coolbutuseless.github.io

Writing a nonogram solver in R

Nonograms are picture logic puzzles in which cells in a grid must be colored or left blank according to numbers at the side of the grid to reveal a hidden picture. In this short series of posts I will: A clue is sequence of integers which are the run-length encoding of the filled-in squares in a row or column


www.samabbott.co.uk

getTBinR 0.5.5 now on CRAN - 2017 data.

The main message from the 2017 data is that in 2017 there were again over 10 million estimated TB cases globally with only a 1.8% decrease in incidence rates compared to 2016. Over the last 10 years progress has been made with an average of a 1


www.redbandsports.net

A consideration of the 2018 Cy Young races

On the second-last Sunday of the season, the Rays’ Blake Snell threw at the Rogers Centre against the Blue Jays. The Blue Jays telecast was full of praise for Snell, who’s had a breakout season in 2018


coolbutuseless.github.io

Creating nonograms with 'nonogram' and 'magick' packages

Nonograms are picture logic puzzles in which cells in a grid must be colored or left blank according to numbers at the side of the grid to reveal a hidden picture


mailund.github.io/r-programmer-blog

Environments and formulae

In my two previous posts on non-standard evaluation, Scoping rules and NSE and Overscoping and eval, I explained: How R finds the bindings of variables by looking through chains of


martakolczynska.com

Harmonizing measures of income in cross-national surveys

Data Number of response options Item non-response Distributions Harmonized target variables Next steps with Przemek Powałko Individual economic status is a necessary element of almost all sociological analyses, including studies of political attitudes and


matthewsmith.rbind.io

Job Update

In September 2018, I joined the University of Greenwich as a Lecturer in International Business (IB), after completing my postodc at the University of Oxford


matthewsmith.rbind.io

Network Visualisation in R - Package Comparison

There are a number of packages available to visualisation networks in R - ranging from those which are implement other network analysis features to those which draw on the grammar of graphics visualisation techniques


www.smashingmagazine.com

Representing Web Developers In The W3C

One of the many things that I do is to be a part of the CSS Working Group as an Invited Expert


www.sastibe.de

Tired of Waiting for your R Scripts to Finish? Let AWS do the Work, Get Notified by E-Mail

Recently, I was faced with a problem: I had written a rather complex simulation of a discrete time queueing network, and I needed to let this simulation run Anyways, the situation described above wouldn’t be problematic as such, but the runtime for a single one of these simulations increased exponentially: for 1500 observations it ran ~ 0


rviews.rstudio.com

August 2018

Package developers relaxed a bit in August.; only 160 new packages went to CRAN that month


martakolczynska.com

Harmonizing measures of income in cross-national surveys

Data Number of response options Item non-response Distributions Harmonized target variables Next steps with Przemek Powałko Individual economic status is a necessary element of almost all sociological analyses, including studies of political attitudes and


haapakorva.fi

Keskilämpötilan muutos kuukausittain Oulussa 1955–2018

Olen syntynyt 1980-luvulla. Muistelen lapsuuteni talvia lumisina ja kylminä. Samalla muistan tältä vuosikymmeneltä talvia, joilloin yhtäjaksoista pakkasjaksoa ei kestänyt kuin muutaman viikon, tai siltä ainakin tuntuu


www.ashwinmalshe.com

MSDA Final Project Guidelines

You will submit: Here Group## will be your group number. For example, Group number 1 will submit Here Group## will be your group number. For example, Group number 1 will submit Every group gets 20 minutes for the presentation. Out of these, 15 minutes are for the actual presentation and 5 minutes for Q&A


coolbutuseless.github.io

Nonograms in R - 'nonogram' package

Nonograms are picture logic puzzles in which cells in a grid must be colored or left blank according to numbers at the side of the grid to reveal a hidden picture. In this short series of posts I will: This is a basic example which shows you how to plot and solve a simple puzzle


www.smashingmagazine.com

Tough Interview(er) Questions For The Job-Seeking Designer

Whether you’re a multi-year veteran to the UX industry or fresh out of a higher education or boot camp style program, setting out into the job market can be a daunting task for any


gcppodcast.com

Wellio with Sivan Aldor-Noiman and Erik Andrejko

The data scientists at Wellio started from scratch (pun intended) to create their own food-related database and then began training models so the data could be organized and personalized. Using a combination of human power and machine learning techniques, Wellio learns your preferences, allergies, diets, etc


data-chips.com

Baby name pattern matching in R

stringr, rebus, babynames packagesMatching patterns in namesGirls names ending in “ita”Boys names rhyming with “Aiden”Girls names starting with J and ending in N soundBoys names ending in “ter”Names that rhyme with “Cory”I want to keep doing thisI was recently introduced to the stringr and rebus R packages when I took a string manipulation course from


www.smashingmagazine.com

Building Websites With Tilda (Full Review)

(This is a sponsored article.) The modern web is very unified. Designers use the same patterns, and, as a result, websites created by different people look like clones. The only way to stand out from the crowd is via content. Content is what brings people to your website in the first place


ropensci.org/blog

Mapping the 2018 East Africa floods from space with smapr

First, let’s get spatial boundaries for the study area: This returns a data frame with one row per file - we can see here that there is one file available for that date


jenrichmond.rbind.io

Writing about LMM

Learning new things is HARD. Stats methods are particularly hard because it not just about running the test. You have to… Part of what makes this learning hard is that our research methods training has drummed into us that there is a RIGHT WAY TO DO IT


www.rdatagen.net

simstudy update

And the point estimates for each variable on the probability scale: And just for fun, here is an example of how simulation might be used to investigate the performance of a model


www.stencilled.me

Colleges vs Breweries - What tops in your State ?

In this post we see how the no. of breweries stack up against no


www.smashingmagazine.com

Dark Patterns And Other Design No-Nos For Mobile

When it comes to making money, some companies will do whatever it takes to get people inside their


www.stencilled.me

Google Trends using gtrends and R shiny

Recently while I was searching for trend data I came across Google Trends. So in the post I am using gtrends, a r package to understand the trends and display it using a R shiny app using the shinymaterial package. The shinymaterial package tends to move away from the traditional shiny dashboard layouts and design


www.rostrum.blog

Knitting Club

Matt Dray RAP and the rise of reproducibility The UK government has got better at producing statistical reports in a reproducible way


cevo.com.au

System Mapping

Every organisation has legacy systems, and people who know “where the bodies are buried” - hidden aspects of dependency, communication, process flow and whatnot, which every system relies on to function


www.stencilled.me

Visualizing your favourite song?

Recently I was searching for some gifts when I stumbled across Sound Viz and Cumberland Coast . These were your favorite songs which were visualized and printed. So that’s when I started looking into on how could you visualize them


jenrichmond.rbind.io

let me count the ways

When we want to learn how to do something new in R, we all start by googling it. Having worked out simple ANOVA, next on my list is repeated measures. HAHAHAHAHA- got me…. When we were talking one-way ANOVA we used this formula. When participants contribute to all levels of condition, we need to describe that in the error term


www.stevejburr.com

MakeoverMonday 17-09-2018

MakeoverMonday this week was on the topic of transport costs, and remaking a chart original produced by DW.com. The original visualisation was a simple line chart showing the prices of train and plane tickets as the date of the journey approaches


www.stevejburr.com

Tidy Tueday - 18-09-2018

For this #TidyTuesday there we two different datasets to look at, one was a table from an article in the magazine of the “Soaring Society of America” and the other was a dataset containing detailed information on US airports


www.ashwinmalshe.com

Using Rstudio Projects

For this tutorial you will need RStudio installed on your computer. The operating system doesn’t matter. I am using my MacBook Pro to record the videos


djnavarro.net

An 'academic' discussion on Twitter

This post is my attempt to explain why. A now-infamous article was posted to the Chronicle of Higher Education a couple of weeks ago, discussing some of the issues surrounding the open science movement. I’m not going to link to the article itself because it’s a very bad take on what the open science movement should be about, and I don’t see much value in encouraging others to read a bad


robchoudhury.netlify.com

Cyclosporiapsis Outbreak in Texas 2017

Over a 20-day period, the several counties in Texas. Things really light up in the counties that countain big cities like Houston, Austin, and Dallas. Based on the CDC dataset, we can see that things get pretty bad, but start to settle down after mid-July


alaburda.rbind.io

Integrating image annotations into blog posts

Last year, I started preparing an online biology textbook for high school


haapakorva.fi

Kivisydämen vuosi

Olen mitannut Kivisydämen ja useiden muiden pysäköintitalojen käyttöä ohjelmallani jo yli vuoden ajan. Hallin laajentamisen puolesta otetaan joka vuosi kantaa, vaikka julkisesti ei ole tiedossa, kuinka paljon hallia käytetään


mailund.github.io/r-programmer-blog

Overscoping and eval

In my previous post I used the lm function for an example of scope rules, but I left a few details out


emmavestesson.netlify.com

Settlers of Catan - the random generator

Background Back in March I wrote a blog post about my favourite board game Settlers of Catan. The code generates a random catan board but there is no interactivity for when you want to create a different random board


alaburda.rbind.io

Testing annotorious

Lorem ipsum, code is visible in post preview. Still kind of visible. ); original_onload = window.onload || function(){}; window.onload = function() { original_onload(); anno.hideSelectionWidget(); var figures = document.getElementsByClassName(“annotatable”); for (var i = 0;


yihui.name/en

A Follow-Up Post on My Early Career Crisis Post (Finally)

Freedom, however, is not the last word. Freedom is only part of the story and half of the truth. Freedom is but the negative aspect of the whole phenomenon whose positive aspect is responsibleness


amateurdatasci.rbind.io

Rain Drops

1 Problem 2 Solution 3 Reference 1 Problem Problem 49, page 189 in Simmons (2016) Newton’s second law of motion [ F = ma = m \frac{dv}{dt} ] can be written in the form [ \begin{equation} F = \frac{d \left( mv \right)}{dt} \tag{1.1} \end{equation} ] in terms of the momentum (mv) of a particle of mass (m) and velocity (v), and remains valid even if (m) is not


coolbutuseless.github.io

The Zombie/Vampire Apocalypse - A use case for strict membership tests

Rather than R letting you know when your membership test is no longer up-to-date, the user must remain vigilant with every data update to ensure that it is still valid i.e


djnavarro.net

Thoughts on participation

Very often I encounter students who are quiet, thoughtful and tend not to speak much. This is totally okay - not everyone in the classroom needs to be talking a


blog.millerti.me

Tracking Docker Image Updates

I’m a huge fan of Docker, and so I run a few web applications at home on my own personal Docker host. Unfortunately, keeping these applications up-to-date with their latest Docker images often devolves into my remembering to peruse Docker Hub to see how out of date the version I declared in docker-compose


g-tierney.github.io

Data Science in Mental Health

I came across two articles recently that I thought spoke to each other in an interesting way


coolbutuseless.github.io

Find all sequences of a given length with the given sum

This problem popped up as a necessary step to solving a type of puzzle (which I’ll post about


mathlacome.rbind.io

How to compare one performance with previous ones

When we open our Tableau software, the first thing is to import our dataset. This dataset is simple - 1 column with the player name, one with date and one with a performance-related variable (here, Mechanical Work during the game). We start by moving the ‘Performance variable’ (Mechanical Work (a


rviews.rstudio.com

Interactive plots in Shiny

I asked a good friend at work to help me test the sample app provided in this post. She was able to run it easily, but then told me that she didn’t know that she was supposed to click on things. Adding interactive plots is one of the most important capabilities to include in a Shiny app


mikewk.com

Labelling dataviz

The combination of style changes and labels clearly made a big difference but, still, I don’t think the above plot is mind-blowing or overly


yihui.name/en

My Trick of Deciding if a Link Should Open a New Browser Tab

Pure technical issues are often relatively easy to resolve. Things will become complicated whenever humans are involved. Please don’t decide for the user


blog.rstudio.com

RStudio Connect 1.6.8 - Emails, APIs, and Titles

RStudio Connect 1.6.8 includes additions to custom emails, new user endpoints in the RStudio Connect Server API, support for content descriptions and title changes, and important security and authentication improvements


mailund.github.io/r-programmer-blog

Scoping Rules and NSE

Earlier this week, I wrote some tweets about how you have to be careful about scopes when you do non-standard evaluation


davemcg.github.io

#GI2018 - Day Three

Typos everywhere. Things may change dramatically over time as I scan back through notes. I’ve tried to respect #notwitter


dusty.phillips.codes

An Order to Learn to Program, Part 5

Parts in this series An Order to Learn to Program, Part 1 An Order to Learn to Program, Part 2 An Order to Learn to Program, Part 3 An Order to Learn to Program, Part 4 An Order to Learn to Program, Part 5 Part 5: Beginner programming, dynamically typed This is part 5 in my series on the order to study topics related to


gcppodcast.com

DL Indaba

Vukosi and Yasin(i) share how Deep Learning Indaba is playing an important role to recognize and grow machine learning research and companies on the African continent


visualizingtheleague.com

How Much Does Thibs Love His Former Players?

Tom Thibodeau loves his former players. Since taking over as coach of the TWolves, Thibs (who also serves of president of basketball operations) has acquired the services of no fewer than six players he coached on the Bulls: Jimmy Butler, Taj Gibson, Derrick Rose, Aaron Brooks, John Lucas III, and most recently Luol Deng


haapakorva.fi

Jokainen oululainen kartalla

Kun näin Näytä datan piirtäneen jokaisen suomalaisen kartalle (inspiroituneena tästä ja tästä) käyttäen avoimia kotimaisia paikkatietoaineistoja ja ilmaista paikkatieto-ohjelmistoa QGIS:iä, halusin kokeilla onnistuisiko minulta ohjelmoida vastaava kuva R:llä. Viime vuosina R on kehittynyt voimakkaasti


blog.wallaroolabs.com

Make Python Pandas go fast

Some Background Suppose you have a Data Analysis batch job that runs every hour on a dedicated machine. As the weeks go by, you notice that the inputs are getting larger and the time taken to run it gets longer, slowly nearing the one hour mark


www.rladiesnyc.org

Parallel Computing in R

We’re excited to host Jared Lander, Chief Data Scientist of Lander Analytics, the organizer of the New York Open Statistical Programming Meetup and the New York R Conference, and author of R for Everyone, to talk about parallel computing in


lcolladotor.github.io

Problems with an article from the Baltimore Sun covering Dr. Mahoney-Steel’s immigration issues

The article starts by framing


blog.rstudio.com

Radix for R Markdown

Reader-friendly typography that adapts well to mobile devices. Radix provides many flexible options for laying out figures


www.thecrosstab.com

The Political News That We're Googling Ahead of the Midterms

Below, I’ve sorted search traffic for “healthcare”,” supreme court“,”economy“, and”immigration” according to the days when each got the most relative hits. Aside from some blips in the news cycle, voters have been constantly concerned with immigration news in the US


aosmith.rbind.io

The log-0 problem

This isn’t necessarily an incorrect thing to do. However, I think it is important to step back and think about the study and those 0 values more before forging ahead with adding a constant to the data. Without getting into too much detail, below are some of the things I consider when I have 0 as well as positive values in a response variable


davemcg.github.io

#GI2018 - Day Two

Typos everywhere. Things may change dramatically over time as I scan back through notes. I’ve tried to respect #notwitter


www.rladiesnyc.org

Book club

Date: Saturday, October 20,


ropensci.org/blog

Building Reproducible Data Packages with DataPackageR

Sharing data sets for collaboration or publication has always been challenging, but it’s become increasingly problematic as complex and high dimensional data sets have become ubiquitous in the life sciences


engineering.pivotal.io

Concourse as a Service and Multi-CPI

As the Toolsmiths Team for Greenplum, we are tasked with doing everything in our power to enable other developers within the org


www.nomadic-hacker.com

From Zero to GPU 2 - Squeezing Neural Network Performance on CPU

Welcome to the second post in my From Zero to GPU series. Where I talk about aspects of neural network implementations


www.ashwinmalshe.com

Homework 3-DA6233

This homework is based on Shiny app. Q1. Create the following 3 Shiny apps. The code to generate random numbers is as follows. Don’t use a random number seed for this exercise


jessesadler.com

Introducing debkeepr

The economic historian encounters the difficulties of handling non-decimal currencies in two main contexts


eliocamp.github.io/codigo-r

Multiple color (and fill) scales with ggplot2

(Versión en español) tl;dr: You can find the relevant code in this gist


cevo.com.au

Python as a cloud glue language

One promise of the cloud is that you have less moving parts to manage. No longer do you need to worry about drives or servers, or even networks. Just package up your application and deploy it to the cloud. For those of you working in the cloud, you’ll know it’s not that simple


yihui.name/en

Serving a Website or Shiny App at 0.0.0.0

This is a convenient way to show websites or apps to people in the same local network


jenrichmond.rbind.io

where is here?

As I add new projects to my rstats portfolio and work collaboratively on projects with students the issue of working directories is becoming more and more


davemcg.github.io

#GI2018 - Day One

Typos everywhere. Things may change dramatically over time as I scan back through notes. I’ve tried to respect #notwitter


www.rladiesnyc.org

Building Infrastructure with R

For this event we will explore how to build tools and infrastructure with R


yutani.rbind.io

Double dispatch of S3 method

So far, so good. But, this won’t work… Here’s examples to show them clearly: If you cannot wait, use S4


ewen.io

In search of the mixtape GOAT

A data-driven tribute to the best of the mixtape era


eliocamp.github.io/codigo-r

Múltiples escalas de colores en ggplot2

(English version) tl;dr: En este gist está el código importante Un dolor importante para algunos usuarios de ggplot2 es la imposibilidad de usar más de una escala para cada tipo de parámetro


jacobbuckman.com

Tensorflow

Explicitly naming nodes is nonessential, but can be very useful when debugging. Oftentimes, when your Tensorflow code crashes, the error trace will refer to a specific operation. If you have many operations of the same type, it can be tough to figure out which one is problematic


giorasimchoni.com

The Actual Tidyverse

It’s been 7 months since my last post. A lot can happen in 7 months, is all I can say. Anyway, in this post I thought how I would combine three of my passions: The Tidyverse Web Scraping Network Analysis

So I’m going to scrape the Tidyverse (blogsphere), and perform Network Analysis on it. Or, rather Network Visualization


mikewk.com

Tick marks, variable names, and ggplot2

Regardless, while it’s definitely a good idea to provide your own summary variable names, you will invariably find yourself in a situation where you would like to plot summarized variables that were named using the text of the expressions used to create them


lcolladotor.github.io

What about a lawyer-like app as the minimum help for defendants in immigration cases?

The panel was composed by: I missed the first 30 minutes or so but I still got to listen to most of it


satopirka.com

Decision tree implementation with NumPy

Pythonで決定木をゼロから実装します


www.riinu.me

Global map of country names

More info: Now this is very good already with hardly any overlapping labels and the world is pretty recognisable. And really, you can make this plot with just 2 lines of code: So what these two lines make is already very amazing. This gives continents a much better shape, but it does severaly misplace Polynesia. See if you can find where, e.g


www.jessemaegan.com

R4DS (v1 & v2)

As all amazing opportunities in my life are wont to do, it started with a tweet: It’s rare that I find myself dealing with imposter syndrome, but I did spend Thursday night eating all of my feelings of doubt and


maximewack.com

So, about those resolutions

It seems like they didn’t hold very long… Me being a French guy, you’re probably waiting for some kind of excuse, like “I’ve been too busy actually doing things that I didn’t have time to write about


djnavarro.net

A personal essay on Bayes factors

’Cause you’re hot then you’re cold You’re yes then you’re no You’re in then you’re out You’re up then you’re down You’re wrong when it’s right It’s black and it’s white - Katy Perry Or, in something closer to every day language: What’s not to love? Oh, you sweet summer


www.stevejburr.com

MakeoverMonday 10-09-2018

This week’s #MakeoverMonday challenge was to take a dataset contributed by ProPublica of spending by politicians/government agencies at properties owned by Donald Trump


energychisquared.com

Predicción del consumo eléctrico (I)

A menudo, cuando se aplica el concepto predicción en el sector eléctrico tiende a relacionarse inmediatamente con dos conceptos: el consumo y el precio. El primero el fundamental para no penalizar desvíos


www.stevejburr.com

Tidy Tuesday - 11-09-2018

This is a slightly late submission, didn’t have time during the week as I was at EARL for most of the week (hope to write a few words on this soon!)


simplystatistics.org

Divergent and Convergent Phases of Data Analysis

One thing that I have found missing from many discussions about tooling in data analysis is an acknowledgment that data analysis tends to advance through different phases and that different tools can be more or less useful in each of those phases


rmflight.github.io

Don't do PCA After Statistical Testing!

I have seen this published at least once (in a Diabetes metabolomics paper, if anyone knows it, please send it to me so I can link it), and have seen collaborators do this after coaching from others in non-statistical departments


livefreeordichotomize.com

One year to dissertate

I’ve compiled some resources that I used when completing my dissertation and I wanted to share them with YOU! Throughout this post, I link to a bunch of different templates that I used throughout my process. You can find them all in this GitHub repo


dusty.phillips.codes

An Intermediate Guide To RSA

The venerable RSA public key encryption algorithm is very elegant. It requires a basic understanding of modular arithmetic, which may sound scary if you haven’t studied it. It reduces to taking the remainder after integer long division. The RSA Wikipedia article describes five simple steps to generate the keys


toscano84.github.io

Combine Data frames in R

Sometimes, before we start to explore our data, we need to put them together. For instance, we might have them stored in different data frames and we have to join variables from two or more data frames in one. This post will talk about the different functions we can use to achieve that goal


lcolladotor.github.io

R Gene Regulatory Interaction Formulator For Inquiring Networks

Here you can find the tweet about the original blog post: This blog post was made possible thanks


evangelinereynolds.netlify.com

Slow ggplot

This post has lots in common with previous posts on “the layered presentation of graphics”. It is about building up plots, but now with a focus on this incramental change for teaching ggplot2


blog.schochastics.net

Stress based graph layouts

I academically grew up among graph drawers, that is, computer scientists and mathematicians interested in deriving two-dimensional depictions of graphs. One may despicably call it pixel science, yet a lot of hard theoretical work is put into producing pretty graph layouts


www.ashwinmalshe.com

DA 6813 Homework Solution

Get the summary of the data Clearly, this will make model comparison a little bit tough but we have to give each model the best chance to perform even at this lower level of analysis