coolbutuseless.github.io
Writing a nonogram solver in R
Nonograms are picture logic puzzles in which cells in a grid must be colored or left blank according to numbers at the side of the grid to reveal a hidden picture. In this short series of posts I will: A clue is sequence of integers which are the run-length encoding of the filled-in squares in a row or column…
www.samabbott.co.uk
getTBinR 0.5.5 now on CRAN - 2017 data.
The main message from the 2017 data is that in 2017 there were again over 10 million estimated TB cases globally with only a 1.8% decrease in incidence rates compared to 2016. Over the last 10 years progress has been made with an average of a 1…
www.redbandsports.net
A consideration of the 2018 Cy Young races
On the second-last Sunday of the season, the Rays’ Blake Snell threw at the Rogers Centre against the Blue Jays. The Blue Jays telecast was full of praise for Snell, who’s had a breakout season in 2018…
coolbutuseless.github.io
Creating nonograms with 'nonogram' and 'magick' packages
Nonograms are picture logic puzzles in which cells in a grid must be colored or left blank according to numbers at the side of the grid to reveal a hidden picture…
mailund.github.io/r-programmer-blog
Environments and formulae
In my two previous posts on non-standard evaluation, Scoping rules and NSE and Overscoping and eval, I explained: How R finds the bindings of variables by looking through chains of…
martakolczynska.com
Harmonizing measures of income in cross-national surveys
Data Number of response options Item non-response Distributions Harmonized target variables Next steps with Przemek Powałko Individual economic status is a necessary element of almost all sociological analyses, including studies of political attitudes and…
matthewsmith.rbind.io
Job Update
In September 2018, I joined the University of Greenwich as a Lecturer in International Business (IB), after completing my postodc at the University of Oxford…
matthewsmith.rbind.io
Network Visualisation in R - Package Comparison
There are a number of packages available to visualisation networks in R - ranging from those which are implement other network analysis features to those which draw on the grammar of graphics visualisation techniques…
www.smashingmagazine.com
Representing Web Developers In The W3C
One of the many things that I do is to be a part of the CSS Working Group as an Invited Expert…
www.sastibe.de
Tired of Waiting for your R Scripts to Finish? Let AWS do the Work, Get Notified by E-Mail
Recently, I was faced with a problem: I had written a rather complex simulation of a discrete time queueing network, and I needed to let this simulation run Anyways, the situation described above wouldn’t be problematic as such, but the runtime for a single one of these simulations increased exponentially: for 1500 observations it ran ~ 0…
rviews.rstudio.com
August 2018
Package developers relaxed a bit in August.; only 160 new packages went to CRAN that month…
martakolczynska.com
Harmonizing measures of income in cross-national surveys
Data Number of response options Item non-response Distributions Harmonized target variables Next steps with Przemek Powałko Individual economic status is a necessary element of almost all sociological analyses, including studies of political attitudes and…
haapakorva.fi
Keskilämpötilan muutos kuukausittain Oulussa 1955–2018
Olen syntynyt 1980-luvulla. Muistelen lapsuuteni talvia lumisina ja kylminä. Samalla muistan tältä vuosikymmeneltä talvia, joilloin yhtäjaksoista pakkasjaksoa ei kestänyt kuin muutaman viikon, tai siltä ainakin tuntuu…
www.ashwinmalshe.com
MSDA Final Project Guidelines
You will submit: Here Group## will be your group number. For example, Group number 1 will submit Here Group## will be your group number. For example, Group number 1 will submit Every group gets 20 minutes for the presentation. Out of these, 15 minutes are for the actual presentation and 5 minutes for Q&A…
coolbutuseless.github.io
Nonograms in R - 'nonogram' package
Nonograms are picture logic puzzles in which cells in a grid must be colored or left blank according to numbers at the side of the grid to reveal a hidden picture. In this short series of posts I will: This is a basic example which shows you how to plot and solve a simple puzzle…
www.smashingmagazine.com
Tough Interview(er) Questions For The Job-Seeking Designer
Whether you’re a multi-year veteran to the UX industry or fresh out of a higher education or boot camp style program, setting out into the job market can be a daunting task for any…
gcppodcast.com
Wellio with Sivan Aldor-Noiman and Erik Andrejko
The data scientists at Wellio started from scratch (pun intended) to create their own food-related database and then began training models so the data could be organized and personalized. Using a combination of human power and machine learning techniques, Wellio learns your preferences, allergies, diets, etc…
data-chips.com
Baby name pattern matching in R
stringr, rebus, babynames packagesMatching patterns in namesGirls names ending in “ita”Boys names rhyming with “Aiden”Girls names starting with J and ending in N soundBoys names ending in “ter”Names that rhyme with “Cory”I want to keep doing thisI was recently introduced to the stringr and rebus R packages when I took a string manipulation course from…
www.smashingmagazine.com
Building Websites With Tilda (Full Review)
(This is a sponsored article.) The modern web is very unified. Designers use the same patterns, and, as a result, websites created by different people look like clones. The only way to stand out from the crowd is via content. Content is what brings people to your website in the first place…
ropensci.org/blog
Mapping the 2018 East Africa floods from space with smapr
First, let’s get spatial boundaries for the study area: This returns a data frame with one row per file - we can see here that there is one file available for that date…
jenrichmond.rbind.io
Writing about LMM
Learning new things is HARD. Stats methods are particularly hard because it not just about running the test. You have to… Part of what makes this learning hard is that our research methods training has drummed into us that there is a RIGHT WAY TO DO IT…
www.rdatagen.net
simstudy update
And the point estimates for each variable on the probability scale: And just for fun, here is an example of how simulation might be used to investigate the performance of a model…
www.stencilled.me
Colleges vs Breweries - What tops in your State ?
In this post we see how the no. of breweries stack up against no…
www.smashingmagazine.com
Dark Patterns And Other Design No-Nos For Mobile
When it comes to making money, some companies will do whatever it takes to get people inside their…
www.stencilled.me
Google Trends using gtrends and R shiny
Recently while I was searching for trend data I came across Google Trends. So in the post I am using gtrends, a r package to understand the trends and display it using a R shiny app using the shinymaterial package. The shinymaterial package tends to move away from the traditional shiny dashboard layouts and design…
www.rostrum.blog
Knitting Club
Matt Dray RAP and the rise of reproducibility The UK government has got better at producing statistical reports in a reproducible way…
cevo.com.au
System Mapping
Every organisation has legacy systems, and people who know “where the bodies are buried” - hidden aspects of dependency, communication, process flow and whatnot, which every system relies on to function…
www.stencilled.me
Visualizing your favourite song?
Recently I was searching for some gifts when I stumbled across Sound Viz and Cumberland Coast . These were your favorite songs which were visualized and printed. So that’s when I started looking into on how could you visualize them…
jenrichmond.rbind.io
let me count the ways
When we want to learn how to do something new in R, we all start by googling it. Having worked out simple ANOVA, next on my list is repeated measures. HAHAHAHAHA- got me…. When we were talking one-way ANOVA we used this formula. When participants contribute to all levels of condition, we need to describe that in the error term…
www.stevejburr.com
MakeoverMonday 17-09-2018
MakeoverMonday this week was on the topic of transport costs, and remaking a chart original produced by DW.com. The original visualisation was a simple line chart showing the prices of train and plane tickets as the date of the journey approaches…
www.stevejburr.com
Tidy Tueday - 18-09-2018
For this #TidyTuesday there we two different datasets to look at, one was a table from an article in the magazine of the “Soaring Society of America” and the other was a dataset containing detailed information on US airports…
www.ashwinmalshe.com
Using Rstudio Projects
For this tutorial you will need RStudio installed on your computer. The operating system doesn’t matter. I am using my MacBook Pro to record the videos…
djnavarro.net
An 'academic' discussion on Twitter
This post is my attempt to explain why. A now-infamous article was posted to the Chronicle of Higher Education a couple of weeks ago, discussing some of the issues surrounding the open science movement. I’m not going to link to the article itself because it’s a very bad take on what the open science movement should be about, and I don’t see much value in encouraging others to read a bad…
robchoudhury.netlify.com
Cyclosporiapsis Outbreak in Texas 2017
Over a 20-day period, the several counties in Texas. Things really light up in the counties that countain big cities like Houston, Austin, and Dallas. Based on the CDC dataset, we can see that things get pretty bad, but start to settle down after mid-July…
alaburda.rbind.io
Integrating image annotations into blog posts
Last year, I started preparing an online biology textbook for high school…
haapakorva.fi
Kivisydämen vuosi
Olen mitannut Kivisydämen ja useiden muiden pysäköintitalojen käyttöä ohjelmallani jo yli vuoden ajan. Hallin laajentamisen puolesta otetaan joka vuosi kantaa, vaikka julkisesti ei ole tiedossa, kuinka paljon hallia käytetään…
mailund.github.io/r-programmer-blog
Overscoping and eval
In my previous post I used the lm function for an example of scope rules, but I left a few details out…
emmavestesson.netlify.com
Settlers of Catan - the random generator
Background Back in March I wrote a blog post about my favourite board game Settlers of Catan. The code generates a random catan board but there is no interactivity for when you want to create a different random board…
alaburda.rbind.io
Testing annotorious
Lorem ipsum, code is visible in post preview. Still kind of visible. ); original_onload = window.onload || function(){}; window.onload = function() { original_onload(); anno.hideSelectionWidget(); var figures = document.getElementsByClassName(“annotatable”); for (var i = 0;…
yihui.name/en
A Follow-Up Post on My Early Career Crisis Post (Finally)
Freedom, however, is not the last word. Freedom is only part of the story and half of the truth. Freedom is but the negative aspect of the whole phenomenon whose positive aspect is responsibleness…
amateurdatasci.rbind.io
Rain Drops
1 Problem 2 Solution 3 Reference 1 Problem Problem 49, page 189 in Simmons (2016) Newton’s second law of motion [ F = ma = m \frac{dv}{dt} ] can be written in the form [ \begin{equation} F = \frac{d \left( mv \right)}{dt} \tag{1.1} \end{equation} ] in terms of the momentum (mv) of a particle of mass (m) and velocity (v), and remains valid even if (m) is not…
coolbutuseless.github.io
The Zombie/Vampire Apocalypse - A use case for strict membership tests
Rather than R letting you know when your membership test is no longer up-to-date, the user must remain vigilant with every data update to ensure that it is still valid i.e…
djnavarro.net
Thoughts on participation
Very often I encounter students who are quiet, thoughtful and tend not to speak much. This is totally okay - not everyone in the classroom needs to be talking a…
blog.millerti.me
Tracking Docker Image Updates
I’m a huge fan of Docker, and so I run a few web applications at home on my own personal Docker host. Unfortunately, keeping these applications up-to-date with their latest Docker images often devolves into my remembering to peruse Docker Hub to see how out of date the version I declared in docker-compose…
g-tierney.github.io
Data Science in Mental Health
I came across two articles recently that I thought spoke to each other in an interesting way…
coolbutuseless.github.io
Find all sequences of a given length with the given sum
This problem popped up as a necessary step to solving a type of puzzle (which I’ll post about…
mathlacome.rbind.io
How to compare one performance with previous ones
When we open our Tableau software, the first thing is to import our dataset. This dataset is simple - 1 column with the player name, one with date and one with a performance-related variable (here, Mechanical Work during the game). We start by moving the ‘Performance variable’ (Mechanical Work (a…
rviews.rstudio.com
Interactive plots in Shiny
I asked a good friend at work to help me test the sample app provided in this post. She was able to run it easily, but then told me that she didn’t know that she was supposed to click on things. Adding interactive plots is one of the most important capabilities to include in a Shiny app…
mikewk.com
Labelling dataviz
The combination of style changes and labels clearly made a big difference but, still, I don’t think the above plot is mind-blowing or overly…
yihui.name/en
My Trick of Deciding if a Link Should Open a New Browser Tab
Pure technical issues are often relatively easy to resolve. Things will become complicated whenever humans are involved. Please don’t decide for the user…
blog.rstudio.com
RStudio Connect 1.6.8 - Emails, APIs, and Titles
RStudio Connect 1.6.8 includes additions to custom emails, new user endpoints in the RStudio Connect Server API, support for content descriptions and title changes, and important security and authentication improvements…
mailund.github.io/r-programmer-blog
Scoping Rules and NSE
Earlier this week, I wrote some tweets about how you have to be careful about scopes when you do non-standard evaluation…
davemcg.github.io
#GI2018 - Day Three
Typos everywhere. Things may change dramatically over time as I scan back through notes. I’ve tried to respect #notwitter…
dusty.phillips.codes
An Order to Learn to Program, Part 5
Parts in this series An Order to Learn to Program, Part 1 An Order to Learn to Program, Part 2 An Order to Learn to Program, Part 3 An Order to Learn to Program, Part 4 An Order to Learn to Program, Part 5 Part 5: Beginner programming, dynamically typed This is part 5 in my series on the order to study topics related to…
gcppodcast.com
DL Indaba
Vukosi and Yasin(i) share how Deep Learning Indaba is playing an important role to recognize and grow machine learning research and companies on the African continent…
visualizingtheleague.com
How Much Does Thibs Love His Former Players?
Tom Thibodeau loves his former players. Since taking over as coach of the TWolves, Thibs (who also serves of president of basketball operations) has acquired the services of no fewer than six players he coached on the Bulls: Jimmy Butler, Taj Gibson, Derrick Rose, Aaron Brooks, John Lucas III, and most recently Luol Deng…
haapakorva.fi
Jokainen oululainen kartalla
Kun näin Näytä datan piirtäneen jokaisen suomalaisen kartalle (inspiroituneena tästä ja tästä) käyttäen avoimia kotimaisia paikkatietoaineistoja ja ilmaista paikkatieto-ohjelmistoa QGIS:iä, halusin kokeilla onnistuisiko minulta ohjelmoida vastaava kuva R:llä. Viime vuosina R on kehittynyt voimakkaasti…
blog.wallaroolabs.com
Make Python Pandas go fast
Some Background Suppose you have a Data Analysis batch job that runs every hour on a dedicated machine. As the weeks go by, you notice that the inputs are getting larger and the time taken to run it gets longer, slowly nearing the one hour mark…
www.rladiesnyc.org
Parallel Computing in R
We’re excited to host Jared Lander, Chief Data Scientist of Lander Analytics, the organizer of the New York Open Statistical Programming Meetup and the New York R Conference, and author of R for Everyone, to talk about parallel computing in…
lcolladotor.github.io
Problems with an article from the Baltimore Sun covering Dr. Mahoney-Steel’s immigration issues
The article starts by framing…
blog.rstudio.com
Radix for R Markdown
Reader-friendly typography that adapts well to mobile devices. Radix provides many flexible options for laying out figures…
www.thecrosstab.com
The Political News That We're Googling Ahead of the Midterms
Below, I’ve sorted search traffic for “healthcare”,” supreme court“,”economy“, and”immigration” according to the days when each got the most relative hits. Aside from some blips in the news cycle, voters have been constantly concerned with immigration news in the US…
aosmith.rbind.io
The log-0 problem
This isn’t necessarily an incorrect thing to do. However, I think it is important to step back and think about the study and those 0 values more before forging ahead with adding a constant to the data. Without getting into too much detail, below are some of the things I consider when I have 0 as well as positive values in a response variable…
davemcg.github.io
#GI2018 - Day Two
Typos everywhere. Things may change dramatically over time as I scan back through notes. I’ve tried to respect #notwitter…
ropensci.org/blog
Building Reproducible Data Packages with DataPackageR
Sharing data sets for collaboration or publication has always been challenging, but it’s become increasingly problematic as complex and high dimensional data sets have become ubiquitous in the life sciences…
engineering.pivotal.io
Concourse as a Service and Multi-CPI
As the Toolsmiths Team for Greenplum, we are tasked with doing everything in our power to enable other developers within the org…
www.nomadic-hacker.com
From Zero to GPU 2 - Squeezing Neural Network Performance on CPU
Welcome to the second post in my From Zero to GPU series. Where I talk about aspects of neural network implementations…
www.ashwinmalshe.com
Homework 3-DA6233
This homework is based on Shiny app. Q1. Create the following 3 Shiny apps. The code to generate random numbers is as follows. Don’t use a random number seed for this exercise…
jessesadler.com
Introducing debkeepr
The economic historian encounters the difficulties of handling non-decimal currencies in two main contexts…
eliocamp.github.io/codigo-r
Multiple color (and fill) scales with ggplot2
(Versión en español) tl;dr: You can find the relevant code in this gist…
cevo.com.au
Python as a cloud glue language
One promise of the cloud is that you have less moving parts to manage. No longer do you need to worry about drives or servers, or even networks. Just package up your application and deploy it to the cloud. For those of you working in the cloud, you’ll know it’s not that simple…
yihui.name/en
Serving a Website or Shiny App at 0.0.0.0
This is a convenient way to show websites or apps to people in the same local network…
jenrichmond.rbind.io
where is here?
As I add new projects to my rstats portfolio and work collaboratively on projects with students the issue of working directories is becoming more and more…
davemcg.github.io
#GI2018 - Day One
Typos everywhere. Things may change dramatically over time as I scan back through notes. I’ve tried to respect #notwitter…
www.rladiesnyc.org
Building Infrastructure with R
For this event we will explore how to build tools and infrastructure with R…
yutani.rbind.io
Double dispatch of S3 method
So far, so good. But, this won’t work… Here’s examples to show them clearly: If you cannot wait, use S4…
eliocamp.github.io/codigo-r
Múltiples escalas de colores en ggplot2
(English version) tl;dr: En este gist está el código importante Un dolor importante para algunos usuarios de ggplot2 es la imposibilidad de usar más de una escala para cada tipo de parámetro…
jacobbuckman.com
Tensorflow
Explicitly naming nodes is nonessential, but can be very useful when debugging. Oftentimes, when your Tensorflow code crashes, the error trace will refer to a specific operation. If you have many operations of the same type, it can be tough to figure out which one is problematic…
giorasimchoni.com
The Actual Tidyverse
It’s been 7 months since my last post. A lot can happen in 7 months, is all I can say. Anyway, in this post I thought how I would combine three of my passions: The Tidyverse Web Scraping Network Analysis
So I’m going to scrape the Tidyverse (blogsphere), and perform Network Analysis on it. Or, rather Network Visualization…
mikewk.com
Tick marks, variable names, and ggplot2
Regardless, while it’s definitely a good idea to provide your own summary variable names, you will invariably find yourself in a situation where you would like to plot summarized variables that were named using the text of the expressions used to create them…
lcolladotor.github.io
What about a lawyer-like app as the minimum help for defendants in immigration cases?
The panel was composed by: I missed the first 30 minutes or so but I still got to listen to most of it…
www.riinu.me
Global map of country names
More info: Now this is very good already with hardly any overlapping labels and the world is pretty recognisable. And really, you can make this plot with just 2 lines of code: So what these two lines make is already very amazing. This gives continents a much better shape, but it does severaly misplace Polynesia. See if you can find where, e.g…
www.jessemaegan.com
R4DS (v1 & v2)
As all amazing opportunities in my life are wont to do, it started with a tweet: It’s rare that I find myself dealing with imposter syndrome, but I did spend Thursday night eating all of my feelings of doubt and…
maximewack.com
So, about those resolutions
It seems like they didn’t hold very long… Me being a French guy, you’re probably waiting for some kind of excuse, like “I’ve been too busy actually doing things that I didn’t have time to write about…
djnavarro.net
A personal essay on Bayes factors
’Cause you’re hot then you’re cold You’re yes then you’re no You’re in then you’re out You’re up then you’re down You’re wrong when it’s right It’s black and it’s white - Katy Perry Or, in something closer to every day language: What’s not to love? Oh, you sweet summer…
www.stevejburr.com
MakeoverMonday 10-09-2018
This week’s #MakeoverMonday challenge was to take a dataset contributed by ProPublica of spending by politicians/government agencies at properties owned by Donald Trump…
energychisquared.com
Predicción del consumo eléctrico (I)
A menudo, cuando se aplica el concepto predicción en el sector eléctrico tiende a relacionarse inmediatamente con dos conceptos: el consumo y el precio. El primero el fundamental para no penalizar desvíos…
www.stevejburr.com
Tidy Tuesday - 11-09-2018
This is a slightly late submission, didn’t have time during the week as I was at EARL for most of the week (hope to write a few words on this soon!)…
simplystatistics.org
Divergent and Convergent Phases of Data Analysis
One thing that I have found missing from many discussions about tooling in data analysis is an acknowledgment that data analysis tends to advance through different phases and that different tools can be more or less useful in each of those phases…
rmflight.github.io
Don't do PCA After Statistical Testing!
I have seen this published at least once (in a Diabetes metabolomics paper, if anyone knows it, please send it to me so I can link it), and have seen collaborators do this after coaching from others in non-statistical departments…
livefreeordichotomize.com
One year to dissertate
I’ve compiled some resources that I used when completing my dissertation and I wanted to share them with YOU! Throughout this post, I link to a bunch of different templates that I used throughout my process. You can find them all in this GitHub repo…
dusty.phillips.codes
An Intermediate Guide To RSA
The venerable RSA public key encryption algorithm is very elegant. It requires a basic understanding of modular arithmetic, which may sound scary if you haven’t studied it. It reduces to taking the remainder after integer long division. The RSA Wikipedia article describes five simple steps to generate the keys…
toscano84.github.io
Combine Data frames in R
Sometimes, before we start to explore our data, we need to put them together. For instance, we might have them stored in different data frames and we have to join variables from two or more data frames in one. This post will talk about the different functions we can use to achieve that goal…
lcolladotor.github.io
R Gene Regulatory Interaction Formulator For Inquiring Networks
Here you can find the tweet about the original blog post: This blog post was made possible thanks…
evangelinereynolds.netlify.com
Slow ggplot
This post has lots in common with previous posts on “the layered presentation of graphics”. It is about building up plots, but now with a focus on this incramental change for teaching ggplot2…
blog.schochastics.net
Stress based graph layouts
I academically grew up among graph drawers, that is, computer scientists and mathematicians interested in deriving two-dimensional depictions of graphs. One may despicably call it pixel science, yet a lot of hard theoretical work is put into producing pretty graph layouts…
www.ashwinmalshe.com
DA 6813 Homework Solution
Get the summary of the data Clearly, this will make model comparison a little bit tough but we have to give each model the best chance to perform even at this lower level of analysis…