www.mytinyshinys.com
Hugo Shortcodes
One of the delights of using blogdown to create blogs or websites is that it allows access to Hogo shortcodes which automatically create an HTML snippet based on the parameters you provide…
roelandtn.frama.io
Pepper et Carrot
Today I went to the library A livr’Ouvert because there was a sign session with David…
www.mytinyshinys.com
EPL Week 4
Game of the week Stoke City 2-2 Manchester UnitedUnited’s first real test…
ernestoramirez.com
Examining Variability in Physical Activity Data with R
Earlier this summer, while I was traveling to a conference I came across an interesting paper published by a group from the University of South Florida that explored strategies for interpreting highly variable data from long-term use of a Fitbit…
roelandtn.frama.io
UbuntuCon Europe 2017
I just came home from this UbuntuCon Europe 2017, who was held in La Cité des Sciences et de l’Industrie, in Paris. SO I was able to attend it, although I followed one talk, about privacy on internet…
giorasimchoni.com
You're in a room (The CastleOfR Package)
It has always been a dream of mine to create a text-based game (a.k.a interactive fiction or text adventure). Text-based games were common in the 1980s when I was a kid, and personal computers were lacking in high-end graphics…
www.tidyverse.org/articles
The summer of ggplot2 - wooooo!
My ggplot2 work consisted of a few different things: issue and pull request triage, bug fixes, and developing new features…
www.tidyverse.org/articles
👩💻🚙👯
My day to day back in April seems so foreign to the workflow Jenny and I have now established, but those first few weeks were incredibly useful for getting me steeped in all things Google Drive…
ritsokiguess.site/docs
Safely, possibly
Introduction You know how, when you’re reading documentation, you nod sagely as you read some part, but when you come back to it later, you realize that you really don’t understand it at all. This happened to me with the purrr documentation…
ewen.io
Unravelling the European Football Transfer Window
Scraping data on 2017 football transfers and following the money…
cevo.com.au
DevOps, Deindividuation and the DeLorean.
I’m fascinated by human interaction. If I was able to find a DeLorean I’d tell myself to fly back in time and study Psychology and not Computer Science…
timtrice.net
Hurricane Harvey Post Storm Report
On August 26, 2017, Hurricane Harvey made landfall along the central Texas gulf coast as a category four hurricane…
gcppodcast.com
What's AI with Melanie Warrick
What if you want you want to do an action whenever any Pod is created or destroyed on…
www.rdatagen.net
Further considerations of a hidden process underlying categorical responses
Certainly, group membership is not necessarily intrinsically ordered. In a general categorical or multinomial outcome, a group does not necessarily have any quantitative relationship vis a vis the other groups…
www.tidyverse.org/articles
Erratum tidyr 0.7.0
You can install the new version of tidyselect from CRAN: We thought this was a relatively uncommon occurrence in practice. However that broke a lot of code that had this form: Although the change was well-intentioned, it proved to be too disruptive and we have reverted it…
livefreeordichotomize.com
A Simple Slack Bot With Plumber
Intro To Plumber Plumber is an R package that allows you to create web apis in R…
www.ifconfig.it/hugo
AirPiConsole part two
Welcome back to AirPiConsole blog post, this is part two. If you read part one and followed the configuration steps you should now have a fully working Raspberry Pi Zero W connected to your WiFi network. You should also be able to connect via Bluetooth to get a console connection without knowing the IP address of the Raspi…
www.ifconfig.it/hugo
AirPiConsole part one
As a network engineer I spend a lot of time with my laptop connected via serial cable to various devices…
thug-r.life
Bucket List
Bucket List Most people end up having a list of things they’d like to do in their life before they ‘kick the bucket’. Often this is made up of big events - a foreign trip, some daredevil adventure. But sometimes that list is a bunch of little things…
mlr-blog.netlify.com
OpenML Workshop 2017
What is OpenML? The field of Machine Learning has grown tremendously over the last years, and is a key component of data-driven science. Data analysis algorithms are being invented and used every day, but their results and experiments are published almost exclusively in journals or separated repositories…
www.mytinyshinys.com
EPL Week 3
Game of the weekA slightly fortuitous win for Manchester City. View the highlights with a Russian commentary Arsenal leaking goalsArsenal have gone from edging a seven-goal thriller against Leicester to a pummelling at the hands of Liverpool, via an unhappy trip to Stoke. Over this period, they have conceded eight goals…
batteriesnotincluded.rbind.io
Ad Hoc Testing
Unit tests are typically designed with a set function name in mind - that is, the function being tested will always have the same name, it may just have an updated definition. However, when testing interactively, I find that sometimes I want to test two different versions of a function to compare their behavior…
shotwell.ca/blog
Advice for non-traditional data scientists
I have a pretty strange background for a data scientist. In my career I’ve sold electric razors, worked on credit derivatives during the 2008 financial crash, written market reports on orthopaedic biomaterials, and practiced law…
lenkiefer.com
Low-key dataviz
IN THIS POST I WANT TO REWORK SOME VISUALIZATIONS we’ve already made. Inspired by this compendium of clean graphics in R, I want to remake some of visualizations we made earlier this year in a low-key manner. Sometimes the visualizations we make here can be pretty intense…
jvera.netlify.com
Selecting packages while coding
Sometimes, you must choose a package to achieve some results while you’re coding. Let’s say, you’re wrangling data and noticed you need a quick way of getting the mode of a vector…
www.tidyverse.org/articles
googledrive initial release
Install googledrive with: The auth flow should “just work” for most people, especially for early and interactive use. For more advanced usage, functions are available that give the user much more control, i.e. to support non-interactive and remote usage…
www.samatkins.me
100 Days of Code Challenge Summary
Photo by Sai Kiran Anagani on Unsplash Earlier this year I completed the 100 days of code challenge…
www.rdatagen.net
A hidden process behind binary or other categorical outcomes?
I was thinking a lot about proportional-odds cumulative logit models last fall while designing a study to evaluate an intervention’s effect on meat consumption…
blog.davisvaughan.com
Financial Numerical Methods - Part 1
Where: Interestingly, we actually have the solution to this equation (one of the few we have analytical solutions for): More generally, this can be written as a formula providing us with the recursive equation: I was given some starting parameters: We all know loops are to be avoided when you can in R, and that you should instead vectorize the…
lenkiefer.com
Forecasting is hard (work)
IN THIS POST WE WILL STUDY FORECASTS OF US ECONOMIC CONDITIONS. Niels Bohr quipped: Prediction is very difficult, especially if it’s about the future. I’m a macroeconomist by training, and my day job sometimes requires me to forecast the future so I can relate…
lenkiefer.com
Forecasting is hard (work)-bonus charts
THIS POST IS A QUICK FOLLOW UP to my previous post on forecasting. I’m not going to add any R code or commentary on graphics, instead I’m just going to leave a few extra graphs related to the prior post. Check out the earlier post for more details on the data and how to build the charts…
www.blog.rdata.lu
How tidyeval could make your life easier
But maybe now you know how to use it, but not why and when you should use it… Basically, whenever you want to write a function that looks something like this:…
cattleguard.github.io
Kickstarting Conversation With Sarcastic Programming
While killing time on Twitter the other day a Techlahoma tweet featuring WebVR and A-Frame grabbed my attention. If you know me, I’ve been excited about VR and AR for sometime (despite not having any VR or AR capable equipment). So, a color changing sphere that bounces is still fairly fun for me…
blog.sellorm.com
Using Python to build RPM’s of non-python code
Anyone that I’ve ever talked to about RPMs knows that I can’t stand the ‘official’ rpmbuild tools. For whatever reason, I’ve just never got on with them. I know loads of people who use them all the time without complaint, but the whole process has always seemed dysfunctional to me…
roelandtn.frama.io
Holidays
Finally I was able to take some holidays in late august. One week visiting friends before getting back to school in early september. And starting my new job in september 5th. To be honest, I had one week of holidays in July. But it was for the FOSS4G-Eu 2017, as an organizer, so it was not really relaxing (but so enriching)…
roelandtn.frama.io
OSGeo-Live Project
OSGeo-Live is a self-containing ISO with around 50 FOSS dedicated to GIS. A virtual macine archive is also provided. I join the project in November (or early December) 2015, as I was already a librist when I was a became a GIS technician…
batteriesnotincluded.rbind.io
Scraping Friends
Now that we’re set up and ready to go, let’s take a moment to think through what we need to do in order to build out this data set. 1. Download all scripts with identifying season and episode data 2. Parse all scripts into speaker and line 3. Clean up any errors 4. Celebrate good times Seems simple enough…
thug-r.life
purrr Tricks with All Subset Regression
All Subsets Regression What is all subsets regression? It’s a technique for model building which involves taking a set of independent variables (X1..i) and regressing them in sets of (k), where (k) is in ({1,2,\dots,i}), against the response variable (Y). The ‘all’ part of ‘all subsets’ means it’s every combination of (X{1..i}) being drawn (k) at a…
www.mytinyshinys.com
EPL Week 2
Round 2Not so many shocks in Round 2. Three teams remain with 100% records…
wirtel.be
Je présente à PyCon France 2017
PyCon France 2017 PyCon France is the annual Python Conference in France for the French Community, and this year, the conference will be in Toulouse, a town in the south of France, from 21st to 24th…
gcppodcast.com
The Future of Media with Machine Learning with Amit Pande
We have a new answer to the question from last…
lenkiefer.com
Visualizing house price acceleration
I HAVE BEEN BUILDING SOME NEW VISUALIZATIONS to study house price trends. In particular I have been thinking about rates of change of rates of change, or accelerations and decelerations in house price trends. I’ve got more to say on this topic, but for today, let’s create a few visualizations and contemplate an animation…
www.rdatagen.net
Be careful not to control for a post-exposure covariate
The field of causal inference is a rich one, and I won’t even scratch the surface here. My goal is to present the concepts of potential outcomes so that we can articulate at least one clear way to think about what a causal effect can be defined…
roelandtn.frama.io
First post !
First of all, welcome to this site ! As the about says, this blog purpose is to talking about OpenGIS, Python, R, data nalysis and stuff like that. I will be probably publishing learning notebooks as i do them to improve my knowledge. When possible, I’ll publish original stuff, depends on current affairs…
www.tidyverse.org/articles
tidyr 0.7.0
We are happy to announce that tidyr 0.7.0 is now available on CRAN. There are two big changes: tidyr uses the new tidyselect package as selection backend…
blog.davisvaughan.com
Which RStudio blog posts “pleased” Hadley? A tidytext + web scraping analysis
Awhile back, I saw a conversation on twitter about how Hadley uses the word “pleased” very often when introducing a new blog post (I couldn’t seem to find this tweet anymore. Can anyone help?)…
www.mytinyshinys.com
EPL Week 1
Round 1Well we’re back with an exciting start to the new season. Manchester United turned in their best performance in years and Huddersfield had a dream return to the top tier…
gcppodcast.com
Office of the CTO with Greg DeMichillie
Prior to joining Google, he had leadership roles at variety of companies including Adobe and Amazon, as well as a decade at Microsoft where he was a developer on the first version of Visual C++, the development manager for Microsoft’s Java tools, and lead the product team for the creation of…
wirtel.be
PythonFOSDEM 2018
Because I want to be in advance this year for the organization of the PythonFOSDEM 2018, I have worked on the web site. The Call for Proposals will be announced once we have the “Go” from FOSDEM. So, the site is already prepared, and now, I will wait for the notification from FOSDEM for the next devroom. I hope the answer will be positive…
www.tidyverse.org/articles
purrr 0.2.3
We are pleased to announce that purrr 0.2.3 is now on CRAN! Despite the small increment in the version number (our mistake, it should have been 0.3.0), this is a rather major release that includes many new features and bug fixes…
www.tidyverse.org/articles
Contributing code to the tidyverse
Contributing code to open source projects can be intimidating. These projects are often widely used and have well known maintainers…
livefreeordichotomize.com
The Exponential Power Series
I am a big fan of the Poisson distribution, there’s something about its simplicity and elegance (I really like (\lambda)s) that makes it way easier to deal with than some monstrosity like the gamma or normal distribution. Another thing I am a big fan of is the book I am currently reading: Surely You’re Joking, Mr Feynman…
livefreeordichotomize.com
Why you maybe shouldn't care about that p-value
Setup Recently, there seems to have been an uptick in citations of studies or statistics about this or that in the news and on the internet. As a statistician I think this is great. Anytime we can start to bring objectivity into our assessments of situations it’s fantastic. ⊕This is not to say all data/ stats are objective…
lenkiefer.com
The 100 second crisis
The 100 second crisis About 100 seconds into every talk I give there comes a crisis. This note is about that crisis and how I try to overcome it. If you’re in a situation where you give a presentation or speech-I call them “talks”-you might experience your own 100 second crisis. Perhaps my strategy could work for you…
thug-r.life
Automatic Twitter Updates
Automatic Twitter Updates One of my goals with this site is to have as much as possible to automatic. Because it’s a static site, each time I write a post I have to copy all the files (new and updated) to the hosting location so you, the illustrious reader, can enjoy the content I create…
www.onceupondata.com
My First Steps into The World of Tidy Eval
The straightforward way was: We need to do the following: After getting acquainted with Tidy eval, I had other situations when I wanted to to write general functions to avoid…
jvera.netlify.com
calculating date diffs per event with dplyr
Sometimes you’d need to calculate rolling date diffs from a vector or a data frame when you’re manipulating repeated events from different categories. It’s very easy and I’ll show you three different methods…
www.ifconfig.it/hugo
HP Procurve to Cisco switchport migration with Python and Netmiko
Summertime usually means a busy period for Network Engineers, customers are on holidays and we have the opportunity to performs all the changes that impact network connectivity. For me this usually means core switch replacement…
www.jamesuanhoro.com
On the interpretation of regression coefficients
TLDR: We should interpret regression coefficients for continuous variables as we would descriptive dummy variables, unless we intend to make causal claims. I am going to be teaching regression labs in the Fall, and somehow, I stumbled onto Gelman and Hill’s Data analysis using regression and multilevel/hierarchical models…
lenkiefer.com
Adding an animation progress bar
I LIKE TO MAKE ANIMATED GIFS with my data. But frankly, sometimes they can be confusing. I got a good suggestion via twitter in response to a gif I shared there: This is very cool…
www.mytinyshinys.com
EPL 2017/18
With the new Premier League season kicking off, I plan to produce plenty of posts over the coming months I have yet to organize it all but the idea is to present tables and charts without embedded…
nowosad.github.io
Geocomputation with R - the beginnings
The hard copy of the book will be published in late…
thug-r.life
Legislate by Executive Order
Executive Orders in the USA From Wikipedia: Executive orders are orders issued by United States Presidents and directed towards officers and agencies of the U.S. federal government…
vuorre.netlify.com
Sharing R products
Sharing R data (.rda) and R code (.R) files is becoming more common in the social sciences…
gcppodcast.com
Heroic Labs with Alim Jaffer and Mo Firouz
A member of the founding team, Alim joined Heroic Labs in 2016 as the VP of Product after having worked in startups focused in the games and health verticals. He is based in Vancouver, Canada and San Francisco…
www.rdatagen.net
Should we be concerned about incidence - prevalence bias?
What could explain this phenomenon? The title of this post provides a hint: cases earlier in a study are more likely to be prevalent ones (i.e. they have been sick for a while), whereas later in the study cases tend to be incident (i.e…
www.mytinyshinys.com
StatCan API's Discovered
My last post was a first look at StatCan data which highlighted that there was a lot of data available but that it was not necessarily easily available or perfectly presented Since then (and apologies where due), I have come across a couple of APIs First-off, StatCan do have a developers page one of which provides access to hundreds of indicators in JSON…
livefreeordichotomize.com
How to make an R Markdown website (with RStudio!)
We’ve updated our R Markdown website tutorial to depend on RStudio for simplicity. You can find our previous version if you would rather not depend on RStudio…
giorasimchoni.com
Lambada! (The mocap Package)
In my continuing mission to explore strange new datasets, I’ve come across a little article analyzing data which came from… Samba dancing! The way the researchers turned Samba dancing into data was by capturing the motion of the dancers with Motion Capture (MoCap) technology, as often seen in “Behind the Scenes” segments of Andy Serkis…
jvera.netlify.com
How to navigate the world of R packages
One of the most powerful features of R is the vast package echosystem. But there’s more than 11.000 packages just in CRAN, and a thousands more if you take GIT and Bioconductor into account, so the issue is to find the proper package for your needs. There’s a lot of sites to search for help if you need a hand when programming in R to choose a package…
lenkiefer.com
Plotting recent house price trends with R
IT IS TIME FOR AN UPDATE ON HOUSE PRICE TRENDS AROUND THE UNITED STATES. I have been experimenting with some new visualizations and updating some old favorites. Let’s collect them here. This post will be an extension of my Visual Meditations on House Prices series from last year…
thug-r.life
Theta Joins in R
Theta Joins Common Joins If you’re like me, you’re looking at that diagram wondering what the heck is a ‘theta’ join?! Well, that’s the formal name for a ‘fuzzy’ join. In fact, you can read about all the different join types in this StackOverflow answer…
blog.mgechev.com
Developing Statically Typed Programming Language
In this blog post we’ll go through a sample implementation of a type checker, interpreter and a transpiler for a basic purely functional programming language, which is based on the lambda calculus…
www.mytinyshinys.com
Bournemouth to bash on?
I have plans to hive off the soccer/football section off to a separate blog as many sports readers will not be interested in code but until then, here is my latest effort As the new season is now upon us it was time to post a new…
emil.tbjerglund.dk
Combining R and Python for data analysis
The main drawback of doing it this way is that I am losing on the interactive explorative tools included in HyperSpy…
lenkiefer.com
Joyswarm
I LOOK AT A LOT OF DATA and the thing about data is it’s not always what it seems to be. A lot of data are uncertain, and based on estimates. We’ve talked about this before…
vuorre.netlify.com
SIPS 2017 Conference Tweets
The timeline of tweets reveals that SIPSers were most active on Twitter on Monday, and there’s a clear spike of activity at 6pm on Monday. Here are the top 5 SIPS non-retweets. Here are the top hashtags people used with the SIPS hashtag…
purrple.cat/blog
Unicode, utf-8, strings and emojis
So as of now, unicode has 82719 runes, that’s way more than the 256 that can fit into a single byte (8 bits), however we still want to be able to process text from back in the days when strings were in fact arrays of single bytes…
gcppodcast.com
Kubernetes 1.7 with Tim Hockin
Tim was one of the first engineers on Kubernetes and GKE, where he has been involved in things like networking, storage, node management, API, plugins, and more…
magesblog.com
Notes from the 5th R in Insurance Conference
The fifth R in Insurance conference took place at Ecole Nationale de la Statistique et de l’Administration Economique, Paris on 8 June 2017…
www.mytinyshinys.com
Pimping the StatCan Blog
Statistics Canada make available masses of useful data from Censuses and Surveys. One way they communicate is via twitter e.g…
www.njtierney.com
A guide to awesome slides
I think that good design of slides is super important, and this slidedeck does a really great job of discussing good principles in a digestible manner, stating that the goal of a presentation is: To allow your audience to absort your…
jvera.netlify.com
Docker for R, first steps
Sometimes, you need to replace your hard disk, upgrade your o.s. or recently bought a new laptop. When this time comes, it’s a real pain to get your Rstudio up and running with all you favourite packages and settings…
giorasimchoni.com
Data Paintings (The kandinsky Package)
Recently I have moaned about not really knowing what I was doing with the grid package (see here and here). I’m happy to say, not only did I take the time to better understand the grid package, I also wrote my own package around it - the kandinsky package! To generate random Wassily Kandinsky paintings or even make any dataset into one…
www.tidyverse.org/articles
Welcome!
As well as radically improved graphic design, the site includes four new components: This blog! Expect to see regular updates on what’s happening in the tidyverse…
purrple.cat/blog
A fortune slack slash command using plumber
Here is the canonical example: So that we can support a variety of use…
ewen.io
Advocating for Open Policing with sparklyr and Shiny
Better monitoring of police activity with implementations of Spark and…
lenkiefer.com
Mortgage Rate Kandinsky
THINGS ARE ABOUT TO GET A BIT MORE ABSTRACT IN THIS SPACE. Today we make some Kandinsky-style images with R. This summer I was fortunate to spend some time at the Pompidou Centre in Paris. The Pompidou Centre houses the largest collection of modern art in Europe…
livefreeordichotomize.com
New and improved draw charts in shinysense
Background I didn’t expect many people to use the you-draw-it charts (henceforth referred to as ‘drawrs’) I put into my shiny modules package shinysense but a decent number did, and with that usage came… bugs…
www.mytinyshinys.com
When Babe Ruth met Kandinsky
When Babe Ruth first retired from baseball he found it tough…
jvera.netlify.com
janitor, a good R package for Data Wrangling
We all know the many hours spent cleaning and wrangling data. Sometimes I think my actual job is not “Data Scientist” but “Data Cleaner”…
gcppodcast.com
Customer Engineers with Jonathan Cham
Jonathan Cham is a customer engineer who helps customers be successful on the Google Cloud Platform…
lenkiefer.com
Joyful dataviz
I TOOK SOME TIME OFF OVER THE SUMMER, away from data visualizations. It’s good to get away from time to time, but oh boy did I miss out. I wasn’t gone long, but in the short time I was gone people came up with some wonderful things. Let me dive back into it with some joyful dataviz…
www.ifconfig.it/hugo
Networkcareer.net interview
My interview was published today on Networkcareer.net, the latest project from Daniel Dib and Kim Pedersen…
ritsokiguess.site/docs
A brief foray into list-columns
Introduction Let’s suppose we are trying to find the median of a bunch of binomial distributions…
lbusett.netlify.com
The new MODIStsp website (based on pkgdown) is online !
We hope that the revised website will allow to navigate MODIStsp-related material much more easily than either github or the standard CRAN documentation, and will therefore help users in better and more easily exploiting MODIStsp…