Experimental data Today I want to talk about two-sample hypothesis testing, or A/B testing in data science parlance. This topic has been on my mind a lot as of late because I’m planning on leaving academia for data science, and that means I may also be transitioning from analyzing survey data to running experiments on unstructured, “big” data. Because academia and business have different goals and resources available to them, the methods they use are also different, even when approaching a similar problem.
This week I’m not doing a data analysis project so much as a data cleaning project. One of the most common problems I come across in data cleaning is how to get summary statistics for various groups in the data. And it’s also one of the most annoying problems, because I invariably forget how to do it, and end up having to go back to old code to copy and paste.
Since moving to San Francisco at the end of February, I have been driving and walking around town, trying to get to know my new city. And even though I’ve only been here a few weeks, I already feel affinity1 for my own neighborhood. Which is good, because I’m also too lazy to leave it. I also learned recently that you can download your location history via google. In the spirit of discovering my new city and possibly also discovering something about my (homebody) self, today I am going to make a map of where I have been since moving here three weeks ago.
Last week was crazy because we moved from the Netherlands to San Francisco, and jetlag and babies don’t go well together. When your child wakes up at 11:30pm and doesn’t fall back asleep until 2am is exactly when you realize how difficult it must be to be a single parent. Father involvement And of course, it’s not only single parents who have full responsibility of staying up with the baby when she’s not sleeping.
Blogdown This week was the very first meeting of the R-Ladies Rotterdam chapter. Ingrid Szilagyi was the driving force behind starting an R-ladies chapter in the Netherlands, and I had the pleasure of working with her, the Py, R, and all things tech-Lady Elena He, and the uber-experienced Paloma Rojas Saunero, formerly a Buenos Aires R-Lady. At the meetup Ingrid gave an introduction to R-ladies, Elrozy Randrinopoulou gave one of the best intro to R talks I’ve ever seen, and I talked about using blogdown to harness the power of Rmarkdown for publishing to the web.