Posts List

Baby Names, Revisited

I often get asked by co-op students at work about how they can get started with using R. While sites like Kaggle are great for finding lots of datasets and entering competitions to see how many tenths of a point you can extract from your model, my advice to those starting it is to pick a topic or question that actually interests you. It’s a hundred times easier to do an analysis on something that you’ve been pondering than on fifty columns of anonymized, standardized numbers.

A/B Testing Overview

Overview and Definitions The purpose of A/B testing is to determine through the use of statistical methods whether an experiment generates enough of a practically significant effect to support implementation. This is not as simple as seeing if the rates of two different groups are different, because of the inherent randomness in sampling from a population. Consider this toy example: library(scales) set.seed(1234) pop_1 <- rnorm(100, 0, 1) pop_2 <- rnorm(100, 0, 1) paste("The mean of pop_1 is: ", comma(mean(pop_1))) ## [1] "The mean of pop_1 is: -0.