While Model Trains

Read data blog posts.
Carefully handpicked.
Presented 3 at a time.

Song Lyrics Across the United States

Julia Silge

An analysis of the frequency of US state names in song lyrics of Billboard's Year-End Hot 100 from 1958 to 2015.

Read it!

From both sides now: the math of linear regression

Katherine Bailey

A journey starting from the standard formulation of linear regression, moving on to the probabilistic approach, and then progressing to Bayesian linear regression.

Read it!

Writing Robust Tests for Data & Machine Learning Pipelines

Eugene Yan

An in-depth analysis of why certain types of tests break more frequently than others, along with suggestions for creating more robust pipeline tests.

Read it!