While Model Trains

Read data blog posts.
Carefully handpicked.
Presented 3 at a time.

Why Correlation Usually ≠ Causation

Gwern

"Despite this admonition, people are overconfident in claiming correlations to support favored causal interpretations and are surprised by the results of randomized experiments, suggesting that they are biased & systematically underestimate the prevalence of confounds / common-causation."

Read it!

Song Lyrics Across the United States

Julia Silge

An analysis of the frequency of US state names in song lyrics of Billboard's Year-End Hot 100 from 1958 to 2015.

Read it!

Finding bad flamingo drawings with recurrent neural networks

Colin Morris

Using Sketch-RNN as a probability estimator to identify the worst sketches of flamingos in 'Quick, Draw!'.

Read it!