While Model Trains

Read data blog posts.
Carefully handpicked.
Presented 3 at a time.

Data Cleaning IS Analysis, Not Grunt Work

Randy Au

"The act of cleaning data is the act of preferentially transforming data so that your chosen analysis algorithm produces interpretable results. That is also the act of data analysis."

Read it!

The Bitter Lesson

Richard Sutton

"The biggest lesson that can be read from 70 years of AI research is that general methods that leverage computation are ultimately the most effective, and by a large margin."

Read it!

Understanding the beta distribution (using baseball statistics)

David Robinson

"The beta distribution is best for representing a probabilistic distribution of probabilities- the case where we don’t know what a probability is in advance, but we have some reasonable guesses."

Read it!