While Model Trains

Read data blog posts.
Carefully handpicked.
Presented 3 at a time.

A few reasons to be skeptical of machine learning

Julia Evans

"why, even though machine learning is really awesome and cool and you can do super powerful and interesting things with it – why you should still be skeptical"

Read it!

Prediction intervals for Random Forests

Ando Saabas

Prediction intervals are commonly used for linear models but are often underused for random forests. Leveraging the fact that a random forest can provide a conditional distribution instead of just the conditional mean makes prediction intervals relatively straightforward to use in this context.

Read it!

The Four Jobs of the Data Scientist

Roger Peng

For each "Data Analytic Iteration," you need to embody the roles of a scientist, statistician, system engineer, and politician.

Read it!