While Model Trains

Read data blog posts.
Carefully handpicked.
Presented 3 at a time.

Bar charts: so popular, so abused

Martina Pugliese

Useful tips for creating better bar charts.

Read it!

Prediction intervals for Random Forests

Ando Saabas

Prediction intervals are commonly used for linear models but are often underused for random forests. Leveraging the fact that a random forest can provide a conditional distribution instead of just the conditional mean makes prediction intervals relatively straightforward to use in this context.

Read it!

How much data should you allocate to training and validation?

Francesco Pochetti

To avoid responding with "that's what Andrew NG said" when asked about the reason behind choosing an 80% training and 20% validation split, consider this explanation.

Read it!