While Model Trains

Read data blog posts.
Carefully handpicked.
Presented 3 at a time.

Prediction intervals for Random Forests

Ando Saabas

Prediction intervals are commonly used for linear models but are often underused for random forests. Leveraging the fact that a random forest can provide a conditional distribution instead of just the conditional mean makes prediction intervals relatively straightforward to use in this context.

Read it!

What Should Data Scientists Learn?

Peter Baumgartner

Asking 'What happens before I do my job?' and 'What happens after I do my job?' can help with choosing what to learn next.

Read it!

Text analysis of Trump's tweets confirms he writes only the (angrier) Android half

David Robinson

An analysis of tweets from Donald Trump and his staff during the 2016 US Election campaign.

Read it!