While Model Trains

Read data blog posts.
Carefully handpicked.
Presented 3 at a time.

Song Lyrics Across the United States

Julia Silge

An analysis of the frequency of US state names in song lyrics of Billboard's Year-End Hot 100 from 1958 to 2015.

Read it!

Why is machine learning 'hard'?

S. Zayd Enam

"The difficulty is that machine learning is a fundamentally hard debugging problem. Debugging for machine learning happens in two cases: 1) your algorithm doesn't work or 2) your algorithm doesn't work well enough."

Read it!

Stop Hard Coding in a Data Science Project – Use Config Files Instead

Khuyen Tran

In a data science project, certain values such as file names, train-test split ratios, and hyperparameters often undergo frequent changes. By using configuration files instead of hard-coding these values, you can achieve better maintainability and flexibility.

Read it!