But what if the key to serving your users best depends in large part upon training a machine learning algorithm? What if that ML algorithm needs a massive training dataset? In an age when machine learning is in its ascendancy, this is increasingly a critical design objective.

Seeing Like an Algorithm | Remains of the Day

This an older article — and long! — but if you do any kind of modeling to better understand the preferences of your users, it is well worth the read.