This document discusses distributing large-scale machine learning algorithms from GPUs to the cloud. It outlines Netflix's use of machine learning algorithms to power recommendations, describes how their data and needs have grown enormously over time, and discusses different levels of distributing and parallelizing machine learning algorithms in practice based on whether all the data is needed, what level of parallelization is appropriate, and what latency can be afforded.