This document summarizes the history and ongoing development of MLlib, Spark's machine learning library. MLlib was initially developed by the MLbase team in 2013 and has since grown significantly with over 80 contributors. It provides algorithms for classification, regression, clustering, collaborative filtering, and linear algebra/optimization. Recent improvements include new algorithms like random forests, pipelines for simplified ML workflows, and continued performance gains.