MLlib is an Apache Spark component for machine learning. It provides algorithms like classification, regression, clustering, and collaborative filtering. MLlib 1.0 introduced new features like decision trees, sparse data support, and distributed matrices. Future releases will focus on standardizing interfaces, adding more algorithms like NMF and LDA, and better integration with other Spark components. MLlib is developed as an open source project with contributions from many individuals and companies.