Hivemall is a scalable machine learning library built as a collection of Hive UDFs. It was accepted into the Apache Incubator in September 2016. Hivemall can be used across multiple platforms like Hive, Spark, Pig, and is designed to be easy to use, versatile, and scalable for large datasets. It allows SQL developers to perform machine learning tasks in a parallel and scalable way on Hadoop clusters.