This document summarizes a presentation about Apache Hivemall, a scalable machine learning library for Apache Hive, Spark, and Pig. Hivemall provides easy-to-use machine learning functions that can run efficiently in parallel on large datasets. It supports various classification, regression, recommendation, and clustering algorithms. The presentation outlines Hivemall's capabilities, how it works with different big data platforms like Hive and Spark, and its ongoing development including new features like XGBoost integration and generalized linear models.