KeystoneML is a software framework for building scalable machine learning pipelines. It provides tools for data loading, feature extraction, model training, and evaluation that work across multiple domains like computer vision, NLP, and speech. Pipelines built with KeystoneML can achieve state-of-the-art results on large datasets using modest computing resources. The framework is open source and available on GitHub.