This document discusses converting Scikit-Learn machine learning pipelines to PMML (Predictive Model Markup Language) format. Key points include:
- Scikit-Learn pipelines can be serialized to PMML, allowing models to be deployed anywhere that supports PMML.
- PMML represents the fitted pipeline using standardized data structures, including feature and target field definitions.
- The sklearn2pmml Python library converts Scikit-Learn pipelines to PMML. It handles feature engineering, selection, estimator fitting, and model customization.
- Hyperparameter tuning and algorithm selection tools like GridSearchCV and TPOT can also have their best pipelines exported to PMML.