Automated Hyperparameter Tuning, Scaling and Tracking

Automated
Hyperparameter Tuning
June 20th, 2019

Logistics
• We can’t hear you…
• Recording will be available…
• Slides will be available…
• Code samples and notebooks will be available…
• Queue up Questions…
• Bookmark databricks.com/blog

About our speakers
Yifan Cao, Sr. Product Manager, Machine Learning at Databricks
• Product Area: ML/DL algorithms and Databricks Runtime for Machine
Learning
• Built and grew two ML products to multi-million dollars in annual
revenue
• B.S. Engineering from UC Berkeley; MBA from MIT
Joseph Bradley, Software Engineer, Machine Learning at Databricks
• Apache Spark PMC member
• Postdoc at UC Berkeley
• Ph.D. in Machine Learning from Carnegie Mellon

Accelerate innovation by unifying data science,
engineering and business
• Original creators of
• 2000+ global companies use our platform across big
data & machine learning lifecycle
VISION
WHO WE
ARE
Unified Analytics PlatformSOLUTION

DATA
ENGINEERS
x
Data & ML Tech and People are in Silos
DATA
SCIENTISTS

Hiring Data Scientists is a Key Blocker

“My team needs to build 100+
models this year, but it has
only got to 20%.”

What is Automated ML (AutoML)?
● Excel-like tool that enables anyone
to do machine learning
● Productivity tools for
data scientists

Raw Data
Model
Exploration
Feature
Engineering
ETL
Model
Scoring
Hyperparam
eter Tuning
Alerting &
Monitoring
Cross
Validation
Where does AutoML fit on Databricks?
DATA
ENGINEERS
DATA
SCIENTISTS
AutoML

Great Training
AutoML on Databricks (1/3)
AutoML librariesUSER CONTROL

Watch it now >
https://dbricks.co/zynga
Custom Solution: Zynga
Automating Predictive Modeling at Zynga with Pandas UDFs

Great Training
AutoML libraries
PartnershipsAUTOMATION
USER CONTROL

Databricks
ETL & ML
Databricks
ML Test & Model
Enable data scientists and citizen data scientists to accelerate and scale
the development and delivery of predictive models.
Run and deploy ML
models at Scale
14
Databricks and DataRobot Integration
Watch it now >
https://dbricks.co/datarobot

Great Training
AutoML libraries
Partnerships
Hyperopt
AUTOMATION
USER CONTROL
AUTOMATION +
CONTROL
Integrations MLlib
Today's Content

Great Training
A simple analogy
Manual Transmission
Semi AutonomousAUTOMATION
USER CONTROL
AUTOMATION +
CONTROL
Automatic Transmission
Today's Content

Use Case #1: Hyperparameter Tuning
Model
Exploration
Feature
Engineering
Model
Scoring
Hyperparam
eter Tuning
Alerting &
Monitoring
Cross
Validation
Scenarios:
● Automated hyperparameter search to select models after cross validation
● Automated hyperparameter search to optimize models in production
Our Oﬀerings:
● Distributed Hyperopt + Automated MLflow Tracking
Raw Data ETL

Use Case #2: Model Search
Model
Exploration
Feature
Engineering
Model
Scoring
Hyperparam
eter Tuning
Alerting &
Monitoring
Cross
Validation
Scenarios:
● Automated model search by exploring diﬀerent combinations of featuresets, algos,
hyperparameters
● Automated model search by extending a baseline model to 1000+ custom models
Our Oﬀerings:
● MLlib + Automated MLflow Tracking
● Distributed Hyperopt + Automated MLflow Tracking, with conditional hyperparameter tuning
Raw Data ETL

Scenarios:
● Automated end-to-end Machine Learning model generation pipelines incorporating
customer-specified logics
Our Oﬀerings:
● Leverage existing Databricks internal tools & frameworks on top of Databricks Runtime
ML
Use Case #3: End-to-end ML Pipeline
Model
Exploration
Feature
Engineering
Model
Scoring
Hyperparam
eter Tuning
Alerting &
Monitoring
Cross
Validation
Raw Data ETL

Hyperparameters
Express high-level concepts, such as statistical assumptions
E.g.: regularization
Are fixed before training or are hard to learn from data
E.g.: neural net architecture
Aﬀect objective, test time performance, computational cost
E.g.: # iterations or epochs

Tuning hyperparameters
E.g.: Fitting a
polynomial
Common goals:
• More flexible modeling process
• Reduced generalization error
• Faster training
• Plug & play ML

Challenges in tuning
Curse of dimensionality
Non-convex optimization
Computational cost
Unintuitive hyperparameters

Data prep: train-validation-test splits
Data

Training Data Test Data
ML Model

Training
Data
Validation
Data
Test Data
Final
ML Model
ML Model 1
ML Model 2
ML Model 3

A practical definition of tuning
ML Model
Featurization
Model family
selection
Hyperparameter
tuning
Parameters: configs which your ML library learns from data
Hyperparameters: configs which your ML library does not learn from data

Overview of tuning methods
•Manual search
•Grid search
•Random search
•Population-based algorithms
•Bayesian algorithms

Manual search
Select hyperparameter settings to try based on human intuition.
2 hyperparameters:
•[0, ..., 5]
•{A, B, ..., F}
A B C D E F
0
1
2
3
4
5
Expert knowledge tells us to try:
(2,C), (2,D), (2,E), (3,C), (3,D), (3,E)

Grid Search
Try points on a grid defined by ranges and step sizes
X-axis: {A,...,F}
Y-axis: 0-5, step = 1
A B C D E F
0
1
2
3
4
5

A B C D E F
0
1
2
3
4
5
Random Search
Sample from distributions over ranges
X-axis: Uniform({A,...,F})
Y-axis: Uniform([0,5])

Start with random search, then iterate:
•Use the previous “generation” to
inform the next generation
•E.g., sample from best performers &
then perturb them
Population Based Algorithms
A B C D E F
0
1
2
3
4
5

Model the loss function:
Hyperparameters ⇒ loss
Iteratively search space, trading oﬀ
between exploration and exploitation
A B C D E F
0
1
2
3
4
5
Bayesian Optimization

Get samples: Test new points in
hyperparameter space
A B C D E F
0
1
2
3
4
5

A B C D E F
0
1
2
3
4
5
Get samples: Test new points in
hyperparameter space
Update model of space:
Hyperparameters ⇒ loss

Comparing tuning methods
Iterative /
adaptive?
# evaluations
for P params
Model of
param space
Grid search No O(c^P) none
Random search No O(k) none
Population-based Yes O(k) implicit
Bayesian Yes O(k) explicit

Open-source tools for tuning
Grid
search
Random
search
Population
-based
Bayesian PyPi
downloads
last month
Github
stars
License
scikit-learn Yes Yes --- --- BSD
MLlib Yes --- --- Apache 2.0
scikit-opti
mize
Yes 49,189 1,278 BSD
Hyperopt Yes Yes 98,282 3,286 BSD
DEAP Yes 26,700 2,789 LGPL v3
TPOT Yes 9,057 5,609 LGPL v3
GPyOpt Yes 4,959 451 BSD
As of mid-April 2019

MLflow Overview
42
Tracking
Record and query
experiments: code,
data, config, results
Projects
Packaging format
for reproducible runs
on any platform
Models
General model format
that supports diverse
deployment tools
mlflow.org github.com/mlflow twitter.com/MLflowdatabricks.com/mlflow

Organizing with
Training Data Validation Data Test Data
Final ML ModelML Model 1
ML Model 2
ML Model 3
Experiment
Main run
Child runs
Tip: Tune full pipeline, not 1 model.

Instrumenting tuning with
MLflow concepts for tracking runs
Params: hyperparameters
Metrics: training & validation, loss & objective, multiple objectives
Tags: provenance, simple metadata
Artifacts: serialized model, large metadata

Analyzing how tuning performs
Questions to answer
• Am I tuning the right hyperparameters?
• Am I exploring the right parts of the search space?
• Do I need to do another round of tuning?
Examining results
• Simple case: visualize param vs metric
• Challenges: multiple params and metrics, iterative experimentation

Auto-tracking MLlib with
Training Data Validation Data Test Data
Final ML ModelML Model 1
ML Model 2
ML Model 3
Experiment
Main run
Child runs
In Databricks
• CrossValidator &
TrainValidationSplit
• 1 run per setting of
hyperparameters
• Avg metrics for CV folds(demo)

Hyperopt
Hyperparameter tuning in Python ML workflows
● Usable with any Python ML library
● Tuning algorithms:
○ Random search
○ Bayesian (Tree of Parzen Estimators)
● Open source (3-clause BSD license)
https://github.com/hyperopt/hyperopt

Distribute tuning across Spark clusters
● Each Spark task trains & evaluates 1 model (hyperparameter setting)
○ Applicable to single-machine ML workloads
● Via new SparkTrials plugin
● Contributing to open source Hyperopt:
github.com/hyperopt/hyperopt/pull/509
With automated MLflow tracking in Databricks
Available now in Databricks Runtime 5.4 ML
Hyperopt on Apache Spark
(demo)

Related Content
Blog:
• Hyperparameter Tuning with MLflow,
Apache Spark MLlib and Hyperopt
Webinar:
• How to Automate Machine Learning and
Scale Delivery
Tutorials
● Hyperparameter Tuning Documentation
● MLflow integrations with H20.ai GPyOpt,
HyperOpt
Notebooks
● MLlib + Automated MLflow Tracking
● Distributed Hyperopt + Automated MLflow
Tracking
● Basic Introduction to DataRobot via API
Videos
● Automating Predictive Modeling at Zynga
with PySpark and Pandas UDFs
● Best Practices for Hyperparameter Tuning
with MLflow
● Advanced Hyperparameter Optimization
for Deep Learning with MLflow

Getting started
MLflow
Managed MLflow
Generally Available in
Databricks
MLlib + automated
MLflow tracking
Public preview in
Databricks Runtime 5.4
& 5.4ML
Distributed Hyperopt
+ automated MLflow
tracking
Public preview in
Databricks Runtime 5.4ML
https://docs.databricks.com/spark/latest/mllib/index.html#hyperparameter-tuning
https://docs.azuredatabricks.net/spark/latest/mllib/index.html#hyperparameter-tuning
https://mlflow.org/

Automated Hyperparameter Tuning, Scaling and Tracking

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Automated Hyperparameter Tuning, Scaling and Tracking

Similar to Automated Hyperparameter Tuning, Scaling and Tracking (20)

More from Databricks

More from Databricks (20)

Recently uploaded

Recently uploaded (20)

Automated Hyperparameter Tuning, Scaling and Tracking