AutoML - Automated Machine Learning
Quoc Le
@quocleix
Joint work with many colleagues in the Google Brain team
Current:
Solution =
ML Expertise + Data + Computation
Current:
Solution =
ML Expertise + Data + Computation
But can we turn this into:
Solution =
Data + 100X Computation
Data processing
Machine
Learning Model
Data
Data processing
Machine
Learning Model
Data
Importance of architectures for Vision
● Designing neural network architectures is hard
● Lots of human efforts go into tuning them
● Can we try and learn good architectures automatically?
Canziani et al, 2017
Convolutional Architectures
Krizhevsky et al, 2012
Neural Architecture Search
● We can specify the structure and connectivity of a neural network by a
configuration string
○ [“Filter Width: 5”, “Filter Height: 3”, “Num Filters: 24”]
● Use a RNN (“Controller”) to generate this string (the “Child Network”)
● Train the Child Network to see how well it performs on a validation set
● Use reinforcement learning to update the Controller based on the accuracy of
the Child Network
Controller: proposes Child Networks Train & evaluate Child Networks
20K
times
Iterate to
find the
most
accurate
Child
Network
Controller: proposes Child Networks Train & evaluate Child Networks
20K
times
Iterate to
find the
most
accurate
Child
Network
Reinforcement Learning
or Evolution Search
Neural Architecture Search for Convolutional Networks
Controller RNN
Softmax classifier
Embedding
Neural Architecture Search for CIFAR-10
Performance of cell on ImageNet
Platform aware Architecture Search
Platform aware Architecture Search
Controller: proposes Child Networks Train & evaluate Child Networks
20K
times
Iterate to
find the
most
accurate
Child
Network
Reinforcement Learning
or Evolution Search
Architecture / Optimization Algorithm /
Nonlinearity
Confidential + Proprietary
Confidential + Proprietary
Confidential + Proprietary
Strange hump
Basically
linear
Confidential + Proprietary
Mobile NASNet-A on ImageNet
Data processing
Machine
Learning Model
Data
Focus of machine
learning research
Data processing
Machine
Learning Model
Data
Focus of machine
learning research
Very important but
manually tuned
Data Augmentation
Controller: proposes Child Networks Train & evaluate Child Networks
20K
times
Iterate to
find the
most
accurate
Child
Network
Reinforcement Learning
or Evolution Search
Architecture / Optimization Algorithm /
Nonlinearity / Augmentation Strategy
AutoAugment: Example Policy
Probability of applying
Magnitude
CIFAR-10
State-of-art: 2.1% error
AutoAugment: 1.5% error
ImageNet
State-of-art: 3.9% error
AutoAugment: 3.5% error
Controller: proposes Child Networks Train & evaluate Child Networks
20K
times
Iterate to
find the
most
accurate
Child
Network
Reinforcement Learning
or Evolution Search
Architecture / Optimization Algorithm /
Nonlinearity / Augmentation Strategy
Summary of AutoML and its progress
CIFAR-10
AutoML
Accuracy
ML Experts
ImageNet
AutoML
Top-1Accuracy
ML Experts
References
● Neural Architecture Search with Reinforcement Learning. Barret Zoph and Quoc
V. Le. ICLR, 2017
● Learning Transferable Architectures for Large Scale Image Recognition. Barret
Zoph, Vijay Vasudevan, Jonathon Shlens, Quoc V. Le. CVPR, 2018
● AutoAugment: Learning Augmentation Policies from Data. Ekin D. Cubuk, Barret
Zoph, Dandelion Mane, Vijay Vasudevan, Quoc V. Le. Arxiv, 2018
● Searching for Activation Functions. Prajit Ramachandran, Barret Zoph, Quoc Le.
ICLR Workshop, 2018

Quoc Le at AI Frontiers : Automated Machine Learning

Editor's Notes

  • #3 Many different algorithms could be used, in this we used the basic REINFORCE Goal is to make the Controller sampler better and better architectures over time
  • #4 Many different algorithms could be used, in this we used the basic REINFORCE Goal is to make the Controller sampler better and better architectures over time
  • #7 Bayesian optimization scales cubically, and can only work over fixed length spaces Evolutionary algorithms
  • #8 Bayesian optimization scales cubically, and can only work over fixed length spaces Evolutionary algorithms
  • #10 Many different algorithms could be used, in this we used the basic REINFORCE Goal is to make the Controller sampler better and better architectures over time
  • #11 Many different algorithms could be used, in this we used the basic REINFORCE Goal is to make the Controller sampler better and better architectures over time
  • #12 Each prediction is carried out by a softmax classifier Example of a convolutional network
  • #13 Mention the networks are much smaller than any of the other networks on the list
  • #17 Many different algorithms could be used, in this we used the basic REINFORCE Goal is to make the Controller sampler better and better architectures over time
  • #25 Many different algorithms could be used, in this we used the basic REINFORCE Goal is to make the Controller sampler better and better architectures over time
  • #28 Many different algorithms could be used, in this we used the basic REINFORCE Goal is to make the Controller sampler better and better architectures over time