NYAI #25: Evolution Strategies: An Alternative Approach to AI w/ Maxwell Rebo

Evolution Strategies
An Alternative Approach to AI

What is ES (and who uses it)?
● Class of heuristic search algorithms for black-box optimization
● Closely related to genetic algorithms (GA)
● Several groups have made serious commitments to developing additional
variants:

Why should I care?
Organizations have yet to adopt reinforcement learning, and as such are leaving
billions of dollars of value on the table
ES presents a perfect excuse to tackle serious problems with RL
It’s no panacea, but hard optimization problems become much easier
DRL is heavy and esoteric; ES is lightweight and intuitive
Once ES becomes explicable, then it’s game over

What do I use ES for?
● Algorithmic trading (c’mon, you knew that was going to come up)
● Token incentives in blockchain applications
● Opportunity cost problems in investment and staking
● Training one-shot computer vision models
● Modeling causal trends in property markets (2016-2017)

Pros
● Gradient-free
● Can run on arbitrary numbers
of CPUs, no GPUs needed
● Can optimize mixed-type
models
● Can handle non-convex
problems
● Conveniently sidesteps
exploding gradients
● Easy to extend/modify
Cons
● Less data-efficient
● No guarantee of convergence
● Still a black box*
● Without modifications, does
poorly with hard constraints as
reward cutoffs
● Can break down with highly
complex/multimodal solutions
● Can be much slower in typical
supervised learning settings

Core Procedure
Credit: OpenAI, https://arxiv.org/abs/1703.03864

2D visualization
Image credit: OpenAI

DRL ES
● Optimizing arbitrary parameters
for a given task
● Largely agnostic to underlying
model structure
● Can implement virtual batch
schemes, but not required for
good performance
● Value function is implicit
● Parameters are smooshed into
1D array
● Optimizing network(s) for a
given task, typically at
episode-level
● Highly sensitive to batch size
and training scheme
● Requires estimation of value
function
● Parameters are grouped by
component/layer

How to gray the black box?
● Feature importance / parameter saliency still a challenge
● Tried Bayesian attribution: too sparse, and still get artifacts from discretization
● One way that works:
○ Calculate MI(x_i’, y’)
○ x_i’ is Δε
○ y’ is change in reward for candidate solution
● For pairwise and partition-wise attribution, do partial information
decomposition in hierarchical fashion
● No free lunch: Attribution computation is expensive

PRESS
● PRincipled Evolution StrategieS (in keeping with weird acronym structures)
● Saliency attribution at the optimizer level
● Short version: find which parameters are important and which ones aren’t
● Long version: Use information-theoretic measures to attribute reward
saliency to parameter perturbations
● Step 1) estimate parameter saliency after N iterations
Step 2) apply saliency estimates as normalized prior in perturbation-sampling
step
● Bears similarity to integrated gradients, guided ES
● Essentially an approach to estimating the partial derivative ∂y/∂x_i
○ Where y is reward, x_i is parameter of a model such as a weight in a neural net, mean of a
probability distribution, etc.

Summary
● For RL problems, using ES solvers confers a number of benefits
● Where problems are mixed-mode and/or gradients are difficult to to evaluate,
ES shines
● Getting away from GPU dependency is especially nice - less $$, for one
● Impact on industry: Compared to other RL methods, ES is easier to apply,
faster, and often yields comparable or even better results. This makes it
easier and cheaper to deploy practitioners to a problem and get results.
● Areas of especially exciting application are synthetic biology, economics
(including economic policy), multi-agent modeling and simulation, and
diagnostic medicine

Resources & Reading
● OpenAI’s intro post: https://blog.openai.com/evolution-strategies/
● Great Python implementation: https://github.com/hardmaru/estool
● Google Brain’s variant: https://github.com/brain-research/guided-evolutionary-strategies
● Uber’s variants: https://arxiv.org/abs/1712.06560
● Uber’s implementations: https://github.com/uber-research/deep-neuroevolution
Blog post: https://eng.uber.com/deep-neuroevolution/
● OpenAI’s starter: https://github.com/openai/evolution-strategies-starter
● Natural ES paper: http://www.jmlr.org/papers/volume15/wierstra14a/wierstra14a.pdf

NYAI #25: Evolution Strategies: An Alternative Approach to AI w/ Maxwell Rebo

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to NYAI #25: Evolution Strategies: An Alternative Approach to AI w/ Maxwell Rebo

Similar to NYAI #25: Evolution Strategies: An Alternative Approach to AI w/ Maxwell Rebo (20)

More from Maryam Farooq

More from Maryam Farooq (15)

Recently uploaded

Recently uploaded (20)

NYAI #25: Evolution Strategies: An Alternative Approach to AI w/ Maxwell Rebo