SlideShare a Scribd company logo
Evolution Strategies
An Alternative Approach to AI
What is ES (and who uses it)?
● Class of heuristic search algorithms for black-box optimization
● Closely related to genetic algorithms (GA)
● Several groups have made serious commitments to developing additional
variants:
Why should I care?
Organizations have yet to adopt reinforcement learning, and as such are leaving
billions of dollars of value on the table
ES presents a perfect excuse to tackle serious problems with RL
It’s no panacea, but hard optimization problems become much easier
DRL is heavy and esoteric; ES is lightweight and intuitive
Once ES becomes explicable, then it’s game over
What do I use ES for?
● Algorithmic trading (c’mon, you knew that was going to come up)
● Token incentives in blockchain applications
● Opportunity cost problems in investment and staking
● Training one-shot computer vision models
● Modeling causal trends in property markets (2016-2017)
Pros
● Gradient-free
● Can run on arbitrary numbers
of CPUs, no GPUs needed
● Can optimize mixed-type
models
● Can handle non-convex
problems
● Conveniently sidesteps
exploding gradients
● Easy to extend/modify
Cons
● Less data-efficient
● No guarantee of convergence
● Still a black box*
● Without modifications, does
poorly with hard constraints as
reward cutoffs
● Can break down with highly
complex/multimodal solutions
● Can be much slower in typical
supervised learning settings
Core Procedure
Credit: OpenAI, https://arxiv.org/abs/1703.03864
2D visualization
Image credit: OpenAI
DRL ES
● Optimizing arbitrary parameters
for a given task
● Largely agnostic to underlying
model structure
● Can implement virtual batch
schemes, but not required for
good performance
● Value function is implicit
● Parameters are smooshed into
1D array
● Optimizing network(s) for a
given task, typically at
episode-level
● Highly sensitive to batch size
and training scheme
● Requires estimation of value
function
● Parameters are grouped by
component/layer
How to gray the black box?
● Feature importance / parameter saliency still a challenge
● Tried Bayesian attribution: too sparse, and still get artifacts from discretization
● One way that works:
○ Calculate MI(x_i’, y’)
○ x_i’ is Δε
○ y’ is change in reward for candidate solution
● For pairwise and partition-wise attribution, do partial information
decomposition in hierarchical fashion
● No free lunch: Attribution computation is expensive
PRESS
● PRincipled Evolution StrategieS (in keeping with weird acronym structures)
● Saliency attribution at the optimizer level
● Short version: find which parameters are important and which ones aren’t
● Long version: Use information-theoretic measures to attribute reward
saliency to parameter perturbations
● Step 1) estimate parameter saliency after N iterations
Step 2) apply saliency estimates as normalized prior in perturbation-sampling
step
● Bears similarity to integrated gradients, guided ES
● Essentially an approach to estimating the partial derivative ∂y/∂x_i
○ Where y is reward, x_i is parameter of a model such as a weight in a neural net, mean of a
probability distribution, etc.
Summary
● For RL problems, using ES solvers confers a number of benefits
● Where problems are mixed-mode and/or gradients are difficult to to evaluate,
ES shines
● Getting away from GPU dependency is especially nice - less $$, for one
● Impact on industry: Compared to other RL methods, ES is easier to apply,
faster, and often yields comparable or even better results. This makes it
easier and cheaper to deploy practitioners to a problem and get results.
● Areas of especially exciting application are synthetic biology, economics
(including economic policy), multi-agent modeling and simulation, and
diagnostic medicine
Resources & Reading
● OpenAI’s intro post: https://blog.openai.com/evolution-strategies/
● Great Python implementation: https://github.com/hardmaru/estool
● Google Brain’s variant: https://github.com/brain-research/guided-evolutionary-strategies
● Uber’s variants: https://arxiv.org/abs/1712.06560
● Uber’s implementations: https://github.com/uber-research/deep-neuroevolution
Blog post: https://eng.uber.com/deep-neuroevolution/
● OpenAI’s starter: https://github.com/openai/evolution-strategies-starter
● Natural ES paper: http://www.jmlr.org/papers/volume15/wierstra14a/wierstra14a.pdf

More Related Content

What's hot

Numerical methods and its applications
Numerical methods and its applicationsNumerical methods and its applications
Numerical methods and its applications
HaiderParekh1
 
supervised learning
supervised learningsupervised learning
supervised learning
Amar Tripathi
 
Ml introduction
Ml introductionMl introduction
Ml introduction
RanjithaM32
 
Machine Learning Using Python
Machine Learning Using PythonMachine Learning Using Python
Machine Learning Using Python
SavitaHanchinal
 
Object-Oriented Design Fundamentals.pptx
Object-Oriented Design Fundamentals.pptxObject-Oriented Design Fundamentals.pptx
Object-Oriented Design Fundamentals.pptx
RaflyRizky2
 
(Machine)Learning with limited labels(Machine)Learning with limited labels(Ma...
(Machine)Learning with limited labels(Machine)Learning with limited labels(Ma...(Machine)Learning with limited labels(Machine)Learning with limited labels(Ma...
(Machine)Learning with limited labels(Machine)Learning with limited labels(Ma...
Eirini Ntoutsi
 
Bt0081 software engineering
Bt0081   software engineeringBt0081   software engineering
Bt0081 software engineering
smumbahelp
 
20211229120253D6323_PERT 06_ Ensemble Learning.pptx
20211229120253D6323_PERT 06_ Ensemble Learning.pptx20211229120253D6323_PERT 06_ Ensemble Learning.pptx
20211229120253D6323_PERT 06_ Ensemble Learning.pptx
RaflyRizky2
 
Problem Formulation in Artificial Inteligence Projects
Problem Formulation in Artificial Inteligence ProjectsProblem Formulation in Artificial Inteligence Projects
Problem Formulation in Artificial Inteligence Projects
Dr. C.V. Suresh Babu
 
Machine Learning part1 - Introduction to Data Science
Machine Learning part1 - Introduction to Data Science Machine Learning part1 - Introduction to Data Science
Machine Learning part1 - Introduction to Data Science
Frank Kienle
 
Supervised learning and Unsupervised learning
Supervised learning and Unsupervised learning Supervised learning and Unsupervised learning
Supervised learning and Unsupervised learning
Usama Fayyaz
 
NUMERICAL METHOD AND ITS APPLICATION
NUMERICAL METHOD AND ITS APPLICATIONNUMERICAL METHOD AND ITS APPLICATION
NUMERICAL METHOD AND ITS APPLICATION
REZAUL KARIM REFATH
 
Types of Mathematical Model.
Types of Mathematical Model.Types of Mathematical Model.
Types of Mathematical Model.
Megha Sharma
 
Supervised Machine Learning
Supervised Machine LearningSupervised Machine Learning
Supervised Machine Learning
Ankit Rai
 
10 Slide Intro to ML
10 Slide Intro to ML10 Slide Intro to ML
10 Slide Intro to ML
Nitesh Surana
 
Internship project report,Predictive Modelling
Internship project report,Predictive ModellingInternship project report,Predictive Modelling
Internship project report,Predictive ModellingAmit Kumar
 
WorkKeys Individual Score Report 11-20-2015
WorkKeys Individual Score Report 11-20-2015WorkKeys Individual Score Report 11-20-2015
WorkKeys Individual Score Report 11-20-2015jamie Justice
 
Bt8901 object oriented systems-de (1)
Bt8901 object oriented systems-de (1)Bt8901 object oriented systems-de (1)
Bt8901 object oriented systems-de (1)
smumbahelp
 
Introduction to ml
Introduction to mlIntroduction to ml
Introduction to ml
SuyashSingh70
 
MachineLlearning introduction
MachineLlearning introductionMachineLlearning introduction
MachineLlearning introduction
The IOT Academy
 

What's hot (20)

Numerical methods and its applications
Numerical methods and its applicationsNumerical methods and its applications
Numerical methods and its applications
 
supervised learning
supervised learningsupervised learning
supervised learning
 
Ml introduction
Ml introductionMl introduction
Ml introduction
 
Machine Learning Using Python
Machine Learning Using PythonMachine Learning Using Python
Machine Learning Using Python
 
Object-Oriented Design Fundamentals.pptx
Object-Oriented Design Fundamentals.pptxObject-Oriented Design Fundamentals.pptx
Object-Oriented Design Fundamentals.pptx
 
(Machine)Learning with limited labels(Machine)Learning with limited labels(Ma...
(Machine)Learning with limited labels(Machine)Learning with limited labels(Ma...(Machine)Learning with limited labels(Machine)Learning with limited labels(Ma...
(Machine)Learning with limited labels(Machine)Learning with limited labels(Ma...
 
Bt0081 software engineering
Bt0081   software engineeringBt0081   software engineering
Bt0081 software engineering
 
20211229120253D6323_PERT 06_ Ensemble Learning.pptx
20211229120253D6323_PERT 06_ Ensemble Learning.pptx20211229120253D6323_PERT 06_ Ensemble Learning.pptx
20211229120253D6323_PERT 06_ Ensemble Learning.pptx
 
Problem Formulation in Artificial Inteligence Projects
Problem Formulation in Artificial Inteligence ProjectsProblem Formulation in Artificial Inteligence Projects
Problem Formulation in Artificial Inteligence Projects
 
Machine Learning part1 - Introduction to Data Science
Machine Learning part1 - Introduction to Data Science Machine Learning part1 - Introduction to Data Science
Machine Learning part1 - Introduction to Data Science
 
Supervised learning and Unsupervised learning
Supervised learning and Unsupervised learning Supervised learning and Unsupervised learning
Supervised learning and Unsupervised learning
 
NUMERICAL METHOD AND ITS APPLICATION
NUMERICAL METHOD AND ITS APPLICATIONNUMERICAL METHOD AND ITS APPLICATION
NUMERICAL METHOD AND ITS APPLICATION
 
Types of Mathematical Model.
Types of Mathematical Model.Types of Mathematical Model.
Types of Mathematical Model.
 
Supervised Machine Learning
Supervised Machine LearningSupervised Machine Learning
Supervised Machine Learning
 
10 Slide Intro to ML
10 Slide Intro to ML10 Slide Intro to ML
10 Slide Intro to ML
 
Internship project report,Predictive Modelling
Internship project report,Predictive ModellingInternship project report,Predictive Modelling
Internship project report,Predictive Modelling
 
WorkKeys Individual Score Report 11-20-2015
WorkKeys Individual Score Report 11-20-2015WorkKeys Individual Score Report 11-20-2015
WorkKeys Individual Score Report 11-20-2015
 
Bt8901 object oriented systems-de (1)
Bt8901 object oriented systems-de (1)Bt8901 object oriented systems-de (1)
Bt8901 object oriented systems-de (1)
 
Introduction to ml
Introduction to mlIntroduction to ml
Introduction to ml
 
MachineLlearning introduction
MachineLlearning introductionMachineLlearning introduction
MachineLlearning introduction
 

Similar to NYAI #25: Evolution Strategies: An Alternative Approach to AI w/ Maxwell Rebo

Machine Learning in the Financial Industry
Machine Learning in the Financial IndustryMachine Learning in the Financial Industry
Machine Learning in the Financial Industry
Subrat Panda, PhD
 
Decision Tree in Machine Learning
Decision Tree in Machine LearningDecision Tree in Machine Learning
Decision Tree in Machine Learning
Tutort Academy
 
AI hype or reality
AI  hype or realityAI  hype or reality
AI hype or reality
Awantik Das
 
C3 w5
C3 w5C3 w5
Strata 2016 - Lessons Learned from building real-life Machine Learning Systems
Strata 2016 -  Lessons Learned from building real-life Machine Learning SystemsStrata 2016 -  Lessons Learned from building real-life Machine Learning Systems
Strata 2016 - Lessons Learned from building real-life Machine Learning Systems
Xavier Amatriain
 
Ad Click Prediction - Paper review
Ad Click Prediction - Paper reviewAd Click Prediction - Paper review
Ad Click Prediction - Paper review
Mazen Aly
 
BIG2016- Lessons Learned from building real-life user-focused Big Data systems
BIG2016- Lessons Learned from building real-life user-focused Big Data systemsBIG2016- Lessons Learned from building real-life user-focused Big Data systems
BIG2016- Lessons Learned from building real-life user-focused Big Data systems
Xavier Amatriain
 
Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...
Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...
Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...
Xavier Amatriain
 
Supercharge your AB testing with automated causal inference - Community Works...
Supercharge your AB testing with automated causal inference - Community Works...Supercharge your AB testing with automated causal inference - Community Works...
Supercharge your AB testing with automated causal inference - Community Works...
Egor Kraev
 
BSSML16 L5. Summary Day 1 Sessions
BSSML16 L5. Summary Day 1 SessionsBSSML16 L5. Summary Day 1 Sessions
BSSML16 L5. Summary Day 1 Sessions
BigML, Inc
 
Kaggle Days Paris - Alberto Danese - ML Interpretability
Kaggle Days Paris - Alberto Danese - ML InterpretabilityKaggle Days Paris - Alberto Danese - ML Interpretability
Kaggle Days Paris - Alberto Danese - ML Interpretability
Alberto Danese
 
VSSML17 Review. Summary Day 1 Sessions
VSSML17 Review. Summary Day 1 SessionsVSSML17 Review. Summary Day 1 Sessions
VSSML17 Review. Summary Day 1 Sessions
BigML, Inc
 
Decision Tree Machine Learning Detailed Explanation.
Decision Tree Machine Learning Detailed Explanation.Decision Tree Machine Learning Detailed Explanation.
Decision Tree Machine Learning Detailed Explanation.
DrezzingGaming
 
Feature Engineering in Machine Learning
Feature Engineering in Machine LearningFeature Engineering in Machine Learning
Feature Engineering in Machine Learning
Knoldus Inc.
 
Recommender Systems In Industry
Recommender Systems In IndustryRecommender Systems In Industry
Recommender Systems In Industry
Xavier Amatriain
 
Software Design Principles and Best Practices - Satyajit Dey
Software Design Principles and Best Practices - Satyajit DeySoftware Design Principles and Best Practices - Satyajit Dey
Software Design Principles and Best Practices - Satyajit Dey
Cefalo
 
Moving from BI to AI : For decision makers
Moving from BI to AI : For decision makersMoving from BI to AI : For decision makers
Moving from BI to AI : For decision makers
zekeLabs Technologies
 
Machine learning - session 3
Machine learning - session 3Machine learning - session 3
Machine learning - session 3
Luis Borbon
 
Human in the loop: Bayesian Rules Enabling Explainable AI
Human in the loop: Bayesian Rules Enabling Explainable AIHuman in the loop: Bayesian Rules Enabling Explainable AI
Human in the loop: Bayesian Rules Enabling Explainable AI
Pramit Choudhary
 
PR-231: A Simple Framework for Contrastive Learning of Visual Representations
PR-231: A Simple Framework for Contrastive Learning of Visual RepresentationsPR-231: A Simple Framework for Contrastive Learning of Visual Representations
PR-231: A Simple Framework for Contrastive Learning of Visual Representations
Jinwon Lee
 

Similar to NYAI #25: Evolution Strategies: An Alternative Approach to AI w/ Maxwell Rebo (20)

Machine Learning in the Financial Industry
Machine Learning in the Financial IndustryMachine Learning in the Financial Industry
Machine Learning in the Financial Industry
 
Decision Tree in Machine Learning
Decision Tree in Machine LearningDecision Tree in Machine Learning
Decision Tree in Machine Learning
 
AI hype or reality
AI  hype or realityAI  hype or reality
AI hype or reality
 
C3 w5
C3 w5C3 w5
C3 w5
 
Strata 2016 - Lessons Learned from building real-life Machine Learning Systems
Strata 2016 -  Lessons Learned from building real-life Machine Learning SystemsStrata 2016 -  Lessons Learned from building real-life Machine Learning Systems
Strata 2016 - Lessons Learned from building real-life Machine Learning Systems
 
Ad Click Prediction - Paper review
Ad Click Prediction - Paper reviewAd Click Prediction - Paper review
Ad Click Prediction - Paper review
 
BIG2016- Lessons Learned from building real-life user-focused Big Data systems
BIG2016- Lessons Learned from building real-life user-focused Big Data systemsBIG2016- Lessons Learned from building real-life user-focused Big Data systems
BIG2016- Lessons Learned from building real-life user-focused Big Data systems
 
Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...
Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...
Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...
 
Supercharge your AB testing with automated causal inference - Community Works...
Supercharge your AB testing with automated causal inference - Community Works...Supercharge your AB testing with automated causal inference - Community Works...
Supercharge your AB testing with automated causal inference - Community Works...
 
BSSML16 L5. Summary Day 1 Sessions
BSSML16 L5. Summary Day 1 SessionsBSSML16 L5. Summary Day 1 Sessions
BSSML16 L5. Summary Day 1 Sessions
 
Kaggle Days Paris - Alberto Danese - ML Interpretability
Kaggle Days Paris - Alberto Danese - ML InterpretabilityKaggle Days Paris - Alberto Danese - ML Interpretability
Kaggle Days Paris - Alberto Danese - ML Interpretability
 
VSSML17 Review. Summary Day 1 Sessions
VSSML17 Review. Summary Day 1 SessionsVSSML17 Review. Summary Day 1 Sessions
VSSML17 Review. Summary Day 1 Sessions
 
Decision Tree Machine Learning Detailed Explanation.
Decision Tree Machine Learning Detailed Explanation.Decision Tree Machine Learning Detailed Explanation.
Decision Tree Machine Learning Detailed Explanation.
 
Feature Engineering in Machine Learning
Feature Engineering in Machine LearningFeature Engineering in Machine Learning
Feature Engineering in Machine Learning
 
Recommender Systems In Industry
Recommender Systems In IndustryRecommender Systems In Industry
Recommender Systems In Industry
 
Software Design Principles and Best Practices - Satyajit Dey
Software Design Principles and Best Practices - Satyajit DeySoftware Design Principles and Best Practices - Satyajit Dey
Software Design Principles and Best Practices - Satyajit Dey
 
Moving from BI to AI : For decision makers
Moving from BI to AI : For decision makersMoving from BI to AI : For decision makers
Moving from BI to AI : For decision makers
 
Machine learning - session 3
Machine learning - session 3Machine learning - session 3
Machine learning - session 3
 
Human in the loop: Bayesian Rules Enabling Explainable AI
Human in the loop: Bayesian Rules Enabling Explainable AIHuman in the loop: Bayesian Rules Enabling Explainable AI
Human in the loop: Bayesian Rules Enabling Explainable AI
 
PR-231: A Simple Framework for Contrastive Learning of Visual Representations
PR-231: A Simple Framework for Contrastive Learning of Visual RepresentationsPR-231: A Simple Framework for Contrastive Learning of Visual Representations
PR-231: A Simple Framework for Contrastive Learning of Visual Representations
 

More from Maryam Farooq

Women in AI Social: Fall Edition (NYAI x Aggregate Intellect x AI Geeks)
Women in AI Social: Fall Edition (NYAI x Aggregate Intellect x AI Geeks)Women in AI Social: Fall Edition (NYAI x Aggregate Intellect x AI Geeks)
Women in AI Social: Fall Edition (NYAI x Aggregate Intellect x AI Geeks)
Maryam Farooq
 
AI & COVID19: Ethics & Data Rights (NYAI x AISC)
AI & COVID19: Ethics & Data Rights (NYAI x AISC)AI & COVID19: Ethics & Data Rights (NYAI x AISC)
AI & COVID19: Ethics & Data Rights (NYAI x AISC)
Maryam Farooq
 
NLP Community Conference - Dr. Catherine Havasi (ConceptNet/MIT Media Lab/Lum...
NLP Community Conference - Dr. Catherine Havasi (ConceptNet/MIT Media Lab/Lum...NLP Community Conference - Dr. Catherine Havasi (ConceptNet/MIT Media Lab/Lum...
NLP Community Conference - Dr. Catherine Havasi (ConceptNet/MIT Media Lab/Lum...
Maryam Farooq
 
NYAI #27: Cognitive Architecture & Natural Language Processing w/ Dr. Catheri...
NYAI #27: Cognitive Architecture & Natural Language Processing w/ Dr. Catheri...NYAI #27: Cognitive Architecture & Natural Language Processing w/ Dr. Catheri...
NYAI #27: Cognitive Architecture & Natural Language Processing w/ Dr. Catheri...
Maryam Farooq
 
NYAI #26: Federated Learning: Machine Learning on Edge Devices w/ Alice Albre...
NYAI #26: Federated Learning: Machine Learning on Edge Devices w/ Alice Albre...NYAI #26: Federated Learning: Machine Learning on Edge Devices w/ Alice Albre...
NYAI #26: Federated Learning: Machine Learning on Edge Devices w/ Alice Albre...
Maryam Farooq
 
NYAI #23: Using Cognitive Neuroscience to Create AI (w/ Dr. Peter Olausson)
NYAI #23: Using Cognitive Neuroscience to Create AI (w/ Dr. Peter Olausson)NYAI #23: Using Cognitive Neuroscience to Create AI (w/ Dr. Peter Olausson)
NYAI #23: Using Cognitive Neuroscience to Create AI (w/ Dr. Peter Olausson)
Maryam Farooq
 
NYAI #24: Developing Trust in Artificial Intelligence and Machine Learning fo...
NYAI #24: Developing Trust in Artificial Intelligence and Machine Learning fo...NYAI #24: Developing Trust in Artificial Intelligence and Machine Learning fo...
NYAI #24: Developing Trust in Artificial Intelligence and Machine Learning fo...
Maryam Farooq
 
NYAI #19: AI & UI - "AI + Emotion: It's all about Trust" by Steph Hay (VP Des...
NYAI #19: AI & UI - "AI + Emotion: It's all about Trust" by Steph Hay (VP Des...NYAI #19: AI & UI - "AI + Emotion: It's all about Trust" by Steph Hay (VP Des...
NYAI #19: AI & UI - "AI + Emotion: It's all about Trust" by Steph Hay (VP Des...
Maryam Farooq
 
NYAI #19: AI & UI - "Designing Intelligent Agents & a New Class of 'Perceived...
NYAI #19: AI & UI - "Designing Intelligent Agents & a New Class of 'Perceived...NYAI #19: AI & UI - "Designing Intelligent Agents & a New Class of 'Perceived...
NYAI #19: AI & UI - "Designing Intelligent Agents & a New Class of 'Perceived...
Maryam Farooq
 
NYAI #18: Team Alignment for Human-Centered AI (Chris Butler - Director of AI...
NYAI #18: Team Alignment for Human-Centered AI (Chris Butler - Director of AI...NYAI #18: Team Alignment for Human-Centered AI (Chris Butler - Director of AI...
NYAI #18: Team Alignment for Human-Centered AI (Chris Butler - Director of AI...
Maryam Farooq
 
NYAI #18: Designing for AI (Rob Strati & Jesse Schifano of ECHO)
NYAI #18: Designing for AI (Rob Strati & Jesse Schifano of ECHO)NYAI #18: Designing for AI (Rob Strati & Jesse Schifano of ECHO)
NYAI #18: Designing for AI (Rob Strati & Jesse Schifano of ECHO)
Maryam Farooq
 
"Understanding Humans with Machines" (Arthur Tisi)
"Understanding Humans with Machines" (Arthur Tisi)"Understanding Humans with Machines" (Arthur Tisi)
"Understanding Humans with Machines" (Arthur Tisi)
Maryam Farooq
 
NYAI #13: "Designing AI by Learning from Enterprise" - Nicholas Borge (Impart...
NYAI #13: "Designing AI by Learning from Enterprise" - Nicholas Borge (Impart...NYAI #13: "Designing AI by Learning from Enterprise" - Nicholas Borge (Impart...
NYAI #13: "Designing AI by Learning from Enterprise" - Nicholas Borge (Impart...
Maryam Farooq
 
NYAI #13: "AI and Business Transformation" - Josh Sutton
NYAI #13: "AI and Business Transformation" - Josh SuttonNYAI #13: "AI and Business Transformation" - Josh Sutton
NYAI #13: "AI and Business Transformation" - Josh Sutton
Maryam Farooq
 
NYAI #10: Building an AI Autonomous Agent Using Supervised Learning with Denn...
NYAI #10: Building an AI Autonomous Agent Using Supervised Learning with Denn...NYAI #10: Building an AI Autonomous Agent Using Supervised Learning with Denn...
NYAI #10: Building an AI Autonomous Agent Using Supervised Learning with Denn...
Maryam Farooq
 

More from Maryam Farooq (15)

Women in AI Social: Fall Edition (NYAI x Aggregate Intellect x AI Geeks)
Women in AI Social: Fall Edition (NYAI x Aggregate Intellect x AI Geeks)Women in AI Social: Fall Edition (NYAI x Aggregate Intellect x AI Geeks)
Women in AI Social: Fall Edition (NYAI x Aggregate Intellect x AI Geeks)
 
AI & COVID19: Ethics & Data Rights (NYAI x AISC)
AI & COVID19: Ethics & Data Rights (NYAI x AISC)AI & COVID19: Ethics & Data Rights (NYAI x AISC)
AI & COVID19: Ethics & Data Rights (NYAI x AISC)
 
NLP Community Conference - Dr. Catherine Havasi (ConceptNet/MIT Media Lab/Lum...
NLP Community Conference - Dr. Catherine Havasi (ConceptNet/MIT Media Lab/Lum...NLP Community Conference - Dr. Catherine Havasi (ConceptNet/MIT Media Lab/Lum...
NLP Community Conference - Dr. Catherine Havasi (ConceptNet/MIT Media Lab/Lum...
 
NYAI #27: Cognitive Architecture & Natural Language Processing w/ Dr. Catheri...
NYAI #27: Cognitive Architecture & Natural Language Processing w/ Dr. Catheri...NYAI #27: Cognitive Architecture & Natural Language Processing w/ Dr. Catheri...
NYAI #27: Cognitive Architecture & Natural Language Processing w/ Dr. Catheri...
 
NYAI #26: Federated Learning: Machine Learning on Edge Devices w/ Alice Albre...
NYAI #26: Federated Learning: Machine Learning on Edge Devices w/ Alice Albre...NYAI #26: Federated Learning: Machine Learning on Edge Devices w/ Alice Albre...
NYAI #26: Federated Learning: Machine Learning on Edge Devices w/ Alice Albre...
 
NYAI #23: Using Cognitive Neuroscience to Create AI (w/ Dr. Peter Olausson)
NYAI #23: Using Cognitive Neuroscience to Create AI (w/ Dr. Peter Olausson)NYAI #23: Using Cognitive Neuroscience to Create AI (w/ Dr. Peter Olausson)
NYAI #23: Using Cognitive Neuroscience to Create AI (w/ Dr. Peter Olausson)
 
NYAI #24: Developing Trust in Artificial Intelligence and Machine Learning fo...
NYAI #24: Developing Trust in Artificial Intelligence and Machine Learning fo...NYAI #24: Developing Trust in Artificial Intelligence and Machine Learning fo...
NYAI #24: Developing Trust in Artificial Intelligence and Machine Learning fo...
 
NYAI #19: AI & UI - "AI + Emotion: It's all about Trust" by Steph Hay (VP Des...
NYAI #19: AI & UI - "AI + Emotion: It's all about Trust" by Steph Hay (VP Des...NYAI #19: AI & UI - "AI + Emotion: It's all about Trust" by Steph Hay (VP Des...
NYAI #19: AI & UI - "AI + Emotion: It's all about Trust" by Steph Hay (VP Des...
 
NYAI #19: AI & UI - "Designing Intelligent Agents & a New Class of 'Perceived...
NYAI #19: AI & UI - "Designing Intelligent Agents & a New Class of 'Perceived...NYAI #19: AI & UI - "Designing Intelligent Agents & a New Class of 'Perceived...
NYAI #19: AI & UI - "Designing Intelligent Agents & a New Class of 'Perceived...
 
NYAI #18: Team Alignment for Human-Centered AI (Chris Butler - Director of AI...
NYAI #18: Team Alignment for Human-Centered AI (Chris Butler - Director of AI...NYAI #18: Team Alignment for Human-Centered AI (Chris Butler - Director of AI...
NYAI #18: Team Alignment for Human-Centered AI (Chris Butler - Director of AI...
 
NYAI #18: Designing for AI (Rob Strati & Jesse Schifano of ECHO)
NYAI #18: Designing for AI (Rob Strati & Jesse Schifano of ECHO)NYAI #18: Designing for AI (Rob Strati & Jesse Schifano of ECHO)
NYAI #18: Designing for AI (Rob Strati & Jesse Schifano of ECHO)
 
"Understanding Humans with Machines" (Arthur Tisi)
"Understanding Humans with Machines" (Arthur Tisi)"Understanding Humans with Machines" (Arthur Tisi)
"Understanding Humans with Machines" (Arthur Tisi)
 
NYAI #13: "Designing AI by Learning from Enterprise" - Nicholas Borge (Impart...
NYAI #13: "Designing AI by Learning from Enterprise" - Nicholas Borge (Impart...NYAI #13: "Designing AI by Learning from Enterprise" - Nicholas Borge (Impart...
NYAI #13: "Designing AI by Learning from Enterprise" - Nicholas Borge (Impart...
 
NYAI #13: "AI and Business Transformation" - Josh Sutton
NYAI #13: "AI and Business Transformation" - Josh SuttonNYAI #13: "AI and Business Transformation" - Josh Sutton
NYAI #13: "AI and Business Transformation" - Josh Sutton
 
NYAI #10: Building an AI Autonomous Agent Using Supervised Learning with Denn...
NYAI #10: Building an AI Autonomous Agent Using Supervised Learning with Denn...NYAI #10: Building an AI Autonomous Agent Using Supervised Learning with Denn...
NYAI #10: Building an AI Autonomous Agent Using Supervised Learning with Denn...
 

Recently uploaded

一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
ewymefz
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
ewymefz
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
TravisMalana
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
ArpitMalhotra16
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
ocavb
 
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
pchutichetpong
 
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
nscud
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
slg6lamcq
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
nscud
 
FP Growth Algorithm and its Applications
FP Growth Algorithm and its ApplicationsFP Growth Algorithm and its Applications
FP Growth Algorithm and its Applications
MaleehaSheikh2
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
axoqas
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
NABLAS株式会社
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
ewymefz
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
vcaxypu
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
ewymefz
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
ahzuo
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape ReportSOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar
 

Recently uploaded (20)

一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
 
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
 
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
 
FP Growth Algorithm and its Applications
FP Growth Algorithm and its ApplicationsFP Growth Algorithm and its Applications
FP Growth Algorithm and its Applications
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape ReportSOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape Report
 

NYAI #25: Evolution Strategies: An Alternative Approach to AI w/ Maxwell Rebo

  • 2. What is ES (and who uses it)? ● Class of heuristic search algorithms for black-box optimization ● Closely related to genetic algorithms (GA) ● Several groups have made serious commitments to developing additional variants:
  • 3. Why should I care? Organizations have yet to adopt reinforcement learning, and as such are leaving billions of dollars of value on the table ES presents a perfect excuse to tackle serious problems with RL It’s no panacea, but hard optimization problems become much easier DRL is heavy and esoteric; ES is lightweight and intuitive Once ES becomes explicable, then it’s game over
  • 4. What do I use ES for? ● Algorithmic trading (c’mon, you knew that was going to come up) ● Token incentives in blockchain applications ● Opportunity cost problems in investment and staking ● Training one-shot computer vision models ● Modeling causal trends in property markets (2016-2017)
  • 5. Pros ● Gradient-free ● Can run on arbitrary numbers of CPUs, no GPUs needed ● Can optimize mixed-type models ● Can handle non-convex problems ● Conveniently sidesteps exploding gradients ● Easy to extend/modify Cons ● Less data-efficient ● No guarantee of convergence ● Still a black box* ● Without modifications, does poorly with hard constraints as reward cutoffs ● Can break down with highly complex/multimodal solutions ● Can be much slower in typical supervised learning settings
  • 6. Core Procedure Credit: OpenAI, https://arxiv.org/abs/1703.03864
  • 8. DRL ES ● Optimizing arbitrary parameters for a given task ● Largely agnostic to underlying model structure ● Can implement virtual batch schemes, but not required for good performance ● Value function is implicit ● Parameters are smooshed into 1D array ● Optimizing network(s) for a given task, typically at episode-level ● Highly sensitive to batch size and training scheme ● Requires estimation of value function ● Parameters are grouped by component/layer
  • 9. How to gray the black box? ● Feature importance / parameter saliency still a challenge ● Tried Bayesian attribution: too sparse, and still get artifacts from discretization ● One way that works: ○ Calculate MI(x_i’, y’) ○ x_i’ is Δε ○ y’ is change in reward for candidate solution ● For pairwise and partition-wise attribution, do partial information decomposition in hierarchical fashion ● No free lunch: Attribution computation is expensive
  • 10. PRESS ● PRincipled Evolution StrategieS (in keeping with weird acronym structures) ● Saliency attribution at the optimizer level ● Short version: find which parameters are important and which ones aren’t ● Long version: Use information-theoretic measures to attribute reward saliency to parameter perturbations ● Step 1) estimate parameter saliency after N iterations Step 2) apply saliency estimates as normalized prior in perturbation-sampling step ● Bears similarity to integrated gradients, guided ES ● Essentially an approach to estimating the partial derivative ∂y/∂x_i ○ Where y is reward, x_i is parameter of a model such as a weight in a neural net, mean of a probability distribution, etc.
  • 11. Summary ● For RL problems, using ES solvers confers a number of benefits ● Where problems are mixed-mode and/or gradients are difficult to to evaluate, ES shines ● Getting away from GPU dependency is especially nice - less $$, for one ● Impact on industry: Compared to other RL methods, ES is easier to apply, faster, and often yields comparable or even better results. This makes it easier and cheaper to deploy practitioners to a problem and get results. ● Areas of especially exciting application are synthetic biology, economics (including economic policy), multi-agent modeling and simulation, and diagnostic medicine
  • 12. Resources & Reading ● OpenAI’s intro post: https://blog.openai.com/evolution-strategies/ ● Great Python implementation: https://github.com/hardmaru/estool ● Google Brain’s variant: https://github.com/brain-research/guided-evolutionary-strategies ● Uber’s variants: https://arxiv.org/abs/1712.06560 ● Uber’s implementations: https://github.com/uber-research/deep-neuroevolution Blog post: https://eng.uber.com/deep-neuroevolution/ ● OpenAI’s starter: https://github.com/openai/evolution-strategies-starter ● Natural ES paper: http://www.jmlr.org/papers/volume15/wierstra14a/wierstra14a.pdf