SlideShare a Scribd company logo
1 of 25
© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Julien Simon
Principal Technical Evangelist, AI & Machine Learning, AWS
@julsimon
An Introduction
to Reinforcement Learning
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Supervised learning
Run an algorithm on a labelled data set, i.e. a data set containing samples
and answers. Gradually, the model learns how to correctly predict the right
answer. Regression and classification are examples of supervised learning.
Unsupervised learning
Run an algorithm on an unlabelled data set, i.e. a data set containing
samples only. Here, the model progressively learns patterns in data and
organizes samples accordingly. Clustering and topic modeling are examples
of unsupervised learning.
Types of Machine Learning
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Supervised learning
Unsupervised learning
Types of Machine LearningSOPHISTICATIONOFMLMODELS
AMOUNT OF TRAINING DATA REQUIRED
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Types of Machine Learning
AMOUNT OF TRAINING DATA REQUIRED
Supervised learning
Unsupervised learning
SOPHISTICATIONOFMLMODELS
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Types of Machine Learning
Reinforcement learning
(RL)
Supervised learning
Unsupervised learning
AMOUNT OF TRAINING DATA REQUIRED
SOPHISTICATIONOFMLMODELS
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Remember when you first learned this?
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Or this?
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
We didn’t have an extensive labelled data
set back then ☺
And yet we learned
How?
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Defining Reinforcement Learning
An algorithm (aka an agent) interacts with its
environment.
The agent receives a positive or negative reward
for actions that it takes: rewards are computed by
a user-defined function which outputs a numeric
representation of the actions that should be
incentivized.
By trying to maximize the accumulation of rewards,
the agent learns an optimal strategy (aka policy)
for decision making.
Source: Wikipedia
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Use cases
• Large complex problems
• Uncertain, dynamic environments
• Continuous learning
• Supply chain management
• HVAC systems
• Industrial robotics
• Autonomous vehicles
• Portfolio management
• Oil exploration
• etc.
Caterpillar: 250-ton autonomous mining trucks
https://diginomica.com/2017/04/17/sending-disruption-mines/
https://www.cat.com/en_US/articles/customer-stories/built-for-it/thefutureisnow-driverless.html
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Example: navigating a maze
• Imagine an agent learning to navigate a maze. It can move in certain directions but is
blocked from going through walls.
• The agent discovers its environment (the current maze) one step at at time, receiving a
reward each time: stepping into a dead end is a negative reward, moving one step closer
to the exit is a positive reward.
• After a certain number of steps (or if we found the exit), the current episode ends.
• After a certain number of episodes, the agent uses the action/reward data points to train
a model, in order to make better decisions next time around.
• One critical thing to understand is that the RL model isn’t trained on a predefined set of
labelled mazes (that would be supervised learning).
• This cycle of exploring and training is central to RL: given enough mazes and enough
training time, we would soon enough know how to navigate any maze.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Environment
• The space in which the RL model operates.
• This can be either a real-world environment
or a simulator.
• If you train a physical autonomous vehicle
on a physical road, that would be a real-
world environment.
• If you train a computer program that
models an autonomous vehicle driving on a
road, that would be a simulator… probably a
much safer option!
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Exploitation vs Exploration
• Selecting the next action is a balance
between exploitation (‘using what you’ve
learned’) and exploration (‘taking a chance
to learn new things’)
• If you favor exploitation, you may never
reach high-value rewards.
• If you favor exploration, you’ll probably run
into trouble very often!
• Initially, the agent will explore at random for
a fixed number of episodes (aka heatup
phase): this generates data for the first
round of training.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Training a RL model
1. Formulate the problem: goal, environment, state, actions, reward
2. Define the environment: real-world or simulator?
3. Define the presets
4. Write the training code and the value function
5. Train the model
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon SageMaker RL
Reinforcementlearningfor every developer and data scientist
Broad support
for frameworks
Broad support for simulation
environments including
SimuLink and MatLab
K E Y F E AT U R E S
TensorFlow, Apache
MXNet, Intel Coach, and
Ray RL support
2D & 3D physics
environments and
OpenAI Gym support
Supports Amazon Sumerian and
Amazon RoboMaker
Fully
managed
Example notebooks
and tutorials
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
How can we get developers rolling
with reinforcement learning?
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Introducing AWS DeepRacer
Fullyautonomous1/18thscaleracecar, drivenbyreinforcementlearning
https://youtu.be/X-6v4RZy-TE
HD video camera
Dual-core Intel
processorFour-wheel drive
Dual power for
compute and drive
Accelerometer
Gyroscope
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
AWS DeepRacer
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
AWS DeepRacer League
Competitive racing league for AWS DeepRacer
Compete virtually onlineTrain models with RL
Race in trials Final at AWS re:Invent
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Getting started
http://aws.amazon.com/free
https://ml.aws
https://aws.amazon.com/sagemaker
https://aws.amazon.com/deepracer/
https://github.com/aws/sagemaker-python-sdk
https://github.com/awslabs/amazon-sagemaker-examples
https://medium.com/@julsimon
https://gitlab.com/juliensimon/dlnotebooks
Thank you!
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Julien Simon
Principal Technical Evangelist, AI & Machine Learning, AWS
@julsimon

More Related Content

What's hot

Build, train and deploy Machine Learning models on Amazon SageMaker (May 2019)
Build, train and deploy Machine Learning models on Amazon SageMaker (May 2019)Build, train and deploy Machine Learning models on Amazon SageMaker (May 2019)
Build, train and deploy Machine Learning models on Amazon SageMaker (May 2019)Julien SIMON
 
AWS re:Invent 2018 - AIM401-R2 - Deep Learning Applications with Tensorflow
AWS re:Invent 2018 - AIM401-R2 - Deep Learning Applications with TensorflowAWS re:Invent 2018 - AIM401-R2 - Deep Learning Applications with Tensorflow
AWS re:Invent 2018 - AIM401-R2 - Deep Learning Applications with TensorflowJulien SIMON
 
An Introduction to Reinforcement Learning (December 2018)
An Introduction to Reinforcement Learning (December 2018)An Introduction to Reinforcement Learning (December 2018)
An Introduction to Reinforcement Learning (December 2018)Julien SIMON
 
Build Text Analytics Solutions with Amazon Comprehend and Amazon Translate
Build Text Analytics Solutions with Amazon Comprehend and Amazon TranslateBuild Text Analytics Solutions with Amazon Comprehend and Amazon Translate
Build Text Analytics Solutions with Amazon Comprehend and Amazon TranslateAmazon Web Services
 
Preparing Your Team for a Cloud Transformation - AWS Online Tech Talks
Preparing Your Team for a Cloud Transformation - AWS Online Tech TalksPreparing Your Team for a Cloud Transformation - AWS Online Tech Talks
Preparing Your Team for a Cloud Transformation - AWS Online Tech TalksAmazon Web Services
 
Learnings from the Field: Best Practices for Making Money with Alexa Skills (...
Learnings from the Field: Best Practices for Making Money with Alexa Skills (...Learnings from the Field: Best Practices for Making Money with Alexa Skills (...
Learnings from the Field: Best Practices for Making Money with Alexa Skills (...Amazon Web Services
 
[NEW LAUNCH!] Introducing AWS DeepRacer (AIM367) - AWS re:Invent 2018
[NEW LAUNCH!] Introducing AWS DeepRacer  (AIM367) - AWS re:Invent 2018[NEW LAUNCH!] Introducing AWS DeepRacer  (AIM367) - AWS re:Invent 2018
[NEW LAUNCH!] Introducing AWS DeepRacer (AIM367) - AWS re:Invent 2018Amazon Web Services
 
Harness the Power of Crowdsourcing with Amazon Mechanical Turk (AIM351) - AWS...
Harness the Power of Crowdsourcing with Amazon Mechanical Turk (AIM351) - AWS...Harness the Power of Crowdsourcing with Amazon Mechanical Turk (AIM351) - AWS...
Harness the Power of Crowdsourcing with Amazon Mechanical Turk (AIM351) - AWS...Amazon Web Services
 
Automate for Efficiency with Amazon Transcribe & Amazon Translate
Automate for Efficiency with Amazon Transcribe & Amazon TranslateAutomate for Efficiency with Amazon Transcribe & Amazon Translate
Automate for Efficiency with Amazon Transcribe & Amazon TranslateAmazon Web Services
 
Build, train and deploy ML models with SageMaker (October 2019)
Build, train and deploy ML models with SageMaker (October 2019)Build, train and deploy ML models with SageMaker (October 2019)
Build, train and deploy ML models with SageMaker (October 2019)Julien SIMON
 
AWS re:Invent 2018 - ENT321 - SageMaker Workshop
AWS re:Invent 2018 - ENT321 - SageMaker WorkshopAWS re:Invent 2018 - ENT321 - SageMaker Workshop
AWS re:Invent 2018 - ENT321 - SageMaker WorkshopJulien SIMON
 
AWS Initiate - Otimização de Custos com AWS
AWS Initiate - Otimização de Custos com AWSAWS Initiate - Otimização de Custos com AWS
AWS Initiate - Otimização de Custos com AWSAmazon Web Services LATAM
 
Innovate - Building Intelligent Applications (No Machine Learning Experience ...
Innovate - Building Intelligent Applications (No Machine Learning Experience ...Innovate - Building Intelligent Applications (No Machine Learning Experience ...
Innovate - Building Intelligent Applications (No Machine Learning Experience ...Amazon Web Services
 
Scale - Amazon SageMaker Deep Dive for Builders
Scale - Amazon SageMaker Deep Dive for BuildersScale - Amazon SageMaker Deep Dive for Builders
Scale - Amazon SageMaker Deep Dive for BuildersAmazon Web Services
 
Machine Learning Key Lessons Learned for Developers
Machine Learning Key Lessons Learned for DevelopersMachine Learning Key Lessons Learned for Developers
Machine Learning Key Lessons Learned for DevelopersAmazon Web Services
 
Amazon SageMaker sviluppa, addestra e distribuisci modelli di Machine Learnin...
Amazon SageMaker sviluppa, addestra e distribuisci modelli di Machine Learnin...Amazon SageMaker sviluppa, addestra e distribuisci modelli di Machine Learnin...
Amazon SageMaker sviluppa, addestra e distribuisci modelli di Machine Learnin...Amazon Web Services
 
AWS Transformation Day 2018 - Charlotte NC
AWS Transformation Day 2018 - Charlotte NCAWS Transformation Day 2018 - Charlotte NC
AWS Transformation Day 2018 - Charlotte NCAmazon Web Services
 

What's hot (20)

Build, train and deploy Machine Learning models on Amazon SageMaker (May 2019)
Build, train and deploy Machine Learning models on Amazon SageMaker (May 2019)Build, train and deploy Machine Learning models on Amazon SageMaker (May 2019)
Build, train and deploy Machine Learning models on Amazon SageMaker (May 2019)
 
AWS re:Invent 2018 - AIM401-R2 - Deep Learning Applications with Tensorflow
AWS re:Invent 2018 - AIM401-R2 - Deep Learning Applications with TensorflowAWS re:Invent 2018 - AIM401-R2 - Deep Learning Applications with Tensorflow
AWS re:Invent 2018 - AIM401-R2 - Deep Learning Applications with Tensorflow
 
An Introduction to Reinforcement Learning (December 2018)
An Introduction to Reinforcement Learning (December 2018)An Introduction to Reinforcement Learning (December 2018)
An Introduction to Reinforcement Learning (December 2018)
 
Build Text Analytics Solutions with Amazon Comprehend and Amazon Translate
Build Text Analytics Solutions with Amazon Comprehend and Amazon TranslateBuild Text Analytics Solutions with Amazon Comprehend and Amazon Translate
Build Text Analytics Solutions with Amazon Comprehend and Amazon Translate
 
Amazon SageMaker
Amazon SageMakerAmazon SageMaker
Amazon SageMaker
 
Preparing Your Team for a Cloud Transformation - AWS Online Tech Talks
Preparing Your Team for a Cloud Transformation - AWS Online Tech TalksPreparing Your Team for a Cloud Transformation - AWS Online Tech Talks
Preparing Your Team for a Cloud Transformation - AWS Online Tech Talks
 
Learnings from the Field: Best Practices for Making Money with Alexa Skills (...
Learnings from the Field: Best Practices for Making Money with Alexa Skills (...Learnings from the Field: Best Practices for Making Money with Alexa Skills (...
Learnings from the Field: Best Practices for Making Money with Alexa Skills (...
 
[NEW LAUNCH!] Introducing AWS DeepRacer (AIM367) - AWS re:Invent 2018
[NEW LAUNCH!] Introducing AWS DeepRacer  (AIM367) - AWS re:Invent 2018[NEW LAUNCH!] Introducing AWS DeepRacer  (AIM367) - AWS re:Invent 2018
[NEW LAUNCH!] Introducing AWS DeepRacer (AIM367) - AWS re:Invent 2018
 
Harness the Power of Crowdsourcing with Amazon Mechanical Turk (AIM351) - AWS...
Harness the Power of Crowdsourcing with Amazon Mechanical Turk (AIM351) - AWS...Harness the Power of Crowdsourcing with Amazon Mechanical Turk (AIM351) - AWS...
Harness the Power of Crowdsourcing with Amazon Mechanical Turk (AIM351) - AWS...
 
Automate for Efficiency with Amazon Transcribe & Amazon Translate
Automate for Efficiency with Amazon Transcribe & Amazon TranslateAutomate for Efficiency with Amazon Transcribe & Amazon Translate
Automate for Efficiency with Amazon Transcribe & Amazon Translate
 
Build, train and deploy ML models with SageMaker (October 2019)
Build, train and deploy ML models with SageMaker (October 2019)Build, train and deploy ML models with SageMaker (October 2019)
Build, train and deploy ML models with SageMaker (October 2019)
 
Machine Learning in Practice
Machine Learning in PracticeMachine Learning in Practice
Machine Learning in Practice
 
AWS re:Invent 2018 - ENT321 - SageMaker Workshop
AWS re:Invent 2018 - ENT321 - SageMaker WorkshopAWS re:Invent 2018 - ENT321 - SageMaker Workshop
AWS re:Invent 2018 - ENT321 - SageMaker Workshop
 
AWS Initiate - Otimização de Custos com AWS
AWS Initiate - Otimização de Custos com AWSAWS Initiate - Otimização de Custos com AWS
AWS Initiate - Otimização de Custos com AWS
 
Innovate - Building Intelligent Applications (No Machine Learning Experience ...
Innovate - Building Intelligent Applications (No Machine Learning Experience ...Innovate - Building Intelligent Applications (No Machine Learning Experience ...
Innovate - Building Intelligent Applications (No Machine Learning Experience ...
 
Scale - Amazon SageMaker Deep Dive for Builders
Scale - Amazon SageMaker Deep Dive for BuildersScale - Amazon SageMaker Deep Dive for Builders
Scale - Amazon SageMaker Deep Dive for Builders
 
Machine Learning Key Lessons Learned for Developers
Machine Learning Key Lessons Learned for DevelopersMachine Learning Key Lessons Learned for Developers
Machine Learning Key Lessons Learned for Developers
 
Amazon SageMaker sviluppa, addestra e distribuisci modelli di Machine Learnin...
Amazon SageMaker sviluppa, addestra e distribuisci modelli di Machine Learnin...Amazon SageMaker sviluppa, addestra e distribuisci modelli di Machine Learnin...
Amazon SageMaker sviluppa, addestra e distribuisci modelli di Machine Learnin...
 
Tendências na Transformação Digital
Tendências na Transformação DigitalTendências na Transformação Digital
Tendências na Transformação Digital
 
AWS Transformation Day 2018 - Charlotte NC
AWS Transformation Day 2018 - Charlotte NCAWS Transformation Day 2018 - Charlotte NC
AWS Transformation Day 2018 - Charlotte NC
 

Similar to An Introduction to Reinforcement Learning with Amazon SageMaker

[NEW LAUNCH!] Introducing Amazon SageMaker RL - Build and Train Reinforcement...
[NEW LAUNCH!] Introducing Amazon SageMaker RL - Build and Train Reinforcement...[NEW LAUNCH!] Introducing Amazon SageMaker RL - Build and Train Reinforcement...
[NEW LAUNCH!] Introducing Amazon SageMaker RL - Build and Train Reinforcement...Amazon Web Services
 
Let’s Talk about Reinforcement Learning with Amazon SageMaker RL (AIM399) - A...
Let’s Talk about Reinforcement Learning with Amazon SageMaker RL (AIM399) - A...Let’s Talk about Reinforcement Learning with Amazon SageMaker RL (AIM399) - A...
Let’s Talk about Reinforcement Learning with Amazon SageMaker RL (AIM399) - A...Amazon Web Services
 
Build Your Recommendation Engine on AWS Today!
Build Your Recommendation Engine on AWS Today!Build Your Recommendation Engine on AWS Today!
Build Your Recommendation Engine on AWS Today!AWS Germany
 
Amazon SageMaker 內建機器學習演算法 (Level 400)
Amazon SageMaker 內建機器學習演算法 (Level 400)Amazon SageMaker 內建機器學習演算法 (Level 400)
Amazon SageMaker 內建機器學習演算法 (Level 400)Amazon Web Services
 
[NEW LAUNCH!] [REPEAT 1] AWS DeepRacer Workshops –a new, fun way to learn rei...
[NEW LAUNCH!] [REPEAT 1] AWS DeepRacer Workshops –a new, fun way to learn rei...[NEW LAUNCH!] [REPEAT 1] AWS DeepRacer Workshops –a new, fun way to learn rei...
[NEW LAUNCH!] [REPEAT 1] AWS DeepRacer Workshops –a new, fun way to learn rei...Amazon Web Services
 
The Future of AI on AWS
The Future of AI on AWSThe Future of AI on AWS
The Future of AI on AWSBoaz Ziniman
 
Robocar Rally 2018 (AIM206-R20) - AWS re:Invent 2018
Robocar Rally 2018 (AIM206-R20) - AWS re:Invent 2018Robocar Rally 2018 (AIM206-R20) - AWS re:Invent 2018
Robocar Rally 2018 (AIM206-R20) - AWS re:Invent 2018Amazon Web Services
 
AI Services for Developers | AWS Floor28
AI Services for Developers | AWS Floor28AI Services for Developers | AWS Floor28
AI Services for Developers | AWS Floor28Amazon Web Services
 
AI Services for Developers - Floor28
AI Services for Developers - Floor28AI Services for Developers - Floor28
AI Services for Developers - Floor28Boaz Ziniman
 
Introducing Amazon SageMaker - AWS Online Tech Talks
Introducing Amazon SageMaker - AWS Online Tech TalksIntroducing Amazon SageMaker - AWS Online Tech Talks
Introducing Amazon SageMaker - AWS Online Tech TalksAmazon Web Services
 
re:Invent Deep Dive on Amazon SageMaker, Amazon Forecast and Amazon Personalise
re:Invent Deep Dive on Amazon SageMaker, Amazon Forecast and Amazon Personalisere:Invent Deep Dive on Amazon SageMaker, Amazon Forecast and Amazon Personalise
re:Invent Deep Dive on Amazon SageMaker, Amazon Forecast and Amazon PersonaliseAmazon Web Services
 
Machine Learning e Amazon SageMaker: Algoritmos, Modelos e Inferências - MCL...
Machine Learning e Amazon SageMaker: Algoritmos, Modelos e Inferências -  MCL...Machine Learning e Amazon SageMaker: Algoritmos, Modelos e Inferências -  MCL...
Machine Learning e Amazon SageMaker: Algoritmos, Modelos e Inferências - MCL...Amazon Web Services
 
Build Your Recommendation Engine on AWS Today - AWS Summit Berlin 2018
Build Your Recommendation Engine on AWS Today - AWS Summit Berlin 2018Build Your Recommendation Engine on AWS Today - AWS Summit Berlin 2018
Build Your Recommendation Engine on AWS Today - AWS Summit Berlin 2018Yotam Yarden
 
Building Deep Learning Applications with TensorFlow and SageMaker on AWS - Te...
Building Deep Learning Applications with TensorFlow and SageMaker on AWS - Te...Building Deep Learning Applications with TensorFlow and SageMaker on AWS - Te...
Building Deep Learning Applications with TensorFlow and SageMaker on AWS - Te...Amazon Web Services
 
BDA301 Working with Machine Learning in Amazon SageMaker: Algorithms, Models,...
BDA301 Working with Machine Learning in Amazon SageMaker: Algorithms, Models,...BDA301 Working with Machine Learning in Amazon SageMaker: Algorithms, Models,...
BDA301 Working with Machine Learning in Amazon SageMaker: Algorithms, Models,...Amazon Web Services
 
Supercharge Your ML Model with SageMaker - AWS Summit Sydney 2018
Supercharge Your ML Model with SageMaker - AWS Summit Sydney 2018Supercharge Your ML Model with SageMaker - AWS Summit Sydney 2018
Supercharge Your ML Model with SageMaker - AWS Summit Sydney 2018Amazon Web Services
 
A Gentle Intro to Deep Learning
A Gentle Intro to Deep LearningA Gentle Intro to Deep Learning
A Gentle Intro to Deep LearningGabe Hollombe
 
Accelerate Machine Learning with Ease using Amazon SageMaker
Accelerate Machine Learning with Ease using Amazon SageMakerAccelerate Machine Learning with Ease using Amazon SageMaker
Accelerate Machine Learning with Ease using Amazon SageMakerAmazon Web Services
 
Predicting the Future with Amazon SageMaker - AWS Summit Sydney 2018
Predicting the Future with Amazon SageMaker - AWS Summit Sydney 2018Predicting the Future with Amazon SageMaker - AWS Summit Sydney 2018
Predicting the Future with Amazon SageMaker - AWS Summit Sydney 2018Amazon Web Services
 

Similar to An Introduction to Reinforcement Learning with Amazon SageMaker (20)

[NEW LAUNCH!] Introducing Amazon SageMaker RL - Build and Train Reinforcement...
[NEW LAUNCH!] Introducing Amazon SageMaker RL - Build and Train Reinforcement...[NEW LAUNCH!] Introducing Amazon SageMaker RL - Build and Train Reinforcement...
[NEW LAUNCH!] Introducing Amazon SageMaker RL - Build and Train Reinforcement...
 
Let’s Talk about Reinforcement Learning with Amazon SageMaker RL (AIM399) - A...
Let’s Talk about Reinforcement Learning with Amazon SageMaker RL (AIM399) - A...Let’s Talk about Reinforcement Learning with Amazon SageMaker RL (AIM399) - A...
Let’s Talk about Reinforcement Learning with Amazon SageMaker RL (AIM399) - A...
 
Build Your Recommendation Engine on AWS Today!
Build Your Recommendation Engine on AWS Today!Build Your Recommendation Engine on AWS Today!
Build Your Recommendation Engine on AWS Today!
 
Amazon SageMaker 內建機器學習演算法 (Level 400)
Amazon SageMaker 內建機器學習演算法 (Level 400)Amazon SageMaker 內建機器學習演算法 (Level 400)
Amazon SageMaker 內建機器學習演算法 (Level 400)
 
[NEW LAUNCH!] [REPEAT 1] AWS DeepRacer Workshops –a new, fun way to learn rei...
[NEW LAUNCH!] [REPEAT 1] AWS DeepRacer Workshops –a new, fun way to learn rei...[NEW LAUNCH!] [REPEAT 1] AWS DeepRacer Workshops –a new, fun way to learn rei...
[NEW LAUNCH!] [REPEAT 1] AWS DeepRacer Workshops –a new, fun way to learn rei...
 
The Future of AI on AWS
The Future of AI on AWSThe Future of AI on AWS
The Future of AI on AWS
 
Robocar Rally 2018 (AIM206-R20) - AWS re:Invent 2018
Robocar Rally 2018 (AIM206-R20) - AWS re:Invent 2018Robocar Rally 2018 (AIM206-R20) - AWS re:Invent 2018
Robocar Rally 2018 (AIM206-R20) - AWS re:Invent 2018
 
AI Services for Developers | AWS Floor28
AI Services for Developers | AWS Floor28AI Services for Developers | AWS Floor28
AI Services for Developers | AWS Floor28
 
AI Services for Developers - Floor28
AI Services for Developers - Floor28AI Services for Developers - Floor28
AI Services for Developers - Floor28
 
Introducing Amazon SageMaker - AWS Online Tech Talks
Introducing Amazon SageMaker - AWS Online Tech TalksIntroducing Amazon SageMaker - AWS Online Tech Talks
Introducing Amazon SageMaker - AWS Online Tech Talks
 
re:Invent Deep Dive on Amazon SageMaker, Amazon Forecast and Amazon Personalise
re:Invent Deep Dive on Amazon SageMaker, Amazon Forecast and Amazon Personalisere:Invent Deep Dive on Amazon SageMaker, Amazon Forecast and Amazon Personalise
re:Invent Deep Dive on Amazon SageMaker, Amazon Forecast and Amazon Personalise
 
Introduction to Sagemaker
Introduction to SagemakerIntroduction to Sagemaker
Introduction to Sagemaker
 
Machine Learning e Amazon SageMaker: Algoritmos, Modelos e Inferências - MCL...
Machine Learning e Amazon SageMaker: Algoritmos, Modelos e Inferências -  MCL...Machine Learning e Amazon SageMaker: Algoritmos, Modelos e Inferências -  MCL...
Machine Learning e Amazon SageMaker: Algoritmos, Modelos e Inferências - MCL...
 
Build Your Recommendation Engine on AWS Today - AWS Summit Berlin 2018
Build Your Recommendation Engine on AWS Today - AWS Summit Berlin 2018Build Your Recommendation Engine on AWS Today - AWS Summit Berlin 2018
Build Your Recommendation Engine on AWS Today - AWS Summit Berlin 2018
 
Building Deep Learning Applications with TensorFlow and SageMaker on AWS - Te...
Building Deep Learning Applications with TensorFlow and SageMaker on AWS - Te...Building Deep Learning Applications with TensorFlow and SageMaker on AWS - Te...
Building Deep Learning Applications with TensorFlow and SageMaker on AWS - Te...
 
BDA301 Working with Machine Learning in Amazon SageMaker: Algorithms, Models,...
BDA301 Working with Machine Learning in Amazon SageMaker: Algorithms, Models,...BDA301 Working with Machine Learning in Amazon SageMaker: Algorithms, Models,...
BDA301 Working with Machine Learning in Amazon SageMaker: Algorithms, Models,...
 
Supercharge Your ML Model with SageMaker - AWS Summit Sydney 2018
Supercharge Your ML Model with SageMaker - AWS Summit Sydney 2018Supercharge Your ML Model with SageMaker - AWS Summit Sydney 2018
Supercharge Your ML Model with SageMaker - AWS Summit Sydney 2018
 
A Gentle Intro to Deep Learning
A Gentle Intro to Deep LearningA Gentle Intro to Deep Learning
A Gentle Intro to Deep Learning
 
Accelerate Machine Learning with Ease using Amazon SageMaker
Accelerate Machine Learning with Ease using Amazon SageMakerAccelerate Machine Learning with Ease using Amazon SageMaker
Accelerate Machine Learning with Ease using Amazon SageMaker
 
Predicting the Future with Amazon SageMaker - AWS Summit Sydney 2018
Predicting the Future with Amazon SageMaker - AWS Summit Sydney 2018Predicting the Future with Amazon SageMaker - AWS Summit Sydney 2018
Predicting the Future with Amazon SageMaker - AWS Summit Sydney 2018
 

More from Amazon Web Services

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Amazon Web Services
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Amazon Web Services
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateAmazon Web Services
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSAmazon Web Services
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Amazon Web Services
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Amazon Web Services
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...Amazon Web Services
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsAmazon Web Services
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareAmazon Web Services
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSAmazon Web Services
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAmazon Web Services
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareAmazon Web Services
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWSAmazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckAmazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without serversAmazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...Amazon Web Services
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceAmazon Web Services
 

More from Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

An Introduction to Reinforcement Learning with Amazon SageMaker

  • 1. © 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Julien Simon Principal Technical Evangelist, AI & Machine Learning, AWS @julsimon An Introduction to Reinforcement Learning
  • 2. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Supervised learning Run an algorithm on a labelled data set, i.e. a data set containing samples and answers. Gradually, the model learns how to correctly predict the right answer. Regression and classification are examples of supervised learning. Unsupervised learning Run an algorithm on an unlabelled data set, i.e. a data set containing samples only. Here, the model progressively learns patterns in data and organizes samples accordingly. Clustering and topic modeling are examples of unsupervised learning. Types of Machine Learning
  • 3. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Supervised learning Unsupervised learning Types of Machine LearningSOPHISTICATIONOFMLMODELS AMOUNT OF TRAINING DATA REQUIRED
  • 4. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Types of Machine Learning AMOUNT OF TRAINING DATA REQUIRED Supervised learning Unsupervised learning SOPHISTICATIONOFMLMODELS
  • 5. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Types of Machine Learning Reinforcement learning (RL) Supervised learning Unsupervised learning AMOUNT OF TRAINING DATA REQUIRED SOPHISTICATIONOFMLMODELS
  • 6. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Remember when you first learned this?
  • 7. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Or this?
  • 8. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. We didn’t have an extensive labelled data set back then ☺ And yet we learned How?
  • 9. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Defining Reinforcement Learning An algorithm (aka an agent) interacts with its environment. The agent receives a positive or negative reward for actions that it takes: rewards are computed by a user-defined function which outputs a numeric representation of the actions that should be incentivized. By trying to maximize the accumulation of rewards, the agent learns an optimal strategy (aka policy) for decision making. Source: Wikipedia
  • 10. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Use cases • Large complex problems • Uncertain, dynamic environments • Continuous learning • Supply chain management • HVAC systems • Industrial robotics • Autonomous vehicles • Portfolio management • Oil exploration • etc. Caterpillar: 250-ton autonomous mining trucks https://diginomica.com/2017/04/17/sending-disruption-mines/ https://www.cat.com/en_US/articles/customer-stories/built-for-it/thefutureisnow-driverless.html
  • 11. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Example: navigating a maze • Imagine an agent learning to navigate a maze. It can move in certain directions but is blocked from going through walls. • The agent discovers its environment (the current maze) one step at at time, receiving a reward each time: stepping into a dead end is a negative reward, moving one step closer to the exit is a positive reward. • After a certain number of steps (or if we found the exit), the current episode ends. • After a certain number of episodes, the agent uses the action/reward data points to train a model, in order to make better decisions next time around. • One critical thing to understand is that the RL model isn’t trained on a predefined set of labelled mazes (that would be supervised learning). • This cycle of exploring and training is central to RL: given enough mazes and enough training time, we would soon enough know how to navigate any maze.
  • 12. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Environment • The space in which the RL model operates. • This can be either a real-world environment or a simulator. • If you train a physical autonomous vehicle on a physical road, that would be a real- world environment. • If you train a computer program that models an autonomous vehicle driving on a road, that would be a simulator… probably a much safer option!
  • 13. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Exploitation vs Exploration • Selecting the next action is a balance between exploitation (‘using what you’ve learned’) and exploration (‘taking a chance to learn new things’) • If you favor exploitation, you may never reach high-value rewards. • If you favor exploration, you’ll probably run into trouble very often! • Initially, the agent will explore at random for a fixed number of episodes (aka heatup phase): this generates data for the first round of training.
  • 14. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Training a RL model 1. Formulate the problem: goal, environment, state, actions, reward 2. Define the environment: real-world or simulator? 3. Define the presets 4. Write the training code and the value function 5. Train the model
  • 15. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon SageMaker RL Reinforcementlearningfor every developer and data scientist Broad support for frameworks Broad support for simulation environments including SimuLink and MatLab K E Y F E AT U R E S TensorFlow, Apache MXNet, Intel Coach, and Ray RL support 2D & 3D physics environments and OpenAI Gym support Supports Amazon Sumerian and Amazon RoboMaker Fully managed Example notebooks and tutorials
  • 16. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 17. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 18. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 19. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 20. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. How can we get developers rolling with reinforcement learning?
  • 21. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Introducing AWS DeepRacer Fullyautonomous1/18thscaleracecar, drivenbyreinforcementlearning https://youtu.be/X-6v4RZy-TE HD video camera Dual-core Intel processorFour-wheel drive Dual power for compute and drive Accelerometer Gyroscope © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 22. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. AWS DeepRacer
  • 23. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. AWS DeepRacer League Competitive racing league for AWS DeepRacer Compete virtually onlineTrain models with RL Race in trials Final at AWS re:Invent
  • 24. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Getting started http://aws.amazon.com/free https://ml.aws https://aws.amazon.com/sagemaker https://aws.amazon.com/deepracer/ https://github.com/aws/sagemaker-python-sdk https://github.com/awslabs/amazon-sagemaker-examples https://medium.com/@julsimon https://gitlab.com/juliensimon/dlnotebooks
  • 25. Thank you! © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Julien Simon Principal Technical Evangelist, AI & Machine Learning, AWS @julsimon