SlideShare a Scribd company logo
1 of 18
Download to read offline
Machine Learning & Robotics
DeepReinforcementLearning+ROS
Tommy Wu,
WCB IoT Solution Architect,
Microsoft
Today’s Machine Learning Categories
How to Play Games with Deep RL
An artificial agent for general Atari
game playing
• Learn to master 49 different
Atari games directly from
game screens
• Beat the best performing
learner from the same
domain in 43 games
• Excel human expert in 29
games
Let’s Play the Game with Reinforcement Learning
Single agent that plays both Space Invaders and Breakout on an ‘above human’ level
D:a9reinforcementLearningatari>python atari.py -g Breakout -r 'True'
What is the Reinforcement Learning
Create an agent that is capable of learning through trial
and error and ultimately solving the problem.
cart-pole system
Cartpole - The goal is to keep the cartpole
balanced by applying appropriate forces to
a pivot point.
Reinforcement Learning Key Factors
• RL is a general concept that can be simply described with
an agent that takes actions in an environment in order to
maximize its cumulative reward.
• Agents in RL algorithms are incentivized with punishments
for bad actions and rewards for good ones.
Deep Q-Learning (DQN)
• DQN is a RL technique that is aimed at choosing the best action
for given circumstances (observation). Each possible action for
each possible observation has its Q value, where ‘Q’ stands for
a quality of a given move.
• SARS (state, action, reward, state_next, terminal) and perform
Experience Replay.
• dqn_solver.remember(state, action, reward, state_next, terminal)
• dqn_solver.experience_replay()
OpenAI Gym toolkit for Reinforcement Leanring
• OpenAI Gym is a toolkit for developing and comparing reinforcement
learning algorithms.
• Gym open-source library - https://github.com/openai/gym
• A collection of environments to develop and test RL algorithms. It’s built on
a Markov chain model that is illustrated below.
Markov Chain
Simple Sample for DQN – Path On Right.
o-----T
Robotics ARM manipulation
Move a box by pushing it until it reaches the
desired goal in Simulation.
My experiment to
pickup the medicine
box
Robotics with Reinforcement Learning
The Robot Operating System (ROS) is a set of software libraries
and tools that help you build robot applications. From drivers to
state-of-the-art algorithms, and with powerful developer tools,
ROS has what you need for your next robotics project. And it's
all open source.
talker.py listener.cpp
ROS Node
ROS Core
/chatter
snoop.cpp
subscribe subscribe
publish
ROS Node ROS Node
Topic
Workspace Structure -
ROS packages for Autonomous System
• Gazebo - They provide the necessary interfaces
to simulate a robot in Gazebo using ROS
messages, services and dynamic reconfigure
• SLAM - The gmapping package provides laser-based
SLAM (Simultaneous Localization and Mapping), as a
ROS node. To create a 2-D occupancy grid map
• RVIZ - Rviz is a 3D visualizer for the Robot Operating
System (ROS) framework
Motion Detector with Camera in ROS
roslaunch motion_detector motion_detection.launch
rosrun rviz rviz
SLAM Navigation with Image Sensing (2D/3D)
Robotics Autonomous System Architecture on ROS – AI Care II
listen
robotcv
ROSCore
publish
/inference_objects
publish
/STT
Iot-central-app
listen
azure_cs_luis
(Speech)
/cmd_vel
Azure Custom
Vision
Action
Sets
Win10 ROS Melodic
Deep reinforcement learning&Robotics

More Related Content

Similar to Deep reinforcement learning&Robotics

Python_for_Visual_Effects_and_Animation_Pipelines
Python_for_Visual_Effects_and_Animation_PipelinesPython_for_Visual_Effects_and_Animation_Pipelines
Python_for_Visual_Effects_and_Animation_PipelinesRussell Darling
 
Utility Domestic Robot Presentation
Utility Domestic Robot PresentationUtility Domestic Robot Presentation
Utility Domestic Robot PresentationNouman Tahir
 
SFScon 2020 - Juri Strumpflohner - Beyond Basics - Scaling Development acros...
 SFScon 2020 - Juri Strumpflohner - Beyond Basics - Scaling Development acros... SFScon 2020 - Juri Strumpflohner - Beyond Basics - Scaling Development acros...
SFScon 2020 - Juri Strumpflohner - Beyond Basics - Scaling Development acros...South Tyrol Free Software Conference
 
License Plate Recognition System using Python and OpenCV
License Plate Recognition System using Python and OpenCVLicense Plate Recognition System using Python and OpenCV
License Plate Recognition System using Python and OpenCVVishal Polley
 
Artificial Intelligence and Deep Learning in Azure, CNTK and Tensorflow
Artificial Intelligence and Deep Learning in Azure, CNTK and TensorflowArtificial Intelligence and Deep Learning in Azure, CNTK and Tensorflow
Artificial Intelligence and Deep Learning in Azure, CNTK and TensorflowJen Stirrup
 
alphablues - ML applied to text and image in chat bots
alphablues - ML applied to text and image in chat botsalphablues - ML applied to text and image in chat bots
alphablues - ML applied to text and image in chat botsAndré Karpištšenko
 
Open LLMs: Viable for Production or Low-Quality Toy?
Open LLMs: Viable for Production or Low-Quality Toy?Open LLMs: Viable for Production or Low-Quality Toy?
Open LLMs: Viable for Production or Low-Quality Toy?M Waleed Kadous
 
Remoticon - TinyML Workshop.pptx
Remoticon - TinyML Workshop.pptxRemoticon - TinyML Workshop.pptx
Remoticon - TinyML Workshop.pptxnaeembisma
 
ARCS Presentation 2008
ARCS Presentation 2008ARCS Presentation 2008
ARCS Presentation 2008Spondon Saha
 
Software Architectures, Week 2 - Decomposition techniques
Software Architectures, Week 2 - Decomposition techniquesSoftware Architectures, Week 2 - Decomposition techniques
Software Architectures, Week 2 - Decomposition techniquesAngelos Kapsimanis
 
Hot to build continuously processing for 24/7 real-time data streaming platform?
Hot to build continuously processing for 24/7 real-time data streaming platform?Hot to build continuously processing for 24/7 real-time data streaming platform?
Hot to build continuously processing for 24/7 real-time data streaming platform?GetInData
 
Byteman and The Jokre, Sanne Grinovero (JBoss by RedHat)
Byteman and The Jokre, Sanne Grinovero (JBoss by RedHat)Byteman and The Jokre, Sanne Grinovero (JBoss by RedHat)
Byteman and The Jokre, Sanne Grinovero (JBoss by RedHat)OpenBlend society
 
Develop Community-based Android Distribution and Upstreaming Experience
Develop Community-based Android Distribution and Upstreaming Experience Develop Community-based Android Distribution and Upstreaming Experience
Develop Community-based Android Distribution and Upstreaming Experience National Cheng Kung University
 
JRuby in Java Projects
JRuby in Java ProjectsJRuby in Java Projects
JRuby in Java Projectsjazzman1980
 
LJC-Unconference-2023-Keynote.pdf
LJC-Unconference-2023-Keynote.pdfLJC-Unconference-2023-Keynote.pdf
LJC-Unconference-2023-Keynote.pdfEmilyJiang23
 
Deep Learning with CNTK
Deep Learning with CNTKDeep Learning with CNTK
Deep Learning with CNTKAshish Jaiman
 
Teaching an Actor-Critic Agent Through Optimal Scripted Agent Trajectories
Teaching an Actor-Critic Agent Through Optimal Scripted Agent TrajectoriesTeaching an Actor-Critic Agent Through Optimal Scripted Agent Trajectories
Teaching an Actor-Critic Agent Through Optimal Scripted Agent TrajectoriesChris Hoyean Song
 

Similar to Deep reinforcement learning&Robotics (20)

Python_for_Visual_Effects_and_Animation_Pipelines
Python_for_Visual_Effects_and_Animation_PipelinesPython_for_Visual_Effects_and_Animation_Pipelines
Python_for_Visual_Effects_and_Animation_Pipelines
 
Utility Domestic Robot Presentation
Utility Domestic Robot PresentationUtility Domestic Robot Presentation
Utility Domestic Robot Presentation
 
OpenAI Gym & Universe
OpenAI Gym & UniverseOpenAI Gym & Universe
OpenAI Gym & Universe
 
SFScon 2020 - Juri Strumpflohner - Beyond Basics - Scaling Development acros...
 SFScon 2020 - Juri Strumpflohner - Beyond Basics - Scaling Development acros... SFScon 2020 - Juri Strumpflohner - Beyond Basics - Scaling Development acros...
SFScon 2020 - Juri Strumpflohner - Beyond Basics - Scaling Development acros...
 
RobotStudiopp.ppt
RobotStudiopp.pptRobotStudiopp.ppt
RobotStudiopp.ppt
 
ie450RobotStudio.ppt
ie450RobotStudio.pptie450RobotStudio.ppt
ie450RobotStudio.ppt
 
License Plate Recognition System using Python and OpenCV
License Plate Recognition System using Python and OpenCVLicense Plate Recognition System using Python and OpenCV
License Plate Recognition System using Python and OpenCV
 
Artificial Intelligence and Deep Learning in Azure, CNTK and Tensorflow
Artificial Intelligence and Deep Learning in Azure, CNTK and TensorflowArtificial Intelligence and Deep Learning in Azure, CNTK and Tensorflow
Artificial Intelligence and Deep Learning in Azure, CNTK and Tensorflow
 
alphablues - ML applied to text and image in chat bots
alphablues - ML applied to text and image in chat botsalphablues - ML applied to text and image in chat bots
alphablues - ML applied to text and image in chat bots
 
Open LLMs: Viable for Production or Low-Quality Toy?
Open LLMs: Viable for Production or Low-Quality Toy?Open LLMs: Viable for Production or Low-Quality Toy?
Open LLMs: Viable for Production or Low-Quality Toy?
 
Remoticon - TinyML Workshop.pptx
Remoticon - TinyML Workshop.pptxRemoticon - TinyML Workshop.pptx
Remoticon - TinyML Workshop.pptx
 
ARCS Presentation 2008
ARCS Presentation 2008ARCS Presentation 2008
ARCS Presentation 2008
 
Software Architectures, Week 2 - Decomposition techniques
Software Architectures, Week 2 - Decomposition techniquesSoftware Architectures, Week 2 - Decomposition techniques
Software Architectures, Week 2 - Decomposition techniques
 
Hot to build continuously processing for 24/7 real-time data streaming platform?
Hot to build continuously processing for 24/7 real-time data streaming platform?Hot to build continuously processing for 24/7 real-time data streaming platform?
Hot to build continuously processing for 24/7 real-time data streaming platform?
 
Byteman and The Jokre, Sanne Grinovero (JBoss by RedHat)
Byteman and The Jokre, Sanne Grinovero (JBoss by RedHat)Byteman and The Jokre, Sanne Grinovero (JBoss by RedHat)
Byteman and The Jokre, Sanne Grinovero (JBoss by RedHat)
 
Develop Community-based Android Distribution and Upstreaming Experience
Develop Community-based Android Distribution and Upstreaming Experience Develop Community-based Android Distribution and Upstreaming Experience
Develop Community-based Android Distribution and Upstreaming Experience
 
JRuby in Java Projects
JRuby in Java ProjectsJRuby in Java Projects
JRuby in Java Projects
 
LJC-Unconference-2023-Keynote.pdf
LJC-Unconference-2023-Keynote.pdfLJC-Unconference-2023-Keynote.pdf
LJC-Unconference-2023-Keynote.pdf
 
Deep Learning with CNTK
Deep Learning with CNTKDeep Learning with CNTK
Deep Learning with CNTK
 
Teaching an Actor-Critic Agent Through Optimal Scripted Agent Trajectories
Teaching an Actor-Critic Agent Through Optimal Scripted Agent TrajectoriesTeaching an Actor-Critic Agent Through Optimal Scripted Agent Trajectories
Teaching an Actor-Critic Agent Through Optimal Scripted Agent Trajectories
 

More from 湯米吳 Tommy Wu

Metaverse and Digital Twins on Enterprise-Public.pdf
Metaverse and Digital Twins on Enterprise-Public.pdfMetaverse and Digital Twins on Enterprise-Public.pdf
Metaverse and Digital Twins on Enterprise-Public.pdf湯米吳 Tommy Wu
 
Project AI-Care for COVID-19 prevention
Project AI-Care for COVID-19 preventionProject AI-Care for COVID-19 prevention
Project AI-Care for COVID-19 prevention湯米吳 Tommy Wu
 
AIoT and edge computing solutions
AIoT and edge computing solutionsAIoT and edge computing solutions
AIoT and edge computing solutions湯米吳 Tommy Wu
 
Tj bot 0317實作坊 組裝篇
Tj bot 0317實作坊 組裝篇Tj bot 0317實作坊 組裝篇
Tj bot 0317實作坊 組裝篇湯米吳 Tommy Wu
 
聊天機器人實作展示與解密 Tommy wut克邦
聊天機器人實作展示與解密 Tommy wut克邦聊天機器人實作展示與解密 Tommy wut克邦
聊天機器人實作展示與解密 Tommy wut克邦湯米吳 Tommy Wu
 
How to create your Smart Toy with bluemix & 7688 Duo board
How to create your Smart Toy with bluemix & 7688 Duo boardHow to create your Smart Toy with bluemix & 7688 Duo board
How to create your Smart Toy with bluemix & 7688 Duo board湯米吳 Tommy Wu
 
運用 Ibm watson技術快速打造智慧應用系統
運用 Ibm watson技術快速打造智慧應用系統運用 Ibm watson技術快速打造智慧應用系統
運用 Ibm watson技術快速打造智慧應用系統湯米吳 Tommy Wu
 
How to build your own robot with ibm bluemix&watson
How to build your own robot with ibm bluemix&watsonHow to build your own robot with ibm bluemix&watson
How to build your own robot with ibm bluemix&watson湯米吳 Tommy Wu
 
Bluemix 智能機器人情境展示
Bluemix 智能機器人情境展示Bluemix 智能機器人情境展示
Bluemix 智能機器人情境展示湯米吳 Tommy Wu
 
Cloud development technology sharing (BlueMix premier)
Cloud development technology sharing (BlueMix premier)Cloud development technology sharing (BlueMix premier)
Cloud development technology sharing (BlueMix premier)湯米吳 Tommy Wu
 
企業應用行動化開發架構
企業應用行動化開發架構企業應用行動化開發架構
企業應用行動化開發架構湯米吳 Tommy Wu
 

More from 湯米吳 Tommy Wu (11)

Metaverse and Digital Twins on Enterprise-Public.pdf
Metaverse and Digital Twins on Enterprise-Public.pdfMetaverse and Digital Twins on Enterprise-Public.pdf
Metaverse and Digital Twins on Enterprise-Public.pdf
 
Project AI-Care for COVID-19 prevention
Project AI-Care for COVID-19 preventionProject AI-Care for COVID-19 prevention
Project AI-Care for COVID-19 prevention
 
AIoT and edge computing solutions
AIoT and edge computing solutionsAIoT and edge computing solutions
AIoT and edge computing solutions
 
Tj bot 0317實作坊 組裝篇
Tj bot 0317實作坊 組裝篇Tj bot 0317實作坊 組裝篇
Tj bot 0317實作坊 組裝篇
 
聊天機器人實作展示與解密 Tommy wut克邦
聊天機器人實作展示與解密 Tommy wut克邦聊天機器人實作展示與解密 Tommy wut克邦
聊天機器人實作展示與解密 Tommy wut克邦
 
How to create your Smart Toy with bluemix & 7688 Duo board
How to create your Smart Toy with bluemix & 7688 Duo boardHow to create your Smart Toy with bluemix & 7688 Duo board
How to create your Smart Toy with bluemix & 7688 Duo board
 
運用 Ibm watson技術快速打造智慧應用系統
運用 Ibm watson技術快速打造智慧應用系統運用 Ibm watson技術快速打造智慧應用系統
運用 Ibm watson技術快速打造智慧應用系統
 
How to build your own robot with ibm bluemix&watson
How to build your own robot with ibm bluemix&watsonHow to build your own robot with ibm bluemix&watson
How to build your own robot with ibm bluemix&watson
 
Bluemix 智能機器人情境展示
Bluemix 智能機器人情境展示Bluemix 智能機器人情境展示
Bluemix 智能機器人情境展示
 
Cloud development technology sharing (BlueMix premier)
Cloud development technology sharing (BlueMix premier)Cloud development technology sharing (BlueMix premier)
Cloud development technology sharing (BlueMix premier)
 
企業應用行動化開發架構
企業應用行動化開發架構企業應用行動化開發架構
企業應用行動化開發架構
 

Recently uploaded

The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsHyundai Motor Group
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 

Recently uploaded (20)

Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 

Deep reinforcement learning&Robotics

  • 1. Machine Learning & Robotics DeepReinforcementLearning+ROS Tommy Wu, WCB IoT Solution Architect, Microsoft
  • 3. How to Play Games with Deep RL An artificial agent for general Atari game playing • Learn to master 49 different Atari games directly from game screens • Beat the best performing learner from the same domain in 43 games • Excel human expert in 29 games
  • 4. Let’s Play the Game with Reinforcement Learning Single agent that plays both Space Invaders and Breakout on an ‘above human’ level D:a9reinforcementLearningatari>python atari.py -g Breakout -r 'True'
  • 5. What is the Reinforcement Learning Create an agent that is capable of learning through trial and error and ultimately solving the problem. cart-pole system Cartpole - The goal is to keep the cartpole balanced by applying appropriate forces to a pivot point.
  • 6. Reinforcement Learning Key Factors • RL is a general concept that can be simply described with an agent that takes actions in an environment in order to maximize its cumulative reward. • Agents in RL algorithms are incentivized with punishments for bad actions and rewards for good ones.
  • 7. Deep Q-Learning (DQN) • DQN is a RL technique that is aimed at choosing the best action for given circumstances (observation). Each possible action for each possible observation has its Q value, where ‘Q’ stands for a quality of a given move. • SARS (state, action, reward, state_next, terminal) and perform Experience Replay. • dqn_solver.remember(state, action, reward, state_next, terminal) • dqn_solver.experience_replay()
  • 8. OpenAI Gym toolkit for Reinforcement Leanring • OpenAI Gym is a toolkit for developing and comparing reinforcement learning algorithms. • Gym open-source library - https://github.com/openai/gym • A collection of environments to develop and test RL algorithms. It’s built on a Markov chain model that is illustrated below. Markov Chain
  • 9. Simple Sample for DQN – Path On Right. o-----T
  • 10. Robotics ARM manipulation Move a box by pushing it until it reaches the desired goal in Simulation. My experiment to pickup the medicine box
  • 11. Robotics with Reinforcement Learning The Robot Operating System (ROS) is a set of software libraries and tools that help you build robot applications. From drivers to state-of-the-art algorithms, and with powerful developer tools, ROS has what you need for your next robotics project. And it's all open source.
  • 12. talker.py listener.cpp ROS Node ROS Core /chatter snoop.cpp subscribe subscribe publish ROS Node ROS Node Topic
  • 14. ROS packages for Autonomous System • Gazebo - They provide the necessary interfaces to simulate a robot in Gazebo using ROS messages, services and dynamic reconfigure • SLAM - The gmapping package provides laser-based SLAM (Simultaneous Localization and Mapping), as a ROS node. To create a 2-D occupancy grid map • RVIZ - Rviz is a 3D visualizer for the Robot Operating System (ROS) framework
  • 15. Motion Detector with Camera in ROS roslaunch motion_detector motion_detection.launch rosrun rviz rviz
  • 16. SLAM Navigation with Image Sensing (2D/3D)
  • 17. Robotics Autonomous System Architecture on ROS – AI Care II listen robotcv ROSCore publish /inference_objects publish /STT Iot-central-app listen azure_cs_luis (Speech) /cmd_vel Azure Custom Vision Action Sets Win10 ROS Melodic