Deep reinforcement learning&Robotics

•

0 likes•104 views

湯米吳 Tommy Wu

DRL & Robotics on ROS

Technology

Machine Learning & Robotics
DeepReinforcementLearning+ROS
Tommy Wu,
WCB IoT Solution Architect,
Microsoft

How to Play Games with Deep RL
An artificial agent for general Atari
game playing
• Learn to master 49 different
Atari games directly from
game screens
• Beat the best performing
learner from the same
domain in 43 games
• Excel human expert in 29
games

Let’s Play the Game with Reinforcement Learning
Single agent that plays both Space Invaders and Breakout on an ‘above human’ level
D:a9reinforcementLearningatari>python atari.py -g Breakout -r 'True'

What is the Reinforcement Learning
Create an agent that is capable of learning through trial
and error and ultimately solving the problem.
cart-pole system
Cartpole - The goal is to keep the cartpole
balanced by applying appropriate forces to
a pivot point.

Reinforcement Learning Key Factors
• RL is a general concept that can be simply described with
an agent that takes actions in an environment in order to
maximize its cumulative reward.
• Agents in RL algorithms are incentivized with punishments
for bad actions and rewards for good ones.

Deep Q-Learning (DQN)
• DQN is a RL technique that is aimed at choosing the best action
for given circumstances (observation). Each possible action for
each possible observation has its Q value, where ‘Q’ stands for
a quality of a given move.
• SARS (state, action, reward, state_next, terminal) and perform
Experience Replay.
• dqn_solver.remember(state, action, reward, state_next, terminal)
• dqn_solver.experience_replay()

OpenAI Gym toolkit for Reinforcement Leanring
• OpenAI Gym is a toolkit for developing and comparing reinforcement
learning algorithms.
• Gym open-source library - https://github.com/openai/gym
• A collection of environments to develop and test RL algorithms. It’s built on
a Markov chain model that is illustrated below.
Markov Chain

Simple Sample for DQN – Path On Right.
o-----T

Robotics ARM manipulation
Move a box by pushing it until it reaches the
desired goal in Simulation.
My experiment to
pickup the medicine
box

Robotics with Reinforcement Learning
The Robot Operating System (ROS) is a set of software libraries
and tools that help you build robot applications. From drivers to
state-of-the-art algorithms, and with powerful developer tools,
ROS has what you need for your next robotics project. And it's
all open source.

talker.py listener.cpp
ROS Node
ROS Core
/chatter
snoop.cpp
subscribe subscribe
publish
ROS Node ROS Node
Topic

ROS packages for Autonomous System
• Gazebo - They provide the necessary interfaces
to simulate a robot in Gazebo using ROS
messages, services and dynamic reconfigure
• SLAM - The gmapping package provides laser-based
SLAM (Simultaneous Localization and Mapping), as a
ROS node. To create a 2-D occupancy grid map
• RVIZ - Rviz is a 3D visualizer for the Robot Operating
System (ROS) framework

Motion Detector with Camera in ROS
roslaunch motion_detector motion_detection.launch
rosrun rviz rviz

SLAM Navigation with Image Sensing (2D/3D)

Robotics Autonomous System Architecture on ROS – AI Care II
listen
robotcv
ROSCore
publish
/inference_objects
publish
/STT
Iot-central-app
listen
azure_cs_luis
(Speech)
/cmd_vel
Azure Custom
Vision
Action
Sets
Win10 ROS Melodic

Similar to Deep reinforcement learning&Robotics

Python_for_Visual_Effects_and_Animation_PipelinesRussell Darling

Utility Domestic Robot PresentationNouman Tahir

OpenAI Gym & UniverseEntrepreneur / Startup

SFScon 2020 - Juri Strumpflohner - Beyond Basics - Scaling Development acros...South Tyrol Free Software Conference

RobotStudiopp.pptNhaTruongThanh

ie450RobotStudio.pptNhaTruongThanh

License Plate Recognition System using Python and OpenCVVishal Polley

Artificial Intelligence and Deep Learning in Azure, CNTK and TensorflowJen Stirrup

alphablues - ML applied to text and image in chat botsAndré Karpištšenko

Open LLMs: Viable for Production or Low-Quality Toy?M Waleed Kadous

Remoticon - TinyML Workshop.pptxnaeembisma

ARCS Presentation 2008Spondon Saha

Software Architectures, Week 2 - Decomposition techniquesAngelos Kapsimanis

Hot to build continuously processing for 24/7 real-time data streaming platform?GetInData

Byteman and The Jokre, Sanne Grinovero (JBoss by RedHat)OpenBlend society

Develop Community-based Android Distribution and Upstreaming Experience National Cheng Kung University

JRuby in Java Projectsjazzman1980

LJC-Unconference-2023-Keynote.pdfEmilyJiang23

Deep Learning with CNTKAshish Jaiman

Teaching an Actor-Critic Agent Through Optimal Scripted Agent TrajectoriesChris Hoyean Song

Similar to Deep reinforcement learning&Robotics (20)

Python_for_Visual_Effects_and_Animation_Pipelines

Utility Domestic Robot Presentation

OpenAI Gym & Universe

SFScon 2020 - Juri Strumpflohner - Beyond Basics - Scaling Development acros...

RobotStudiopp.ppt

ie450RobotStudio.ppt

License Plate Recognition System using Python and OpenCV

Artificial Intelligence and Deep Learning in Azure, CNTK and Tensorflow

alphablues - ML applied to text and image in chat bots

Open LLMs: Viable for Production or Low-Quality Toy?

Remoticon - TinyML Workshop.pptx

ARCS Presentation 2008

Software Architectures, Week 2 - Decomposition techniques

Hot to build continuously processing for 24/7 real-time data streaming platform?

Byteman and The Jokre, Sanne Grinovero (JBoss by RedHat)

Develop Community-based Android Distribution and Upstreaming Experience

JRuby in Java Projects

LJC-Unconference-2023-Keynote.pdf

Deep Learning with CNTK

Teaching an Actor-Critic Agent Through Optimal Scripted Agent Trajectories

Recently uploaded

Vulnerability_Management_GRC_by Sohang Sengupta.pptxnull - The Open Security Community

The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad

Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software

Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik

Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community

Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsHyundai Motor Group

Pigging Solutions Piggable Sweeping ElbowsPigging Solutions

Presentation on how to chat with PDF using ChatGPT code interpreternaman860154

Key Features Of Token Development (1).pptxLBM Solutions

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxnull - The Open Security Community

Install Stable Diffusion in windows machinePadma Pradeep

08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls

WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal

Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55

Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren

AI as an Interface for Commercial BuildingsMemoori

SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren

CloudStudio User manual (basic edition):comworks

08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls

Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix

Recently uploaded (20)

Vulnerability_Management_GRC_by Sohang Sengupta.pptx

The Codex of Business Writing Software for Real-World Solutions 2.pptx

Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation

Injustice - Developers Among Us (SciFiDevCon 2024)

Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx

Snow Chain-Integrated Tire for a Safe Drive on Winter Roads

Pigging Solutions Piggable Sweeping Elbows

Presentation on how to chat with PDF using ChatGPT code interpreter

Key Features Of Token Development (1).pptx

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx

Install Stable Diffusion in windows machine

08448380779 Call Girls In Greater Kailash - I Women Seeking Men

WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service

Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...

Advanced Test Driven-Development @ php[tek] 2024

AI as an Interface for Commercial Buildings

SQL Database Design For Developers at php[tek] 2024

CloudStudio User manual (basic edition):

08448380779 Call Girls In Civil Lines Women Seeking Men

Swan(sea) Song – personal research during my six years at Swansea ... and bey...

Deep reinforcement learning&Robotics

1. Machine Learning & Robotics DeepReinforcementLearning+ROS Tommy Wu, WCB IoT Solution Architect, Microsoft

2. Today’s Machine Learning Categories

3. How to Play Games with Deep RL An artificial agent for general Atari game playing • Learn to master 49 different Atari games directly from game screens • Beat the best performing learner from the same domain in 43 games • Excel human expert in 29 games

4. Let’s Play the Game with Reinforcement Learning Single agent that plays both Space Invaders and Breakout on an ‘above human’ level D:a9reinforcementLearningatari>python atari.py -g Breakout -r 'True'

5. What is the Reinforcement Learning Create an agent that is capable of learning through trial and error and ultimately solving the problem. cart-pole system Cartpole - The goal is to keep the cartpole balanced by applying appropriate forces to a pivot point.

6. Reinforcement Learning Key Factors • RL is a general concept that can be simply described with an agent that takes actions in an environment in order to maximize its cumulative reward. • Agents in RL algorithms are incentivized with punishments for bad actions and rewards for good ones.

7. Deep Q-Learning (DQN) • DQN is a RL technique that is aimed at choosing the best action for given circumstances (observation). Each possible action for each possible observation has its Q value, where ‘Q’ stands for a quality of a given move. • SARS (state, action, reward, state_next, terminal) and perform Experience Replay. • dqn_solver.remember(state, action, reward, state_next, terminal) • dqn_solver.experience_replay()

8. OpenAI Gym toolkit for Reinforcement Leanring • OpenAI Gym is a toolkit for developing and comparing reinforcement learning algorithms. • Gym open-source library - https://github.com/openai/gym • A collection of environments to develop and test RL algorithms. It’s built on a Markov chain model that is illustrated below. Markov Chain

9. Simple Sample for DQN – Path On Right. o-----T

10. Robotics ARM manipulation Move a box by pushing it until it reaches the desired goal in Simulation. My experiment to pickup the medicine box

11. Robotics with Reinforcement Learning The Robot Operating System (ROS) is a set of software libraries and tools that help you build robot applications. From drivers to state-of-the-art algorithms, and with powerful developer tools, ROS has what you need for your next robotics project. And it's all open source.

12. talker.py listener.cpp ROS Node ROS Core /chatter snoop.cpp subscribe subscribe publish ROS Node ROS Node Topic

13. Workspace Structure -

14. ROS packages for Autonomous System • Gazebo - They provide the necessary interfaces to simulate a robot in Gazebo using ROS messages, services and dynamic reconfigure • SLAM - The gmapping package provides laser-based SLAM (Simultaneous Localization and Mapping), as a ROS node. To create a 2-D occupancy grid map • RVIZ - Rviz is a 3D visualizer for the Robot Operating System (ROS) framework

15. Motion Detector with Camera in ROS roslaunch motion_detector motion_detection.launch rosrun rviz rviz

16. SLAM Navigation with Image Sensing (2D/3D)

17. Robotics Autonomous System Architecture on ROS – AI Care II listen robotcv ROSCore publish /inference_objects publish /STT Iot-central-app listen azure_cs_luis (Speech) /cmd_vel Azure Custom Vision Action Sets Win10 ROS Melodic

Deep reinforcement learning&Robotics

Recommended

Recommended

More Related Content

Similar to Deep reinforcement learning&Robotics

Similar to Deep reinforcement learning&Robotics (20)

More from 湯米吳 Tommy Wu

More from 湯米吳 Tommy Wu (11)

Recently uploaded

Recently uploaded (20)

Deep reinforcement learning&Robotics