SlideShare a Scribd company logo
1 of 10
Download to read offline
Accelerating
Reinforcement Learning 
in Engineering Systems
Tham Chen Khong
with contributions from Zhou Chongyu and Le Van Duc 
Department of Electrical & Computer Engineering
National University of Singapore
Reinforcement Learning
• An agent learns to perform actions that maximize long term rewards
• Discovers solutions through interaction with the environment
• Different from supervised learning which a lot of deep learning is used for
• Deep learning can still be applied for RL, but target value changes
• RL is a powerful technique, but is usually computationally expensive and 
may be impractical since it requires many interactions with the environment
NVIDIA AI Conference 24 Oct 2017 CK Tham ‐ Accelerating RL in Engineering Systems 2
NVIDIA AI Conference 24 Oct 2017 CK Tham ‐ Accelerating RL in Engineering Systems 3
19 Oct 2017
Human‐level Control in Atari 2600 Games
Mnih et al, 2013/15
NVIDIA AI Conference 24 Oct 2017 CK Tham ‐ Accelerating RL in Engineering Systems 4
Deep Q‐Network
(DQN)
Engineering Systems
• Complex systems
• behaviour described by differential equations
• may have a number of operational modes
• Real‐time control is required
• Need to operate reliably and safely
• Examples
• vehicles, robots, manufacturing and communication and computing systems
NVIDIA AI Conference 24 Oct 2017 CK Tham ‐ Accelerating RL in Engineering Systems 5
Execution Time: 
CPU vs GPU on OpenAI Gym Tasks
0.00E+00
5.00E+02
1.00E+03
1.50E+03
2.00E+03
2.50E+03
3.00E+03
50 100 150
GPU 1.48E+03 7.13E+02 4.96E+02
CPU 2.92E+03 2.11E+03 2.30E+03
Execution Time(s)
Batch size
Acrobot
GPU CPU
0.00E+00
2.00E+02
4.00E+02
6.00E+02
8.00E+02
1.00E+03
1.20E+03
50 100 150
GPU 7.13E+02 3.88E+01 6.17E+01
CPU 1.07E+03 3.61E+02 2.24E+02
Execution Time(s)
Batch size
CartPole
GPU CPU
Training parameters: 
# Episodes: 1,000
# Iterations in each Episode: 500
CPU: Intel Core i7‐3770 CPU 3.40 GHz (4 cores, GB RAM) (power consumption: 47‐100 W)
GPU: NVIDIA GeForce GTX 1060 (1,280 cores, 6 GB) (~S$350; power consumption: 120 W)
NVIDIA AI Conference 24 Oct 2017 CK Tham ‐ Accelerating RL in Engineering Systems 6
Engineering Case Studies
• Next Generation Distributed Mobile Computing: 
5G ‐ IoT ‐ Fog/Edge Computing
• Learning Skills on a Robot Manipulator
NVIDIA AI Conference 24 Oct 2017 CK Tham ‐ Accelerating RL in Engineering Systems 7
Computation Offloading in an Ad‐hoc Mobile Cloud
(5G ‐ IoT ‐ Fog Computing operation scenario) 
• Consider an ad‐hoc mobile cloud where a mobile user can offload its computation tasks to nearby 
mobile cloudlets (e.g., smart phones and vehicles) with available computation resources
• Computation tasks are offloaded to the cloudlets via a device‐to‐device (D2D) communication‐
enabled cellular network
• Deep reinforcement learning (DRL) is applied for the user to learn an optimal offloading policy with 
the objective of maximizing the user’s utility, while minimizing the required payment, energy 
consumption, processing delay and task loss
D V Le, and C K Tham, A Deep Reinforcement Learning (DRL)‐based Offloading Scheme in Ad‐hoc Mobile Clouds, submitted to IEEE PerCom 2018
NVIDIA AI Conference 24 Oct 2017 CK Tham ‐ Accelerating RL in Engineering Systems 8
DQN‐based Offloading Decision Learning
• The user maintains a neural network to estimate Q‐values for all pairs of states and actions
• In the deep Q‐network, the experience replay and target network techniques are used to 
train the neural network at every learning step
• The  ‐greedy policy is adopted to select an action for each state based on the estimated 
Q‐values
Deep Q‐network
NVIDIA AI Conference 24 Oct 2017 CK Tham ‐ Accelerating RL in Engineering Systems 9
Offloading Results
Training parameters: 
# Episodes: 100
# Iterations in each Episode: 5,000
epsilon is decreased from 0.99 to 0.1 with decay rate 0.9999
0.00E+00
5.00E+03
1.00E+04
1.50E+04
2.00E+04
2.50E+04
GPU‐DQN GPU‐Q‐
learning
CPU‐DQN CPU‐Q‐
learning
Series1 1.79E+04 1.54E+04 22561.21006 21464.44345
Execution Time (s)
NVIDIA AI Conference 24 Oct 2017 CK Tham ‐ Accelerating RL in Engineering Systems 10
Achieves good payment, energy consumption, 
processing delay and task loss performance
CPU: Intel Core i7‐3770 CPU 3.40 GHz (4 cores, 16 GB RAM)
GPU: NVIDIA GeForce GTX 1060 (1,280 cores, 6 GB)
NVIDIA AI Conference 24 Oct 2017 CK Tham ‐ Accelerating RL in Engineering Systems 11
NVIDIA AI Conference 24 Oct 2017 CK Tham ‐ Accelerating RL in Engineering Systems 12
Robot Manipulator Control
• Goal: move end‐effector from any starting position to the destination 
position without colliding with any obstacle
• Generate torque commands to drive each link
• A dynamical system whose model is governed by differential 
equations
NVIDIA AI Conference 24 Oct 2017 CK Tham ‐ Accelerating RL in Engineering Systems 13
Train DQN
NVIDIA AI Conference 24 Oct 2017 CK Tham ‐ Accelerating RL in Engineering Systems 14
Robot Manipulator Results
NVIDIA AI Conference 24 Oct 2017 CK Tham ‐ Accelerating RL in Engineering Systems 15
CPU: Intel Core i7‐6700, 3.40 GHz (4 cores, 16 GB RAM)
GPU: NVIDIA GeForce GTX 1060 (1,280 cores, 6 GB)
Is this task too difficult for
Reinforcement Learning?
No. It can be solved …
NVIDIA AI Conference 24 Oct 2017 CK Tham ‐ Accelerating RL in Engineering Systems 17
Faster
Higher resolution
using a different function approximator
Composite Tasks
NVIDIA AI Conference 24 Oct 2017 CK Tham ‐ Accelerating RL in Engineering Systems 18
Conclusion
• Reinforcement Learning is capable of solving engineering tasks
• Hardware accelerators like GPUs* and algorithmic improvements 
have enabled RL to be accelerated in time and reduce the number of 
iterations or interactions with the environment that are required
• However, direct application of deep learning techniques may not yield 
the best results
• Further research into various aspects is needed to enhance RL’s ability 
to better solve engineering problems
NVIDIA AI Conference 24 Oct 2017 CK Tham ‐ Accelerating RL in Engineering Systems 19
*
Thank You
Questions?
Contact: Tham Chen Khong
E‐mail: eletck@nus.edu.sg

More Related Content

Similar to Accelerating reinforcement-learning-in-engineering

ICLR 2020 Recap
ICLR 2020 RecapICLR 2020 Recap
ICLR 2020 Recap
Sri Ambati
 
Ronan_Rice_Resume_2016_web
Ronan_Rice_Resume_2016_webRonan_Rice_Resume_2016_web
Ronan_Rice_Resume_2016_web
Ronan Rice
 
Elastic Software Infrastructure to Support the Industrial Internet
Elastic Software Infrastructure to Support the Industrial InternetElastic Software Infrastructure to Support the Industrial Internet
Elastic Software Infrastructure to Support the Industrial Internet
Real-Time Innovations (RTI)
 

Similar to Accelerating reinforcement-learning-in-engineering (20)

Univa Presentation at DAC 2020
Univa Presentation at DAC 2020 Univa Presentation at DAC 2020
Univa Presentation at DAC 2020
 
Jennifer Whyte - Digital innovation and the transformation of infrastructure ...
Jennifer Whyte - Digital innovation and the transformation of infrastructure ...Jennifer Whyte - Digital innovation and the transformation of infrastructure ...
Jennifer Whyte - Digital innovation and the transformation of infrastructure ...
 
Quality 4.0 and reimagining quality
Quality 4.0 and reimagining qualityQuality 4.0 and reimagining quality
Quality 4.0 and reimagining quality
 
Sodc 1 Introduction
Sodc 1 IntroductionSodc 1 Introduction
Sodc 1 Introduction
 
Cloud Computing Essentials in Alibaba Cloud
Cloud Computing Essentials in Alibaba CloudCloud Computing Essentials in Alibaba Cloud
Cloud Computing Essentials in Alibaba Cloud
 
Network of Electronic Self-Navigating Transports Presentation (NEST)
Network of Electronic Self-Navigating Transports Presentation (NEST)Network of Electronic Self-Navigating Transports Presentation (NEST)
Network of Electronic Self-Navigating Transports Presentation (NEST)
 
Gridcomputing
Gridcomputing Gridcomputing
Gridcomputing
 
IC Design Physical Verification
IC Design Physical VerificationIC Design Physical Verification
IC Design Physical Verification
 
ICLR 2020 Recap
ICLR 2020 RecapICLR 2020 Recap
ICLR 2020 Recap
 
IRJET-To Implement Cloud Computing by using Agile Methodology in Indian E-Gov...
IRJET-To Implement Cloud Computing by using Agile Methodology in Indian E-Gov...IRJET-To Implement Cloud Computing by using Agile Methodology in Indian E-Gov...
IRJET-To Implement Cloud Computing by using Agile Methodology in Indian E-Gov...
 
Orchestrating the lifecycle of network services
Orchestrating the lifecycle of network servicesOrchestrating the lifecycle of network services
Orchestrating the lifecycle of network services
 
EU-Taiwan Workshop on 5G Research, PRISTINE introduction
EU-Taiwan Workshop on 5G Research, PRISTINE introductionEU-Taiwan Workshop on 5G Research, PRISTINE introduction
EU-Taiwan Workshop on 5G Research, PRISTINE introduction
 
Maturing IoT solutions with Microsoft Azure (Sam Vanhoutte & Glenn Colpaert a...
Maturing IoT solutions with Microsoft Azure (Sam Vanhoutte & Glenn Colpaert a...Maturing IoT solutions with Microsoft Azure (Sam Vanhoutte & Glenn Colpaert a...
Maturing IoT solutions with Microsoft Azure (Sam Vanhoutte & Glenn Colpaert a...
 
Ronan_Rice_Resume_2016_web
Ronan_Rice_Resume_2016_webRonan_Rice_Resume_2016_web
Ronan_Rice_Resume_2016_web
 
HPC in higher education
HPC in higher educationHPC in higher education
HPC in higher education
 
RECAP Project Overview
RECAP Project OverviewRECAP Project Overview
RECAP Project Overview
 
Introduction to SDN
Introduction to SDNIntroduction to SDN
Introduction to SDN
 
aristi1.pptx
aristi1.pptxaristi1.pptx
aristi1.pptx
 
IEEE SmartTech 2019 Next Generation Private and Hybrid Clouds, Edge Computing...
IEEE SmartTech 2019 Next Generation Private and Hybrid Clouds, Edge Computing...IEEE SmartTech 2019 Next Generation Private and Hybrid Clouds, Edge Computing...
IEEE SmartTech 2019 Next Generation Private and Hybrid Clouds, Edge Computing...
 
Elastic Software Infrastructure to Support the Industrial Internet
Elastic Software Infrastructure to Support the Industrial InternetElastic Software Infrastructure to Support the Industrial Internet
Elastic Software Infrastructure to Support the Industrial Internet
 

Recently uploaded

CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
amitlee9823
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
amitlee9823
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
amitlee9823
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
amitlee9823
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
amitlee9823
 

Recently uploaded (20)

Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptx
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
 

Accelerating reinforcement-learning-in-engineering