Introduction2drl

•

2 likes•161 views

Shenglin Zhao

introduction to deep reinforcement learning

Data & Analytics

Outline
•  Background
•  Deep Learning
•  Reinforcement Learning
•  Deep Reinforcement Learning
•  Conclusion

Milestone Issues
•  NIPS 2013, DeepMind, Playing Atari with Deep Reinforcement Learning,
https://arxiv.org/abs/1312.5602
•  Nature cover paper 2015, DeepMind, Human-level control through deep
reinforcement learning, www.nature.com/articles/nature14236
•  Nature cover paper 2016, DeepMind, Mastering the game of Go with deep
neural networks and tree search, www.nature.com/articles/nature16961

Reinforcement Learning in a nutshell
http://icml.cc/2016/tutorials/deep_rl_tutorial.pdf

Deep Learning in a nutshell
http://icml.cc/2016/tutorials/deep_rl_tutorial.pdf

Deep Reinforcement Learning:AI = RL + DL
http://icml.cc/2016/tutorials/deep_rl_tutorial.pdf

Examples of Deep RL@DeepMind
http://icml.cc/2016/tutorials/deep_rl_tutorial.pdf

Deep Learning = Learning Representations/Features
•  Traditional Model
•  Deep Learning
http://www.cs.nyu.edu/~yann/talks/lecun-ranzato-icml2013.pdf

Deep Representations
http://icml.cc/2016/tutorials/deep_rl_tutorial.pdf

Deep Neural Network
http://icml.cc/2016/tutorials/deep_rl_tutorial.pdf

Example-CNN
http://xrds.acm.org/blog/2016/06/convolutional-neural-networks-cnns-illustrated-explanation/
Convolutional Operator
Discrete Form
Matrix Element-wise Multiplication
1 2 0
0 1 0
0 0 1
2 2 7
5 0 7
1 2 1
= 7

Example-CNN
https://adeshpande3.github.io/adeshpande3.github.io/A-Beginner's-Guide-To-Understanding-Convolutional-Neural-Networks/
Pixel representation of filter Visualization

Example-CNN
https://adeshpande3.github.io/adeshpande3.github.io/A-Beginner's-Guide-To-Understanding-Convolutional-Neural-Networks/
Original image What we find

Example-CNN
http://www.cs.nyu.edu/~yann/talks/lecun-ranzato-icml2013.pdf
Low-Level
Feature
Mid-Level
Feature
High-Level
Feature
Trainable
Classifier

Reinforcement Learning
http://icml.cc/2016/tutorials/deep_rl_tutorial.pdf

Agent and Environment
http://icml.cc/2016/tutorials/deep_rl_tutorial.pdf
Agent
Environment

State
http://icml.cc/2016/tutorials/deep_rl_tutorial.pdf

Major Components
http://icml.cc/2016/tutorials/deep_rl_tutorial.pdf

Policy
http://icml.cc/2016/tutorials/deep_rl_tutorial.pdf

Value Function
Bellman equation
http://icml.cc/2016/tutorials/deep_rl_tutorial.pdf

Optimal Case
http://icml.cc/2016/tutorials/deep_rl_tutorial.pdf

Approaches to Reinforcement Learning
http://icml.cc/2016/tutorials/deep_rl_tutorial.pdf

RL Example
•  Assumption
•  Suppose we have 5 rooms in a building connected by doors
•  The outside of the building can be thought of as one big room (5)
•  Target
•  Put an agent in any room, and from that room, go outside the building
http://mnemstudio.org/path-finding-q-learning-tutorial.htm

Q-learning for the RL Problem
•  Assuming rewards for each step, the goal is to reach the state with
the highest reward.
•  Terms— state: room, action: move decision, reward: 0 or 100
RL Problem Markov Decision Process
http://mnemstudio.org/path-finding-q-learning-tutorial.htm

Q-learning for the RL Problem
Reward Table Q-value Table

Q-learning for the RL Problem
•  Q-table is the brain of our agent, representing the memory of what
the agent has learned through experience.
•  The agent starts out knowing nothing, the matrix Q is initialized to
zero.
•  Simple transition rule of Q learning,
http://mnemstudio.org/path-finding-q-learning-tutorial.htm

Q-learning for the RL Problem
http://mnemstudio.org/path-finding-q-learning-tutorial.htm

Q-learning for the RL Problem
•  Example
•  Initial state: room 1, action: move to 5
http://mnemstudio.org/path-finding-q-learning-tutorial.htm

Q-learning for the RL Problem
•  Example
•  Initial state: room 3, action: move to 1
Q(3, 1) = 0 + 0.8 * 100 = 80
http://mnemstudio.org/path-finding-q-learning-tutorial.htm

Outline
•  Background
•  Deep Learning
•  Reinforcement Learning
•  Deep Reinforcement Learning
•  Conclusions

Deep Reinforcement Learning
http://videolectures.net/rldm2015_silver_reinforcement_learning/
Deep
Learning
Reinforcement
Learning
Deep
Reinforcement
Learning

DQN Architecture
Naive formulation of deep Q-network DQN in DeepMind paper
https://www.nervanasys.com/demystifying-deep-reinforcement-learning/

DQN Architecture
https://www.nervanasys.com/demystifying-deep-reinforcement-learning/

DQN
Loss function
Q-table update algorithm
https://www.nervanasys.com/demystifying-deep-reinforcement-learning/

Stability Issues with Deep RL
http://videolectures.net/rldm2015_silver_reinforcement_learning/

DQN
http://videolectures.net/rldm2015_silver_reinforcement_learning/

Experience Replay
http://videolectures.net/rldm2015_silver_reinforcement_learning/

Fixed Target Q-Network
http://videolectures.net/rldm2015_silver_reinforcement_learning/

Reward/Value Range
http://videolectures.net/rldm2015_silver_reinforcement_learning/

References
•  http://karpathy.github.io/2016/05/31/rl/
•  https://gym.openai.com/docs/rl
•  https://www.nervanasys.com/demystifying-deep-reinforcement-
learning/
•  http://mnemstudio.org/path-finding-q-learning-tutorial.htm
•  http://videolectures.net/rldm2015_silver_reinforcement_learning/
•  http://icml.cc/2016/tutorials/deep_rl_tutorial.pdf

Similar to Introduction2drl

Strata San Jose 2016: Scalable Ensemble Learning with H2OSri Ambati

MDEC Data Matters Series: machine learning and Deep Learning, A PrimerPoo Kuan Hoong

Deep learning - a primerUwe Friedrichsen

Deep learning - a primerShirin Elsinghorst

Deep Learning Made Easy with Deep FeaturesTuri, Inc.

DeepLearningLecture.pptxssuserf07225

Development of Deep Learning ArchitecturePantech ProLabs India Pvt Ltd

An introduction to Machine Learning (and a little bit of Deep Learning)Thomas da Silva Paula

孫民/從電腦視覺看人工智慧 : 下一件大事台灣資料科學年會

DEF CON 24 - Clarence Chio - machine duping 101Felipe Prado

Deep learning with KerasQuantUniversity

Deep Learning Interview Questions and Answers | EdurekaEdureka!

Ml - A shallow diveGopi Krishna Nuti

Deep Learning with Python: A Complete Practical Course for ResearchersATITAcademy

Apache MXNet ODSC West 2018Apache MXNet

Intro to TensorFlow and PyTorch Workshop at Tubular LabsKendall

Deep learning to the rescue - solving long standing problems of recommender ...Balázs Hidasi

lecture01v1.pdf2014csb1020

Artificial Intelligence, Machine Learning and Deep LearningSujit Pal

Notes from 2016 bay area deep learning school Niketan Pansare

Similar to Introduction2drl (20)

Strata San Jose 2016: Scalable Ensemble Learning with H2O

MDEC Data Matters Series: machine learning and Deep Learning, A Primer

Deep learning - a primer

Deep Learning Made Easy with Deep Features

DeepLearningLecture.pptx

Development of Deep Learning Architecture

An introduction to Machine Learning (and a little bit of Deep Learning)

孫民/從電腦視覺看人工智慧 : 下一件大事

DEF CON 24 - Clarence Chio - machine duping 101

Deep learning with Keras

Deep Learning Interview Questions and Answers | Edureka

Ml - A shallow dive

Deep Learning with Python: A Complete Practical Course for Researchers

Apache MXNet ODSC West 2018

Intro to TensorFlow and PyTorch Workshop at Tubular Labs

Deep learning to the rescue - solving long standing problems of recommender ...

lecture01v1.pdf

Artificial Intelligence, Machine Learning and Deep Learning

Notes from 2016 bay area deep learning school

Recently uploaded

Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863

Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson

EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster

Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083

RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh

Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda

Brighton SEO | April 2024 | Data StorytellingNeil Barnes

From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck

Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083

Spark3's new memory model/managementakshesh doshi

Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375

代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改atducpo

Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha

High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh

꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083

Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H

VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...Suhani Kapoor

(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat

RadioAdProWritingCinderellabyButleri.pdfgstagge

Russian Call Girls Dwarka Sector 15 💓 Delhi 9999965857 @Sabina Modi VVIP MODE...Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

Recently uploaded (20)

Dubai Call Girls Wifey O52&786472 Call Girls Dubai

Schema on read is obsolete. Welcome metaprogramming..pdf

EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx

Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call

RA-11058_IRR-COMPRESS Do 198 series of 1998

Customer Service Analytics - Make Sense of All Your Data.pptx

Brighton SEO | April 2024 | Data Storytelling

From idea to production in a day – Leveraging Azure ML and Streamlit to build...

Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call

Spark3's new memory model/management

Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...

代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改

Call Girls In Mahipalpur O9654467111 Escorts Service

High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...

꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call

Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf

VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...

(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service

RadioAdProWritingCinderellabyButleri.pdf

Russian Call Girls Dwarka Sector 15 💓 Delhi 9999965857 @Sabina Modi VVIP MODE...

Introduction2drl

1. Introduction to Deep Reinforcement Learning Shenglin Zhao Department of Computer Science & Engineering The Chinese University of Hong Kong

2. Outline •  Background •  Deep Learning •  Reinforcement Learning •  Deep Reinforcement Learning •  Conclusion

3. Outline •  Background •  Deep Learning •  Reinforcement Learning •  Deep Reinforcement Learning •  Conclusion

4. Milestone Issues •  NIPS 2013, DeepMind, Playing Atari with Deep Reinforcement Learning, https://arxiv.org/abs/1312.5602 •  Nature cover paper 2015, DeepMind, Human-level control through deep reinforcement learning, www.nature.com/articles/nature14236 •  Nature cover paper 2016, DeepMind, Mastering the game of Go with deep neural networks and tree search, www.nature.com/articles/nature16961

5. Reinforcement Learning in a nutshell http://icml.cc/2016/tutorials/deep_rl_tutorial.pdf

6. Deep Learning in a nutshell http://icml.cc/2016/tutorials/deep_rl_tutorial.pdf

7. Deep Reinforcement Learning:AI = RL + DL http://icml.cc/2016/tutorials/deep_rl_tutorial.pdf

8. Examples of Deep RL@DeepMind http://icml.cc/2016/tutorials/deep_rl_tutorial.pdf

9. Outline •  Background •  Deep Learning •  Reinforcement Learning •  Deep Reinforcement Learning •  Conclusion

10. Deep Learning = Learning Representations/Features •  Traditional Model •  Deep Learning http://www.cs.nyu.edu/~yann/talks/lecun-ranzato-icml2013.pdf

11. Deep Representations http://icml.cc/2016/tutorials/deep_rl_tutorial.pdf

12. Deep Neural Network http://icml.cc/2016/tutorials/deep_rl_tutorial.pdf

13. Example-CNN http://xrds.acm.org/blog/2016/06/convolutional-neural-networks-cnns-illustrated-explanation/ Convolutional Operator Discrete Form Matrix Element-wise Multiplication 1 2 0 0 1 0 0 0 1 2 2 7 5 0 7 1 2 1 = 7

14. Example-CNN https://adeshpande3.github.io/adeshpande3.github.io/A-Beginner's-Guide-To-Understanding-Convolutional-Neural-Networks/ Pixel representation of filter Visualization

15. Example-CNN https://adeshpande3.github.io/adeshpande3.github.io/A-Beginner's-Guide-To-Understanding-Convolutional-Neural-Networks/ Original image What we find

16. Example-CNN http://www.cs.nyu.edu/~yann/talks/lecun-ranzato-icml2013.pdf Low-Level Feature Mid-Level Feature High-Level Feature Trainable Classifier

17. Outline •  Background •  Deep Learning •  Reinforcement Learning •  Deep Reinforcement Learning •  Conclusion

18. Reinforcement Learning http://icml.cc/2016/tutorials/deep_rl_tutorial.pdf

19. Agent and Environment http://icml.cc/2016/tutorials/deep_rl_tutorial.pdf Agent Environment

20. State http://icml.cc/2016/tutorials/deep_rl_tutorial.pdf

21. Major Components http://icml.cc/2016/tutorials/deep_rl_tutorial.pdf

22. Policy http://icml.cc/2016/tutorials/deep_rl_tutorial.pdf

23. Value Function Bellman equation http://icml.cc/2016/tutorials/deep_rl_tutorial.pdf

24. Optimal Case http://icml.cc/2016/tutorials/deep_rl_tutorial.pdf

25. Approaches to Reinforcement Learning http://icml.cc/2016/tutorials/deep_rl_tutorial.pdf

26. RL Example •  Assumption •  Suppose we have 5 rooms in a building connected by doors •  The outside of the building can be thought of as one big room (5) •  Target •  Put an agent in any room, and from that room, go outside the building http://mnemstudio.org/path-finding-q-learning-tutorial.htm

27. RL Example

28. Q-learning for the RL Problem •  Assuming rewards for each step, the goal is to reach the state with the highest reward. •  Terms— state: room, action: move decision, reward: 0 or 100 RL Problem Markov Decision Process http://mnemstudio.org/path-finding-q-learning-tutorial.htm

29. Q-learning for the RL Problem Reward Table Q-value Table

30. Q-learning for the RL Problem •  Q-table is the brain of our agent, representing the memory of what the agent has learned through experience. •  The agent starts out knowing nothing, the matrix Q is initialized to zero. •  Simple transition rule of Q learning, http://mnemstudio.org/path-finding-q-learning-tutorial.htm

31. Q-learning for the RL Problem http://mnemstudio.org/path-finding-q-learning-tutorial.htm

32. Q-learning for the RL Problem •  Example •  Initial state: room 1, action: move to 5 http://mnemstudio.org/path-finding-q-learning-tutorial.htm

33. Q-learning for the RL Problem •  Example •  Initial state: room 3, action: move to 1 Q(3, 1) = 0 + 0.8 * 100 = 80 http://mnemstudio.org/path-finding-q-learning-tutorial.htm

34. Q-learning for the RL Problem http://mnemstudio.org/path-finding-q-learning-tutorial.htm

35. Outline •  Background •  Deep Learning •  Reinforcement Learning •  Deep Reinforcement Learning •  Conclusions

36. Deep Reinforcement Learning http://videolectures.net/rldm2015_silver_reinforcement_learning/ Deep Learning Reinforcement Learning Deep Reinforcement Learning

37. Deep Q-learning Deep Q-Network (DQN)

38. DQN Architecture Naive formulation of deep Q-network DQN in DeepMind paper https://www.nervanasys.com/demystifying-deep-reinforcement-learning/

39. DQN Architecture https://www.nervanasys.com/demystifying-deep-reinforcement-learning/

40. DQN Loss function Q-table update algorithm https://www.nervanasys.com/demystifying-deep-reinforcement-learning/

41. Exploration-Exploitation

42. Value Iteration

43. Policy Iteration

44. Stability Issues with Deep RL http://videolectures.net/rldm2015_silver_reinforcement_learning/

45. DQN http://videolectures.net/rldm2015_silver_reinforcement_learning/

46. Experience Replay http://videolectures.net/rldm2015_silver_reinforcement_learning/

47. Fixed Target Q-Network http://videolectures.net/rldm2015_silver_reinforcement_learning/

48. Reward/Value Range http://videolectures.net/rldm2015_silver_reinforcement_learning/

49. DQN Results in Atari

50. DQN Atari Demo

51. Conclusion

52. Demo

53. References •  http://karpathy.github.io/2016/05/31/rl/ •  https://gym.openai.com/docs/rl •  https://www.nervanasys.com/demystifying-deep-reinforcement- learning/ •  http://mnemstudio.org/path-finding-q-learning-tutorial.htm •  http://videolectures.net/rldm2015_silver_reinforcement_learning/ •  http://icml.cc/2016/tutorials/deep_rl_tutorial.pdf

Introduction2drl

Recommended

Recommended

More Related Content

Similar to Introduction2drl

Similar to Introduction2drl (20)

Recently uploaded

Recently uploaded (20)

Introduction2drl