Secrets behind AlphaGo

Behind AlphaGo
- Deep Reinforcement Learning
Houston Machine Learning
Yan Xu
08/05/2017

Roadmap
• Introduction and Feature Engineering (2 lectures)
• Supervised Learning (4 lectures)
• Unsupervised Learning (3 lectures)
• Deep Learning series (4 lectures)
• Optimization in Deep learning
• Behind AlphaGo
• Mastering the game of Go with deep neural networks and tree search
• Reinforcement learning 101 – Ravi
• Deep reinforcement learning
• Attention network
• Cuda Programming Hands-on - Martin
• Application of Deep Learning
• Object recognition
• Chatbot
Slides posted on:
http://www.slideshare.net/xuyangela

Outline
• Go game
• Deep learning recap
• Reinforcement learning 101
• AlphaGo system overview: Deep reinforcement
learning
More depth into deep reinforcement learning in upcoming
meetups

Deep Learning
+
Reinforcement Learning
=
Deep reinforcement learning

Deep Learning: Basic Component
Activation
function

Deep Learning: Architecture
Convolutional neural network
feature
map
Feature map

Agent
Environment

https://www.youtube.com/watch?v=xWe58WGWmlk&t=64s

AlphaGo: Deep Reinforcement Learning
http://techtalks.tv/talks/deep-reinforcement-learning/62360/
Mimic human
experts
Play against self
Estimate wins
Policy Network
Value Network

http://techtalks.tv/talks/deep-reinforcement-learning/62360/

More coming up!
• Behind AlphaGo
• Mastering the game of Go with deep neural networks and
tree search
• Reinforcement learning 101 – Ravi
• Deep reinforcement learning
• Attention network
• Cuda Programming Hands-on - Martin
• Application of Deep Learning
• Object recognition
• Chatbot
• Any proposal?

HML Speaker Hall of Fame
Recognize contribution to Houston Machine Learning Meetup

Thank you ~
Slides will be posted at:
http://www.slideshare.net/xuyangela
Leave a
group
review
please 

Secrets behind AlphaGo

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Secrets behind AlphaGo

Similar to Secrets behind AlphaGo (20)

More from Yan Xu

More from Yan Xu (20)

Recently uploaded

Recently uploaded (20)

Secrets behind AlphaGo