Playing the Snake Game with Deep Reinforcement Learning (by Chuyang Liu)

Playing the Snake Game with
Deep Reinforcement Learning
Chuyang Liu
https://github.com/chuyangliu/snake

Contents
01
02
03
04
Background
My Works
Algorithms
Experiments
05 Summary

Background
2015 2016 2017
DQN Optimizations
DDQN, Prioritized, Duel
Human-Level Control through Deep
Reinforcement Learning (DeepMind)
Deep Q-Network (DQN)
Snake AI
(Hawstein, Tobin Ehlis,
Christopher Lockhart, …)
Graph Theory, GP, RL, …
Snake AI (Chuyang Liu)
Path Search, Hamilton Cycle
Benchmark: Atari 2600 BFS, Longest Path

My Works
?
Designed global/local state vector
Discussed absolute/relative move direction
Feature Extraction
Implemented Deep Q-Network
(Experience Replay & Fixed Target-Net)
DQN
Compared the performance of
the three optimizations for DQN
DQN Optimizations
Designed a better reward
policy improving performance
Environment

𝜽 ← 𝜽 + 𝜼
𝝏
𝝏𝜽
𝒓 + 𝜸𝐦𝐚𝐱
𝒂′
𝑸 𝒕𝒂𝒓𝒈𝒆𝒕 𝒔′
, 𝒂′
; 𝜽′
− 𝑸 𝒆𝒗𝒂𝒍(𝒔, 𝒂; 𝜽)
𝟐
Policy
𝑎 ← 𝜋 𝑠
Eval-NetTarget-Net
Q-Target Q-Eval
Loss
state
action
state’reward
Environment
𝑟, 𝑠′
← 𝑒𝑛𝑣 𝑠, 𝑎
Algorithm: Computation Graph
assign

Environment
𝑟, 𝑠′
← 𝑒𝑛𝑣 𝑠, 𝑎
Algorithm: Environment
FOOD +1.0
DEAD -0.5 (game over)
ELSE -0.005
𝜽 ← 𝜽 + 𝜼
𝝏
𝝏𝜽
𝒓 + 𝜸𝐦𝐚𝐱
𝒂′
𝑸 𝒕𝒂𝒓𝒈𝒆𝒕 𝒔′
, 𝒂′
; 𝜽′
− 𝑸 𝒆𝒗𝒂𝒍(𝒔, 𝒂; 𝜽)
𝟐
Policy
𝑎 ← 𝜋 𝑠
state
action
state’reward

Eval-NetTarget-Net
Algorithm: Network & State Vector
assign

Target-Net
Algorithm: Network & State Vector
global (8x8x4)
local (1x3)
Conv3x3x32
Conv3x3x64
Conv2x2x128
Conv2x2x256
FC1024
FC512
Advantage1x3 Value1x1
Q1x3
FC512
Eval-Net
Same structure
but different weights and bias
assign

Summary
Double DQN
Prioritized Experience Replay
Dueling Network Structure
DQN
Feature Extraction
Environment Feedback
State & Action
Human experience
Pixels input
Network Inputs
Deep Neural Network
Deep Q-Learning from
Demonstrations (2018)
Training Efficiency

Thanks
Chuyang Liu
https://github.com/chuyangliu/snake

Playing the Snake Game with Deep Reinforcement Learning (by Chuyang Liu)

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Playing the Snake Game with Deep Reinforcement Learning (by Chuyang Liu)

Similar to Playing the Snake Game with Deep Reinforcement Learning (by Chuyang Liu) (20)

Recently uploaded

Recently uploaded (20)

Playing the Snake Game with Deep Reinforcement Learning (by Chuyang Liu)