2. 論文概要
• 著者
Matteo Hessel, Joseph Modayil, Hado van Hasselt, Tom Schaul, Georg Ostrovski,
Will Dabney, Dan Horgan, Bilal Piot, Mohammad Azar, David Silver
• AAAI 2018
https://arxiv.org/abs/1710.02298
• 被引用数:36件(2018/6/28)
• 複数のDQNの拡張モデルを組み合わせて出来たRainbowという
ハイパフォーマンスな手法を提案し、当時のstate-of-the-art(最先端技術)となった
1
19. 参考文献
• Deep Reinforcement Learning with Double Q-learning
Hado van Hasselt, Arthur Guez, David Silver
• Prioritized Experience Replay
Tom Schaul, John Quan, Ioannis Antonoglou, David Silver
• Dueling Network Architectures for Deep Reinforcement Learning
Ziyu Wang, Tom Schaul, Matteo Hessel, Hado van Hasselt, Marc Lanctot, Nando de Freitas
• A Distributional Perspective on Reinforcement Learning
Marc G. Bellemare, Will Dabney, Rémi Munos
• Noisy Networks for Exploration
Meire Fortunato, Mohammad Gheshlaghi Azar, Bilal Piot, Jacob Menick, Ian Osband, Alex
Graves, Vlad Mnih, Remi Munos, Demis Hassabis, Olivier Pietquin, Charles Blundell, Shane
Legg
• https://github.com/arXivTimes/arXivTimes/issues/439
18