Slideshare uses cookies to improve functionality and performance, and to provide you with relevant advertising. If you continue browsing the site, you agree to the use of cookies on this website. See our User Agreement and Privacy Policy.
Slideshare uses cookies to improve functionality and performance, and to provide you with relevant advertising. If you continue browsing the site, you agree to the use of cookies on this website. See our Privacy Policy and User Agreement for details.
Published on
ICLR2019に投稿されたR2D2 (Recurrent Replay Distributed DQN)アルゴリズムの紹介。
深層強化学習の分散化の流れ(A3C/Ape-X/IMPALA)やRNN活用(DRQN)についても合わせて概観しています。
※ 分かりにくい箇所や、不正確な記載があればコメントいただけると嬉しいです。
Login to see the comments