[DL輪読会]Learning Self-Imitating Diverse Policies

Ad

Ad

Ad

Ad

Ad

Ad

Ad

Ad

Ad

Ad

Ad

Ad

1
DEEP LEARNING JP
[DL Papers]
http://deeplearning.jp/
“Learning Self-Imitating Diverse Policies (ICLR2019 under review)”
...

書誌情報
•URL
–OpenReview, ArXiv
•著者
–Tanmay Gangwani, Qiang Liu, Jian Peng
–イリノイ大学アーバナ・シャンペーン校
•ステータス
–ICLR2019 under review
...

概要
•背景：強化学習はエピソード報酬・スパース報酬・ノイジー報酬に弱い
–エピソード報酬：エピソードの最後のみ報酬が得られる
–スパース報酬：ある条件下でのみ報酬が得られる(eg. Montezuma’s Revenge )
–ノイジー報酬：...

Share Slideshare

LinkedIn
Facebook
Twitter

Embed

Size (px)

Show related Slideshows at end

WordPress Shortcode

Link

Share
Email

2024 Trend Updates: What Really Wor... by Search Engine Jou... 1047773 views
Storytelling For The Web: Integrate... by Chiara Aliotta 961940 views
Artificial Intelligence, Data and C... by OECD Directorate ... 891371 views
How to Leverage AI to Boost Employe... by SocialHRCamp 375955 views
2024 State of Marketing Report – by... by Marius Sescu 213320 views
Everything You Need To Know About C... by Expeed Software 232066 views

View on Slideshare

1 of 14 Ad

View on Slideshare

1 of 14 Ad