Bentham & Hooker's Classification. along with the merits and demerits of the ...
Suphx: Mastering Mahjong with Deep Reinforcement Learning
1. DLゼミ
Suphx: Mastering Mahjong with Deep
Reinforcement Learning
北海道⼤学 ⼤学院情報科学研究院
情報理⼯学部⾨ 複合情報⼯学分野 調和系⼯学研究室
修⼠課程 2年 織⽥ 智⽮
2. • Junjie Li, Sotetsu Koyamada, Qiwei Ye, Guoqing
Liu, Chao Wang, Ruihan Yang, Li Zhao,Tao Qin,
Tie-Yan Liu, Hsiao-Wuen Hon
• Microsoft Research Asia
• Kyoto University
• University of Science and Technology of China
• Tsinghua University
• Nankai University
• Arxiv Mar 2020
書誌情報 2