[DL輪読会]Fast and Slow Learning of Recurrent Independent Mechanisms

DEEP LEARNING JP
[DL Papers] Fast and Slow Learning of
Recurrent Independent Mechanisms
XIN ZHANG, Matsuo Lab
http://deeplearning.jp/

書誌情報
● タイトル：
○ Fast and Slow Learning of Recurrent Independent Mechanisms
● 著者
○ Kanika Madan, Rosemary Nan Ke, Anirudh Goyal, Bernhard Scholkopfm, Yoshua
Bengio.
● ICLR 2021
● 概要
○ 脳に存在する機能毎に独立な部分を,Modular Networkで実現しようと...
○ Recurrent Independent Mechanisms(RIM)はその一種.
○ RIMの学習を異なるStepで行う仕組みを提案し, 手法を改良した研究. 2

Introduction：Modular Networks
4
Deep Compositional Question Answering with Neural Module Networks 2016
➢ VQA：Parserで再利用な可能なModuleを選び, Networkを作成.

5
➢ 多めにネットワークを生成して, 進化論の思想で, 役立つModuleを残していく.

6
Learning Modular Neural Network Policies for Multi-Task and Multi-Robot Transfer 2016
➢ ロボットのModule, タスクのModuleを学習して, 新たな組み合わせに汎化できる.

Meta Learning of Recurrent Independent Mechanisms

RIM:Recurrent Independent Mechanisms
8
➢ Inputを潜在空間にEncode, RIMを通すことで, Inputに関連したMemoryをOutput.
○ OutputをValue, Policyに分割して,PPOの学習に使う.
➢ RIMは, 独立したNこのModule, AttentionでInputに関連したK個のRIMを更新.

Meta Learning of RIM
9
➢ Fast Inner：RIM, Policy head.
➢ SLOW：Input Attention & Communication Attention, Value head.

提案手法：MIR
10
➢ PPOのLoss.
➢ θM, θA,でAttentionとModuleの更新異なるStepで行う.

- Modular Networks（Introdcution）
Related Work
11
- Meta Learning
- Modular meta-learning 2018
- Meta-Learning to Disentangle Causal Mechanisms
- A Meta-Transfer Objective for Learning to Disentangle Causal Mechanisms
- Learning neural causal models from unknown interventions

a: Improve sample efficiency?
13
➢ YES, 赤い線が提案手法, 横軸がFrame数.

b: Lead to policy that generalize better?
14
➢ Yes, More DifficultはZero shot Transfer, Baselineを大きくリード.

c: Fast adaptation to new distributions?
15
➢ 簡単な環境でPre-trainして,target 環境で成功率を測る.
○ もっと効率的に知識のピースを再利用していると言える.

Ablation: Meta-Learning setupが大事？
16
➢ Meta-learningの重要さを示す? Meta-LSTMがvanillaより良い図.

Ablation: Sparsity, Slow-factor of Outer loop
17
➢ 全部使うより, SparsityがModuleの機能性を向上させる.
n=4, k=2の例..

Ablation: Value function Visualization
18
➢ 左の図, Valueが上がったり下がったり...ゴールが見えている時は, 高い値を示す.
➢ Frame 12はゴールの目の前にいて,すごく高い, 13はタスク終了なので,低くなる.

Ablation: Visualizing Module Activations
19
➢ 左のInputで,活性化されるModuleを示している. n=5, k=3.
➢ F7のところで左の緑の点が見えて,M5が活性化される..

Ablation: Importance of Fast and Slow Update Loops.
20
➢ Inner loop, Outer loopの役割を交換すると,精度は落ちる. Vanilaと同じ程度に.
➢ AttentionのLearning rateだけを落としても,うまくいかない.(slowLR)

Ablation: Roles of the Active Modules
21
➢ Active Modulesを減らしたら,エピソードを完成するのに,より長い時間をかけた.

まとめ：
- 知識の分解と再利用を実現するのに, 必要なアーキテクチャに関する研究.
- 多くの関連分野(meta RL, HRL, time scale in RL, attention)をうまく繋げた面白い研
究.(OpenReview.)
- 具体的にはRIMをMete-Learning的な考え方で実現してみた.
- Meta-learningの活用で,汎化性能を挙げられることに期待.
まとめ&感想
23
感想：
- Modular Networkの研究が面白い, RIMはBengio先生が推してて重要な研究.
- それぞれのModuleが異なる役割をもっと明確に担当させるのに, 方法がありそう.
- DADS の

Appendix
- 関連研究:
- Learning to Combine Top-Down and Bottom-Up Signals in Recurrent Neural Networks with
Attention over Modules
- ブログ RIM：
- https://www.zhihu.com/search?type=content&q=Recurrent%20independent%20mechanism
s

[DL輪読会]Fast and Slow Learning of Recurrent Independent Mechanisms

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to [DL輪読会]Fast and Slow Learning of Recurrent Independent Mechanisms

Similar to [DL輪読会]Fast and Slow Learning of Recurrent Independent Mechanisms (7)

More from Deep Learning JP

More from Deep Learning JP (20)

Recently uploaded

Recently uploaded (8)

[DL輪読会]Fast and Slow Learning of Recurrent Independent Mechanisms