1
DEEP LEARNING JP
[DL Papers]
http://deeplearning.jp/
Implicit Behavioral Cloning (CoRL 2021)
Koki Yamane, University of Tsukuba
2022/5/13 2
◼ Implicit Behavioral Cloning
◼ CoRL 2021
◼ Robotics at Google
 Pete Florence, Corey Lynch, Andy Zeng, Oscar Ramirez, Ayzaan Wahid,
Laura Downs, Adrian Wong, Johnny Lee, Igor Mordatch, Jonathan Tompson
◼ https://implicitbc.github.io/
(BC) EBM
2022/5/13 3
◼

◼

◼
2022/5/13 4
ෝ
𝒂 = 𝑭𝜽 𝒐
Behavior Cloning
BC
2022/5/13 5
ෝ
𝒂 = 𝑭𝜽 𝒐
Explicit Model ( )
ෝ
𝒂 = argmin 𝑬𝜽 𝒐, 𝒂
Implicit Model ( )
2022/5/13 6
ෝ
𝒂 = argmin 𝑬𝜽 𝒐, 𝒂
𝐿InfoNCE = ෍
𝑖=0
𝑁
−log ෦
𝑝𝜃 𝑦𝑖|𝑥, ෤
𝑦𝑖
𝑗
𝑗=1
𝑁𝑛𝑒𝑔
෦
𝑝𝜃 𝑦𝑖|𝑥, ෤
𝑦𝑖
𝑗
𝑗=1
𝑁𝑛𝑒𝑔
=
𝑒−𝐸𝜃 𝑥𝑖,𝑦𝑖
𝑒−𝐸𝜃 𝑥𝑖,𝑦𝑖 + σ𝑗=1
𝑁𝑛𝑒𝑔
𝑒
−𝐸𝜃 𝑥𝑖, ෤
𝑦𝑖
𝑗
(CNN+) MLP
2022/5/13 7
Implicit Model
2022/5/13 8
Implicit Model
2022/5/13 9
Implicit Model
2022/5/13 10
2022/5/13 11
Bi-Manual Sweeping Task
Explicit Model ( ) Implicit Model ( )
2022/5/13 12
Insertion Task
Explicit Model ( ) Implicit Model ( )
2022/5/13 13
Sorting Task
Explicit Model ( ) Implicit Model ( )
2022/5/13 14
2022/5/13 15
◼ BC EBM
◼
◼
◼

 RNN

【DL輪読会】Implicit Behavioral Cloning