33. Appendix1: A2C-同期分散学習-
2020/5/15
MARLとM^3RL@総合ゼミ
清原 明加
33
In a single CPU
(multi-threading)
π V
Network
Input
π V
Network
Input
π V
Network
Input
π V
Network
Input
Synchronizer
Global Parameters
gradients
Updating
parameters
distributed learning
with multi agents