(参考文献) Yufei Wang, Tianwei Ni. Meta-SAC: Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient. In 7th ICML Workshop on Automated Machine Learning (2020) Read less