Causal Confusion in Imitation Learning Character Controllers using Motion VAEs Causal Confusion in Imitation Learning Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic Context Variables PRM-RL: Long-range Robotics Navigation Tasks by Combining Reinforcement Learning and Sampling-based Planning Exploration Strategies in Reinforcement Learning Maximum Entropy Reinforcement Learning (Stochastic Control) Safe Reinforcement Learning Planning and Learning with Tabular Methods