This document summarizes a presentation on deep reinforcement learning. It introduces Markov decision processes and describes how reinforcement learning can be used for prediction and control tasks. It then discusses different deep reinforcement learning algorithms like DQN for value-based methods and policy gradient methods like REINFORCE, TRPO, and PPO for policy-based optimization. Actor-critic algorithms like A3C and DDPG that combine value and policy learning are also covered. The presentation provides an overview of core concepts and algorithms in deep reinforcement learning.