Embed presentation
Download to read offline








This document provides an introduction to reinforcement learning. It defines reinforcement learning and distinguishes it from supervised and unsupervised learning. It presents a mathematical formula for calculating rewards received at each state and provides two examples of how reinforcement learning could be used: determining optimal ad placement on a web page to maximize revenue, and controlling a walking robot to reach a target destination. It also briefly mentions the epsilon greedy algorithm.







