Introduction to Reinforcement Learning.pptx

INTRODUCTIO
N
TO
Dr. Tri Basuki Kurniawan

RL vs Supervised and
Unsupervised Learning
1.
2.
3.
4.

Some Math Stuff
𝐺𝑡 = 𝑘=0
∞
𝛾𝑘
𝑅𝑡+𝑘+1𝑤ℎ𝑒𝑟𝑒 𝛾 𝜖[0,1)
𝑅𝑡+1 + 𝛾𝑅𝑡+2 + 𝛾2𝑅𝑡+3 …
Sigma (sum up)
Discount rate
Rewards receive at each state
Expanded from of the Equation

Example of Use Case
Agent: The program making
decision on how many ads are
appropriate for a page.
Environment: The web page.
Action: One of three: (1)
putting another ad on the
page; (2) dropping an ad from
the page; (3) neither adding
nor removing .
Reward: Positive when
revenue increase; negative
when revenue drops.
Determining the Placement of Ads on a Web Page

Example of Use Case
Controlling a Walking Robot
AGENT: THE PROGRAM
CONTROLLING A WALKING
ROBOT
ENVIRONMENT: THE REAL
WORLD.
ACTION: ONE OF FOUR MOVE: (1)
FORWARD; (2) BACKWARD; (3)
LEFT AND (4) RIGHT.
REWARD: POSITIVE WHEN IT
APPROACHES THE TARGET
DESTINATION; NEGATIVE WHEN IT
WASTES TIME, GOES IN THE
WRONG DIRECTION OR FALL
DOWN.

THANK YOU
Dr. Tri Basuki Kurniawan

Introduction to Reinforcement Learning.pptx

More Related Content

Recently uploaded

Featured

Introduction to Reinforcement Learning.pptx