AI - Introduction to Reinforcement Learning

•Download as PPTX, PDF•

1 like•307 views

This document provides an introduction to reinforcement learning, which is a machine learning method that allows an agent to learn how to take actions in an environment by interacting with it. The agent receives rewards or punishments that shape its behavior toward achieving goals. Through trial and error, the agent learns a policy that maps states to actions in a way that maximizes rewards over time without being explicitly programmed.

Technology

Artificial Intelligence
Reinforcement Learning
Introduction
Portland Data Science Group
Created by Andrew Ferlitsch
Community Outreach Officer
July, 2017

Introduction
• A machine learning method for adaptation to an
environment.
• An agent (e.g., robot) interacts with a dynamic
environment.
• An agent learns from interacting with the environment
the best actions to take.
• Concepts include:
• Markov Principles
• Exploration / Exploitation
• Dynamic Programming

Actions / Environment
Reflex Agent
Environment
Action Observation
Continuous Cycle:
Observe Environment,
Take Action,
Observe Environment,
Take Action
Actions are determined
based on predefined
rules.
Preprogrammed

Intelligent Agent
Wikipedia: An Intelligent Agent is an autonomous entity which observes through sensors
and acts upon an environment using actuators (i.e. it is an agent) and directs its activity
towards achieving goals (i.e. it is "rational", as defined in economics).
Intelligent Agent
Sensors
Actuators
Environment
Senses the environment (e.g., camera,
audio, LIDAR, GPS, ultrasonic)
Room, Street,
Warehouse, etc.
Modifies the environment
(e.g., walk, pickup, drive)
State
Actions
Current State of the Agent
relative to the environment.
Possible Actions the Agent
can take.
Policy
A set of rules that
map a state to an
action.

State / Reward
Intelligent Agent
Environment
Action Observation
State Reward
How the action
effected the agent
and environment.
How positive or
negative is the
new state.
LEARN
What was
learned from
the reward.
Policy
Learned set of
rules of:
States -> Actions
Example Positive Reward:
Robot Stands Up,
Closer to Destination
Example Negative Reward:
Robot Falls Down,
Further from Destination
Reinforcement Learning

Learn by Trial and Error
Un-programmed
Predefined set
of Actions
Reward
Function
Delivered From Factory
Trial and Error
Programmed
Predefined set
of Actions
Reward
Function
Reinforcement Learning
Policy
States ->
Actions

Simple Reinforcement Learning Example
Actions:
Stand
Walk
Run
Rewards:
Stand -> 0
Walk -> 1
Run -> 2
Fall -> -2
Un-programmed
Initial State -> Fall State->Action
Actions: Stand -> 0 Fall -> Stand
Walk -> Robot Falls Down -> -2
Run -> Robot Falls Down -> -2
State->Stand
Stand -> 0
Walk -> 1 Stand->Walk
Run -> Robot Falls Down -> -2
State->Walk
Stand -> 0
Walk -> 1
Run -> 2 Walk->Run
Learned PolicyTrials
Highest Reward

What's hot

Artificial intelligence(03)Nazir Ahmed

Lecture 02-agentsnisar haider bhatti

AISuveeksha

Intelligent Agent PerceptionMolly Maymar

Chapter 2 intelligent agentsLukasJohnny

An Early Warning System for Ambient Assisted LivingAndrea Monacchi

What's hot (6)

Artificial intelligence(03)

Lecture 02-agents

Intelligent Agent Perception

Chapter 2 intelligent agents

An Early Warning System for Ambient Assisted Living

Similar to AI - Introduction to Reinforcement Learning

Lecture 2Shiplu Hawlader

m2-agents.pptxRitwikNayan

AI Basic.pptxDharaDarji5

Lecture 2 Agents.pptxAndrewKuziwakwasheMu

Unit-1.pptxDharaDarji5

introduction to inteligent IntelligentAgent.pptdejene3

Intelligent agentsMohammed Alhabib

Jarrar.lecture notes.aai.2011s.ch2.intelligentagentsPalGov

Agents1Amar Jukuntla

Lec 2-agentsTaymoor Nazmy

A.i lecture 04yarafghani

Ai u1Dr. Kavita Sharma

agents in ai pptPrasanth633635

Introduction To Artificial IntelligenceNeHal VeRma

Artificial IntelligenceVinod Kumar Meghwar

Lec 2 agentsEyob Sisay

Unit 1.pptGEETHAS668001

M2 agentsHadeel AbuShaireh

Similar to AI - Introduction to Reinforcement Learning (20)

Lecture 2

m2-agents.pptx

AI Basic.pptx

Lecture 2 Agents.pptx

Unit-1.pptx

introduction to inteligent IntelligentAgent.ppt

Intelligent agents

Jarrar.lecture notes.aai.2011s.ch2.intelligentagents

Agents1

Lec 2-agents

A.i lecture 04

Ai u1

agents in ai ppt

Introduction To Artificial Intelligence

Artificial Intelligence

Lec 2 agents

Unit 1.ppt

M2 agents

Recently uploaded

Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar

DMCC Future of Trade Web3 - Special EditionDubai Multi Commodity Centre

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxnull - The Open Security Community

Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang

New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada

Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes

Vulnerability_Management_GRC_by Sohang Sengupta.pptxnull - The Open Security Community

Unlocking the Potential of the Cloud for IBM Power SystemsPrecisely

SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren

Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106

Pigging Solutions in Pet Food ManufacturingPigging Solutions

Understanding the Laravel MVC ArchitecturePixlogix Infotech

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsAndrey Dotsenko

Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix

Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsHyundai Motor Group

"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski

Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK

Recently uploaded (20)

Unleash Your Potential - Namagunga Girls Coding Club

DMCC Future of Trade Web3 - Special Edition

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx

Bun (KitWorks Team Study 노별마루 발표 2024.4.22)

New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024

Enhancing Worker Digital Experience: A Hands-on Workshop for Partners

Vulnerability_Management_GRC_by Sohang Sengupta.pptx

Unlocking the Potential of the Cloud for IBM Power Systems

SQL Database Design For Developers at php[tek] 2024

Benefits Of Flutter Compared To Other Frameworks

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics

Pigging Solutions in Pet Food Manufacturing

Understanding the Laravel MVC Architecture

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics

Swan(sea) Song – personal research during my six years at Swansea ... and bey...

Snow Chain-Integrated Tire for a Safe Drive on Winter Roads

"Federated learning: out of reach no matter how close",Oleksandr Lapshyn

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...

Unblocking The Main Thread Solving ANRs and Frozen Frames

AI - Introduction to Reinforcement Learning

1. Artificial Intelligence Reinforcement Learning Introduction Portland Data Science Group Created by Andrew Ferlitsch Community Outreach Officer July, 2017

2. Introduction • A machine learning method for adaptation to an environment. • An agent (e.g., robot) interacts with a dynamic environment. • An agent learns from interacting with the environment the best actions to take. • Concepts include: • Markov Principles • Exploration / Exploitation • Dynamic Programming

3. Actions / Environment Reflex Agent Environment Action Observation Continuous Cycle: Observe Environment, Take Action, Observe Environment, Take Action Actions are determined based on predefined rules. Preprogrammed

4. Intelligent Agent Wikipedia: An Intelligent Agent is an autonomous entity which observes through sensors and acts upon an environment using actuators (i.e. it is an agent) and directs its activity towards achieving goals (i.e. it is "rational", as defined in economics). Intelligent Agent Sensors Actuators Environment Senses the environment (e.g., camera, audio, LIDAR, GPS, ultrasonic) Room, Street, Warehouse, etc. Modifies the environment (e.g., walk, pickup, drive) State Actions Current State of the Agent relative to the environment. Possible Actions the Agent can take. Policy A set of rules that map a state to an action.

5. State / Reward Intelligent Agent Environment Action Observation State Reward How the action effected the agent and environment. How positive or negative is the new state. LEARN What was learned from the reward. Policy Learned set of rules of: States -> Actions Example Positive Reward: Robot Stands Up, Closer to Destination Example Negative Reward: Robot Falls Down, Further from Destination Reinforcement Learning

6. Learn by Trial and Error Un-programmed Predefined set of Actions Reward Function Delivered From Factory Trial and Error Programmed Predefined set of Actions Reward Function Reinforcement Learning Policy States -> Actions

7. Simple Reinforcement Learning Example Actions: Stand Walk Run Rewards: Stand -> 0 Walk -> 1 Run -> 2 Fall -> -2 Un-programmed Initial State -> Fall State->Action Actions: Stand -> 0 Fall -> Stand Walk -> Robot Falls Down -> -2 Run -> Robot Falls Down -> -2 State->Stand Stand -> 0 Walk -> 1 Stand->Walk Run -> Robot Falls Down -> -2 State->Walk Stand -> 0 Walk -> 1 Run -> 2 Walk->Run Learned PolicyTrials Highest Reward

AI - Introduction to Reinforcement Learning

Recommended

Recommended

More Related Content

What's hot

What's hot (6)

Similar to AI - Introduction to Reinforcement Learning

Similar to AI - Introduction to Reinforcement Learning (20)

More from Andrew Ferlitsch

More from Andrew Ferlitsch (20)

Recently uploaded

Recently uploaded (20)

AI - Introduction to Reinforcement Learning