SlideShare a Scribd company logo
1 of 11
7376212CB107
ARCHAYA R S
REINFORCEMENT
LEARNING
and its impact on the future.
It is important to understand
the utilities of artificial
intelligence
New future
01 - What is it?
02 - Introduction
03 - Elements of RL
04 - Types of RL
05 - Application of RL
06 - Compare RL & SL
07 - Challenges of RL
08 - Conclusion
Reinforcement learning is an area of Artificial
Intelligence; it has emerged as an effective tool
towards building artificially intelligent systems
and solving sequential decision-making
problems. Reinforcement Learning has
achieved many impressive breakthroughs in
recent years and it was able to surpass the
human level in many fields; it is able to play and
win various games. Historically, reinforcement
learning was efficient in solving some control
system problems. Nowadays, it has a
growing range of applications.
ABSTRACT
01 - What is it?
Artificial
Human Intelligence
INTRODUCTION
The upraise of Artificial Intelligence is associated with Deep
Learning achievements in recent years. Deep Learning is
basically a set of multiple layers of neural networks connected to
each other. Reinforcement learning is learning through
interaction with an environment by taking different actions and
experiencing many failures and successes while trying to
maximize the received rewards. It is close to human learning. The
algorithm learns a policy of how to act in the environment. It is a
problem faced by an agent that learns behavior through trail-
and-error interactions with a dynamic environment.
• AGENT: Intelligent Program
• ENVIRONMENT: External Condition
• POLICY: Mapping from states to actions
• REWARD: Defines the goal in the RL problem,
Policy is altered to achieve this goal.
• VALUE: The value of a state is the total
amount of reward an agent can expect to
accumulate over the future, starting from that
state.
• MODEL OF ENVIRONMENT: Predict the
mimic behavior of the environment, then
predict the resultant of the next state and
reward.
ELEMENTS OF RL
Positive: Positive Reinforcement is defined as when an event,
occurs due to a particular behavior, and increases the strength
and the frequency of the behavior.
• Maximizes Performance
• Sustain Change for a long period of time
Negative: Negative Reinforcement is defined as the
strengthening of behavior because a negative condition is
stopped or avoided.
• Increases Behavior
• Provide defiance to a minimum standard of performance
• It Only provides enough to meet up the minimum behavior
TYPES OF REINFORCEMENT LEARNING
refinery’s
operation in
real time.
RL can be used
in large
environments
in the following
situations:
• A model of
the
environmen
t is known,
but an
analytic
solution is
not
available;
• Only a
APPLICATIONS OF RL:
Reinforcement learning, while high in potential, can
be difficult to deploy and remains limited in its
application. One of the barriers to the deployment of
this type of machine learning is its reliance on an
exploration of the environment.
For example, if you were to deploy a robot that was
reliant on reinforcement learning to navigate a complex
physical environment, it will seek new states and take
different actions as it moves. It is difficult to consistently
take the best actions in a real-world environment,
however, because of how frequently the environment
changes.
CHALLENGES OF APPLYING REINFORCEMENT LEARNING
COMPARING
REINFORCEMENT AND
SUPERVISED
LEARNING
However, it's crucial to acknowledge the existing
challenges and limitations. The exploration-exploitation
dilemma, high computational demands, and the need for
careful reward engineering remain obstacles that
researchers and practitioners must address. Striking a
balance between efficient learning and generalization
while minimizing negative societal impacts requires
ethical considerations to be woven into the fabric of
reinforcement learning research.
Looking ahead, the future of reinforcement learning
holds exciting prospects. Advances in algorithms, such as
deep reinforcement learning, along with the integration
of techniques from other disciplines like neuroscience,
psychology, and multi-agent systems, could lead to
breakthroughs in addressing current limitations.
CONCLUSI
ON
THANK
YOU

More Related Content

Similar to REINFORCEMENT LEARNING (reinforced through trial and error).pptx

Unlocking Exploration: Self-Motivated Agents Thrive on Memory-Driven Curiosity
Unlocking Exploration: Self-Motivated Agents Thrive on Memory-Driven CuriosityUnlocking Exploration: Self-Motivated Agents Thrive on Memory-Driven Curiosity
Unlocking Exploration: Self-Motivated Agents Thrive on Memory-Driven Curiosity
Hung Le
 
Workshop: Monitoring, evaluation and impact assessment
Workshop: Monitoring, evaluation and impact assessmentWorkshop: Monitoring, evaluation and impact assessment
Workshop: Monitoring, evaluation and impact assessment
WorldFish
 
reinforcement-learning-141009013546-conversion-gate02.pdf
reinforcement-learning-141009013546-conversion-gate02.pdfreinforcement-learning-141009013546-conversion-gate02.pdf
reinforcement-learning-141009013546-conversion-gate02.pdf
VaishnavGhadge1
 
M Harmon RL Tutorial
M Harmon RL TutorialM Harmon RL Tutorial
M Harmon RL Tutorial
Mance Harmon
 
software engineering powerpoint presentation foe everyone
software engineering powerpoint presentation foe everyonesoftware engineering powerpoint presentation foe everyone
software engineering powerpoint presentation foe everyone
rebantaofficial
 
acai01-updated.ppt
acai01-updated.pptacai01-updated.ppt
acai01-updated.ppt
butest
 

Similar to REINFORCEMENT LEARNING (reinforced through trial and error).pptx (20)

Harm van Seijen, Research Scientist, Maluuba at MLconf SF 2016
Harm van Seijen, Research Scientist, Maluuba at MLconf SF 2016Harm van Seijen, Research Scientist, Maluuba at MLconf SF 2016
Harm van Seijen, Research Scientist, Maluuba at MLconf SF 2016
 
Unlocking Exploration: Self-Motivated Agents Thrive on Memory-Driven Curiosity
Unlocking Exploration: Self-Motivated Agents Thrive on Memory-Driven CuriosityUnlocking Exploration: Self-Motivated Agents Thrive on Memory-Driven Curiosity
Unlocking Exploration: Self-Motivated Agents Thrive on Memory-Driven Curiosity
 
IRJET- A Review on Deep Reinforcement Learning Induced Autonomous Driving Fra...
IRJET- A Review on Deep Reinforcement Learning Induced Autonomous Driving Fra...IRJET- A Review on Deep Reinforcement Learning Induced Autonomous Driving Fra...
IRJET- A Review on Deep Reinforcement Learning Induced Autonomous Driving Fra...
 
Self Review Framework
Self Review FrameworkSelf Review Framework
Self Review Framework
 
CS3013 -MACHINE LEARNING.pptx
CS3013 -MACHINE LEARNING.pptxCS3013 -MACHINE LEARNING.pptx
CS3013 -MACHINE LEARNING.pptx
 
Workshop: Monitoring, evaluation and impact assessment
Workshop: Monitoring, evaluation and impact assessmentWorkshop: Monitoring, evaluation and impact assessment
Workshop: Monitoring, evaluation and impact assessment
 
V Jornadas eMadrid sobre "Educación Digital". Cristina Conati, University of ...
V Jornadas eMadrid sobre "Educación Digital". Cristina Conati, University of ...V Jornadas eMadrid sobre "Educación Digital". Cristina Conati, University of ...
V Jornadas eMadrid sobre "Educación Digital". Cristina Conati, University of ...
 
reinforcement-learning-141009013546-conversion-gate02.pdf
reinforcement-learning-141009013546-conversion-gate02.pdfreinforcement-learning-141009013546-conversion-gate02.pdf
reinforcement-learning-141009013546-conversion-gate02.pdf
 
The challenge of wicked problems in airlines engineering ahmad arafat
The challenge of wicked problems in airlines engineering   ahmad arafatThe challenge of wicked problems in airlines engineering   ahmad arafat
The challenge of wicked problems in airlines engineering ahmad arafat
 
RL_Dr.SNR Final ppt for Presentation 28.05.2021.pptx
RL_Dr.SNR Final ppt for Presentation 28.05.2021.pptxRL_Dr.SNR Final ppt for Presentation 28.05.2021.pptx
RL_Dr.SNR Final ppt for Presentation 28.05.2021.pptx
 
Reinforcement Learning with Deep Architectures
Reinforcement Learning with Deep ArchitecturesReinforcement Learning with Deep Architectures
Reinforcement Learning with Deep Architectures
 
M Harmon RL Tutorial
M Harmon RL TutorialM Harmon RL Tutorial
M Harmon RL Tutorial
 
A Review on Introduction to Reinforcement Learning
A Review on Introduction to Reinforcement LearningA Review on Introduction to Reinforcement Learning
A Review on Introduction to Reinforcement Learning
 
software engineering powerpoint presentation foe everyone
software engineering powerpoint presentation foe everyonesoftware engineering powerpoint presentation foe everyone
software engineering powerpoint presentation foe everyone
 
Slhi june 23 08 imp part 1
Slhi june 23 08 imp part 1Slhi june 23 08 imp part 1
Slhi june 23 08 imp part 1
 
Slhi june 23 08 imp part 1
Slhi june 23 08 imp part 1Slhi june 23 08 imp part 1
Slhi june 23 08 imp part 1
 
TEST PPT
TEST PPTTEST PPT
TEST PPT
 
AI: Learning in AI 2
AI: Learning in AI 2AI: Learning in AI 2
AI: Learning in AI 2
 
AI: Learning in AI 2
AI: Learning in AI  2AI: Learning in AI  2
AI: Learning in AI 2
 
acai01-updated.ppt
acai01-updated.pptacai01-updated.ppt
acai01-updated.ppt
 

Recently uploaded

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Recently uploaded (20)

Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...
 
API Governance and Monetization - The evolution of API governance
API Governance and Monetization -  The evolution of API governanceAPI Governance and Monetization -  The evolution of API governance
API Governance and Monetization - The evolution of API governance
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
AI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by Anitaraj
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Quantum Leap in Next-Generation Computing
Quantum Leap in Next-Generation ComputingQuantum Leap in Next-Generation Computing
Quantum Leap in Next-Generation Computing
 
Less Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data PlatformLess Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data Platform
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 

REINFORCEMENT LEARNING (reinforced through trial and error).pptx

  • 2. It is important to understand the utilities of artificial intelligence New future 01 - What is it? 02 - Introduction 03 - Elements of RL 04 - Types of RL 05 - Application of RL 06 - Compare RL & SL 07 - Challenges of RL 08 - Conclusion
  • 3. Reinforcement learning is an area of Artificial Intelligence; it has emerged as an effective tool towards building artificially intelligent systems and solving sequential decision-making problems. Reinforcement Learning has achieved many impressive breakthroughs in recent years and it was able to surpass the human level in many fields; it is able to play and win various games. Historically, reinforcement learning was efficient in solving some control system problems. Nowadays, it has a growing range of applications. ABSTRACT 01 - What is it?
  • 4. Artificial Human Intelligence INTRODUCTION The upraise of Artificial Intelligence is associated with Deep Learning achievements in recent years. Deep Learning is basically a set of multiple layers of neural networks connected to each other. Reinforcement learning is learning through interaction with an environment by taking different actions and experiencing many failures and successes while trying to maximize the received rewards. It is close to human learning. The algorithm learns a policy of how to act in the environment. It is a problem faced by an agent that learns behavior through trail- and-error interactions with a dynamic environment.
  • 5. • AGENT: Intelligent Program • ENVIRONMENT: External Condition • POLICY: Mapping from states to actions • REWARD: Defines the goal in the RL problem, Policy is altered to achieve this goal. • VALUE: The value of a state is the total amount of reward an agent can expect to accumulate over the future, starting from that state. • MODEL OF ENVIRONMENT: Predict the mimic behavior of the environment, then predict the resultant of the next state and reward. ELEMENTS OF RL
  • 6. Positive: Positive Reinforcement is defined as when an event, occurs due to a particular behavior, and increases the strength and the frequency of the behavior. • Maximizes Performance • Sustain Change for a long period of time Negative: Negative Reinforcement is defined as the strengthening of behavior because a negative condition is stopped or avoided. • Increases Behavior • Provide defiance to a minimum standard of performance • It Only provides enough to meet up the minimum behavior TYPES OF REINFORCEMENT LEARNING
  • 7. refinery’s operation in real time. RL can be used in large environments in the following situations: • A model of the environmen t is known, but an analytic solution is not available; • Only a APPLICATIONS OF RL:
  • 8. Reinforcement learning, while high in potential, can be difficult to deploy and remains limited in its application. One of the barriers to the deployment of this type of machine learning is its reliance on an exploration of the environment. For example, if you were to deploy a robot that was reliant on reinforcement learning to navigate a complex physical environment, it will seek new states and take different actions as it moves. It is difficult to consistently take the best actions in a real-world environment, however, because of how frequently the environment changes. CHALLENGES OF APPLYING REINFORCEMENT LEARNING
  • 10. However, it's crucial to acknowledge the existing challenges and limitations. The exploration-exploitation dilemma, high computational demands, and the need for careful reward engineering remain obstacles that researchers and practitioners must address. Striking a balance between efficient learning and generalization while minimizing negative societal impacts requires ethical considerations to be woven into the fabric of reinforcement learning research. Looking ahead, the future of reinforcement learning holds exciting prospects. Advances in algorithms, such as deep reinforcement learning, along with the integration of techniques from other disciplines like neuroscience, psychology, and multi-agent systems, could lead to breakthroughs in addressing current limitations. CONCLUSI ON