SlideShare a Scribd company logo
1 of 11
CS3013 -MACHINE LEARNING
UNIT 5 SEMINAR
SUBJECT MENTOR PRESENTED BY
Dr.T R .Saravanan Ms.J.SHERINE GLORY,
Assistant Professor, PC2113003013020
SRMIST (KTR CAMPUS) RESEARCH SCHOLAR (PART TIME EXTERNAL)
Reinforcement learning
 Close to human learning.
 Algorithm learns a policy of how to act in a given environment.
 Every action has some impact in the environment, and the environment provides
rewards that guides the learning algorithm.
INTRODUCTION
 Meaning of Reinforcement:Occurrence of an event, in the proper relation to a
response, that tends to increase the probability that the response will occur again in the
same situation.
 Reinforcement learning is the problem faced by an agent that learns behavior through
trial-and-error interactions with a dynamic environment.
 Reinforcement Learning is learning how to act in order to maximize a numerical reward.
 Reinforcement learning is not a type of neural network, nor is it an alternative to
neural networks. Rather, it is an orthogonal approach for Learning Machine.
 Reinforcement learning emphasizes learning feedback that evaluates the learner's
performance without providing standards of correctness in the form of behavioral
targets.
Example: Bicycle learning
Element of reinforcement learning
 Agent: Intelligent programs
 Environment: External condition
 Policy:
Defines the agent’s behavior at a given time.
A mapping from states to actions.
Lookup tables or simple function
Element of reinforcement learning
 Reward function :
Defines the goal in an RL problem.
Policy is altered to achieve this goal.
Value function:
Reward function indicates what is good in an immediate sense while a value
function specifies what is good in the long run.
Value of a state is the total amount of reward an agent can expect to
accumulate over the future, starting form that state.
Model of the environment :
Predict mimic behavior of environment.
Used for planning & if Know current state and action then predict the resultant
next state and next reward.
Steps for Reinforcement Learning
 1. The agent observes an input state
 2. An action is determined by a decision making function (policy).
 3. The action is performed.
 4. The agent receives a scalar reward or reinforcement from the environment.
 5. Information about the reward given for that state / action pair is recorded
Features of Reinforcement Learning
 Set of problems rather than a set of techniques.
 Without specifying how the task is to be achieved.
 “RL as a tool” point of view.
RL is training by rewards and punishments.
Train tool for the computer learning.
 The learning agent’s point of view:
RL is learning from trial and error with the world.
Eg. how much reward I much get if I get this.
Reinforcement Learning (Cont..)
 Reinforcement Learning uses Evaluative Feedback
 Purely Evaluative Feedback
Indicates how good the action taken is.
Not tell if it is the best or the worst action possible.
Basis of methods for function optimization.
 Purely Instructive Feedback
Indicates the correct action to take, independently of the action actually taken.
Eg: Supervised Learning
Associative & Non Associative Tasks
 Associative :
Situation Dependent
Mapping from situation to the actions that are best in that situation
 Non Associative:
Situation independent
No need for associating different action with different situations.
Learner either tries to find a single best action when the task is stationary, or
tries to track the best action as it changes over time when the task is non stationary.
Reinforcement Learning (Cont..)
 Exploration and exploitation
Greedy action: Action chosen with greatest estimated value. Greedy action: a
case of Exploitation.
Non greedy action: a case of Exploration, as it enables us to improve estimate
the non-greedy action's value.
 N-armed bandit Problem:
We have to choose from n different options or actions.
We will choose the one with maximum reward.
CS3013 -MACHINE LEARNING.pptx

More Related Content

Similar to CS3013 -MACHINE LEARNING.pptx

reiniforcement learning.ppt
reiniforcement learning.pptreiniforcement learning.ppt
reiniforcement learning.pptcharusharma165
 
RL_Dr.SNR Final ppt for Presentation 28.05.2021.pptx
RL_Dr.SNR Final ppt for Presentation 28.05.2021.pptxRL_Dr.SNR Final ppt for Presentation 28.05.2021.pptx
RL_Dr.SNR Final ppt for Presentation 28.05.2021.pptxdeeplearning6
 
A Review on Introduction to Reinforcement Learning
A Review on Introduction to Reinforcement LearningA Review on Introduction to Reinforcement Learning
A Review on Introduction to Reinforcement Learningijtsrd
 
RL_online _presentation_1.ppt
RL_online _presentation_1.pptRL_online _presentation_1.ppt
RL_online _presentation_1.pptssuser43a599
 
23 forms of learning-N.doc
23 forms of learning-N.doc23 forms of learning-N.doc
23 forms of learning-N.docjdinfo444
 
An efficient use of temporal difference technique in Computer Game Learning
An efficient use of temporal difference technique in Computer Game LearningAn efficient use of temporal difference technique in Computer Game Learning
An efficient use of temporal difference technique in Computer Game LearningPrabhu Kumar
 
reinforcement-learning-141009013546-conversion-gate02.pptx
reinforcement-learning-141009013546-conversion-gate02.pptxreinforcement-learning-141009013546-conversion-gate02.pptx
reinforcement-learning-141009013546-conversion-gate02.pptxMohibKhan79
 
Reinforcement Learning.ppt
Reinforcement Learning.pptReinforcement Learning.ppt
Reinforcement Learning.pptPOOJASHREEC1
 
Reinforcement learning in Machine learning
 Reinforcement learning in Machine learning Reinforcement learning in Machine learning
Reinforcement learning in Machine learningMegha Sharma
 
Reinforcement Learning
Reinforcement LearningReinforcement Learning
Reinforcement LearningSVijaylakshmi
 
Reinforcement Learning Guide For Beginners
Reinforcement Learning Guide For BeginnersReinforcement Learning Guide For Beginners
Reinforcement Learning Guide For Beginnersgokulprasath06
 
Fba and fa powerpoint
Fba and fa powerpointFba and fa powerpoint
Fba and fa powerpointjmcullenbsu
 
semi supervised Learning and Reinforcement learning (1).pptx
 semi supervised Learning and Reinforcement learning (1).pptx semi supervised Learning and Reinforcement learning (1).pptx
semi supervised Learning and Reinforcement learning (1).pptxDr.Shweta
 
ch.3 learning. for organisation developmentppt
ch.3 learning. for organisation developmentpptch.3 learning. for organisation developmentppt
ch.3 learning. for organisation developmentpptAyushsharma131736
 
Occupational Therapy and Reinforcement (part 3)
Occupational Therapy and Reinforcement (part 3)Occupational Therapy and Reinforcement (part 3)
Occupational Therapy and Reinforcement (part 3)Stephan Van Breenen
 

Similar to CS3013 -MACHINE LEARNING.pptx (20)

reiniforcement learning.ppt
reiniforcement learning.pptreiniforcement learning.ppt
reiniforcement learning.ppt
 
RL_Dr.SNR Final ppt for Presentation 28.05.2021.pptx
RL_Dr.SNR Final ppt for Presentation 28.05.2021.pptxRL_Dr.SNR Final ppt for Presentation 28.05.2021.pptx
RL_Dr.SNR Final ppt for Presentation 28.05.2021.pptx
 
A Review on Introduction to Reinforcement Learning
A Review on Introduction to Reinforcement LearningA Review on Introduction to Reinforcement Learning
A Review on Introduction to Reinforcement Learning
 
YijueRL.ppt
YijueRL.pptYijueRL.ppt
YijueRL.ppt
 
RL_online _presentation_1.ppt
RL_online _presentation_1.pptRL_online _presentation_1.ppt
RL_online _presentation_1.ppt
 
Q_Learning.ppt
Q_Learning.pptQ_Learning.ppt
Q_Learning.ppt
 
RL.ppt
RL.pptRL.ppt
RL.ppt
 
23 forms of learning-N.doc
23 forms of learning-N.doc23 forms of learning-N.doc
23 forms of learning-N.doc
 
An efficient use of temporal difference technique in Computer Game Learning
An efficient use of temporal difference technique in Computer Game LearningAn efficient use of temporal difference technique in Computer Game Learning
An efficient use of temporal difference technique in Computer Game Learning
 
Reinforcement learning
Reinforcement learningReinforcement learning
Reinforcement learning
 
reinforcement-learning-141009013546-conversion-gate02.pptx
reinforcement-learning-141009013546-conversion-gate02.pptxreinforcement-learning-141009013546-conversion-gate02.pptx
reinforcement-learning-141009013546-conversion-gate02.pptx
 
Reinforcement Learning.ppt
Reinforcement Learning.pptReinforcement Learning.ppt
Reinforcement Learning.ppt
 
Reinforcement learning in Machine learning
 Reinforcement learning in Machine learning Reinforcement learning in Machine learning
Reinforcement learning in Machine learning
 
Reinforcement Learning
Reinforcement LearningReinforcement Learning
Reinforcement Learning
 
Reinforcement Learning Guide For Beginners
Reinforcement Learning Guide For BeginnersReinforcement Learning Guide For Beginners
Reinforcement Learning Guide For Beginners
 
Fba and fa powerpoint
Fba and fa powerpointFba and fa powerpoint
Fba and fa powerpoint
 
semi supervised Learning and Reinforcement learning (1).pptx
 semi supervised Learning and Reinforcement learning (1).pptx semi supervised Learning and Reinforcement learning (1).pptx
semi supervised Learning and Reinforcement learning (1).pptx
 
ch.3 learning. for organisation developmentppt
ch.3 learning. for organisation developmentpptch.3 learning. for organisation developmentppt
ch.3 learning. for organisation developmentppt
 
Unit 2
Unit 2Unit 2
Unit 2
 
Occupational Therapy and Reinforcement (part 3)
Occupational Therapy and Reinforcement (part 3)Occupational Therapy and Reinforcement (part 3)
Occupational Therapy and Reinforcement (part 3)
 

More from logesswarisrinivasan

More from logesswarisrinivasan (6)

unit 4 for trees data structure notes it is
unit 4 for trees data structure notes it isunit 4 for trees data structure notes it is
unit 4 for trees data structure notes it is
 
8139350.ppt
8139350.ppt8139350.ppt
8139350.ppt
 
8486477.ppt
8486477.ppt8486477.ppt
8486477.ppt
 
21GE301-UHV-Unit4.pdf
21GE301-UHV-Unit4.pdf21GE301-UHV-Unit4.pdf
21GE301-UHV-Unit4.pdf
 
05 (1).pptx
05 (1).pptx05 (1).pptx
05 (1).pptx
 
Unit 2 [autosaved]
Unit 2 [autosaved]Unit 2 [autosaved]
Unit 2 [autosaved]
 

Recently uploaded

Delhi Call Girls South Ex 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls South Ex 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls South Ex 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls South Ex 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
VIP Call Girls in Cuttack Aarohi 8250192130 Independent Escort Service Cuttack
VIP Call Girls in Cuttack Aarohi 8250192130 Independent Escort Service CuttackVIP Call Girls in Cuttack Aarohi 8250192130 Independent Escort Service Cuttack
VIP Call Girls in Cuttack Aarohi 8250192130 Independent Escort Service CuttackSuhani Kapoor
 
Internshala Student Partner 6.0 Jadavpur University Certificate
Internshala Student Partner 6.0 Jadavpur University CertificateInternshala Student Partner 6.0 Jadavpur University Certificate
Internshala Student Partner 6.0 Jadavpur University CertificateSoham Mondal
 
CFO_SB_Career History_Multi Sector Experience
CFO_SB_Career History_Multi Sector ExperienceCFO_SB_Career History_Multi Sector Experience
CFO_SB_Career History_Multi Sector ExperienceSanjay Bokadia
 
内布拉斯加大学林肯分校毕业证录取书( 退学 )学位证书硕士
内布拉斯加大学林肯分校毕业证录取书( 退学 )学位证书硕士内布拉斯加大学林肯分校毕业证录取书( 退学 )学位证书硕士
内布拉斯加大学林肯分校毕业证录取书( 退学 )学位证书硕士obuhobo
 
CALL ON ➥8923113531 🔝Call Girls Gosainganj Lucknow best sexual service
CALL ON ➥8923113531 🔝Call Girls Gosainganj Lucknow best sexual serviceCALL ON ➥8923113531 🔝Call Girls Gosainganj Lucknow best sexual service
CALL ON ➥8923113531 🔝Call Girls Gosainganj Lucknow best sexual serviceanilsa9823
 
VIP Russian Call Girls in Amravati Deepika 8250192130 Independent Escort Serv...
VIP Russian Call Girls in Amravati Deepika 8250192130 Independent Escort Serv...VIP Russian Call Girls in Amravati Deepika 8250192130 Independent Escort Serv...
VIP Russian Call Girls in Amravati Deepika 8250192130 Independent Escort Serv...Suhani Kapoor
 
CALL ON ➥8923113531 🔝Call Girls Husainganj Lucknow best Female service 🧳
CALL ON ➥8923113531 🔝Call Girls Husainganj Lucknow best Female service  🧳CALL ON ➥8923113531 🔝Call Girls Husainganj Lucknow best Female service  🧳
CALL ON ➥8923113531 🔝Call Girls Husainganj Lucknow best Female service 🧳anilsa9823
 
Call Girl in Low Price Delhi Punjabi Bagh 9711199012
Call Girl in Low Price Delhi Punjabi Bagh  9711199012Call Girl in Low Price Delhi Punjabi Bagh  9711199012
Call Girl in Low Price Delhi Punjabi Bagh 9711199012sapnasaifi408
 
Experience Certificate - Marketing Analyst-Soham Mondal.pdf
Experience Certificate - Marketing Analyst-Soham Mondal.pdfExperience Certificate - Marketing Analyst-Soham Mondal.pdf
Experience Certificate - Marketing Analyst-Soham Mondal.pdfSoham Mondal
 
The Impact of Socioeconomic Status on Education.pdf
The Impact of Socioeconomic Status on Education.pdfThe Impact of Socioeconomic Status on Education.pdf
The Impact of Socioeconomic Status on Education.pdftheknowledgereview1
 
女王大学硕士毕业证成绩单(加急办理)认证海外毕业证
女王大学硕士毕业证成绩单(加急办理)认证海外毕业证女王大学硕士毕业证成绩单(加急办理)认证海外毕业证
女王大学硕士毕业证成绩单(加急办理)认证海外毕业证obuhobo
 
Production Day 1.pptxjvjbvbcbcb bj bvcbj
Production Day 1.pptxjvjbvbcbcb bj bvcbjProduction Day 1.pptxjvjbvbcbcb bj bvcbj
Production Day 1.pptxjvjbvbcbcb bj bvcbjLewisJB
 
Dubai Call Girls Demons O525547819 Call Girls IN DUbai Natural Big Boody
Dubai Call Girls Demons O525547819 Call Girls IN DUbai Natural Big BoodyDubai Call Girls Demons O525547819 Call Girls IN DUbai Natural Big Boody
Dubai Call Girls Demons O525547819 Call Girls IN DUbai Natural Big Boodykojalkojal131
 
Vip Modals Call Girls (Delhi) Rohini 9711199171✔️ Full night Service for one...
Vip  Modals Call Girls (Delhi) Rohini 9711199171✔️ Full night Service for one...Vip  Modals Call Girls (Delhi) Rohini 9711199171✔️ Full night Service for one...
Vip Modals Call Girls (Delhi) Rohini 9711199171✔️ Full night Service for one...shivangimorya083
 
Delhi Call Girls Greater Noida 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Greater Noida 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Greater Noida 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Greater Noida 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
VIP Call Girls in Jamshedpur Aarohi 8250192130 Independent Escort Service Jam...
VIP Call Girls in Jamshedpur Aarohi 8250192130 Independent Escort Service Jam...VIP Call Girls in Jamshedpur Aarohi 8250192130 Independent Escort Service Jam...
VIP Call Girls in Jamshedpur Aarohi 8250192130 Independent Escort Service Jam...Suhani Kapoor
 
Dubai Call Girls Naija O525547819 Call Girls In Dubai Home Made
Dubai Call Girls Naija O525547819 Call Girls In Dubai Home MadeDubai Call Girls Naija O525547819 Call Girls In Dubai Home Made
Dubai Call Girls Naija O525547819 Call Girls In Dubai Home Madekojalkojal131
 
Low Rate Call Girls Cuttack Anika 8250192130 Independent Escort Service Cuttack
Low Rate Call Girls Cuttack Anika 8250192130 Independent Escort Service CuttackLow Rate Call Girls Cuttack Anika 8250192130 Independent Escort Service Cuttack
Low Rate Call Girls Cuttack Anika 8250192130 Independent Escort Service CuttackSuhani Kapoor
 
VIP Russian Call Girls Amravati Chhaya 8250192130 Independent Escort Service ...
VIP Russian Call Girls Amravati Chhaya 8250192130 Independent Escort Service ...VIP Russian Call Girls Amravati Chhaya 8250192130 Independent Escort Service ...
VIP Russian Call Girls Amravati Chhaya 8250192130 Independent Escort Service ...Suhani Kapoor
 

Recently uploaded (20)

Delhi Call Girls South Ex 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls South Ex 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls South Ex 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls South Ex 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
VIP Call Girls in Cuttack Aarohi 8250192130 Independent Escort Service Cuttack
VIP Call Girls in Cuttack Aarohi 8250192130 Independent Escort Service CuttackVIP Call Girls in Cuttack Aarohi 8250192130 Independent Escort Service Cuttack
VIP Call Girls in Cuttack Aarohi 8250192130 Independent Escort Service Cuttack
 
Internshala Student Partner 6.0 Jadavpur University Certificate
Internshala Student Partner 6.0 Jadavpur University CertificateInternshala Student Partner 6.0 Jadavpur University Certificate
Internshala Student Partner 6.0 Jadavpur University Certificate
 
CFO_SB_Career History_Multi Sector Experience
CFO_SB_Career History_Multi Sector ExperienceCFO_SB_Career History_Multi Sector Experience
CFO_SB_Career History_Multi Sector Experience
 
内布拉斯加大学林肯分校毕业证录取书( 退学 )学位证书硕士
内布拉斯加大学林肯分校毕业证录取书( 退学 )学位证书硕士内布拉斯加大学林肯分校毕业证录取书( 退学 )学位证书硕士
内布拉斯加大学林肯分校毕业证录取书( 退学 )学位证书硕士
 
CALL ON ➥8923113531 🔝Call Girls Gosainganj Lucknow best sexual service
CALL ON ➥8923113531 🔝Call Girls Gosainganj Lucknow best sexual serviceCALL ON ➥8923113531 🔝Call Girls Gosainganj Lucknow best sexual service
CALL ON ➥8923113531 🔝Call Girls Gosainganj Lucknow best sexual service
 
VIP Russian Call Girls in Amravati Deepika 8250192130 Independent Escort Serv...
VIP Russian Call Girls in Amravati Deepika 8250192130 Independent Escort Serv...VIP Russian Call Girls in Amravati Deepika 8250192130 Independent Escort Serv...
VIP Russian Call Girls in Amravati Deepika 8250192130 Independent Escort Serv...
 
CALL ON ➥8923113531 🔝Call Girls Husainganj Lucknow best Female service 🧳
CALL ON ➥8923113531 🔝Call Girls Husainganj Lucknow best Female service  🧳CALL ON ➥8923113531 🔝Call Girls Husainganj Lucknow best Female service  🧳
CALL ON ➥8923113531 🔝Call Girls Husainganj Lucknow best Female service 🧳
 
Call Girl in Low Price Delhi Punjabi Bagh 9711199012
Call Girl in Low Price Delhi Punjabi Bagh  9711199012Call Girl in Low Price Delhi Punjabi Bagh  9711199012
Call Girl in Low Price Delhi Punjabi Bagh 9711199012
 
Experience Certificate - Marketing Analyst-Soham Mondal.pdf
Experience Certificate - Marketing Analyst-Soham Mondal.pdfExperience Certificate - Marketing Analyst-Soham Mondal.pdf
Experience Certificate - Marketing Analyst-Soham Mondal.pdf
 
The Impact of Socioeconomic Status on Education.pdf
The Impact of Socioeconomic Status on Education.pdfThe Impact of Socioeconomic Status on Education.pdf
The Impact of Socioeconomic Status on Education.pdf
 
女王大学硕士毕业证成绩单(加急办理)认证海外毕业证
女王大学硕士毕业证成绩单(加急办理)认证海外毕业证女王大学硕士毕业证成绩单(加急办理)认证海外毕业证
女王大学硕士毕业证成绩单(加急办理)认证海外毕业证
 
Production Day 1.pptxjvjbvbcbcb bj bvcbj
Production Day 1.pptxjvjbvbcbcb bj bvcbjProduction Day 1.pptxjvjbvbcbcb bj bvcbj
Production Day 1.pptxjvjbvbcbcb bj bvcbj
 
Dubai Call Girls Demons O525547819 Call Girls IN DUbai Natural Big Boody
Dubai Call Girls Demons O525547819 Call Girls IN DUbai Natural Big BoodyDubai Call Girls Demons O525547819 Call Girls IN DUbai Natural Big Boody
Dubai Call Girls Demons O525547819 Call Girls IN DUbai Natural Big Boody
 
Vip Modals Call Girls (Delhi) Rohini 9711199171✔️ Full night Service for one...
Vip  Modals Call Girls (Delhi) Rohini 9711199171✔️ Full night Service for one...Vip  Modals Call Girls (Delhi) Rohini 9711199171✔️ Full night Service for one...
Vip Modals Call Girls (Delhi) Rohini 9711199171✔️ Full night Service for one...
 
Delhi Call Girls Greater Noida 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Greater Noida 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Greater Noida 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Greater Noida 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
VIP Call Girls in Jamshedpur Aarohi 8250192130 Independent Escort Service Jam...
VIP Call Girls in Jamshedpur Aarohi 8250192130 Independent Escort Service Jam...VIP Call Girls in Jamshedpur Aarohi 8250192130 Independent Escort Service Jam...
VIP Call Girls in Jamshedpur Aarohi 8250192130 Independent Escort Service Jam...
 
Dubai Call Girls Naija O525547819 Call Girls In Dubai Home Made
Dubai Call Girls Naija O525547819 Call Girls In Dubai Home MadeDubai Call Girls Naija O525547819 Call Girls In Dubai Home Made
Dubai Call Girls Naija O525547819 Call Girls In Dubai Home Made
 
Low Rate Call Girls Cuttack Anika 8250192130 Independent Escort Service Cuttack
Low Rate Call Girls Cuttack Anika 8250192130 Independent Escort Service CuttackLow Rate Call Girls Cuttack Anika 8250192130 Independent Escort Service Cuttack
Low Rate Call Girls Cuttack Anika 8250192130 Independent Escort Service Cuttack
 
VIP Russian Call Girls Amravati Chhaya 8250192130 Independent Escort Service ...
VIP Russian Call Girls Amravati Chhaya 8250192130 Independent Escort Service ...VIP Russian Call Girls Amravati Chhaya 8250192130 Independent Escort Service ...
VIP Russian Call Girls Amravati Chhaya 8250192130 Independent Escort Service ...
 

CS3013 -MACHINE LEARNING.pptx

  • 1. CS3013 -MACHINE LEARNING UNIT 5 SEMINAR SUBJECT MENTOR PRESENTED BY Dr.T R .Saravanan Ms.J.SHERINE GLORY, Assistant Professor, PC2113003013020 SRMIST (KTR CAMPUS) RESEARCH SCHOLAR (PART TIME EXTERNAL)
  • 2. Reinforcement learning  Close to human learning.  Algorithm learns a policy of how to act in a given environment.  Every action has some impact in the environment, and the environment provides rewards that guides the learning algorithm.
  • 3. INTRODUCTION  Meaning of Reinforcement:Occurrence of an event, in the proper relation to a response, that tends to increase the probability that the response will occur again in the same situation.  Reinforcement learning is the problem faced by an agent that learns behavior through trial-and-error interactions with a dynamic environment.  Reinforcement Learning is learning how to act in order to maximize a numerical reward.  Reinforcement learning is not a type of neural network, nor is it an alternative to neural networks. Rather, it is an orthogonal approach for Learning Machine.  Reinforcement learning emphasizes learning feedback that evaluates the learner's performance without providing standards of correctness in the form of behavioral targets. Example: Bicycle learning
  • 4. Element of reinforcement learning  Agent: Intelligent programs  Environment: External condition  Policy: Defines the agent’s behavior at a given time. A mapping from states to actions. Lookup tables or simple function
  • 5. Element of reinforcement learning  Reward function : Defines the goal in an RL problem. Policy is altered to achieve this goal. Value function: Reward function indicates what is good in an immediate sense while a value function specifies what is good in the long run. Value of a state is the total amount of reward an agent can expect to accumulate over the future, starting form that state. Model of the environment : Predict mimic behavior of environment. Used for planning & if Know current state and action then predict the resultant next state and next reward.
  • 6. Steps for Reinforcement Learning  1. The agent observes an input state  2. An action is determined by a decision making function (policy).  3. The action is performed.  4. The agent receives a scalar reward or reinforcement from the environment.  5. Information about the reward given for that state / action pair is recorded
  • 7. Features of Reinforcement Learning  Set of problems rather than a set of techniques.  Without specifying how the task is to be achieved.  “RL as a tool” point of view. RL is training by rewards and punishments. Train tool for the computer learning.  The learning agent’s point of view: RL is learning from trial and error with the world. Eg. how much reward I much get if I get this.
  • 8. Reinforcement Learning (Cont..)  Reinforcement Learning uses Evaluative Feedback  Purely Evaluative Feedback Indicates how good the action taken is. Not tell if it is the best or the worst action possible. Basis of methods for function optimization.  Purely Instructive Feedback Indicates the correct action to take, independently of the action actually taken. Eg: Supervised Learning
  • 9. Associative & Non Associative Tasks  Associative : Situation Dependent Mapping from situation to the actions that are best in that situation  Non Associative: Situation independent No need for associating different action with different situations. Learner either tries to find a single best action when the task is stationary, or tries to track the best action as it changes over time when the task is non stationary.
  • 10. Reinforcement Learning (Cont..)  Exploration and exploitation Greedy action: Action chosen with greatest estimated value. Greedy action: a case of Exploitation. Non greedy action: a case of Exploration, as it enables us to improve estimate the non-greedy action's value.  N-armed bandit Problem: We have to choose from n different options or actions. We will choose the one with maximum reward.