SlideShare a Scribd company logo
Terms used in Reinforcement Learning
 Agent
 Environment
 Action
 State
 Reward
 policy
Smart healthcare diagnosis
Reinforcement Learning in news recommendation
User preferences can change frequently,
therefore recommending news to users based on reviews and
likes could become obsolete quickly. With reinforcement
learning, the RL system can track the reader’s return
behaviors.
Construction of such a system would involve obtaining news
features, reader features, context features, and reader news
features. News features include but are not limited to the
content, headline, and publisher. Reader features refer to how
the reader interacts with the content e.g clicks and shares.
Context features include news aspects such as timing and
freshness of the news. A reward is then defined based on these
user behaviors.
self-driving car
Approaches to Implement
Reinforcement Learning
1. Value Based
2. Policy Based
3. Model Based
Types of Reinforcement
Learning
1. Positive Reinforcement
2. Negative Reinforcement

More Related Content

Similar to 20. reinforcement learning.pptx

UNLEASHING THE POWER OF CONTENT OPTIMIZATION.pptx
UNLEASHING THE POWER OF CONTENT OPTIMIZATION.pptxUNLEASHING THE POWER OF CONTENT OPTIMIZATION.pptx
UNLEASHING THE POWER OF CONTENT OPTIMIZATION.pptx
Toshi Thakur
 
IRJET- Analysis on Existing Methodologies of User Service Rating Prediction S...
IRJET- Analysis on Existing Methodologies of User Service Rating Prediction S...IRJET- Analysis on Existing Methodologies of User Service Rating Prediction S...
IRJET- Analysis on Existing Methodologies of User Service Rating Prediction S...
IRJET Journal
 
Web & Social Media Analytics Module 4.pdf
Web & Social Media Analytics Module 4.pdfWeb & Social Media Analytics Module 4.pdf
Web & Social Media Analytics Module 4.pdf
Jayanti Pande
 
Trust_Recommendation_System
Trust_Recommendation_SystemTrust_Recommendation_System
Trust_Recommendation_System
chettykulkarni
 
IR UNIT V.docx
IR UNIT  V.docxIR UNIT  V.docx
IR UNIT V.docx
thenmozhip8
 
IRJET - Sentiment Similarity Analysis and Building Users Trust from E-Commerc...
IRJET - Sentiment Similarity Analysis and Building Users Trust from E-Commerc...IRJET - Sentiment Similarity Analysis and Building Users Trust from E-Commerc...
IRJET - Sentiment Similarity Analysis and Building Users Trust from E-Commerc...
IRJET Journal
 
A LOCATION-BASED RECOMMENDER SYSTEM FRAMEWORK TO IMPROVE ACCURACY IN USERBASE...
A LOCATION-BASED RECOMMENDER SYSTEM FRAMEWORK TO IMPROVE ACCURACY IN USERBASE...A LOCATION-BASED RECOMMENDER SYSTEM FRAMEWORK TO IMPROVE ACCURACY IN USERBASE...
A LOCATION-BASED RECOMMENDER SYSTEM FRAMEWORK TO IMPROVE ACCURACY IN USERBASE...
ijcsa
 
Book recommendation system using opinion mining technique
Book recommendation system using opinion mining techniqueBook recommendation system using opinion mining technique
Book recommendation system using opinion mining technique
eSAT Journals
 
An Efficient Trust Evaluation using Fact-Finder Technique
An Efficient Trust Evaluation using Fact-Finder TechniqueAn Efficient Trust Evaluation using Fact-Finder Technique
An Efficient Trust Evaluation using Fact-Finder Technique
IJCSIS Research Publications
 
Chapter6.ppt marketing of the chapter sellingin
Chapter6.ppt marketing of the chapter sellinginChapter6.ppt marketing of the chapter sellingin
Chapter6.ppt marketing of the chapter sellingin
OshadiVindika
 
good emperical generalisation.pptx
good emperical generalisation.pptxgood emperical generalisation.pptx
good emperical generalisation.pptx
DEBALEENADAS7
 
Automatic Recommendation of Trustworthy Users in Online Product Rating Sites
Automatic Recommendation of Trustworthy Users in Online Product Rating SitesAutomatic Recommendation of Trustworthy Users in Online Product Rating Sites
Automatic Recommendation of Trustworthy Users in Online Product Rating Sites
IRJET Journal
 
User behavior analysis and relevance extraction modelling
User behavior analysis and relevance extraction modellingUser behavior analysis and relevance extraction modelling
User behavior analysis and relevance extraction modelling
Ambar Gupta
 
FIND MY VENUE: Content & Review Based Location Recommendation System
FIND MY VENUE: Content & Review Based Location Recommendation SystemFIND MY VENUE: Content & Review Based Location Recommendation System
FIND MY VENUE: Content & Review Based Location Recommendation System
IJTET Journal
 
How Data Science Plays the Crucial Role in Social Media
How Data Science Plays the Crucial Role in Social MediaHow Data Science Plays the Crucial Role in Social Media
How Data Science Plays the Crucial Role in Social Media
Edtech Learning
 
A study on after sales and services in tvs
A study on after sales and services in tvsA study on after sales and services in tvs
A study on after sales and services in tvs
Projects Kart
 
A Hypothesis is Placed to Justify the Extendibility of Recommender System/ Re...
A Hypothesis is Placed to Justify the Extendibility of Recommender System/ Re...A Hypothesis is Placed to Justify the Extendibility of Recommender System/ Re...
A Hypothesis is Placed to Justify the Extendibility of Recommender System/ Re...
Dr. Amarjeet Singh
 
Metzler 2010 - reputation systems
Metzler   2010 - reputation systemsMetzler   2010 - reputation systems
Metzler 2010 - reputation systems
Richard Metzler
 
Bv31491493
Bv31491493Bv31491493
Bv31491493
IJERA Editor
 
How to Use Customer Messaging Stats & Insights.docx
How to Use Customer Messaging Stats & Insights.docxHow to Use Customer Messaging Stats & Insights.docx
How to Use Customer Messaging Stats & Insights.docx
HariNarayanDas4
 

Similar to 20. reinforcement learning.pptx (20)

UNLEASHING THE POWER OF CONTENT OPTIMIZATION.pptx
UNLEASHING THE POWER OF CONTENT OPTIMIZATION.pptxUNLEASHING THE POWER OF CONTENT OPTIMIZATION.pptx
UNLEASHING THE POWER OF CONTENT OPTIMIZATION.pptx
 
IRJET- Analysis on Existing Methodologies of User Service Rating Prediction S...
IRJET- Analysis on Existing Methodologies of User Service Rating Prediction S...IRJET- Analysis on Existing Methodologies of User Service Rating Prediction S...
IRJET- Analysis on Existing Methodologies of User Service Rating Prediction S...
 
Web & Social Media Analytics Module 4.pdf
Web & Social Media Analytics Module 4.pdfWeb & Social Media Analytics Module 4.pdf
Web & Social Media Analytics Module 4.pdf
 
Trust_Recommendation_System
Trust_Recommendation_SystemTrust_Recommendation_System
Trust_Recommendation_System
 
IR UNIT V.docx
IR UNIT  V.docxIR UNIT  V.docx
IR UNIT V.docx
 
IRJET - Sentiment Similarity Analysis and Building Users Trust from E-Commerc...
IRJET - Sentiment Similarity Analysis and Building Users Trust from E-Commerc...IRJET - Sentiment Similarity Analysis and Building Users Trust from E-Commerc...
IRJET - Sentiment Similarity Analysis and Building Users Trust from E-Commerc...
 
A LOCATION-BASED RECOMMENDER SYSTEM FRAMEWORK TO IMPROVE ACCURACY IN USERBASE...
A LOCATION-BASED RECOMMENDER SYSTEM FRAMEWORK TO IMPROVE ACCURACY IN USERBASE...A LOCATION-BASED RECOMMENDER SYSTEM FRAMEWORK TO IMPROVE ACCURACY IN USERBASE...
A LOCATION-BASED RECOMMENDER SYSTEM FRAMEWORK TO IMPROVE ACCURACY IN USERBASE...
 
Book recommendation system using opinion mining technique
Book recommendation system using opinion mining techniqueBook recommendation system using opinion mining technique
Book recommendation system using opinion mining technique
 
An Efficient Trust Evaluation using Fact-Finder Technique
An Efficient Trust Evaluation using Fact-Finder TechniqueAn Efficient Trust Evaluation using Fact-Finder Technique
An Efficient Trust Evaluation using Fact-Finder Technique
 
Chapter6.ppt marketing of the chapter sellingin
Chapter6.ppt marketing of the chapter sellinginChapter6.ppt marketing of the chapter sellingin
Chapter6.ppt marketing of the chapter sellingin
 
good emperical generalisation.pptx
good emperical generalisation.pptxgood emperical generalisation.pptx
good emperical generalisation.pptx
 
Automatic Recommendation of Trustworthy Users in Online Product Rating Sites
Automatic Recommendation of Trustworthy Users in Online Product Rating SitesAutomatic Recommendation of Trustworthy Users in Online Product Rating Sites
Automatic Recommendation of Trustworthy Users in Online Product Rating Sites
 
User behavior analysis and relevance extraction modelling
User behavior analysis and relevance extraction modellingUser behavior analysis and relevance extraction modelling
User behavior analysis and relevance extraction modelling
 
FIND MY VENUE: Content & Review Based Location Recommendation System
FIND MY VENUE: Content & Review Based Location Recommendation SystemFIND MY VENUE: Content & Review Based Location Recommendation System
FIND MY VENUE: Content & Review Based Location Recommendation System
 
How Data Science Plays the Crucial Role in Social Media
How Data Science Plays the Crucial Role in Social MediaHow Data Science Plays the Crucial Role in Social Media
How Data Science Plays the Crucial Role in Social Media
 
A study on after sales and services in tvs
A study on after sales and services in tvsA study on after sales and services in tvs
A study on after sales and services in tvs
 
A Hypothesis is Placed to Justify the Extendibility of Recommender System/ Re...
A Hypothesis is Placed to Justify the Extendibility of Recommender System/ Re...A Hypothesis is Placed to Justify the Extendibility of Recommender System/ Re...
A Hypothesis is Placed to Justify the Extendibility of Recommender System/ Re...
 
Metzler 2010 - reputation systems
Metzler   2010 - reputation systemsMetzler   2010 - reputation systems
Metzler 2010 - reputation systems
 
Bv31491493
Bv31491493Bv31491493
Bv31491493
 
How to Use Customer Messaging Stats & Insights.docx
How to Use Customer Messaging Stats & Insights.docxHow to Use Customer Messaging Stats & Insights.docx
How to Use Customer Messaging Stats & Insights.docx
 

Recently uploaded

2008 BUILDING CONSTRUCTION Illustrated - Ching Chapter 02 The Building.pdf
2008 BUILDING CONSTRUCTION Illustrated - Ching Chapter 02 The Building.pdf2008 BUILDING CONSTRUCTION Illustrated - Ching Chapter 02 The Building.pdf
2008 BUILDING CONSTRUCTION Illustrated - Ching Chapter 02 The Building.pdf
Yasser Mahgoub
 
Recycled Concrete Aggregate in Construction Part III
Recycled Concrete Aggregate in Construction Part IIIRecycled Concrete Aggregate in Construction Part III
Recycled Concrete Aggregate in Construction Part III
Aditya Rajan Patra
 
Computational Engineering IITH Presentation
Computational Engineering IITH PresentationComputational Engineering IITH Presentation
Computational Engineering IITH Presentation
co23btech11018
 
Question paper of renewable energy sources
Question paper of renewable energy sourcesQuestion paper of renewable energy sources
Question paper of renewable energy sources
mahammadsalmanmech
 
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...
IJECEIAES
 
Presentation of IEEE Slovenia CIS (Computational Intelligence Society) Chapte...
Presentation of IEEE Slovenia CIS (Computational Intelligence Society) Chapte...Presentation of IEEE Slovenia CIS (Computational Intelligence Society) Chapte...
Presentation of IEEE Slovenia CIS (Computational Intelligence Society) Chapte...
University of Maribor
 
Generative AI leverages algorithms to create various forms of content
Generative AI leverages algorithms to create various forms of contentGenerative AI leverages algorithms to create various forms of content
Generative AI leverages algorithms to create various forms of content
Hitesh Mohapatra
 
22CYT12-Unit-V-E Waste and its Management.ppt
22CYT12-Unit-V-E Waste and its Management.ppt22CYT12-Unit-V-E Waste and its Management.ppt
22CYT12-Unit-V-E Waste and its Management.ppt
KrishnaveniKrishnara1
 
CSM Cloud Service Management Presentarion
CSM Cloud Service Management PresentarionCSM Cloud Service Management Presentarion
CSM Cloud Service Management Presentarion
rpskprasana
 
Eric Nizeyimana's document 2006 from gicumbi to ttc nyamata handball play
Eric Nizeyimana's document 2006 from gicumbi to ttc nyamata handball playEric Nizeyimana's document 2006 from gicumbi to ttc nyamata handball play
Eric Nizeyimana's document 2006 from gicumbi to ttc nyamata handball play
enizeyimana36
 
ISPM 15 Heat Treated Wood Stamps and why your shipping must have one
ISPM 15 Heat Treated Wood Stamps and why your shipping must have oneISPM 15 Heat Treated Wood Stamps and why your shipping must have one
ISPM 15 Heat Treated Wood Stamps and why your shipping must have one
Las Vegas Warehouse
 
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODEL
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODELDEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODEL
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODEL
gerogepatton
 
TIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEM
TIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEMTIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEM
TIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEM
HODECEDSIET
 
Harnessing WebAssembly for Real-time Stateless Streaming Pipelines
Harnessing WebAssembly for Real-time Stateless Streaming PipelinesHarnessing WebAssembly for Real-time Stateless Streaming Pipelines
Harnessing WebAssembly for Real-time Stateless Streaming Pipelines
Christina Lin
 
ML Based Model for NIDS MSc Updated Presentation.v2.pptx
ML Based Model for NIDS MSc Updated Presentation.v2.pptxML Based Model for NIDS MSc Updated Presentation.v2.pptx
ML Based Model for NIDS MSc Updated Presentation.v2.pptx
JamalHussainArman
 
Properties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptxProperties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptx
MDSABBIROJJAMANPAYEL
 
Understanding Inductive Bias in Machine Learning
Understanding Inductive Bias in Machine LearningUnderstanding Inductive Bias in Machine Learning
Understanding Inductive Bias in Machine Learning
SUTEJAS
 
学校原版美国波士顿大学毕业证学历学位证书原版一模一样
学校原版美国波士顿大学毕业证学历学位证书原版一模一样学校原版美国波士顿大学毕业证学历学位证书原版一模一样
学校原版美国波士顿大学毕业证学历学位证书原版一模一样
171ticu
 
Casting-Defect-inSlab continuous casting.pdf
Casting-Defect-inSlab continuous casting.pdfCasting-Defect-inSlab continuous casting.pdf
Casting-Defect-inSlab continuous casting.pdf
zubairahmad848137
 
IEEE Aerospace and Electronic Systems Society as a Graduate Student Member
IEEE Aerospace and Electronic Systems Society as a Graduate Student MemberIEEE Aerospace and Electronic Systems Society as a Graduate Student Member
IEEE Aerospace and Electronic Systems Society as a Graduate Student Member
VICTOR MAESTRE RAMIREZ
 

Recently uploaded (20)

2008 BUILDING CONSTRUCTION Illustrated - Ching Chapter 02 The Building.pdf
2008 BUILDING CONSTRUCTION Illustrated - Ching Chapter 02 The Building.pdf2008 BUILDING CONSTRUCTION Illustrated - Ching Chapter 02 The Building.pdf
2008 BUILDING CONSTRUCTION Illustrated - Ching Chapter 02 The Building.pdf
 
Recycled Concrete Aggregate in Construction Part III
Recycled Concrete Aggregate in Construction Part IIIRecycled Concrete Aggregate in Construction Part III
Recycled Concrete Aggregate in Construction Part III
 
Computational Engineering IITH Presentation
Computational Engineering IITH PresentationComputational Engineering IITH Presentation
Computational Engineering IITH Presentation
 
Question paper of renewable energy sources
Question paper of renewable energy sourcesQuestion paper of renewable energy sources
Question paper of renewable energy sources
 
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...
 
Presentation of IEEE Slovenia CIS (Computational Intelligence Society) Chapte...
Presentation of IEEE Slovenia CIS (Computational Intelligence Society) Chapte...Presentation of IEEE Slovenia CIS (Computational Intelligence Society) Chapte...
Presentation of IEEE Slovenia CIS (Computational Intelligence Society) Chapte...
 
Generative AI leverages algorithms to create various forms of content
Generative AI leverages algorithms to create various forms of contentGenerative AI leverages algorithms to create various forms of content
Generative AI leverages algorithms to create various forms of content
 
22CYT12-Unit-V-E Waste and its Management.ppt
22CYT12-Unit-V-E Waste and its Management.ppt22CYT12-Unit-V-E Waste and its Management.ppt
22CYT12-Unit-V-E Waste and its Management.ppt
 
CSM Cloud Service Management Presentarion
CSM Cloud Service Management PresentarionCSM Cloud Service Management Presentarion
CSM Cloud Service Management Presentarion
 
Eric Nizeyimana's document 2006 from gicumbi to ttc nyamata handball play
Eric Nizeyimana's document 2006 from gicumbi to ttc nyamata handball playEric Nizeyimana's document 2006 from gicumbi to ttc nyamata handball play
Eric Nizeyimana's document 2006 from gicumbi to ttc nyamata handball play
 
ISPM 15 Heat Treated Wood Stamps and why your shipping must have one
ISPM 15 Heat Treated Wood Stamps and why your shipping must have oneISPM 15 Heat Treated Wood Stamps and why your shipping must have one
ISPM 15 Heat Treated Wood Stamps and why your shipping must have one
 
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODEL
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODELDEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODEL
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODEL
 
TIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEM
TIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEMTIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEM
TIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEM
 
Harnessing WebAssembly for Real-time Stateless Streaming Pipelines
Harnessing WebAssembly for Real-time Stateless Streaming PipelinesHarnessing WebAssembly for Real-time Stateless Streaming Pipelines
Harnessing WebAssembly for Real-time Stateless Streaming Pipelines
 
ML Based Model for NIDS MSc Updated Presentation.v2.pptx
ML Based Model for NIDS MSc Updated Presentation.v2.pptxML Based Model for NIDS MSc Updated Presentation.v2.pptx
ML Based Model for NIDS MSc Updated Presentation.v2.pptx
 
Properties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptxProperties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptx
 
Understanding Inductive Bias in Machine Learning
Understanding Inductive Bias in Machine LearningUnderstanding Inductive Bias in Machine Learning
Understanding Inductive Bias in Machine Learning
 
学校原版美国波士顿大学毕业证学历学位证书原版一模一样
学校原版美国波士顿大学毕业证学历学位证书原版一模一样学校原版美国波士顿大学毕业证学历学位证书原版一模一样
学校原版美国波士顿大学毕业证学历学位证书原版一模一样
 
Casting-Defect-inSlab continuous casting.pdf
Casting-Defect-inSlab continuous casting.pdfCasting-Defect-inSlab continuous casting.pdf
Casting-Defect-inSlab continuous casting.pdf
 
IEEE Aerospace and Electronic Systems Society as a Graduate Student Member
IEEE Aerospace and Electronic Systems Society as a Graduate Student MemberIEEE Aerospace and Electronic Systems Society as a Graduate Student Member
IEEE Aerospace and Electronic Systems Society as a Graduate Student Member
 

20. reinforcement learning.pptx

  • 1.
  • 2.
  • 3.
  • 4.
  • 5. Terms used in Reinforcement Learning  Agent  Environment  Action  State  Reward  policy
  • 6.
  • 7.
  • 9. Reinforcement Learning in news recommendation User preferences can change frequently, therefore recommending news to users based on reviews and likes could become obsolete quickly. With reinforcement learning, the RL system can track the reader’s return behaviors. Construction of such a system would involve obtaining news features, reader features, context features, and reader news features. News features include but are not limited to the content, headline, and publisher. Reader features refer to how the reader interacts with the content e.g clicks and shares. Context features include news aspects such as timing and freshness of the news. A reward is then defined based on these user behaviors.
  • 11. Approaches to Implement Reinforcement Learning 1. Value Based 2. Policy Based 3. Model Based Types of Reinforcement Learning 1. Positive Reinforcement 2. Negative Reinforcement