SlideShare a Scribd company logo
Emotion-Driven Reinforcement Learning Bob Marinier & John Laird University of Michigan, Computer Science and Engineering CogSci’08
Introduction Interested in the functional benefits of emotion for a cognitive agent Appraisal theories of emotion PEACTIDM theory of cognitive control Use emotion as a reward signal to a reinforcement learning agent Demonstrates a functional benefit of emotion Provides a theory of the origin of intrinsic reward 2
Outline Background Integration of emotion and cognition Integration of emotion and reinforcement learning Implementation in Soar Learning task Results 3
Appraisal Theories of Emotion A situation is evaluated along a number of appraisal dimensions, many of which relate the situation to current goals Novelty, goal relevance, goal conduciveness, expectedness, causal agency, etc. Appraisals influence emotion Emotion can then be coped with (via internal or external actions) Situation Goals Appraisals Coping Emotion 4
Appraisals to Emotions (Scherer 2001) 5
Cognitive Control: PEACTIDM (Newell 1990) 6
Unification of PEACTIDM and Appraisal Theories 7 Perceive Raw Perceptual Information Environmental Change Encode Motor Suddenness Unpredictability Goal Relevance Intrinsic Pleasantness Stimulus Relevance Motor Commands Prediction Outcome Probability Attend Decode Causal Agent/Motive Discrepancy Conduciveness Control/Power Stimulus chosen for processing Action Comprehend Intend Current Situation Assessment
Distinction between emotion, mood, and feeling(Marinier & Laird 2007) Emotion: Result of appraisals Is about the current situation Mood: “Average” over recent emotions Provides historical context Feeling: Emotion “+” Mood What agent actually perceives 8
Emotion, mood, and feeling Cognition Active Appraisals Perceived Feeling Emotion Feeling Combination Function Pull Mood Decay 9
Intrinsically Motivated Reinforcement Learning(Sutton & Barto 1998; Singh et al. 2004) 10 External Environment Environment Actions Sensations Critic “Organism” Internal Environment Actions States Rewards Critic Appraisal Process Agent +/- Feeling Intensity States Rewards Decisions Agent Reward = Intensity * Valence
Extending Soar with Emotion(Marinier & Laird 2007) Episodic Semantic Symbolic Long-Term Memories Procedural Semantic Learning Episodic Learning Chunking Reinforcement Learning Appraisal Detector Short-Term Memory Situation, Goals Decision Procedure Visual Imagery Perception Action Body 11
Extending Soar with Emotion(Marinier & Laird 2007) 12 Episodic Semantic Symbolic Long-Term Memories Procedural Semantic Learning Episodic Learning Chunking Reinforcement Learning      +/-Intensity Appraisal Detector Feeling .9,.6,.5,-.1,.8,… Short-Term Memory Situation, Goals Feelings Decision Procedure Feelings Appraisals Visual Imagery Emotion .5,.7,0,-.4,.3,… Mood .7,-.2,.8,.3,.6,… Perception Action Knowledge Body Architecture
Learning task Start Goal 13
Learning task: Encoding 14 North Passable: false On path: false Progress: true East Passable: false On path: true Progress: true West Passable: false On path: false Progress: true South Passable: true On path: true Progress: true
Learning task: Encoding & Appraisal 15 North Intrinsic Pleasantness: Low Goal Relevance: Low Unpredictability: High East Intrinsic Pleasantness: Low Goal Relevance: High Unpredictability: High West Intrinsic Pleasantness: Low Goal Relevance: Low Unpredictability: High South Intrinsic Pleasantness: Neutral Goal Relevance: High Unpredictability: Low
Learning task: Attending, Comprehending & Appraisal 16 South Intrinsic Pleasantness: Neutral Goal Relevance: High Unpredictability: Low Conduciveness: High Control: High …
Learning task: Tasking 17
Learning task: Tasking 18 Optimal Subtasks
What is being learned? When to Attend vs Task If Attending, what to Attend to If Tasking, which subtask to create When to Intend vs. Ignore 19
Learning Results 20
Results: With and without mood 21
Discussion Agent learns both internal (tasking) and external (movement) actions Emotion allows for more frequent rewards, and thus learns faster than standard RL Mood “fills in the gaps” allowing for even faster learning and less variability 22

More Related Content

What's hot

Expectancy theory
Expectancy theoryExpectancy theory
Expectancy theory
kdore
 
Eiwp conf presentation scott thor
Eiwp conf presentation scott thorEiwp conf presentation scott thor
Eiwp conf presentation scott thor
Scott Thor
 
Lessons learntmanagingsoftwareprojects
Lessons learntmanagingsoftwareprojectsLessons learntmanagingsoftwareprojects
Lessons learntmanagingsoftwareprojects
Ramanan Jagannathan
 
Ei
EiEi
Identifying neurocorrelates in psychological type ap ti tc 2011
Identifying neurocorrelates in psychological type  ap ti tc 2011Identifying neurocorrelates in psychological type  ap ti tc 2011
Identifying neurocorrelates in psychological type ap ti tc 2011
Ann Holm
 
Thinking Reasoning & Problem Solving (Human Behavior)
Thinking Reasoning & Problem Solving (Human Behavior)Thinking Reasoning & Problem Solving (Human Behavior)
Thinking Reasoning & Problem Solving (Human Behavior)
zohebchana
 
HOW STATISTICS WORKS?
HOW STATISTICS WORKS?HOW STATISTICS WORKS?
HOW STATISTICS WORKS?
John Christian Villanueva
 
Problem solving
Problem solvingProblem solving
Problem solving
Mahmoud Shaqria
 
Zenjoy - The psychology of habit forming apps.
Zenjoy - The psychology of habit forming apps.Zenjoy - The psychology of habit forming apps.
Zenjoy - The psychology of habit forming apps.
dewitkoen
 

What's hot (9)

Expectancy theory
Expectancy theoryExpectancy theory
Expectancy theory
 
Eiwp conf presentation scott thor
Eiwp conf presentation scott thorEiwp conf presentation scott thor
Eiwp conf presentation scott thor
 
Lessons learntmanagingsoftwareprojects
Lessons learntmanagingsoftwareprojectsLessons learntmanagingsoftwareprojects
Lessons learntmanagingsoftwareprojects
 
Ei
EiEi
Ei
 
Identifying neurocorrelates in psychological type ap ti tc 2011
Identifying neurocorrelates in psychological type  ap ti tc 2011Identifying neurocorrelates in psychological type  ap ti tc 2011
Identifying neurocorrelates in psychological type ap ti tc 2011
 
Thinking Reasoning & Problem Solving (Human Behavior)
Thinking Reasoning & Problem Solving (Human Behavior)Thinking Reasoning & Problem Solving (Human Behavior)
Thinking Reasoning & Problem Solving (Human Behavior)
 
HOW STATISTICS WORKS?
HOW STATISTICS WORKS?HOW STATISTICS WORKS?
HOW STATISTICS WORKS?
 
Problem solving
Problem solvingProblem solving
Problem solving
 
Zenjoy - The psychology of habit forming apps.
Zenjoy - The psychology of habit forming apps.Zenjoy - The psychology of habit forming apps.
Zenjoy - The psychology of habit forming apps.
 

Viewers also liked

Rf Connections E Commerce
Rf Connections E CommerceRf Connections E Commerce
Rf Connections E Commerce
RF Connections
 
Z7,Z8,Z9 Version Cad 2004 Con Soluciones
Z7,Z8,Z9 Version Cad 2004 Con SolucionesZ7,Z8,Z9 Version Cad 2004 Con Soluciones
Z7,Z8,Z9 Version Cad 2004 Con Solucionesqvrrafa
 
Semantic Web - basic taxonomies
Semantic Web - basic taxonomiesSemantic Web - basic taxonomies
Semantic Web - basic taxonomies
Robin Houdmeyers
 
The Long Tail Model, Gwenaelle Doceul
The Long Tail Model, Gwenaelle DoceulThe Long Tail Model, Gwenaelle Doceul
The Long Tail Model, Gwenaelle Doceul
guestb39a34
 
Struggle And Survival Chapters 1,12,3,4
Struggle And Survival Chapters 1,12,3,4Struggle And Survival Chapters 1,12,3,4
Struggle And Survival Chapters 1,12,3,4
008634585
 
Presentation1
Presentation1Presentation1
Presentation1
satiman
 
Social Media in Deutsch
Social Media in DeutschSocial Media in Deutsch
Social Media in Deutsch
Simon Rabente
 
Pictures Of Products
Pictures Of ProductsPictures Of Products
Pictures Of Products
kikabastosdk
 
La_aventura_de_ser_maestro
La_aventura_de_ser_maestroLa_aventura_de_ser_maestro
La_aventura_de_ser_maestro
Sergd
 
LeWeb Yarışması 2009
LeWeb Yarışması 2009LeWeb Yarışması 2009
LeWeb Yarışması 2009
Serkan Unsal
 
Inside The Bushey Cell
Inside  The  Bushey  CellInside  The  Bushey  Cell
Inside The Bushey Cell
lisabushey
 
Enerxías renovables
Enerxías renovablesEnerxías renovables
Enerxías renovablesfgnfsgn
 
Renji See\'s Dead People
Renji See\'s Dead PeopleRenji See\'s Dead People
Renji See\'s Dead People
BleachXHairpin
 
Roma Tech & South Europe Forum - Convegno RF & Wireless Sfide e benefici -Uni...
Roma Tech & South Europe Forum - Convegno RF & Wireless Sfide e benefici -Uni...Roma Tech & South Europe Forum - Convegno RF & Wireless Sfide e benefici -Uni...
Roma Tech & South Europe Forum - Convegno RF & Wireless Sfide e benefici -Uni...
tecnoimprese
 
Resumé/CV
Resumé/CVResumé/CV
Resumé/CV
Barney Gerrard
 
Michael P Totten A Climate For Life Mesh Talk Bioneer Los Angeles 12 09 09
Michael P Totten A Climate For Life Mesh Talk Bioneer Los Angeles 12 09 09Michael P Totten A Climate For Life Mesh Talk Bioneer Los Angeles 12 09 09
Michael P Totten A Climate For Life Mesh Talk Bioneer Los Angeles 12 09 09
Michael P Totten
 
Instituições e desenvolvimento econômico na abordagem do excedente
Instituições e desenvolvimento econômico na abordagem do excedenteInstituições e desenvolvimento econômico na abordagem do excedente
Instituições e desenvolvimento econômico na abordagem do excedente
Grupo de Economia Política IE-UFRJ
 
Geometry In The Real World Laura
Geometry In The Real World LauraGeometry In The Real World Laura
Geometry In The Real World Laura
Torra8
 
Felicitats Alberto Per Aconseguir El Teu Somni
Felicitats Alberto Per Aconseguir El Teu SomniFelicitats Alberto Per Aconseguir El Teu Somni
Felicitats Alberto Per Aconseguir El Teu SomniCristina
 

Viewers also liked (20)

Rf Connections E Commerce
Rf Connections E CommerceRf Connections E Commerce
Rf Connections E Commerce
 
Z7,Z8,Z9 Version Cad 2004 Con Soluciones
Z7,Z8,Z9 Version Cad 2004 Con SolucionesZ7,Z8,Z9 Version Cad 2004 Con Soluciones
Z7,Z8,Z9 Version Cad 2004 Con Soluciones
 
Semantic Web - basic taxonomies
Semantic Web - basic taxonomiesSemantic Web - basic taxonomies
Semantic Web - basic taxonomies
 
The Long Tail Model, Gwenaelle Doceul
The Long Tail Model, Gwenaelle DoceulThe Long Tail Model, Gwenaelle Doceul
The Long Tail Model, Gwenaelle Doceul
 
Struggle And Survival Chapters 1,12,3,4
Struggle And Survival Chapters 1,12,3,4Struggle And Survival Chapters 1,12,3,4
Struggle And Survival Chapters 1,12,3,4
 
Presentation1
Presentation1Presentation1
Presentation1
 
Social Media in Deutsch
Social Media in DeutschSocial Media in Deutsch
Social Media in Deutsch
 
BELÉN
BELÉNBELÉN
BELÉN
 
Pictures Of Products
Pictures Of ProductsPictures Of Products
Pictures Of Products
 
La_aventura_de_ser_maestro
La_aventura_de_ser_maestroLa_aventura_de_ser_maestro
La_aventura_de_ser_maestro
 
LeWeb Yarışması 2009
LeWeb Yarışması 2009LeWeb Yarışması 2009
LeWeb Yarışması 2009
 
Inside The Bushey Cell
Inside  The  Bushey  CellInside  The  Bushey  Cell
Inside The Bushey Cell
 
Enerxías renovables
Enerxías renovablesEnerxías renovables
Enerxías renovables
 
Renji See\'s Dead People
Renji See\'s Dead PeopleRenji See\'s Dead People
Renji See\'s Dead People
 
Roma Tech & South Europe Forum - Convegno RF & Wireless Sfide e benefici -Uni...
Roma Tech & South Europe Forum - Convegno RF & Wireless Sfide e benefici -Uni...Roma Tech & South Europe Forum - Convegno RF & Wireless Sfide e benefici -Uni...
Roma Tech & South Europe Forum - Convegno RF & Wireless Sfide e benefici -Uni...
 
Resumé/CV
Resumé/CVResumé/CV
Resumé/CV
 
Michael P Totten A Climate For Life Mesh Talk Bioneer Los Angeles 12 09 09
Michael P Totten A Climate For Life Mesh Talk Bioneer Los Angeles 12 09 09Michael P Totten A Climate For Life Mesh Talk Bioneer Los Angeles 12 09 09
Michael P Totten A Climate For Life Mesh Talk Bioneer Los Angeles 12 09 09
 
Instituições e desenvolvimento econômico na abordagem do excedente
Instituições e desenvolvimento econômico na abordagem do excedenteInstituições e desenvolvimento econômico na abordagem do excedente
Instituições e desenvolvimento econômico na abordagem do excedente
 
Geometry In The Real World Laura
Geometry In The Real World LauraGeometry In The Real World Laura
Geometry In The Real World Laura
 
Felicitats Alberto Per Aconseguir El Teu Somni
Felicitats Alberto Per Aconseguir El Teu SomniFelicitats Alberto Per Aconseguir El Teu Somni
Felicitats Alberto Per Aconseguir El Teu Somni
 

Similar to Marinier Laird Cogsci 2008 Emotionrl Pres

A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
Nicole Novielli
 
TS4-5: Yuan Ma from Japan Advanced Institute of Science and Technology
TS4-5: Yuan Ma from Japan Advanced Institute of Science and TechnologyTS4-5: Yuan Ma from Japan Advanced Institute of Science and Technology
TS4-5: Yuan Ma from Japan Advanced Institute of Science and Technology
Jawad Haqbeen
 
Reflective learning
Reflective learningReflective learning
Reflective learning
P&CO
 
Intention-behavior relations
Intention-behavior relationsIntention-behavior relations
Intention-behavior relations
renes002
 
How to Foster Great Employee Attitudes at Work
How to Foster Great Employee Attitudes at WorkHow to Foster Great Employee Attitudes at Work
How to Foster Great Employee Attitudes at Work
The Chazin Group LLC
 
The Emotionally Intelligent Interim Manager.Ppt2
The Emotionally Intelligent Interim Manager.Ppt2The Emotionally Intelligent Interim Manager.Ppt2
The Emotionally Intelligent Interim Manager.Ppt2
MartinD1
 
Process theories of motivation
Process theories of motivationProcess theories of motivation
Process theories of motivation
ace boado
 
Perception.pptx js5dihob ycydugobcb ytsi kf
Perception.pptx js5dihob ycydugobcb ytsi kfPerception.pptx js5dihob ycydugobcb ytsi kf
Perception.pptx js5dihob ycydugobcb ytsi kf
nikhilojha4142
 
Mindfulness@work case-agile india2018
Mindfulness@work case-agile india2018Mindfulness@work case-agile india2018
Mindfulness@work case-agile india2018
Vishweshwar Hegde
 
PERCEPTION IN ORGANISATIONAL BEHAVIOUR
PERCEPTION IN ORGANISATIONAL BEHAVIOURPERCEPTION IN ORGANISATIONAL BEHAVIOUR
PERCEPTION IN ORGANISATIONAL BEHAVIOUR
Kriace Ward
 
Lab Presentation 103108
Lab Presentation 103108Lab Presentation 103108
Lab Presentation 103108
tkvaran
 
Emotional Intelligence with Suzette Reyes
Emotional Intelligence with Suzette ReyesEmotional Intelligence with Suzette Reyes
Emotional Intelligence with Suzette Reyes
Jodi Rudick
 
Perseption
PerseptionPerseption
Perseption
nymufti
 
Perception ppt @ bec doms bagalkot mba
Perception ppt @ bec doms bagalkot mbaPerception ppt @ bec doms bagalkot mba
Perception ppt @ bec doms bagalkot mba
Babasab Patil
 
Interactive Metronome
Interactive MetronomeInteractive Metronome
Interactive Metronome
SharpBrains
 
Motivation
MotivationMotivation
Motivation
zie_aftone
 
LASI13-Boston, Rappolt Schlichtmann
LASI13-Boston, Rappolt SchlichtmannLASI13-Boston, Rappolt Schlichtmann
LASI13-Boston, Rappolt Schlichtmann
LA-Boston
 
Module 1
Module 1Module 1
Depth of Feelings: Modeling Emotions in User Models and Agent Architectures
Depth of Feelings: Modeling Emotions in User Models and Agent ArchitecturesDepth of Feelings: Modeling Emotions in User Models and Agent Architectures
Depth of Feelings: Modeling Emotions in User Models and Agent Architectures
Eva Hudlicka
 
Week 4BUSI7280 Managing in a Global Context1.docx
Week 4BUSI7280 Managing in a Global Context1.docxWeek 4BUSI7280 Managing in a Global Context1.docx
Week 4BUSI7280 Managing in a Global Context1.docx
helzerpatrina
 

Similar to Marinier Laird Cogsci 2008 Emotionrl Pres (20)

A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
TS4-5: Yuan Ma from Japan Advanced Institute of Science and Technology
TS4-5: Yuan Ma from Japan Advanced Institute of Science and TechnologyTS4-5: Yuan Ma from Japan Advanced Institute of Science and Technology
TS4-5: Yuan Ma from Japan Advanced Institute of Science and Technology
 
Reflective learning
Reflective learningReflective learning
Reflective learning
 
Intention-behavior relations
Intention-behavior relationsIntention-behavior relations
Intention-behavior relations
 
How to Foster Great Employee Attitudes at Work
How to Foster Great Employee Attitudes at WorkHow to Foster Great Employee Attitudes at Work
How to Foster Great Employee Attitudes at Work
 
The Emotionally Intelligent Interim Manager.Ppt2
The Emotionally Intelligent Interim Manager.Ppt2The Emotionally Intelligent Interim Manager.Ppt2
The Emotionally Intelligent Interim Manager.Ppt2
 
Process theories of motivation
Process theories of motivationProcess theories of motivation
Process theories of motivation
 
Perception.pptx js5dihob ycydugobcb ytsi kf
Perception.pptx js5dihob ycydugobcb ytsi kfPerception.pptx js5dihob ycydugobcb ytsi kf
Perception.pptx js5dihob ycydugobcb ytsi kf
 
Mindfulness@work case-agile india2018
Mindfulness@work case-agile india2018Mindfulness@work case-agile india2018
Mindfulness@work case-agile india2018
 
PERCEPTION IN ORGANISATIONAL BEHAVIOUR
PERCEPTION IN ORGANISATIONAL BEHAVIOURPERCEPTION IN ORGANISATIONAL BEHAVIOUR
PERCEPTION IN ORGANISATIONAL BEHAVIOUR
 
Lab Presentation 103108
Lab Presentation 103108Lab Presentation 103108
Lab Presentation 103108
 
Emotional Intelligence with Suzette Reyes
Emotional Intelligence with Suzette ReyesEmotional Intelligence with Suzette Reyes
Emotional Intelligence with Suzette Reyes
 
Perseption
PerseptionPerseption
Perseption
 
Perception ppt @ bec doms bagalkot mba
Perception ppt @ bec doms bagalkot mbaPerception ppt @ bec doms bagalkot mba
Perception ppt @ bec doms bagalkot mba
 
Interactive Metronome
Interactive MetronomeInteractive Metronome
Interactive Metronome
 
Motivation
MotivationMotivation
Motivation
 
LASI13-Boston, Rappolt Schlichtmann
LASI13-Boston, Rappolt SchlichtmannLASI13-Boston, Rappolt Schlichtmann
LASI13-Boston, Rappolt Schlichtmann
 
Module 1
Module 1Module 1
Module 1
 
Depth of Feelings: Modeling Emotions in User Models and Agent Architectures
Depth of Feelings: Modeling Emotions in User Models and Agent ArchitecturesDepth of Feelings: Modeling Emotions in User Models and Agent Architectures
Depth of Feelings: Modeling Emotions in User Models and Agent Architectures
 
Week 4BUSI7280 Managing in a Global Context1.docx
Week 4BUSI7280 Managing in a Global Context1.docxWeek 4BUSI7280 Managing in a Global Context1.docx
Week 4BUSI7280 Managing in a Global Context1.docx
 

More from gueste9cbbf

Power Point 2007
Power Point 2007Power Point 2007
Power Point 2007
gueste9cbbf
 
Presentation 10 20 08 1
Presentation 10 20 08 1Presentation 10 20 08 1
Presentation 10 20 08 1
gueste9cbbf
 
bb
bbbb
b
bb
Marinier Laird Cogsci 2008 Emotionrl Pres
Marinier Laird Cogsci 2008 Emotionrl PresMarinier Laird Cogsci 2008 Emotionrl Pres
Marinier Laird Cogsci 2008 Emotionrl Pres
gueste9cbbf
 
Power Point 2007
Power Point 2007Power Point 2007
Power Point 2007
gueste9cbbf
 
Britwear
BritwearBritwear
Britwear
gueste9cbbf
 

More from gueste9cbbf (7)

Power Point 2007
Power Point 2007Power Point 2007
Power Point 2007
 
Presentation 10 20 08 1
Presentation 10 20 08 1Presentation 10 20 08 1
Presentation 10 20 08 1
 
bb
bbbb
bb
 
b
bb
b
 
Marinier Laird Cogsci 2008 Emotionrl Pres
Marinier Laird Cogsci 2008 Emotionrl PresMarinier Laird Cogsci 2008 Emotionrl Pres
Marinier Laird Cogsci 2008 Emotionrl Pres
 
Power Point 2007
Power Point 2007Power Point 2007
Power Point 2007
 
Britwear
BritwearBritwear
Britwear
 

Recently uploaded

GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge GraphGraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
Neo4j
 
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
AlexanderRichford
 
Call Girls Chandigarh🔥7023059433🔥Agency Profile Escorts in Chandigarh Availab...
Call Girls Chandigarh🔥7023059433🔥Agency Profile Escorts in Chandigarh Availab...Call Girls Chandigarh🔥7023059433🔥Agency Profile Escorts in Chandigarh Availab...
Call Girls Chandigarh🔥7023059433🔥Agency Profile Escorts in Chandigarh Availab...
manji sharman06
 
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
Fwdays
 
Christine's Product Research Presentation.pptx
Christine's Product Research Presentation.pptxChristine's Product Research Presentation.pptx
Christine's Product Research Presentation.pptx
christinelarrosa
 
Leveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and StandardsLeveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and Standards
Neo4j
 
Essentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation ParametersEssentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation Parameters
Safe Software
 
What is an RPA CoE? Session 2 – CoE Roles
What is an RPA CoE?  Session 2 – CoE RolesWhat is an RPA CoE?  Session 2 – CoE Roles
What is an RPA CoE? Session 2 – CoE Roles
DianaGray10
 
"What does it really mean for your system to be available, or how to define w...
"What does it really mean for your system to be available, or how to define w..."What does it really mean for your system to be available, or how to define w...
"What does it really mean for your system to be available, or how to define w...
Fwdays
 
AI in the Workplace Reskilling, Upskilling, and Future Work.pptx
AI in the Workplace Reskilling, Upskilling, and Future Work.pptxAI in the Workplace Reskilling, Upskilling, and Future Work.pptx
AI in the Workplace Reskilling, Upskilling, and Future Work.pptx
Sunil Jagani
 
Y-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PPY-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PP
c5vrf27qcz
 
Containers & AI - Beauty and the Beast!?!
Containers & AI - Beauty and the Beast!?!Containers & AI - Beauty and the Beast!?!
Containers & AI - Beauty and the Beast!?!
Tobias Schneck
 
Introducing BoxLang : A new JVM language for productivity and modularity!
Introducing BoxLang : A new JVM language for productivity and modularity!Introducing BoxLang : A new JVM language for productivity and modularity!
Introducing BoxLang : A new JVM language for productivity and modularity!
Ortus Solutions, Corp
 
"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota
Fwdays
 
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansBiomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Neo4j
 
From Natural Language to Structured Solr Queries using LLMs
From Natural Language to Structured Solr Queries using LLMsFrom Natural Language to Structured Solr Queries using LLMs
From Natural Language to Structured Solr Queries using LLMs
Sease
 
GlobalLogic Java Community Webinar #18 “How to Improve Web Application Perfor...
GlobalLogic Java Community Webinar #18 “How to Improve Web Application Perfor...GlobalLogic Java Community Webinar #18 “How to Improve Web Application Perfor...
GlobalLogic Java Community Webinar #18 “How to Improve Web Application Perfor...
GlobalLogic Ukraine
 
Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |
AstuteBusiness
 
inQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
inQuba Webinar Mastering Customer Journey Management with Dr Graham HillinQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
inQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
LizaNolte
 
AppSec PNW: Android and iOS Application Security with MobSF
AppSec PNW: Android and iOS Application Security with MobSFAppSec PNW: Android and iOS Application Security with MobSF
AppSec PNW: Android and iOS Application Security with MobSF
Ajin Abraham
 

Recently uploaded (20)

GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge GraphGraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
 
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
 
Call Girls Chandigarh🔥7023059433🔥Agency Profile Escorts in Chandigarh Availab...
Call Girls Chandigarh🔥7023059433🔥Agency Profile Escorts in Chandigarh Availab...Call Girls Chandigarh🔥7023059433🔥Agency Profile Escorts in Chandigarh Availab...
Call Girls Chandigarh🔥7023059433🔥Agency Profile Escorts in Chandigarh Availab...
 
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
 
Christine's Product Research Presentation.pptx
Christine's Product Research Presentation.pptxChristine's Product Research Presentation.pptx
Christine's Product Research Presentation.pptx
 
Leveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and StandardsLeveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and Standards
 
Essentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation ParametersEssentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation Parameters
 
What is an RPA CoE? Session 2 – CoE Roles
What is an RPA CoE?  Session 2 – CoE RolesWhat is an RPA CoE?  Session 2 – CoE Roles
What is an RPA CoE? Session 2 – CoE Roles
 
"What does it really mean for your system to be available, or how to define w...
"What does it really mean for your system to be available, or how to define w..."What does it really mean for your system to be available, or how to define w...
"What does it really mean for your system to be available, or how to define w...
 
AI in the Workplace Reskilling, Upskilling, and Future Work.pptx
AI in the Workplace Reskilling, Upskilling, and Future Work.pptxAI in the Workplace Reskilling, Upskilling, and Future Work.pptx
AI in the Workplace Reskilling, Upskilling, and Future Work.pptx
 
Y-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PPY-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PP
 
Containers & AI - Beauty and the Beast!?!
Containers & AI - Beauty and the Beast!?!Containers & AI - Beauty and the Beast!?!
Containers & AI - Beauty and the Beast!?!
 
Introducing BoxLang : A new JVM language for productivity and modularity!
Introducing BoxLang : A new JVM language for productivity and modularity!Introducing BoxLang : A new JVM language for productivity and modularity!
Introducing BoxLang : A new JVM language for productivity and modularity!
 
"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota
 
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansBiomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
 
From Natural Language to Structured Solr Queries using LLMs
From Natural Language to Structured Solr Queries using LLMsFrom Natural Language to Structured Solr Queries using LLMs
From Natural Language to Structured Solr Queries using LLMs
 
GlobalLogic Java Community Webinar #18 “How to Improve Web Application Perfor...
GlobalLogic Java Community Webinar #18 “How to Improve Web Application Perfor...GlobalLogic Java Community Webinar #18 “How to Improve Web Application Perfor...
GlobalLogic Java Community Webinar #18 “How to Improve Web Application Perfor...
 
Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |
 
inQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
inQuba Webinar Mastering Customer Journey Management with Dr Graham HillinQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
inQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
 
AppSec PNW: Android and iOS Application Security with MobSF
AppSec PNW: Android and iOS Application Security with MobSFAppSec PNW: Android and iOS Application Security with MobSF
AppSec PNW: Android and iOS Application Security with MobSF
 

Marinier Laird Cogsci 2008 Emotionrl Pres

  • 1. Emotion-Driven Reinforcement Learning Bob Marinier & John Laird University of Michigan, Computer Science and Engineering CogSci’08
  • 2. Introduction Interested in the functional benefits of emotion for a cognitive agent Appraisal theories of emotion PEACTIDM theory of cognitive control Use emotion as a reward signal to a reinforcement learning agent Demonstrates a functional benefit of emotion Provides a theory of the origin of intrinsic reward 2
  • 3. Outline Background Integration of emotion and cognition Integration of emotion and reinforcement learning Implementation in Soar Learning task Results 3
  • 4. Appraisal Theories of Emotion A situation is evaluated along a number of appraisal dimensions, many of which relate the situation to current goals Novelty, goal relevance, goal conduciveness, expectedness, causal agency, etc. Appraisals influence emotion Emotion can then be coped with (via internal or external actions) Situation Goals Appraisals Coping Emotion 4
  • 5. Appraisals to Emotions (Scherer 2001) 5
  • 6. Cognitive Control: PEACTIDM (Newell 1990) 6
  • 7. Unification of PEACTIDM and Appraisal Theories 7 Perceive Raw Perceptual Information Environmental Change Encode Motor Suddenness Unpredictability Goal Relevance Intrinsic Pleasantness Stimulus Relevance Motor Commands Prediction Outcome Probability Attend Decode Causal Agent/Motive Discrepancy Conduciveness Control/Power Stimulus chosen for processing Action Comprehend Intend Current Situation Assessment
  • 8. Distinction between emotion, mood, and feeling(Marinier & Laird 2007) Emotion: Result of appraisals Is about the current situation Mood: “Average” over recent emotions Provides historical context Feeling: Emotion “+” Mood What agent actually perceives 8
  • 9. Emotion, mood, and feeling Cognition Active Appraisals Perceived Feeling Emotion Feeling Combination Function Pull Mood Decay 9
  • 10. Intrinsically Motivated Reinforcement Learning(Sutton & Barto 1998; Singh et al. 2004) 10 External Environment Environment Actions Sensations Critic “Organism” Internal Environment Actions States Rewards Critic Appraisal Process Agent +/- Feeling Intensity States Rewards Decisions Agent Reward = Intensity * Valence
  • 11. Extending Soar with Emotion(Marinier & Laird 2007) Episodic Semantic Symbolic Long-Term Memories Procedural Semantic Learning Episodic Learning Chunking Reinforcement Learning Appraisal Detector Short-Term Memory Situation, Goals Decision Procedure Visual Imagery Perception Action Body 11
  • 12. Extending Soar with Emotion(Marinier & Laird 2007) 12 Episodic Semantic Symbolic Long-Term Memories Procedural Semantic Learning Episodic Learning Chunking Reinforcement Learning +/-Intensity Appraisal Detector Feeling .9,.6,.5,-.1,.8,… Short-Term Memory Situation, Goals Feelings Decision Procedure Feelings Appraisals Visual Imagery Emotion .5,.7,0,-.4,.3,… Mood .7,-.2,.8,.3,.6,… Perception Action Knowledge Body Architecture
  • 14. Learning task: Encoding 14 North Passable: false On path: false Progress: true East Passable: false On path: true Progress: true West Passable: false On path: false Progress: true South Passable: true On path: true Progress: true
  • 15. Learning task: Encoding & Appraisal 15 North Intrinsic Pleasantness: Low Goal Relevance: Low Unpredictability: High East Intrinsic Pleasantness: Low Goal Relevance: High Unpredictability: High West Intrinsic Pleasantness: Low Goal Relevance: Low Unpredictability: High South Intrinsic Pleasantness: Neutral Goal Relevance: High Unpredictability: Low
  • 16. Learning task: Attending, Comprehending & Appraisal 16 South Intrinsic Pleasantness: Neutral Goal Relevance: High Unpredictability: Low Conduciveness: High Control: High …
  • 18. Learning task: Tasking 18 Optimal Subtasks
  • 19. What is being learned? When to Attend vs Task If Attending, what to Attend to If Tasking, which subtask to create When to Intend vs. Ignore 19
  • 21. Results: With and without mood 21
  • 22. Discussion Agent learns both internal (tasking) and external (movement) actions Emotion allows for more frequent rewards, and thus learns faster than standard RL Mood “fills in the gaps” allowing for even faster learning and less variability 22
  • 23. Conclusion & Future Work Demonstrated computational model that integrates emotion and cognitive control Confirmed emotion can drive reinforcement learning We have already successfully demonstrated similar learning in a more complex domain Would like to explore multi-agent scenarios 23
  • 24. 24 HIGH INTENSITY alert tense excited nervous elated stressed happy upset NEGATIVE VALENCE POSITIVE VALENCE sad contented depressed serene lethargic relaxed fatigued calm LOW INTENSITY Circumplex models Emotions can be described in terms of intensity and valence, as in a circumplex model: Adapted from Feldman Barrett & Russell (1998)
  • 25. Computing Feeling from Emotion and Mood 25 Assumption: Appraisal dimensions are independent Limited Range: Inputs and outputs are in [0,1] or [-1,1] Distinguishability: Very different inputs should lead to very different outputs Non-linear: Linearity would violate limited range and distinguishability
  • 26. Computing Feeling Intensity 26 Motivation: Intensity gives a summary of how important (i.e., how good or bad) the situation is Limited range: Should map onto [0,1] No dominant appraisal: No single value should drown out all the others Can’t just multiply values, because if any are 0, then intensity is 0 Realization principle: Expected events should be less intense than unexpected events

Editor's Notes

  1. Be careful about how say agent generates appraisal values
  2. Say prediction is our extension
  3. A cognitive architecture is a set of task-independent mechanisms that interact to give rise to behavior.
  4. In this environment, the agent’s sensing is limited: it can only see the cells immediately adjacent to it in the four cardinal directions. The agent has a sensor that tells it its Manhattan distance to the goal. However, the agent has no knowledge as to the effects of its actions, and thus cannot evaluate possible actions relative to the goal until it has actually performed them. Even then, it cannot always blindly move closer to the goal because given the shape of the maze, it must sometimes increase its Manhattan distance to the goal in order to make progress in the maze.
  5. Mention relaxation and direction
  6. 15 episodes50 trialsCutoff at 10kdcsmedian
  7. 1st and 3rd quartiles shownReach optimality at the same time, but mood is less variable
  8. This is an extension of previous workThese constraints define a set of equations. This is one possible equation which improves previous work that seems to work well for our current models.
  9. This is an extension of previous workUnifies intensity for all feelings in one equation (others use different equations for each “kind” of feeling)Again these constraints define a set of possible functions, of which this is one that seems to work well for us