SlideShare a Scribd company logo
1 of 37
Download to read offline
Emotions Evoked by 

Common Words and Phrases: 

Using Mechanical Turk to Create an Emotion Lexicon
Saif Mohammad and Peter Turney
National Research Council Canada
Painting

The Destroyer
- Frank Frazetta

Emotions evoked by common words and phrases.
Saif Mohammad and Peter Turney.
2
Sentence

(Phil, from the San Francisco Chronicle)


speaker/writer
Death threats over South Park episode


Event

When your cartoon can get you killed

Extremists

listener/reader

Trey Parker, Matt Stone

Participants

Participants

3
Our focus: words

evokes joy

When your cartoon can get you killed

evokes sadness

4
Motivation for emotion detection
 

Devising automatic dialogue systems that respond
appropriately to different emotional states of the user.
◦  customer relation models
◦  intelligent tutoring systems

 
 
 
 
 

◦  emotion-aware games
Tracking sentiment towards politicians, movies, products.
Determining emotional intelligence.
Assisting in writing e-mails, documents, and other text to
convey desired emotion (and avoiding misinterpretation).
Detecting how people use emotion-bearing-words to
persuade and coerce others
Deception detection 

Emotions evoked by common words and phrases.
Saif Mohammad and Peter Turney.
5
Emotions evoked by common words and phrases.
Saif Mohammad and Peter Turney.
6
Base emotions
pre-frontal cortex
complex emotions amygdala
base emotions

 

Ekman: 6 basic emotions

 

◦  joy, sadness, fear, anger, surprise, disgust
Plutchik: 8 

 

◦  Ekmanʼs 6 + anticipation + trust
◦  4 pairs of antonymous emotions
More proposals by Parrot, Loyban, and others

Emotions evoked by common words and phrases.
Saif Mohammad and Peter Turney.
7
Plutchikʼs wheel of emotions
 

Similar emotions are 

adjacent

 

Contrasting emotions are
diametrically opposite

 

The radius indicates 

intensity

 

In the white spaces 

are the primary dyads –
emotions that are 

combinations of the primary
emotions 

Emotions evoked by common words and phrases.
Saif Mohammad and Peter Turney.
8
Amazonʼs Mechanical Turk
 

Requester
◦  breaks task into small independent units – HITs 

 

◦  specifies: 
  compensation for solving each HIT
  # of independent annotations required for each HIT
a.k.a. # of assignments/HIT
◦  uploads HITs
Turkers

 

◦  attempt as many HITs as they wish
Requester
◦  inspects each assignment: approves or rejects
Emotions evoked by common words and phrases.
Saif Mohammad and Peter Turney.
9
 

Inexpensive
◦  $1/hour is not uncommon

 

Convenient
◦  Web-based
◦  Scripts to upload HITs and review assignments

 

Takes care of certain ethics issues
◦  Anonymity
◦  No pressure on workers to solve HITs

Emotions evoked by common words and phrases.
Saif Mohammad and Peter Turney.
10
 

Malicious annotations
◦  Random selection or garbage data entry
◦  Deliberate incorrect annotation

 

Inadvertent and infrequent errors
◦  Turker attempts HITs for unfamiliar words too

Emotions evoked by common words and phrases.
Saif Mohammad and Peter Turney.
11
Emotion annotation: Challenges
 

Words used in different senses and in different contexts
can evoke different emotions.
High aspect ratio wings allow low speed flight.
The fight or flight response is crucial for survival.

 

How to convey the target sense 

to the annotator?
◦  definitions are long
◦  need to discourage annotation for unfamiliar words

Emotions evoked by common words and phrases.
Saif Mohammad and Peter Turney.
12
Our solution
Directions: Attempt HIT only if you are familiar with the word.
Words in different senses may have different emotion
associations. Question 1 will guide you to the intended sense. 
Q1. Which word is closest in meaning (most related) to flight?

buying
avoidance
doubt
boredom
 

Near-synonym is taken from a thesaurus.

 

◦  Categories in a thesaurus act as coarse senses
Three distracters are chosen at random
Emotions evoked by common words and phrases.
Saif Mohammad and Peter Turney.
13
Emotion annotation: Challenges
 

Words used in different senses and in different contexts
can evoke different emotions.
High aspect ratio wings allow low speed flight.
The fight or flight response is crucial for survival.

 

How to convey the target sense 

to the annotator?
◦  definitions are long
◦  need to discourage annotation for unfamiliar words

Emotions evoked by common words and phrases.
Saif Mohammad and Peter Turney.
14
 

If the word choice question is answered wrongly, then
the whole assignment is discarded 

(answers to all questions in the HIT by the Turker are
discarded) 

 

If an annotator gets more than 1 in 3 questions wrong,
then we assume they are not following instructions.
◦  We reject all their assignments.

Emotions evoked by common words and phrases.
Saif Mohammad and Peter Turney.
15
 

Malicious annotations
◦  Random selection or garbage data entry
◦  Deliberate incorrect annotation

ss
 

Inadvertent and infrequent errors
◦  Turker attempts HITs for unfamiliar words too
Will detect it 75% of the time.

Emotions evoked by common words and phrases.
Saif Mohammad and Peter Turney.
16
Target n-grams
 

 

Conditions:
◦  Most frequent terms in the Google n-gram corpus
◦  Must be in the thesaurus in just one or two categories
Most frequent monosemous n-grams in each of the
following categories:
◦  noun unigrams (200)
◦ 
◦ 
◦ 
◦ 

noun bigrams (200)
verb unigrams (200)
verb bigrams (200)
adjective unigrams (200)

◦  adjective bigrams (200)
◦  adverb unigrams (200)
◦  adverb bigrams (200)

Emotions evoked by common words and phrases.
Saif Mohammad and Peter Turney.
17
 

Most frequent monosemous terms in the General
Inquirer (GI) that are:
◦  marked as positive (200)
◦  marked as negative (200)

 

Terms in WordNet Affect Lexicon (WAL) that have one
or two senses and are:
◦  marked as anger terms (107)
◦ 
◦ 
◦ 
◦ 

marked as disgust terms (25)
marked as fear terms (58)
marked as joy terms (109)
marked as sadness terms (86)

◦  marked as surprise terms (39)

2176 terms in all.

Emotions evoked by common words and phrases.
Saif Mohammad and Peter Turney.
18
Questions: 

1. Which word is closest in meaning (most related) to flight?


buying

avoidance
doubt
boredom
2. How positive (good, praising) is flight 

(for example, nice and excellent are strongly positive): 


 
 flight is not positive 

 
 flight is weakly positive 

 
 flight is moderately positive 

 
 flight is strongly positive 
Emotions evoked by common words and phrases.
Saif Mohammad and Peter Turney.
19
Questions (continued): 

3. How negative (bad, criticizing) is flight 

(for example, poor and pathetic are strongly negative): 


 
 flight is not negative 

 
 flight is weakly negative 

 
 flight is moderately negative 

 
 flight is strongly negative 
4. How much does flight evoke/produce the emotion joy 

(for example, happy and fun may strongly evoke joy): 


 
 flight does not evoke joy 

 
 flight weakly evokes joy 

 
 flight moderately evokes joy 

 
 flight strongly evokes joy 
Emotions evoked by common words and phrases.
Saif Mohammad and Peter Turney.
20
 
 
 
 
 
 

 

2176 (HITs) x 5 (assignments per HIT) = 10,880
assignments
Annotators: 1012 
Turkers spent on average about 1 minute per HIT
Hourly wage was about $2.40 (about 4 cents per HIT)
Total cost: US $470 (cost per term: about 22 cents)
More than 95% of the assignments had the correct
answer for the word choice question.
◦  The rest were discarded.
2081 terms had 3 or more valid assignments
◦  on average 4.75 assignments per HIT
Emotions evoked by common words and phrases.
Saif Mohammad and Peter Turney.
21
Evocative and non-evocative
 

Practical NLP applications may care for only two levels of
intensity

 

Example: vampire-fear
No fear



weak fear


moderate fear


strong fear

0 votes



1 vote


2 votes 


2 votes

non-evocative
1 vote

evocative
4 votes
Emotions evoked by common words and phrases.
Saif Mohammad and Peter Turney.
22
70
60
50
40
30
20
10
0

23
% of WAL anger terms evocative of
different emotions as per the Turkers
100
90
80
70
60
50
40
30
20
10
0

Emotions evoked by common words and phrases.
Saif Mohammad and Peter Turney.
24
What was missed?
baffled
covetousness
exacerbate
gravel
pesky
pestering

25
Anger and Joy!
adjourn
credit card
find out
gloat
spontaneously
surprised

26
Agreement at two intensity levels:

Majority class (m) = 3, 4, 5
m = three

m = four

m = five

70

% of terms

60
50
40
30
20
10
0

Emotions evoked by common words and phrases.
Saif Mohammad and Peter Turney.
27
Conclusions
 

 

Regular folks can produce high quality emotion
annotations with proper guidelines and checks:
◦  Annotations match those in GI and WAL
◦  High degree of agreement


  Anticipation and trust are sources of more
disagreement
A large number of commonly used terms are evocative:
◦  About 61% of the terms are evocative 

(evoke one or the other base emotion)

Emotions evoked by common words and phrases.
Saif Mohammad and Peter Turney.
28
Current work

evoked associated with joy

When your cartoon can get you killed

evoked associated with sadness

29
% of terms where all 5 agree
Evokes

Associated

90
80
70
60
50
40
30
20
10
0

30
Current and future work

 

Determining which terms have strong color associations
and if there is a correlation with emotions.
Determine how much near synonyms vary in emotional
content.
Empirically verify if complex emotions are indeed
combinations of basic emotions.
Create a much larger lexicon (40,000 terms, say).

 

◦  Make lexicon publicly available.
Use lexicon in applications.

 
 
 

31
Questions.
32
GI positives
90
80
70
60
50
40
30
20
10
0

Emotions evoked by common words and phrases.
Saif Mohammad and Peter Turney.
33
GI negatives
90
80
70
60
50
40
30
20
10
0

Emotions evoked by common words and phrases.
Saif Mohammad and Peter Turney.
34
100
90
80
70
60
50
40
30
20
10
0

Emotions evoked by common words and phrases.
Saif Mohammad and Peter Turney.
35
 

Annotator time is precious
◦  Minimum reading, minimum writing
◦  Maximum information throughput

 

Requestor time is precious
◦  Automatic review and assimilation of annotations 
One solution: Multiple choice questions, with
examples instead of explanations.

Emotions evoked by common words and phrases.
Saif Mohammad and Peter Turney.
36
Example question
How much does vampire evoke/produce the emotion fear? 

(For example, horror and scary may strongly evoke fear.)

vampire does not evoke fear 

vampire weakly evokes fear 

vampire moderately evokes fear 

vampire strongly evokes fear 

Emotions evoked by common words and phrases.
Saif Mohammad and Peter Turney.
37

More Related Content

Similar to mohammad turney-emotion workshop-slides

Emotions
EmotionsEmotions
Emotions
TJHSST
 
Kinesics in public speaking
Kinesics in public speakingKinesics in public speaking
Kinesics in public speaking
Archana R Singh
 
1. Describe physical security threats to the United States as a re.docx
1. Describe physical security threats to the United States as a re.docx1. Describe physical security threats to the United States as a re.docx
1. Describe physical security threats to the United States as a re.docx
jeremylockett77
 
Communication & Challenging Conversations PCMA 2014 Montreal
Communication & Challenging Conversations PCMA 2014 MontrealCommunication & Challenging Conversations PCMA 2014 Montreal
Communication & Challenging Conversations PCMA 2014 Montreal
McKinley Solutions
 
Effective communication
Effective communicationEffective communication
Effective communication
laipeng
 

Similar to mohammad turney-emotion workshop-slides (20)

Empathy Course
Empathy CourseEmpathy Course
Empathy Course
 
Empathy Course
Empathy CourseEmpathy Course
Empathy Course
 
Empathy Course
Empathy CourseEmpathy Course
Empathy Course
 
Persuasive writing
Persuasive writingPersuasive writing
Persuasive writing
 
Communication & Conflict Resolution
Communication & Conflict ResolutionCommunication & Conflict Resolution
Communication & Conflict Resolution
 
Emotions
EmotionsEmotions
Emotions
 
Nonviolent Communication - XP Days Germany 2014
Nonviolent Communication - XP Days Germany 2014Nonviolent Communication - XP Days Germany 2014
Nonviolent Communication - XP Days Germany 2014
 
Debate
DebateDebate
Debate
 
Session 3
Session 3Session 3
Session 3
 
Kinesics in public speaking
Kinesics in public speakingKinesics in public speaking
Kinesics in public speaking
 
1. Describe physical security threats to the United States as a re.docx
1. Describe physical security threats to the United States as a re.docx1. Describe physical security threats to the United States as a re.docx
1. Describe physical security threats to the United States as a re.docx
 
3 COMUNICATION SKILLS.pptx
3 COMUNICATION SKILLS.pptx3 COMUNICATION SKILLS.pptx
3 COMUNICATION SKILLS.pptx
 
Non Verbal Communication Types
Non Verbal Communication TypesNon Verbal Communication Types
Non Verbal Communication Types
 
Week 9 Conflict
Week 9 ConflictWeek 9 Conflict
Week 9 Conflict
 
Ejved adj seminar
Ejved adj seminarEjved adj seminar
Ejved adj seminar
 
Eq 3 11 10
Eq 3 11 10Eq 3 11 10
Eq 3 11 10
 
Communication & Challenging Conversations PCMA 2014 Montreal
Communication & Challenging Conversations PCMA 2014 MontrealCommunication & Challenging Conversations PCMA 2014 Montreal
Communication & Challenging Conversations PCMA 2014 Montreal
 
Frank Luntz
Frank LuntzFrank Luntz
Frank Luntz
 
Interpersonal Relationship Skills.ppt
Interpersonal Relationship Skills.pptInterpersonal Relationship Skills.ppt
Interpersonal Relationship Skills.ppt
 
Effective communication
Effective communicationEffective communication
Effective communication
 

Recently uploaded

Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Recently uploaded (20)

Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
JohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptx
 
AI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by Anitaraj
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 

mohammad turney-emotion workshop-slides

  • 1. Emotions Evoked by 
 Common Words and Phrases: 
 Using Mechanical Turk to Create an Emotion Lexicon Saif Mohammad and Peter Turney National Research Council Canada
  • 2. Painting The Destroyer - Frank Frazetta Emotions evoked by common words and phrases. Saif Mohammad and Peter Turney. 2
  • 3. Sentence (Phil, from the San Francisco Chronicle)
 speaker/writer Death threats over South Park episode
 Event When your cartoon can get you killed Extremists listener/reader Trey Parker, Matt Stone Participants Participants 3
  • 4. Our focus: words evokes joy When your cartoon can get you killed evokes sadness 4
  • 5. Motivation for emotion detection   Devising automatic dialogue systems that respond appropriately to different emotional states of the user. ◦  customer relation models ◦  intelligent tutoring systems           ◦  emotion-aware games Tracking sentiment towards politicians, movies, products. Determining emotional intelligence. Assisting in writing e-mails, documents, and other text to convey desired emotion (and avoiding misinterpretation). Detecting how people use emotion-bearing-words to persuade and coerce others Deception detection Emotions evoked by common words and phrases. Saif Mohammad and Peter Turney. 5
  • 6. Emotions evoked by common words and phrases. Saif Mohammad and Peter Turney. 6
  • 7. Base emotions pre-frontal cortex complex emotions amygdala base emotions   Ekman: 6 basic emotions   ◦  joy, sadness, fear, anger, surprise, disgust Plutchik: 8   ◦  Ekmanʼs 6 + anticipation + trust ◦  4 pairs of antonymous emotions More proposals by Parrot, Loyban, and others Emotions evoked by common words and phrases. Saif Mohammad and Peter Turney. 7
  • 8. Plutchikʼs wheel of emotions   Similar emotions are 
 adjacent   Contrasting emotions are diametrically opposite   The radius indicates 
 intensity   In the white spaces 
 are the primary dyads – emotions that are 
 combinations of the primary emotions Emotions evoked by common words and phrases. Saif Mohammad and Peter Turney. 8
  • 9. Amazonʼs Mechanical Turk   Requester ◦  breaks task into small independent units – HITs   ◦  specifies:   compensation for solving each HIT   # of independent annotations required for each HIT a.k.a. # of assignments/HIT ◦  uploads HITs Turkers   ◦  attempt as many HITs as they wish Requester ◦  inspects each assignment: approves or rejects Emotions evoked by common words and phrases. Saif Mohammad and Peter Turney. 9
  • 10.   Inexpensive ◦  $1/hour is not uncommon   Convenient ◦  Web-based ◦  Scripts to upload HITs and review assignments   Takes care of certain ethics issues ◦  Anonymity ◦  No pressure on workers to solve HITs Emotions evoked by common words and phrases. Saif Mohammad and Peter Turney. 10
  • 11.   Malicious annotations ◦  Random selection or garbage data entry ◦  Deliberate incorrect annotation   Inadvertent and infrequent errors ◦  Turker attempts HITs for unfamiliar words too Emotions evoked by common words and phrases. Saif Mohammad and Peter Turney. 11
  • 12. Emotion annotation: Challenges   Words used in different senses and in different contexts can evoke different emotions. High aspect ratio wings allow low speed flight. The fight or flight response is crucial for survival.   How to convey the target sense 
 to the annotator? ◦  definitions are long ◦  need to discourage annotation for unfamiliar words Emotions evoked by common words and phrases. Saif Mohammad and Peter Turney. 12
  • 13. Our solution Directions: Attempt HIT only if you are familiar with the word. Words in different senses may have different emotion associations. Question 1 will guide you to the intended sense. Q1. Which word is closest in meaning (most related) to flight? buying avoidance doubt boredom   Near-synonym is taken from a thesaurus.   ◦  Categories in a thesaurus act as coarse senses Three distracters are chosen at random Emotions evoked by common words and phrases. Saif Mohammad and Peter Turney. 13
  • 14. Emotion annotation: Challenges   Words used in different senses and in different contexts can evoke different emotions. High aspect ratio wings allow low speed flight. The fight or flight response is crucial for survival.   How to convey the target sense 
 to the annotator? ◦  definitions are long ◦  need to discourage annotation for unfamiliar words Emotions evoked by common words and phrases. Saif Mohammad and Peter Turney. 14
  • 15.   If the word choice question is answered wrongly, then the whole assignment is discarded 
 (answers to all questions in the HIT by the Turker are discarded)   If an annotator gets more than 1 in 3 questions wrong, then we assume they are not following instructions. ◦  We reject all their assignments. Emotions evoked by common words and phrases. Saif Mohammad and Peter Turney. 15
  • 16.   Malicious annotations ◦  Random selection or garbage data entry ◦  Deliberate incorrect annotation ss   Inadvertent and infrequent errors ◦  Turker attempts HITs for unfamiliar words too Will detect it 75% of the time. Emotions evoked by common words and phrases. Saif Mohammad and Peter Turney. 16
  • 17. Target n-grams     Conditions: ◦  Most frequent terms in the Google n-gram corpus ◦  Must be in the thesaurus in just one or two categories Most frequent monosemous n-grams in each of the following categories: ◦  noun unigrams (200) ◦  ◦  ◦  ◦  noun bigrams (200) verb unigrams (200) verb bigrams (200) adjective unigrams (200) ◦  adjective bigrams (200) ◦  adverb unigrams (200) ◦  adverb bigrams (200) Emotions evoked by common words and phrases. Saif Mohammad and Peter Turney. 17
  • 18.   Most frequent monosemous terms in the General Inquirer (GI) that are: ◦  marked as positive (200) ◦  marked as negative (200)   Terms in WordNet Affect Lexicon (WAL) that have one or two senses and are: ◦  marked as anger terms (107) ◦  ◦  ◦  ◦  marked as disgust terms (25) marked as fear terms (58) marked as joy terms (109) marked as sadness terms (86) ◦  marked as surprise terms (39) 2176 terms in all. Emotions evoked by common words and phrases. Saif Mohammad and Peter Turney. 18
  • 19. Questions: 1. Which word is closest in meaning (most related) to flight?
 buying avoidance doubt boredom 2. How positive (good, praising) is flight 
 (for example, nice and excellent are strongly positive):  flight is not positive flight is weakly positive flight is moderately positive flight is strongly positive Emotions evoked by common words and phrases. Saif Mohammad and Peter Turney. 19
  • 20. Questions (continued): 3. How negative (bad, criticizing) is flight 
 (for example, poor and pathetic are strongly negative):  flight is not negative flight is weakly negative flight is moderately negative flight is strongly negative 4. How much does flight evoke/produce the emotion joy 
 (for example, happy and fun may strongly evoke joy):  flight does not evoke joy flight weakly evokes joy flight moderately evokes joy flight strongly evokes joy Emotions evoked by common words and phrases. Saif Mohammad and Peter Turney. 20
  • 21.               2176 (HITs) x 5 (assignments per HIT) = 10,880 assignments Annotators: 1012 Turkers spent on average about 1 minute per HIT Hourly wage was about $2.40 (about 4 cents per HIT) Total cost: US $470 (cost per term: about 22 cents) More than 95% of the assignments had the correct answer for the word choice question. ◦  The rest were discarded. 2081 terms had 3 or more valid assignments ◦  on average 4.75 assignments per HIT Emotions evoked by common words and phrases. Saif Mohammad and Peter Turney. 21
  • 22. Evocative and non-evocative   Practical NLP applications may care for only two levels of intensity   Example: vampire-fear No fear weak fear moderate fear strong fear 0 votes 1 vote 2 votes 2 votes non-evocative 1 vote evocative 4 votes Emotions evoked by common words and phrases. Saif Mohammad and Peter Turney. 22
  • 24. % of WAL anger terms evocative of different emotions as per the Turkers 100 90 80 70 60 50 40 30 20 10 0 Emotions evoked by common words and phrases. Saif Mohammad and Peter Turney. 24
  • 26. Anger and Joy! adjourn credit card find out gloat spontaneously surprised 26
  • 27. Agreement at two intensity levels:
 Majority class (m) = 3, 4, 5 m = three m = four m = five 70 % of terms 60 50 40 30 20 10 0 Emotions evoked by common words and phrases. Saif Mohammad and Peter Turney. 27
  • 28. Conclusions     Regular folks can produce high quality emotion annotations with proper guidelines and checks: ◦  Annotations match those in GI and WAL ◦  High degree of agreement   Anticipation and trust are sources of more disagreement A large number of commonly used terms are evocative: ◦  About 61% of the terms are evocative 
 (evoke one or the other base emotion) Emotions evoked by common words and phrases. Saif Mohammad and Peter Turney. 28
  • 29. Current work evoked associated with joy When your cartoon can get you killed evoked associated with sadness 29
  • 30. % of terms where all 5 agree Evokes Associated 90 80 70 60 50 40 30 20 10 0 30
  • 31. Current and future work   Determining which terms have strong color associations and if there is a correlation with emotions. Determine how much near synonyms vary in emotional content. Empirically verify if complex emotions are indeed combinations of basic emotions. Create a much larger lexicon (40,000 terms, say).   ◦  Make lexicon publicly available. Use lexicon in applications.       31
  • 33. GI positives 90 80 70 60 50 40 30 20 10 0 Emotions evoked by common words and phrases. Saif Mohammad and Peter Turney. 33
  • 34. GI negatives 90 80 70 60 50 40 30 20 10 0 Emotions evoked by common words and phrases. Saif Mohammad and Peter Turney. 34
  • 35. 100 90 80 70 60 50 40 30 20 10 0 Emotions evoked by common words and phrases. Saif Mohammad and Peter Turney. 35
  • 36.   Annotator time is precious ◦  Minimum reading, minimum writing ◦  Maximum information throughput   Requestor time is precious ◦  Automatic review and assimilation of annotations One solution: Multiple choice questions, with examples instead of explanations. Emotions evoked by common words and phrases. Saif Mohammad and Peter Turney. 36
  • 37. Example question How much does vampire evoke/produce the emotion fear? 
 (For example, horror and scary may strongly evoke fear.) vampire does not evoke fear vampire weakly evokes fear vampire moderately evokes fear vampire strongly evokes fear Emotions evoked by common words and phrases. Saif Mohammad and Peter Turney. 37