SlideShare a Scribd company logo
1 of 16
Download to read offline
SemEval 2018 Task 4:
Character Identification
on Multiparty Dialogues
Jinho D. Choi @ Emory University
HenryY. Chen @ Snap Inc.
https://github.com/emorynlp/semeval-2018-task4
http://nlp.mathcs.emory.edu
Character Mining
2
Extract or infer various information about
individual characters in multiparty dialogues
text chat
audio chat
video chat
Character Mining
3
Transcripts from the TV show, Friends
A total of 10 seasons, 236 episodes, 3,107 scenes
Character Mining
4
Character Reading
Comprehension
Emotion
Detection
Personality
TestIdentification
https://github.com/emorynlp/character-mining
5
Ross I told mom and dad last night, they seemed to take it pretty well.
Monica
Oh really, so that hysterical phone call I got from a woman at sobbing 3:00 A.M., "I'll
never have grandchildren, I'll never have grandchildren." was what? A wrong number?
Ross Sorry.
Joey
Alright Ross, look. You're feeling a lot of pain right now. You're angry. You're hurting.

Can I tell you what the answer is?
Jack Monica
Ross Joey
Judy
Character Identification
6
Ross I told mom and dad last night, they seemed to take it pretty well.
Monica
Oh really, so that hysterical phone call I got from a woman at sobbing 3:00 A.M., "I'll
never have grandchildren, I'll never have grandchildren." was what? A wrong number?
Ross Sorry.
Joey
Alright Ross, look. You're feeling a lot of pain right now. You're angry. You're hurting.

Can I tell you what the answer is?
Entity Linking or Coreference Resolution?
? ?
Cross-document resolution is required
Judy & Jack appear in some other dialogue
Character Identification
Corpus
7
Episodes Scenes Speakers Utterances Sentences Tokens
Season 1 24 326 107 5,968 10,790 81,453
Season 2 24 293 107 5,747 9,337 81,910
Total 48 619 171 11,715 20,127 163,363
Two seasons of Friends
Pseudo-annotated then
manually corrected
Manually double-annotated
by crowd workers
Nominals referring to humans Characters in the show
Mentions Entities
Mention Annotation
8
Named Entity Pronoun Gazeteer Total
Season 1 946 7,549 811 9,306
Season 2 1,037 7,459 806 9,302
Total 1,983 15,008 1,617 18,608
A noun phrase is a mention if it is either
1. A PERSON named entity, or
2. A pronoun or possessive pronoun excluding“it” and “they”, or
3. One of the personal noun gazetteers that are 603 common and singular
personal nouns selected from Freebase and DBPedia (e.g., mom, girlfriend).
Gives about the F1 score of 95.93%
Entity Annotation
9
Primary Secondary Generic Collective General Other Total
Season 1 5,160 2,526 178 1,150 111 187 9,312
Season 2 5,385 2,340 120 1,238 44 169 9,296
Total 10,545 4,866 298 2,388 155 356 18,608
1. Primary: six main characters of the show.
2. Secondary: all the non-primary characters.
3. Generic: real entities whose identities are not defined 

(e.g.,That waitress is really cute, I am going to ask her out)
4. Collective: the plural use of the pronoun you, which cannot be deterministically
distinguished from the singular use
5. General: mentions used in reference to a general case rather than an specific entity 

(e.g.,The ideal guy you look for doesn’t exist)
6. Other: indicates all the other kinds of mentions
Used for this year’s
shared task
Evaluation Metrics
10
Labeling Accuracy
# of all mentions
# of correctly identified mentions
Average Macro F1
F1 score of the i’th character
i = 1
N
N
1
6 Main Characters + Others All Characters
Overall Scores
11
Over 90 teams registered at CodaLab
Only 4 teams made submissions :-(
Main + Others All Characters
Acc F1 Acc F1
AMORE-UPF 77.23 79.36 74.72 41.05
KNU-CI 85.10 86.00 69.49 16.98
Kampfpudding 73.36 73.51 59.45 37.37
Zuma-AR 46.85 44.68 33.06 16.09
Main Characters + Others
12
Ross Rachel Chand Joey Pheob Monica OTHER
AMORE-UPF 78.57 82.98 81.36 79.83 86.52 85.22 61.02
KNU-CI 85.86 92.49 84.94 79.67 88.09 91.16 79.79
Kampfpudding 73.48 70.67 79.25 63.38 79.79 73.35 74.61
Zuma-AR 38.72 43.05 43.04 36.10 42.90 46.43 51.78
Evaluation 18.98 13.96 9.80 9.51 9.02 8.97 29.77
Training 13.93 12.37 11.43 9.43 8.79 10.61 33.44
All Characters
13
Be Ca Ed Pa Ju MB Ri Sc Cr Fr Ja OT
AMORE 50.00 57.14 80.60 35.56 72.73 64.52 80.85 10.00 61.54 0.00 42.11 7.89
KNU-CI 38.46 62.79 73.02 15.38 42.55 0.00 66.67 38.46 0.00 18.18 16.00 0.00
Kamp. 31.86 33.33 68.85 33.33 60.32 50.00 61.22 10.00 0.00 0.00 23.53 0.00
Zuma 0.00 12.24 44.44 0.00 27.91 15.38 77.78 0.00 38.46 0.00 12.50 0.00
TST 3.46 1.73 1.56 1.44 1.32 0.86 0.86 0.78 0.74 0.70 0.62 2.92
TRN 1.41 1.46 1.06 0.71 1.15 0.60 1.83 0.21 0.13 0.51 0.43 13.51
Next Step
14
Character Identification v2.0
Four seasons of Friends are completely annotated
Plural mentions are annotated
Ihttps://github.com/emorynlp/character-identification
They Exist! Introducing Plural Mentions to
Coreference Resolution and Entity Linking
Ethan Zhou and Jinho D. Choi
COLING 2018
ThankYou
15
Henry Chen Ethan Zhou
Questions
16

More Related Content

What's hot (16)

Silent hill essay script 1
Silent hill essay script 1Silent hill essay script 1
Silent hill essay script 1
 
michael
michaelmichael
michael
 
Gameplay and ludology
Gameplay and ludologyGameplay and ludology
Gameplay and ludology
 
Data gathering on gaming results
Data gathering on gaming resultsData gathering on gaming results
Data gathering on gaming results
 
Script presintation
Script presintation Script presintation
Script presintation
 
Q script
Q scriptQ script
Q script
 
Shotlist
Shotlist Shotlist
Shotlist
 
3. research (1)
3. research (1)3. research (1)
3. research (1)
 
The Zodiac Ciphers: What do we know? And when can we stop trying to solve them?
The Zodiac Ciphers: What do we know?  And when can we stop trying to solve them?The Zodiac Ciphers: What do we know?  And when can we stop trying to solve them?
The Zodiac Ciphers: What do we know? And when can we stop trying to solve them?
 
Game review dead island riptide
Game review  dead island riptideGame review  dead island riptide
Game review dead island riptide
 
What is a_game
What is a_gameWhat is a_game
What is a_game
 
Mafia 2
Mafia 2Mafia 2
Mafia 2
 
Q cards
Q cardsQ cards
Q cards
 
3. research
3. research3. research
3. research
 
Graphical styles
Graphical stylesGraphical styles
Graphical styles
 
Why games?
Why games?Why games?
Why games?
 

More from Jinho Choi

More from Jinho Choi (20)

Adaptation of Multilingual Transformer Encoder for Robust Enhanced Universal ...
Adaptation of Multilingual Transformer Encoder for Robust Enhanced Universal ...Adaptation of Multilingual Transformer Encoder for Robust Enhanced Universal ...
Adaptation of Multilingual Transformer Encoder for Robust Enhanced Universal ...
 
Analysis of Hierarchical Multi-Content Text Classification Model on B-SHARP D...
Analysis of Hierarchical Multi-Content Text Classification Model on B-SHARP D...Analysis of Hierarchical Multi-Content Text Classification Model on B-SHARP D...
Analysis of Hierarchical Multi-Content Text Classification Model on B-SHARP D...
 
Competence-Level Prediction and Resume & Job Description Matching Using Conte...
Competence-Level Prediction and Resume & Job Description Matching Using Conte...Competence-Level Prediction and Resume & Job Description Matching Using Conte...
Competence-Level Prediction and Resume & Job Description Matching Using Conte...
 
Transformers to Learn Hierarchical Contexts in Multiparty Dialogue for Span-b...
Transformers to Learn Hierarchical Contexts in Multiparty Dialogue for Span-b...Transformers to Learn Hierarchical Contexts in Multiparty Dialogue for Span-b...
Transformers to Learn Hierarchical Contexts in Multiparty Dialogue for Span-b...
 
The Myth of Higher-Order Inference in Coreference Resolution
The Myth of Higher-Order Inference in Coreference ResolutionThe Myth of Higher-Order Inference in Coreference Resolution
The Myth of Higher-Order Inference in Coreference Resolution
 
Noise Pollution in Hospital Readmission Prediction: Long Document Classificat...
Noise Pollution in Hospital Readmission Prediction: Long Document Classificat...Noise Pollution in Hospital Readmission Prediction: Long Document Classificat...
Noise Pollution in Hospital Readmission Prediction: Long Document Classificat...
 
Abstract Meaning Representation
Abstract Meaning RepresentationAbstract Meaning Representation
Abstract Meaning Representation
 
Semantic Role Labeling
Semantic Role LabelingSemantic Role Labeling
Semantic Role Labeling
 
CKY Parsing
CKY ParsingCKY Parsing
CKY Parsing
 
CS329 - WordNet Similarities
CS329 - WordNet SimilaritiesCS329 - WordNet Similarities
CS329 - WordNet Similarities
 
CS329 - Lexical Relations
CS329 - Lexical RelationsCS329 - Lexical Relations
CS329 - Lexical Relations
 
Automatic Knowledge Base Expansion for Dialogue Management
Automatic Knowledge Base Expansion for Dialogue ManagementAutomatic Knowledge Base Expansion for Dialogue Management
Automatic Knowledge Base Expansion for Dialogue Management
 
Attention is All You Need for AMR Parsing
Attention is All You Need for AMR ParsingAttention is All You Need for AMR Parsing
Attention is All You Need for AMR Parsing
 
Graph-to-Text Generation and its Applications to Dialogue
Graph-to-Text Generation and its Applications to DialogueGraph-to-Text Generation and its Applications to Dialogue
Graph-to-Text Generation and its Applications to Dialogue
 
Real-time Coreference Resolution for Dialogue Understanding
Real-time Coreference Resolution for Dialogue UnderstandingReal-time Coreference Resolution for Dialogue Understanding
Real-time Coreference Resolution for Dialogue Understanding
 
Topological Sort
Topological SortTopological Sort
Topological Sort
 
Tries - Put
Tries - PutTries - Put
Tries - Put
 
Multi-modal Embedding Learning for Early Detection of Alzheimer's Disease
Multi-modal Embedding Learning for Early Detection of Alzheimer's DiseaseMulti-modal Embedding Learning for Early Detection of Alzheimer's Disease
Multi-modal Embedding Learning for Early Detection of Alzheimer's Disease
 
Building Widely-Interpretable Semantic Networks for Dialogue Contexts
Building Widely-Interpretable Semantic Networks for Dialogue ContextsBuilding Widely-Interpretable Semantic Networks for Dialogue Contexts
Building Widely-Interpretable Semantic Networks for Dialogue Contexts
 
How to make Emora talk about Sports Intelligently
How to make Emora talk about Sports IntelligentlyHow to make Emora talk about Sports Intelligently
How to make Emora talk about Sports Intelligently
 

Recently uploaded

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Recently uploaded (20)

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 

SemEval 2018 Task 4: Character Identification on Multiparty Dialogues

  • 1. SemEval 2018 Task 4: Character Identification on Multiparty Dialogues Jinho D. Choi @ Emory University HenryY. Chen @ Snap Inc. https://github.com/emorynlp/semeval-2018-task4 http://nlp.mathcs.emory.edu
  • 2. Character Mining 2 Extract or infer various information about individual characters in multiparty dialogues text chat audio chat video chat
  • 3. Character Mining 3 Transcripts from the TV show, Friends A total of 10 seasons, 236 episodes, 3,107 scenes
  • 5. 5 Ross I told mom and dad last night, they seemed to take it pretty well. Monica Oh really, so that hysterical phone call I got from a woman at sobbing 3:00 A.M., "I'll never have grandchildren, I'll never have grandchildren." was what? A wrong number? Ross Sorry. Joey Alright Ross, look. You're feeling a lot of pain right now. You're angry. You're hurting.
 Can I tell you what the answer is? Jack Monica Ross Joey Judy Character Identification
  • 6. 6 Ross I told mom and dad last night, they seemed to take it pretty well. Monica Oh really, so that hysterical phone call I got from a woman at sobbing 3:00 A.M., "I'll never have grandchildren, I'll never have grandchildren." was what? A wrong number? Ross Sorry. Joey Alright Ross, look. You're feeling a lot of pain right now. You're angry. You're hurting.
 Can I tell you what the answer is? Entity Linking or Coreference Resolution? ? ? Cross-document resolution is required Judy & Jack appear in some other dialogue Character Identification
  • 7. Corpus 7 Episodes Scenes Speakers Utterances Sentences Tokens Season 1 24 326 107 5,968 10,790 81,453 Season 2 24 293 107 5,747 9,337 81,910 Total 48 619 171 11,715 20,127 163,363 Two seasons of Friends Pseudo-annotated then manually corrected Manually double-annotated by crowd workers Nominals referring to humans Characters in the show Mentions Entities
  • 8. Mention Annotation 8 Named Entity Pronoun Gazeteer Total Season 1 946 7,549 811 9,306 Season 2 1,037 7,459 806 9,302 Total 1,983 15,008 1,617 18,608 A noun phrase is a mention if it is either 1. A PERSON named entity, or 2. A pronoun or possessive pronoun excluding“it” and “they”, or 3. One of the personal noun gazetteers that are 603 common and singular personal nouns selected from Freebase and DBPedia (e.g., mom, girlfriend). Gives about the F1 score of 95.93%
  • 9. Entity Annotation 9 Primary Secondary Generic Collective General Other Total Season 1 5,160 2,526 178 1,150 111 187 9,312 Season 2 5,385 2,340 120 1,238 44 169 9,296 Total 10,545 4,866 298 2,388 155 356 18,608 1. Primary: six main characters of the show. 2. Secondary: all the non-primary characters. 3. Generic: real entities whose identities are not defined 
 (e.g.,That waitress is really cute, I am going to ask her out) 4. Collective: the plural use of the pronoun you, which cannot be deterministically distinguished from the singular use 5. General: mentions used in reference to a general case rather than an specific entity 
 (e.g.,The ideal guy you look for doesn’t exist) 6. Other: indicates all the other kinds of mentions Used for this year’s shared task
  • 10. Evaluation Metrics 10 Labeling Accuracy # of all mentions # of correctly identified mentions Average Macro F1 F1 score of the i’th character i = 1 N N 1 6 Main Characters + Others All Characters
  • 11. Overall Scores 11 Over 90 teams registered at CodaLab Only 4 teams made submissions :-( Main + Others All Characters Acc F1 Acc F1 AMORE-UPF 77.23 79.36 74.72 41.05 KNU-CI 85.10 86.00 69.49 16.98 Kampfpudding 73.36 73.51 59.45 37.37 Zuma-AR 46.85 44.68 33.06 16.09
  • 12. Main Characters + Others 12 Ross Rachel Chand Joey Pheob Monica OTHER AMORE-UPF 78.57 82.98 81.36 79.83 86.52 85.22 61.02 KNU-CI 85.86 92.49 84.94 79.67 88.09 91.16 79.79 Kampfpudding 73.48 70.67 79.25 63.38 79.79 73.35 74.61 Zuma-AR 38.72 43.05 43.04 36.10 42.90 46.43 51.78 Evaluation 18.98 13.96 9.80 9.51 9.02 8.97 29.77 Training 13.93 12.37 11.43 9.43 8.79 10.61 33.44
  • 13. All Characters 13 Be Ca Ed Pa Ju MB Ri Sc Cr Fr Ja OT AMORE 50.00 57.14 80.60 35.56 72.73 64.52 80.85 10.00 61.54 0.00 42.11 7.89 KNU-CI 38.46 62.79 73.02 15.38 42.55 0.00 66.67 38.46 0.00 18.18 16.00 0.00 Kamp. 31.86 33.33 68.85 33.33 60.32 50.00 61.22 10.00 0.00 0.00 23.53 0.00 Zuma 0.00 12.24 44.44 0.00 27.91 15.38 77.78 0.00 38.46 0.00 12.50 0.00 TST 3.46 1.73 1.56 1.44 1.32 0.86 0.86 0.78 0.74 0.70 0.62 2.92 TRN 1.41 1.46 1.06 0.71 1.15 0.60 1.83 0.21 0.13 0.51 0.43 13.51
  • 14. Next Step 14 Character Identification v2.0 Four seasons of Friends are completely annotated Plural mentions are annotated Ihttps://github.com/emorynlp/character-identification They Exist! Introducing Plural Mentions to Coreference Resolution and Entity Linking Ethan Zhou and Jinho D. Choi COLING 2018