SlideShare a Scribd company logo
Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
gathering gold standard annotations for relation extraction	

Crowd Truth
Harnessing Disagreement in
Crowdsourcing
Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
Gold Standard
Assumption
• typically in cognitive systems
• for each annotated instance there is a single right answer
• gold standard quality can be measured in inter-annotator
agreement
Let them disagree?
Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
Hypothesis
Annotator disagreement is not noise, but signal.
Not a problem to overcome but a source of information for machines
Artificially restricting humans does not help machines to learn.
They will learn better from diversity
Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
Position
disagreement is a sign of
intrinsic vagueness & ambiguity in human understanding
Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
Approach Principles
1.  Tolerate, capture & exploit disagreement
2.  Understand it by a space of possibilities (frequencies & similarities)
3.  Score the machine output based on where it falls in this space
4.  Adapt to new annotation tasks
Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
Relation Extraction
crowdsourcing gold standard data
Relations overlap in meaning
Sentences are vague and ambiguous
Experts have different interpretations
Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
Feeling the way the CHEST expands (PALPATION), can identify areas of
the lung that are full of fluid.
?PALPATIONIs CHEST related to
diagnose location associated
with
is_a otherpart_of
0 0 02 3 0 0 0 1 0 0 44 1
?CONJUNCTIVITISHYPERAEMIA related toIs
0 0 0 1 0 0 0 013 0 0 0 0 0
symptomcause
Redness (HYPERAEMIA), irritation (chemosis) and watering (epiphora)
of the eyes are symptoms common to all forms of CONJUNCTIVITIS.
Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
Harnessing Disagreement
• Sentence-relation score: core crowd truth metric for relation extraction, measured for each relation on
each sentence as the cosine of the unit vector for relation with sentence vector
• Sentence clarity: for each sentence - max relation score for that sentence. If all the workers selected the
same relation for a sentence, the max score is 1, indicating a clear sentence
• Relation similarity: pairwise conditional probability that if relation Ri is annotated in a sentence, Rj is as
well. Indicates how confusable the linguistic expression of two relations are
• Relation ambiguity: max relation similarity for a relation. If a relation is clear it has low score
• Relation clarity: max sentence-relation score for a relation over all sentences. If a relation has a high
clarity score, it means that it is at least possible to express the relation clearly
Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
The Dark Side of Crowdsourcing
Disagreement
• spammers generate disagreement for the wrong reasons
• most spam detection requires gold standard
• Worker-sentence disagreement: the average of all the cosines between each
worker’s sentence vector and the full sentence vector (minus that worker).
Indicates how much a worker disagrees with the crowd on a sentence basis
• Worker-worker disagreement: a pairwise confusion matrix between workers
and the average agreement across the matrix for each worker. Indicates
whether there are consistently like-minded workers
Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
Questions?

More Related Content

Viewers also liked

Keynote at SMAP2012: Personalized Access to TV Content
Keynote at SMAP2012: Personalized Access to TV ContentKeynote at SMAP2012: Personalized Access to TV Content
Keynote at SMAP2012: Personalized Access to TV Content
Lora Aroyo
 
Agora User Committee Meeting 2013
Agora User Committee Meeting 2013Agora User Committee Meeting 2013
Agora User Committee Meeting 2013
Lora Aroyo
 
SealincMedia Accurator Demos
SealincMedia Accurator DemosSealincMedia Accurator Demos
SealincMedia Accurator Demos
Lora Aroyo
 
AGORA Project: Final Review 2012
AGORA Project: Final Review 2012AGORA Project: Final Review 2012
AGORA Project: Final Review 2012
Lora Aroyo
 
CHIP Project: Personalized Museum Tour with Real-Time Adaptation on a Mobile ...
CHIP Project: Personalized Museum Tour with Real-Time Adaptation on a Mobile ...CHIP Project: Personalized Museum Tour with Real-Time Adaptation on a Mobile ...
CHIP Project: Personalized Museum Tour with Real-Time Adaptation on a Mobile ...
Lora Aroyo
 
Europeana Tech 2011
Europeana Tech 2011Europeana Tech 2011
Europeana Tech 2011
Michiel Hildebrand
 
Stitch by Stitch: Annotating Fashion at the Rijksmuseum
Stitch by Stitch: Annotating Fashion at the RijksmuseumStitch by Stitch: Annotating Fashion at the Rijksmuseum
Stitch by Stitch: Annotating Fashion at the Rijksmuseum
Lora Aroyo
 
DIVE+: Explorative Search for Digital Humanities
DIVE+: Explorative Search for Digital HumanitiesDIVE+: Explorative Search for Digital Humanities
DIVE+: Explorative Search for Digital Humanities
Johan Oomen
 

Viewers also liked (8)

Keynote at SMAP2012: Personalized Access to TV Content
Keynote at SMAP2012: Personalized Access to TV ContentKeynote at SMAP2012: Personalized Access to TV Content
Keynote at SMAP2012: Personalized Access to TV Content
 
Agora User Committee Meeting 2013
Agora User Committee Meeting 2013Agora User Committee Meeting 2013
Agora User Committee Meeting 2013
 
SealincMedia Accurator Demos
SealincMedia Accurator DemosSealincMedia Accurator Demos
SealincMedia Accurator Demos
 
AGORA Project: Final Review 2012
AGORA Project: Final Review 2012AGORA Project: Final Review 2012
AGORA Project: Final Review 2012
 
CHIP Project: Personalized Museum Tour with Real-Time Adaptation on a Mobile ...
CHIP Project: Personalized Museum Tour with Real-Time Adaptation on a Mobile ...CHIP Project: Personalized Museum Tour with Real-Time Adaptation on a Mobile ...
CHIP Project: Personalized Museum Tour with Real-Time Adaptation on a Mobile ...
 
Europeana Tech 2011
Europeana Tech 2011Europeana Tech 2011
Europeana Tech 2011
 
Stitch by Stitch: Annotating Fashion at the Rijksmuseum
Stitch by Stitch: Annotating Fashion at the RijksmuseumStitch by Stitch: Annotating Fashion at the Rijksmuseum
Stitch by Stitch: Annotating Fashion at the Rijksmuseum
 
DIVE+: Explorative Search for Digital Humanities
DIVE+: Explorative Search for Digital HumanitiesDIVE+: Explorative Search for Digital Humanities
DIVE+: Explorative Search for Digital Humanities
 

Similar to WebSci2013 Harnessing Disagreement in Crowdsourcing

Dartmouth 2018 writing assessment presentation Les Perelman
  Dartmouth 2018 writing assessment presentation Les Perelman  Dartmouth 2018 writing assessment presentation Les Perelman
Dartmouth 2018 writing assessment presentation Les Perelman
Les Perelman
 
L6.pptxsdv dfbdfjftj hgjythgfvfhjyggunghb fghtffn
L6.pptxsdv dfbdfjftj hgjythgfvfhjyggunghb fghtffnL6.pptxsdv dfbdfjftj hgjythgfvfhjyggunghb fghtffn
L6.pptxsdv dfbdfjftj hgjythgfvfhjyggunghb fghtffn
RwanEnan
 
School Essay Essays Format. Online assignment writing service.
School Essay Essays Format. Online assignment writing service.School Essay Essays Format. Online assignment writing service.
School Essay Essays Format. Online assignment writing service.
Carolina Abrams
 
Communities of Trust - from regulation to cooperation
Communities of Trust - from regulation to cooperationCommunities of Trust - from regulation to cooperation
Communities of Trust - from regulation to cooperation
Screamin Wrba
 
the relevance theory- pragmatics
the relevance theory- pragmaticsthe relevance theory- pragmatics
the relevance theory- pragmatics
kiran nazir
 
kiranppt-170704170919 (1).pdf
kiranppt-170704170919 (1).pdfkiranppt-170704170919 (1).pdf
kiranppt-170704170919 (1).pdf
SemaYILDIZHUSEYNOV1
 
RecSys 2020 A Human Perspective on Algorithmic Similarity Schendel 9-2020
RecSys 2020 A Human Perspective on Algorithmic Similarity Schendel 9-2020RecSys 2020 A Human Perspective on Algorithmic Similarity Schendel 9-2020
RecSys 2020 A Human Perspective on Algorithmic Similarity Schendel 9-2020
Zachary Schendel
 
Sample Self Evaluation Essay.pdf
Sample Self Evaluation Essay.pdfSample Self Evaluation Essay.pdf
Sample Self Evaluation Essay.pdf
Andrea Santiago
 
Cbse Class 7 English Essay
Cbse Class 7 English EssayCbse Class 7 English Essay
Cbse Class 7 English Essay
Vanessa Henderson
 
Essay On Exam Stress. Online assignment writing service.
Essay On Exam Stress. Online assignment writing service.Essay On Exam Stress. Online assignment writing service.
Essay On Exam Stress. Online assignment writing service.
Amanda Anderson
 
Example Of Event Report Essay
Example Of Event Report EssayExample Of Event Report Essay
Example Of Event Report Essay
Emily Owusuansah
 
Recsys Presentation
Recsys PresentationRecsys Presentation
Recsys Presentation
Neal Lathia
 
Xmas Writing Paper
Xmas Writing PaperXmas Writing Paper
Xmas Writing Paper
Jennifer Perry
 
CrowdTruth Tutorial: Using the Crowd to Understand Ambiguity
CrowdTruth Tutorial: Using the Crowd to Understand AmbiguityCrowdTruth Tutorial: Using the Crowd to Understand Ambiguity
CrowdTruth Tutorial: Using the Crowd to Understand Ambiguity
Anca Dumitrache
 
Size Of Writing Paper. Writing Paper Sizes Chart. 2019-01-16
Size Of Writing Paper. Writing Paper Sizes Chart. 2019-01-16Size Of Writing Paper. Writing Paper Sizes Chart. 2019-01-16
Size Of Writing Paper. Writing Paper Sizes Chart. 2019-01-16
Kimberly Gomez
 
Semantic Patterns for Sentiment Analysis of Twitter
Semantic Patterns for Sentiment Analysis of TwitterSemantic Patterns for Sentiment Analysis of Twitter
Semantic Patterns for Sentiment Analysis of Twitter
Knowledge Media Institute - The Open University
 
Dialogue based Meaning Negotiation
Dialogue based Meaning NegotiationDialogue based Meaning Negotiation
Dialogue based Meaning Negotiation
Terry Payne
 
Puppy Writing Stationary Writing, Puppies, Words
Puppy Writing Stationary Writing, Puppies, WordsPuppy Writing Stationary Writing, Puppies, Words
Puppy Writing Stationary Writing, Puppies, Words
Michelle Adams
 
IndiaS Natural Beauty Essay In Hindi. Online assignment writing service.
IndiaS Natural Beauty Essay In Hindi. Online assignment writing service.IndiaS Natural Beauty Essay In Hindi. Online assignment writing service.
IndiaS Natural Beauty Essay In Hindi. Online assignment writing service.
Heather Wilkins
 
2000 Word Essay How Long Introduction. Online assignment writing service.
2000 Word Essay How Long Introduction. Online assignment writing service.2000 Word Essay How Long Introduction. Online assignment writing service.
2000 Word Essay How Long Introduction. Online assignment writing service.
Tammy Adams
 

Similar to WebSci2013 Harnessing Disagreement in Crowdsourcing (20)

Dartmouth 2018 writing assessment presentation Les Perelman
  Dartmouth 2018 writing assessment presentation Les Perelman  Dartmouth 2018 writing assessment presentation Les Perelman
Dartmouth 2018 writing assessment presentation Les Perelman
 
L6.pptxsdv dfbdfjftj hgjythgfvfhjyggunghb fghtffn
L6.pptxsdv dfbdfjftj hgjythgfvfhjyggunghb fghtffnL6.pptxsdv dfbdfjftj hgjythgfvfhjyggunghb fghtffn
L6.pptxsdv dfbdfjftj hgjythgfvfhjyggunghb fghtffn
 
School Essay Essays Format. Online assignment writing service.
School Essay Essays Format. Online assignment writing service.School Essay Essays Format. Online assignment writing service.
School Essay Essays Format. Online assignment writing service.
 
Communities of Trust - from regulation to cooperation
Communities of Trust - from regulation to cooperationCommunities of Trust - from regulation to cooperation
Communities of Trust - from regulation to cooperation
 
the relevance theory- pragmatics
the relevance theory- pragmaticsthe relevance theory- pragmatics
the relevance theory- pragmatics
 
kiranppt-170704170919 (1).pdf
kiranppt-170704170919 (1).pdfkiranppt-170704170919 (1).pdf
kiranppt-170704170919 (1).pdf
 
RecSys 2020 A Human Perspective on Algorithmic Similarity Schendel 9-2020
RecSys 2020 A Human Perspective on Algorithmic Similarity Schendel 9-2020RecSys 2020 A Human Perspective on Algorithmic Similarity Schendel 9-2020
RecSys 2020 A Human Perspective on Algorithmic Similarity Schendel 9-2020
 
Sample Self Evaluation Essay.pdf
Sample Self Evaluation Essay.pdfSample Self Evaluation Essay.pdf
Sample Self Evaluation Essay.pdf
 
Cbse Class 7 English Essay
Cbse Class 7 English EssayCbse Class 7 English Essay
Cbse Class 7 English Essay
 
Essay On Exam Stress. Online assignment writing service.
Essay On Exam Stress. Online assignment writing service.Essay On Exam Stress. Online assignment writing service.
Essay On Exam Stress. Online assignment writing service.
 
Example Of Event Report Essay
Example Of Event Report EssayExample Of Event Report Essay
Example Of Event Report Essay
 
Recsys Presentation
Recsys PresentationRecsys Presentation
Recsys Presentation
 
Xmas Writing Paper
Xmas Writing PaperXmas Writing Paper
Xmas Writing Paper
 
CrowdTruth Tutorial: Using the Crowd to Understand Ambiguity
CrowdTruth Tutorial: Using the Crowd to Understand AmbiguityCrowdTruth Tutorial: Using the Crowd to Understand Ambiguity
CrowdTruth Tutorial: Using the Crowd to Understand Ambiguity
 
Size Of Writing Paper. Writing Paper Sizes Chart. 2019-01-16
Size Of Writing Paper. Writing Paper Sizes Chart. 2019-01-16Size Of Writing Paper. Writing Paper Sizes Chart. 2019-01-16
Size Of Writing Paper. Writing Paper Sizes Chart. 2019-01-16
 
Semantic Patterns for Sentiment Analysis of Twitter
Semantic Patterns for Sentiment Analysis of TwitterSemantic Patterns for Sentiment Analysis of Twitter
Semantic Patterns for Sentiment Analysis of Twitter
 
Dialogue based Meaning Negotiation
Dialogue based Meaning NegotiationDialogue based Meaning Negotiation
Dialogue based Meaning Negotiation
 
Puppy Writing Stationary Writing, Puppies, Words
Puppy Writing Stationary Writing, Puppies, WordsPuppy Writing Stationary Writing, Puppies, Words
Puppy Writing Stationary Writing, Puppies, Words
 
IndiaS Natural Beauty Essay In Hindi. Online assignment writing service.
IndiaS Natural Beauty Essay In Hindi. Online assignment writing service.IndiaS Natural Beauty Essay In Hindi. Online assignment writing service.
IndiaS Natural Beauty Essay In Hindi. Online assignment writing service.
 
2000 Word Essay How Long Introduction. Online assignment writing service.
2000 Word Essay How Long Introduction. Online assignment writing service.2000 Word Essay How Long Introduction. Online assignment writing service.
2000 Word Essay How Long Introduction. Online assignment writing service.
 

More from Lora Aroyo

NeurIPS2023 Keynote: The Many Faces of Responsible AI.pdf
NeurIPS2023 Keynote: The Many Faces of Responsible AI.pdfNeurIPS2023 Keynote: The Many Faces of Responsible AI.pdf
NeurIPS2023 Keynote: The Many Faces of Responsible AI.pdf
Lora Aroyo
 
CATS4ML Data Challenge: Crowdsourcing Adverse Test Sets for Machine Learning
CATS4ML Data Challenge: Crowdsourcing Adverse Test Sets for Machine LearningCATS4ML Data Challenge: Crowdsourcing Adverse Test Sets for Machine Learning
CATS4ML Data Challenge: Crowdsourcing Adverse Test Sets for Machine Learning
Lora Aroyo
 
Harnessing Human Semantics at Scale (updated)
Harnessing Human Semantics at Scale (updated)Harnessing Human Semantics at Scale (updated)
Harnessing Human Semantics at Scale (updated)
Lora Aroyo
 
Data excellence: Better data for better AI
Data excellence: Better data for better AIData excellence: Better data for better AI
Data excellence: Better data for better AI
Lora Aroyo
 
CHIP Demonstrator presentation @ CATCH Symposium
CHIP Demonstrator presentation @ CATCH SymposiumCHIP Demonstrator presentation @ CATCH Symposium
CHIP Demonstrator presentation @ CATCH Symposium
Lora Aroyo
 
Semantic Web Challenge: CHIP Demonstrator
Semantic Web Challenge: CHIP DemonstratorSemantic Web Challenge: CHIP Demonstrator
Semantic Web Challenge: CHIP Demonstrator
Lora Aroyo
 
The Rijksmuseum Collection as Linked Data
The Rijksmuseum Collection as Linked DataThe Rijksmuseum Collection as Linked Data
The Rijksmuseum Collection as Linked Data
Lora Aroyo
 
Keynote at International Conference of Art Libraries 2018 @Rijksmuseum
Keynote at International Conference of Art Libraries 2018 @RijksmuseumKeynote at International Conference of Art Libraries 2018 @Rijksmuseum
Keynote at International Conference of Art Libraries 2018 @Rijksmuseum
Lora Aroyo
 
FAIRview: Responsible Video Summarization @NYCML'18
FAIRview: Responsible Video Summarization @NYCML'18FAIRview: Responsible Video Summarization @NYCML'18
FAIRview: Responsible Video Summarization @NYCML'18
Lora Aroyo
 
Understanding bias in video news & news filtering algorithms
Understanding bias in video news & news filtering algorithmsUnderstanding bias in video news & news filtering algorithms
Understanding bias in video news & news filtering algorithms
Lora Aroyo
 
StorySourcing: Telling Stories with Humans & Machines
StorySourcing: Telling Stories with Humans & MachinesStorySourcing: Telling Stories with Humans & Machines
StorySourcing: Telling Stories with Humans & Machines
Lora Aroyo
 
Data Science with Humans in the Loop
Data Science with Humans in the LoopData Science with Humans in the Loop
Data Science with Humans in the Loop
Lora Aroyo
 
Digital Humanities Benelux 2017: Keynote Lora Aroyo
Digital Humanities Benelux 2017: Keynote Lora AroyoDigital Humanities Benelux 2017: Keynote Lora Aroyo
Digital Humanities Benelux 2017: Keynote Lora Aroyo
Lora Aroyo
 
DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...
DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...
DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...
Lora Aroyo
 
Crowdsourcing ambiguity aware ground truth - collective intelligence 2017
Crowdsourcing ambiguity aware ground truth - collective intelligence 2017Crowdsourcing ambiguity aware ground truth - collective intelligence 2017
Crowdsourcing ambiguity aware ground truth - collective intelligence 2017
Lora Aroyo
 
My ESWC 2017 keynote: Disrupting the Semantic Comfort Zone
My ESWC 2017 keynote: Disrupting the Semantic Comfort ZoneMy ESWC 2017 keynote: Disrupting the Semantic Comfort Zone
My ESWC 2017 keynote: Disrupting the Semantic Comfort Zone
Lora Aroyo
 
Data Science with Human in the Loop @Faculty of Science #Leiden University
Data Science with Human in the Loop @Faculty of Science #Leiden UniversityData Science with Human in the Loop @Faculty of Science #Leiden University
Data Science with Human in the Loop @Faculty of Science #Leiden University
Lora Aroyo
 
SXSW2017 @NewDutchMedia Talk: Exploration is the New Search
SXSW2017 @NewDutchMedia Talk: Exploration is the New SearchSXSW2017 @NewDutchMedia Talk: Exploration is the New Search
SXSW2017 @NewDutchMedia Talk: Exploration is the New Search
Lora Aroyo
 
Europeana GA 2016: Harnessing Crowds, Niches & Professionals in the Digital Age
Europeana GA 2016: Harnessing Crowds, Niches & Professionals  in the Digital AgeEuropeana GA 2016: Harnessing Crowds, Niches & Professionals  in the Digital Age
Europeana GA 2016: Harnessing Crowds, Niches & Professionals in the Digital Age
Lora Aroyo
 
"Video Killed the Radio Star": From MTV to Snapchat
"Video Killed the Radio Star": From MTV to Snapchat"Video Killed the Radio Star": From MTV to Snapchat
"Video Killed the Radio Star": From MTV to Snapchat
Lora Aroyo
 

More from Lora Aroyo (20)

NeurIPS2023 Keynote: The Many Faces of Responsible AI.pdf
NeurIPS2023 Keynote: The Many Faces of Responsible AI.pdfNeurIPS2023 Keynote: The Many Faces of Responsible AI.pdf
NeurIPS2023 Keynote: The Many Faces of Responsible AI.pdf
 
CATS4ML Data Challenge: Crowdsourcing Adverse Test Sets for Machine Learning
CATS4ML Data Challenge: Crowdsourcing Adverse Test Sets for Machine LearningCATS4ML Data Challenge: Crowdsourcing Adverse Test Sets for Machine Learning
CATS4ML Data Challenge: Crowdsourcing Adverse Test Sets for Machine Learning
 
Harnessing Human Semantics at Scale (updated)
Harnessing Human Semantics at Scale (updated)Harnessing Human Semantics at Scale (updated)
Harnessing Human Semantics at Scale (updated)
 
Data excellence: Better data for better AI
Data excellence: Better data for better AIData excellence: Better data for better AI
Data excellence: Better data for better AI
 
CHIP Demonstrator presentation @ CATCH Symposium
CHIP Demonstrator presentation @ CATCH SymposiumCHIP Demonstrator presentation @ CATCH Symposium
CHIP Demonstrator presentation @ CATCH Symposium
 
Semantic Web Challenge: CHIP Demonstrator
Semantic Web Challenge: CHIP DemonstratorSemantic Web Challenge: CHIP Demonstrator
Semantic Web Challenge: CHIP Demonstrator
 
The Rijksmuseum Collection as Linked Data
The Rijksmuseum Collection as Linked DataThe Rijksmuseum Collection as Linked Data
The Rijksmuseum Collection as Linked Data
 
Keynote at International Conference of Art Libraries 2018 @Rijksmuseum
Keynote at International Conference of Art Libraries 2018 @RijksmuseumKeynote at International Conference of Art Libraries 2018 @Rijksmuseum
Keynote at International Conference of Art Libraries 2018 @Rijksmuseum
 
FAIRview: Responsible Video Summarization @NYCML'18
FAIRview: Responsible Video Summarization @NYCML'18FAIRview: Responsible Video Summarization @NYCML'18
FAIRview: Responsible Video Summarization @NYCML'18
 
Understanding bias in video news & news filtering algorithms
Understanding bias in video news & news filtering algorithmsUnderstanding bias in video news & news filtering algorithms
Understanding bias in video news & news filtering algorithms
 
StorySourcing: Telling Stories with Humans & Machines
StorySourcing: Telling Stories with Humans & MachinesStorySourcing: Telling Stories with Humans & Machines
StorySourcing: Telling Stories with Humans & Machines
 
Data Science with Humans in the Loop
Data Science with Humans in the LoopData Science with Humans in the Loop
Data Science with Humans in the Loop
 
Digital Humanities Benelux 2017: Keynote Lora Aroyo
Digital Humanities Benelux 2017: Keynote Lora AroyoDigital Humanities Benelux 2017: Keynote Lora Aroyo
Digital Humanities Benelux 2017: Keynote Lora Aroyo
 
DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...
DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...
DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...
 
Crowdsourcing ambiguity aware ground truth - collective intelligence 2017
Crowdsourcing ambiguity aware ground truth - collective intelligence 2017Crowdsourcing ambiguity aware ground truth - collective intelligence 2017
Crowdsourcing ambiguity aware ground truth - collective intelligence 2017
 
My ESWC 2017 keynote: Disrupting the Semantic Comfort Zone
My ESWC 2017 keynote: Disrupting the Semantic Comfort ZoneMy ESWC 2017 keynote: Disrupting the Semantic Comfort Zone
My ESWC 2017 keynote: Disrupting the Semantic Comfort Zone
 
Data Science with Human in the Loop @Faculty of Science #Leiden University
Data Science with Human in the Loop @Faculty of Science #Leiden UniversityData Science with Human in the Loop @Faculty of Science #Leiden University
Data Science with Human in the Loop @Faculty of Science #Leiden University
 
SXSW2017 @NewDutchMedia Talk: Exploration is the New Search
SXSW2017 @NewDutchMedia Talk: Exploration is the New SearchSXSW2017 @NewDutchMedia Talk: Exploration is the New Search
SXSW2017 @NewDutchMedia Talk: Exploration is the New Search
 
Europeana GA 2016: Harnessing Crowds, Niches & Professionals in the Digital Age
Europeana GA 2016: Harnessing Crowds, Niches & Professionals  in the Digital AgeEuropeana GA 2016: Harnessing Crowds, Niches & Professionals  in the Digital Age
Europeana GA 2016: Harnessing Crowds, Niches & Professionals in the Digital Age
 
"Video Killed the Radio Star": From MTV to Snapchat
"Video Killed the Radio Star": From MTV to Snapchat"Video Killed the Radio Star": From MTV to Snapchat
"Video Killed the Radio Star": From MTV to Snapchat
 

Recently uploaded

Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
Kari Kakkonen
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
DianaGray10
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
danishmna97
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
DianaGray10
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
Neo4j
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
名前 です男
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
Quotidiano Piemontese
 
Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...
Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...
Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...
Zilliz
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
KAMESHS29
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
innovationoecd
 
“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”
Claudio Di Ciccio
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
Neo4j
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
Neo4j
 
Data structures and Algorithms in Python.pdf
Data structures and Algorithms in Python.pdfData structures and Algorithms in Python.pdf
Data structures and Algorithms in Python.pdf
TIPNGVN2
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
James Anderson
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
SOFTTECHHUB
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Speck&Tech
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
sonjaschweigert1
 

Recently uploaded (20)

Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
 
Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...
Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...
Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
 
“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
 
Data structures and Algorithms in Python.pdf
Data structures and Algorithms in Python.pdfData structures and Algorithms in Python.pdf
Data structures and Algorithms in Python.pdf
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
 

WebSci2013 Harnessing Disagreement in Crowdsourcing

  • 1. Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo gathering gold standard annotations for relation extraction Crowd Truth Harnessing Disagreement in Crowdsourcing
  • 2. Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo Gold Standard Assumption • typically in cognitive systems • for each annotated instance there is a single right answer • gold standard quality can be measured in inter-annotator agreement Let them disagree?
  • 3. Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo Hypothesis Annotator disagreement is not noise, but signal. Not a problem to overcome but a source of information for machines Artificially restricting humans does not help machines to learn. They will learn better from diversity
  • 4. Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo Position disagreement is a sign of intrinsic vagueness & ambiguity in human understanding
  • 5. Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo Approach Principles 1.  Tolerate, capture & exploit disagreement 2.  Understand it by a space of possibilities (frequencies & similarities) 3.  Score the machine output based on where it falls in this space 4.  Adapt to new annotation tasks
  • 6. Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo Relation Extraction crowdsourcing gold standard data Relations overlap in meaning Sentences are vague and ambiguous Experts have different interpretations
  • 7. Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
  • 8. Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo Feeling the way the CHEST expands (PALPATION), can identify areas of the lung that are full of fluid. ?PALPATIONIs CHEST related to diagnose location associated with is_a otherpart_of 0 0 02 3 0 0 0 1 0 0 44 1 ?CONJUNCTIVITISHYPERAEMIA related toIs 0 0 0 1 0 0 0 013 0 0 0 0 0 symptomcause Redness (HYPERAEMIA), irritation (chemosis) and watering (epiphora) of the eyes are symptoms common to all forms of CONJUNCTIVITIS.
  • 9. Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
  • 10. Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo Harnessing Disagreement • Sentence-relation score: core crowd truth metric for relation extraction, measured for each relation on each sentence as the cosine of the unit vector for relation with sentence vector • Sentence clarity: for each sentence - max relation score for that sentence. If all the workers selected the same relation for a sentence, the max score is 1, indicating a clear sentence • Relation similarity: pairwise conditional probability that if relation Ri is annotated in a sentence, Rj is as well. Indicates how confusable the linguistic expression of two relations are • Relation ambiguity: max relation similarity for a relation. If a relation is clear it has low score • Relation clarity: max sentence-relation score for a relation over all sentences. If a relation has a high clarity score, it means that it is at least possible to express the relation clearly
  • 11. Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo The Dark Side of Crowdsourcing Disagreement • spammers generate disagreement for the wrong reasons • most spam detection requires gold standard • Worker-sentence disagreement: the average of all the cosines between each worker’s sentence vector and the full sentence vector (minus that worker). Indicates how much a worker disagrees with the crowd on a sentence basis • Worker-worker disagreement: a pairwise confusion matrix between workers and the average agreement across the matrix for each worker. Indicates whether there are consistently like-minded workers
  • 12. Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo Questions?