SlideShare a Scribd company logo
1 of 52
Download to read offline
Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
gathering gold standard annotations for relation extraction	

Crowd Truth
Harnessing Disagreement in
Crowdsourcing
Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
IBM Confidential
•  Open Domain Question-Answering Machine, that given
– Rich Natural Language Questions
– Over a Broad Domain of Knowledge
•  Won a 2-game Jeopardy match against the all-time winners
–  viewed by over 50,000,000
Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
Cognitive Computing
EXPANDS human cognition, makes the jobs we do easier,
like a cognitive prosthesis, especially when dealing with processing
massive data, or data that requires human interpretation	

LEARNS as you use it – most machine errors are easy for a
human to detect, and we can instrument usage of systems to
better understand the system and the problem it solves	

INTERACTS naturally. We need to bring machines closer to
their users, we have adapted ourselves enough to them, they should
understand natural language, spoken or written, be able to process
images and videos. These simple human problems are extremely
complex for machines, but are hallmarks of a new computing era.
Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
Watson MD
•  Adapt Watson to Medical QA
•  Mainly an NLP task
•  Cognitive computing systems need
human-annotated data for training, testing,
evaluation	

	

the human annotation task is one of semantic
interpretation	

Now answering
medical
questions!
Gadolinium agents are useful for patients with renal
impairment, but in patients with severe renal failure
requiring dialysis it presents a risk of nephrogenic
systemic fibrosis.
Mention detection: find the spans (begin, end) of relevant medical
terms (factors) in a passage.
Factor Typing: find the type of each mention
substance disorder
disorder
NER
disorder
treatment
NLP Tasks
Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
NLP Tasks
Gadolinium agents are useful for patients with renal
impairment, but in patients with severe renal failure
requiring dialysis it presents a risk of nephrogenic
systemic fibrosis.
Mention detection: find the spans (begin, end) of relevant medical
terms (factors) in a passage.
Factor Typing: find the type of each mention
Factor (Entity) Identification: find the corresponding ids for a
mentioned factor in a knowledge-base
C0016911
C1408325
C0035078
C1619692
C0019004
NLP Tasks
Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
NLP Tasks
Gadolinium agents are useful for patients with renal
impairment, but in patients with severe renal failure
requiring dialysis it presents a risk of nephrogenic
systemic fibrosis.
Mention detection: find the spans (begin, end) of relevant medical
terms (factors) in a passage.
Factor Typing: find the type of each mention
Factor (Entity) Identification: find the corresponding ids for a
mentioned factor in a knowledge-base
Relation detection: find relations that are expressed in a passage
between factors?
cause
treats
treats
contra-
indicates
NLP Tasks
Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
NLP Tasks
Gadolinium agents are useful for patients with renal
impairment, but in patients with severe renal failure
requiring dialysis it presents a risk of nephrogenic
systemic fibrosis.
Mention detection: find the spans (begin, end) of relevant medical
terms (factors) in a passage.
Factor Typing: find the type of each mention
Factor (Entity) Identification: find the corresponding ids for a
mentioned factor in a knowledge-base
Relation detection: find relations that are expressed in a passage
between factors?
Coreference: Find the mentions in a sentence that refer to the same
factor.
Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
Gold Standard
Assumption
•  Cognitive systems need to be told what is right & what is wrong	

•  A gold standard or ground truth	

•  Performance is measured on test sets vetted by human experts
à never perfect, always improving against test data	

•  Historically, gold standards are created assuming that for each
annotated instance there is a single right answer
•  Gold standard quality is measured in inter-annotator
agreement à does not account for perspectives, for
reasonable alternative interpretations
but people don’t always agree…
Disagreement
Gadolinium agents are useful for patients with renal
impairment, but in patients with severe renal failure
requiring dialysis there is a risk of nephrogenic
systemic fibrosis.
cause
Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
Gadolinium agents are useful for patients with renal
impairment, but in patients with severe renal failure
requiring dialysis there is a risk of nephrogenic
systemic fibrosis.
side-effect The human annotation task is one
of semantic interpretation
Disagreement
Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
Why do people disagree?
Sentence
Relation Worker
Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
Key Question
How do we represent &
measure disagreement in a
way that it can be harnessed?
Why do people disagree?
Sign
Referent Observer
Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
Triangle of Reference
Position
maybe this disagreement is a signal and not noise?
can we harness it?
Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
Crowd Truth
Annotator disagreement is signal, not
noise.
It is indicative of the variation in
human semantic interpretation of
signs, and can indicate ambiguity,
vagueness, over-generality, etc.
http://www.freefoto.com/preview/01-47-44/Flock-of-Birds
Approach Principles
1. understand the range of disagreements by creating a
space of possibilities with frequencies & similarities
2. tolerate, capture & exploit disagreement
3. score machine output based on where it falls in this space
4. adaptable to new annotation tasks
Flickr: auroille
Crowd Watson
•  Crowdsourcing gold standard data for
•  Training Watson in medical domain, as well as for events extraction,
image annotations, video tagging and summarization
•  Crowdsourcing for Domain Adaptation
•  How to rapidly acquire knowledge for new domains
•  Platforms
•  CrowdFlower, Amazon Mechanical Turk
•  Crowdsourcing Games with a Purpose, e.g. Dr. Watson, Waisda?
Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
Relation Extraction
crowdsourcing gold standard data
Relations overlap in meaning
Sentences are vague and ambiguous
Experts have different interpretations
Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
In distant supervision we take arguments that are known to
be related by a target relation in a knowledge base and we find
all sentences in a corpus that mention both arguments.
Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
Representation
Worker Vector
1	

 1	

 1
Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
Representation
Sentence Vector
1	

 1	

 1	

1	

 1	

1	

1	

 1	

1	

 1	

1	

 1	

1	

1	

1	

0	

 1	

 1	

 0	

 0	

 4	

 3	

 0	

 0	

 5	

 1	

 0
Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
Feeling the way the CHEST expands (PALPATION), can identify areas of
the lung that are full of fluid.
?PALPATIONIs CHEST related to
diagnose location associated
with
is_a otherpart_of
0 0 02 3 0 0 0 1 0 0 44 1
Disagreement for
Sentence Clarity
Unclear relationship between the two arguments
reflected in the disagreement
Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
?CONJUNCTIVITISHYPERAEMIA related toIs
0 0 0 1 0 0 0 013 0 0 0 0 0
symptomcause
Redness (HYPERAEMIA), irritation (chemosis) and watering (epiphora)
of the eyes are symptoms common to all forms of CONJUNCTIVITIS.
Disagreement for
Sentence Clarity
Clearly expressed relation between the two
arguments reflected in the agreement
Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
Sentence-Relation Score
Measures how clearly a sentence
expresses a relation
0	

 1	

 1	

 0	

 0	

 4	

 3	

 0	

 0	

 5	

 1	

 0	

Unit vector for
relation R6	

Sentence
Vector	

Cosine = .55
Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
Worker Disagreement
Measured per worker
Worker-sentence disagreement
0	

 1	

 1	

 0	

 0	

 4	

 3	

 0	

 0	

 5	

 1	

 0	

Worker’s
sentence vector	

Sentence
Vector	

AVG (Cosine)
Crowd Truth Metrics
Relation Extraction
Three parts to understand human interpretations:
§  Sentence
•  How good is a sentence for relation extraction task?
§  Workers
•  How well does a worker understand the sentence?
§  Relations
•  Is the meaning of the relation clear?
•  How ambiguous/confusable is it?
Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
Crowd Truth Metrics
Based on the Triangle of Reference
Three parts to understand human interpretations:
§  Sign
•  How good is a sign for conveying information?
§  People
•  How well does a person understand the sign?
§  Ontology
•  Are the distinctions of the ontology clear?
•  How ambiguous/confusable are they?
Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
The Dark Side of Crowdsourcing
Disagreement
• spammers generate disagreement for the wrong reasons
• most spam detection requires gold standard
• Worker-sentence disagreement: the average of all the cosines between
each worker’s sentence vector and the full sentence vector (minus that
worker). Indicates how much a worker disagrees with the crowd on a
sentence basis
• Worker-worker disagreement: a pairwise confusion matrix between workers
and the average agreement across the matrix for each worker. Indicates
whether there are consistently like-minded workers
Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
Harnessing Disagreement
• Sentence-relation score: measured for each relation on each sentence as the cosine of
the unit vector for relation with sentence vector
• Sentence clarity: for each sentence - max relation score for that sentence. If all the
workers selected the same relation for a sentence, the max score is 1, indicating a
clear sentence
• Relation similarity: pairwise conditional probability that if relation Ri is annotated in a
sentence, then Rj is as well. Indicates how confusable linguistic expression of two
relations are
• Relation ambiguity: max relation similarity for a relation. If a relation is clear score is
low
• Relation clarity: max sentence-relation score for a relation over all sentences. If a
relation has a high clarity score, it means that it is at least possible to express the
relation clearly
• Worker Quality: avg. cosine of worker vector with sentence vector for all sentences the
worker annotated.
Disagreement metrics
•  Diverging opinions cluster around the most
plausible options.
•  Identify workers who systematically disagree
1.  With the opinion of the majority (worker-sentence disag)
o  Compare worker opinion with that of the majority
2.  With the rest of their co-workers (worker-worker disag)
o  Workers with the same opinion as worker W.
3.  + Avg. number of relations / sentence
Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
Task completion time
Task completion time
Task completion time
Spam in a channel
Conclusions
•  Crowd Truth can help us understand the
diversity of interpretations
•  with adequate representation & metrics
•  dispense with the “one correct answer” assumption
•  Disagreement metrics can be augmented by
content filters for better spam detection
•  explanations by workers can be useful
Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
The Crew
•  Lora Aroyo (VU)
•  Chris Welty (IBM)
•  Guillermo Soberon (VU)
•  Hui Lin (IBM)
•  Anca Dumitrache (VU)
•  Oana Inel (VU)
•  Manfred Overmeen (IBM)
•  Robert-Jan Sips (IBM)
http://crowd-watson.nl
Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
Questions?
Accuracy pred. low quality (1)
Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
Accuracy pred. low quality (2)
Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
Spamming scenarios
Dev. Test
•  12 spammers / 110
workers
•  139 "spammed"
sentences out of 1302
(11%)
•  100% accuracy spam
detection
•  20 spammers / 93
workers
•  386 "spammed"
sentences out of 1291
(30%)
•  89% accuracy (10
spammers missed)
Can we do better?
Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
Data collected
•  Annotations
o  12 relations + OTH / NON
o  Behaviour with respect to the crowd
Disagreement
Filters
Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
•  Annotations
o  12 relations + OTH / NON
o  Behaviour with respect to the crowd
•  Explanations
o  Selected Words (justify the choice)
o  Explanation (for OTHER or NONE)
o  Individual behaviour patterns.
Disagreement
Filters
Explanation
filters
Data collected
Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
Relation Extraction
Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
Explanations analysis
Four patterns in worker behaviour indicating
spam:
o  No Valid Words were used for the text
o  Using the same text for all the annotations
o  Using the same text for both "Selected words" and
"Explanation"
o  Bad understanding (not following) of the task
instructions:
§  Selecting "None" and "Other" in combination
with other relations
§  Including explanations when are not required.
Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
Spam patterns analysis
None / Other Rep. Response Rep. Text No Valid Words
Spam
Candidates
22 8 14 12
Overlap with
disagreement
18% 37% 36% 42%
30 unique workers were identified ONLY
by the Explanation filters as possible low quality
workers.
Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
Spam patterns analysis
None / Other Rep. Response Rep. Text No Valid Words
Spam
Candidates
22 8 14 12
Overlap with
disagreement
18% 37% 36% 42%
30 unique workers were identified ONLY
by the Explanation filters as possible low quality
workers.
Explanation Filters ⊄ Disagreement metrics
Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
Results
•  Linear combination of Disagreement metrics
+ Explanation filters
o  "No Valid Words" and Avg. Num Relations / sent a
bit more weight than the rest
•  Results
o  95% accuracy and .88 F1 score
o  16 spammers out of 20
•  Previously, only with disagreement metrics:
o  88% Accuracy, .66 F1 score
o  10 spammers out of 20
Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty

More Related Content

Viewers also liked

Lecture 7: Social Web Challenges (2012)
Lecture 7: Social Web Challenges (2012)Lecture 7: Social Web Challenges (2012)
Lecture 7: Social Web Challenges (2012)Lora Aroyo
 
PATCH 2014 Workshop: Personalized Access to Cultural Heritage
PATCH 2014 Workshop: Personalized Access to Cultural HeritagePATCH 2014 Workshop: Personalized Access to Cultural Heritage
PATCH 2014 Workshop: Personalized Access to Cultural HeritageLora Aroyo
 
Lecture 3: Human-Computer Interaction Course (2015) @VU University Amsterdam
Lecture 3: Human-Computer Interaction Course (2015) @VU University AmsterdamLecture 3: Human-Computer Interaction Course (2015) @VU University Amsterdam
Lecture 3: Human-Computer Interaction Course (2015) @VU University AmsterdamLora Aroyo
 
Museums & the Web 2016 Presentation: Enriching Collections with Expert Knowle...
Museums & the Web 2016 Presentation: Enriching Collections with Expert Knowle...Museums & the Web 2016 Presentation: Enriching Collections with Expert Knowle...
Museums & the Web 2016 Presentation: Enriching Collections with Expert Knowle...Lora Aroyo
 
DIVE+ @ NLeSymposium 2015: Towards New Cultural Commons with DIVE+
DIVE+ @ NLeSymposium 2015: Towards New Cultural Commons  with DIVE+DIVE+ @ NLeSymposium 2015: Towards New Cultural Commons  with DIVE+
DIVE+ @ NLeSymposium 2015: Towards New Cultural Commons with DIVE+Lora Aroyo
 
CrowdTruth Games @NLeSc eHumanities day 2015
CrowdTruth Games @NLeSc eHumanities day 2015CrowdTruth Games @NLeSc eHumanities day 2015
CrowdTruth Games @NLeSc eHumanities day 2015Lora Aroyo
 
Lecture 2: Human-Computer Interaction Course (2015) @VU University Amsterdam
Lecture 2: Human-Computer Interaction Course (2015) @VU University AmsterdamLecture 2: Human-Computer Interaction Course (2015) @VU University Amsterdam
Lecture 2: Human-Computer Interaction Course (2015) @VU University AmsterdamLora Aroyo
 
"Video Killed the Radio Star": From MTV to Snapchat
"Video Killed the Radio Star": From MTV to Snapchat"Video Killed the Radio Star": From MTV to Snapchat
"Video Killed the Radio Star": From MTV to SnapchatLora Aroyo
 
Truth is a Lie: Rules & Semantics from Crowd Perspectives (RR'2015 Keynote)
Truth is a Lie: Rules & Semantics from Crowd Perspectives (RR'2015 Keynote)Truth is a Lie: Rules & Semantics from Crowd Perspectives (RR'2015 Keynote)
Truth is a Lie: Rules & Semantics from Crowd Perspectives (RR'2015 Keynote)Lora Aroyo
 

Viewers also liked (9)

Lecture 7: Social Web Challenges (2012)
Lecture 7: Social Web Challenges (2012)Lecture 7: Social Web Challenges (2012)
Lecture 7: Social Web Challenges (2012)
 
PATCH 2014 Workshop: Personalized Access to Cultural Heritage
PATCH 2014 Workshop: Personalized Access to Cultural HeritagePATCH 2014 Workshop: Personalized Access to Cultural Heritage
PATCH 2014 Workshop: Personalized Access to Cultural Heritage
 
Lecture 3: Human-Computer Interaction Course (2015) @VU University Amsterdam
Lecture 3: Human-Computer Interaction Course (2015) @VU University AmsterdamLecture 3: Human-Computer Interaction Course (2015) @VU University Amsterdam
Lecture 3: Human-Computer Interaction Course (2015) @VU University Amsterdam
 
Museums & the Web 2016 Presentation: Enriching Collections with Expert Knowle...
Museums & the Web 2016 Presentation: Enriching Collections with Expert Knowle...Museums & the Web 2016 Presentation: Enriching Collections with Expert Knowle...
Museums & the Web 2016 Presentation: Enriching Collections with Expert Knowle...
 
DIVE+ @ NLeSymposium 2015: Towards New Cultural Commons with DIVE+
DIVE+ @ NLeSymposium 2015: Towards New Cultural Commons  with DIVE+DIVE+ @ NLeSymposium 2015: Towards New Cultural Commons  with DIVE+
DIVE+ @ NLeSymposium 2015: Towards New Cultural Commons with DIVE+
 
CrowdTruth Games @NLeSc eHumanities day 2015
CrowdTruth Games @NLeSc eHumanities day 2015CrowdTruth Games @NLeSc eHumanities day 2015
CrowdTruth Games @NLeSc eHumanities day 2015
 
Lecture 2: Human-Computer Interaction Course (2015) @VU University Amsterdam
Lecture 2: Human-Computer Interaction Course (2015) @VU University AmsterdamLecture 2: Human-Computer Interaction Course (2015) @VU University Amsterdam
Lecture 2: Human-Computer Interaction Course (2015) @VU University Amsterdam
 
"Video Killed the Radio Star": From MTV to Snapchat
"Video Killed the Radio Star": From MTV to Snapchat"Video Killed the Radio Star": From MTV to Snapchat
"Video Killed the Radio Star": From MTV to Snapchat
 
Truth is a Lie: Rules & Semantics from Crowd Perspectives (RR'2015 Keynote)
Truth is a Lie: Rules & Semantics from Crowd Perspectives (RR'2015 Keynote)Truth is a Lie: Rules & Semantics from Crowd Perspectives (RR'2015 Keynote)
Truth is a Lie: Rules & Semantics from Crowd Perspectives (RR'2015 Keynote)
 

Similar to CCCT University of Amsterdam Seminars 2013: Crowdsourcing Session

CrowdTruth Tutorial: Using the Crowd to Understand Ambiguity
CrowdTruth Tutorial: Using the Crowd to Understand AmbiguityCrowdTruth Tutorial: Using the Crowd to Understand Ambiguity
CrowdTruth Tutorial: Using the Crowd to Understand AmbiguityAnca Dumitrache
 
Transition From Childhood To Adulthood College Essay Examples
Transition From Childhood To Adulthood College Essay ExamplesTransition From Childhood To Adulthood College Essay Examples
Transition From Childhood To Adulthood College Essay ExamplesAnna May
 
Sample Self Evaluation Essay.pdf
Sample Self Evaluation Essay.pdfSample Self Evaluation Essay.pdf
Sample Self Evaluation Essay.pdfAndrea Santiago
 
Merchant Of Venice Essay Topics. Essay: Merchant of Venice - GCSE Miscellaneo...
Merchant Of Venice Essay Topics. Essay: Merchant of Venice - GCSE Miscellaneo...Merchant Of Venice Essay Topics. Essay: Merchant of Venice - GCSE Miscellaneo...
Merchant Of Venice Essay Topics. Essay: Merchant of Venice - GCSE Miscellaneo...Holly Bell
 
Example Of Event Report Essay
Example Of Event Report EssayExample Of Event Report Essay
Example Of Event Report EssayEmily Owusuansah
 
Ap English Language And Composition Essay Scoring Rubric
Ap English Language And Composition Essay Scoring RubricAp English Language And Composition Essay Scoring Rubric
Ap English Language And Composition Essay Scoring RubricTracy Walker
 
Example Of Rogerian Argument Essay
Example Of Rogerian Argument EssayExample Of Rogerian Argument Essay
Example Of Rogerian Argument EssayAna Hall
 
PPT - Top Essay Writing Companies PowerPoint Presentat
PPT - Top Essay Writing Companies PowerPoint PresentatPPT - Top Essay Writing Companies PowerPoint Presentat
PPT - Top Essay Writing Companies PowerPoint PresentatAshley Davis
 
😊 Research Paper Analysis. Applied Behavior Analysis
😊 Research Paper Analysis. Applied Behavior Analysis😊 Research Paper Analysis. Applied Behavior Analysis
😊 Research Paper Analysis. Applied Behavior AnalysisCynthia Smith
 
Loss Of Innocence Essay
Loss Of Innocence EssayLoss Of Innocence Essay
Loss Of Innocence EssayMichelle Sykes
 
Explaining Black-Box Machine Learning Predictions - Sameer Singh, Assistant P...
Explaining Black-Box Machine Learning Predictions - Sameer Singh, Assistant P...Explaining Black-Box Machine Learning Predictions - Sameer Singh, Assistant P...
Explaining Black-Box Machine Learning Predictions - Sameer Singh, Assistant P...Sri Ambati
 
Question Answering over Linked Data (Reasoning Web Summer School)
Question Answering over Linked Data (Reasoning Web Summer School)Question Answering over Linked Data (Reasoning Web Summer School)
Question Answering over Linked Data (Reasoning Web Summer School)Andre Freitas
 
Architecting a Post Mortem - Velocity 2018 San Jose Tutorial
Architecting a Post Mortem - Velocity 2018 San Jose TutorialArchitecting a Post Mortem - Velocity 2018 San Jose Tutorial
Architecting a Post Mortem - Velocity 2018 San Jose TutorialWill Gallego
 
Quotes About Short Story Writing (47 Quotes)
Quotes About Short Story Writing (47 Quotes)Quotes About Short Story Writing (47 Quotes)
Quotes About Short Story Writing (47 Quotes)Susan Anderson
 
A3 THINKING FOR SOLVING COMPLEX PROBLEMS AND EVOLUTIONARY CHANGE (ALEXEI ZHEG...
A3 THINKING FOR SOLVING COMPLEX PROBLEMS AND EVOLUTIONARY CHANGE (ALEXEI ZHEG...A3 THINKING FOR SOLVING COMPLEX PROBLEMS AND EVOLUTIONARY CHANGE (ALEXEI ZHEG...
A3 THINKING FOR SOLVING COMPLEX PROBLEMS AND EVOLUTIONARY CHANGE (ALEXEI ZHEG...Lean Kanban Central Europe
 
How To Do A Compare And Contrast Essay. How T
How To Do A Compare And Contrast Essay. How THow To Do A Compare And Contrast Essay. How T
How To Do A Compare And Contrast Essay. How TAlyssa Jefferson
 
NeurIPS2023 Keynote: The Many Faces of Responsible AI.pdf
NeurIPS2023 Keynote: The Many Faces of Responsible AI.pdfNeurIPS2023 Keynote: The Many Faces of Responsible AI.pdf
NeurIPS2023 Keynote: The Many Faces of Responsible AI.pdfLora Aroyo
 
Using artificial intelligence to enhance your customer experience
Using artificial intelligence to enhance your customer experienceUsing artificial intelligence to enhance your customer experience
Using artificial intelligence to enhance your customer experienceAmazon Web Services
 

Similar to CCCT University of Amsterdam Seminars 2013: Crowdsourcing Session (20)

CrowdTruth Tutorial: Using the Crowd to Understand Ambiguity
CrowdTruth Tutorial: Using the Crowd to Understand AmbiguityCrowdTruth Tutorial: Using the Crowd to Understand Ambiguity
CrowdTruth Tutorial: Using the Crowd to Understand Ambiguity
 
Transition From Childhood To Adulthood College Essay Examples
Transition From Childhood To Adulthood College Essay ExamplesTransition From Childhood To Adulthood College Essay Examples
Transition From Childhood To Adulthood College Essay Examples
 
Sample Self Evaluation Essay.pdf
Sample Self Evaluation Essay.pdfSample Self Evaluation Essay.pdf
Sample Self Evaluation Essay.pdf
 
BlueHat v18 || MSRC listens
BlueHat v18 || MSRC listensBlueHat v18 || MSRC listens
BlueHat v18 || MSRC listens
 
Merchant Of Venice Essay Topics. Essay: Merchant of Venice - GCSE Miscellaneo...
Merchant Of Venice Essay Topics. Essay: Merchant of Venice - GCSE Miscellaneo...Merchant Of Venice Essay Topics. Essay: Merchant of Venice - GCSE Miscellaneo...
Merchant Of Venice Essay Topics. Essay: Merchant of Venice - GCSE Miscellaneo...
 
Example Of Event Report Essay
Example Of Event Report EssayExample Of Event Report Essay
Example Of Event Report Essay
 
Ap English Language And Composition Essay Scoring Rubric
Ap English Language And Composition Essay Scoring RubricAp English Language And Composition Essay Scoring Rubric
Ap English Language And Composition Essay Scoring Rubric
 
Example Of Rogerian Argument Essay
Example Of Rogerian Argument EssayExample Of Rogerian Argument Essay
Example Of Rogerian Argument Essay
 
PPT - Top Essay Writing Companies PowerPoint Presentat
PPT - Top Essay Writing Companies PowerPoint PresentatPPT - Top Essay Writing Companies PowerPoint Presentat
PPT - Top Essay Writing Companies PowerPoint Presentat
 
😊 Research Paper Analysis. Applied Behavior Analysis
😊 Research Paper Analysis. Applied Behavior Analysis😊 Research Paper Analysis. Applied Behavior Analysis
😊 Research Paper Analysis. Applied Behavior Analysis
 
Loss Of Innocence Essay
Loss Of Innocence EssayLoss Of Innocence Essay
Loss Of Innocence Essay
 
Linked Data and Ontology Tutorial (for RD-Connect)
Linked Data and Ontology Tutorial (for RD-Connect)Linked Data and Ontology Tutorial (for RD-Connect)
Linked Data and Ontology Tutorial (for RD-Connect)
 
Explaining Black-Box Machine Learning Predictions - Sameer Singh, Assistant P...
Explaining Black-Box Machine Learning Predictions - Sameer Singh, Assistant P...Explaining Black-Box Machine Learning Predictions - Sameer Singh, Assistant P...
Explaining Black-Box Machine Learning Predictions - Sameer Singh, Assistant P...
 
Question Answering over Linked Data (Reasoning Web Summer School)
Question Answering over Linked Data (Reasoning Web Summer School)Question Answering over Linked Data (Reasoning Web Summer School)
Question Answering over Linked Data (Reasoning Web Summer School)
 
Architecting a Post Mortem - Velocity 2018 San Jose Tutorial
Architecting a Post Mortem - Velocity 2018 San Jose TutorialArchitecting a Post Mortem - Velocity 2018 San Jose Tutorial
Architecting a Post Mortem - Velocity 2018 San Jose Tutorial
 
Quotes About Short Story Writing (47 Quotes)
Quotes About Short Story Writing (47 Quotes)Quotes About Short Story Writing (47 Quotes)
Quotes About Short Story Writing (47 Quotes)
 
A3 THINKING FOR SOLVING COMPLEX PROBLEMS AND EVOLUTIONARY CHANGE (ALEXEI ZHEG...
A3 THINKING FOR SOLVING COMPLEX PROBLEMS AND EVOLUTIONARY CHANGE (ALEXEI ZHEG...A3 THINKING FOR SOLVING COMPLEX PROBLEMS AND EVOLUTIONARY CHANGE (ALEXEI ZHEG...
A3 THINKING FOR SOLVING COMPLEX PROBLEMS AND EVOLUTIONARY CHANGE (ALEXEI ZHEG...
 
How To Do A Compare And Contrast Essay. How T
How To Do A Compare And Contrast Essay. How THow To Do A Compare And Contrast Essay. How T
How To Do A Compare And Contrast Essay. How T
 
NeurIPS2023 Keynote: The Many Faces of Responsible AI.pdf
NeurIPS2023 Keynote: The Many Faces of Responsible AI.pdfNeurIPS2023 Keynote: The Many Faces of Responsible AI.pdf
NeurIPS2023 Keynote: The Many Faces of Responsible AI.pdf
 
Using artificial intelligence to enhance your customer experience
Using artificial intelligence to enhance your customer experienceUsing artificial intelligence to enhance your customer experience
Using artificial intelligence to enhance your customer experience
 

More from Lora Aroyo

CATS4ML Data Challenge: Crowdsourcing Adverse Test Sets for Machine Learning
CATS4ML Data Challenge: Crowdsourcing Adverse Test Sets for Machine LearningCATS4ML Data Challenge: Crowdsourcing Adverse Test Sets for Machine Learning
CATS4ML Data Challenge: Crowdsourcing Adverse Test Sets for Machine LearningLora Aroyo
 
Harnessing Human Semantics at Scale (updated)
Harnessing Human Semantics at Scale (updated)Harnessing Human Semantics at Scale (updated)
Harnessing Human Semantics at Scale (updated)Lora Aroyo
 
Data excellence: Better data for better AI
Data excellence: Better data for better AIData excellence: Better data for better AI
Data excellence: Better data for better AILora Aroyo
 
CHIP Demonstrator presentation @ CATCH Symposium
CHIP Demonstrator presentation @ CATCH SymposiumCHIP Demonstrator presentation @ CATCH Symposium
CHIP Demonstrator presentation @ CATCH SymposiumLora Aroyo
 
Semantic Web Challenge: CHIP Demonstrator
Semantic Web Challenge: CHIP DemonstratorSemantic Web Challenge: CHIP Demonstrator
Semantic Web Challenge: CHIP DemonstratorLora Aroyo
 
The Rijksmuseum Collection as Linked Data
The Rijksmuseum Collection as Linked DataThe Rijksmuseum Collection as Linked Data
The Rijksmuseum Collection as Linked DataLora Aroyo
 
Keynote at International Conference of Art Libraries 2018 @Rijksmuseum
Keynote at International Conference of Art Libraries 2018 @RijksmuseumKeynote at International Conference of Art Libraries 2018 @Rijksmuseum
Keynote at International Conference of Art Libraries 2018 @RijksmuseumLora Aroyo
 
FAIRview: Responsible Video Summarization @NYCML'18
FAIRview: Responsible Video Summarization @NYCML'18FAIRview: Responsible Video Summarization @NYCML'18
FAIRview: Responsible Video Summarization @NYCML'18Lora Aroyo
 
Understanding bias in video news & news filtering algorithms
Understanding bias in video news & news filtering algorithmsUnderstanding bias in video news & news filtering algorithms
Understanding bias in video news & news filtering algorithmsLora Aroyo
 
StorySourcing: Telling Stories with Humans & Machines
StorySourcing: Telling Stories with Humans & MachinesStorySourcing: Telling Stories with Humans & Machines
StorySourcing: Telling Stories with Humans & MachinesLora Aroyo
 
Data Science with Humans in the Loop
Data Science with Humans in the LoopData Science with Humans in the Loop
Data Science with Humans in the LoopLora Aroyo
 
Digital Humanities Benelux 2017: Keynote Lora Aroyo
Digital Humanities Benelux 2017: Keynote Lora AroyoDigital Humanities Benelux 2017: Keynote Lora Aroyo
Digital Humanities Benelux 2017: Keynote Lora AroyoLora Aroyo
 
DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...
DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...
DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...Lora Aroyo
 
My ESWC 2017 keynote: Disrupting the Semantic Comfort Zone
My ESWC 2017 keynote: Disrupting the Semantic Comfort ZoneMy ESWC 2017 keynote: Disrupting the Semantic Comfort Zone
My ESWC 2017 keynote: Disrupting the Semantic Comfort ZoneLora Aroyo
 
Data Science with Human in the Loop @Faculty of Science #Leiden University
Data Science with Human in the Loop @Faculty of Science #Leiden UniversityData Science with Human in the Loop @Faculty of Science #Leiden University
Data Science with Human in the Loop @Faculty of Science #Leiden UniversityLora Aroyo
 
SXSW2017 @NewDutchMedia Talk: Exploration is the New Search
SXSW2017 @NewDutchMedia Talk: Exploration is the New SearchSXSW2017 @NewDutchMedia Talk: Exploration is the New Search
SXSW2017 @NewDutchMedia Talk: Exploration is the New SearchLora Aroyo
 
Europeana GA 2016: Harnessing Crowds, Niches & Professionals in the Digital Age
Europeana GA 2016: Harnessing Crowds, Niches & Professionals  in the Digital AgeEuropeana GA 2016: Harnessing Crowds, Niches & Professionals  in the Digital Age
Europeana GA 2016: Harnessing Crowds, Niches & Professionals in the Digital AgeLora Aroyo
 
UMAP 2016 Opening Ceremony
UMAP 2016 Opening CeremonyUMAP 2016 Opening Ceremony
UMAP 2016 Opening CeremonyLora Aroyo
 
Crowdsourcing & Nichesourcing: Enriching Cultural Heritage with Experts & Cr...
Crowdsourcing & Nichesourcing: Enriching Cultural Heritagewith Experts & Cr...Crowdsourcing & Nichesourcing: Enriching Cultural Heritagewith Experts & Cr...
Crowdsourcing & Nichesourcing: Enriching Cultural Heritage with Experts & Cr...Lora Aroyo
 
Stitch by Stitch: Annotating Fashion at the Rijksmuseum
Stitch by Stitch: Annotating Fashion at the RijksmuseumStitch by Stitch: Annotating Fashion at the Rijksmuseum
Stitch by Stitch: Annotating Fashion at the RijksmuseumLora Aroyo
 

More from Lora Aroyo (20)

CATS4ML Data Challenge: Crowdsourcing Adverse Test Sets for Machine Learning
CATS4ML Data Challenge: Crowdsourcing Adverse Test Sets for Machine LearningCATS4ML Data Challenge: Crowdsourcing Adverse Test Sets for Machine Learning
CATS4ML Data Challenge: Crowdsourcing Adverse Test Sets for Machine Learning
 
Harnessing Human Semantics at Scale (updated)
Harnessing Human Semantics at Scale (updated)Harnessing Human Semantics at Scale (updated)
Harnessing Human Semantics at Scale (updated)
 
Data excellence: Better data for better AI
Data excellence: Better data for better AIData excellence: Better data for better AI
Data excellence: Better data for better AI
 
CHIP Demonstrator presentation @ CATCH Symposium
CHIP Demonstrator presentation @ CATCH SymposiumCHIP Demonstrator presentation @ CATCH Symposium
CHIP Demonstrator presentation @ CATCH Symposium
 
Semantic Web Challenge: CHIP Demonstrator
Semantic Web Challenge: CHIP DemonstratorSemantic Web Challenge: CHIP Demonstrator
Semantic Web Challenge: CHIP Demonstrator
 
The Rijksmuseum Collection as Linked Data
The Rijksmuseum Collection as Linked DataThe Rijksmuseum Collection as Linked Data
The Rijksmuseum Collection as Linked Data
 
Keynote at International Conference of Art Libraries 2018 @Rijksmuseum
Keynote at International Conference of Art Libraries 2018 @RijksmuseumKeynote at International Conference of Art Libraries 2018 @Rijksmuseum
Keynote at International Conference of Art Libraries 2018 @Rijksmuseum
 
FAIRview: Responsible Video Summarization @NYCML'18
FAIRview: Responsible Video Summarization @NYCML'18FAIRview: Responsible Video Summarization @NYCML'18
FAIRview: Responsible Video Summarization @NYCML'18
 
Understanding bias in video news & news filtering algorithms
Understanding bias in video news & news filtering algorithmsUnderstanding bias in video news & news filtering algorithms
Understanding bias in video news & news filtering algorithms
 
StorySourcing: Telling Stories with Humans & Machines
StorySourcing: Telling Stories with Humans & MachinesStorySourcing: Telling Stories with Humans & Machines
StorySourcing: Telling Stories with Humans & Machines
 
Data Science with Humans in the Loop
Data Science with Humans in the LoopData Science with Humans in the Loop
Data Science with Humans in the Loop
 
Digital Humanities Benelux 2017: Keynote Lora Aroyo
Digital Humanities Benelux 2017: Keynote Lora AroyoDigital Humanities Benelux 2017: Keynote Lora Aroyo
Digital Humanities Benelux 2017: Keynote Lora Aroyo
 
DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...
DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...
DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...
 
My ESWC 2017 keynote: Disrupting the Semantic Comfort Zone
My ESWC 2017 keynote: Disrupting the Semantic Comfort ZoneMy ESWC 2017 keynote: Disrupting the Semantic Comfort Zone
My ESWC 2017 keynote: Disrupting the Semantic Comfort Zone
 
Data Science with Human in the Loop @Faculty of Science #Leiden University
Data Science with Human in the Loop @Faculty of Science #Leiden UniversityData Science with Human in the Loop @Faculty of Science #Leiden University
Data Science with Human in the Loop @Faculty of Science #Leiden University
 
SXSW2017 @NewDutchMedia Talk: Exploration is the New Search
SXSW2017 @NewDutchMedia Talk: Exploration is the New SearchSXSW2017 @NewDutchMedia Talk: Exploration is the New Search
SXSW2017 @NewDutchMedia Talk: Exploration is the New Search
 
Europeana GA 2016: Harnessing Crowds, Niches & Professionals in the Digital Age
Europeana GA 2016: Harnessing Crowds, Niches & Professionals  in the Digital AgeEuropeana GA 2016: Harnessing Crowds, Niches & Professionals  in the Digital Age
Europeana GA 2016: Harnessing Crowds, Niches & Professionals in the Digital Age
 
UMAP 2016 Opening Ceremony
UMAP 2016 Opening CeremonyUMAP 2016 Opening Ceremony
UMAP 2016 Opening Ceremony
 
Crowdsourcing & Nichesourcing: Enriching Cultural Heritage with Experts & Cr...
Crowdsourcing & Nichesourcing: Enriching Cultural Heritagewith Experts & Cr...Crowdsourcing & Nichesourcing: Enriching Cultural Heritagewith Experts & Cr...
Crowdsourcing & Nichesourcing: Enriching Cultural Heritage with Experts & Cr...
 
Stitch by Stitch: Annotating Fashion at the Rijksmuseum
Stitch by Stitch: Annotating Fashion at the RijksmuseumStitch by Stitch: Annotating Fashion at the Rijksmuseum
Stitch by Stitch: Annotating Fashion at the Rijksmuseum
 

Recently uploaded

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 

Recently uploaded (20)

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 

CCCT University of Amsterdam Seminars 2013: Crowdsourcing Session

  • 1. Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty gathering gold standard annotations for relation extraction Crowd Truth Harnessing Disagreement in Crowdsourcing
  • 2. Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty IBM Confidential •  Open Domain Question-Answering Machine, that given – Rich Natural Language Questions – Over a Broad Domain of Knowledge •  Won a 2-game Jeopardy match against the all-time winners –  viewed by over 50,000,000
  • 3. Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty Cognitive Computing EXPANDS human cognition, makes the jobs we do easier, like a cognitive prosthesis, especially when dealing with processing massive data, or data that requires human interpretation LEARNS as you use it – most machine errors are easy for a human to detect, and we can instrument usage of systems to better understand the system and the problem it solves INTERACTS naturally. We need to bring machines closer to their users, we have adapted ourselves enough to them, they should understand natural language, spoken or written, be able to process images and videos. These simple human problems are extremely complex for machines, but are hallmarks of a new computing era.
  • 4. Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty Watson MD •  Adapt Watson to Medical QA •  Mainly an NLP task •  Cognitive computing systems need human-annotated data for training, testing, evaluation the human annotation task is one of semantic interpretation Now answering medical questions!
  • 5. Gadolinium agents are useful for patients with renal impairment, but in patients with severe renal failure requiring dialysis it presents a risk of nephrogenic systemic fibrosis. Mention detection: find the spans (begin, end) of relevant medical terms (factors) in a passage. Factor Typing: find the type of each mention substance disorder disorder NER disorder treatment NLP Tasks Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
  • 6. NLP Tasks Gadolinium agents are useful for patients with renal impairment, but in patients with severe renal failure requiring dialysis it presents a risk of nephrogenic systemic fibrosis. Mention detection: find the spans (begin, end) of relevant medical terms (factors) in a passage. Factor Typing: find the type of each mention Factor (Entity) Identification: find the corresponding ids for a mentioned factor in a knowledge-base C0016911 C1408325 C0035078 C1619692 C0019004 NLP Tasks Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
  • 7. NLP Tasks Gadolinium agents are useful for patients with renal impairment, but in patients with severe renal failure requiring dialysis it presents a risk of nephrogenic systemic fibrosis. Mention detection: find the spans (begin, end) of relevant medical terms (factors) in a passage. Factor Typing: find the type of each mention Factor (Entity) Identification: find the corresponding ids for a mentioned factor in a knowledge-base Relation detection: find relations that are expressed in a passage between factors? cause treats treats contra- indicates NLP Tasks Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
  • 8. NLP Tasks Gadolinium agents are useful for patients with renal impairment, but in patients with severe renal failure requiring dialysis it presents a risk of nephrogenic systemic fibrosis. Mention detection: find the spans (begin, end) of relevant medical terms (factors) in a passage. Factor Typing: find the type of each mention Factor (Entity) Identification: find the corresponding ids for a mentioned factor in a knowledge-base Relation detection: find relations that are expressed in a passage between factors? Coreference: Find the mentions in a sentence that refer to the same factor.
  • 9. Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty Gold Standard Assumption •  Cognitive systems need to be told what is right & what is wrong •  A gold standard or ground truth •  Performance is measured on test sets vetted by human experts à never perfect, always improving against test data •  Historically, gold standards are created assuming that for each annotated instance there is a single right answer •  Gold standard quality is measured in inter-annotator agreement à does not account for perspectives, for reasonable alternative interpretations
  • 10. but people don’t always agree…
  • 11. Disagreement Gadolinium agents are useful for patients with renal impairment, but in patients with severe renal failure requiring dialysis there is a risk of nephrogenic systemic fibrosis. cause Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
  • 12. Gadolinium agents are useful for patients with renal impairment, but in patients with severe renal failure requiring dialysis there is a risk of nephrogenic systemic fibrosis. side-effect The human annotation task is one of semantic interpretation Disagreement Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
  • 13. Why do people disagree? Sentence Relation Worker Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
  • 14. Key Question How do we represent & measure disagreement in a way that it can be harnessed?
  • 15. Why do people disagree? Sign Referent Observer Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty Triangle of Reference
  • 16. Position maybe this disagreement is a signal and not noise? can we harness it?
  • 17. Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty Crowd Truth Annotator disagreement is signal, not noise. It is indicative of the variation in human semantic interpretation of signs, and can indicate ambiguity, vagueness, over-generality, etc. http://www.freefoto.com/preview/01-47-44/Flock-of-Birds
  • 18. Approach Principles 1. understand the range of disagreements by creating a space of possibilities with frequencies & similarities 2. tolerate, capture & exploit disagreement 3. score machine output based on where it falls in this space 4. adaptable to new annotation tasks Flickr: auroille
  • 19. Crowd Watson •  Crowdsourcing gold standard data for •  Training Watson in medical domain, as well as for events extraction, image annotations, video tagging and summarization •  Crowdsourcing for Domain Adaptation •  How to rapidly acquire knowledge for new domains •  Platforms •  CrowdFlower, Amazon Mechanical Turk •  Crowdsourcing Games with a Purpose, e.g. Dr. Watson, Waisda? Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
  • 20. Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo Relation Extraction crowdsourcing gold standard data Relations overlap in meaning Sentences are vague and ambiguous Experts have different interpretations
  • 21. Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo In distant supervision we take arguments that are known to be related by a target relation in a knowledge base and we find all sentences in a corpus that mention both arguments.
  • 22. Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
  • 23. Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo Representation Worker Vector 1 1 1
  • 24. Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo Representation Sentence Vector 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 0 1 1 0 0 4 3 0 0 5 1 0
  • 25. Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo Feeling the way the CHEST expands (PALPATION), can identify areas of the lung that are full of fluid. ?PALPATIONIs CHEST related to diagnose location associated with is_a otherpart_of 0 0 02 3 0 0 0 1 0 0 44 1 Disagreement for Sentence Clarity Unclear relationship between the two arguments reflected in the disagreement
  • 26. Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo ?CONJUNCTIVITISHYPERAEMIA related toIs 0 0 0 1 0 0 0 013 0 0 0 0 0 symptomcause Redness (HYPERAEMIA), irritation (chemosis) and watering (epiphora) of the eyes are symptoms common to all forms of CONJUNCTIVITIS. Disagreement for Sentence Clarity Clearly expressed relation between the two arguments reflected in the agreement
  • 27. Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo Sentence-Relation Score Measures how clearly a sentence expresses a relation 0 1 1 0 0 4 3 0 0 5 1 0 Unit vector for relation R6 Sentence Vector Cosine = .55
  • 28. Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo Worker Disagreement Measured per worker Worker-sentence disagreement 0 1 1 0 0 4 3 0 0 5 1 0 Worker’s sentence vector Sentence Vector AVG (Cosine)
  • 29. Crowd Truth Metrics Relation Extraction Three parts to understand human interpretations: §  Sentence •  How good is a sentence for relation extraction task? §  Workers •  How well does a worker understand the sentence? §  Relations •  Is the meaning of the relation clear? •  How ambiguous/confusable is it? Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
  • 30. Crowd Truth Metrics Based on the Triangle of Reference Three parts to understand human interpretations: §  Sign •  How good is a sign for conveying information? §  People •  How well does a person understand the sign? §  Ontology •  Are the distinctions of the ontology clear? •  How ambiguous/confusable are they? Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
  • 31. Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo The Dark Side of Crowdsourcing Disagreement • spammers generate disagreement for the wrong reasons • most spam detection requires gold standard • Worker-sentence disagreement: the average of all the cosines between each worker’s sentence vector and the full sentence vector (minus that worker). Indicates how much a worker disagrees with the crowd on a sentence basis • Worker-worker disagreement: a pairwise confusion matrix between workers and the average agreement across the matrix for each worker. Indicates whether there are consistently like-minded workers
  • 32. Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo Harnessing Disagreement • Sentence-relation score: measured for each relation on each sentence as the cosine of the unit vector for relation with sentence vector • Sentence clarity: for each sentence - max relation score for that sentence. If all the workers selected the same relation for a sentence, the max score is 1, indicating a clear sentence • Relation similarity: pairwise conditional probability that if relation Ri is annotated in a sentence, then Rj is as well. Indicates how confusable linguistic expression of two relations are • Relation ambiguity: max relation similarity for a relation. If a relation is clear score is low • Relation clarity: max sentence-relation score for a relation over all sentences. If a relation has a high clarity score, it means that it is at least possible to express the relation clearly • Worker Quality: avg. cosine of worker vector with sentence vector for all sentences the worker annotated.
  • 33. Disagreement metrics •  Diverging opinions cluster around the most plausible options. •  Identify workers who systematically disagree 1.  With the opinion of the majority (worker-sentence disag) o  Compare worker opinion with that of the majority 2.  With the rest of their co-workers (worker-worker disag) o  Workers with the same opinion as worker W. 3.  + Avg. number of relations / sentence Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
  • 37.
  • 38. Spam in a channel
  • 39. Conclusions •  Crowd Truth can help us understand the diversity of interpretations •  with adequate representation & metrics •  dispense with the “one correct answer” assumption •  Disagreement metrics can be augmented by content filters for better spam detection •  explanations by workers can be useful Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
  • 40. The Crew •  Lora Aroyo (VU) •  Chris Welty (IBM) •  Guillermo Soberon (VU) •  Hui Lin (IBM) •  Anca Dumitrache (VU) •  Oana Inel (VU) •  Manfred Overmeen (IBM) •  Robert-Jan Sips (IBM)
  • 42. Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty Questions?
  • 43. Accuracy pred. low quality (1) Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
  • 44. Accuracy pred. low quality (2) Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
  • 45. Spamming scenarios Dev. Test •  12 spammers / 110 workers •  139 "spammed" sentences out of 1302 (11%) •  100% accuracy spam detection •  20 spammers / 93 workers •  386 "spammed" sentences out of 1291 (30%) •  89% accuracy (10 spammers missed) Can we do better? Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
  • 46. Data collected •  Annotations o  12 relations + OTH / NON o  Behaviour with respect to the crowd Disagreement Filters Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
  • 47. •  Annotations o  12 relations + OTH / NON o  Behaviour with respect to the crowd •  Explanations o  Selected Words (justify the choice) o  Explanation (for OTHER or NONE) o  Individual behaviour patterns. Disagreement Filters Explanation filters Data collected Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
  • 48. Relation Extraction Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
  • 49. Explanations analysis Four patterns in worker behaviour indicating spam: o  No Valid Words were used for the text o  Using the same text for all the annotations o  Using the same text for both "Selected words" and "Explanation" o  Bad understanding (not following) of the task instructions: §  Selecting "None" and "Other" in combination with other relations §  Including explanations when are not required. Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
  • 50. Spam patterns analysis None / Other Rep. Response Rep. Text No Valid Words Spam Candidates 22 8 14 12 Overlap with disagreement 18% 37% 36% 42% 30 unique workers were identified ONLY by the Explanation filters as possible low quality workers. Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
  • 51. Spam patterns analysis None / Other Rep. Response Rep. Text No Valid Words Spam Candidates 22 8 14 12 Overlap with disagreement 18% 37% 36% 42% 30 unique workers were identified ONLY by the Explanation filters as possible low quality workers. Explanation Filters ⊄ Disagreement metrics Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
  • 52. Results •  Linear combination of Disagreement metrics + Explanation filters o  "No Valid Words" and Avg. Num Relations / sent a bit more weight than the rest •  Results o  95% accuracy and .88 F1 score o  16 spammers out of 20 •  Previously, only with disagreement metrics: o  88% Accuracy, .66 F1 score o  10 spammers out of 20 Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty