SlideShare a Scribd company logo
CrowdED: Guideline for
Optimal Crowdsourcing
Amrapali Zaveri, Pedro Hernandez Serrano, Manisha
Desai, Michel Dumontier
HumL@WWW2018 @AmrapaliZ 24 April, 20181
Crowdsourcing Tasks
❖ Tasks based on human skills
not yet replicable by machines
❖ Highly parallelizable tasks
❖ Every human (worker) must
be provided with a monetary
reward for an answer
❖ Consolidated answers solve
scientific problems
!2
Crowdsourcing Design
❖ Gold standard
questions

❖ Master Workers

❖ Majority voting 

❖ Overall accuracy
!3
Crowdsourcing Use Case
Biomedical Metadata Quality Assessment*
!4
*MetaCrowd: Crowdsourcing Biomedical Metadata Quality Assessment. 
Amrapali Zaveri and Michel Dumontier. Bio-Ontologies 2017.
How CrowdED is too
crowded?
BUT
!5
Research Question
Can we a-priori estimate optimal
workers and tasks' assignment to obtain
maximum accuracy on all tasks?
!6
CrowdED
a two-staged statistical
Crowdsourcing
Experimental Design
!7
Related Studies
!8
Adaptive Model
Active Learning
KB Test Questions
Self Assessment
Cost-Time

&

Cost-Quality

Optimization
CrowdED
CrowdED offers a two-staged statistical model to estimate a-priori worker
and task assignment to achieve maximum accuracy.
Stage 1: 

• Train all
workers

• On a proportion
of tasks

• Identify best
workers &

• Hard tasks
2 Stages
!9
!
Stage 2:
• Assign best
workers to

• Hard tasks

• Remaining tasks

• Calculate
Overall
Accuracy
!
Stage 1
!
Stage 1
Easy Hard
Good Poor
Workers
Tasks
!10
Assign Tasks to Workers
!
Stage 1
Easy Hard
Good Poor
Workers
Tasks
Task
Label
Truth
1 1 hard_task age
1 2 hard_task age
1 3 hard_task age
1 4 easy_task age
1 5 easy_task age
Simulate
Odd no.
Proportion
of tasks to train
!11
Worker
Label
Truth
1 1 good_worker age
2 1 poor_worker age
3 1 good_worker age
4 1 good_worker age
5 1 poor_worker age
Workerview
Taskview
Calculate Worker Accuracy
& Task Difficulty
!12
Task
Label
Truth
Task
Difficulty
1 1 hard_task age 0.54
1 2 hard_task age 0.42
1 3 hard_task age 0.45
1 4 easy_task age 0.80
1 5 easy_task age 0.70
Worker
Label
Truth
Worker
Accuracy
1 1 good_worker age 0.75
2 1 poor_worker age 0.58
3 1 good_worker age 0.78
4 1 good_worker age 0.95
5 1 poor_worker age 0.54
Workerview
Taskview
Simulate Worker Answer
!13
Task
Label
Truth
Task
Difficulty
Worker
Answer
1 1 hard_task age 0.54 age
1 2 hard_task age 0.42 tissue
1 3 hard_task age 0.45 disease
1 4 easy_task age 0.80 age
1 5 easy_task age 0.70 age
Worker
Label
Truth
Worker
Accuracy
Worker
Answer
1 1 good_worker age 0.75 age
2 1 poor_worker age 0.58 age
3 1 good_worker age 0.78 age
4 1 good_worker age 0.95 tissue
5 1 poor_worker age 0.54 age
!13
Workerview
Taskview
Calculate Worker
Performance
Avg. proportion of times a

worker is in agreement with other 

workers for a given task 

vs. 

all tasks performed by the worker
Range

[0…1]
Threshold
identify
!
Easy Hard
Good Poor
!14
Easy Tasks
!15
Hard Tasks!
Worker
Label
Truth
Worker
Accuracy
Worker
Answer
1 1 good_worker age 0.75 age
2 1 poor_worker age 0.58 age
3 1 good_worker age 0.78 age
4 1 good_worker age 0.95 tissue
5 1 poor_worker age 0.54 age
Worker
Label
Truth
Worker
Accuracy
Worker
Answer
2 2 good_worker age 0.75 treatment
3 2 poor_worker age 0.58 disease
15 2 good_worker age 0.78 age
17 2 poor_worker age 0.95 tissue
20 2 poor_worker age 0.54
Taskview
Taskview
Stage 1: 

• Train all
workers

• On a proportion
of tasks

• Identify best
workers &

• Hard tasks
2 Stages
!16
!
Stage 2:
• Assign best
workers to

• Hard tasks & 

• Remaining tasks

• Calculate
Overall
Accuracy
!
Stage 2
!
Easy Hard
Good Poor
Stage 2
!17
Simulate Worker Answer
Stage 2
!
Hard
Good
simulate
Remaining 

Tasks
!18
Task
Label
Truth
Task
Difficulty
Worker
Answer
1 1 hard_task age 0.54 age
1 2 hard_task age 0.42 tissue
1 3 hard_task age 0.45 disease
1 4 easy_task age 0.80 age
1 5 easy_task age 0.70 age
Workerview
Merge Stage 1 and 2
& Assign Answers
!19
Worker
Label
Truth
Worker
Accuracy
Worker
Answer
1 1 good_worker age 0.75 age
2 1 poor_worker age 0.58 age
3 1 good_worker age 0.78 age
4 1 good_worker age 0.95 tissue
5 1 poor_worker age 0.54 age
Taskview
Answer = age
Assessing Design
Merged Dataset
calculate
!20
Overall Accuracy
avg. of all the tasks
which had consensus
Worker
Label
Truth
Worker
Accuracy
Worker
Answer
1 1 good_worker age 0.75 age
2 1 poor_worker age 0.58 age
3 1 good_worker age 0.78 age
4 1 good_worker age 0.95 tissue
5 1 poor_worker age 0.54 age
Taskview
Experimental Evaluation
• tasks = [60, 80, 100, 120, 140, 160, 180]

• workers = [20, 30, 40]

• answers key = ["liver", "blood", "lung", "brain",
“heart"]

• good workers = [0.1, 0.3, 0.5, 0.7, 0.9]

• hard tasks = [0.1, 0.3, 0.5, 0.7, 0.9]

• proportion of training tasks = [0.2, 0.3, 0.4, 0.5, 0.6]

• workers per task = [3, 5, 7, 9, 11]
13,125 combinations
!21
• Results support the
intuition that reduced
difficulty (10%) in tasks
result in higher
accuracy
!22
• calculating the
performance of the
workers in combination
with whether she was a
good worker (from the
beginning) ensures that
she is the best worker

• adopting the two-
staged algorithm
ensures that only the
best workers are chosen
to perform all the tasks
!23
Results
!24
CrowdED recommendation
• no. of workers should be 40-60% of the total number
of tasks

• train workers on 40-60% of the tasks in Stage 1

• set the number of workers per task to be either 3, 5 or
7 (fewer than 9)

• reduce the number of hard tasks 

• adopt the two-staged algorithm to identify the best
workers
!25
https://pedrohserrano.shinyapps.io/crowdapp/
!26
Conclusion & Future Work
• Two-staged statistical design for designing optimal crowdsourcing experiments 

• a-priori estimate optimal workers and tasks' assignment to obtain maximum
accuracy on all tasks

• Implemented in Python, open source, Jupyter notebook

• Future work

• Training the workers vs. not training

• Real-world experiments and comparison with baseline approaches

• Include budgetary constraints 

• Extend the interface to allow user to vary parameters and observe sensitivity the
design is to various assumptions
!27
@AmrapaliZamrapali.zaveri@maastrichtuniversity.nl
Thank You!
Questions?
Try it yourself

https://github.com/MaastrichtU-IDS/crowdED
Feedback welcome !
!28

More Related Content

Similar to CrowdED: Guideline for optimal Crowdsourcing Experimental Design

Software estimation is crap
Software estimation is crapSoftware estimation is crap
Software estimation is crap
Ian Garrison
 
Testing for everyone agile yorkshire
Testing for everyone agile yorkshireTesting for everyone agile yorkshire
Testing for everyone agile yorkshire
Ady Stokes
 
Dymystify Statistics Day 1.pdf
Dymystify Statistics Day 1.pdfDymystify Statistics Day 1.pdf
Dymystify Statistics Day 1.pdf
KristineIbaez2
 
Statistics-1 : The Basics of Statistics
Statistics-1 : The Basics of StatisticsStatistics-1 : The Basics of Statistics
Statistics-1 : The Basics of Statistics
Giridhar Chandrasekaran
 
Assignment 2 1 of 32Exercise 2-1 Testing Herzbergs Job .docx
Assignment 2 1 of 32Exercise 2-1 Testing Herzbergs Job .docxAssignment 2 1 of 32Exercise 2-1 Testing Herzbergs Job .docx
Assignment 2 1 of 32Exercise 2-1 Testing Herzbergs Job .docx
sherni1
 
“Job Quality, Labour Market Performance and Well-Being”_Parent thirion
“Job Quality, Labour Market Performance and Well-Being”_Parent thirion“Job Quality, Labour Market Performance and Well-Being”_Parent thirion
“Job Quality, Labour Market Performance and Well-Being”_Parent thirion
StatsCommunications
 
Optimising selection success through best practice
Optimising selection success through best practiceOptimising selection success through best practice
Optimising selection success through best practice
OPRA Psychology Group
 
Hendrix 2015 composite endpoints redacted
Hendrix 2015 composite endpoints redacted Hendrix 2015 composite endpoints redacted
Hendrix 2015 composite endpoints redacted
Alzforum
 
MLSEV Virtual. Evaluations
MLSEV Virtual. EvaluationsMLSEV Virtual. Evaluations
MLSEV Virtual. Evaluations
BigML, Inc
 
Employee productivity
Employee productivityEmployee productivity
Employee productivity
Self-employed
 
3 brooke-ifa 2012 29.5 libby ppt
3 brooke-ifa 2012 29.5  libby  ppt3 brooke-ifa 2012 29.5  libby  ppt
3 brooke-ifa 2012 29.5 libby pptifa2012
 
Employee Retension Capstone Project - Neeraj Bubby.pptx
Employee Retension Capstone Project - Neeraj Bubby.pptxEmployee Retension Capstone Project - Neeraj Bubby.pptx
Employee Retension Capstone Project - Neeraj Bubby.pptx
Boston Institute of Analytics
 
Principles of management
Principles of managementPrinciples of management
Principles of management
Sahil Jindal
 
171 Red beads The company as a system - Essential Lean 2014 01
171 Red beads   The company as a system - Essential Lean 2014 01171 Red beads   The company as a system - Essential Lean 2014 01
171 Red beads The company as a system - Essential Lean 2014 01
Francisco Pulgar-Vidal, MBA, Lean Six Sigma MBB
 
Performance Appraisal HRM
Performance Appraisal HRMPerformance Appraisal HRM
Performance Appraisal HRM
Aditya Gupta
 
Transforming End of Life Care in Acute Hospitals PM Workshop 3: Vital Signs ‘...
Transforming End of Life Care in Acute Hospitals PM Workshop 3: Vital Signs ‘...Transforming End of Life Care in Acute Hospitals PM Workshop 3: Vital Signs ‘...
Transforming End of Life Care in Acute Hospitals PM Workshop 3: Vital Signs ‘...
NHS Improving Quality
 
Employee productivity and Role of HR
Employee productivity and Role of HREmployee productivity and Role of HR
Employee productivity and Role of HR
Self-employed
 
How Experienced Workers are Re-energizing the Workforce
How Experienced Workers  are Re-energizing the WorkforceHow Experienced Workers  are Re-energizing the Workforce
How Experienced Workers are Re-energizing the Workforce
AARP
 

Similar to CrowdED: Guideline for optimal Crowdsourcing Experimental Design (20)

Software estimation is crap
Software estimation is crapSoftware estimation is crap
Software estimation is crap
 
Testing for everyone agile yorkshire
Testing for everyone agile yorkshireTesting for everyone agile yorkshire
Testing for everyone agile yorkshire
 
Dymystify Statistics Day 1.pdf
Dymystify Statistics Day 1.pdfDymystify Statistics Day 1.pdf
Dymystify Statistics Day 1.pdf
 
Statistics-1 : The Basics of Statistics
Statistics-1 : The Basics of StatisticsStatistics-1 : The Basics of Statistics
Statistics-1 : The Basics of Statistics
 
Assignment 2 1 of 32Exercise 2-1 Testing Herzbergs Job .docx
Assignment 2 1 of 32Exercise 2-1 Testing Herzbergs Job .docxAssignment 2 1 of 32Exercise 2-1 Testing Herzbergs Job .docx
Assignment 2 1 of 32Exercise 2-1 Testing Herzbergs Job .docx
 
“Job Quality, Labour Market Performance and Well-Being”_Parent thirion
“Job Quality, Labour Market Performance and Well-Being”_Parent thirion“Job Quality, Labour Market Performance and Well-Being”_Parent thirion
“Job Quality, Labour Market Performance and Well-Being”_Parent thirion
 
Optimising selection success through best practice
Optimising selection success through best practiceOptimising selection success through best practice
Optimising selection success through best practice
 
Hendrix 2015 composite endpoints redacted
Hendrix 2015 composite endpoints redacted Hendrix 2015 composite endpoints redacted
Hendrix 2015 composite endpoints redacted
 
Job Evaluation
Job EvaluationJob Evaluation
Job Evaluation
 
MLSEV Virtual. Evaluations
MLSEV Virtual. EvaluationsMLSEV Virtual. Evaluations
MLSEV Virtual. Evaluations
 
Employee productivity
Employee productivityEmployee productivity
Employee productivity
 
3 brooke-ifa 2012 29.5 libby ppt
3 brooke-ifa 2012 29.5  libby  ppt3 brooke-ifa 2012 29.5  libby  ppt
3 brooke-ifa 2012 29.5 libby ppt
 
Employee Retension Capstone Project - Neeraj Bubby.pptx
Employee Retension Capstone Project - Neeraj Bubby.pptxEmployee Retension Capstone Project - Neeraj Bubby.pptx
Employee Retension Capstone Project - Neeraj Bubby.pptx
 
Principles of management
Principles of managementPrinciples of management
Principles of management
 
171 Red beads The company as a system - Essential Lean 2014 01
171 Red beads   The company as a system - Essential Lean 2014 01171 Red beads   The company as a system - Essential Lean 2014 01
171 Red beads The company as a system - Essential Lean 2014 01
 
Performance Appraisal HRM
Performance Appraisal HRMPerformance Appraisal HRM
Performance Appraisal HRM
 
Transforming End of Life Care in Acute Hospitals PM Workshop 3: Vital Signs ‘...
Transforming End of Life Care in Acute Hospitals PM Workshop 3: Vital Signs ‘...Transforming End of Life Care in Acute Hospitals PM Workshop 3: Vital Signs ‘...
Transforming End of Life Care in Acute Hospitals PM Workshop 3: Vital Signs ‘...
 
Employee productivity and Role of HR
Employee productivity and Role of HREmployee productivity and Role of HR
Employee productivity and Role of HR
 
Unit 2 - Statistics
Unit 2 - StatisticsUnit 2 - Statistics
Unit 2 - Statistics
 
How Experienced Workers are Re-energizing the Workforce
How Experienced Workers  are Re-energizing the WorkforceHow Experienced Workers  are Re-energizing the Workforce
How Experienced Workers are Re-energizing the Workforce
 

More from Amrapali Zaveri, PhD

Data Quality and the FAIR principles
Data Quality and the FAIR principlesData Quality and the FAIR principles
Data Quality and the FAIR principles
Amrapali Zaveri, PhD
 
Workshop on Data Quality Management in Wikidata
Workshop on Data Quality Management in WikidataWorkshop on Data Quality Management in Wikidata
Workshop on Data Quality Management in Wikidata
Amrapali Zaveri, PhD
 
ESOF Panel 2018
ESOF Panel 2018ESOF Panel 2018
ESOF Panel 2018
Amrapali Zaveri, PhD
 
MetaCrowd: Crowdsourcing Gene Expression Metadata Quality Assessment
MetaCrowd: Crowdsourcing Gene Expression Metadata Quality AssessmentMetaCrowd: Crowdsourcing Gene Expression Metadata Quality Assessment
MetaCrowd: Crowdsourcing Gene Expression Metadata Quality Assessment
Amrapali Zaveri, PhD
 
smartAPI: Towards a more intelligent network of Web APIs
smartAPI: Towards a more intelligent network of Web APIssmartAPI: Towards a more intelligent network of Web APIs
smartAPI: Towards a more intelligent network of Web APIs
Amrapali Zaveri, PhD
 
Introduction to Bio SPARQL
Introduction to Bio SPARQL Introduction to Bio SPARQL
Introduction to Bio SPARQL
Amrapali Zaveri, PhD
 
Crowdsourcing Linked Data Quality Assessment
Crowdsourcing Linked Data Quality AssessmentCrowdsourcing Linked Data Quality Assessment
Crowdsourcing Linked Data Quality Assessment
Amrapali Zaveri, PhD
 
Linked Data Quality Assessment: A Survey
Linked Data Quality Assessment: A SurveyLinked Data Quality Assessment: A Survey
Linked Data Quality Assessment: A Survey
Amrapali Zaveri, PhD
 
Amrapali Zaveri Defense
Amrapali Zaveri DefenseAmrapali Zaveri Defense
Amrapali Zaveri Defense
Amrapali Zaveri, PhD
 
LDQ 2014 DQ Methodology
LDQ 2014 DQ MethodologyLDQ 2014 DQ Methodology
LDQ 2014 DQ Methodology
Amrapali Zaveri, PhD
 
LOD-SEM
LOD-SEMLOD-SEM
TripleCheckMate
TripleCheckMateTripleCheckMate
TripleCheckMate
Amrapali Zaveri, PhD
 
Towards Biomedical Data Integration for Analyzing the Evolution of Cognition
Towards Biomedical Data Integration for Analyzing the Evolution of CognitionTowards Biomedical Data Integration for Analyzing the Evolution of Cognition
Towards Biomedical Data Integration for Analyzing the Evolution of Cognition
Amrapali Zaveri, PhD
 
User-driven Quality Evaluation of DBpedia
User-driven Quality Evaluation of DBpediaUser-driven Quality Evaluation of DBpedia
User-driven Quality Evaluation of DBpedia
Amrapali Zaveri, PhD
 
ReDD-Observatory
ReDD-ObservatoryReDD-Observatory
ReDD-Observatory
Amrapali Zaveri, PhD
 

More from Amrapali Zaveri, PhD (16)

Data Quality and the FAIR principles
Data Quality and the FAIR principlesData Quality and the FAIR principles
Data Quality and the FAIR principles
 
Workshop on Data Quality Management in Wikidata
Workshop on Data Quality Management in WikidataWorkshop on Data Quality Management in Wikidata
Workshop on Data Quality Management in Wikidata
 
ESOF Panel 2018
ESOF Panel 2018ESOF Panel 2018
ESOF Panel 2018
 
MetaCrowd: Crowdsourcing Gene Expression Metadata Quality Assessment
MetaCrowd: Crowdsourcing Gene Expression Metadata Quality AssessmentMetaCrowd: Crowdsourcing Gene Expression Metadata Quality Assessment
MetaCrowd: Crowdsourcing Gene Expression Metadata Quality Assessment
 
smartAPI: Towards a more intelligent network of Web APIs
smartAPI: Towards a more intelligent network of Web APIssmartAPI: Towards a more intelligent network of Web APIs
smartAPI: Towards a more intelligent network of Web APIs
 
Introduction to Bio SPARQL
Introduction to Bio SPARQL Introduction to Bio SPARQL
Introduction to Bio SPARQL
 
Crowdsourcing Linked Data Quality Assessment
Crowdsourcing Linked Data Quality AssessmentCrowdsourcing Linked Data Quality Assessment
Crowdsourcing Linked Data Quality Assessment
 
Linked Data Quality Assessment: A Survey
Linked Data Quality Assessment: A SurveyLinked Data Quality Assessment: A Survey
Linked Data Quality Assessment: A Survey
 
Amrapali Zaveri Defense
Amrapali Zaveri DefenseAmrapali Zaveri Defense
Amrapali Zaveri Defense
 
LDQ 2014 DQ Methodology
LDQ 2014 DQ MethodologyLDQ 2014 DQ Methodology
LDQ 2014 DQ Methodology
 
LOD-SEM
LOD-SEMLOD-SEM
LOD-SEM
 
TripleCheckMate
TripleCheckMateTripleCheckMate
TripleCheckMate
 
Towards Biomedical Data Integration for Analyzing the Evolution of Cognition
Towards Biomedical Data Integration for Analyzing the Evolution of CognitionTowards Biomedical Data Integration for Analyzing the Evolution of Cognition
Towards Biomedical Data Integration for Analyzing the Evolution of Cognition
 
User-driven Quality Evaluation of DBpedia
User-driven Quality Evaluation of DBpediaUser-driven Quality Evaluation of DBpedia
User-driven Quality Evaluation of DBpedia
 
Converting GHO to RDF
Converting GHO to RDFConverting GHO to RDF
Converting GHO to RDF
 
ReDD-Observatory
ReDD-ObservatoryReDD-Observatory
ReDD-Observatory
 

Recently uploaded

Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
Pavel ( NSTU)
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptx
Jisc
 
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
EugeneSaldivar
 
Digital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and ResearchDigital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and Research
Vikramjit Singh
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
EverAndrsGuerraGuerr
 
Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.
Ashokrao Mane college of Pharmacy Peth-Vadgaon
 
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCECLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
BhavyaRajput3
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
TechSoup
 
Embracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic ImperativeEmbracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic Imperative
Peter Windle
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
Peter Windle
 
Polish students' mobility in the Czech Republic
Polish students' mobility in the Czech RepublicPolish students' mobility in the Czech Republic
Polish students' mobility in the Czech Republic
Anna Sz.
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
Tamralipta Mahavidyalaya
 
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
Nguyen Thanh Tu Collection
 
The Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdfThe Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdf
kaushalkr1407
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
DeeptiGupta154
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
Mohd Adib Abd Muin, Senior Lecturer at Universiti Utara Malaysia
 
678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf
CarlosHernanMontoyab2
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
camakaiclarkmusic
 
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXXPhrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
MIRIAMSALINAS13
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
Atul Kumar Singh
 

Recently uploaded (20)

Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptx
 
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
 
Digital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and ResearchDigital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and Research
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
 
Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.
 
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCECLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
 
Embracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic ImperativeEmbracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic Imperative
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
 
Polish students' mobility in the Czech Republic
Polish students' mobility in the Czech RepublicPolish students' mobility in the Czech Republic
Polish students' mobility in the Czech Republic
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
 
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
 
The Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdfThe Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdf
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
 
678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
 
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXXPhrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
 

CrowdED: Guideline for optimal Crowdsourcing Experimental Design

  • 1. CrowdED: Guideline for Optimal Crowdsourcing Amrapali Zaveri, Pedro Hernandez Serrano, Manisha Desai, Michel Dumontier HumL@WWW2018 @AmrapaliZ 24 April, 20181
  • 2. Crowdsourcing Tasks ❖ Tasks based on human skills not yet replicable by machines ❖ Highly parallelizable tasks ❖ Every human (worker) must be provided with a monetary reward for an answer ❖ Consolidated answers solve scientific problems !2
  • 3. Crowdsourcing Design ❖ Gold standard questions ❖ Master Workers ❖ Majority voting ❖ Overall accuracy !3
  • 4. Crowdsourcing Use Case Biomedical Metadata Quality Assessment* !4 *MetaCrowd: Crowdsourcing Biomedical Metadata Quality Assessment.  Amrapali Zaveri and Michel Dumontier. Bio-Ontologies 2017.
  • 5. How CrowdED is too crowded? BUT !5
  • 6. Research Question Can we a-priori estimate optimal workers and tasks' assignment to obtain maximum accuracy on all tasks? !6
  • 8. Related Studies !8 Adaptive Model Active Learning KB Test Questions Self Assessment Cost-Time & Cost-Quality Optimization CrowdED CrowdED offers a two-staged statistical model to estimate a-priori worker and task assignment to achieve maximum accuracy.
  • 9. Stage 1: • Train all workers • On a proportion of tasks • Identify best workers & • Hard tasks 2 Stages !9 ! Stage 2: • Assign best workers to • Hard tasks • Remaining tasks • Calculate Overall Accuracy !
  • 10. Stage 1 ! Stage 1 Easy Hard Good Poor Workers Tasks !10
  • 11. Assign Tasks to Workers ! Stage 1 Easy Hard Good Poor Workers Tasks Task Label Truth 1 1 hard_task age 1 2 hard_task age 1 3 hard_task age 1 4 easy_task age 1 5 easy_task age Simulate Odd no. Proportion of tasks to train !11 Worker Label Truth 1 1 good_worker age 2 1 poor_worker age 3 1 good_worker age 4 1 good_worker age 5 1 poor_worker age Workerview Taskview
  • 12. Calculate Worker Accuracy & Task Difficulty !12 Task Label Truth Task Difficulty 1 1 hard_task age 0.54 1 2 hard_task age 0.42 1 3 hard_task age 0.45 1 4 easy_task age 0.80 1 5 easy_task age 0.70 Worker Label Truth Worker Accuracy 1 1 good_worker age 0.75 2 1 poor_worker age 0.58 3 1 good_worker age 0.78 4 1 good_worker age 0.95 5 1 poor_worker age 0.54 Workerview Taskview
  • 13. Simulate Worker Answer !13 Task Label Truth Task Difficulty Worker Answer 1 1 hard_task age 0.54 age 1 2 hard_task age 0.42 tissue 1 3 hard_task age 0.45 disease 1 4 easy_task age 0.80 age 1 5 easy_task age 0.70 age Worker Label Truth Worker Accuracy Worker Answer 1 1 good_worker age 0.75 age 2 1 poor_worker age 0.58 age 3 1 good_worker age 0.78 age 4 1 good_worker age 0.95 tissue 5 1 poor_worker age 0.54 age !13 Workerview Taskview
  • 14. Calculate Worker Performance Avg. proportion of times a worker is in agreement with other workers for a given task vs. all tasks performed by the worker Range [0…1] Threshold identify ! Easy Hard Good Poor !14
  • 15. Easy Tasks !15 Hard Tasks! Worker Label Truth Worker Accuracy Worker Answer 1 1 good_worker age 0.75 age 2 1 poor_worker age 0.58 age 3 1 good_worker age 0.78 age 4 1 good_worker age 0.95 tissue 5 1 poor_worker age 0.54 age Worker Label Truth Worker Accuracy Worker Answer 2 2 good_worker age 0.75 treatment 3 2 poor_worker age 0.58 disease 15 2 good_worker age 0.78 age 17 2 poor_worker age 0.95 tissue 20 2 poor_worker age 0.54 Taskview Taskview
  • 16. Stage 1: • Train all workers • On a proportion of tasks • Identify best workers & • Hard tasks 2 Stages !16 ! Stage 2: • Assign best workers to • Hard tasks & • Remaining tasks • Calculate Overall Accuracy !
  • 17. Stage 2 ! Easy Hard Good Poor Stage 2 !17
  • 18. Simulate Worker Answer Stage 2 ! Hard Good simulate Remaining Tasks !18 Task Label Truth Task Difficulty Worker Answer 1 1 hard_task age 0.54 age 1 2 hard_task age 0.42 tissue 1 3 hard_task age 0.45 disease 1 4 easy_task age 0.80 age 1 5 easy_task age 0.70 age Workerview
  • 19. Merge Stage 1 and 2 & Assign Answers !19 Worker Label Truth Worker Accuracy Worker Answer 1 1 good_worker age 0.75 age 2 1 poor_worker age 0.58 age 3 1 good_worker age 0.78 age 4 1 good_worker age 0.95 tissue 5 1 poor_worker age 0.54 age Taskview Answer = age
  • 20. Assessing Design Merged Dataset calculate !20 Overall Accuracy avg. of all the tasks which had consensus Worker Label Truth Worker Accuracy Worker Answer 1 1 good_worker age 0.75 age 2 1 poor_worker age 0.58 age 3 1 good_worker age 0.78 age 4 1 good_worker age 0.95 tissue 5 1 poor_worker age 0.54 age Taskview
  • 21. Experimental Evaluation • tasks = [60, 80, 100, 120, 140, 160, 180] • workers = [20, 30, 40] • answers key = ["liver", "blood", "lung", "brain", “heart"] • good workers = [0.1, 0.3, 0.5, 0.7, 0.9] • hard tasks = [0.1, 0.3, 0.5, 0.7, 0.9] • proportion of training tasks = [0.2, 0.3, 0.4, 0.5, 0.6] • workers per task = [3, 5, 7, 9, 11] 13,125 combinations !21
  • 22. • Results support the intuition that reduced difficulty (10%) in tasks result in higher accuracy !22
  • 23. • calculating the performance of the workers in combination with whether she was a good worker (from the beginning) ensures that she is the best worker • adopting the two- staged algorithm ensures that only the best workers are chosen to perform all the tasks !23
  • 25. CrowdED recommendation • no. of workers should be 40-60% of the total number of tasks • train workers on 40-60% of the tasks in Stage 1 • set the number of workers per task to be either 3, 5 or 7 (fewer than 9) • reduce the number of hard tasks • adopt the two-staged algorithm to identify the best workers !25
  • 27. Conclusion & Future Work • Two-staged statistical design for designing optimal crowdsourcing experiments • a-priori estimate optimal workers and tasks' assignment to obtain maximum accuracy on all tasks • Implemented in Python, open source, Jupyter notebook • Future work • Training the workers vs. not training • Real-world experiments and comparison with baseline approaches • Include budgetary constraints • Extend the interface to allow user to vary parameters and observe sensitivity the design is to various assumptions !27
  • 28. @AmrapaliZamrapali.zaveri@maastrichtuniversity.nl Thank You! Questions? Try it yourself https://github.com/MaastrichtU-IDS/crowdED Feedback welcome ! !28