SlideShare a Scribd company logo
HUMAN
COMPUTATION
FUNDAMENTALS
ELENA SIMPERL
UNIVERSITY OF SOUTHAMPTON
7/18/2013
Tutorial@ESWC2013
1
HUMAN COMPUTATION
REVISITED
• Outsourcing tasks that machines
find difficult to solve to humans
• Difficult not the same as impossible
• Accuracy, efficiency, cost
• Historically humans were the first
computers
• 17th century: Halley‘s commet
• 19th century: computing factories
• 20th century: professionalization of
human computation
• Characteristics: division of labor,
redundancy, multiple methods to find
or check the correctness of a solution
7/18/2013
Tutorial@ESWC2013
2
DIMENSIONS
OF HUMAN
COMPUTATION
HUMAN COMPUTATION FUNDAMENTALS
7/18/2013
3
Tutorial@ESWC201
DIMENSIONS OF HUMAN
COMPUTATION
WHAT IS
OUTSOURCED
• Tasks based on human skills
not easily replicable by
machines (visual
recognition, language
understanding, knowledge
acquisition, basic human
communication etc)
HOW IS THE TASK
OUTSOURCED
• Explicit vs. implicit
participation
• Tasks broken down into
smaller units undertaken
in parallel by different
people
• Coordination required to
handle cases with more
complex workflows
• Partial or independent
answers consolidated
and aggregated into
complete solution
7/18/2013
Tutorial@ESWC2013
4
See also [Quinn & Bederson, 2012]
WHO IS THE CROWD
• Open call
• Call may target specific skills
and expertise
• Requester typically knows
less about the workers than
in other work environments
EXAMPLE: CITIZEN
SCIENCE VIA HUMAN
COMPUTATION
WHAT IS OUTSOURCED
• Object recognition, labeling,
categorization in media content
WHO IS THE CROWD
• Anyone
HOW IS THE TASK
OUTSOURCED
• Highly parallelizable tasks
• Every item is handled by multiple
annotators
• Every annotator provides an answer
• Consolidated answers solve scientific
problems
7/18/2013
Tutorial@ESWC2013
5
Country of residence
• United States: 46.80%
• India: 34.00%
• Miscellaneous: 19.20%
7/18/2013
Tutorial@ESWC201
6
A LARGE, BUT NOT
ALWAYS DIVERSE
CROWD
60% OF WORKERS SPEND
MORE THAN 4 HOURS A
WEEK ON MTURK
7
http://www.mturk-tracker.com/
SIGNIFICANT AMOUNT
OF RESOURCES AND
TIMELY DELIVERY
7/18/2013
Tutorial@ESWC2013
8
BROAD RANGE OF
TASKS
7/18/2013
Tutorial@ESWC2013
9
Users aware of how their
input contributes to the
achievement of
application’s goal (and
identify themselves with it)
vs.
Tasks are hidden behind
the application narratives.
Engagement ensured
through other incentives
7/18/2013
Tutorial@ESWC201
10
EXPLICIT VS. IMPLICIT
CONTRIBUTION - AFFECTS
MOTIVATION AND ENGAGEMENT
COMPLEX WORKFLOWS CANNOT
ALWAYS BE DIRECTLY
IMPLEMENTED
7/18/2013
Tutorial@ESWC201
11
http://www.youtube.com/watch?v=n_miZqsPwsc
WHAT IS OUTSOURCED
• Text shortening, proof-
reading, open editing
WHO IS THE CROWD
• MTurk
HOW IS THE TASK
OUTSOURCED
• Text divided into paragraphs
• Select-fix-verify pattern
• Multiple workers in each step
See also [Bernstein et al., 2010]
DIMENSIONS OF HUMAN
COMPUTATION (2)
HOW ARE THE
RESULTS VALIDATED
• Solutions space closed
vs. open
• Performance
measurements/ground
truth
• Statistical techniques
employed to predict
accurate solutions
• May take into account
confidence values of
algorithmically
generated solutions
HOW CAN THE
PROCESS BE
OPTIMIZED
• Incentives and
motivators
• Assigning tasks to
people based on their
skills and performance
(as opposed to random
assignments
• Symbiotic
combinations of
human- and machine-
driven computation,
including combinations
of different forms of
crowdsourcing
7/18/2013
Tutorial@ESWC2013
12
See also [Quinn & Bederson, 2012]
OPEN SOLUTION SPACES
Selecting
the right
option vs.
assessing
the quality
of the work
7/18/2013
Tutorial@ESWC2013
13
The goal is to
undertake
much of the
assessment
either
automatically
or use the
crowd for it.
MEASURING PERFORMANCE
CAN BE CHALLENGING
WHO AND HOW
• Redundancy
• Excluding spam and
obviously wrong answers
• Voting and ratings by the
crowd
• Assessment by the
requester
• Where does the ground
truth come from and is it
needed
• Note: improving recall of
algorithms
WHEN
• Real-time constraints
in games
• Near-real-time
microtasks, see
Bernstein et al.
Crowds in Two
Seconds: Enabling
Realtime Crowd-
Powered Interfaces.
In Proc. UIST 2011.
7/18/2013
Tutorial@ESWC2013
14
ALIGNING INCENTIVES
IS ESSENTIAL
Motivation: driving force that
makes humans achieve their
goals
Incentives: ‘rewards’ assigned
by an external ‘judge’ to a
performer for undertaking a
specific task
• Common belief (among
economists): incentives can be
translated into a sum of money
for all practical purposes.
Incentives can be related to
both extrinsic and intrinsic
motivations.
Extrinsic motivation if task is
considered boring, dangerous,
useless, socially undesirable,
dislikable by the performer.
Intrinsic motivation is driven by
an interest or enjoyment in the
task itself.
IN THIS TUTORIAL
GAMES WITH A
PURPOSE
MICROTASKS
HYBRID SYSTEMS
HUMAN COMPUTATION FUNDAMENTALS
7/18/2013
16
Tutorial@ESWC201
GAMES WITH A
PURPOSE (GWAP)
Human computation disguised as casual games
Tasks are divided into parallelizable atomic units
(challenges) solved (consensually) by players
Game models
• Single vs. multi-player
• Selection agreement vs. input agreement vs. inversion-
problem games
See also [van Ahn & Dabbish, 2008]
7/18/2013
17
MICROTASK
CROWDSOURCING
Similar types of tasks, but different incentives model
(monetary reward)
Successfully applied to transcription, classification, and
content generation, data collection, image tagging, website
feedback, usability tests…
7/18/2013
Tutorial@ESWC2013
18
THE SAME, BUT
DIFFERENT
• Tasks leveraging common human skills, appealing to large
audiences
• Selection of domain and task more constrained in games to
create typical UX
• Tasks decomposed into smaller units of work to be solved
independently
• Complex workflows
• Creating a casual game experience vs. patterns in microtasks
• Quality assurance
• Synchronous interaction in games
• Levels of difficulty and near-real-time feedback in games
• Many methods applied in both cases (redundancy, votes,
statistical techniques)
• Different set of incentives and motivators
7/18/2013
Tutorial@ESWC2013
19
Physical World
(people and devices)
HYBRID SYSTEMS
Design and
composition
Participation and
data supply
Model of social interaction
Virtual world
(Network of
social interactions)
Dave Robertson
EXAMPLE: HYBRID IMAGE SEARCH
21
Yan, Kumar, Ganesan, CrowdSearch: Exploiting Crowds for Accurate Real-time Image
Search on Mobile Phones, Mobisys 2010.
Not sure
EXAMPLE: HYBRID DATA
INTEGRATION
paper conf
Data integration VLDB-01
Data mining SIGMOD-02
title author email
OLAP Mike mike@a
Social media Jane jane@b
Generate plausible matches
– paper = title, paper = author, paper = email, paper = venue
– conf = title, conf = author, conf = email, conf = venue
Ask users to verify
paper conf
Data integration VLDB-01
Data mining SIGMOD-02
title author email venue
OLAP Mike mike@a ICDE-02
Social media Jane jane@b PODS-05
Does attribute paper match attribute author?
NoYes
McCann, Shen, Doan: Matching Schemas in Online Communities. ICDE, 2008
22
EXAMPLE: HYBRID QUERY
PROCESSING
23
Use the crowd to
answer DB-hard
queries
Where to use the crowd:
Find missing data
Make subjective comparisons
Recognize patterns
But not:
Anything the computer already does
well
M. Franklin, D. Kossmann, T. Kraska, S. Ramesh and R. Xin .
CrowdDB: Answering Queries with Crowdsourcing, SIGMOD 2011
WHAT’S NEXT
HUMAN COMPUTATION FUNDAMENTALS
7/18/2013
24
Tutorial@ESWC2013
More people
MoremachinesTHE BIGGER PICTURE
Machines using people
e.g., human computation
People using machines
e.g., collective action
Dave de Roure
More people
Moremachines
OPEN QUESTION: HOW TO BUILD
SOCIAL SYSTEMS AT SCALE?
Big Data
Big Compute
Conventional
Computation
The Future!
Social
Networking
e-infrastructure
online
R&D
Dave de Roure

More Related Content

What's hot

Proposal defense
Proposal defenseProposal defense
Proposal defense
leducmills
 
hcid2011 - Creativity for open spaces - Dr Sara Jones (HCID)
hcid2011 - Creativity for open spaces - Dr Sara Jones (HCID)hcid2011 - Creativity for open spaces - Dr Sara Jones (HCID)
hcid2011 - Creativity for open spaces - Dr Sara Jones (HCID)
City University London
 
Algorithmic Accountability & Learning Analytics (UCL)
Algorithmic Accountability & Learning Analytics (UCL)Algorithmic Accountability & Learning Analytics (UCL)
Algorithmic Accountability & Learning Analytics (UCL)
Simon Buckingham Shum
 

What's hot (20)

Metrocon-Rise-Of-Crowd-Computing
Metrocon-Rise-Of-Crowd-ComputingMetrocon-Rise-Of-Crowd-Computing
Metrocon-Rise-Of-Crowd-Computing
 
Computational Thinking - A Revolution in 4 Steps
Computational Thinking - A Revolution in 4 StepsComputational Thinking - A Revolution in 4 Steps
Computational Thinking - A Revolution in 4 Steps
 
"So UX Designers Just Make Things Pretty, Right?" and Other Myths Debunked
"So UX Designers Just Make Things Pretty, Right?" and Other Myths Debunked"So UX Designers Just Make Things Pretty, Right?" and Other Myths Debunked
"So UX Designers Just Make Things Pretty, Right?" and Other Myths Debunked
 
Rise of Crowd Computing (December 2012)
Rise of Crowd Computing (December 2012)Rise of Crowd Computing (December 2012)
Rise of Crowd Computing (December 2012)
 
Proposal defense
Proposal defenseProposal defense
Proposal defense
 
Can Computers Design? Presented at interaction16, March 2, 2016, Helsinki by ...
Can Computers Design? Presented at interaction16, March 2, 2016, Helsinki by ...Can Computers Design? Presented at interaction16, March 2, 2016, Helsinki by ...
Can Computers Design? Presented at interaction16, March 2, 2016, Helsinki by ...
 
Usability First - Introduction to User-Centered Design
Usability First - Introduction to User-Centered DesignUsability First - Introduction to User-Centered Design
Usability First - Introduction to User-Centered Design
 
BCII 2016 - Visualizing Complexity
BCII 2016 - Visualizing ComplexityBCII 2016 - Visualizing Complexity
BCII 2016 - Visualizing Complexity
 
hcid2011 - Creativity for open spaces - Dr Sara Jones (HCID)
hcid2011 - Creativity for open spaces - Dr Sara Jones (HCID)hcid2011 - Creativity for open spaces - Dr Sara Jones (HCID)
hcid2011 - Creativity for open spaces - Dr Sara Jones (HCID)
 
Provocations for CLA Dashboard
Provocations for CLA DashboardProvocations for CLA Dashboard
Provocations for CLA Dashboard
 
Computational Thinking - a 4 step approach and a new pedagogy
Computational Thinking - a 4 step approach and a new pedagogyComputational Thinking - a 4 step approach and a new pedagogy
Computational Thinking - a 4 step approach and a new pedagogy
 
Junkbots and computational thinking
Junkbots and computational thinkingJunkbots and computational thinking
Junkbots and computational thinking
 
Applying Computational Thinking
Applying  Computational ThinkingApplying  Computational Thinking
Applying Computational Thinking
 
Algorithmic Accountability & Learning Analytics (UCL)
Algorithmic Accountability & Learning Analytics (UCL)Algorithmic Accountability & Learning Analytics (UCL)
Algorithmic Accountability & Learning Analytics (UCL)
 
Adventures in Crowdsourcing : Toward Safer Content Moderation & Better Suppor...
Adventures in Crowdsourcing : Toward Safer Content Moderation & Better Suppor...Adventures in Crowdsourcing : Toward Safer Content Moderation & Better Suppor...
Adventures in Crowdsourcing : Toward Safer Content Moderation & Better Suppor...
 
Mix and Match: Collaborative Expert-Crowd Judging for Building Test Collectio...
Mix and Match: Collaborative Expert-Crowd Judging for Building Test Collectio...Mix and Match: Collaborative Expert-Crowd Judging for Building Test Collectio...
Mix and Match: Collaborative Expert-Crowd Judging for Building Test Collectio...
 
What is computational thinking? Who needs it? Why? How can it be learnt? ...
What is computational thinking?  Who needs it?  Why?  How can it be learnt?  ...What is computational thinking?  Who needs it?  Why?  How can it be learnt?  ...
What is computational thinking? Who needs it? Why? How can it be learnt? ...
 
Qsite Presentation computational thinking 2013
Qsite Presentation   computational thinking 2013Qsite Presentation   computational thinking 2013
Qsite Presentation computational thinking 2013
 
Computational Thinking: Why It is Important for All Students
Computational Thinking: Why It is Important for All StudentsComputational Thinking: Why It is Important for All Students
Computational Thinking: Why It is Important for All Students
 
Designing at the Intersection of HCI & AI: Misinformation & Crowdsourced Anno...
Designing at the Intersection of HCI & AI: Misinformation & Crowdsourced Anno...Designing at the Intersection of HCI & AI: Misinformation & Crowdsourced Anno...
Designing at the Intersection of HCI & AI: Misinformation & Crowdsourced Anno...
 

Similar to Fundamentals of human computation

Crowdsourcing Linked Data management
Crowdsourcing Linked Data managementCrowdsourcing Linked Data management
Crowdsourcing Linked Data management
Elena Simperl
 
Yang presentation 2
Yang presentation 2Yang presentation 2
Yang presentation 2
Yang Huang
 
Yang presentation 2
Yang presentation 2Yang presentation 2
Yang presentation 2
Yang Huang
 
Towards a classification framework for social machines
Towards a classification framework for social machinesTowards a classification framework for social machines
Towards a classification framework for social machines
Elena Simperl
 
Towards a classification framework for social machines
Towards a classification  framework for social machinesTowards a classification  framework for social machines
Towards a classification framework for social machines
SOCIAM Project
 

Similar to Fundamentals of human computation (20)

A brief introduction to crowdsourcing for data collection
A brief introduction to crowdsourcing for data collectionA brief introduction to crowdsourcing for data collection
A brief introduction to crowdsourcing for data collection
 
Crowdsourcing for research libraries
Crowdsourcing for research librariesCrowdsourcing for research libraries
Crowdsourcing for research libraries
 
Incentives-driven technology design
Incentives-driven technology designIncentives-driven technology design
Incentives-driven technology design
 
Crowdsourcing Linked Data management
Crowdsourcing Linked Data managementCrowdsourcing Linked Data management
Crowdsourcing Linked Data management
 
Tutorial: Social Semantic Web and Crowdsourcing - E. Simperl - ESWC SS 2014
Tutorial: Social Semantic Web and Crowdsourcing - E. Simperl - ESWC SS 2014 Tutorial: Social Semantic Web and Crowdsourcing - E. Simperl - ESWC SS 2014
Tutorial: Social Semantic Web and Crowdsourcing - E. Simperl - ESWC SS 2014
 
Human factor in big data qrowd bdve
Human factor in big data qrowd bdveHuman factor in big data qrowd bdve
Human factor in big data qrowd bdve
 
Yang presentation 2
Yang presentation 2Yang presentation 2
Yang presentation 2
 
BDVe Webinar Series - QROWD: The Human Factor in Big Data
BDVe Webinar Series - QROWD: The Human Factor in Big DataBDVe Webinar Series - QROWD: The Human Factor in Big Data
BDVe Webinar Series - QROWD: The Human Factor in Big Data
 
BDVe Webinar Series - QROWD: The Human Factor in Big Data
BDVe Webinar Series - QROWD: The Human Factor in Big DataBDVe Webinar Series - QROWD: The Human Factor in Big Data
BDVe Webinar Series - QROWD: The Human Factor in Big Data
 
We are the data
We are the dataWe are the data
We are the data
 
Human computation and the Semantic Web (examples)
Human computation and the Semantic Web (examples)Human computation and the Semantic Web (examples)
Human computation and the Semantic Web (examples)
 
Yang presentation 2
Yang presentation 2Yang presentation 2
Yang presentation 2
 
Graphic design and UI efficiency
Graphic design and UI efficiencyGraphic design and UI efficiency
Graphic design and UI efficiency
 
Social machines: theory design and incentives
Social machines: theory design and incentivesSocial machines: theory design and incentives
Social machines: theory design and incentives
 
Advanced Methods for User Evaluation in Enterprise AR
Advanced Methods for User Evaluation in Enterprise ARAdvanced Methods for User Evaluation in Enterprise AR
Advanced Methods for User Evaluation in Enterprise AR
 
UXPA2019 UX fundamentals for adapting science-based interfaces for non-techni...
UXPA2019 UX fundamentals for adapting science-based interfaces for non-techni...UXPA2019 UX fundamentals for adapting science-based interfaces for non-techni...
UXPA2019 UX fundamentals for adapting science-based interfaces for non-techni...
 
Snips and snails and puppy dog tails: the need to preserve complexity in math...
Snips and snails and puppy dog tails: the need to preserve complexity in math...Snips and snails and puppy dog tails: the need to preserve complexity in math...
Snips and snails and puppy dog tails: the need to preserve complexity in math...
 
ICS3211 lntelligent Interfaces
ICS3211 lntelligent InterfacesICS3211 lntelligent Interfaces
ICS3211 lntelligent Interfaces
 
Towards a classification framework for social machines
Towards a classification framework for social machinesTowards a classification framework for social machines
Towards a classification framework for social machines
 
Towards a classification framework for social machines
Towards a classification  framework for social machinesTowards a classification  framework for social machines
Towards a classification framework for social machines
 

More from Elena Simperl

One does not simply crowdsource the Semantic Web: 10 years with people, URIs,...
One does not simply crowdsource the Semantic Web: 10 years with people, URIs,...One does not simply crowdsource the Semantic Web: 10 years with people, URIs,...
One does not simply crowdsource the Semantic Web: 10 years with people, URIs,...
Elena Simperl
 

More from Elena Simperl (20)

When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
 
This talk was not generated with ChatGPT: how AI is changing science
This talk was not generated with ChatGPT: how AI is changing scienceThis talk was not generated with ChatGPT: how AI is changing science
This talk was not generated with ChatGPT: how AI is changing science
 
Knowledge graph use cases in natural language generation
Knowledge graph use cases in natural language generationKnowledge graph use cases in natural language generation
Knowledge graph use cases in natural language generation
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
 
The web of data: how are we doing so far
The web of data: how are we doing so farThe web of data: how are we doing so far
The web of data: how are we doing so far
 
What Wikidata teaches us about knowledge engineering
What Wikidata teaches us about knowledge engineeringWhat Wikidata teaches us about knowledge engineering
What Wikidata teaches us about knowledge engineering
 
Open government data portals: from publishing to use and impact
Open government data portals: from publishing to use and impactOpen government data portals: from publishing to use and impact
Open government data portals: from publishing to use and impact
 
Ten myths about knowledge graphs.pdf
Ten myths about knowledge graphs.pdfTen myths about knowledge graphs.pdf
Ten myths about knowledge graphs.pdf
 
What Wikidata teaches us about knowledge engineering
What Wikidata teaches us about knowledge engineeringWhat Wikidata teaches us about knowledge engineering
What Wikidata teaches us about knowledge engineering
 
Data commons and their role in fighting misinformation.pdf
Data commons and their role in fighting misinformation.pdfData commons and their role in fighting misinformation.pdf
Data commons and their role in fighting misinformation.pdf
 
Are our knowledge graphs trustworthy?
Are our knowledge graphs trustworthy?Are our knowledge graphs trustworthy?
Are our knowledge graphs trustworthy?
 
The web of data: how are we doing so far?
The web of data: how are we doing so far?The web of data: how are we doing so far?
The web of data: how are we doing so far?
 
Crowdsourcing and citizen engagement for people-centric smart cities
Crowdsourcing and citizen engagement for people-centric smart citiesCrowdsourcing and citizen engagement for people-centric smart cities
Crowdsourcing and citizen engagement for people-centric smart cities
 
Pie chart or pizza: identifying chart types and their virality on Twitter
Pie chart or pizza: identifying chart types and their virality on TwitterPie chart or pizza: identifying chart types and their virality on Twitter
Pie chart or pizza: identifying chart types and their virality on Twitter
 
High-value datasets: from publication to impact
High-value datasets: from publication to impactHigh-value datasets: from publication to impact
High-value datasets: from publication to impact
 
The story of Data Stories
The story of Data StoriesThe story of Data Stories
The story of Data Stories
 
The human face of AI: how collective and augmented intelligence can help sol...
The human face of AI:  how collective and augmented intelligence can help sol...The human face of AI:  how collective and augmented intelligence can help sol...
The human face of AI: how collective and augmented intelligence can help sol...
 
Qrowd and the city: designing people-centric smart cities
Qrowd and the city: designing people-centric smart citiesQrowd and the city: designing people-centric smart cities
Qrowd and the city: designing people-centric smart cities
 
One does not simply crowdsource the Semantic Web: 10 years with people, URIs,...
One does not simply crowdsource the Semantic Web: 10 years with people, URIs,...One does not simply crowdsource the Semantic Web: 10 years with people, URIs,...
One does not simply crowdsource the Semantic Web: 10 years with people, URIs,...
 

Recently uploaded

Additional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdfAdditional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdf
joachimlavalley1
 
plant breeding methods in asexually or clonally propagated crops
plant breeding methods in asexually or clonally propagated cropsplant breeding methods in asexually or clonally propagated crops
plant breeding methods in asexually or clonally propagated crops
parmarsneha2
 

Recently uploaded (20)

Extraction Of Natural Dye From Beetroot (Beta Vulgaris) And Preparation Of He...
Extraction Of Natural Dye From Beetroot (Beta Vulgaris) And Preparation Of He...Extraction Of Natural Dye From Beetroot (Beta Vulgaris) And Preparation Of He...
Extraction Of Natural Dye From Beetroot (Beta Vulgaris) And Preparation Of He...
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345
 
B.ed spl. HI pdusu exam paper-2023-24.pdf
B.ed spl. HI pdusu exam paper-2023-24.pdfB.ed spl. HI pdusu exam paper-2023-24.pdf
B.ed spl. HI pdusu exam paper-2023-24.pdf
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
 
Basic phrases for greeting and assisting costumers
Basic phrases for greeting and assisting costumersBasic phrases for greeting and assisting costumers
Basic phrases for greeting and assisting costumers
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
 
Additional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdfAdditional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdf
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
 
Introduction to Quality Improvement Essentials
Introduction to Quality Improvement EssentialsIntroduction to Quality Improvement Essentials
Introduction to Quality Improvement Essentials
 
plant breeding methods in asexually or clonally propagated crops
plant breeding methods in asexually or clonally propagated cropsplant breeding methods in asexually or clonally propagated crops
plant breeding methods in asexually or clonally propagated crops
 
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptxStudents, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
 
Basic_QTL_Marker-assisted_Selection_Sourabh.ppt
Basic_QTL_Marker-assisted_Selection_Sourabh.pptBasic_QTL_Marker-assisted_Selection_Sourabh.ppt
Basic_QTL_Marker-assisted_Selection_Sourabh.ppt
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
 
Embracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic ImperativeEmbracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic Imperative
 
The Art Pastor's Guide to Sabbath | Steve Thomason
The Art Pastor's Guide to Sabbath | Steve ThomasonThe Art Pastor's Guide to Sabbath | Steve Thomason
The Art Pastor's Guide to Sabbath | Steve Thomason
 
Solid waste management & Types of Basic civil Engineering notes by DJ Sir.pptx
Solid waste management & Types of Basic civil Engineering notes by DJ Sir.pptxSolid waste management & Types of Basic civil Engineering notes by DJ Sir.pptx
Solid waste management & Types of Basic civil Engineering notes by DJ Sir.pptx
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
 

Fundamentals of human computation

  • 1. HUMAN COMPUTATION FUNDAMENTALS ELENA SIMPERL UNIVERSITY OF SOUTHAMPTON 7/18/2013 Tutorial@ESWC2013 1
  • 2. HUMAN COMPUTATION REVISITED • Outsourcing tasks that machines find difficult to solve to humans • Difficult not the same as impossible • Accuracy, efficiency, cost • Historically humans were the first computers • 17th century: Halley‘s commet • 19th century: computing factories • 20th century: professionalization of human computation • Characteristics: division of labor, redundancy, multiple methods to find or check the correctness of a solution 7/18/2013 Tutorial@ESWC2013 2
  • 3. DIMENSIONS OF HUMAN COMPUTATION HUMAN COMPUTATION FUNDAMENTALS 7/18/2013 3 Tutorial@ESWC201
  • 4. DIMENSIONS OF HUMAN COMPUTATION WHAT IS OUTSOURCED • Tasks based on human skills not easily replicable by machines (visual recognition, language understanding, knowledge acquisition, basic human communication etc) HOW IS THE TASK OUTSOURCED • Explicit vs. implicit participation • Tasks broken down into smaller units undertaken in parallel by different people • Coordination required to handle cases with more complex workflows • Partial or independent answers consolidated and aggregated into complete solution 7/18/2013 Tutorial@ESWC2013 4 See also [Quinn & Bederson, 2012] WHO IS THE CROWD • Open call • Call may target specific skills and expertise • Requester typically knows less about the workers than in other work environments
  • 5. EXAMPLE: CITIZEN SCIENCE VIA HUMAN COMPUTATION WHAT IS OUTSOURCED • Object recognition, labeling, categorization in media content WHO IS THE CROWD • Anyone HOW IS THE TASK OUTSOURCED • Highly parallelizable tasks • Every item is handled by multiple annotators • Every annotator provides an answer • Consolidated answers solve scientific problems 7/18/2013 Tutorial@ESWC2013 5
  • 6. Country of residence • United States: 46.80% • India: 34.00% • Miscellaneous: 19.20% 7/18/2013 Tutorial@ESWC201 6 A LARGE, BUT NOT ALWAYS DIVERSE CROWD
  • 7. 60% OF WORKERS SPEND MORE THAN 4 HOURS A WEEK ON MTURK 7 http://www.mturk-tracker.com/
  • 8. SIGNIFICANT AMOUNT OF RESOURCES AND TIMELY DELIVERY 7/18/2013 Tutorial@ESWC2013 8
  • 10. Users aware of how their input contributes to the achievement of application’s goal (and identify themselves with it) vs. Tasks are hidden behind the application narratives. Engagement ensured through other incentives 7/18/2013 Tutorial@ESWC201 10 EXPLICIT VS. IMPLICIT CONTRIBUTION - AFFECTS MOTIVATION AND ENGAGEMENT
  • 11. COMPLEX WORKFLOWS CANNOT ALWAYS BE DIRECTLY IMPLEMENTED 7/18/2013 Tutorial@ESWC201 11 http://www.youtube.com/watch?v=n_miZqsPwsc WHAT IS OUTSOURCED • Text shortening, proof- reading, open editing WHO IS THE CROWD • MTurk HOW IS THE TASK OUTSOURCED • Text divided into paragraphs • Select-fix-verify pattern • Multiple workers in each step See also [Bernstein et al., 2010]
  • 12. DIMENSIONS OF HUMAN COMPUTATION (2) HOW ARE THE RESULTS VALIDATED • Solutions space closed vs. open • Performance measurements/ground truth • Statistical techniques employed to predict accurate solutions • May take into account confidence values of algorithmically generated solutions HOW CAN THE PROCESS BE OPTIMIZED • Incentives and motivators • Assigning tasks to people based on their skills and performance (as opposed to random assignments • Symbiotic combinations of human- and machine- driven computation, including combinations of different forms of crowdsourcing 7/18/2013 Tutorial@ESWC2013 12 See also [Quinn & Bederson, 2012]
  • 13. OPEN SOLUTION SPACES Selecting the right option vs. assessing the quality of the work 7/18/2013 Tutorial@ESWC2013 13 The goal is to undertake much of the assessment either automatically or use the crowd for it.
  • 14. MEASURING PERFORMANCE CAN BE CHALLENGING WHO AND HOW • Redundancy • Excluding spam and obviously wrong answers • Voting and ratings by the crowd • Assessment by the requester • Where does the ground truth come from and is it needed • Note: improving recall of algorithms WHEN • Real-time constraints in games • Near-real-time microtasks, see Bernstein et al. Crowds in Two Seconds: Enabling Realtime Crowd- Powered Interfaces. In Proc. UIST 2011. 7/18/2013 Tutorial@ESWC2013 14
  • 15. ALIGNING INCENTIVES IS ESSENTIAL Motivation: driving force that makes humans achieve their goals Incentives: ‘rewards’ assigned by an external ‘judge’ to a performer for undertaking a specific task • Common belief (among economists): incentives can be translated into a sum of money for all practical purposes. Incentives can be related to both extrinsic and intrinsic motivations. Extrinsic motivation if task is considered boring, dangerous, useless, socially undesirable, dislikable by the performer. Intrinsic motivation is driven by an interest or enjoyment in the task itself.
  • 16. IN THIS TUTORIAL GAMES WITH A PURPOSE MICROTASKS HYBRID SYSTEMS HUMAN COMPUTATION FUNDAMENTALS 7/18/2013 16 Tutorial@ESWC201
  • 17. GAMES WITH A PURPOSE (GWAP) Human computation disguised as casual games Tasks are divided into parallelizable atomic units (challenges) solved (consensually) by players Game models • Single vs. multi-player • Selection agreement vs. input agreement vs. inversion- problem games See also [van Ahn & Dabbish, 2008] 7/18/2013 17
  • 18. MICROTASK CROWDSOURCING Similar types of tasks, but different incentives model (monetary reward) Successfully applied to transcription, classification, and content generation, data collection, image tagging, website feedback, usability tests… 7/18/2013 Tutorial@ESWC2013 18
  • 19. THE SAME, BUT DIFFERENT • Tasks leveraging common human skills, appealing to large audiences • Selection of domain and task more constrained in games to create typical UX • Tasks decomposed into smaller units of work to be solved independently • Complex workflows • Creating a casual game experience vs. patterns in microtasks • Quality assurance • Synchronous interaction in games • Levels of difficulty and near-real-time feedback in games • Many methods applied in both cases (redundancy, votes, statistical techniques) • Different set of incentives and motivators 7/18/2013 Tutorial@ESWC2013 19
  • 20. Physical World (people and devices) HYBRID SYSTEMS Design and composition Participation and data supply Model of social interaction Virtual world (Network of social interactions) Dave Robertson
  • 21. EXAMPLE: HYBRID IMAGE SEARCH 21 Yan, Kumar, Ganesan, CrowdSearch: Exploiting Crowds for Accurate Real-time Image Search on Mobile Phones, Mobisys 2010.
  • 22. Not sure EXAMPLE: HYBRID DATA INTEGRATION paper conf Data integration VLDB-01 Data mining SIGMOD-02 title author email OLAP Mike mike@a Social media Jane jane@b Generate plausible matches – paper = title, paper = author, paper = email, paper = venue – conf = title, conf = author, conf = email, conf = venue Ask users to verify paper conf Data integration VLDB-01 Data mining SIGMOD-02 title author email venue OLAP Mike mike@a ICDE-02 Social media Jane jane@b PODS-05 Does attribute paper match attribute author? NoYes McCann, Shen, Doan: Matching Schemas in Online Communities. ICDE, 2008 22
  • 23. EXAMPLE: HYBRID QUERY PROCESSING 23 Use the crowd to answer DB-hard queries Where to use the crowd: Find missing data Make subjective comparisons Recognize patterns But not: Anything the computer already does well M. Franklin, D. Kossmann, T. Kraska, S. Ramesh and R. Xin . CrowdDB: Answering Queries with Crowdsourcing, SIGMOD 2011
  • 24. WHAT’S NEXT HUMAN COMPUTATION FUNDAMENTALS 7/18/2013 24 Tutorial@ESWC2013
  • 25. More people MoremachinesTHE BIGGER PICTURE Machines using people e.g., human computation People using machines e.g., collective action Dave de Roure
  • 26. More people Moremachines OPEN QUESTION: HOW TO BUILD SOCIAL SYSTEMS AT SCALE? Big Data Big Compute Conventional Computation The Future! Social Networking e-infrastructure online R&D Dave de Roure