Pattern-Based Specification of Crowdsourcing Applications

Alessandro Bozzon
Alessandro BozzonAssistant Professor at Delft University of Technology
Pattern-Based Specification of
Crowdsourcing Applications
Alessandro Bozzon (TU Delft)
Marco Brambilla (Politecnico di Milano)

Stefano Ceri (Politecnico di Milano)

Andrea Mauri (Politecnico di Milano)

Riccardo Volonterio (Politecnico di Milano)
Crowdsourcing and Human Computation
It works like magic!
Endless Applications
Endless Success Stories
2008 Olympics Opening Ceremony
Actually…
Often a Try&HopeError process
Task Design Matters
Crowd can be unreliable
($) Incentives Matter
Quality Control Matters
Platform of Execution Matters
Pattern-Based Specification of Crowdsourcing Applications
Setting Nº1
Global Annotations with
simple counting
Setting Nº2
Local Annotations
with Bounding Boxes
Pattern-Based Specification of Crowdsourcing Applications
Pattern-Based Specification of Crowdsourcing Applications
Setting Nº3
Local Annotations with Verified Bounding Boxes
Setting Nº3
Local Annotations with Verified Bounding Boxes
Setting Nº3
Local Annotations with Verified Bounding Boxes
Ok, so what?
#Workers
#Useful
Workers
#Executions Cost $
Time
(hours)
Precision
Setting
Nº1
732
44
(6%)
488 72 ~40 ~67%
Setting
Nº2
498
25
(5%)
547 48 ~169 ~67%
Setting
Nº3
1420
464

(32%)
3387 83 ~184 ~75%
Total 2152 508 4422 203 ~16 days
Our study
Our study
Our Contribution
GOAL
!
Simplify and systematize the design, deploy, and
monitoring of applications (including experiments)
Contributions
An Abstract
Model of
Crowdsourcing
Activities
A Composition
model for
Crowdsourcing
Activities
A Library of
crowdsourcing
Patterns
• A conceptual framework
• A specification paradigm
• A reactive execution control
environment
Models
DEMO VIDEO
Crowd Task
[T operation types]
(intra-task patterns)
Object Type
block size
min #obj
(cons)
input buffer
batch flow (on closed task)
stream flow (on closed object)
MicroTask
[MT operation types]
r
data manipulator
Case Study: Movie Scene Analysis
Scenario 1: Scene Positioning
Spoiler Alert!
Order Scenes
Scene in Beg/Mid/End
Scenario 2: Actor Identification
Find Actor
Validate Actor
Pattern-Based Specification of Crowdsourcing Applications
Position Scenes
[Classify]
(Static Agreement@3)
MicroTask [Classify]
Scene
block 1
min 1
Spoiler Scenes
[Like]
(Static Agreement@3)
MicroTask [Like]
Scene
block 1
min 1
5
Order Scenes
[Order] (SortByLiking)
MicroTask [Like]
Scene
block 2
min 2
[Class=E]
[Class=B OR M]
Example of Scenario 1 Model
Patterns
Intra-Task Auxiliary Workflow
Intra-Task
Pre-Processing
Post-
Processing
Task
Consensus
Splitting
Assignment
Aggregation
microTaskmicroTaskmicroTask
microTaskmicroTaskmicroTask
microTaskmicroTaskmicroTask
• Consensus
• Join
• Sort
• Grouping
• Performer Control
• Planning
• Assignment
• Aggregation
• Quality & Performer
Auxiliary Intra-Task
Pre-Processing
Post-
Processing
Task
Consensus
Splitting
Assignment
Aggregation
microTaskmicroTaskmicroTask
microTaskmicroTaskmicroTask
microTaskmicroTaskmicroTask
• Pruning • Tie Breaking
• Operations before or
after the execution
Workflow
• Set of heterogeneous
tasks
Create Decide
Improve
Compare
/ Verify
Find Fix
(a)
(b)
(c)
Auxiliary
Task
Create/Decide
Improve/Compare
Find/Fix/Verify
Experiments
1700 Executions
39$
September 2013
Streaming Vs. Batch Execution
Position Scenes
[Classify]
(Static Agreement@3)
MicroTask [Classify]
Scene
block 1
min 1
Spoiler Scenes
[Like]
(Static Agreement@3)
MicroTask [Like]
Scene
block 1
min 1
7
Order Scenes
[Order]
(SortByLiking)
MicroTask [Like]
Scene
block 2
min 2
[Class=E]
[Class=B OR M]
(P1)
5
3
Position Scenes
[Classify]
(Static Agreement@3)
MicroTask [Classify]
Scene
block 1
min 1
Spoiler Scenes
[Like]
(Static Agreement@3)
MicroTask [Like]
Scene
block 1
min 1
7
Order Scenes
[Order]
(SortByLiking)
MicroTask [Like]
Scene
block 2
min 2
Cons.
[Class=E]
[Class=B OR M]
(P2)
5
3
P.Beg P.Mid P.End
P1 0.5 1 0.11
P2 0.5 0.8 0.33
Spear.Beg Spear. Mid

.MidP1 0.5 0.54
P2 0.9 0.51
Position
Order
P1 P
b) Elapsed Tim
#ClosedObjects
1
10
20
30
40
50
60
70
80
5 60 120 180 240 300 360 5
Position
Order
P1
Position
Order
P2
b) Elapsed Time (Mins)
#ClosedObjects
1
10
20
30
40
50
60
70
80
5 60 120 180 240 300 360 5 60 120 180 240 300 360
Position
Order
P1 P
b) Elapsed Tim
#ClosedObjects
1
10
20
30
40
50
60
70
80
5 60 120 180 240 300 360 5
Intra-Task Consensus Vs. Workflow Decision
A4 A5 A6
Precision
 0.99 0.95 0.89
Recall 0.90 1 0.96
F-Score 0.93 0.97 0.90
Find Actors
[Tag]
(Static Agreement@3)
MicroTask [Tag]
Scene
block 1
min 1
Validate Actors
[Like]
MicroTask [Like]
Scene+Actor
block All
min 1
5
(A4)
Find Actors
[Tag]
MicroTask [Tag]
Scene
block 1
min 1
Validate Actors
[Like]
(Majority Voting@2)
MicroTask [Like]
Scene+Actor
block All
min 1
5 3
(A5)
(A6)
Find Actors
[Tag]
(Static Agreement@3)
MicroTask [Tag]
Scene
block 1
min 1
Validate Actors
[Like]
(Majority Voting@2)
MicroTask [Like]
Scene+Actor
block All
min 1
5 3
count(Actor.Like)<=1
Actor
Validate
A3
Actor
Validate
A4 A5
a) Elapsed Time (
#ClosedObjects
0
10
20
30
40
50
60
5 30 60 90 120 160 5 30 60 90 120 160 5
Actor
Validate
Actor
Validate
A4
Actor
Validate
A5
a) Elapsed Time (Mins)
30 60 90 120 160 5 30 60 90 120 160 5 60 120 180 240 60
Actor
Validate
Actor
Validate
A5
a) Elapsed Time (Mins)
Actor
Validate
A6
30 60 90 120 160 5 60 120 180 240 60 300 540 780
Actor
Validate
A3 A4
#ClosedObjects
0
10
20
30
40
50
60
5 30 60 90 120 160 5
Actor
Validate
A3 A4
#ClosedObjects
0
10
20
30
40
50
60
5 30 60 90 120 160 5
Actor
Validate
A3 A4
#ClosedObjects
0
10
20
30
40
50
60
5 30 60 90 120 160 5
Take-Home Message
Engineering approaches should be
applied to people management too
In crowdsourcing, control and
monitoring are key for successful
outcomes
Questions?
1 of 31

Recommended

City Data Fusion: A Big Data Infrastructure to sense the pulse of the city in... by
City Data Fusion: A Big Data Infrastructure to sense the pulse of the city in...City Data Fusion: A Big Data Infrastructure to sense the pulse of the city in...
City Data Fusion: A Big Data Infrastructure to sense the pulse of the city in...Emanuele Della Valle
4.5K views18 slides
CUbRIK is by
CUbRIK is CUbRIK is
CUbRIK is CUbRIK Project
4.7K views13 slides
SAIL 2015 Crowdmanagement Experiment. Pitch slides by
SAIL 2015 Crowdmanagement Experiment. Pitch slidesSAIL 2015 Crowdmanagement Experiment. Pitch slides
SAIL 2015 Crowdmanagement Experiment. Pitch slidesAlessandro Bozzon
2.8K views12 slides
SealincMedia Accurator Demos by
SealincMedia Accurator DemosSealincMedia Accurator Demos
SealincMedia Accurator DemosLora Aroyo
9K views32 slides
Reactive crowdsourcing by
Reactive crowdsourcingReactive crowdsourcing
Reactive crowdsourcingAlessandro Bozzon
9.7K views42 slides
Answering Search Queries with CrowdSearcher: a crowdsourcing and social netwo... by
Answering Search Queries with CrowdSearcher: a crowdsourcing and social netwo...Answering Search Queries with CrowdSearcher: a crowdsourcing and social netwo...
Answering Search Queries with CrowdSearcher: a crowdsourcing and social netwo...Marco Brambilla
6.9K views35 slides

More Related Content

More from Alessandro Bozzon

Weaving the Web of People and Things for Intelligent Cities by
Weaving the Web of People and Things for Intelligent CitiesWeaving the Web of People and Things for Intelligent Cities
Weaving the Web of People and Things for Intelligent CitiesAlessandro Bozzon
518 views63 slides
Trustworthy Micro-task Crowdsourcing: Challenges and Opportunities by
Trustworthy Micro-task Crowdsourcing: Challenges and OpportunitiesTrustworthy Micro-task Crowdsourcing: Challenges and Opportunities
Trustworthy Micro-task Crowdsourcing: Challenges and OpportunitiesAlessandro Bozzon
1K views55 slides
Social Data Science For Intelligent Cities by
Social Data Science For Intelligent CitiesSocial Data Science For Intelligent Cities
Social Data Science For Intelligent CitiesAlessandro Bozzon
835 views25 slides
ICWE 2013 - Slides From The Poster And Demo Session by
ICWE 2013 - Slides From The Poster And Demo SessionICWE 2013 - Slides From The Poster And Demo Session
ICWE 2013 - Slides From The Poster And Demo SessionAlessandro Bozzon
3.8K views63 slides
An Introduction to Human Computation and Games With A Purpose - Part I by
An Introduction to Human Computation and Games With A Purpose - Part IAn Introduction to Human Computation and Games With A Purpose - Part I
An Introduction to Human Computation and Games With A Purpose - Part IAlessandro Bozzon
5.3K views173 slides
A Service-Based Architecture for Multi-domain Search on the Web by
A Service-Based Architecture for Multi-domain Search on the WebA Service-Based Architecture for Multi-domain Search on the Web
A Service-Based Architecture for Multi-domain Search on the WebAlessandro Bozzon
917 views22 slides

More from Alessandro Bozzon(9)

Weaving the Web of People and Things for Intelligent Cities by Alessandro Bozzon
Weaving the Web of People and Things for Intelligent CitiesWeaving the Web of People and Things for Intelligent Cities
Weaving the Web of People and Things for Intelligent Cities
Alessandro Bozzon518 views
Trustworthy Micro-task Crowdsourcing: Challenges and Opportunities by Alessandro Bozzon
Trustworthy Micro-task Crowdsourcing: Challenges and OpportunitiesTrustworthy Micro-task Crowdsourcing: Challenges and Opportunities
Trustworthy Micro-task Crowdsourcing: Challenges and Opportunities
Social Data Science For Intelligent Cities by Alessandro Bozzon
Social Data Science For Intelligent CitiesSocial Data Science For Intelligent Cities
Social Data Science For Intelligent Cities
Alessandro Bozzon835 views
ICWE 2013 - Slides From The Poster And Demo Session by Alessandro Bozzon
ICWE 2013 - Slides From The Poster And Demo SessionICWE 2013 - Slides From The Poster And Demo Session
ICWE 2013 - Slides From The Poster And Demo Session
Alessandro Bozzon3.8K views
An Introduction to Human Computation and Games With A Purpose - Part I by Alessandro Bozzon
An Introduction to Human Computation and Games With A Purpose - Part IAn Introduction to Human Computation and Games With A Purpose - Part I
An Introduction to Human Computation and Games With A Purpose - Part I
Alessandro Bozzon5.3K views
A Service-Based Architecture for Multi-domain Search on the Web by Alessandro Bozzon
A Service-Based Architecture for Multi-domain Search on the WebA Service-Based Architecture for Multi-domain Search on the Web
A Service-Based Architecture for Multi-domain Search on the Web
Alessandro Bozzon917 views
Engineering Web Search Applications by Alessandro Bozzon
Engineering Web Search ApplicationsEngineering Web Search Applications
Engineering Web Search Applications
Alessandro Bozzon10.7K views
Liquid Query: Multi-domain Exploratory Search on the Web by Alessandro Bozzon
Liquid Query: Multi-domain Exploratory Search on the WebLiquid Query: Multi-domain Exploratory Search on the Web
Liquid Query: Multi-domain Exploratory Search on the Web
Alessandro Bozzon1.9K views

Recently uploaded

Nitrosamine & NDSRI.pptx by
Nitrosamine & NDSRI.pptxNitrosamine & NDSRI.pptx
Nitrosamine & NDSRI.pptxNileshBonde4
8 views22 slides
Conventional and non-conventional methods for improvement of cucurbits.pptx by
Conventional and non-conventional methods for improvement of cucurbits.pptxConventional and non-conventional methods for improvement of cucurbits.pptx
Conventional and non-conventional methods for improvement of cucurbits.pptxgandhi976
16 views35 slides
"How can I develop my learning path in bioinformatics? by
"How can I develop my learning path in bioinformatics?"How can I develop my learning path in bioinformatics?
"How can I develop my learning path in bioinformatics?Bioinformy
18 views13 slides
Ecology by
Ecology Ecology
Ecology Abhijith Raj.R
6 views10 slides
application of genetic engineering 2.pptx by
application of genetic engineering 2.pptxapplication of genetic engineering 2.pptx
application of genetic engineering 2.pptxSankSurezz
6 views12 slides
ENTOMOLOGY PPT ON BOMBYCIDAE AND SATURNIIDAE.pptx by
ENTOMOLOGY PPT ON BOMBYCIDAE AND SATURNIIDAE.pptxENTOMOLOGY PPT ON BOMBYCIDAE AND SATURNIIDAE.pptx
ENTOMOLOGY PPT ON BOMBYCIDAE AND SATURNIIDAE.pptxMN
6 views13 slides

Recently uploaded(20)

Conventional and non-conventional methods for improvement of cucurbits.pptx by gandhi976
Conventional and non-conventional methods for improvement of cucurbits.pptxConventional and non-conventional methods for improvement of cucurbits.pptx
Conventional and non-conventional methods for improvement of cucurbits.pptx
gandhi97616 views
"How can I develop my learning path in bioinformatics? by Bioinformy
"How can I develop my learning path in bioinformatics?"How can I develop my learning path in bioinformatics?
"How can I develop my learning path in bioinformatics?
Bioinformy18 views
application of genetic engineering 2.pptx by SankSurezz
application of genetic engineering 2.pptxapplication of genetic engineering 2.pptx
application of genetic engineering 2.pptx
SankSurezz6 views
ENTOMOLOGY PPT ON BOMBYCIDAE AND SATURNIIDAE.pptx by MN
ENTOMOLOGY PPT ON BOMBYCIDAE AND SATURNIIDAE.pptxENTOMOLOGY PPT ON BOMBYCIDAE AND SATURNIIDAE.pptx
ENTOMOLOGY PPT ON BOMBYCIDAE AND SATURNIIDAE.pptx
MN6 views
Pollination By Nagapradheesh.M.pptx by MNAGAPRADHEESH
Pollination By Nagapradheesh.M.pptxPollination By Nagapradheesh.M.pptx
Pollination By Nagapradheesh.M.pptx
MNAGAPRADHEESH15 views
Guinea Pig as a Model for Translation Research by PervaizDar1
Guinea Pig as a Model for Translation ResearchGuinea Pig as a Model for Translation Research
Guinea Pig as a Model for Translation Research
PervaizDar111 views
A Ready-to-Analyze High-Plex Spatial Signature Development Workflow for Cance... by InsideScientific
A Ready-to-Analyze High-Plex Spatial Signature Development Workflow for Cance...A Ready-to-Analyze High-Plex Spatial Signature Development Workflow for Cance...
A Ready-to-Analyze High-Plex Spatial Signature Development Workflow for Cance...
How to be(come) a successful PhD student by Tom Mens
How to be(come) a successful PhD studentHow to be(come) a successful PhD student
How to be(come) a successful PhD student
Tom Mens422 views
Ethical issues associated with Genetically Modified Crops and Genetically Mod... by PunithKumars6
Ethical issues associated with Genetically Modified Crops and Genetically Mod...Ethical issues associated with Genetically Modified Crops and Genetically Mod...
Ethical issues associated with Genetically Modified Crops and Genetically Mod...
PunithKumars618 views
Open Access Publishing in Astrophysics by Peter Coles
Open Access Publishing in AstrophysicsOpen Access Publishing in Astrophysics
Open Access Publishing in Astrophysics
Peter Coles543 views
Distinct distributions of elliptical and disk galaxies across the Local Super... by Sérgio Sacani
Distinct distributions of elliptical and disk galaxies across the Local Super...Distinct distributions of elliptical and disk galaxies across the Local Super...
Distinct distributions of elliptical and disk galaxies across the Local Super...
Sérgio Sacani30 views
Artificial Intelligence Helps in Drug Designing and Discovery.pptx by abhinashsahoo2001
Artificial Intelligence Helps in Drug Designing and Discovery.pptxArtificial Intelligence Helps in Drug Designing and Discovery.pptx
Artificial Intelligence Helps in Drug Designing and Discovery.pptx
abhinashsahoo2001117 views
himalay baruah acid fast staining.pptx by HimalayBaruah
himalay baruah acid fast staining.pptxhimalay baruah acid fast staining.pptx
himalay baruah acid fast staining.pptx
HimalayBaruah5 views
A training, certification and marketing scheme for informal dairy vendors in ... by ILRI
A training, certification and marketing scheme for informal dairy vendors in ...A training, certification and marketing scheme for informal dairy vendors in ...
A training, certification and marketing scheme for informal dairy vendors in ...
ILRI10 views

Pattern-Based Specification of Crowdsourcing Applications