SlideShare a Scribd company logo
Pick-A-Crowd: Tell Me What You Like,
and I’ll Tell You What to Do
A Crowdsourcing Platform for Personalized
Human Intelligence Task Assignment Based on Social
Networks

Djellel E. Difallah, GianlucaDemartini, Philippe Cudré-Mauroux
eXascaleInfolab
University of Fribourg, Switzerland
15th May 2013, WWW 2013 - Rio De Janeiro, Brazil

1
Crowdsourcing
• Exploit human intelligence to solve tasks that
are simple for Humans and complex for
machines
• Examples:
– Wikipedia, reCaptcha, Duolingo

• Incentives
– Financial, fun, visibility

2
Motivation
• The Pull Methodology is suboptimal

Actual workers

Max Overlap
Effective workers

3
Motivation
• The Push Methodology is a Task-to-Worker
Recommender System.

4
Contribution and Claim
• Pick-A-Crowd: A system architecture that uses
Task-to-Worker matching:
– The worker’s social profile
– The task context

• Workers can provide higher quality answers
on tasks they relate to

5
Worker Social Profiling

“YouAreWhatYouLike”

7
Problem Definition (1)The Human Intelligence Task (HIT)
Categorization
Survey
Image Tagging
Data Collection

Batch of Tasks:
Title
Batch Instruction
Specific task instruction*
Task data:
- Text.
- Options.
- Additional data (image, Url)
List of categories*

8
ProblemDefinition (2)The Worker

Completed HITs: 256
Approval Rate: 96%
Qualification Types
Generic Qualifications

Page:
Page:
Page:
- -Title
Title
- Title
- -Category
Category
- Category
- -Description
Description
- Description
- -Feed, etc.
Feed, etc.
- Feed, etc.
9
Problem Definition (3) –
Task-to-Worker Matching
Batch of Tasks:
Title
Batch Instruction
Specific task instruction*
Task data:
- Text.
- Options.
- Additional data (image, Url)
List of categories*

Page:
Page:
Page:
- -Title
Title
- Title
- -Category
Category
- Category
- -Description
Description
- Description
- -Feed, etc.
Feed, etc.
- Feed, etc.

1- Task-to-Page Matching Function
- Category
- Expert finding
- Semantic

2- Worker Ranking

10
Matching Models (1/3)–
Category Based
• The requester provides a list of categories related to the batch
• We create a subset of pages whose category is in the category
list of the batch
• Rank the workers by the number of liked pages in the subset

11
Matching Models (2/3) –
Expert Finding
•
•
•

Build an inverted index on the pages’ titles and description
Use the title/description of the tasks as a key word query on the
inverted index and get a subset of pages
Rank the workers by the number of liked pages in the subset

12
Matching Models (3/3) –
Semantic Based
•
•

Link the context to an external knowledge base (e.g., DBPedia)
Exploit the underlying graph structure to determine the Hits and Pages similarity
– Assumption that a worker who likes a page is able to answer questions about related entities
– Worker who likes a page is able to answer questions about entities of the same type

•

Rank the workers by the number of liked pages in the subset
Similarity

Relatedness
HIT

FB Pages

Type-Similarity

13
Pick-A-Crowd Architecture

15
Experimental Evaluation
• The Facebook app OpenTurkimplements part
of the Pick-A-Crowd architecture:
– More than 170 registered workers participated
– Over 12k pages crawled

• Covered both multiple answer questions as
well as open-ended questions
– 50 images with multiple choice question and 5 candidate answers
(Soccer, Actors, Music, Authors,Movies, Animes)
– Answer 20 open-ended questions related to the topic (Cricket)

16
OpenTurk app

18
Evaluation -

WORKER PRECISION

Correlation between the crowd accuracy and
the number of relevant likes (Category Based)

NUMBER OF RELEVANT LIKES

19
Evaluation (Baseline) –
Amazon Mechanical Turk (AMT)

AMT 3 = Majority vote of 3 workers
AMT 5 = Majority vote of 5 workers
20
Evaluation –
HIT Assignment Models
CATEGORY APPROACH

21
Evaluation –
HIT Assignment Models
EXPERT FINDING BASED

TITLE/INSTRUCTION

CONTENT

22
Evaluation –
HIT Assignment Models
SEMANTIC BASED

TYPE

RELATEDNESS

23
PICK-A-CROWD

AMT

Evaluation Comparison With Mechanical Turk

24
Conclusions and Future Work
• Pull vs. Pushmethodologies in Crowdsourcing
• Pick-A-Crowd system architecture with Taskto-Worker recommendation
• Experimental comparison with AMT shows a
consistent quality improvement
“Workers Know what they Like”
• Exploit more of the social activity, and handle
content-less tasks
25
Next Step
• We are building a Crowdsourcing platform for
the research community
• Pre-register on:

www.openturk.com
Thank You!
26

More Related Content

What's hot

OSCOSS: Opening Scholarly Communication in Social Sciences
OSCOSS: Opening Scholarly Communication in Social SciencesOSCOSS: Opening Scholarly Communication in Social Sciences
OSCOSS: Opening Scholarly Communication in Social Sciences
Christoph Lange
 
CrowdSourcing- Location based Quries
CrowdSourcing- Location based QuriesCrowdSourcing- Location based Quries
CrowdSourcing- Location based Quries
purushottam02468
 
06 Network Study Design: Ethical Considerations and Safeguards
06 Network Study Design: Ethical Considerations and Safeguards06 Network Study Design: Ethical Considerations and Safeguards
06 Network Study Design: Ethical Considerations and Safeguards
dnac
 
Computational Social Science:The Collaborative Futures of Big Data, Computer ...
Computational Social Science:The Collaborative Futures of Big Data, Computer ...Computational Social Science:The Collaborative Futures of Big Data, Computer ...
Computational Social Science:The Collaborative Futures of Big Data, Computer ...
Academia Sinica
 
Introduction to Social Network Analysis
Introduction to Social Network AnalysisIntroduction to Social Network Analysis
Introduction to Social Network Analysis
Patti Anklam
 
Social Network Analysis Workshop
Social Network Analysis WorkshopSocial Network Analysis Workshop
Social Network Analysis Workshop
Data Works MD
 
Social Network Analysis: What It Is, Why We Should Care, and What We Can Lear...
Social Network Analysis: What It Is, Why We Should Care, and What We Can Lear...Social Network Analysis: What It Is, Why We Should Care, and What We Can Lear...
Social Network Analysis: What It Is, Why We Should Care, and What We Can Lear...
Xiaohan Zeng
 
Proposal final
Proposal finalProposal final
Proposal final
Mido Razaz
 
Introduction to Social Network Analysis
Introduction to Social Network AnalysisIntroduction to Social Network Analysis
Introduction to Social Network Analysis
Toronto Metropolitan University
 
Webometrics report
Webometrics reportWebometrics report
Webometrics report
Orpha Mangaco
 
Online Forums vs. Social Networks: Two Case Studies to support eGovernment wi...
Online Forums vs. Social Networks: Two Case Studies to support eGovernment wi...Online Forums vs. Social Networks: Two Case Studies to support eGovernment wi...
Online Forums vs. Social Networks: Two Case Studies to support eGovernment wi...Timo Wandhoefer
 
Social Network Analysis Introduction including Data Structure Graph overview.
Social Network Analysis Introduction including Data Structure Graph overview. Social Network Analysis Introduction including Data Structure Graph overview.
Social Network Analysis Introduction including Data Structure Graph overview.
Doug Needham
 
A Community of Quality: Using Social Network Analysis to Study University-Wid...
A Community of Quality: Using Social Network Analysis to Study University-Wid...A Community of Quality: Using Social Network Analysis to Study University-Wid...
A Community of Quality: Using Social Network Analysis to Study University-Wid...
Stephanie Richter
 
Collaboration, Technology and Libraries
Collaboration, Technology and LibrariesCollaboration, Technology and Libraries
Collaboration, Technology and Libraries
Dr. Margaret (Meg) Westbury
 
Future of Journalism - civil discourse technologies
Future of Journalism - civil discourse technologiesFuture of Journalism - civil discourse technologies
Future of Journalism - civil discourse technologies
Simon Buckingham Shum
 
Mattmiddaghscatterplotppt
MattmiddaghscatterplotpptMattmiddaghscatterplotppt
Mattmiddaghscatterplotpptmattmidd
 
Online survey tools ppt 30-01-2016
Online survey tools ppt 30-01-2016Online survey tools ppt 30-01-2016
Online survey tools ppt 30-01-2016
Vasantha Raju N
 
Overview of Digital Publishing
Overview of Digital PublishingOverview of Digital Publishing
Overview of Digital Publishing
Philip Bourne
 
Tech Tools for Music Industry Teaching and Exploration
Tech Tools for Music Industry Teaching and ExplorationTech Tools for Music Industry Teaching and Exploration
Tech Tools for Music Industry Teaching and Exploration
Gigi Johnson
 
12 Network Experiments and Interventions: Studying Information Diffusion and ...
12 Network Experiments and Interventions: Studying Information Diffusion and ...12 Network Experiments and Interventions: Studying Information Diffusion and ...
12 Network Experiments and Interventions: Studying Information Diffusion and ...
dnac
 

What's hot (20)

OSCOSS: Opening Scholarly Communication in Social Sciences
OSCOSS: Opening Scholarly Communication in Social SciencesOSCOSS: Opening Scholarly Communication in Social Sciences
OSCOSS: Opening Scholarly Communication in Social Sciences
 
CrowdSourcing- Location based Quries
CrowdSourcing- Location based QuriesCrowdSourcing- Location based Quries
CrowdSourcing- Location based Quries
 
06 Network Study Design: Ethical Considerations and Safeguards
06 Network Study Design: Ethical Considerations and Safeguards06 Network Study Design: Ethical Considerations and Safeguards
06 Network Study Design: Ethical Considerations and Safeguards
 
Computational Social Science:The Collaborative Futures of Big Data, Computer ...
Computational Social Science:The Collaborative Futures of Big Data, Computer ...Computational Social Science:The Collaborative Futures of Big Data, Computer ...
Computational Social Science:The Collaborative Futures of Big Data, Computer ...
 
Introduction to Social Network Analysis
Introduction to Social Network AnalysisIntroduction to Social Network Analysis
Introduction to Social Network Analysis
 
Social Network Analysis Workshop
Social Network Analysis WorkshopSocial Network Analysis Workshop
Social Network Analysis Workshop
 
Social Network Analysis: What It Is, Why We Should Care, and What We Can Lear...
Social Network Analysis: What It Is, Why We Should Care, and What We Can Lear...Social Network Analysis: What It Is, Why We Should Care, and What We Can Lear...
Social Network Analysis: What It Is, Why We Should Care, and What We Can Lear...
 
Proposal final
Proposal finalProposal final
Proposal final
 
Introduction to Social Network Analysis
Introduction to Social Network AnalysisIntroduction to Social Network Analysis
Introduction to Social Network Analysis
 
Webometrics report
Webometrics reportWebometrics report
Webometrics report
 
Online Forums vs. Social Networks: Two Case Studies to support eGovernment wi...
Online Forums vs. Social Networks: Two Case Studies to support eGovernment wi...Online Forums vs. Social Networks: Two Case Studies to support eGovernment wi...
Online Forums vs. Social Networks: Two Case Studies to support eGovernment wi...
 
Social Network Analysis Introduction including Data Structure Graph overview.
Social Network Analysis Introduction including Data Structure Graph overview. Social Network Analysis Introduction including Data Structure Graph overview.
Social Network Analysis Introduction including Data Structure Graph overview.
 
A Community of Quality: Using Social Network Analysis to Study University-Wid...
A Community of Quality: Using Social Network Analysis to Study University-Wid...A Community of Quality: Using Social Network Analysis to Study University-Wid...
A Community of Quality: Using Social Network Analysis to Study University-Wid...
 
Collaboration, Technology and Libraries
Collaboration, Technology and LibrariesCollaboration, Technology and Libraries
Collaboration, Technology and Libraries
 
Future of Journalism - civil discourse technologies
Future of Journalism - civil discourse technologiesFuture of Journalism - civil discourse technologies
Future of Journalism - civil discourse technologies
 
Mattmiddaghscatterplotppt
MattmiddaghscatterplotpptMattmiddaghscatterplotppt
Mattmiddaghscatterplotppt
 
Online survey tools ppt 30-01-2016
Online survey tools ppt 30-01-2016Online survey tools ppt 30-01-2016
Online survey tools ppt 30-01-2016
 
Overview of Digital Publishing
Overview of Digital PublishingOverview of Digital Publishing
Overview of Digital Publishing
 
Tech Tools for Music Industry Teaching and Exploration
Tech Tools for Music Industry Teaching and ExplorationTech Tools for Music Industry Teaching and Exploration
Tech Tools for Music Industry Teaching and Exploration
 
12 Network Experiments and Interventions: Studying Information Diffusion and ...
12 Network Experiments and Interventions: Studying Information Diffusion and ...12 Network Experiments and Interventions: Studying Information Diffusion and ...
12 Network Experiments and Interventions: Studying Information Diffusion and ...
 

Viewers also liked

Bay dc loco gen and motors 12 4-13
Bay dc loco gen and motors 12 4-13Bay dc loco gen and motors 12 4-13
Bay dc loco gen and motors 12 4-13Imarinwcmr
 
Medycyna holistyczna
Medycyna holistycznaMedycyna holistyczna
Medycyna holistycznaolgalasek
 
2013 11-25 d effective solutions -prostream
2013 11-25 d effective solutions -prostream2013 11-25 d effective solutions -prostream
2013 11-25 d effective solutions -prostream
Emil Hristov
 
Balore yeni baski part 1
Balore  yeni baski part 1Balore  yeni baski part 1
Balore yeni baski part 1EUROPAGES
 
Jurisdiction Chart Required at the time of Service Tax Registration in Delhi
Jurisdiction Chart Required at the time of Service Tax Registration in DelhiJurisdiction Chart Required at the time of Service Tax Registration in Delhi
Jurisdiction Chart Required at the time of Service Tax Registration in Delhi
RVG & CO
 
Liberalization Privatization Globalization (LPG)
Liberalization Privatization Globalization (LPG)Liberalization Privatization Globalization (LPG)
Liberalization Privatization Globalization (LPG)
mayankravi
 
2013.10苦勞網收支
2013.10苦勞網收支2013.10苦勞網收支
2013.10苦勞網收支顥中 王
 
70 ejercicios practicos
70 ejercicios practicos70 ejercicios practicos
70 ejercicios practicos
Maikol Rojas Arias
 
Dinamicas grupales y tecnica grupales
Dinamicas grupales y tecnica grupalesDinamicas grupales y tecnica grupales
Dinamicas grupales y tecnica grupales
Maikol Rojas Arias
 
Paisajes de colombia
Paisajes de colombiaPaisajes de colombia
Paisajes de colombia
tatianagaa
 

Viewers also liked (11)

Bay dc loco gen and motors 12 4-13
Bay dc loco gen and motors 12 4-13Bay dc loco gen and motors 12 4-13
Bay dc loco gen and motors 12 4-13
 
Medycyna holistyczna
Medycyna holistycznaMedycyna holistyczna
Medycyna holistyczna
 
2013 11-25 d effective solutions -prostream
2013 11-25 d effective solutions -prostream2013 11-25 d effective solutions -prostream
2013 11-25 d effective solutions -prostream
 
Balore yeni baski part 1
Balore  yeni baski part 1Balore  yeni baski part 1
Balore yeni baski part 1
 
Jurisdiction Chart Required at the time of Service Tax Registration in Delhi
Jurisdiction Chart Required at the time of Service Tax Registration in DelhiJurisdiction Chart Required at the time of Service Tax Registration in Delhi
Jurisdiction Chart Required at the time of Service Tax Registration in Delhi
 
Liberalization Privatization Globalization (LPG)
Liberalization Privatization Globalization (LPG)Liberalization Privatization Globalization (LPG)
Liberalization Privatization Globalization (LPG)
 
2013.10苦勞網收支
2013.10苦勞網收支2013.10苦勞網收支
2013.10苦勞網收支
 
Bankruptcy advice U.K?
Bankruptcy advice U.K?Bankruptcy advice U.K?
Bankruptcy advice U.K?
 
70 ejercicios practicos
70 ejercicios practicos70 ejercicios practicos
70 ejercicios practicos
 
Dinamicas grupales y tecnica grupales
Dinamicas grupales y tecnica grupalesDinamicas grupales y tecnica grupales
Dinamicas grupales y tecnica grupales
 
Paisajes de colombia
Paisajes de colombiaPaisajes de colombia
Paisajes de colombia
 

Similar to Pick a Crowd

Choosing the right crowd. Expert finding in social networks. edbt 2013
Choosing the right crowd. Expert finding in social networks. edbt 2013Choosing the right crowd. Expert finding in social networks. edbt 2013
Choosing the right crowd. Expert finding in social networks. edbt 2013Marco Brambilla
 
Sweeny group think-ias2015
Sweeny group think-ias2015Sweeny group think-ias2015
Sweeny group think-ias2015
Marianne Sweeny
 
Human Computation for Big Data
Human Computation for Big DataHuman Computation for Big Data
Human Computation for Big Data
eXascale Infolab
 
Social Web Course @VU Amsterdam: Final Student Presentations
Social Web Course @VU Amsterdam: Final Student PresentationsSocial Web Course @VU Amsterdam: Final Student Presentations
Social Web Course @VU Amsterdam: Final Student PresentationsLora Aroyo
 
Social machines: theory design and incentives
Social machines: theory design and incentivesSocial machines: theory design and incentives
Social machines: theory design and incentives
Elena Simperl
 
Demystifying Data Science with an introduction to Machine Learning
Demystifying Data Science with an introduction to Machine LearningDemystifying Data Science with an introduction to Machine Learning
Demystifying Data Science with an introduction to Machine Learning
Julian Bright
 
Final PhD defense presentation
Final PhD defense presentationFinal PhD defense presentation
Final PhD defense presentation
Hussein Hazimeh
 
SLUA: Towards Semantic Linking of Users with Actions in Crowdsourcing
SLUA: Towards Semantic Linking of Users with Actions in CrowdsourcingSLUA: Towards Semantic Linking of Users with Actions in Crowdsourcing
SLUA: Towards Semantic Linking of Users with Actions in Crowdsourcing
Umair ul Hassan
 
SEMANTiCS2016 - Exploring Dynamics and Semantics of User Interests for User ...
SEMANTiCS2016 - Exploring Dynamics and Semantics of User Interests for User ...SEMANTiCS2016 - Exploring Dynamics and Semantics of User Interests for User ...
SEMANTiCS2016 - Exploring Dynamics and Semantics of User Interests for User ...
GUANGYUAN PIAO
 
Human Computation
Human ComputationHuman Computation
Human Computation
Irene Celino
 
Answering Search Queries with CrowdSearcher: a crowdsourcing and social netwo...
Answering Search Queries with CrowdSearcher: a crowdsourcing and social netwo...Answering Search Queries with CrowdSearcher: a crowdsourcing and social netwo...
Answering Search Queries with CrowdSearcher: a crowdsourcing and social netwo...
Marco Brambilla
 
Recommender Problems Introduction
Recommender Problems IntroductionRecommender Problems Introduction
Recommender Problems Introduction
Minh Nguyen
 
Information Access on Social Web
Information Access on Social WebInformation Access on Social Web
Information Access on Social Web
Daqing He
 
Tutorial: Social Semantic Web and Crowdsourcing - E. Simperl - ESWC SS 2014
Tutorial: Social Semantic Web and Crowdsourcing - E. Simperl - ESWC SS 2014 Tutorial: Social Semantic Web and Crowdsourcing - E. Simperl - ESWC SS 2014
Tutorial: Social Semantic Web and Crowdsourcing - E. Simperl - ESWC SS 2014
eswcsummerschool
 
Artificial intelligence: Simulation of Intelligence
Artificial intelligence: Simulation of IntelligenceArtificial intelligence: Simulation of Intelligence
Artificial intelligence: Simulation of Intelligence
Abhishek Upadhyay
 
A Research Plan to Study Impact of a Collaborative Web Search Tool on Novice'...
A Research Plan to Study Impact of a Collaborative Web Search Tool on Novice'...A Research Plan to Study Impact of a Collaborative Web Search Tool on Novice'...
A Research Plan to Study Impact of a Collaborative Web Search Tool on Novice'...
Karthikeyan Umapathy
 
Researching Social Media – Big Data and Social Media Analysis
Researching Social Media – Big Data and Social Media AnalysisResearching Social Media – Big Data and Social Media Analysis
Researching Social Media – Big Data and Social Media Analysis
Farida Vis
 
Crowdsourced Data Processing: Industry and Academic Perspectives
Crowdsourced Data Processing: Industry and Academic PerspectivesCrowdsourced Data Processing: Industry and Academic Perspectives
Crowdsourced Data Processing: Industry and Academic Perspectives
Aditya Parameswaran
 
AI @ Wholi - Bucharest.AI Meetup #5
AI @ Wholi - Bucharest.AI Meetup #5AI @ Wholi - Bucharest.AI Meetup #5
AI @ Wholi - Bucharest.AI Meetup #5
Traian Rebedea
 
Lecture 5: Mining, Analysis and Visualisation
Lecture 5: Mining, Analysis and VisualisationLecture 5: Mining, Analysis and Visualisation
Lecture 5: Mining, Analysis and Visualisation
Marieke van Erp
 

Similar to Pick a Crowd (20)

Choosing the right crowd. Expert finding in social networks. edbt 2013
Choosing the right crowd. Expert finding in social networks. edbt 2013Choosing the right crowd. Expert finding in social networks. edbt 2013
Choosing the right crowd. Expert finding in social networks. edbt 2013
 
Sweeny group think-ias2015
Sweeny group think-ias2015Sweeny group think-ias2015
Sweeny group think-ias2015
 
Human Computation for Big Data
Human Computation for Big DataHuman Computation for Big Data
Human Computation for Big Data
 
Social Web Course @VU Amsterdam: Final Student Presentations
Social Web Course @VU Amsterdam: Final Student PresentationsSocial Web Course @VU Amsterdam: Final Student Presentations
Social Web Course @VU Amsterdam: Final Student Presentations
 
Social machines: theory design and incentives
Social machines: theory design and incentivesSocial machines: theory design and incentives
Social machines: theory design and incentives
 
Demystifying Data Science with an introduction to Machine Learning
Demystifying Data Science with an introduction to Machine LearningDemystifying Data Science with an introduction to Machine Learning
Demystifying Data Science with an introduction to Machine Learning
 
Final PhD defense presentation
Final PhD defense presentationFinal PhD defense presentation
Final PhD defense presentation
 
SLUA: Towards Semantic Linking of Users with Actions in Crowdsourcing
SLUA: Towards Semantic Linking of Users with Actions in CrowdsourcingSLUA: Towards Semantic Linking of Users with Actions in Crowdsourcing
SLUA: Towards Semantic Linking of Users with Actions in Crowdsourcing
 
SEMANTiCS2016 - Exploring Dynamics and Semantics of User Interests for User ...
SEMANTiCS2016 - Exploring Dynamics and Semantics of User Interests for User ...SEMANTiCS2016 - Exploring Dynamics and Semantics of User Interests for User ...
SEMANTiCS2016 - Exploring Dynamics and Semantics of User Interests for User ...
 
Human Computation
Human ComputationHuman Computation
Human Computation
 
Answering Search Queries with CrowdSearcher: a crowdsourcing and social netwo...
Answering Search Queries with CrowdSearcher: a crowdsourcing and social netwo...Answering Search Queries with CrowdSearcher: a crowdsourcing and social netwo...
Answering Search Queries with CrowdSearcher: a crowdsourcing and social netwo...
 
Recommender Problems Introduction
Recommender Problems IntroductionRecommender Problems Introduction
Recommender Problems Introduction
 
Information Access on Social Web
Information Access on Social WebInformation Access on Social Web
Information Access on Social Web
 
Tutorial: Social Semantic Web and Crowdsourcing - E. Simperl - ESWC SS 2014
Tutorial: Social Semantic Web and Crowdsourcing - E. Simperl - ESWC SS 2014 Tutorial: Social Semantic Web and Crowdsourcing - E. Simperl - ESWC SS 2014
Tutorial: Social Semantic Web and Crowdsourcing - E. Simperl - ESWC SS 2014
 
Artificial intelligence: Simulation of Intelligence
Artificial intelligence: Simulation of IntelligenceArtificial intelligence: Simulation of Intelligence
Artificial intelligence: Simulation of Intelligence
 
A Research Plan to Study Impact of a Collaborative Web Search Tool on Novice'...
A Research Plan to Study Impact of a Collaborative Web Search Tool on Novice'...A Research Plan to Study Impact of a Collaborative Web Search Tool on Novice'...
A Research Plan to Study Impact of a Collaborative Web Search Tool on Novice'...
 
Researching Social Media – Big Data and Social Media Analysis
Researching Social Media – Big Data and Social Media AnalysisResearching Social Media – Big Data and Social Media Analysis
Researching Social Media – Big Data and Social Media Analysis
 
Crowdsourced Data Processing: Industry and Academic Perspectives
Crowdsourced Data Processing: Industry and Academic PerspectivesCrowdsourced Data Processing: Industry and Academic Perspectives
Crowdsourced Data Processing: Industry and Academic Perspectives
 
AI @ Wholi - Bucharest.AI Meetup #5
AI @ Wholi - Bucharest.AI Meetup #5AI @ Wholi - Bucharest.AI Meetup #5
AI @ Wholi - Bucharest.AI Meetup #5
 
Lecture 5: Mining, Analysis and Visualisation
Lecture 5: Mining, Analysis and VisualisationLecture 5: Mining, Analysis and Visualisation
Lecture 5: Mining, Analysis and Visualisation
 

More from eXascale Infolab

Beyond Triplets: Hyper-Relational Knowledge Graph Embedding for Link Prediction
Beyond Triplets: Hyper-Relational Knowledge Graph Embedding for Link PredictionBeyond Triplets: Hyper-Relational Knowledge Graph Embedding for Link Prediction
Beyond Triplets: Hyper-Relational Knowledge Graph Embedding for Link Prediction
eXascale Infolab
 
It Takes Two: Instrumenting the Interaction between In-Memory Databases and S...
It Takes Two: Instrumenting the Interaction between In-Memory Databases and S...It Takes Two: Instrumenting the Interaction between In-Memory Databases and S...
It Takes Two: Instrumenting the Interaction between In-Memory Databases and S...
eXascale Infolab
 
Representation Learning on Complex Graphs
Representation Learning on Complex GraphsRepresentation Learning on Complex Graphs
Representation Learning on Complex Graphs
eXascale Infolab
 
A force directed approach for offline gps trajectory map
A force directed approach for offline gps trajectory mapA force directed approach for offline gps trajectory map
A force directed approach for offline gps trajectory map
eXascale Infolab
 
Cikm 2018
Cikm 2018Cikm 2018
Cikm 2018
eXascale Infolab
 
HistoSketch: Fast Similarity-Preserving Sketching of Streaming Histograms wit...
HistoSketch: Fast Similarity-Preserving Sketching of Streaming Histograms wit...HistoSketch: Fast Similarity-Preserving Sketching of Streaming Histograms wit...
HistoSketch: Fast Similarity-Preserving Sketching of Streaming Histograms wit...
eXascale Infolab
 
SwissLink: High-Precision, Context-Free Entity Linking Exploiting Unambiguous...
SwissLink: High-Precision, Context-Free Entity Linking Exploiting Unambiguous...SwissLink: High-Precision, Context-Free Entity Linking Exploiting Unambiguous...
SwissLink: High-Precision, Context-Free Entity Linking Exploiting Unambiguous...
eXascale Infolab
 
Dependency-Driven Analytics: A Compass for Uncharted Data Oceans
Dependency-Driven Analytics: A Compass for Uncharted Data OceansDependency-Driven Analytics: A Compass for Uncharted Data Oceans
Dependency-Driven Analytics: A Compass for Uncharted Data Oceans
eXascale Infolab
 
Crowd scheduling www2016
Crowd scheduling www2016Crowd scheduling www2016
Crowd scheduling www2016
eXascale Infolab
 
SANAPHOR: Ontology-based Coreference Resolution
SANAPHOR: Ontology-based Coreference ResolutionSANAPHOR: Ontology-based Coreference Resolution
SANAPHOR: Ontology-based Coreference Resolution
eXascale Infolab
 
Efficient, Scalable, and Provenance-Aware Management of Linked Data
Efficient, Scalable, and Provenance-Aware Management of Linked DataEfficient, Scalable, and Provenance-Aware Management of Linked Data
Efficient, Scalable, and Provenance-Aware Management of Linked Data
eXascale Infolab
 
Entity-Centric Data Management
Entity-Centric Data ManagementEntity-Centric Data Management
Entity-Centric Data Management
eXascale Infolab
 
SSSW 2015 Sense Making
SSSW 2015 Sense MakingSSSW 2015 Sense Making
SSSW 2015 Sense Making
eXascale Infolab
 
LDOW2015 - Uduvudu: a Graph-Aware and Adaptive UI Engine for Linked Data
LDOW2015 - Uduvudu: a Graph-Aware and Adaptive UI Engine for Linked DataLDOW2015 - Uduvudu: a Graph-Aware and Adaptive UI Engine for Linked Data
LDOW2015 - Uduvudu: a Graph-Aware and Adaptive UI Engine for Linked Data
eXascale Infolab
 
Executing Provenance-Enabled Queries over Web Data
Executing Provenance-Enabled Queries over Web DataExecuting Provenance-Enabled Queries over Web Data
Executing Provenance-Enabled Queries over Web Data
eXascale Infolab
 
The Dynamics of Micro-Task Crowdsourcing
The Dynamics of Micro-Task CrowdsourcingThe Dynamics of Micro-Task Crowdsourcing
The Dynamics of Micro-Task Crowdsourcing
eXascale Infolab
 
Fixing the Domain and Range of Properties in Linked Data by Context Disambigu...
Fixing the Domain and Range of Properties in Linked Data by Context Disambigu...Fixing the Domain and Range of Properties in Linked Data by Context Disambigu...
Fixing the Domain and Range of Properties in Linked Data by Context Disambigu...
eXascale Infolab
 
CIKM14: Fixing grammatical errors by preposition ranking
CIKM14: Fixing grammatical errors by preposition rankingCIKM14: Fixing grammatical errors by preposition ranking
CIKM14: Fixing grammatical errors by preposition ranking
eXascale Infolab
 
OLTP-Bench
OLTP-BenchOLTP-Bench
OLTP-Bench
eXascale Infolab
 
An Introduction to Big Data
An Introduction to Big DataAn Introduction to Big Data
An Introduction to Big Data
eXascale Infolab
 

More from eXascale Infolab (20)

Beyond Triplets: Hyper-Relational Knowledge Graph Embedding for Link Prediction
Beyond Triplets: Hyper-Relational Knowledge Graph Embedding for Link PredictionBeyond Triplets: Hyper-Relational Knowledge Graph Embedding for Link Prediction
Beyond Triplets: Hyper-Relational Knowledge Graph Embedding for Link Prediction
 
It Takes Two: Instrumenting the Interaction between In-Memory Databases and S...
It Takes Two: Instrumenting the Interaction between In-Memory Databases and S...It Takes Two: Instrumenting the Interaction between In-Memory Databases and S...
It Takes Two: Instrumenting the Interaction between In-Memory Databases and S...
 
Representation Learning on Complex Graphs
Representation Learning on Complex GraphsRepresentation Learning on Complex Graphs
Representation Learning on Complex Graphs
 
A force directed approach for offline gps trajectory map
A force directed approach for offline gps trajectory mapA force directed approach for offline gps trajectory map
A force directed approach for offline gps trajectory map
 
Cikm 2018
Cikm 2018Cikm 2018
Cikm 2018
 
HistoSketch: Fast Similarity-Preserving Sketching of Streaming Histograms wit...
HistoSketch: Fast Similarity-Preserving Sketching of Streaming Histograms wit...HistoSketch: Fast Similarity-Preserving Sketching of Streaming Histograms wit...
HistoSketch: Fast Similarity-Preserving Sketching of Streaming Histograms wit...
 
SwissLink: High-Precision, Context-Free Entity Linking Exploiting Unambiguous...
SwissLink: High-Precision, Context-Free Entity Linking Exploiting Unambiguous...SwissLink: High-Precision, Context-Free Entity Linking Exploiting Unambiguous...
SwissLink: High-Precision, Context-Free Entity Linking Exploiting Unambiguous...
 
Dependency-Driven Analytics: A Compass for Uncharted Data Oceans
Dependency-Driven Analytics: A Compass for Uncharted Data OceansDependency-Driven Analytics: A Compass for Uncharted Data Oceans
Dependency-Driven Analytics: A Compass for Uncharted Data Oceans
 
Crowd scheduling www2016
Crowd scheduling www2016Crowd scheduling www2016
Crowd scheduling www2016
 
SANAPHOR: Ontology-based Coreference Resolution
SANAPHOR: Ontology-based Coreference ResolutionSANAPHOR: Ontology-based Coreference Resolution
SANAPHOR: Ontology-based Coreference Resolution
 
Efficient, Scalable, and Provenance-Aware Management of Linked Data
Efficient, Scalable, and Provenance-Aware Management of Linked DataEfficient, Scalable, and Provenance-Aware Management of Linked Data
Efficient, Scalable, and Provenance-Aware Management of Linked Data
 
Entity-Centric Data Management
Entity-Centric Data ManagementEntity-Centric Data Management
Entity-Centric Data Management
 
SSSW 2015 Sense Making
SSSW 2015 Sense MakingSSSW 2015 Sense Making
SSSW 2015 Sense Making
 
LDOW2015 - Uduvudu: a Graph-Aware and Adaptive UI Engine for Linked Data
LDOW2015 - Uduvudu: a Graph-Aware and Adaptive UI Engine for Linked DataLDOW2015 - Uduvudu: a Graph-Aware and Adaptive UI Engine for Linked Data
LDOW2015 - Uduvudu: a Graph-Aware and Adaptive UI Engine for Linked Data
 
Executing Provenance-Enabled Queries over Web Data
Executing Provenance-Enabled Queries over Web DataExecuting Provenance-Enabled Queries over Web Data
Executing Provenance-Enabled Queries over Web Data
 
The Dynamics of Micro-Task Crowdsourcing
The Dynamics of Micro-Task CrowdsourcingThe Dynamics of Micro-Task Crowdsourcing
The Dynamics of Micro-Task Crowdsourcing
 
Fixing the Domain and Range of Properties in Linked Data by Context Disambigu...
Fixing the Domain and Range of Properties in Linked Data by Context Disambigu...Fixing the Domain and Range of Properties in Linked Data by Context Disambigu...
Fixing the Domain and Range of Properties in Linked Data by Context Disambigu...
 
CIKM14: Fixing grammatical errors by preposition ranking
CIKM14: Fixing grammatical errors by preposition rankingCIKM14: Fixing grammatical errors by preposition ranking
CIKM14: Fixing grammatical errors by preposition ranking
 
OLTP-Bench
OLTP-BenchOLTP-Bench
OLTP-Bench
 
An Introduction to Big Data
An Introduction to Big DataAn Introduction to Big Data
An Introduction to Big Data
 

Recently uploaded

Search Disrupted Google’s Leaked Documents Rock the SEO World.pdf
Search Disrupted Google’s Leaked Documents Rock the SEO World.pdfSearch Disrupted Google’s Leaked Documents Rock the SEO World.pdf
Search Disrupted Google’s Leaked Documents Rock the SEO World.pdf
Arihant Webtech Pvt. Ltd
 
Cree_Rey_BrandIdentityKit.PDF_PersonalBd
Cree_Rey_BrandIdentityKit.PDF_PersonalBdCree_Rey_BrandIdentityKit.PDF_PersonalBd
Cree_Rey_BrandIdentityKit.PDF_PersonalBd
creerey
 
5 Things You Need To Know Before Hiring a Videographer
5 Things You Need To Know Before Hiring a Videographer5 Things You Need To Know Before Hiring a Videographer
5 Things You Need To Know Before Hiring a Videographer
ofm712785
 
Meas_Dylan_DMBS_PB1_2024-05XX_Revised.pdf
Meas_Dylan_DMBS_PB1_2024-05XX_Revised.pdfMeas_Dylan_DMBS_PB1_2024-05XX_Revised.pdf
Meas_Dylan_DMBS_PB1_2024-05XX_Revised.pdf
dylandmeas
 
RMD24 | Retail media: hoe zet je dit in als je geen AH of Unilever bent? Heid...
RMD24 | Retail media: hoe zet je dit in als je geen AH of Unilever bent? Heid...RMD24 | Retail media: hoe zet je dit in als je geen AH of Unilever bent? Heid...
RMD24 | Retail media: hoe zet je dit in als je geen AH of Unilever bent? Heid...
BBPMedia1
 
Project File Report BBA 6th semester.pdf
Project File Report BBA 6th semester.pdfProject File Report BBA 6th semester.pdf
Project File Report BBA 6th semester.pdf
RajPriye
 
Premium MEAN Stack Development Solutions for Modern Businesses
Premium MEAN Stack Development Solutions for Modern BusinessesPremium MEAN Stack Development Solutions for Modern Businesses
Premium MEAN Stack Development Solutions for Modern Businesses
SynapseIndia
 
Sustainability: Balancing the Environment, Equity & Economy
Sustainability: Balancing the Environment, Equity & EconomySustainability: Balancing the Environment, Equity & Economy
Sustainability: Balancing the Environment, Equity & Economy
Operational Excellence Consulting
 
Global Interconnection Group Joint Venture[960] (1).pdf
Global Interconnection Group Joint Venture[960] (1).pdfGlobal Interconnection Group Joint Venture[960] (1).pdf
Global Interconnection Group Joint Venture[960] (1).pdf
Henry Tapper
 
Cracking the Workplace Discipline Code Main.pptx
Cracking the Workplace Discipline Code Main.pptxCracking the Workplace Discipline Code Main.pptx
Cracking the Workplace Discipline Code Main.pptx
Workforce Group
 
Affordable Stationery Printing Services in Jaipur | Navpack n Print
Affordable Stationery Printing Services in Jaipur | Navpack n PrintAffordable Stationery Printing Services in Jaipur | Navpack n Print
Affordable Stationery Printing Services in Jaipur | Navpack n Print
Navpack & Print
 
The Parable of the Pipeline a book every new businessman or business student ...
The Parable of the Pipeline a book every new businessman or business student ...The Parable of the Pipeline a book every new businessman or business student ...
The Parable of the Pipeline a book every new businessman or business student ...
awaisafdar
 
April 2024 Nostalgia Products Newsletter
April 2024 Nostalgia Products NewsletterApril 2024 Nostalgia Products Newsletter
April 2024 Nostalgia Products Newsletter
NathanBaughman3
 
anas about venice for grade 6f about venice
anas about venice for grade 6f about veniceanas about venice for grade 6f about venice
anas about venice for grade 6f about venice
anasabutalha2013
 
Introduction to Amazon company 111111111111
Introduction to Amazon company 111111111111Introduction to Amazon company 111111111111
Introduction to Amazon company 111111111111
zoyaansari11365
 
Enterprise Excellence is Inclusive Excellence.pdf
Enterprise Excellence is Inclusive Excellence.pdfEnterprise Excellence is Inclusive Excellence.pdf
Enterprise Excellence is Inclusive Excellence.pdf
KaiNexus
 
Buy Verified PayPal Account | Buy Google 5 Star Reviews
Buy Verified PayPal Account | Buy Google 5 Star ReviewsBuy Verified PayPal Account | Buy Google 5 Star Reviews
Buy Verified PayPal Account | Buy Google 5 Star Reviews
usawebmarket
 
Improving profitability for small business
Improving profitability for small businessImproving profitability for small business
Improving profitability for small business
Ben Wann
 
What are the main advantages of using HR recruiter services.pdf
What are the main advantages of using HR recruiter services.pdfWhat are the main advantages of using HR recruiter services.pdf
What are the main advantages of using HR recruiter services.pdf
HumanResourceDimensi1
 
Role of Remote Sensing and Monitoring in Mining
Role of Remote Sensing and Monitoring in MiningRole of Remote Sensing and Monitoring in Mining
Role of Remote Sensing and Monitoring in Mining
Naaraayani Minerals Pvt.Ltd
 

Recently uploaded (20)

Search Disrupted Google’s Leaked Documents Rock the SEO World.pdf
Search Disrupted Google’s Leaked Documents Rock the SEO World.pdfSearch Disrupted Google’s Leaked Documents Rock the SEO World.pdf
Search Disrupted Google’s Leaked Documents Rock the SEO World.pdf
 
Cree_Rey_BrandIdentityKit.PDF_PersonalBd
Cree_Rey_BrandIdentityKit.PDF_PersonalBdCree_Rey_BrandIdentityKit.PDF_PersonalBd
Cree_Rey_BrandIdentityKit.PDF_PersonalBd
 
5 Things You Need To Know Before Hiring a Videographer
5 Things You Need To Know Before Hiring a Videographer5 Things You Need To Know Before Hiring a Videographer
5 Things You Need To Know Before Hiring a Videographer
 
Meas_Dylan_DMBS_PB1_2024-05XX_Revised.pdf
Meas_Dylan_DMBS_PB1_2024-05XX_Revised.pdfMeas_Dylan_DMBS_PB1_2024-05XX_Revised.pdf
Meas_Dylan_DMBS_PB1_2024-05XX_Revised.pdf
 
RMD24 | Retail media: hoe zet je dit in als je geen AH of Unilever bent? Heid...
RMD24 | Retail media: hoe zet je dit in als je geen AH of Unilever bent? Heid...RMD24 | Retail media: hoe zet je dit in als je geen AH of Unilever bent? Heid...
RMD24 | Retail media: hoe zet je dit in als je geen AH of Unilever bent? Heid...
 
Project File Report BBA 6th semester.pdf
Project File Report BBA 6th semester.pdfProject File Report BBA 6th semester.pdf
Project File Report BBA 6th semester.pdf
 
Premium MEAN Stack Development Solutions for Modern Businesses
Premium MEAN Stack Development Solutions for Modern BusinessesPremium MEAN Stack Development Solutions for Modern Businesses
Premium MEAN Stack Development Solutions for Modern Businesses
 
Sustainability: Balancing the Environment, Equity & Economy
Sustainability: Balancing the Environment, Equity & EconomySustainability: Balancing the Environment, Equity & Economy
Sustainability: Balancing the Environment, Equity & Economy
 
Global Interconnection Group Joint Venture[960] (1).pdf
Global Interconnection Group Joint Venture[960] (1).pdfGlobal Interconnection Group Joint Venture[960] (1).pdf
Global Interconnection Group Joint Venture[960] (1).pdf
 
Cracking the Workplace Discipline Code Main.pptx
Cracking the Workplace Discipline Code Main.pptxCracking the Workplace Discipline Code Main.pptx
Cracking the Workplace Discipline Code Main.pptx
 
Affordable Stationery Printing Services in Jaipur | Navpack n Print
Affordable Stationery Printing Services in Jaipur | Navpack n PrintAffordable Stationery Printing Services in Jaipur | Navpack n Print
Affordable Stationery Printing Services in Jaipur | Navpack n Print
 
The Parable of the Pipeline a book every new businessman or business student ...
The Parable of the Pipeline a book every new businessman or business student ...The Parable of the Pipeline a book every new businessman or business student ...
The Parable of the Pipeline a book every new businessman or business student ...
 
April 2024 Nostalgia Products Newsletter
April 2024 Nostalgia Products NewsletterApril 2024 Nostalgia Products Newsletter
April 2024 Nostalgia Products Newsletter
 
anas about venice for grade 6f about venice
anas about venice for grade 6f about veniceanas about venice for grade 6f about venice
anas about venice for grade 6f about venice
 
Introduction to Amazon company 111111111111
Introduction to Amazon company 111111111111Introduction to Amazon company 111111111111
Introduction to Amazon company 111111111111
 
Enterprise Excellence is Inclusive Excellence.pdf
Enterprise Excellence is Inclusive Excellence.pdfEnterprise Excellence is Inclusive Excellence.pdf
Enterprise Excellence is Inclusive Excellence.pdf
 
Buy Verified PayPal Account | Buy Google 5 Star Reviews
Buy Verified PayPal Account | Buy Google 5 Star ReviewsBuy Verified PayPal Account | Buy Google 5 Star Reviews
Buy Verified PayPal Account | Buy Google 5 Star Reviews
 
Improving profitability for small business
Improving profitability for small businessImproving profitability for small business
Improving profitability for small business
 
What are the main advantages of using HR recruiter services.pdf
What are the main advantages of using HR recruiter services.pdfWhat are the main advantages of using HR recruiter services.pdf
What are the main advantages of using HR recruiter services.pdf
 
Role of Remote Sensing and Monitoring in Mining
Role of Remote Sensing and Monitoring in MiningRole of Remote Sensing and Monitoring in Mining
Role of Remote Sensing and Monitoring in Mining
 

Pick a Crowd

  • 1. Pick-A-Crowd: Tell Me What You Like, and I’ll Tell You What to Do A Crowdsourcing Platform for Personalized Human Intelligence Task Assignment Based on Social Networks Djellel E. Difallah, GianlucaDemartini, Philippe Cudré-Mauroux eXascaleInfolab University of Fribourg, Switzerland 15th May 2013, WWW 2013 - Rio De Janeiro, Brazil 1
  • 2. Crowdsourcing • Exploit human intelligence to solve tasks that are simple for Humans and complex for machines • Examples: – Wikipedia, reCaptcha, Duolingo • Incentives – Financial, fun, visibility 2
  • 3. Motivation • The Pull Methodology is suboptimal Actual workers Max Overlap Effective workers 3
  • 4. Motivation • The Push Methodology is a Task-to-Worker Recommender System. 4
  • 5. Contribution and Claim • Pick-A-Crowd: A system architecture that uses Task-to-Worker matching: – The worker’s social profile – The task context • Workers can provide higher quality answers on tasks they relate to 5
  • 7. Problem Definition (1)The Human Intelligence Task (HIT) Categorization Survey Image Tagging Data Collection Batch of Tasks: Title Batch Instruction Specific task instruction* Task data: - Text. - Options. - Additional data (image, Url) List of categories* 8
  • 8. ProblemDefinition (2)The Worker Completed HITs: 256 Approval Rate: 96% Qualification Types Generic Qualifications Page: Page: Page: - -Title Title - Title - -Category Category - Category - -Description Description - Description - -Feed, etc. Feed, etc. - Feed, etc. 9
  • 9. Problem Definition (3) – Task-to-Worker Matching Batch of Tasks: Title Batch Instruction Specific task instruction* Task data: - Text. - Options. - Additional data (image, Url) List of categories* Page: Page: Page: - -Title Title - Title - -Category Category - Category - -Description Description - Description - -Feed, etc. Feed, etc. - Feed, etc. 1- Task-to-Page Matching Function - Category - Expert finding - Semantic 2- Worker Ranking 10
  • 10. Matching Models (1/3)– Category Based • The requester provides a list of categories related to the batch • We create a subset of pages whose category is in the category list of the batch • Rank the workers by the number of liked pages in the subset 11
  • 11. Matching Models (2/3) – Expert Finding • • • Build an inverted index on the pages’ titles and description Use the title/description of the tasks as a key word query on the inverted index and get a subset of pages Rank the workers by the number of liked pages in the subset 12
  • 12. Matching Models (3/3) – Semantic Based • • Link the context to an external knowledge base (e.g., DBPedia) Exploit the underlying graph structure to determine the Hits and Pages similarity – Assumption that a worker who likes a page is able to answer questions about related entities – Worker who likes a page is able to answer questions about entities of the same type • Rank the workers by the number of liked pages in the subset Similarity Relatedness HIT FB Pages Type-Similarity 13
  • 14. Experimental Evaluation • The Facebook app OpenTurkimplements part of the Pick-A-Crowd architecture: – More than 170 registered workers participated – Over 12k pages crawled • Covered both multiple answer questions as well as open-ended questions – 50 images with multiple choice question and 5 candidate answers (Soccer, Actors, Music, Authors,Movies, Animes) – Answer 20 open-ended questions related to the topic (Cricket) 16
  • 16. Evaluation - WORKER PRECISION Correlation between the crowd accuracy and the number of relevant likes (Category Based) NUMBER OF RELEVANT LIKES 19
  • 17. Evaluation (Baseline) – Amazon Mechanical Turk (AMT) AMT 3 = Majority vote of 3 workers AMT 5 = Majority vote of 5 workers 20
  • 18. Evaluation – HIT Assignment Models CATEGORY APPROACH 21
  • 19. Evaluation – HIT Assignment Models EXPERT FINDING BASED TITLE/INSTRUCTION CONTENT 22
  • 20. Evaluation – HIT Assignment Models SEMANTIC BASED TYPE RELATEDNESS 23
  • 22. Conclusions and Future Work • Pull vs. Pushmethodologies in Crowdsourcing • Pick-A-Crowd system architecture with Taskto-Worker recommendation • Experimental comparison with AMT shows a consistent quality improvement “Workers Know what they Like” • Exploit more of the social activity, and handle content-less tasks 25
  • 23. Next Step • We are building a Crowdsourcing platform for the research community • Pre-register on: www.openturk.com Thank You! 26