SlideShare a Scribd company logo
1 of 38
Download to read offline
Crowdsourcing Pareto-Optimal Object
Finding by Pairwise Comparisons
Abolfazl Asudeh, Gensheng Zhang, Naeemul Hassan, Chengkai Li, Gergely V. Zaruba
Department of Computer Science and Engineering
University of Texas at Arlington
CIKM, October 21, 2015, Melbourne, Australia
©2015 The University of Texas at Arlington. All Rights Reserved.
Humans’ Obsession: Comparing Things
Who is the better footballer?
Which is the better company to work for?
http://visual.ly/messi-vs-ronaldo
©2015 The University of Texas at Arlington. All Rights Reserved.
Facebook vs. Google
http://www.businessinsider.com/facebook-vs-google-best-employer-2013-11
©2015 The University of Texas at Arlington. All Rights Reserved.
Facebook vs. Google
Overall
satisfaction
CEO approval Employee
confidence in the
future
Perks and salaries Interview
difficulty
≻ ≻ ≻ ≻≻
Multiple criteria
Which one is better?
©2015 The University of Texas at Arlington. All Rights Reserved.
Overview
1. Ask crowd to compare objects on individual criteria
2. Derive partial knowledge about preference relations based
on responses from crowd
3. Find all Pareto-optimal objects without exhausting all
possible comparison questions
General framework and its instantiations (algorithms), with the
goal of minimizing number of questions
Preference (Better-Than) Relation Pc
(x , y) ∈ Pc (or x ≻c y)
“x is better than (preferred over) y with regard to criterion c”
Assumptions on data model:
o Pc is a strict partial order (as opposed to a total order)
Irreflexivity, Transitivity, Asymmetry
o No explicit attribute representation,
thus no equivalence on a criterion
Notations:
o (indifferent) x ∼c y ⇔ (x, y) ∉ Pc ∧ (y, x) ∉ Pc
o (not better than) x ⊁c y ⇔ (x, y) ∉ Pc
Pareto-optimal Objects
Object dominance: consider objects O, criteria C
y ≻ x ⇔ ∀c ∈ C : x ⊁c y and ∃c ∈ C such that y ≻c x (i.e., x is not
better than y by any criterion and y is better than x by at least one criterion)
x ∈ O is Pareto-optimal ⇔ x is not dominated by any
other object
Example:
o c ≻ d ⇐ c ≻story d, c ≻music d, c ∼acting d
o Only one Pareto-optimal object: b
Deriving Preference Relations by Aggregating
Crowd’s Responses to Pairwise Comparisons
Pareto-Optimal Object Finding
Problem statement: Given objects O and criteria C, find all Pareto-optimal
objects, using pairwise comparisons by individual criteria
Cost metric
o Goal: as few pairwise comparison questions as possible
o Simple, but reflect real-world monetary cost and time delay
o Brute-force approach : |C| x |O| x (|O|-1)/2 questions
Assumptions on execution model
o Sequential execution: get rlt(qi) before asking qi+1
o No consideration of worker quality
Applications
Collecting Public Opinion
o Best companies to work for, best cities to live in
Group Decision Making
o Where for lunch, which product to use, which candidate to hire
Information Exploration
o Compare photos by color, sharpness, and landscape
Back to the “which one is better”?
o After finding Pareto-optimal objects, further actions (ranking, filtering,
visualization) to find desirable objects
Related Work
[12]Chenetal.WSDM13 || [15]Davidsonetal. ICDT13 || [17]Grozetal. PODS15 ||
[23]Lofietal. EDBT13 || [25]Polychronopoulosetal.WebDB13 || [4] technicalreportof
thispaper
Otherrelatedwork:collaborativefiltering,learningtorank,…
Related Work
Explicit attribute representation
o Total order on ordinal attributes (sizes of houses, ratings of
restaurants), Partial-order on categorical attributes (genres of movies)
o Not always easy to model and/or for users to provide missing values
(e.g., story of movies)
Pairwise comparisons
o Known to be easier, faster, and less error-prone. Widely used in social
choice and welfare, preferences, and voting
Partial order vs. Total order
o A direct effect of using pairwise comparisons
o Not always natural to enforce a total order
General, Iterative Algorithm Framework
4-steps in each iteration
(1) Question selection
(2) Outcome derivation
(3) Contradiction resolution
(4) Termination test
(2) Outcome Derivation
{
Question
o q = x ?c y (compare x and y by criterion c)
Three possible outcomes based on voting
x ≻c y
rlt(q) = y ≻c x
x ∼c y
The framework is agnostic to the outcome derivation method.
Other conceivable method can be plugged in.
(3) Contradiction Resolution
Assume transitivity in preference relation, and enforce it.
x
y
z
criterion c
case 1
(3) Contradiction Resolution
Assume transitivity in preference relation, and enforce it.
x
y
z
criterion c
assumed
case 1
(3) Contradiction Resolution
Assume transitivity in preference relation, and enforce it.
x
y
z
criterion c
assumed
x y
z
~
criterion c
case 1 case 2
(3) Contradiction Resolution
Assume transitivity in preference relation, and enforce it.
x
y
z
criterion c
assumed
x y
z
~
~
criterion c
assumedcase 1 case 2
(4) Termination Test
At the end of each iteration, objects are partitioned into 3 sets, based on
incomplete preference relations R+
(Q) so far.
o O√ : Pareto-optimal objects
o O× : Non Pareto-optimal objects
o O? : R+
(Q) is insufficient for discerning these objects’ Pareto-optimality
o O? = ∅ ⇒ terminate
(1) Question Selection
The process of executing a question sequence Q = 〈q1, ..., qn〉
o Q is a terminal sequence if O? = ∅ based on R+
(Q).
o Goal: among many terminal sequences, execute a short sequence
Lower bound
o Theorem2: Atldeast(|O|−k)×|C|+(k−1)×2pairwisecomparisonquestionsare
necessary,wherekisthe numberof Pareto-optimalobjects.
Bad news
o Worst-case : |C| x |O| x (|O|-1)/2 questions; cannot do better than brute-force
o E.g., suppose all objects are indifferent by every criterion. If any comparison x ?c y is
skipped, we will not be able to determine if x and y are indifferent or if one dominates
another.
Transitivity of Object Dominance: Doesn’t Hold
A cost-saving property for skyline queries
o Object dominance transitivity: x ≻ y, y ≻ z ⇒ x ≻ z
o Immediately prune a dominated object from further comparison.
(Any object dominated by y is also dominated by x.)
Transitivity of Object Dominance: Doesn’t Hold
Fundamental reason: lack of explicit attribute representation
o In skyline/preference queries:
on any attribute, x >= y, y>=z ⇒ x>=z.
o In Pareto-optimal object finding:
x ≻c or ∼c y, y ≻c or ∼c z ⇒ x ≻c or ∼c z (not true)
Even possible that an object is dominated by only one non-
Pareto optimal object.
Can Still Benefit From A Similar Idea
For a non-Pareto optimal object, we only need to know at least
one object dominates it. We don’t care about which other objects
also dominate it.
Overriding principle of the framework:
o Identify non-Pareto objects as early as possible
o Postpone their comparisons with other objects as much as possible
Candidate Questions
Given asked questions Q = 〈q1, ..., qn〉, x ?c y is a
candidate question iff it satisfies 3 conditions:
(i) The outcome of x ?c y is unknown yet, i.e., rlt(x ?c y) ∉ R+
(Q)
(ii) x ∈ O?
(iii) Based on R+(Q), y ≻ x is not ruled out yet.
How to rule out y ≻ x ?
∃c∈C such that x ≻c y ∈ R+
(Q) ⇒ y⊁x
Only Choosing from Candidate Questions
Sufficient
Property 2: Qcan = ∅ ⇔ O? = ∅
Efficient
Theorem 1: If Q contains non-candidate questions, there exists a shorter
or equally long sequence Q’ without non-candidate questions such that Q’
finds at least all dominated objects found by Q.
Macro-ordering, Micro-ordering
Both guided by the overriding principle
Macro-ordering:Whenavailable,wechoosea candidatequestionx?c y suchthaty ∉ O×
Micro-ordering:Severalquestionorderingheuristics:
o Random Question (RandomQ): randomly choose a candidate question x ?c y
o Random Pair (RandomP): randomly choose a candidate question x ?c y,
continue to finish all remaining candidate questions between x and y.
o Fewest Remaining Questions (FRQ): Choose a pair with the fewest
remaining questions. Ties are broken based on how many objects are
better/worse than x and y on the criterion.
Experiments by Simulation
o Used an 10000-tuple NBA dataset that records players’ per-
season performance on 10 criteria (points-per-game, …)
o Simulated Partial orders based on players’ performance
comparison, with some perturbations.
Effectiveness of Candidate Questions and Macro-
Ordering
Effectiveness of Micro-Ordering
Experiments using Amazon Mechanical Turk
o Compare 100 photos of UT-Arlington campus, by color, sharpness,
landscape.
o All 14, 850 possible pairwise questions were partitioned into 1, 650 tasks,
each containing 9 questions on a criterion.
o Worker qualification:
 responded to at least 100 HITs before with at least 90% approval rate
 2 additional validation questions mixed in each task
Experiments using Amazon Mechanical Turk
Limitations and Future Work
No performance guarantee: as bad as brute-force in worst-case
Non-deterministic results: due to contradiction resolution
Possibly empty result: due to lack of object dominance transitivity
No consideration of different levels of confidence on question
outcomes or crowdsourcers' quality.
Future work: Pareto-optimal objects in probabilistic sense?
No consideration of parallel/batch-execution scheme
Future work: Parallel scheme
Thank You! Questions?
Chengkai Li
http://ranger.uta.edu/~cli
cli@uta.edu
©2015 The University of Texas at Arlington. All Rights Reserved.
(3) Contradiction Resolution
How often does contradiction happen?
o Depends on data itself, k, and θ
o It may not happen a lot.
 Intuitively: as long as the underlying relation is transitive,
collective wisdom of crowd should reflect it.
 Preference judgments of relevance in document retrieval are
transitive [27, 11].
6 objects, 3 criteria: 45 comparisons
Brute-Force on the Toy Example
RandomQ on the Toy Example
RandomP on the Toy Example
FRQ on the Toy Example

More Related Content

Similar to Crowdsourcing Pareto-Optimal Object Finding by Pairwise Comparisons

week9_Machine_Learning.ppt
week9_Machine_Learning.pptweek9_Machine_Learning.ppt
week9_Machine_Learning.ppt
butest
 
Machine learning and Neural Networks
Machine learning and Neural NetworksMachine learning and Neural Networks
Machine learning and Neural Networks
butest
 
Machine Learning ICS 273A
Machine Learning ICS 273AMachine Learning ICS 273A
Machine Learning ICS 273A
butest
 
3_learning.ppt
3_learning.ppt3_learning.ppt
3_learning.ppt
butest
 
Topic_6
Topic_6Topic_6
Topic_6
butest
 
CS364 Artificial Intelligence Machine Learning
CS364 Artificial Intelligence Machine LearningCS364 Artificial Intelligence Machine Learning
CS364 Artificial Intelligence Machine Learning
butest
 
Sean Holden (University of Cambridge) - Proving Theorems_ Still A Major Test ...
Sean Holden (University of Cambridge) - Proving Theorems_ Still A Major Test ...Sean Holden (University of Cambridge) - Proving Theorems_ Still A Major Test ...
Sean Holden (University of Cambridge) - Proving Theorems_ Still A Major Test ...
Codiax
 
k Nearest Neighbor
k Nearest Neighbork Nearest Neighbor
k Nearest Neighbor
butest
 

Similar to Crowdsourcing Pareto-Optimal Object Finding by Pairwise Comparisons (20)

Matineh Shaker, Artificial Intelligence Scientist, Bonsai at MLconf SF 2017
Matineh Shaker, Artificial Intelligence Scientist, Bonsai at MLconf SF 2017Matineh Shaker, Artificial Intelligence Scientist, Bonsai at MLconf SF 2017
Matineh Shaker, Artificial Intelligence Scientist, Bonsai at MLconf SF 2017
 
Machine learning in science and industry — day 1
Machine learning in science and industry — day 1Machine learning in science and industry — day 1
Machine learning in science and industry — day 1
 
nnml.ppt
nnml.pptnnml.ppt
nnml.ppt
 
week9_Machine_Learning.ppt
week9_Machine_Learning.pptweek9_Machine_Learning.ppt
week9_Machine_Learning.ppt
 
Item analysis
Item analysisItem analysis
Item analysis
 
Machine learning and Neural Networks
Machine learning and Neural NetworksMachine learning and Neural Networks
Machine learning and Neural Networks
 
Item Response Theory in Constructing Measures
Item Response Theory in Constructing MeasuresItem Response Theory in Constructing Measures
Item Response Theory in Constructing Measures
 
Machine learning by Dr. Vivek Vijay and Dr. Sandeep Yadav
Machine learning by Dr. Vivek Vijay and Dr. Sandeep YadavMachine learning by Dr. Vivek Vijay and Dr. Sandeep Yadav
Machine learning by Dr. Vivek Vijay and Dr. Sandeep Yadav
 
Machine Learning ICS 273A
Machine Learning ICS 273AMachine Learning ICS 273A
Machine Learning ICS 273A
 
Machine Learning and Artificial Neural Networks.ppt
Machine Learning and Artificial Neural Networks.pptMachine Learning and Artificial Neural Networks.ppt
Machine Learning and Artificial Neural Networks.ppt
 
Intro to machine learning
Intro to machine learningIntro to machine learning
Intro to machine learning
 
Deep Reinforcement Learning Through Policy Optimization, John Schulman, OpenAI
Deep Reinforcement Learning Through Policy Optimization, John Schulman, OpenAIDeep Reinforcement Learning Through Policy Optimization, John Schulman, OpenAI
Deep Reinforcement Learning Through Policy Optimization, John Schulman, OpenAI
 
3_learning.ppt
3_learning.ppt3_learning.ppt
3_learning.ppt
 
Recommender Systems Fairness Evaluation via Generalized Cross Entropy
Recommender Systems Fairness Evaluation via Generalized Cross EntropyRecommender Systems Fairness Evaluation via Generalized Cross Entropy
Recommender Systems Fairness Evaluation via Generalized Cross Entropy
 
Explainable AI
Explainable AIExplainable AI
Explainable AI
 
Topic_6
Topic_6Topic_6
Topic_6
 
CS364 Artificial Intelligence Machine Learning
CS364 Artificial Intelligence Machine LearningCS364 Artificial Intelligence Machine Learning
CS364 Artificial Intelligence Machine Learning
 
Sean Holden (University of Cambridge) - Proving Theorems_ Still A Major Test ...
Sean Holden (University of Cambridge) - Proving Theorems_ Still A Major Test ...Sean Holden (University of Cambridge) - Proving Theorems_ Still A Major Test ...
Sean Holden (University of Cambridge) - Proving Theorems_ Still A Major Test ...
 
k Nearest Neighbor
k Nearest Neighbork Nearest Neighbor
k Nearest Neighbor
 
Introduction to Data Mining
Introduction to Data MiningIntroduction to Data Mining
Introduction to Data Mining
 

More from The Innovative Data Intelligence Research (IDIR) Laboratory, University of Texas at Arlington

Enabling Computational Journalism: Automated Fact-Checking and Story-Finding
Enabling Computational Journalism: Automated Fact-Checking and Story-FindingEnabling Computational Journalism: Automated Fact-Checking and Story-Finding
Enabling Computational Journalism: Automated Fact-Checking and Story-Finding
The Innovative Data Intelligence Research (IDIR) Laboratory, University of Texas at Arlington
 
Restoring Trust by Computing: Data-driven Fact-checking and Exceptional Fact ...
Restoring Trust by Computing: Data-driven Fact-checking and Exceptional Fact ...Restoring Trust by Computing: Data-driven Fact-checking and Exceptional Fact ...
Restoring Trust by Computing: Data-driven Fact-checking and Exceptional Fact ...
The Innovative Data Intelligence Research (IDIR) Laboratory, University of Texas at Arlington
 
Tackling Usability Challenges in Querying Massive, Ultra-heterogeneous Graphs
Tackling Usability Challenges in Querying Massive, Ultra-heterogeneous GraphsTackling Usability Challenges in Querying Massive, Ultra-heterogeneous Graphs
Tackling Usability Challenges in Querying Massive, Ultra-heterogeneous Graphs
The Innovative Data Intelligence Research (IDIR) Laboratory, University of Texas at Arlington
 
Dynamic Symbolic Database Application Testing
Dynamic Symbolic Database Application TestingDynamic Symbolic Database Application Testing
Incremental Discovery of Prominent Situational Facts
Incremental Discovery of Prominent Situational FactsIncremental Discovery of Prominent Situational Facts
Towards a Query-by-Example System for Knowledge Graphs
Towards a Query-by-Example System for Knowledge GraphsTowards a Query-by-Example System for Knowledge Graphs

More from The Innovative Data Intelligence Research (IDIR) Laboratory, University of Texas at Arlington (20)

Enabling Computational Journalism: Automated Fact-Checking and Story-Finding
Enabling Computational Journalism: Automated Fact-Checking and Story-FindingEnabling Computational Journalism: Automated Fact-Checking and Story-Finding
Enabling Computational Journalism: Automated Fact-Checking and Story-Finding
 
Restoring Trust by Computing: Data-driven Fact-checking and Exceptional Fact ...
Restoring Trust by Computing: Data-driven Fact-checking and Exceptional Fact ...Restoring Trust by Computing: Data-driven Fact-checking and Exceptional Fact ...
Restoring Trust by Computing: Data-driven Fact-checking and Exceptional Fact ...
 
Tackling Usability Challenges in Querying Massive, Ultra-heterogeneous Graphs
Tackling Usability Challenges in Querying Massive, Ultra-heterogeneous GraphsTackling Usability Challenges in Querying Massive, Ultra-heterogeneous Graphs
Tackling Usability Challenges in Querying Massive, Ultra-heterogeneous Graphs
 
Dynamic Symbolic Database Application Testing
Dynamic Symbolic Database Application TestingDynamic Symbolic Database Application Testing
Dynamic Symbolic Database Application Testing
 
Facetedpedia: Dynamic Generation of Query-Dependent Faceted Interfaces for Wi...
Facetedpedia: Dynamic Generation of Query-Dependent Faceted Interfaces for Wi...Facetedpedia: Dynamic Generation of Query-Dependent Faceted Interfaces for Wi...
Facetedpedia: Dynamic Generation of Query-Dependent Faceted Interfaces for Wi...
 
Entity-Relationship Queries over Wikipedia
Entity-Relationship Queries over WikipediaEntity-Relationship Queries over Wikipedia
Entity-Relationship Queries over Wikipedia
 
Prominent Streak Discovery in Sequence Data
Prominent Streak Discovery in Sequence DataProminent Streak Discovery in Sequence Data
Prominent Streak Discovery in Sequence Data
 
On Skyline Groups
On Skyline GroupsOn Skyline Groups
On Skyline Groups
 
Anything You Can Do, I Can Do Better: Finding Expert Teams by CrewScout
Anything You Can Do, I Can Do Better: Finding Expert Teams by CrewScoutAnything You Can Do, I Can Do Better: Finding Expert Teams by CrewScout
Anything You Can Do, I Can Do Better: Finding Expert Teams by CrewScout
 
Incremental Discovery of Prominent Situational Facts
Incremental Discovery of Prominent Situational FactsIncremental Discovery of Prominent Situational Facts
Incremental Discovery of Prominent Situational Facts
 
Towards a Query-by-Example System for Knowledge Graphs
Towards a Query-by-Example System for Knowledge GraphsTowards a Query-by-Example System for Knowledge Graphs
Towards a Query-by-Example System for Knowledge Graphs
 
Data In, Fact Out: Automated Monitoring of Facts by FactWatcher
Data In, Fact Out: Automated Monitoring of Facts by FactWatcherData In, Fact Out: Automated Monitoring of Facts by FactWatcher
Data In, Fact Out: Automated Monitoring of Facts by FactWatcher
 
Anything You Can Do, I Can Do Better: Finding Expert Teams by CrewScoutCrewsc...
Anything You Can Do, I Can Do Better: Finding Expert Teams by CrewScoutCrewsc...Anything You Can Do, I Can Do Better: Finding Expert Teams by CrewScoutCrewsc...
Anything You Can Do, I Can Do Better: Finding Expert Teams by CrewScoutCrewsc...
 
VIIQ: Auto-suggestion Enabled Visual Interface for Interactive Graph Query Fo...
VIIQ: Auto-suggestion Enabled Visual Interface for Interactive Graph Query Fo...VIIQ: Auto-suggestion Enabled Visual Interface for Interactive Graph Query Fo...
VIIQ: Auto-suggestion Enabled Visual Interface for Interactive Graph Query Fo...
 
Generating Preview Tables for Entity Graphs
Generating Preview Tables for Entity GraphsGenerating Preview Tables for Entity Graphs
Generating Preview Tables for Entity Graphs
 
Comparing Automated Factual Claim Detection Against Judgments of Journalism O...
Comparing Automated Factual Claim Detection Against Judgments of Journalism O...Comparing Automated Factual Claim Detection Against Judgments of Journalism O...
Comparing Automated Factual Claim Detection Against Judgments of Journalism O...
 
ClaimBuster: The First-ever End-to-end Fact-checking System
ClaimBuster: The First-ever End-to-end Fact-checking SystemClaimBuster: The First-ever End-to-end Fact-checking System
ClaimBuster: The First-ever End-to-end Fact-checking System
 
Toward Automated Fact-Checking: Detecting Check-worthy Factual Claims by Clai...
Toward Automated Fact-Checking: Detecting Check-worthy Factual Claims by Clai...Toward Automated Fact-Checking: Detecting Check-worthy Factual Claims by Clai...
Toward Automated Fact-Checking: Detecting Check-worthy Factual Claims by Clai...
 
TableView: A Visual Interface for Generating Preview Tables of Entity Graphs
TableView: A Visual Interface for Generating Preview Tables of Entity GraphsTableView: A Visual Interface for Generating Preview Tables of Entity Graphs
TableView: A Visual Interface for Generating Preview Tables of Entity Graphs
 
Maverick: Discovering Exceptional Facts from Knowledge Graphs
Maverick: Discovering Exceptional Facts from Knowledge GraphsMaverick: Discovering Exceptional Facts from Knowledge Graphs
Maverick: Discovering Exceptional Facts from Knowledge Graphs
 

Recently uploaded

Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
amitlee9823
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
amitlee9823
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
amitlee9823
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
only4webmaster01
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
amitlee9823
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
amitlee9823
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
amitlee9823
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
amitlee9823
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
MarinCaroMartnezBerg
 

Recently uploaded (20)

Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 

Crowdsourcing Pareto-Optimal Object Finding by Pairwise Comparisons

  • 1. Crowdsourcing Pareto-Optimal Object Finding by Pairwise Comparisons Abolfazl Asudeh, Gensheng Zhang, Naeemul Hassan, Chengkai Li, Gergely V. Zaruba Department of Computer Science and Engineering University of Texas at Arlington CIKM, October 21, 2015, Melbourne, Australia ©2015 The University of Texas at Arlington. All Rights Reserved.
  • 2. Humans’ Obsession: Comparing Things Who is the better footballer? Which is the better company to work for? http://visual.ly/messi-vs-ronaldo ©2015 The University of Texas at Arlington. All Rights Reserved.
  • 4. Facebook vs. Google Overall satisfaction CEO approval Employee confidence in the future Perks and salaries Interview difficulty ≻ ≻ ≻ ≻≻ Multiple criteria Which one is better? ©2015 The University of Texas at Arlington. All Rights Reserved.
  • 5. Overview 1. Ask crowd to compare objects on individual criteria 2. Derive partial knowledge about preference relations based on responses from crowd 3. Find all Pareto-optimal objects without exhausting all possible comparison questions General framework and its instantiations (algorithms), with the goal of minimizing number of questions
  • 6. Preference (Better-Than) Relation Pc (x , y) ∈ Pc (or x ≻c y) “x is better than (preferred over) y with regard to criterion c” Assumptions on data model: o Pc is a strict partial order (as opposed to a total order) Irreflexivity, Transitivity, Asymmetry o No explicit attribute representation, thus no equivalence on a criterion Notations: o (indifferent) x ∼c y ⇔ (x, y) ∉ Pc ∧ (y, x) ∉ Pc o (not better than) x ⊁c y ⇔ (x, y) ∉ Pc
  • 7. Pareto-optimal Objects Object dominance: consider objects O, criteria C y ≻ x ⇔ ∀c ∈ C : x ⊁c y and ∃c ∈ C such that y ≻c x (i.e., x is not better than y by any criterion and y is better than x by at least one criterion) x ∈ O is Pareto-optimal ⇔ x is not dominated by any other object Example: o c ≻ d ⇐ c ≻story d, c ≻music d, c ∼acting d o Only one Pareto-optimal object: b
  • 8. Deriving Preference Relations by Aggregating Crowd’s Responses to Pairwise Comparisons
  • 9. Pareto-Optimal Object Finding Problem statement: Given objects O and criteria C, find all Pareto-optimal objects, using pairwise comparisons by individual criteria Cost metric o Goal: as few pairwise comparison questions as possible o Simple, but reflect real-world monetary cost and time delay o Brute-force approach : |C| x |O| x (|O|-1)/2 questions Assumptions on execution model o Sequential execution: get rlt(qi) before asking qi+1 o No consideration of worker quality
  • 10. Applications Collecting Public Opinion o Best companies to work for, best cities to live in Group Decision Making o Where for lunch, which product to use, which candidate to hire Information Exploration o Compare photos by color, sharpness, and landscape Back to the “which one is better”? o After finding Pareto-optimal objects, further actions (ranking, filtering, visualization) to find desirable objects
  • 11. Related Work [12]Chenetal.WSDM13 || [15]Davidsonetal. ICDT13 || [17]Grozetal. PODS15 || [23]Lofietal. EDBT13 || [25]Polychronopoulosetal.WebDB13 || [4] technicalreportof thispaper Otherrelatedwork:collaborativefiltering,learningtorank,…
  • 12. Related Work Explicit attribute representation o Total order on ordinal attributes (sizes of houses, ratings of restaurants), Partial-order on categorical attributes (genres of movies) o Not always easy to model and/or for users to provide missing values (e.g., story of movies) Pairwise comparisons o Known to be easier, faster, and less error-prone. Widely used in social choice and welfare, preferences, and voting Partial order vs. Total order o A direct effect of using pairwise comparisons o Not always natural to enforce a total order
  • 13. General, Iterative Algorithm Framework 4-steps in each iteration (1) Question selection (2) Outcome derivation (3) Contradiction resolution (4) Termination test
  • 14. (2) Outcome Derivation { Question o q = x ?c y (compare x and y by criterion c) Three possible outcomes based on voting x ≻c y rlt(q) = y ≻c x x ∼c y The framework is agnostic to the outcome derivation method. Other conceivable method can be plugged in.
  • 15. (3) Contradiction Resolution Assume transitivity in preference relation, and enforce it. x y z criterion c case 1
  • 16. (3) Contradiction Resolution Assume transitivity in preference relation, and enforce it. x y z criterion c assumed case 1
  • 17. (3) Contradiction Resolution Assume transitivity in preference relation, and enforce it. x y z criterion c assumed x y z ~ criterion c case 1 case 2
  • 18. (3) Contradiction Resolution Assume transitivity in preference relation, and enforce it. x y z criterion c assumed x y z ~ ~ criterion c assumedcase 1 case 2
  • 19. (4) Termination Test At the end of each iteration, objects are partitioned into 3 sets, based on incomplete preference relations R+ (Q) so far. o O√ : Pareto-optimal objects o O× : Non Pareto-optimal objects o O? : R+ (Q) is insufficient for discerning these objects’ Pareto-optimality o O? = ∅ ⇒ terminate
  • 20. (1) Question Selection The process of executing a question sequence Q = 〈q1, ..., qn〉 o Q is a terminal sequence if O? = ∅ based on R+ (Q). o Goal: among many terminal sequences, execute a short sequence Lower bound o Theorem2: Atldeast(|O|−k)×|C|+(k−1)×2pairwisecomparisonquestionsare necessary,wherekisthe numberof Pareto-optimalobjects. Bad news o Worst-case : |C| x |O| x (|O|-1)/2 questions; cannot do better than brute-force o E.g., suppose all objects are indifferent by every criterion. If any comparison x ?c y is skipped, we will not be able to determine if x and y are indifferent or if one dominates another.
  • 21. Transitivity of Object Dominance: Doesn’t Hold A cost-saving property for skyline queries o Object dominance transitivity: x ≻ y, y ≻ z ⇒ x ≻ z o Immediately prune a dominated object from further comparison. (Any object dominated by y is also dominated by x.)
  • 22. Transitivity of Object Dominance: Doesn’t Hold Fundamental reason: lack of explicit attribute representation o In skyline/preference queries: on any attribute, x >= y, y>=z ⇒ x>=z. o In Pareto-optimal object finding: x ≻c or ∼c y, y ≻c or ∼c z ⇒ x ≻c or ∼c z (not true) Even possible that an object is dominated by only one non- Pareto optimal object.
  • 23. Can Still Benefit From A Similar Idea For a non-Pareto optimal object, we only need to know at least one object dominates it. We don’t care about which other objects also dominate it. Overriding principle of the framework: o Identify non-Pareto objects as early as possible o Postpone their comparisons with other objects as much as possible
  • 24. Candidate Questions Given asked questions Q = 〈q1, ..., qn〉, x ?c y is a candidate question iff it satisfies 3 conditions: (i) The outcome of x ?c y is unknown yet, i.e., rlt(x ?c y) ∉ R+ (Q) (ii) x ∈ O? (iii) Based on R+(Q), y ≻ x is not ruled out yet. How to rule out y ≻ x ? ∃c∈C such that x ≻c y ∈ R+ (Q) ⇒ y⊁x
  • 25. Only Choosing from Candidate Questions Sufficient Property 2: Qcan = ∅ ⇔ O? = ∅ Efficient Theorem 1: If Q contains non-candidate questions, there exists a shorter or equally long sequence Q’ without non-candidate questions such that Q’ finds at least all dominated objects found by Q.
  • 26. Macro-ordering, Micro-ordering Both guided by the overriding principle Macro-ordering:Whenavailable,wechoosea candidatequestionx?c y suchthaty ∉ O× Micro-ordering:Severalquestionorderingheuristics: o Random Question (RandomQ): randomly choose a candidate question x ?c y o Random Pair (RandomP): randomly choose a candidate question x ?c y, continue to finish all remaining candidate questions between x and y. o Fewest Remaining Questions (FRQ): Choose a pair with the fewest remaining questions. Ties are broken based on how many objects are better/worse than x and y on the criterion.
  • 27. Experiments by Simulation o Used an 10000-tuple NBA dataset that records players’ per- season performance on 10 criteria (points-per-game, …) o Simulated Partial orders based on players’ performance comparison, with some perturbations.
  • 28. Effectiveness of Candidate Questions and Macro- Ordering
  • 30. Experiments using Amazon Mechanical Turk o Compare 100 photos of UT-Arlington campus, by color, sharpness, landscape. o All 14, 850 possible pairwise questions were partitioned into 1, 650 tasks, each containing 9 questions on a criterion. o Worker qualification:  responded to at least 100 HITs before with at least 90% approval rate  2 additional validation questions mixed in each task
  • 31. Experiments using Amazon Mechanical Turk
  • 32. Limitations and Future Work No performance guarantee: as bad as brute-force in worst-case Non-deterministic results: due to contradiction resolution Possibly empty result: due to lack of object dominance transitivity No consideration of different levels of confidence on question outcomes or crowdsourcers' quality. Future work: Pareto-optimal objects in probabilistic sense? No consideration of parallel/batch-execution scheme Future work: Parallel scheme
  • 33. Thank You! Questions? Chengkai Li http://ranger.uta.edu/~cli cli@uta.edu ©2015 The University of Texas at Arlington. All Rights Reserved.
  • 34. (3) Contradiction Resolution How often does contradiction happen? o Depends on data itself, k, and θ o It may not happen a lot.  Intuitively: as long as the underlying relation is transitive, collective wisdom of crowd should reflect it.  Preference judgments of relevance in document retrieval are transitive [27, 11].
  • 35. 6 objects, 3 criteria: 45 comparisons Brute-Force on the Toy Example
  • 36. RandomQ on the Toy Example
  • 37. RandomP on the Toy Example
  • 38. FRQ on the Toy Example