SlideShare a Scribd company logo
1 of 25
Efficient Computation of
Regret-ratio Minimizing Set:
A Compact Maxima Representative
ABOLFAZL ASUDEH
AZADE NAZI
NAN ZHANG
GAUTAM DAS
SIGMOD’17 © 2017 ACM. ISBN 978-1-4503-4197-4/17/05
UNIVERSITY OF TEXAS AT ARLINGTON
UNIVERSITY OF TEXAS AT ARLINGTON
GEORGE WASHINGTON UNIVERSITY
UNIVERSITY OF TEXAS AT ARLINGTON
Outline
Motivation and Problem statement
2D-RRMS (Two-Dimensional Regret-Ratio Minimizing Set)
HD-RRMS (Higher-Dimensional Regret-Ratio Minimizing Set)
Experiments
2
Maxima Queries
𝑓 = ∑𝑤𝑖 𝐴𝑖
… to give the best trade-off b/w
price, duration, number of stops, …
3
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
Y X
𝑓 = 𝑥 + 𝑦
Example 𝑡𝑖

4
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
Y X
Example
Convex hull (sky convex) 


5
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
Y X
Example







6
A subset of skyline:
the set of non-dominated points
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
Y X
Example
Convex hull (sky convex) 
7
Convex hull size Problem
Curvature effect
8
Convex hull size Problem
effect of the number of attributes (m)
m=2m=3m=4m=5m=6
Regret-Ratio Minimizing Set
10

𝑓 𝑡 − 𝑓(𝑡′
)
𝑓 𝑡 − 𝑓(𝑡′
)
𝑓(𝑡)
Problem:
Find a subset of size at most r
that minimizes the maximum
Regret-ratio over all functions
Overview of the literature,
Our contributions
The regret-ratio notion and the problem was first proposed at [Nanongkai et. al. VLDB 2010].
In two dimensional data:
◦ [Chester et. al. VLDB 2014]: Sweeping line 𝑂(𝑟. 𝑛2
)
◦ We: a dynamic algorithm O r. s. log s . log c < O r. n. (log n)2
-- s: skyline size; c: convex hull size.
In higher dimensional data:
◦ Complexity: NP-complete
◦ For arbitrary dimensions: [Chester et. al. VLDB 2014]
◦ Recently for fixed dimensions: [W. Cao et. al. ICDT 2017], [P. K. Agrawal et. al. Arxiv:1702.01446, 2017]
◦ Existing work: (a) a greedy heuristic with unproven theoretical guarantee, (b) a simple attribute
space discretization with a fixed upper bound on the regret-ratio of output [Nanongkai et. al. VLDB
2010].
◦ We: a linearithmic time approximation algorithm that guarantees a regret ratio, within any
arbitrarily small user-controllable distance from the optimal regret ratio.
◦ Assumption: fixed number of dimensions
11
Outline
Motivation and Problem statement
2D-RRMS (Two-Dimensional Regret-Ratio Minimizing Set)
HD-RRMS (Higher-Dimensional Regret-Ratio Minimizing Set)
Experiments
12
High-level idea
Order the skyline points from top-left to bottom right, add two
dummy points t0 and ts+1, and construct a complete
weighted graph on these points
13
t1
t5
t6
t0
t2
t3
t4
t7
Weight of an edge is the Max. regret ratio of removing all the
points in its top-right half-space




High-level idea
14
t1
t5
t6
t0
t2
t3
t4
t7
Order the skyline points from top-left to bottom right, add two
dummy points t0 and ts+1, and construct a complete
weighted graph on these points
Weight of an edge is the Max. regret ratio of removing all the
points in its top-right half-space  use binary search
High-level idea
Order the skyline points from top-left to bottom right, add two
dummy points t0 and ts+1, and construct a complete
weighted graph on these points
15
t1
t5
t6
t0
t2
t3
t4
t7
Weight of an edge is the Max. regret ratio of removing all the
points in its top-right half-space  use binary search




Apply the Dynamic programming, DP(ti,r’): optimal solution
from ti to ts+1 with at most r’ intermediate steps
𝑂(𝑟. 𝑠. log 𝑠 log 𝑐)
Outline
Motivation and Problem statement
2D-RRMS (Two-Dimensional Regret-Ratio Minimizing Set)
HD-RRMS (Higher-Dimensional Regret-Ratio Minimizing Set)
Experiments
16
Steps
RRMS
• Start with a conceptual model
• Discuss its problems
DMM
• Propose the idea of function space discretization
• Transform RRMS to a Min Max problem
MRST
• Define the intermediate problem “Min Rows Satisfying a Threshold”
• Transform MRST to a fixed-size instance of Set-cover problem
17
Conceptual Model
18
𝑡1
𝑡2
𝑡 𝑠
...
f
MinMax ( )
F (all possible functions)
Regret-ratio on 𝑓 if
only
𝑡2 is remained
Transform the problem to a min-max problem
Problem1:
◦ F is continuous  infinite number of
columns
◦ Matrix Discritization
Problem2:
◦ Even if could construct the matrix,
𝑛
𝑟
to solve it
◦ Transform to fixed-size set-cover
instances
Matrix Discretization
19
f
𝜃2
𝜃1
Arbitrarily small user-controllable
distance from the optimal solution
DMM: Discretized Min Max Problem
20
𝑡1
𝑡2
𝑡 𝑠
...
f
MinMax ( )
F (all possible functions)F(discretized function space)
Observation: the optimal regret-ratio is one of the cell values!
Define an intermediate problem:
◦ Min. rows satisfying the threshold (MRST)
Order the values in M.
Do a binary search over the values and for each value
Convert M to a (fixed-size) binary matrix
Convert MRST to a (fixed size) set-cover instance
f
F(discretized function space)
𝑡𝑖
1 if regret-ratio of t for f is at
most threshold, 0 otherwise
For fixed values of 𝑚 and 𝛾, can be solved in constant time.
 The running time of HD-RRMS is 𝑂(𝑛 log 𝑛)
Practical HD-RRMS: Use greedy approximate algorithm for solving the
set-cover instances
1. Accept a result if its size is at most 𝑟𝑚𝑙𝑜𝑔(𝛾): Index size increase, no
change in quality of output
2. Accept the result if size is at most r: index size does not change,
output quality may increase.
Outline
Motivation and Problem statement
2D-RRMS (Two-Dimensional Regret-Ratio Minimizing Set)
HD-RRMS (Higher-Dimensional Regret-Ratio Minimizing Set)
Experiments
21
Setup
22
Synthetic Data:
◦ Three datasets (correlated, independent, and anti-correlated) 10M tuples over 10 ordinal
attributes.
Real-world Datasets
◦ Airline dataset: 5.8M records over two ordinal attributes.
◦ US Department of Transportation (DOT) dataset: 457K records over 7 ordinal attributes.
◦ NBA dataset: 21K tuples over 17 ordinal attributes.
2D-RRMS
NBA dataset
23
Airline dataset
HD-RRMS
24
DOT dataset NBA dataset
Thank You!
25

More Related Content

What's hot

Approximate Thin Plate Spline Mappings
Approximate Thin Plate Spline MappingsApproximate Thin Plate Spline Mappings
Approximate Thin Plate Spline MappingsArchzilon Eshun-Davies
 
Lecture 11 (Digital Image Processing)
Lecture 11 (Digital Image Processing)Lecture 11 (Digital Image Processing)
Lecture 11 (Digital Image Processing)VARUN KUMAR
 
Gaussian Quadrature Formula
Gaussian Quadrature FormulaGaussian Quadrature Formula
Gaussian Quadrature FormulaDhaval Shukla
 
Applied numerical methods lec14
Applied numerical methods lec14Applied numerical methods lec14
Applied numerical methods lec14Yasser Ahmed
 
Applied numerical methods lec10
Applied numerical methods lec10Applied numerical methods lec10
Applied numerical methods lec10Yasser Ahmed
 
Finite-difference modeling, accuracy, and boundary conditions- Arthur Weglein...
Finite-difference modeling, accuracy, and boundary conditions- Arthur Weglein...Finite-difference modeling, accuracy, and boundary conditions- Arthur Weglein...
Finite-difference modeling, accuracy, and boundary conditions- Arthur Weglein...Arthur Weglein
 
NUMERICAL INTEGRATION : ERROR FORMULA, GAUSSIAN QUADRATURE FORMULA
NUMERICAL INTEGRATION : ERROR FORMULA, GAUSSIAN QUADRATURE FORMULANUMERICAL INTEGRATION : ERROR FORMULA, GAUSSIAN QUADRATURE FORMULA
NUMERICAL INTEGRATION : ERROR FORMULA, GAUSSIAN QUADRATURE FORMULAKHORASIYA DEVANSU
 
Numerical integration
Numerical integrationNumerical integration
Numerical integrationTarun Gehlot
 
Karatsuba algorithm for fast mltiplication
Karatsuba algorithm for fast mltiplicationKaratsuba algorithm for fast mltiplication
Karatsuba algorithm for fast mltiplicationAtul Singh
 
Particle Collision near 1+1- D Horava-Lifshitz Black Holes (Karl Schwarzschil...
Particle Collision near 1+1- D Horava-Lifshitz Black Holes (Karl Schwarzschil...Particle Collision near 1+1- D Horava-Lifshitz Black Holes (Karl Schwarzschil...
Particle Collision near 1+1- D Horava-Lifshitz Black Holes (Karl Schwarzschil...Eastern Mediterranean University
 
Numerical integration;Gaussian integration one point, two point and three poi...
Numerical integration;Gaussian integration one point, two point and three poi...Numerical integration;Gaussian integration one point, two point and three poi...
Numerical integration;Gaussian integration one point, two point and three poi...vaibhav tailor
 

What's hot (20)

Lecture 6
Lecture 6Lecture 6
Lecture 6
 
Triple integrals and applications
Triple integrals and applicationsTriple integrals and applications
Triple integrals and applications
 
Regression
RegressionRegression
Regression
 
Approximate Thin Plate Spline Mappings
Approximate Thin Plate Spline MappingsApproximate Thin Plate Spline Mappings
Approximate Thin Plate Spline Mappings
 
Lecture 11 (Digital Image Processing)
Lecture 11 (Digital Image Processing)Lecture 11 (Digital Image Processing)
Lecture 11 (Digital Image Processing)
 
Gaussian Quadrature Formula
Gaussian Quadrature FormulaGaussian Quadrature Formula
Gaussian Quadrature Formula
 
L 4 4
L 4 4L 4 4
L 4 4
 
Applied numerical methods lec14
Applied numerical methods lec14Applied numerical methods lec14
Applied numerical methods lec14
 
Applied numerical methods lec10
Applied numerical methods lec10Applied numerical methods lec10
Applied numerical methods lec10
 
Finite-difference modeling, accuracy, and boundary conditions- Arthur Weglein...
Finite-difference modeling, accuracy, and boundary conditions- Arthur Weglein...Finite-difference modeling, accuracy, and boundary conditions- Arthur Weglein...
Finite-difference modeling, accuracy, and boundary conditions- Arthur Weglein...
 
Quadrature
QuadratureQuadrature
Quadrature
 
Cmb part3
Cmb part3Cmb part3
Cmb part3
 
Presentation
PresentationPresentation
Presentation
 
R09 optimal control theory
R09 optimal control theoryR09 optimal control theory
R09 optimal control theory
 
NUMERICAL INTEGRATION : ERROR FORMULA, GAUSSIAN QUADRATURE FORMULA
NUMERICAL INTEGRATION : ERROR FORMULA, GAUSSIAN QUADRATURE FORMULANUMERICAL INTEGRATION : ERROR FORMULA, GAUSSIAN QUADRATURE FORMULA
NUMERICAL INTEGRATION : ERROR FORMULA, GAUSSIAN QUADRATURE FORMULA
 
Data Analysis Homework Help
Data Analysis Homework HelpData Analysis Homework Help
Data Analysis Homework Help
 
Numerical integration
Numerical integrationNumerical integration
Numerical integration
 
Karatsuba algorithm for fast mltiplication
Karatsuba algorithm for fast mltiplicationKaratsuba algorithm for fast mltiplication
Karatsuba algorithm for fast mltiplication
 
Particle Collision near 1+1- D Horava-Lifshitz Black Holes (Karl Schwarzschil...
Particle Collision near 1+1- D Horava-Lifshitz Black Holes (Karl Schwarzschil...Particle Collision near 1+1- D Horava-Lifshitz Black Holes (Karl Schwarzschil...
Particle Collision near 1+1- D Horava-Lifshitz Black Holes (Karl Schwarzschil...
 
Numerical integration;Gaussian integration one point, two point and three poi...
Numerical integration;Gaussian integration one point, two point and three poi...Numerical integration;Gaussian integration one point, two point and three poi...
Numerical integration;Gaussian integration one point, two point and three poi...
 

Similar to Efficient Computation of Regret-ratio Minimizing Set: A Compact Maxima Representative

A Comparison Of Methods For Solving MAX-SAT Problems
A Comparison Of Methods For Solving MAX-SAT ProblemsA Comparison Of Methods For Solving MAX-SAT Problems
A Comparison Of Methods For Solving MAX-SAT ProblemsKarla Adamson
 
Intro to Quant Trading Strategies (Lecture 7 of 10)
Intro to Quant Trading Strategies (Lecture 7 of 10)Intro to Quant Trading Strategies (Lecture 7 of 10)
Intro to Quant Trading Strategies (Lecture 7 of 10)Adrian Aley
 
[AAAI2021] Combinatorial Pure Exploration with Full-bandit or Partial Linear ...
[AAAI2021] Combinatorial Pure Exploration with Full-bandit or Partial Linear ...[AAAI2021] Combinatorial Pure Exploration with Full-bandit or Partial Linear ...
[AAAI2021] Combinatorial Pure Exploration with Full-bandit or Partial Linear ...Yuko Kuroki (黒木祐子)
 
MLHEP 2015: Introductory Lecture #4
MLHEP 2015: Introductory Lecture #4MLHEP 2015: Introductory Lecture #4
MLHEP 2015: Introductory Lecture #4arogozhnikov
 
Solving the Poisson Equation
Solving the Poisson EquationSolving the Poisson Equation
Solving the Poisson EquationShahzaib Malik
 
Computational Intelligence Assisted Engineering Design Optimization (using MA...
Computational Intelligence Assisted Engineering Design Optimization (using MA...Computational Intelligence Assisted Engineering Design Optimization (using MA...
Computational Intelligence Assisted Engineering Design Optimization (using MA...AmirParnianifard1
 
Binary Vector Reconstruction via Discreteness-Aware Approximate Message Passing
Binary Vector Reconstruction via Discreteness-Aware Approximate Message PassingBinary Vector Reconstruction via Discreteness-Aware Approximate Message Passing
Binary Vector Reconstruction via Discreteness-Aware Approximate Message PassingRyo Hayakawa
 
ICML2016: Low-rank tensor completion: a Riemannian manifold preconditioning a...
ICML2016: Low-rank tensor completion: a Riemannian manifold preconditioning a...ICML2016: Low-rank tensor completion: a Riemannian manifold preconditioning a...
ICML2016: Low-rank tensor completion: a Riemannian manifold preconditioning a...Hiroyuki KASAI
 
Algorithm review
Algorithm reviewAlgorithm review
Algorithm reviewchidabdu
 
Chapter 4 Simplex Method ppt
Chapter 4  Simplex Method pptChapter 4  Simplex Method ppt
Chapter 4 Simplex Method pptDereje Tigabu
 
Estimating Future Initial Margin with Machine Learning
Estimating Future Initial Margin with Machine LearningEstimating Future Initial Margin with Machine Learning
Estimating Future Initial Margin with Machine LearningAndres Hernandez
 
2014 on exact solutions for the minmax regret aspanning tree problem
2014   on exact solutions for the minmax regret aspanning tree problem2014   on exact solutions for the minmax regret aspanning tree problem
2014 on exact solutions for the minmax regret aspanning tree problemFrancisco Pérez
 
APLICACIONES DE LA DERIVADA EN LA CARRERA DE (Mecánica, Electrónica, Telecomu...
APLICACIONES DE LA DERIVADA EN LA CARRERA DE (Mecánica, Electrónica, Telecomu...APLICACIONES DE LA DERIVADA EN LA CARRERA DE (Mecánica, Electrónica, Telecomu...
APLICACIONES DE LA DERIVADA EN LA CARRERA DE (Mecánica, Electrónica, Telecomu...WILIAMMAURICIOCAHUAT1
 
Subproblem-Tree Calibration: A Unified Approach to Max-Product Message Passin...
Subproblem-Tree Calibration: A Unified Approach to Max-Product Message Passin...Subproblem-Tree Calibration: A Unified Approach to Max-Product Message Passin...
Subproblem-Tree Calibration: A Unified Approach to Max-Product Message Passin...Varad Meru
 
MVPA with SpaceNet: sparse structured priors
MVPA with SpaceNet: sparse structured priorsMVPA with SpaceNet: sparse structured priors
MVPA with SpaceNet: sparse structured priorsElvis DOHMATOB
 
MARGINAL PERCEPTRON FOR NON-LINEAR AND MULTI CLASS CLASSIFICATION
MARGINAL PERCEPTRON FOR NON-LINEAR AND MULTI CLASS CLASSIFICATION MARGINAL PERCEPTRON FOR NON-LINEAR AND MULTI CLASS CLASSIFICATION
MARGINAL PERCEPTRON FOR NON-LINEAR AND MULTI CLASS CLASSIFICATION ijscai
 

Similar to Efficient Computation of Regret-ratio Minimizing Set: A Compact Maxima Representative (20)

A Comparison Of Methods For Solving MAX-SAT Problems
A Comparison Of Methods For Solving MAX-SAT ProblemsA Comparison Of Methods For Solving MAX-SAT Problems
A Comparison Of Methods For Solving MAX-SAT Problems
 
Presentation.pdf
Presentation.pdfPresentation.pdf
Presentation.pdf
 
Intro to Quant Trading Strategies (Lecture 7 of 10)
Intro to Quant Trading Strategies (Lecture 7 of 10)Intro to Quant Trading Strategies (Lecture 7 of 10)
Intro to Quant Trading Strategies (Lecture 7 of 10)
 
[AAAI2021] Combinatorial Pure Exploration with Full-bandit or Partial Linear ...
[AAAI2021] Combinatorial Pure Exploration with Full-bandit or Partial Linear ...[AAAI2021] Combinatorial Pure Exploration with Full-bandit or Partial Linear ...
[AAAI2021] Combinatorial Pure Exploration with Full-bandit or Partial Linear ...
 
MLHEP 2015: Introductory Lecture #4
MLHEP 2015: Introductory Lecture #4MLHEP 2015: Introductory Lecture #4
MLHEP 2015: Introductory Lecture #4
 
Solving the Poisson Equation
Solving the Poisson EquationSolving the Poisson Equation
Solving the Poisson Equation
 
MUMS Opening Workshop - Quantifying Nonparametric Modeling Uncertainty with B...
MUMS Opening Workshop - Quantifying Nonparametric Modeling Uncertainty with B...MUMS Opening Workshop - Quantifying Nonparametric Modeling Uncertainty with B...
MUMS Opening Workshop - Quantifying Nonparametric Modeling Uncertainty with B...
 
Computational Intelligence Assisted Engineering Design Optimization (using MA...
Computational Intelligence Assisted Engineering Design Optimization (using MA...Computational Intelligence Assisted Engineering Design Optimization (using MA...
Computational Intelligence Assisted Engineering Design Optimization (using MA...
 
MUMS: Transition & SPUQ Workshop - Practical Bayesian Optimization for Urban ...
MUMS: Transition & SPUQ Workshop - Practical Bayesian Optimization for Urban ...MUMS: Transition & SPUQ Workshop - Practical Bayesian Optimization for Urban ...
MUMS: Transition & SPUQ Workshop - Practical Bayesian Optimization for Urban ...
 
Binary Vector Reconstruction via Discreteness-Aware Approximate Message Passing
Binary Vector Reconstruction via Discreteness-Aware Approximate Message PassingBinary Vector Reconstruction via Discreteness-Aware Approximate Message Passing
Binary Vector Reconstruction via Discreteness-Aware Approximate Message Passing
 
ICML2016: Low-rank tensor completion: a Riemannian manifold preconditioning a...
ICML2016: Low-rank tensor completion: a Riemannian manifold preconditioning a...ICML2016: Low-rank tensor completion: a Riemannian manifold preconditioning a...
ICML2016: Low-rank tensor completion: a Riemannian manifold preconditioning a...
 
Algorithm review
Algorithm reviewAlgorithm review
Algorithm review
 
Input analysis
Input analysisInput analysis
Input analysis
 
Chapter 4 Simplex Method ppt
Chapter 4  Simplex Method pptChapter 4  Simplex Method ppt
Chapter 4 Simplex Method ppt
 
Estimating Future Initial Margin with Machine Learning
Estimating Future Initial Margin with Machine LearningEstimating Future Initial Margin with Machine Learning
Estimating Future Initial Margin with Machine Learning
 
2014 on exact solutions for the minmax regret aspanning tree problem
2014   on exact solutions for the minmax regret aspanning tree problem2014   on exact solutions for the minmax regret aspanning tree problem
2014 on exact solutions for the minmax regret aspanning tree problem
 
APLICACIONES DE LA DERIVADA EN LA CARRERA DE (Mecánica, Electrónica, Telecomu...
APLICACIONES DE LA DERIVADA EN LA CARRERA DE (Mecánica, Electrónica, Telecomu...APLICACIONES DE LA DERIVADA EN LA CARRERA DE (Mecánica, Electrónica, Telecomu...
APLICACIONES DE LA DERIVADA EN LA CARRERA DE (Mecánica, Electrónica, Telecomu...
 
Subproblem-Tree Calibration: A Unified Approach to Max-Product Message Passin...
Subproblem-Tree Calibration: A Unified Approach to Max-Product Message Passin...Subproblem-Tree Calibration: A Unified Approach to Max-Product Message Passin...
Subproblem-Tree Calibration: A Unified Approach to Max-Product Message Passin...
 
MVPA with SpaceNet: sparse structured priors
MVPA with SpaceNet: sparse structured priorsMVPA with SpaceNet: sparse structured priors
MVPA with SpaceNet: sparse structured priors
 
MARGINAL PERCEPTRON FOR NON-LINEAR AND MULTI CLASS CLASSIFICATION
MARGINAL PERCEPTRON FOR NON-LINEAR AND MULTI CLASS CLASSIFICATION MARGINAL PERCEPTRON FOR NON-LINEAR AND MULTI CLASS CLASSIFICATION
MARGINAL PERCEPTRON FOR NON-LINEAR AND MULTI CLASS CLASSIFICATION
 

More from Abolfazl Asudeh

Query Reranking As A Service
Query Reranking As A ServiceQuery Reranking As A Service
Query Reranking As A ServiceAbolfazl Asudeh
 
[Slides] Crowdsourcing Pareto-Optimal Object Finding By Pairwise Comparisons
[Slides] Crowdsourcing Pareto-Optimal Object Finding By Pairwise Comparisons[Slides] Crowdsourcing Pareto-Optimal Object Finding By Pairwise Comparisons
[Slides] Crowdsourcing Pareto-Optimal Object Finding By Pairwise ComparisonsAbolfazl Asudeh
 
MapReduce : Simplified Data Processing on Large Clusters
MapReduce : Simplified Data Processing on Large ClustersMapReduce : Simplified Data Processing on Large Clusters
MapReduce : Simplified Data Processing on Large ClustersAbolfazl Asudeh
 
GBLENDER: Towards blending visual query formulation and query processing in g...
GBLENDER: Towards blending visual query formulation and query processing in g...GBLENDER: Towards blending visual query formulation and query processing in g...
GBLENDER: Towards blending visual query formulation and query processing in g...Abolfazl Asudeh
 
Using incompletely cooperative game theory in wireless sensor networks
Using incompletely cooperative game theory in wireless sensor networksUsing incompletely cooperative game theory in wireless sensor networks
Using incompletely cooperative game theory in wireless sensor networksAbolfazl Asudeh
 
PREGEL a system for large scale graph processing
PREGEL a system for large scale graph processingPREGEL a system for large scale graph processing
PREGEL a system for large scale graph processingAbolfazl Asudeh
 

More from Abolfazl Asudeh (6)

Query Reranking As A Service
Query Reranking As A ServiceQuery Reranking As A Service
Query Reranking As A Service
 
[Slides] Crowdsourcing Pareto-Optimal Object Finding By Pairwise Comparisons
[Slides] Crowdsourcing Pareto-Optimal Object Finding By Pairwise Comparisons[Slides] Crowdsourcing Pareto-Optimal Object Finding By Pairwise Comparisons
[Slides] Crowdsourcing Pareto-Optimal Object Finding By Pairwise Comparisons
 
MapReduce : Simplified Data Processing on Large Clusters
MapReduce : Simplified Data Processing on Large ClustersMapReduce : Simplified Data Processing on Large Clusters
MapReduce : Simplified Data Processing on Large Clusters
 
GBLENDER: Towards blending visual query formulation and query processing in g...
GBLENDER: Towards blending visual query formulation and query processing in g...GBLENDER: Towards blending visual query formulation and query processing in g...
GBLENDER: Towards blending visual query formulation and query processing in g...
 
Using incompletely cooperative game theory in wireless sensor networks
Using incompletely cooperative game theory in wireless sensor networksUsing incompletely cooperative game theory in wireless sensor networks
Using incompletely cooperative game theory in wireless sensor networks
 
PREGEL a system for large scale graph processing
PREGEL a system for large scale graph processingPREGEL a system for large scale graph processing
PREGEL a system for large scale graph processing
 

Recently uploaded

18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Celine George
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docxPoojaSen20
 
Class 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdfClass 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdfakmcokerachita
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 

Recently uploaded (20)

18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docx
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
Class 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdfClass 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdf
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 

Efficient Computation of Regret-ratio Minimizing Set: A Compact Maxima Representative

  • 1. Efficient Computation of Regret-ratio Minimizing Set: A Compact Maxima Representative ABOLFAZL ASUDEH AZADE NAZI NAN ZHANG GAUTAM DAS SIGMOD’17 © 2017 ACM. ISBN 978-1-4503-4197-4/17/05 UNIVERSITY OF TEXAS AT ARLINGTON UNIVERSITY OF TEXAS AT ARLINGTON GEORGE WASHINGTON UNIVERSITY UNIVERSITY OF TEXAS AT ARLINGTON
  • 2. Outline Motivation and Problem statement 2D-RRMS (Two-Dimensional Regret-Ratio Minimizing Set) HD-RRMS (Higher-Dimensional Regret-Ratio Minimizing Set) Experiments 2
  • 3. Maxima Queries 𝑓 = ∑𝑤𝑖 𝐴𝑖 … to give the best trade-off b/w price, duration, number of stops, … 3
  • 4. 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 Y X 𝑓 = 𝑥 + 𝑦 Example 𝑡𝑖  4
  • 5. 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 Y X Example Convex hull (sky convex)    5
  • 6. 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 Y X Example        6 A subset of skyline: the set of non-dominated points
  • 7. 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 Y X Example Convex hull (sky convex)  7
  • 8. Convex hull size Problem Curvature effect 8
  • 9. Convex hull size Problem effect of the number of attributes (m) m=2m=3m=4m=5m=6
  • 10. Regret-Ratio Minimizing Set 10  𝑓 𝑡 − 𝑓(𝑡′ ) 𝑓 𝑡 − 𝑓(𝑡′ ) 𝑓(𝑡) Problem: Find a subset of size at most r that minimizes the maximum Regret-ratio over all functions
  • 11. Overview of the literature, Our contributions The regret-ratio notion and the problem was first proposed at [Nanongkai et. al. VLDB 2010]. In two dimensional data: ◦ [Chester et. al. VLDB 2014]: Sweeping line 𝑂(𝑟. 𝑛2 ) ◦ We: a dynamic algorithm O r. s. log s . log c < O r. n. (log n)2 -- s: skyline size; c: convex hull size. In higher dimensional data: ◦ Complexity: NP-complete ◦ For arbitrary dimensions: [Chester et. al. VLDB 2014] ◦ Recently for fixed dimensions: [W. Cao et. al. ICDT 2017], [P. K. Agrawal et. al. Arxiv:1702.01446, 2017] ◦ Existing work: (a) a greedy heuristic with unproven theoretical guarantee, (b) a simple attribute space discretization with a fixed upper bound on the regret-ratio of output [Nanongkai et. al. VLDB 2010]. ◦ We: a linearithmic time approximation algorithm that guarantees a regret ratio, within any arbitrarily small user-controllable distance from the optimal regret ratio. ◦ Assumption: fixed number of dimensions 11
  • 12. Outline Motivation and Problem statement 2D-RRMS (Two-Dimensional Regret-Ratio Minimizing Set) HD-RRMS (Higher-Dimensional Regret-Ratio Minimizing Set) Experiments 12
  • 13. High-level idea Order the skyline points from top-left to bottom right, add two dummy points t0 and ts+1, and construct a complete weighted graph on these points 13 t1 t5 t6 t0 t2 t3 t4 t7 Weight of an edge is the Max. regret ratio of removing all the points in its top-right half-space    
  • 14. High-level idea 14 t1 t5 t6 t0 t2 t3 t4 t7 Order the skyline points from top-left to bottom right, add two dummy points t0 and ts+1, and construct a complete weighted graph on these points Weight of an edge is the Max. regret ratio of removing all the points in its top-right half-space  use binary search
  • 15. High-level idea Order the skyline points from top-left to bottom right, add two dummy points t0 and ts+1, and construct a complete weighted graph on these points 15 t1 t5 t6 t0 t2 t3 t4 t7 Weight of an edge is the Max. regret ratio of removing all the points in its top-right half-space  use binary search     Apply the Dynamic programming, DP(ti,r’): optimal solution from ti to ts+1 with at most r’ intermediate steps 𝑂(𝑟. 𝑠. log 𝑠 log 𝑐)
  • 16. Outline Motivation and Problem statement 2D-RRMS (Two-Dimensional Regret-Ratio Minimizing Set) HD-RRMS (Higher-Dimensional Regret-Ratio Minimizing Set) Experiments 16
  • 17. Steps RRMS • Start with a conceptual model • Discuss its problems DMM • Propose the idea of function space discretization • Transform RRMS to a Min Max problem MRST • Define the intermediate problem “Min Rows Satisfying a Threshold” • Transform MRST to a fixed-size instance of Set-cover problem 17
  • 18. Conceptual Model 18 𝑡1 𝑡2 𝑡 𝑠 ... f MinMax ( ) F (all possible functions) Regret-ratio on 𝑓 if only 𝑡2 is remained Transform the problem to a min-max problem Problem1: ◦ F is continuous  infinite number of columns ◦ Matrix Discritization Problem2: ◦ Even if could construct the matrix, 𝑛 𝑟 to solve it ◦ Transform to fixed-size set-cover instances
  • 19. Matrix Discretization 19 f 𝜃2 𝜃1 Arbitrarily small user-controllable distance from the optimal solution
  • 20. DMM: Discretized Min Max Problem 20 𝑡1 𝑡2 𝑡 𝑠 ... f MinMax ( ) F (all possible functions)F(discretized function space) Observation: the optimal regret-ratio is one of the cell values! Define an intermediate problem: ◦ Min. rows satisfying the threshold (MRST) Order the values in M. Do a binary search over the values and for each value Convert M to a (fixed-size) binary matrix Convert MRST to a (fixed size) set-cover instance f F(discretized function space) 𝑡𝑖 1 if regret-ratio of t for f is at most threshold, 0 otherwise For fixed values of 𝑚 and 𝛾, can be solved in constant time.  The running time of HD-RRMS is 𝑂(𝑛 log 𝑛) Practical HD-RRMS: Use greedy approximate algorithm for solving the set-cover instances 1. Accept a result if its size is at most 𝑟𝑚𝑙𝑜𝑔(𝛾): Index size increase, no change in quality of output 2. Accept the result if size is at most r: index size does not change, output quality may increase.
  • 21. Outline Motivation and Problem statement 2D-RRMS (Two-Dimensional Regret-Ratio Minimizing Set) HD-RRMS (Higher-Dimensional Regret-Ratio Minimizing Set) Experiments 21
  • 22. Setup 22 Synthetic Data: ◦ Three datasets (correlated, independent, and anti-correlated) 10M tuples over 10 ordinal attributes. Real-world Datasets ◦ Airline dataset: 5.8M records over two ordinal attributes. ◦ US Department of Transportation (DOT) dataset: 457K records over 7 ordinal attributes. ◦ NBA dataset: 21K tuples over 17 ordinal attributes.