SlideShare a Scribd company logo
1 of 20
Skyline Operator
on Anti-correlated Distribution
Proceedings of the VLDB(2013) Endowment, Vol. 6 No. 9
Haichuan Shang, Masaru Kitsuregawa
Presenter:
WooSung Choi
(ws_choi@korea.ac.kr)
DataKnow. Lab
Korea UNIV.
Background
Related work
Preliminaries
• Formal definition of Dominates (≺)
 Given a set of d-dimensional points 𝑇
 We say that a point t1 ∈ 𝑇 DOMINATES another point t2 ∈ 𝑇
 If and only if
 ∀𝑖 ∈ 1, 2, 3, … , 𝑑 , 𝑡1 𝑖 ≤ 𝑡2[𝑖]
 ∃𝑗 ∈ 1, 2, 3, … , 𝑑 , 𝑡1 𝑗 < 𝑡2[𝑗]
 and Denoted by t1 ≺ t2
 (simply saying, t1 이 자명하게 선호됨)
Definition from http://www.comp.nus.edu.sg/~atung/publication/k_dominant.pdf
Note that
the meaning of ‘dominates’ may differ
according to type of application
www.caranddriver.com
formal Definition (skyline)
• The Skyline operator
 Input - Given a set of objects P = {𝑝1, 𝑝2, … , 𝑝 𝑁}
 𝑆𝐾𝑌𝐿𝐼𝑁𝐸 𝑃 = {𝑝𝑖| 𝑝𝑖 ∈ 𝑃 𝑎𝑛𝑑 ∄ 𝑝∗
∈ 𝑃 𝑠. 𝑡. 𝑝∗
≺ 𝑝𝑖}
A
B
C
D
E
F
Dominating Area(B)
x axis
yaxis
G
Common misconceptions
“𝐵 ∈ 𝑂𝑢𝑝𝑢𝑡 s𝑖𝑛𝑐𝑒 𝐵 ≺ 𝐶 , D, F” , wrong
“𝐵 ∈ 𝑂𝑢𝑝𝑢𝑡,
s𝑖𝑛𝑐𝑒 𝑛𝑜 𝑜𝑡ℎ𝑒𝑟 𝑝𝑜𝑖𝑛𝑡 𝑃 ≺ 𝐵”, correct
 Suppose there are n objects in the given set
 𝐷 𝑥 = {𝑜1, 𝑜2, … , 𝑜 𝑛}
 Algorithm -Naïve 1
 𝑓𝑜𝑟 𝑒𝑎𝑐ℎ 𝑜𝑏𝑗𝑒𝑐𝑡 𝑜 𝑥 ∈ 𝐷
 𝑏𝑜𝑜𝑙𝑒𝑎𝑛 𝑖𝑠𝐷𝑜𝑚𝑖𝑛𝑎𝑡𝑒𝑑 = 𝑓𝑎𝑙𝑠𝑒
 𝑓𝑜𝑟 𝑒𝑎𝑐ℎ 𝑜𝑏𝑗𝑒𝑐𝑡 𝑜 𝑦 ∈ 𝐷
 𝑖𝑓 ¬(𝑜 𝑥 = 𝑜 𝑦) 𝐴𝑁𝐷 ¬ 𝑜 𝑦 ≺ 𝑜 𝑥 𝑡ℎ𝑒𝑛 𝑐𝑜𝑛𝑡𝑖𝑛𝑢𝑒;
 𝑒𝑙𝑠𝑒
 𝑡ℎ𝑒𝑛 𝑖𝑠𝐷𝑜𝑚𝑖𝑛𝑎𝑡𝑒𝑑 = 𝑡𝑟𝑢𝑒;
 break;
 𝑖𝑓 ! 𝑖𝑠𝐷𝑜𝑚𝑖𝑛𝑎𝑡𝑒𝑑 𝑆 ∪ {𝑜 𝑥}
Naïveapproach
NestedLoopStructure
Computational Cost - 𝑂(𝑛2
)
Motivation
Data Distribution
Data Distribution?
Related Work: Summary
• Worst-case Analysis (2.1)
 worst case complexity on arbitrary data distributions
 Ω(𝑛𝑙𝑜𝑔𝑛)[16], O( N/B logM/B
𝑑−2
N/B )[12]
• Elimination Category (2.2)
 Average Complexity with dimensional independence
 Idea: Eliminate non-skyline objects quickly!
 BNL[7], SFS[9], LESS[12], …
 O(dnm)[20], where 𝑚 is the skyline cardinalityO(dnm)[20], where 𝑚 is the skyline cardinality
Anti-Correlation은 왜 중요한가?
Anti-Correlated (2)
•A relationship in which
 the value in one dimension increases as the values in the other
dimensions decrease
•Skyline Queries
are used to find a set of non-dominated data points
for Multi-Criteria Decision Making
•Data in real world
 is more likely to be anti- correlated
Anti-Correlated (3)
• The anti-correlation significantly limits the practical
usage of the existing algorithms
• and yields the demand of effective mathematical
models and efficient algorithms on anti-correlated data
O(dnm)[20], where 𝑚 is the skyline cardinality
𝑚 tends to increase on anti-correlated distribution
These existing algorithms fall back to O(dn2)
뭘 하겠다는 연구인가?
공헌도
Contribution
• 1) General model for the anti-correlated distribution
• 2) Polynomial Estimation of the lower bound of the
expected value of skyline cardinality
• 3) a “Determination and Elimination Framework” for
efficient computation of skyline on anti-correlated
distribution
3. PRELIMINIARIES
Definition & Expectation of Skyline Cardinality
Model: Anti-Correlated Distribution
0
1000
2000
3000
4000
5000
6000
7000
8000
0 2000 4000 6000 8000 10000 12000
Uniform
0
1000
2000
3000
4000
5000
6000
0 2000 4000 6000 8000 10000 12000
Anti c=1
0
1000
2000
3000
4000
5000
6000
0 2000 4000 6000 8000 10000 12000
Anti c=0.1
1) General model for the anti-correlated distribution
1K Tuples
0
1000
2000
3000
4000
5000
6000
7000
8000
0 2000 4000 6000 8000 10000 12000
Uniform
0
1000
2000
3000
4000
5000
6000
0 2000 4000 6000 8000 10000 12000
Anti c=1
0
1000
2000
3000
4000
5000
6000
0 2000 4000 6000 8000 10000 12000
Anti c=0.1
12 57 116
1) General model for the anti-correlated distribution
1K Tuples
0
1000
2000
3000
4000
5000
6000
0 2000 4000 6000 8000 10000 12000
Anti c=1
57
𝑆2,1000,1 ≈ 1000 ∗ 𝜋 − 1
= 55.0499122
2) Polynomal Estimation of the lowerbound of the expected value of skyline cardinality
Generalization
• Theorem 3
 The expected value 𝑆 𝑑,𝑛,𝑐 of the skyline cardinality
 𝑆 𝑑,𝑛,1 ≤ 𝑆 𝑑,𝑛,𝑐 ≤ 𝑆 𝑑,𝑛,0 = 𝑛
 𝑆 𝑑,𝑛,1 = 𝑘=1
𝑑
−1 𝑘−1 𝑑−1
𝑘−1
𝑛
Γ
𝑘
𝑑
Γ(n)
Γ(𝑛+
𝑘
𝑑
)
 ≈ 𝑘=1
𝑑
−1 𝑘−1 𝑑−1
𝑘−1
Γ
𝑘
𝑑
𝑛1−
𝑘
𝑑
 when d ≥ 2
• Where Γ 𝑛 =
1
2𝜋 0
∞
𝑒−𝑡
𝑡 𝑛
𝑑𝑡
2) Polynomal Estimation of the lowerbound of the expected value of skyline cardinality
O(dnm)[20], where 𝑚 is the skyline cardinality
𝑚 tends to increase on anti-correlated distribution
These existing algorithms: O(𝑑𝑛(2𝑑−1)/𝑑) ~ O(dn2)
Pearson Correlation Coefficient
or covariance based model
공분산
• 확률론과 통계학에서, 공분산(共分散, 영어: covariance)
은 2개의 확률변수의 상관정도를 나타내는 값
• 만약 2개의 변수중 하나의 값이 상승하는 경향을 보일
때, 다른 값도 상승하는 경향의 상관관계에 있다면, 공분
산의 값은 양수
• 반대로 2개의 변수중 하나의 값이 상승하는 경향을 보일
때, 다른 값이 하강하는 경향을 보인다면 공분산의 값은
음수

More Related Content

What's hot

DDPG algortihm for angry birds
DDPG algortihm for angry birdsDDPG algortihm for angry birds
DDPG algortihm for angry birdsWangyu Han
 
MLHEP 2015: Introductory Lecture #4
MLHEP 2015: Introductory Lecture #4MLHEP 2015: Introductory Lecture #4
MLHEP 2015: Introductory Lecture #4arogozhnikov
 
MLHEP 2015: Introductory Lecture #2
MLHEP 2015: Introductory Lecture #2MLHEP 2015: Introductory Lecture #2
MLHEP 2015: Introductory Lecture #2arogozhnikov
 
Graph Regularised Hashing
Graph Regularised HashingGraph Regularised Hashing
Graph Regularised HashingSean Moran
 
Hyperparameter optimization with approximate gradient
Hyperparameter optimization with approximate gradientHyperparameter optimization with approximate gradient
Hyperparameter optimization with approximate gradientFabian Pedregosa
 
Introduction to Diffusion Models
Introduction to Diffusion ModelsIntroduction to Diffusion Models
Introduction to Diffusion ModelsSangwoo Mo
 
Visualization using tSNE
Visualization using tSNEVisualization using tSNE
Visualization using tSNEYan Xu
 
Paper Study: Melding the data decision pipeline
Paper Study: Melding the data decision pipelinePaper Study: Melding the data decision pipeline
Paper Study: Melding the data decision pipelineChenYiHuang5
 
K means and dbscan
K means and dbscanK means and dbscan
K means and dbscanYan Xu
 
IJCAI13 Paper review: Large-scale spectral clustering on graphs
IJCAI13 Paper review: Large-scale spectral clustering on graphsIJCAI13 Paper review: Large-scale spectral clustering on graphs
IJCAI13 Paper review: Large-scale spectral clustering on graphsAkisato Kimura
 
Paper study: Attention, learn to solve routing problems!
Paper study: Attention, learn to solve routing problems!Paper study: Attention, learn to solve routing problems!
Paper study: Attention, learn to solve routing problems!ChenYiHuang5
 
Paper study: Learning to solve circuit sat
Paper study: Learning to solve circuit satPaper study: Learning to solve circuit sat
Paper study: Learning to solve circuit satChenYiHuang5
 
Spectral clustering Tutorial
Spectral clustering TutorialSpectral clustering Tutorial
Spectral clustering TutorialZitao Liu
 
K-Means Clustering Simply
K-Means Clustering SimplyK-Means Clustering Simply
K-Means Clustering SimplyEmad Nabil
 
Automatic Gain Tuning based on Gaussian Process Global Optimization (= Bayesi...
Automatic Gain Tuning based on Gaussian Process Global Optimization (= Bayesi...Automatic Gain Tuning based on Gaussian Process Global Optimization (= Bayesi...
Automatic Gain Tuning based on Gaussian Process Global Optimization (= Bayesi...홍배 김
 
Generative adversarial networks
Generative adversarial networksGenerative adversarial networks
Generative adversarial networks남주 김
 
"Principal Component Analysis - the original paper" presentation @ Papers We ...
"Principal Component Analysis - the original paper" presentation @ Papers We ..."Principal Component Analysis - the original paper" presentation @ Papers We ...
"Principal Component Analysis - the original paper" presentation @ Papers We ...Adrian Florea
 
Parallel Algorithms for Geometric Graph Problems (at Stanford)
Parallel Algorithms for Geometric Graph Problems (at Stanford)Parallel Algorithms for Geometric Graph Problems (at Stanford)
Parallel Algorithms for Geometric Graph Problems (at Stanford)Grigory Yaroslavtsev
 
MLHEP Lectures - day 2, basic track
MLHEP Lectures - day 2, basic trackMLHEP Lectures - day 2, basic track
MLHEP Lectures - day 2, basic trackarogozhnikov
 
Machine learning applications in aerospace domain
Machine learning applications in aerospace domainMachine learning applications in aerospace domain
Machine learning applications in aerospace domain홍배 김
 

What's hot (20)

DDPG algortihm for angry birds
DDPG algortihm for angry birdsDDPG algortihm for angry birds
DDPG algortihm for angry birds
 
MLHEP 2015: Introductory Lecture #4
MLHEP 2015: Introductory Lecture #4MLHEP 2015: Introductory Lecture #4
MLHEP 2015: Introductory Lecture #4
 
MLHEP 2015: Introductory Lecture #2
MLHEP 2015: Introductory Lecture #2MLHEP 2015: Introductory Lecture #2
MLHEP 2015: Introductory Lecture #2
 
Graph Regularised Hashing
Graph Regularised HashingGraph Regularised Hashing
Graph Regularised Hashing
 
Hyperparameter optimization with approximate gradient
Hyperparameter optimization with approximate gradientHyperparameter optimization with approximate gradient
Hyperparameter optimization with approximate gradient
 
Introduction to Diffusion Models
Introduction to Diffusion ModelsIntroduction to Diffusion Models
Introduction to Diffusion Models
 
Visualization using tSNE
Visualization using tSNEVisualization using tSNE
Visualization using tSNE
 
Paper Study: Melding the data decision pipeline
Paper Study: Melding the data decision pipelinePaper Study: Melding the data decision pipeline
Paper Study: Melding the data decision pipeline
 
K means and dbscan
K means and dbscanK means and dbscan
K means and dbscan
 
IJCAI13 Paper review: Large-scale spectral clustering on graphs
IJCAI13 Paper review: Large-scale spectral clustering on graphsIJCAI13 Paper review: Large-scale spectral clustering on graphs
IJCAI13 Paper review: Large-scale spectral clustering on graphs
 
Paper study: Attention, learn to solve routing problems!
Paper study: Attention, learn to solve routing problems!Paper study: Attention, learn to solve routing problems!
Paper study: Attention, learn to solve routing problems!
 
Paper study: Learning to solve circuit sat
Paper study: Learning to solve circuit satPaper study: Learning to solve circuit sat
Paper study: Learning to solve circuit sat
 
Spectral clustering Tutorial
Spectral clustering TutorialSpectral clustering Tutorial
Spectral clustering Tutorial
 
K-Means Clustering Simply
K-Means Clustering SimplyK-Means Clustering Simply
K-Means Clustering Simply
 
Automatic Gain Tuning based on Gaussian Process Global Optimization (= Bayesi...
Automatic Gain Tuning based on Gaussian Process Global Optimization (= Bayesi...Automatic Gain Tuning based on Gaussian Process Global Optimization (= Bayesi...
Automatic Gain Tuning based on Gaussian Process Global Optimization (= Bayesi...
 
Generative adversarial networks
Generative adversarial networksGenerative adversarial networks
Generative adversarial networks
 
"Principal Component Analysis - the original paper" presentation @ Papers We ...
"Principal Component Analysis - the original paper" presentation @ Papers We ..."Principal Component Analysis - the original paper" presentation @ Papers We ...
"Principal Component Analysis - the original paper" presentation @ Papers We ...
 
Parallel Algorithms for Geometric Graph Problems (at Stanford)
Parallel Algorithms for Geometric Graph Problems (at Stanford)Parallel Algorithms for Geometric Graph Problems (at Stanford)
Parallel Algorithms for Geometric Graph Problems (at Stanford)
 
MLHEP Lectures - day 2, basic track
MLHEP Lectures - day 2, basic trackMLHEP Lectures - day 2, basic track
MLHEP Lectures - day 2, basic track
 
Machine learning applications in aerospace domain
Machine learning applications in aerospace domainMachine learning applications in aerospace domain
Machine learning applications in aerospace domain
 

Similar to [Vldb 2013] skyline operator on anti correlated distributions

Theories and Engineering Technics of 2D-to-3D Back-Projection Problem
Theories and Engineering Technics of 2D-to-3D Back-Projection ProblemTheories and Engineering Technics of 2D-to-3D Back-Projection Problem
Theories and Engineering Technics of 2D-to-3D Back-Projection ProblemSeongcheol Baek
 
Random Matrix Theory and Machine Learning - Part 4
Random Matrix Theory and Machine Learning - Part 4Random Matrix Theory and Machine Learning - Part 4
Random Matrix Theory and Machine Learning - Part 4Fabian Pedregosa
 
Time series clustering presentation
Time series clustering presentationTime series clustering presentation
Time series clustering presentationEleni Stamatelou
 
Projection methods for stochastic structural dynamics
Projection methods for stochastic structural dynamicsProjection methods for stochastic structural dynamics
Projection methods for stochastic structural dynamicsUniversity of Glasgow
 
An Efficient Convex Hull Algorithm for a Planer Set of Points
An Efficient Convex Hull Algorithm for a Planer Set of PointsAn Efficient Convex Hull Algorithm for a Planer Set of Points
An Efficient Convex Hull Algorithm for a Planer Set of PointsKasun Ranga Wijeweera
 
Efficient anomaly detection via matrix sketching
Efficient anomaly detection via matrix sketchingEfficient anomaly detection via matrix sketching
Efficient anomaly detection via matrix sketchingHsing-chuan Hsieh
 
Passive network-redesign-ntua
Passive network-redesign-ntuaPassive network-redesign-ntua
Passive network-redesign-ntuaIEEE NTUA SB
 
Stochastic Gradient Descent with Exponential Convergence Rates of Expected Cl...
Stochastic Gradient Descent with Exponential Convergence Rates of Expected Cl...Stochastic Gradient Descent with Exponential Convergence Rates of Expected Cl...
Stochastic Gradient Descent with Exponential Convergence Rates of Expected Cl...Atsushi Nitanda
 
TPDM Presentation Slide (ICCV23)
TPDM Presentation Slide (ICCV23)TPDM Presentation Slide (ICCV23)
TPDM Presentation Slide (ICCV23)Suhyeon Lee
 
Aaa ped-17-Unsupervised Learning: Dimensionality reduction
Aaa ped-17-Unsupervised Learning: Dimensionality reductionAaa ped-17-Unsupervised Learning: Dimensionality reduction
Aaa ped-17-Unsupervised Learning: Dimensionality reductionAminaRepo
 
Wereszczynski Molecular Dynamics
Wereszczynski Molecular DynamicsWereszczynski Molecular Dynamics
Wereszczynski Molecular DynamicsSciCompIIT
 
Participation costs dismiss the advantage of heterogeneous networks in evolut...
Participation costs dismiss the advantage of heterogeneous networks in evolut...Participation costs dismiss the advantage of heterogeneous networks in evolut...
Participation costs dismiss the advantage of heterogeneous networks in evolut...Naoki Masuda
 
Euler lagrange equations of motion mit-holonomic constraints_lecture7
Euler lagrange equations of motion  mit-holonomic  constraints_lecture7Euler lagrange equations of motion  mit-holonomic  constraints_lecture7
Euler lagrange equations of motion mit-holonomic constraints_lecture7JOHN OBIDI
 

Similar to [Vldb 2013] skyline operator on anti correlated distributions (20)

Theories and Engineering Technics of 2D-to-3D Back-Projection Problem
Theories and Engineering Technics of 2D-to-3D Back-Projection ProblemTheories and Engineering Technics of 2D-to-3D Back-Projection Problem
Theories and Engineering Technics of 2D-to-3D Back-Projection Problem
 
Lecture8 xing
Lecture8 xingLecture8 xing
Lecture8 xing
 
Random Matrix Theory and Machine Learning - Part 4
Random Matrix Theory and Machine Learning - Part 4Random Matrix Theory and Machine Learning - Part 4
Random Matrix Theory and Machine Learning - Part 4
 
Time series clustering presentation
Time series clustering presentationTime series clustering presentation
Time series clustering presentation
 
Delta Like Robot
Delta Like RobotDelta Like Robot
Delta Like Robot
 
Projection methods for stochastic structural dynamics
Projection methods for stochastic structural dynamicsProjection methods for stochastic structural dynamics
Projection methods for stochastic structural dynamics
 
An Efficient Convex Hull Algorithm for a Planer Set of Points
An Efficient Convex Hull Algorithm for a Planer Set of PointsAn Efficient Convex Hull Algorithm for a Planer Set of Points
An Efficient Convex Hull Algorithm for a Planer Set of Points
 
VoxelNet
VoxelNetVoxelNet
VoxelNet
 
GDRR Opening Workshop - Variance Reduction for Reliability Assessment with St...
GDRR Opening Workshop - Variance Reduction for Reliability Assessment with St...GDRR Opening Workshop - Variance Reduction for Reliability Assessment with St...
GDRR Opening Workshop - Variance Reduction for Reliability Assessment with St...
 
CLIM Fall 2017 Course: Statistics for Climate Research, Spatial Data: Models ...
CLIM Fall 2017 Course: Statistics for Climate Research, Spatial Data: Models ...CLIM Fall 2017 Course: Statistics for Climate Research, Spatial Data: Models ...
CLIM Fall 2017 Course: Statistics for Climate Research, Spatial Data: Models ...
 
Efficient anomaly detection via matrix sketching
Efficient anomaly detection via matrix sketchingEfficient anomaly detection via matrix sketching
Efficient anomaly detection via matrix sketching
 
Passive network-redesign-ntua
Passive network-redesign-ntuaPassive network-redesign-ntua
Passive network-redesign-ntua
 
Stochastic Gradient Descent with Exponential Convergence Rates of Expected Cl...
Stochastic Gradient Descent with Exponential Convergence Rates of Expected Cl...Stochastic Gradient Descent with Exponential Convergence Rates of Expected Cl...
Stochastic Gradient Descent with Exponential Convergence Rates of Expected Cl...
 
TPDM Presentation Slide (ICCV23)
TPDM Presentation Slide (ICCV23)TPDM Presentation Slide (ICCV23)
TPDM Presentation Slide (ICCV23)
 
Aaa ped-17-Unsupervised Learning: Dimensionality reduction
Aaa ped-17-Unsupervised Learning: Dimensionality reductionAaa ped-17-Unsupervised Learning: Dimensionality reduction
Aaa ped-17-Unsupervised Learning: Dimensionality reduction
 
Wereszczynski Molecular Dynamics
Wereszczynski Molecular DynamicsWereszczynski Molecular Dynamics
Wereszczynski Molecular Dynamics
 
Participation costs dismiss the advantage of heterogeneous networks in evolut...
Participation costs dismiss the advantage of heterogeneous networks in evolut...Participation costs dismiss the advantage of heterogeneous networks in evolut...
Participation costs dismiss the advantage of heterogeneous networks in evolut...
 
TunUp final presentation
TunUp final presentationTunUp final presentation
TunUp final presentation
 
Euler lagrange equations of motion mit-holonomic constraints_lecture7
Euler lagrange equations of motion  mit-holonomic  constraints_lecture7Euler lagrange equations of motion  mit-holonomic  constraints_lecture7
Euler lagrange equations of motion mit-holonomic constraints_lecture7
 
New name-400-4
New name-400-4New name-400-4
New name-400-4
 

Recently uploaded

Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...Call Girls in Nagpur High Profile
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfJiananWang21
 
Online banking management system project.pdf
Online banking management system project.pdfOnline banking management system project.pdf
Online banking management system project.pdfKamal Acharya
 
Intro To Electric Vehicles PDF Notes.pdf
Intro To Electric Vehicles PDF Notes.pdfIntro To Electric Vehicles PDF Notes.pdf
Intro To Electric Vehicles PDF Notes.pdfrs7054576148
 
Generative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPTGenerative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPTbhaskargani46
 
Unleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapUnleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapRishantSharmaFr
 
Work-Permit-Receiver-in-Saudi-Aramco.pptx
Work-Permit-Receiver-in-Saudi-Aramco.pptxWork-Permit-Receiver-in-Saudi-Aramco.pptx
Work-Permit-Receiver-in-Saudi-Aramco.pptxJuliansyahHarahap1
 
Unit 2- Effective stress & Permeability.pdf
Unit 2- Effective stress & Permeability.pdfUnit 2- Effective stress & Permeability.pdf
Unit 2- Effective stress & Permeability.pdfRagavanV2
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Call Girls in Nagpur High Profile
 
University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdfKamal Acharya
 
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank  Design by Working Stress - IS Method.pdfIntze Overhead Water Tank  Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank Design by Working Stress - IS Method.pdfSuman Jyoti
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordAsst.prof M.Gokilavani
 
Double rodded leveling 1 pdf activity 01
Double rodded leveling 1 pdf activity 01Double rodded leveling 1 pdf activity 01
Double rodded leveling 1 pdf activity 01KreezheaRecto
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdfankushspencer015
 
notes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.pptnotes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.pptMsecMca
 
Design For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the startDesign For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the startQuintin Balsdon
 
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...tanu pandey
 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfKamal Acharya
 
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoorTop Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoordharasingh5698
 

Recently uploaded (20)

Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdf
 
Online banking management system project.pdf
Online banking management system project.pdfOnline banking management system project.pdf
Online banking management system project.pdf
 
Intro To Electric Vehicles PDF Notes.pdf
Intro To Electric Vehicles PDF Notes.pdfIntro To Electric Vehicles PDF Notes.pdf
Intro To Electric Vehicles PDF Notes.pdf
 
Generative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPTGenerative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPT
 
Unleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapUnleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leap
 
Work-Permit-Receiver-in-Saudi-Aramco.pptx
Work-Permit-Receiver-in-Saudi-Aramco.pptxWork-Permit-Receiver-in-Saudi-Aramco.pptx
Work-Permit-Receiver-in-Saudi-Aramco.pptx
 
Unit 2- Effective stress & Permeability.pdf
Unit 2- Effective stress & Permeability.pdfUnit 2- Effective stress & Permeability.pdf
Unit 2- Effective stress & Permeability.pdf
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
 
University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdf
 
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank  Design by Working Stress - IS Method.pdfIntze Overhead Water Tank  Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
 
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
 
Double rodded leveling 1 pdf activity 01
Double rodded leveling 1 pdf activity 01Double rodded leveling 1 pdf activity 01
Double rodded leveling 1 pdf activity 01
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdf
 
notes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.pptnotes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.ppt
 
Design For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the startDesign For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the start
 
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
 
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoorTop Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
 

[Vldb 2013] skyline operator on anti correlated distributions

  • 1. Skyline Operator on Anti-correlated Distribution Proceedings of the VLDB(2013) Endowment, Vol. 6 No. 9 Haichuan Shang, Masaru Kitsuregawa Presenter: WooSung Choi (ws_choi@korea.ac.kr) DataKnow. Lab Korea UNIV.
  • 3. Preliminaries • Formal definition of Dominates (≺)  Given a set of d-dimensional points 𝑇  We say that a point t1 ∈ 𝑇 DOMINATES another point t2 ∈ 𝑇  If and only if  ∀𝑖 ∈ 1, 2, 3, … , 𝑑 , 𝑡1 𝑖 ≤ 𝑡2[𝑖]  ∃𝑗 ∈ 1, 2, 3, … , 𝑑 , 𝑡1 𝑗 < 𝑡2[𝑗]  and Denoted by t1 ≺ t2  (simply saying, t1 이 자명하게 선호됨) Definition from http://www.comp.nus.edu.sg/~atung/publication/k_dominant.pdf Note that the meaning of ‘dominates’ may differ according to type of application www.caranddriver.com
  • 4. formal Definition (skyline) • The Skyline operator  Input - Given a set of objects P = {𝑝1, 𝑝2, … , 𝑝 𝑁}  𝑆𝐾𝑌𝐿𝐼𝑁𝐸 𝑃 = {𝑝𝑖| 𝑝𝑖 ∈ 𝑃 𝑎𝑛𝑑 ∄ 𝑝∗ ∈ 𝑃 𝑠. 𝑡. 𝑝∗ ≺ 𝑝𝑖} A B C D E F Dominating Area(B) x axis yaxis G Common misconceptions “𝐵 ∈ 𝑂𝑢𝑝𝑢𝑡 s𝑖𝑛𝑐𝑒 𝐵 ≺ 𝐶 , D, F” , wrong “𝐵 ∈ 𝑂𝑢𝑝𝑢𝑡, s𝑖𝑛𝑐𝑒 𝑛𝑜 𝑜𝑡ℎ𝑒𝑟 𝑝𝑜𝑖𝑛𝑡 𝑃 ≺ 𝐵”, correct
  • 5.  Suppose there are n objects in the given set  𝐷 𝑥 = {𝑜1, 𝑜2, … , 𝑜 𝑛}  Algorithm -Naïve 1  𝑓𝑜𝑟 𝑒𝑎𝑐ℎ 𝑜𝑏𝑗𝑒𝑐𝑡 𝑜 𝑥 ∈ 𝐷  𝑏𝑜𝑜𝑙𝑒𝑎𝑛 𝑖𝑠𝐷𝑜𝑚𝑖𝑛𝑎𝑡𝑒𝑑 = 𝑓𝑎𝑙𝑠𝑒  𝑓𝑜𝑟 𝑒𝑎𝑐ℎ 𝑜𝑏𝑗𝑒𝑐𝑡 𝑜 𝑦 ∈ 𝐷  𝑖𝑓 ¬(𝑜 𝑥 = 𝑜 𝑦) 𝐴𝑁𝐷 ¬ 𝑜 𝑦 ≺ 𝑜 𝑥 𝑡ℎ𝑒𝑛 𝑐𝑜𝑛𝑡𝑖𝑛𝑢𝑒;  𝑒𝑙𝑠𝑒  𝑡ℎ𝑒𝑛 𝑖𝑠𝐷𝑜𝑚𝑖𝑛𝑎𝑡𝑒𝑑 = 𝑡𝑟𝑢𝑒;  break;  𝑖𝑓 ! 𝑖𝑠𝐷𝑜𝑚𝑖𝑛𝑎𝑡𝑒𝑑 𝑆 ∪ {𝑜 𝑥} Naïveapproach NestedLoopStructure Computational Cost - 𝑂(𝑛2 )
  • 8. Related Work: Summary • Worst-case Analysis (2.1)  worst case complexity on arbitrary data distributions  Ω(𝑛𝑙𝑜𝑔𝑛)[16], O( N/B logM/B 𝑑−2 N/B )[12] • Elimination Category (2.2)  Average Complexity with dimensional independence  Idea: Eliminate non-skyline objects quickly!  BNL[7], SFS[9], LESS[12], …  O(dnm)[20], where 𝑚 is the skyline cardinalityO(dnm)[20], where 𝑚 is the skyline cardinality
  • 10. Anti-Correlated (2) •A relationship in which  the value in one dimension increases as the values in the other dimensions decrease •Skyline Queries are used to find a set of non-dominated data points for Multi-Criteria Decision Making •Data in real world  is more likely to be anti- correlated
  • 11. Anti-Correlated (3) • The anti-correlation significantly limits the practical usage of the existing algorithms • and yields the demand of effective mathematical models and efficient algorithms on anti-correlated data O(dnm)[20], where 𝑚 is the skyline cardinality 𝑚 tends to increase on anti-correlated distribution These existing algorithms fall back to O(dn2)
  • 13. Contribution • 1) General model for the anti-correlated distribution • 2) Polynomial Estimation of the lower bound of the expected value of skyline cardinality • 3) a “Determination and Elimination Framework” for efficient computation of skyline on anti-correlated distribution
  • 14. 3. PRELIMINIARIES Definition & Expectation of Skyline Cardinality
  • 15. Model: Anti-Correlated Distribution 0 1000 2000 3000 4000 5000 6000 7000 8000 0 2000 4000 6000 8000 10000 12000 Uniform 0 1000 2000 3000 4000 5000 6000 0 2000 4000 6000 8000 10000 12000 Anti c=1 0 1000 2000 3000 4000 5000 6000 0 2000 4000 6000 8000 10000 12000 Anti c=0.1 1) General model for the anti-correlated distribution
  • 16. 1K Tuples 0 1000 2000 3000 4000 5000 6000 7000 8000 0 2000 4000 6000 8000 10000 12000 Uniform 0 1000 2000 3000 4000 5000 6000 0 2000 4000 6000 8000 10000 12000 Anti c=1 0 1000 2000 3000 4000 5000 6000 0 2000 4000 6000 8000 10000 12000 Anti c=0.1 12 57 116 1) General model for the anti-correlated distribution
  • 17. 1K Tuples 0 1000 2000 3000 4000 5000 6000 0 2000 4000 6000 8000 10000 12000 Anti c=1 57 𝑆2,1000,1 ≈ 1000 ∗ 𝜋 − 1 = 55.0499122 2) Polynomal Estimation of the lowerbound of the expected value of skyline cardinality
  • 18. Generalization • Theorem 3  The expected value 𝑆 𝑑,𝑛,𝑐 of the skyline cardinality  𝑆 𝑑,𝑛,1 ≤ 𝑆 𝑑,𝑛,𝑐 ≤ 𝑆 𝑑,𝑛,0 = 𝑛  𝑆 𝑑,𝑛,1 = 𝑘=1 𝑑 −1 𝑘−1 𝑑−1 𝑘−1 𝑛 Γ 𝑘 𝑑 Γ(n) Γ(𝑛+ 𝑘 𝑑 )  ≈ 𝑘=1 𝑑 −1 𝑘−1 𝑑−1 𝑘−1 Γ 𝑘 𝑑 𝑛1− 𝑘 𝑑  when d ≥ 2 • Where Γ 𝑛 = 1 2𝜋 0 ∞ 𝑒−𝑡 𝑡 𝑛 𝑑𝑡 2) Polynomal Estimation of the lowerbound of the expected value of skyline cardinality O(dnm)[20], where 𝑚 is the skyline cardinality 𝑚 tends to increase on anti-correlated distribution These existing algorithms: O(𝑑𝑛(2𝑑−1)/𝑑) ~ O(dn2)
  • 19. Pearson Correlation Coefficient or covariance based model
  • 20. 공분산 • 확률론과 통계학에서, 공분산(共分散, 영어: covariance) 은 2개의 확률변수의 상관정도를 나타내는 값 • 만약 2개의 변수중 하나의 값이 상승하는 경향을 보일 때, 다른 값도 상승하는 경향의 상관관계에 있다면, 공분 산의 값은 양수 • 반대로 2개의 변수중 하나의 값이 상승하는 경향을 보일 때, 다른 값이 하강하는 경향을 보인다면 공분산의 값은 음수