SlideShare a Scribd company logo
Generalized Notions of Data Depth
Spring 2015 Data Reading Seminar
Mukund Raj
12th Mar, 2015
1 / 25
Outline
1 Data Depth Background
What is Data Depth?
Geometrical Data Depth
General Properties of Data Depth
2 Generalized Notions of Data Depth
Functions
Multivariate Curves
Sets
Paths (on a graph)
3 Discussion
Relaxed Formulations
Advantages and Limitations of Data Depth
2 / 25
What is Data Depth?
A means of measuring how deep a data point p is within a
cloud of points {p1, . . . , pn}.
Multivariate data analysis approach to generate order statistics
which capture high-dimensional features and relationships.
Descriptive nonparametric method of statistical analysis.
3 / 25
Why is Data Depth Interesting?
Estimate the location from center outward ( with respect to
parent distribution ).
Identify outliers.
Formulate quantitative and graphical methods for analyzing
distributional characteristics such as location, scale, e.t.c as
well as hypothesis testing.
Robustness.
4 / 25
Various Formulations of Data Depth
Geometrical (for Data in
Euclidean Space)
L2 depth
Mahalanobis depth
Oja depth
Expected convex hull depth
Zonoid depth
Simplex depth
Half Space depth or Tukey
depth or Location depth
Generalized (for Complex Data)
Functional Band Depth
Depth for Multivariate
Curves
Sets
Paths on a Graph
5 / 25
Geometrical data depth
Depth based on distances / volumes
L2 depth
Mahalanobis depth
Oja depth
Depth based on weighted means
Zonoid depth
Expected Convex Hull depth
Depth based on half spaces and simplices
Tukey depth
Simplicial depth
[Mosler 2012]
6 / 25
General Properties of Data Depth
1 Zero at infinity
2 Maximality at Center
3 Monotonicity
4 Affine Invariance
[Zuo and Serfling, 2000]
7 / 25
Outline
1 Data Depth Background
What is Data Depth?
Geometrical Data Depth
General Properties of Data Depth
2 Generalized Notions of Data Depth
Functions
Multivariate Curves
Sets
Paths (on a graph)
3 Discussion
Relaxed Formulations
Advantages and Limitations of Data Depth
8 / 25
Function Ensembles
A function ensemble can be defined as:
{xi (t), i = 1, . . . , n, t ∈ I} where I is an interval in and
xi : →
Time series observations annual trend of temperature or
precipitation, prices of commodities, heights of children versus
age e.t.c.
9 / 25
Motivation for Functional Band Depth
Challenge with regular multivariate analysis of functions
Curve ensembles that are sampled at different points.
Curse of dimensionality in case of current methods (e.g.
PCA).
Contribution by [L´opez-Pintado et. al. 2009]
Given an ensemble of functions (sampled from a distribution),
a formulation of data depth associated with the function.
10 / 25
Functional Band Depth Formulation
Figure: A functional band [Lopez-Pintado et. al. 2009].
Functional band formulation:
g ⊂ B(f1, · · · , fj ) iff ∀x min
i∈{1...j}
{fi(x)} ≤ g(x) ≤ max
i∈{1...j}
{fi(x)}
(1)
Functional band depth formulation:
BDj (g) = P (g ⊂ B(f1, · · · , fj)) (2)
11 / 25
Visualization of Data Depth for Functions
Figure: Visualization of function
ensemble [Lopez-Pintados et. al.
2009].
Figure: Boxplot visualization of
function ensemble [Sun et. al. 2011,
Whitaker et. al. 2013].
12 / 25
Multivariate Curve Ensembles
A parameterized curve can be defined in terms
of an independent parameter s as:
c(s) = ˜x(s) c : D → R D ⊂ R, R ⊂ Rd
Hurricane paths.
Brain tractography data.
Pathline ensemble in fluid simulation. Figure: A synthetic
ensemble of
multivariate curves in
[Mirzargar et. al.
2014]
13 / 25
Data Depth Formulation for Multivariate Curves
(a) (b)
Figure: Band formed by 3 multivariate curves [Lopez-Pintado et. al.
2014, Mirzargar et. al. 2014]
Curve band formulation:
g ⊂ B(ci1 , · · · , cij
) iff ∀x g(x) ∈ simplex ci1 (x), · · · , cij (x)
(3)
Curve band depth formulation:
SBDj (g) = P g ⊂ B(fc1 , · · · , cij ) (4)
14 / 25
Visualization of Data Depth for Curves
Figure: Chinese Script replicated
100 times [Lopez-Pintado 2014].
Figure: Curve boxplot for hurricane
path ensemble [Mirzargar et. al.
2014]
15 / 25
Set / Isocontour Ensembles
Given an ensemble of real valued functions
f (x, y), the sublevel and superlevel sets for any
particular isovalue.
Isocontours of temperature field.
Isocontours of pressure field in fluid
dynamics simulations.
Figure: A synthetic
ensemble of contours
in [Whitaker et. al.
2013]
16 / 25
Data Depth Formulation for Sets
Figure: Examples of set band [Whitaker et. al. 2013]
Set band formulation:
S ∈ sB(S1, . . . , Dj ) ↔
j
k=1
Sk ⊂ S ⊂
j
k=1
Sk (5)
Set band depth formulation:
sBDj (S) = P (S ⊂ sB(S1, . . . , Sj ) (6)
17 / 25
Visualization of Data Depth for Sets
(a)
(b)
Figure: Contour boxplot for an ensemble of isocontours of pressure field
[Whitaker et. al. 2013]
18 / 25
Paths (on a graph)
Let G = {V , E, W }. A path p can be denoted
as p : I → V where index set I = (1, . . . , m)
Paths of packets in computer networks.
Paths on transportation networks
modelled as graphs.
Figure: A synthetic
ensemble of paths on
a graph.
19 / 25
Data Depth Formulation for Paths
Figure: Illustration of band formed by 3 paths.
Path band formulation:
p ∈ B[Pj ] iff p(l) ∈ H[p1(l), . . . , pj (l)] ∀l ∈ I (7)
Path band depth formulation:
pBDj (p) = E [χ(p ∈ B(pj ))] (8)
20 / 25
Visualization of Data Depth for Paths
(a) (b)
Figure: Path boxplots for paths on AS and road graphs.
21 / 25
Outline
1 Data Depth Background
What is Data Depth?
Geometrical Data Depth
General Properties of Data Depth
2 Generalized Notions of Data Depth
Functions
Multivariate Curves
Sets
Paths (on a graph)
3 Discussion
Relaxed Formulations
Advantages and Limitations of Data Depth
22 / 25
Relaxed formulations
1 Modified Band Depth - Instead of an indicator function,
measure object inside the band.
2 Subsets - Indicator function with a relaxed threshold.
23 / 25
Advantages and Limitations
For Combinatorial Data Depth Formulations for Complex Data
Advantages
No assumption required for the underlying distribution.
Captures nonlocal relationships
Robust.
Limitations
Computationally expensive for large ensembles.
24 / 25
Thank You
Questions?
25 / 25

More Related Content

What's hot

Introduction to image processing-Class Notes
Introduction to image processing-Class NotesIntroduction to image processing-Class Notes
Introduction to image processing-Class Notes
Dr.YNM
 
11 praktikum operasi sinyal
11 praktikum operasi sinyal11 praktikum operasi sinyal
11 praktikum operasi sinyal
Simon Patabang
 
Defying Nyquist in Analog to Digital Conversion
Defying Nyquist in Analog to Digital ConversionDefying Nyquist in Analog to Digital Conversion
Defying Nyquist in Analog to Digital Conversion
Distinguished Lecturer Series - Leon The Mathematician
 
Graph terminologies & special type graphs
Graph terminologies & special type graphsGraph terminologies & special type graphs
Graph terminologies & special type graphs
Nabeel Ahsen
 
Telekomunikasi Analog dan Digital - Slide week 7 derau dalam sistem komunikasi
Telekomunikasi Analog dan Digital - Slide week 7   derau dalam sistem komunikasiTelekomunikasi Analog dan Digital - Slide week 7   derau dalam sistem komunikasi
Telekomunikasi Analog dan Digital - Slide week 7 derau dalam sistem komunikasiBeny Nugraha
 
Run-Length Encoding algorithm
Run-Length Encoding algorithmRun-Length Encoding algorithm
Run-Length Encoding algorithm
Hyeon Sik Song
 
DSP, Differences between Fourier series ,Fourier Transform and Z transform
DSP, Differences between  Fourier series ,Fourier Transform and Z transform DSP, Differences between  Fourier series ,Fourier Transform and Z transform
DSP, Differences between Fourier series ,Fourier Transform and Z transform
Naresh Biloniya
 
The Digital Image Processing Q@A
The Digital Image Processing Q@AThe Digital Image Processing Q@A
The Digital Image Processing Q@A
Chung Hua Universit
 
Parseval's Theorem
Parseval's TheoremParseval's Theorem
Parseval's Theorem
COMSATS Abbottabad
 
Comparison of image segmentation
Comparison of image segmentationComparison of image segmentation
Comparison of image segmentation
Haitham Ahmed
 
Makalah perbedaan analog dan digital
Makalah perbedaan analog dan digitalMakalah perbedaan analog dan digital
Makalah perbedaan analog dan digitalEsir R UKI Toraja
 
Wavelet based image compression technique
Wavelet based image compression techniqueWavelet based image compression technique
Wavelet based image compression techniquePriyanka Pachori
 
SPIHT(Set Partitioning In Hierarchical Trees)
SPIHT(Set Partitioning In Hierarchical Trees)SPIHT(Set Partitioning In Hierarchical Trees)
SPIHT(Set Partitioning In Hierarchical Trees)
M.k. Praveen
 
1. Sinyal (1).ppt
1. Sinyal (1).ppt1. Sinyal (1).ppt
1. Sinyal (1).ppt
ndah11
 
Multimodal Assessment of Parkinson’s Disease: A Deep Learning Approach
Multimodal Assessment of Parkinson’s Disease: A Deep Learning ApproachMultimodal Assessment of Parkinson’s Disease: A Deep Learning Approach
Multimodal Assessment of Parkinson’s Disease: A Deep Learning Approach
Juan Camilo Vasquez
 
History and definition of statistics
History and definition of statisticsHistory and definition of statistics
History and definition of statistics
Muhammad Kamran
 
Fundamental Steps of Digital Image Processing & Image Components
Fundamental Steps of Digital Image Processing & Image ComponentsFundamental Steps of Digital Image Processing & Image Components
Fundamental Steps of Digital Image Processing & Image Components
Kalyan Acharjya
 
Signals & systems
Signals & systems Signals & systems
Signals & systems
SathyaVigneshR
 

What's hot (20)

Introduction to image processing-Class Notes
Introduction to image processing-Class NotesIntroduction to image processing-Class Notes
Introduction to image processing-Class Notes
 
analisis kluster
analisis klusteranalisis kluster
analisis kluster
 
11 praktikum operasi sinyal
11 praktikum operasi sinyal11 praktikum operasi sinyal
11 praktikum operasi sinyal
 
Defying Nyquist in Analog to Digital Conversion
Defying Nyquist in Analog to Digital ConversionDefying Nyquist in Analog to Digital Conversion
Defying Nyquist in Analog to Digital Conversion
 
Patent 4 UK Certificate-of-Grant
Patent 4 UK Certificate-of-GrantPatent 4 UK Certificate-of-Grant
Patent 4 UK Certificate-of-Grant
 
Graph terminologies & special type graphs
Graph terminologies & special type graphsGraph terminologies & special type graphs
Graph terminologies & special type graphs
 
Telekomunikasi Analog dan Digital - Slide week 7 derau dalam sistem komunikasi
Telekomunikasi Analog dan Digital - Slide week 7   derau dalam sistem komunikasiTelekomunikasi Analog dan Digital - Slide week 7   derau dalam sistem komunikasi
Telekomunikasi Analog dan Digital - Slide week 7 derau dalam sistem komunikasi
 
Run-Length Encoding algorithm
Run-Length Encoding algorithmRun-Length Encoding algorithm
Run-Length Encoding algorithm
 
DSP, Differences between Fourier series ,Fourier Transform and Z transform
DSP, Differences between  Fourier series ,Fourier Transform and Z transform DSP, Differences between  Fourier series ,Fourier Transform and Z transform
DSP, Differences between Fourier series ,Fourier Transform and Z transform
 
The Digital Image Processing Q@A
The Digital Image Processing Q@AThe Digital Image Processing Q@A
The Digital Image Processing Q@A
 
Parseval's Theorem
Parseval's TheoremParseval's Theorem
Parseval's Theorem
 
Comparison of image segmentation
Comparison of image segmentationComparison of image segmentation
Comparison of image segmentation
 
Makalah perbedaan analog dan digital
Makalah perbedaan analog dan digitalMakalah perbedaan analog dan digital
Makalah perbedaan analog dan digital
 
Wavelet based image compression technique
Wavelet based image compression techniqueWavelet based image compression technique
Wavelet based image compression technique
 
SPIHT(Set Partitioning In Hierarchical Trees)
SPIHT(Set Partitioning In Hierarchical Trees)SPIHT(Set Partitioning In Hierarchical Trees)
SPIHT(Set Partitioning In Hierarchical Trees)
 
1. Sinyal (1).ppt
1. Sinyal (1).ppt1. Sinyal (1).ppt
1. Sinyal (1).ppt
 
Multimodal Assessment of Parkinson’s Disease: A Deep Learning Approach
Multimodal Assessment of Parkinson’s Disease: A Deep Learning ApproachMultimodal Assessment of Parkinson’s Disease: A Deep Learning Approach
Multimodal Assessment of Parkinson’s Disease: A Deep Learning Approach
 
History and definition of statistics
History and definition of statisticsHistory and definition of statistics
History and definition of statistics
 
Fundamental Steps of Digital Image Processing & Image Components
Fundamental Steps of Digital Image Processing & Image ComponentsFundamental Steps of Digital Image Processing & Image Components
Fundamental Steps of Digital Image Processing & Image Components
 
Signals & systems
Signals & systems Signals & systems
Signals & systems
 

Viewers also liked

Between the two ages
Between the two agesBetween the two ages
Between the two ages
Mohsen Youssef
 
The barbecue at pidgeon court
The barbecue at pidgeon courtThe barbecue at pidgeon court
The barbecue at pidgeon court
Bernard Tisman
 
Yeni microsoft office power point sunusu
Yeni microsoft office power point sunusuYeni microsoft office power point sunusu
Yeni microsoft office power point sunusuOğuzhan Özekinci
 
The worktatorship
The  worktatorshipThe  worktatorship
The worktatorship
Bernard Tisman
 
Img 2637.jpg
Img 2637.jpgImg 2637.jpg
Img 2637.jpg
Nitin Jolly
 
Satellite Madrid Informe
Satellite Madrid InformeSatellite Madrid Informe
Satellite Madrid InformeNieves Alonso
 
Del modelo manicomial al modelo comunitario
Del modelo manicomial al modelo comunitarioDel modelo manicomial al modelo comunitario
Del modelo manicomial al modelo comunitario
Vanessa Herrera Lopez
 
Presentation on Nutritional Supplements
 Presentation on Nutritional Supplements  Presentation on Nutritional Supplements
Presentation on Nutritional Supplements
Pratheesh Jacob
 
Social Media: Legal Pitfalls and Best Practices - SXSWedu 2016
Social Media: Legal Pitfalls and Best Practices - SXSWedu 2016Social Media: Legal Pitfalls and Best Practices - SXSWedu 2016
Social Media: Legal Pitfalls and Best Practices - SXSWedu 2016
Diana Benner
 
Cultura fenicia diapositivas info
Cultura fenicia diapositivas infoCultura fenicia diapositivas info
Cultura fenicia diapositivas info
Pam Pompa
 

Viewers also liked (11)

Between the two ages
Between the two agesBetween the two ages
Between the two ages
 
Man eating bus
Man eating busMan eating bus
Man eating bus
 
The barbecue at pidgeon court
The barbecue at pidgeon courtThe barbecue at pidgeon court
The barbecue at pidgeon court
 
Yeni microsoft office power point sunusu
Yeni microsoft office power point sunusuYeni microsoft office power point sunusu
Yeni microsoft office power point sunusu
 
The worktatorship
The  worktatorshipThe  worktatorship
The worktatorship
 
Img 2637.jpg
Img 2637.jpgImg 2637.jpg
Img 2637.jpg
 
Satellite Madrid Informe
Satellite Madrid InformeSatellite Madrid Informe
Satellite Madrid Informe
 
Del modelo manicomial al modelo comunitario
Del modelo manicomial al modelo comunitarioDel modelo manicomial al modelo comunitario
Del modelo manicomial al modelo comunitario
 
Presentation on Nutritional Supplements
 Presentation on Nutritional Supplements  Presentation on Nutritional Supplements
Presentation on Nutritional Supplements
 
Social Media: Legal Pitfalls and Best Practices - SXSWedu 2016
Social Media: Legal Pitfalls and Best Practices - SXSWedu 2016Social Media: Legal Pitfalls and Best Practices - SXSWedu 2016
Social Media: Legal Pitfalls and Best Practices - SXSWedu 2016
 
Cultura fenicia diapositivas info
Cultura fenicia diapositivas infoCultura fenicia diapositivas info
Cultura fenicia diapositivas info
 

Similar to Generalized Notions of Data Depth

R and Visualization: A match made in Heaven
R and Visualization: A match made in HeavenR and Visualization: A match made in Heaven
R and Visualization: A match made in Heaven
Edureka!
 
Application of Graph Theory in Computer science using Data Structure.pdf
Application of Graph Theory in Computer science using Data Structure.pdfApplication of Graph Theory in Computer science using Data Structure.pdf
Application of Graph Theory in Computer science using Data Structure.pdf
Nancy Ideker
 
Multimodal Biometrics Recognition by Dimensionality Diminution Method
Multimodal Biometrics Recognition by Dimensionality Diminution MethodMultimodal Biometrics Recognition by Dimensionality Diminution Method
Multimodal Biometrics Recognition by Dimensionality Diminution Method
IJERA Editor
 
Interval Pattern Structures: An introdution
Interval Pattern Structures: An introdutionInterval Pattern Structures: An introdution
Interval Pattern Structures: An introdution
INSA Lyon - L'Institut National des Sciences Appliquées de Lyon
 
How to Decide the Best Fuzzy Model in ANFIS
How to Decide the Best Fuzzy Model in ANFIS How to Decide the Best Fuzzy Model in ANFIS
Dual-time Modeling and Forecasting in Consumer Banking (2016)
Dual-time Modeling and Forecasting in Consumer Banking (2016)Dual-time Modeling and Forecasting in Consumer Banking (2016)
Dual-time Modeling and Forecasting in Consumer Banking (2016)
Aijun Zhang
 
An Importance Sampling Approach to Integrate Expert Knowledge When Learning B...
An Importance Sampling Approach to Integrate Expert Knowledge When Learning B...An Importance Sampling Approach to Integrate Expert Knowledge When Learning B...
An Importance Sampling Approach to Integrate Expert Knowledge When Learning B...
NTNU
 
Introduction to GIS
Introduction to GISIntroduction to GIS
Introduction to GIS
Uday kumar Devalla
 
Triggering patterns of topology changes in dynamic attributed graphs
Triggering patterns of topology changes in dynamic attributed graphsTriggering patterns of topology changes in dynamic attributed graphs
Triggering patterns of topology changes in dynamic attributed graphs
INSA Lyon - L'Institut National des Sciences Appliquées de Lyon
 
Pre-computation for ABC in image analysis
Pre-computation for ABC in image analysisPre-computation for ABC in image analysis
Pre-computation for ABC in image analysis
Matt Moores
 
Data Compression in Data mining and Business Intelligencs
Data Compression in Data mining and Business Intelligencs Data Compression in Data mining and Business Intelligencs
Data Compression in Data mining and Business Intelligencs
ShahDhruv21
 
A Diffusion Wavelet Approach For 3 D Model Matching
A Diffusion Wavelet Approach For 3 D Model MatchingA Diffusion Wavelet Approach For 3 D Model Matching
A Diffusion Wavelet Approach For 3 D Model Matchingrafi
 
Estimation, Detection & Comparison of Soil Nutrients using Matlab
Estimation, Detection & Comparison of Soil Nutrients using MatlabEstimation, Detection & Comparison of Soil Nutrients using Matlab
Estimation, Detection & Comparison of Soil Nutrients using Matlab
IRJET Journal
 
C documents and settings_administrator_local settings_application data_mozil...
C  documents and settings_administrator_local settings_application data_mozil...C  documents and settings_administrator_local settings_application data_mozil...
C documents and settings_administrator_local settings_application data_mozil...Anuar Ahmad
 
Computed Tomography Image Reconstruction in 3D VoxelSpace
Computed Tomography Image Reconstruction in 3D VoxelSpaceComputed Tomography Image Reconstruction in 3D VoxelSpace
Computed Tomography Image Reconstruction in 3D VoxelSpace
International Journal of Modern Research in Engineering and Technology
 
Visual analysis of large graphs state of the art and future research challenges
Visual analysis of large graphs state of the art and future research challengesVisual analysis of large graphs state of the art and future research challenges
Visual analysis of large graphs state of the art and future research challenges
Asliza Hamzah
 

Similar to Generalized Notions of Data Depth (20)

R and Visualization: A match made in Heaven
R and Visualization: A match made in HeavenR and Visualization: A match made in Heaven
R and Visualization: A match made in Heaven
 
Lec-3 DIP.pptx
Lec-3 DIP.pptxLec-3 DIP.pptx
Lec-3 DIP.pptx
 
Application of Graph Theory in Computer science using Data Structure.pdf
Application of Graph Theory in Computer science using Data Structure.pdfApplication of Graph Theory in Computer science using Data Structure.pdf
Application of Graph Theory in Computer science using Data Structure.pdf
 
Multimodal Biometrics Recognition by Dimensionality Diminution Method
Multimodal Biometrics Recognition by Dimensionality Diminution MethodMultimodal Biometrics Recognition by Dimensionality Diminution Method
Multimodal Biometrics Recognition by Dimensionality Diminution Method
 
Interval Pattern Structures: An introdution
Interval Pattern Structures: An introdutionInterval Pattern Structures: An introdution
Interval Pattern Structures: An introdution
 
How to Decide the Best Fuzzy Model in ANFIS
How to Decide the Best Fuzzy Model in ANFIS How to Decide the Best Fuzzy Model in ANFIS
How to Decide the Best Fuzzy Model in ANFIS
 
Dual-time Modeling and Forecasting in Consumer Banking (2016)
Dual-time Modeling and Forecasting in Consumer Banking (2016)Dual-time Modeling and Forecasting in Consumer Banking (2016)
Dual-time Modeling and Forecasting in Consumer Banking (2016)
 
An Importance Sampling Approach to Integrate Expert Knowledge When Learning B...
An Importance Sampling Approach to Integrate Expert Knowledge When Learning B...An Importance Sampling Approach to Integrate Expert Knowledge When Learning B...
An Importance Sampling Approach to Integrate Expert Knowledge When Learning B...
 
R Language Introduction
R Language IntroductionR Language Introduction
R Language Introduction
 
Introduction to GIS
Introduction to GISIntroduction to GIS
Introduction to GIS
 
Triggering patterns of topology changes in dynamic attributed graphs
Triggering patterns of topology changes in dynamic attributed graphsTriggering patterns of topology changes in dynamic attributed graphs
Triggering patterns of topology changes in dynamic attributed graphs
 
Pre-computation for ABC in image analysis
Pre-computation for ABC in image analysisPre-computation for ABC in image analysis
Pre-computation for ABC in image analysis
 
Data Compression in Data mining and Business Intelligencs
Data Compression in Data mining and Business Intelligencs Data Compression in Data mining and Business Intelligencs
Data Compression in Data mining and Business Intelligencs
 
A Diffusion Wavelet Approach For 3 D Model Matching
A Diffusion Wavelet Approach For 3 D Model MatchingA Diffusion Wavelet Approach For 3 D Model Matching
A Diffusion Wavelet Approach For 3 D Model Matching
 
Estimation, Detection & Comparison of Soil Nutrients using Matlab
Estimation, Detection & Comparison of Soil Nutrients using MatlabEstimation, Detection & Comparison of Soil Nutrients using Matlab
Estimation, Detection & Comparison of Soil Nutrients using Matlab
 
C documents and settings_administrator_local settings_application data_mozil...
C  documents and settings_administrator_local settings_application data_mozil...C  documents and settings_administrator_local settings_application data_mozil...
C documents and settings_administrator_local settings_application data_mozil...
 
Dtm Quality Assesment
Dtm Quality AssesmentDtm Quality Assesment
Dtm Quality Assesment
 
Computed Tomography Image Reconstruction in 3D VoxelSpace
Computed Tomography Image Reconstruction in 3D VoxelSpaceComputed Tomography Image Reconstruction in 3D VoxelSpace
Computed Tomography Image Reconstruction in 3D VoxelSpace
 
Visual analysis of large graphs state of the art and future research challenges
Visual analysis of large graphs state of the art and future research challengesVisual analysis of large graphs state of the art and future research challenges
Visual analysis of large graphs state of the art and future research challenges
 
PAKDD2013
PAKDD2013PAKDD2013
PAKDD2013
 

Recently uploaded

Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
roli9797
 
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
dwreak4tg
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
Global Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headedGlobal Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headed
vikram sood
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
balafet
 
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
oz8q3jxlp
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
ahzuo
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
g4dpvqap0
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
manishkhaire30
 
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
u86oixdj
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
slg6lamcq
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
TravisMalana
 
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
apvysm8
 
My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
rwarrenll
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
74nqk8xf
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
u86oixdj
 
Nanandann Nilekani's ppt On India's .pdf
Nanandann Nilekani's ppt On India's .pdfNanandann Nilekani's ppt On India's .pdf
Nanandann Nilekani's ppt On India's .pdf
eddie19851
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
jerlynmaetalle
 
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptxData_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
AnirbanRoy608946
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
axoqas
 

Recently uploaded (20)

Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
 
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
Global Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headedGlobal Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headed
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
 
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
 
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
 
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
 
My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
 
Nanandann Nilekani's ppt On India's .pdf
Nanandann Nilekani's ppt On India's .pdfNanandann Nilekani's ppt On India's .pdf
Nanandann Nilekani's ppt On India's .pdf
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
 
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptxData_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
 

Generalized Notions of Data Depth

  • 1. Generalized Notions of Data Depth Spring 2015 Data Reading Seminar Mukund Raj 12th Mar, 2015 1 / 25
  • 2. Outline 1 Data Depth Background What is Data Depth? Geometrical Data Depth General Properties of Data Depth 2 Generalized Notions of Data Depth Functions Multivariate Curves Sets Paths (on a graph) 3 Discussion Relaxed Formulations Advantages and Limitations of Data Depth 2 / 25
  • 3. What is Data Depth? A means of measuring how deep a data point p is within a cloud of points {p1, . . . , pn}. Multivariate data analysis approach to generate order statistics which capture high-dimensional features and relationships. Descriptive nonparametric method of statistical analysis. 3 / 25
  • 4. Why is Data Depth Interesting? Estimate the location from center outward ( with respect to parent distribution ). Identify outliers. Formulate quantitative and graphical methods for analyzing distributional characteristics such as location, scale, e.t.c as well as hypothesis testing. Robustness. 4 / 25
  • 5. Various Formulations of Data Depth Geometrical (for Data in Euclidean Space) L2 depth Mahalanobis depth Oja depth Expected convex hull depth Zonoid depth Simplex depth Half Space depth or Tukey depth or Location depth Generalized (for Complex Data) Functional Band Depth Depth for Multivariate Curves Sets Paths on a Graph 5 / 25
  • 6. Geometrical data depth Depth based on distances / volumes L2 depth Mahalanobis depth Oja depth Depth based on weighted means Zonoid depth Expected Convex Hull depth Depth based on half spaces and simplices Tukey depth Simplicial depth [Mosler 2012] 6 / 25
  • 7. General Properties of Data Depth 1 Zero at infinity 2 Maximality at Center 3 Monotonicity 4 Affine Invariance [Zuo and Serfling, 2000] 7 / 25
  • 8. Outline 1 Data Depth Background What is Data Depth? Geometrical Data Depth General Properties of Data Depth 2 Generalized Notions of Data Depth Functions Multivariate Curves Sets Paths (on a graph) 3 Discussion Relaxed Formulations Advantages and Limitations of Data Depth 8 / 25
  • 9. Function Ensembles A function ensemble can be defined as: {xi (t), i = 1, . . . , n, t ∈ I} where I is an interval in and xi : → Time series observations annual trend of temperature or precipitation, prices of commodities, heights of children versus age e.t.c. 9 / 25
  • 10. Motivation for Functional Band Depth Challenge with regular multivariate analysis of functions Curve ensembles that are sampled at different points. Curse of dimensionality in case of current methods (e.g. PCA). Contribution by [L´opez-Pintado et. al. 2009] Given an ensemble of functions (sampled from a distribution), a formulation of data depth associated with the function. 10 / 25
  • 11. Functional Band Depth Formulation Figure: A functional band [Lopez-Pintado et. al. 2009]. Functional band formulation: g ⊂ B(f1, · · · , fj ) iff ∀x min i∈{1...j} {fi(x)} ≤ g(x) ≤ max i∈{1...j} {fi(x)} (1) Functional band depth formulation: BDj (g) = P (g ⊂ B(f1, · · · , fj)) (2) 11 / 25
  • 12. Visualization of Data Depth for Functions Figure: Visualization of function ensemble [Lopez-Pintados et. al. 2009]. Figure: Boxplot visualization of function ensemble [Sun et. al. 2011, Whitaker et. al. 2013]. 12 / 25
  • 13. Multivariate Curve Ensembles A parameterized curve can be defined in terms of an independent parameter s as: c(s) = ˜x(s) c : D → R D ⊂ R, R ⊂ Rd Hurricane paths. Brain tractography data. Pathline ensemble in fluid simulation. Figure: A synthetic ensemble of multivariate curves in [Mirzargar et. al. 2014] 13 / 25
  • 14. Data Depth Formulation for Multivariate Curves (a) (b) Figure: Band formed by 3 multivariate curves [Lopez-Pintado et. al. 2014, Mirzargar et. al. 2014] Curve band formulation: g ⊂ B(ci1 , · · · , cij ) iff ∀x g(x) ∈ simplex ci1 (x), · · · , cij (x) (3) Curve band depth formulation: SBDj (g) = P g ⊂ B(fc1 , · · · , cij ) (4) 14 / 25
  • 15. Visualization of Data Depth for Curves Figure: Chinese Script replicated 100 times [Lopez-Pintado 2014]. Figure: Curve boxplot for hurricane path ensemble [Mirzargar et. al. 2014] 15 / 25
  • 16. Set / Isocontour Ensembles Given an ensemble of real valued functions f (x, y), the sublevel and superlevel sets for any particular isovalue. Isocontours of temperature field. Isocontours of pressure field in fluid dynamics simulations. Figure: A synthetic ensemble of contours in [Whitaker et. al. 2013] 16 / 25
  • 17. Data Depth Formulation for Sets Figure: Examples of set band [Whitaker et. al. 2013] Set band formulation: S ∈ sB(S1, . . . , Dj ) ↔ j k=1 Sk ⊂ S ⊂ j k=1 Sk (5) Set band depth formulation: sBDj (S) = P (S ⊂ sB(S1, . . . , Sj ) (6) 17 / 25
  • 18. Visualization of Data Depth for Sets (a) (b) Figure: Contour boxplot for an ensemble of isocontours of pressure field [Whitaker et. al. 2013] 18 / 25
  • 19. Paths (on a graph) Let G = {V , E, W }. A path p can be denoted as p : I → V where index set I = (1, . . . , m) Paths of packets in computer networks. Paths on transportation networks modelled as graphs. Figure: A synthetic ensemble of paths on a graph. 19 / 25
  • 20. Data Depth Formulation for Paths Figure: Illustration of band formed by 3 paths. Path band formulation: p ∈ B[Pj ] iff p(l) ∈ H[p1(l), . . . , pj (l)] ∀l ∈ I (7) Path band depth formulation: pBDj (p) = E [χ(p ∈ B(pj ))] (8) 20 / 25
  • 21. Visualization of Data Depth for Paths (a) (b) Figure: Path boxplots for paths on AS and road graphs. 21 / 25
  • 22. Outline 1 Data Depth Background What is Data Depth? Geometrical Data Depth General Properties of Data Depth 2 Generalized Notions of Data Depth Functions Multivariate Curves Sets Paths (on a graph) 3 Discussion Relaxed Formulations Advantages and Limitations of Data Depth 22 / 25
  • 23. Relaxed formulations 1 Modified Band Depth - Instead of an indicator function, measure object inside the band. 2 Subsets - Indicator function with a relaxed threshold. 23 / 25
  • 24. Advantages and Limitations For Combinatorial Data Depth Formulations for Complex Data Advantages No assumption required for the underlying distribution. Captures nonlocal relationships Robust. Limitations Computationally expensive for large ensembles. 24 / 25