SlideShare a Scribd company logo
On the Smallest Enclosing Information Disk
Frank Nielsen1

Richard Nock2

1 Sony

Computer Science Laboratories, Inc.
Fundamental Research Laboratory
Frank.Nielsen@acm.org
2 University

of Antilles-Guyanne
DSI-GRIMAAG
Richard.Nock@martinique.univ-ag.fr

August 2006

F. Nielsen and R. Nock

On the Smallest Enclosing Information Disk
Smallest Enclosing Balls
Problem
Given S = {s1 , ..., sn }, compute a simplified description, called
the center, that fits well S (i.e., summarizes S).
Two optimization criteria:
M IN AVG Find a center c∗ which minimizes the average
distortion w.r.t S: c∗ = argminc i d(c, si ).
M IN M AX Find a center c∗ which minimizes the maximal
distortion w.r.t S: c∗ = argminc maxi d(c, si ).
Investigated in Applied Mathematics:
Computational geometry (1-center problem),
Computational statistics (1-point estimator),
Machine learning (1-class classification),
F. Nielsen and R. Nock

On the Smallest Enclosing Information Disk
Smallest Enclosing Balls in Computational Geometry
Distortion measure d(·, ·) is the geometric distance:
Euclidean distance L2 .
c∗ is the circumcenter of S for M IN M AX,
Squared Euclidean distance L2 .
2
c∗ is the centroid of S for M IN AVG (→ k -means),
Euclidean distance L2 .
c∗ is the Fermat-Weber point for M IN AVG.

Centroid
M IN AVG L2
2

Circumcenter
M IN M AX L2

F. Nielsen and R. Nock

Fermat-Weber
M IN AVG L2

On the Smallest Enclosing Information Disk
M IN M AX in Computational Geometry (MINIBALL)
M IN M AX point set
Smallest Enclosing Ball [NN’04]
Pioneered by Sylvester (1857),
Unique circumcenter c∗ (radius r ∗ ),
LP-type, linear-time randomized
algorithm (fixed dimension d),
Weakly polynomial.
Efficient SOCP numerical solver,
Fast combinatorial heuristics
(d ≥ 1000).

M IN M AX ball set
F. Nielsen and R. Nock

On the Smallest Enclosing Information Disk
Distortions: Bregman Divergences
Definition
Bregman divergences are parameterized (F ) families of
distortions.
Let F : X −→ R, such that F is strictly convex and differentiable
on int(X ), for a convex domain X ⊆ Rd .
Bregman divergence DF :
DF (x, y) = F (x) − F (y) − x − y,

F

·, ·

:
:

F (y)

.

gradient operator of F
Inner product (dot product)
(→ DF is the tail of a Taylor expansion of F )

F. Nielsen and R. Nock

On the Smallest Enclosing Information Disk
Visualizing F and DF
F (·)

DF (x, y)
x − y,

y

F (y)

x

DF (x, y) = F (x) − F (y) − x − y,

F (y)

.

(→ DF is the a truncated Taylor expansion of F )
F. Nielsen and R. Nock

On the Smallest Enclosing Information Disk
Bregman Balls (Information Balls)

Euclidean Ball: Bc,r = {x ∈ X : x − c
(r : squared radius.

L2 :
2

2
2

≤ r}

Bregman divergence F (x) =

d
i=1

xi2 )

Theorem [BMDG’04]
The M IN AVG Ball for Bregman divergences is the centroid .

F. Nielsen and R. Nock

On the Smallest Enclosing Information Disk
Two types of Bregman balls

First-type:
Bc,r = {x ∈ X : DF ( c , x) ≤ r },
Second-type:
Bc,r = {x ∈ X : DF (x, c ) ≤ r }
Lemma
The smallest enclosing Bregman balls Bc∗ ,r ∗ and B c∗ ,r ∗ of S
are unique.
−→ Consider first-type Bregman balls.
(The second-type is obtained as a first-type ball on the dual
divergence DF ∗ using the Legendre-Fenchel transformation.)

F. Nielsen and R. Nock

On the Smallest Enclosing Information Disk
Applications of Bregman Balls
Circumcenters of the smallest enclosing Bregman balls encode:
Euclidean squared distance.
The closest point to a set of points.
d

(qi − pi )2 = ||p||2 + ||q||2 − 2 p, q .

DF (p, q) =
i=1

Itakura-Saito divergence. The closest (sound) signal to a set of
signals (speech recognition).
d

DF (p, q) =
i=1

pi
pi
( − log − 1), [← F (x) = −
qi
qi

d

log xi ]
i=1

Kullback-Leibler. The closest distribution to a set of
distributions (density estimation).
d

DF (p, q) =
i=1

pi
pi log − pi + qi , [F (x) = −
qi

F. Nielsen and R. Nock

d

xi log xi ]
i=1

On the Smallest Enclosing Information Disk
Information Disks
Problem
Given a set S = {s1 , ..., sn } of n 2D vector points, compute the
M IN M AX center: c∗ = argminc maxi d(c, si ).
handle geometric points for various distortions,
handle parametric distributions
(e.g., Normal distributions are parameterized by (µ, σ)).

F. Nielsen and R. Nock

On the Smallest Enclosing Information Disk
Information Disk is LP-type
Monotonicity. For any F and G such that F ⊆ G ⊆ X ,
r ∗ (F) ≤ r ∗ (G).
Locality. For any F and G such that F ⊆ G ⊆ X with
r ∗ (F) = r ∗ (G), and any point p ∈ X ,
r ∗ (G) < r ∗ (G ∪ {p}) → r ∗ (F) < r ∗ (F ∪ {p}).
M INI I NFO B ALL(S = {p1 , ..., pn }, B):
Initially B = ∅. Returns B ∗ = (c∗ , r ∗ )
IF |S ∪ B| ≤ 3
RETURN B=S OLVE I NFO B ASIS(S ∪ B)
ELSE
Select at random p ∈ S
B ∗ =M INI I NFO B ALL(S{p}, B)
IF p ∈ B ∗
Then add p to the basis
M INI I NFO B ALL(S{p}, B ∪ {p})
F. Nielsen and R. Nock

On the Smallest Enclosing Information Disk
Computing basis (S OLVE I NFO B ASIS)
Lemma
The first-type Bregman bisector
Bisector(p, q) = {c ∈ X | DF (c, p) = DF (c, q)} is linear.
This is a linear equation in c (an hyperplane). Bisector
Bisector(p, q) = {x | x, dpq + kpq = 0} with
dpq = F (p) − F (q) a vector, and
kpq = F (p) − F (q) + q, F (q) − p, F (p) a constant

(Itakura-Saito divergence)
F. Nielsen and R. Nock

On the Smallest Enclosing Information Disk
Computing basis (S OLVE I NFO B ASIS)
Basis 3 : The circumcenter is the trisector.
(intersection of 3 linear bisectors, enough to consider any two
of them).
c∗ = l12 × l13 = l12 × l23 = l13 × l23 ,
lij : projective point associated to the linear bisector
Bisector(pi , pj ) (×: cross-product)

F. Nielsen and R. Nock

On the Smallest Enclosing Information Disk
Computing basis (S OLVE I NFO B ASIS)
Basis 2 : Either minimize DF (c, p) s.t. c∗ ∈ Bisector(p, q), or
better perform a logarithmic search on λ ∈ [0, 1] s. t.
rλ = F −1 ((1 − λ) F (p) + λ F (q)) is on the geodesic of pq
( F −1 : reciprocal gradient).

F. Nielsen and R. Nock

On the Smallest Enclosing Information Disk
Live Demo

http://www.csl.sony.co.jp/person/nielsen/
BregmanBall/MINIBALL/

F. Nielsen and R. Nock

On the Smallest Enclosing Information Disk
Statistical application example

Univariate Normal law distribution:
2
1
N(x|µ, σ) = √ exp(− (x−µ) ).
2σ 2
σ 2π

Consider the Kullback-Leibler divergence of two distributions:
f (x)
KL(f , g) = x f (x) log g(x) .
Canonical form of an exponential family:
N(x|µ, σ) = √ 1
exp{ θ, f(x) } with:
2πZ (θ)

2

µ
Z (θ) = σ exp{ 2σ2 } =

θ2

1
2
− 2θ1 exp{− 4θ1 },

f(x) = [x 2 x]T : sufficient statistics,
1
θ = [− 2σ2

µ T
] :
σ2

natural parameters.

F. Nielsen and R. Nock

On the Smallest Enclosing Information Disk
Kullback-Leibler of parametric exponential family is a Bregman
divergence for F = log Z .
Z (θ )
KL(θp ||θq ) = DF (θp , θq ) = (θp − θq ), θp [f] + log Z (θq )
p
θp [f] =

x2
x Z (θp )
x
x Z (θp )

exp{ θp , f(x) }
exp{ θp , f(x) }

=

2
µ2 + σp
p
µp

Z (θ )

p
Bisector (θp − θq ), θc [f] + log Z (θq ) = 0.

1D Gaussian distribution: change variables
(µ, σ) → (µ2 + σ 2 , µ) = (x, y) (with x > y > 0).
y2
It comes Z (x, y ) = x − y 2 exp{ 2(x−y 2 ) },
log Z (x, y ) = log
F (x, y)

y2
and
2(x−y 2 )
y2
y3
,
).
2(x−y 2 ) (x−y 2 )2

x − y2 +

1
= ( 2(x−y 2 ) −

F. Nielsen and R. Nock

On the Smallest Enclosing Information Disk
Statistical application example (cont’d)

M IN M AX: (µ∗ , σ ∗ )
r∗

(2.67446, 1.08313) and

0.801357,

M IN AVG: (µ∗ , σ ∗ ) = (2.40909, 1.10782).
Note that KL(Ni , Nj ) =

1
2

σi2
σj2

σ

+ 2 log σji − 1 +

F. Nielsen and R. Nock

(µj −µi )2
σj2

.

On the Smallest Enclosing Information Disk
Java Applet online:
www.csl.sony.co.jp/person/nielsen/BregmanBall/
MINIBALL/

Source code: Basic MiniBall, Line intersection by
projective geometry
Visual Computing: Geometry, Graphics, and Vision, ISBN
1-58450-427-7, 2005.
˘
In high dimensions, extend Badoiu & Clarkson core-set
See On approximating the smallest enclosing Bregman
Balls (SoCG’06 video)
F. Nielsen and R. Nock

On the Smallest Enclosing Information Disk
3D Bregman balls (video)

Relative entropy (KL)

F. Nielsen and R. Nock

Itakura-Saito

On the Smallest Enclosing Information Disk
Bregman Voronoi/Delaunay

EXP

F. Nielsen and R. Nock

On the Smallest Enclosing Information Disk
Bibliography

Welzl, ”Smallest Enclosing Disks (Balls and Ellipsoids)”,
LNCS 555:359-370, 1991.
Crammer & Singer, ”Learning Algorithms for Enclosing
Points in Bregmanian Spheres”, COLT03.
Nock & Nielsen, ”Fitting the smallest Bregman ball”,
ECML05 (SoCG06 video).
Banerjee et. al, ”Clustering with Bregman divergences” ,
JMLR05.

F. Nielsen and R. Nock

On the Smallest Enclosing Information Disk

More Related Content

What's hot

Research Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and ScienceResearch Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and Science
researchinventy
 
Computing F-blowups
Computing F-blowupsComputing F-blowups
Computing F-blowups
Takehiko Yasuda
 
Notes up to_ch7_sec3
Notes up to_ch7_sec3Notes up to_ch7_sec3
Notes up to_ch7_sec3
Leonardo Nosce
 
On Some Geometrical Properties of Proximal Sets and Existence of Best Proximi...
On Some Geometrical Properties of Proximal Sets and Existence of Best Proximi...On Some Geometrical Properties of Proximal Sets and Existence of Best Proximi...
On Some Geometrical Properties of Proximal Sets and Existence of Best Proximi...
BRNSS Publication Hub
 
Cubic BF-Algebra
Cubic BF-AlgebraCubic BF-Algebra
Cubic BF-Algebra
AM Publications
 
Números primos repunits
Números primos repunitsNúmeros primos repunits
Números primos repunits
lenixez
 
The wild McKay correspondence
The wild McKay correspondenceThe wild McKay correspondence
The wild McKay correspondence
Takehiko Yasuda
 
Calc 7.1b
Calc 7.1bCalc 7.1b
Calc 7.1b
hartcher
 
Analysis Solutions CVI
Analysis Solutions CVIAnalysis Solutions CVI
Analysis Solutions CVI
Leonardo Di Giosia
 
Business math
Business mathBusiness math
Business math
sanjida_yeasmin
 
Week 2 - Trigonometry
Week 2 - TrigonometryWeek 2 - Trigonometry
Week 2 - Trigonometry
Carlos Vázquez
 
Definite Integral Review
Definite Integral ReviewDefinite Integral Review
Definite Integral Review
Sharon Henry
 
Variations on the method of Coleman-Chabauty
Variations on the method of Coleman-ChabautyVariations on the method of Coleman-Chabauty
Variations on the method of Coleman-Chabauty
mmasdeu
 
Pc12 sol c04_4-2
Pc12 sol c04_4-2Pc12 sol c04_4-2
Pc12 sol c04_4-2
Garden City
 
Week 3 - Trigonometry
Week 3 - TrigonometryWeek 3 - Trigonometry
Week 3 - Trigonometry
Carlos Vázquez
 
Lesson 25: Evaluating Definite Integrals (Section 10 version)
Lesson 25: Evaluating Definite Integrals (Section 10 version)Lesson 25: Evaluating Definite Integrals (Section 10 version)
Lesson 25: Evaluating Definite Integrals (Section 10 version)
Matthew Leingang
 
Non-informative reparametrisation for location-scale mixtures
Non-informative reparametrisation for location-scale mixturesNon-informative reparametrisation for location-scale mixtures
Non-informative reparametrisation for location-scale mixtures
Christian Robert
 
Approximate Integration
Approximate IntegrationApproximate Integration
Approximate Integration
Silvius
 

What's hot (18)

Research Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and ScienceResearch Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and Science
 
Computing F-blowups
Computing F-blowupsComputing F-blowups
Computing F-blowups
 
Notes up to_ch7_sec3
Notes up to_ch7_sec3Notes up to_ch7_sec3
Notes up to_ch7_sec3
 
On Some Geometrical Properties of Proximal Sets and Existence of Best Proximi...
On Some Geometrical Properties of Proximal Sets and Existence of Best Proximi...On Some Geometrical Properties of Proximal Sets and Existence of Best Proximi...
On Some Geometrical Properties of Proximal Sets and Existence of Best Proximi...
 
Cubic BF-Algebra
Cubic BF-AlgebraCubic BF-Algebra
Cubic BF-Algebra
 
Números primos repunits
Números primos repunitsNúmeros primos repunits
Números primos repunits
 
The wild McKay correspondence
The wild McKay correspondenceThe wild McKay correspondence
The wild McKay correspondence
 
Calc 7.1b
Calc 7.1bCalc 7.1b
Calc 7.1b
 
Analysis Solutions CVI
Analysis Solutions CVIAnalysis Solutions CVI
Analysis Solutions CVI
 
Business math
Business mathBusiness math
Business math
 
Week 2 - Trigonometry
Week 2 - TrigonometryWeek 2 - Trigonometry
Week 2 - Trigonometry
 
Definite Integral Review
Definite Integral ReviewDefinite Integral Review
Definite Integral Review
 
Variations on the method of Coleman-Chabauty
Variations on the method of Coleman-ChabautyVariations on the method of Coleman-Chabauty
Variations on the method of Coleman-Chabauty
 
Pc12 sol c04_4-2
Pc12 sol c04_4-2Pc12 sol c04_4-2
Pc12 sol c04_4-2
 
Week 3 - Trigonometry
Week 3 - TrigonometryWeek 3 - Trigonometry
Week 3 - Trigonometry
 
Lesson 25: Evaluating Definite Integrals (Section 10 version)
Lesson 25: Evaluating Definite Integrals (Section 10 version)Lesson 25: Evaluating Definite Integrals (Section 10 version)
Lesson 25: Evaluating Definite Integrals (Section 10 version)
 
Non-informative reparametrisation for location-scale mixtures
Non-informative reparametrisation for location-scale mixturesNon-informative reparametrisation for location-scale mixtures
Non-informative reparametrisation for location-scale mixtures
 
Approximate Integration
Approximate IntegrationApproximate Integration
Approximate Integration
 

Similar to On the smallest enclosing information disk

Slides: On the Chi Square and Higher-Order Chi Distances for Approximating f-...
Slides: On the Chi Square and Higher-Order Chi Distances for Approximating f-...Slides: On the Chi Square and Higher-Order Chi Distances for Approximating f-...
Slides: On the Chi Square and Higher-Order Chi Distances for Approximating f-...
Frank Nielsen
 
Lecture4 kenrels functions_rkhs
Lecture4 kenrels functions_rkhsLecture4 kenrels functions_rkhs
Lecture4 kenrels functions_rkhs
Stéphane Canu
 
On Approximating the Smallest Enclosing Bregman Balls
On Approximating the Smallest Enclosing Bregman Balls On Approximating the Smallest Enclosing Bregman Balls
On Approximating the Smallest Enclosing Bregman Balls
Frank Nielsen
 
Optimal interval clustering: Application to Bregman clustering and statistica...
Optimal interval clustering: Application to Bregman clustering and statistica...Optimal interval clustering: Application to Bregman clustering and statistica...
Optimal interval clustering: Application to Bregman clustering and statistica...
Frank Nielsen
 
QMC Error SAMSI Tutorial Aug 2017
QMC Error SAMSI Tutorial Aug 2017QMC Error SAMSI Tutorial Aug 2017
QMC Error SAMSI Tutorial Aug 2017
Fred J. Hickernell
 
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
The Statistical and Applied Mathematical Sciences Institute
 
Slides: Total Jensen divergences: Definition, Properties and k-Means++ Cluste...
Slides: Total Jensen divergences: Definition, Properties and k-Means++ Cluste...Slides: Total Jensen divergences: Definition, Properties and k-Means++ Cluste...
Slides: Total Jensen divergences: Definition, Properties and k-Means++ Cluste...
Frank Nielsen
 
k-MLE: A fast algorithm for learning statistical mixture models
k-MLE: A fast algorithm for learning statistical mixture modelsk-MLE: A fast algorithm for learning statistical mixture models
k-MLE: A fast algorithm for learning statistical mixture models
Frank Nielsen
 
Slides: The dual Voronoi diagrams with respect to representational Bregman di...
Slides: The dual Voronoi diagrams with respect to representational Bregman di...Slides: The dual Voronoi diagrams with respect to representational Bregman di...
Slides: The dual Voronoi diagrams with respect to representational Bregman di...
Frank Nielsen
 
Computational Information Geometry on Matrix Manifolds (ICTP 2013)
Computational Information Geometry on Matrix Manifolds (ICTP 2013)Computational Information Geometry on Matrix Manifolds (ICTP 2013)
Computational Information Geometry on Matrix Manifolds (ICTP 2013)
Frank Nielsen
 
Bayesian inference on mixtures
Bayesian inference on mixturesBayesian inference on mixtures
Bayesian inference on mixtures
Christian Robert
 
Patch Matching with Polynomial Exponential Families and Projective Divergences
Patch Matching with Polynomial Exponential Families and Projective DivergencesPatch Matching with Polynomial Exponential Families and Projective Divergences
Patch Matching with Polynomial Exponential Families and Projective Divergences
Frank Nielsen
 
On approximating the Riemannian 1-center
On approximating the Riemannian 1-centerOn approximating the Riemannian 1-center
On approximating the Riemannian 1-center
Frank Nielsen
 
Murphy: Machine learning A probabilistic perspective: Ch.9
Murphy: Machine learning A probabilistic perspective: Ch.9Murphy: Machine learning A probabilistic perspective: Ch.9
Murphy: Machine learning A probabilistic perspective: Ch.9
Daisuke Yoneoka
 
Slides: The Burbea-Rao and Bhattacharyya centroids
Slides: The Burbea-Rao and Bhattacharyya centroidsSlides: The Burbea-Rao and Bhattacharyya centroids
Slides: The Burbea-Rao and Bhattacharyya centroids
Frank Nielsen
 
Slides: A glance at information-geometric signal processing
Slides: A glance at information-geometric signal processingSlides: A glance at information-geometric signal processing
Slides: A glance at information-geometric signal processing
Frank Nielsen
 
On learning statistical mixtures maximizing the complete likelihood
On learning statistical mixtures maximizing the complete likelihoodOn learning statistical mixtures maximizing the complete likelihood
On learning statistical mixtures maximizing the complete likelihood
Frank Nielsen
 
Evaluating definite integrals
Evaluating definite integralsEvaluating definite integrals
Evaluating definite integrals
منتدى الرياضيات المتقدمة
 
Compressive Spectral Image Sensing, Processing, and Optimization
Compressive Spectral Image Sensing, Processing, and OptimizationCompressive Spectral Image Sensing, Processing, and Optimization
Compressive Spectral Image Sensing, Processing, and Optimization
Distinguished Lecturer Series - Leon The Mathematician
 
Slides: The Centroids of Symmetrized Bregman Divergences
Slides: The Centroids of Symmetrized Bregman DivergencesSlides: The Centroids of Symmetrized Bregman Divergences
Slides: The Centroids of Symmetrized Bregman Divergences
Frank Nielsen
 

Similar to On the smallest enclosing information disk (20)

Slides: On the Chi Square and Higher-Order Chi Distances for Approximating f-...
Slides: On the Chi Square and Higher-Order Chi Distances for Approximating f-...Slides: On the Chi Square and Higher-Order Chi Distances for Approximating f-...
Slides: On the Chi Square and Higher-Order Chi Distances for Approximating f-...
 
Lecture4 kenrels functions_rkhs
Lecture4 kenrels functions_rkhsLecture4 kenrels functions_rkhs
Lecture4 kenrels functions_rkhs
 
On Approximating the Smallest Enclosing Bregman Balls
On Approximating the Smallest Enclosing Bregman Balls On Approximating the Smallest Enclosing Bregman Balls
On Approximating the Smallest Enclosing Bregman Balls
 
Optimal interval clustering: Application to Bregman clustering and statistica...
Optimal interval clustering: Application to Bregman clustering and statistica...Optimal interval clustering: Application to Bregman clustering and statistica...
Optimal interval clustering: Application to Bregman clustering and statistica...
 
QMC Error SAMSI Tutorial Aug 2017
QMC Error SAMSI Tutorial Aug 2017QMC Error SAMSI Tutorial Aug 2017
QMC Error SAMSI Tutorial Aug 2017
 
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
 
Slides: Total Jensen divergences: Definition, Properties and k-Means++ Cluste...
Slides: Total Jensen divergences: Definition, Properties and k-Means++ Cluste...Slides: Total Jensen divergences: Definition, Properties and k-Means++ Cluste...
Slides: Total Jensen divergences: Definition, Properties and k-Means++ Cluste...
 
k-MLE: A fast algorithm for learning statistical mixture models
k-MLE: A fast algorithm for learning statistical mixture modelsk-MLE: A fast algorithm for learning statistical mixture models
k-MLE: A fast algorithm for learning statistical mixture models
 
Slides: The dual Voronoi diagrams with respect to representational Bregman di...
Slides: The dual Voronoi diagrams with respect to representational Bregman di...Slides: The dual Voronoi diagrams with respect to representational Bregman di...
Slides: The dual Voronoi diagrams with respect to representational Bregman di...
 
Computational Information Geometry on Matrix Manifolds (ICTP 2013)
Computational Information Geometry on Matrix Manifolds (ICTP 2013)Computational Information Geometry on Matrix Manifolds (ICTP 2013)
Computational Information Geometry on Matrix Manifolds (ICTP 2013)
 
Bayesian inference on mixtures
Bayesian inference on mixturesBayesian inference on mixtures
Bayesian inference on mixtures
 
Patch Matching with Polynomial Exponential Families and Projective Divergences
Patch Matching with Polynomial Exponential Families and Projective DivergencesPatch Matching with Polynomial Exponential Families and Projective Divergences
Patch Matching with Polynomial Exponential Families and Projective Divergences
 
On approximating the Riemannian 1-center
On approximating the Riemannian 1-centerOn approximating the Riemannian 1-center
On approximating the Riemannian 1-center
 
Murphy: Machine learning A probabilistic perspective: Ch.9
Murphy: Machine learning A probabilistic perspective: Ch.9Murphy: Machine learning A probabilistic perspective: Ch.9
Murphy: Machine learning A probabilistic perspective: Ch.9
 
Slides: The Burbea-Rao and Bhattacharyya centroids
Slides: The Burbea-Rao and Bhattacharyya centroidsSlides: The Burbea-Rao and Bhattacharyya centroids
Slides: The Burbea-Rao and Bhattacharyya centroids
 
Slides: A glance at information-geometric signal processing
Slides: A glance at information-geometric signal processingSlides: A glance at information-geometric signal processing
Slides: A glance at information-geometric signal processing
 
On learning statistical mixtures maximizing the complete likelihood
On learning statistical mixtures maximizing the complete likelihoodOn learning statistical mixtures maximizing the complete likelihood
On learning statistical mixtures maximizing the complete likelihood
 
Evaluating definite integrals
Evaluating definite integralsEvaluating definite integrals
Evaluating definite integrals
 
Compressive Spectral Image Sensing, Processing, and Optimization
Compressive Spectral Image Sensing, Processing, and OptimizationCompressive Spectral Image Sensing, Processing, and Optimization
Compressive Spectral Image Sensing, Processing, and Optimization
 
Slides: The Centroids of Symmetrized Bregman Divergences
Slides: The Centroids of Symmetrized Bregman DivergencesSlides: The Centroids of Symmetrized Bregman Divergences
Slides: The Centroids of Symmetrized Bregman Divergences
 

Recently uploaded

GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
Neo4j
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
SOFTTECHHUB
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
ControlCase
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
SOFTTECHHUB
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
Zilliz
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
Kari Kakkonen
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
Neo4j
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
sonjaschweigert1
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Malak Abu Hammad
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
Adtran
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
Pierluigi Pugliese
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Nexer Digital
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
Uni Systems S.M.S.A.
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
mikeeftimakis1
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
Alpen-Adria-Universität
 

Recently uploaded (20)

GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
 

On the smallest enclosing information disk

  • 1. On the Smallest Enclosing Information Disk Frank Nielsen1 Richard Nock2 1 Sony Computer Science Laboratories, Inc. Fundamental Research Laboratory Frank.Nielsen@acm.org 2 University of Antilles-Guyanne DSI-GRIMAAG Richard.Nock@martinique.univ-ag.fr August 2006 F. Nielsen and R. Nock On the Smallest Enclosing Information Disk
  • 2. Smallest Enclosing Balls Problem Given S = {s1 , ..., sn }, compute a simplified description, called the center, that fits well S (i.e., summarizes S). Two optimization criteria: M IN AVG Find a center c∗ which minimizes the average distortion w.r.t S: c∗ = argminc i d(c, si ). M IN M AX Find a center c∗ which minimizes the maximal distortion w.r.t S: c∗ = argminc maxi d(c, si ). Investigated in Applied Mathematics: Computational geometry (1-center problem), Computational statistics (1-point estimator), Machine learning (1-class classification), F. Nielsen and R. Nock On the Smallest Enclosing Information Disk
  • 3. Smallest Enclosing Balls in Computational Geometry Distortion measure d(·, ·) is the geometric distance: Euclidean distance L2 . c∗ is the circumcenter of S for M IN M AX, Squared Euclidean distance L2 . 2 c∗ is the centroid of S for M IN AVG (→ k -means), Euclidean distance L2 . c∗ is the Fermat-Weber point for M IN AVG. Centroid M IN AVG L2 2 Circumcenter M IN M AX L2 F. Nielsen and R. Nock Fermat-Weber M IN AVG L2 On the Smallest Enclosing Information Disk
  • 4. M IN M AX in Computational Geometry (MINIBALL) M IN M AX point set Smallest Enclosing Ball [NN’04] Pioneered by Sylvester (1857), Unique circumcenter c∗ (radius r ∗ ), LP-type, linear-time randomized algorithm (fixed dimension d), Weakly polynomial. Efficient SOCP numerical solver, Fast combinatorial heuristics (d ≥ 1000). M IN M AX ball set F. Nielsen and R. Nock On the Smallest Enclosing Information Disk
  • 5. Distortions: Bregman Divergences Definition Bregman divergences are parameterized (F ) families of distortions. Let F : X −→ R, such that F is strictly convex and differentiable on int(X ), for a convex domain X ⊆ Rd . Bregman divergence DF : DF (x, y) = F (x) − F (y) − x − y, F ·, · : : F (y) . gradient operator of F Inner product (dot product) (→ DF is the tail of a Taylor expansion of F ) F. Nielsen and R. Nock On the Smallest Enclosing Information Disk
  • 6. Visualizing F and DF F (·) DF (x, y) x − y, y F (y) x DF (x, y) = F (x) − F (y) − x − y, F (y) . (→ DF is the a truncated Taylor expansion of F ) F. Nielsen and R. Nock On the Smallest Enclosing Information Disk
  • 7. Bregman Balls (Information Balls) Euclidean Ball: Bc,r = {x ∈ X : x − c (r : squared radius. L2 : 2 2 2 ≤ r} Bregman divergence F (x) = d i=1 xi2 ) Theorem [BMDG’04] The M IN AVG Ball for Bregman divergences is the centroid . F. Nielsen and R. Nock On the Smallest Enclosing Information Disk
  • 8. Two types of Bregman balls First-type: Bc,r = {x ∈ X : DF ( c , x) ≤ r }, Second-type: Bc,r = {x ∈ X : DF (x, c ) ≤ r } Lemma The smallest enclosing Bregman balls Bc∗ ,r ∗ and B c∗ ,r ∗ of S are unique. −→ Consider first-type Bregman balls. (The second-type is obtained as a first-type ball on the dual divergence DF ∗ using the Legendre-Fenchel transformation.) F. Nielsen and R. Nock On the Smallest Enclosing Information Disk
  • 9. Applications of Bregman Balls Circumcenters of the smallest enclosing Bregman balls encode: Euclidean squared distance. The closest point to a set of points. d (qi − pi )2 = ||p||2 + ||q||2 − 2 p, q . DF (p, q) = i=1 Itakura-Saito divergence. The closest (sound) signal to a set of signals (speech recognition). d DF (p, q) = i=1 pi pi ( − log − 1), [← F (x) = − qi qi d log xi ] i=1 Kullback-Leibler. The closest distribution to a set of distributions (density estimation). d DF (p, q) = i=1 pi pi log − pi + qi , [F (x) = − qi F. Nielsen and R. Nock d xi log xi ] i=1 On the Smallest Enclosing Information Disk
  • 10. Information Disks Problem Given a set S = {s1 , ..., sn } of n 2D vector points, compute the M IN M AX center: c∗ = argminc maxi d(c, si ). handle geometric points for various distortions, handle parametric distributions (e.g., Normal distributions are parameterized by (µ, σ)). F. Nielsen and R. Nock On the Smallest Enclosing Information Disk
  • 11. Information Disk is LP-type Monotonicity. For any F and G such that F ⊆ G ⊆ X , r ∗ (F) ≤ r ∗ (G). Locality. For any F and G such that F ⊆ G ⊆ X with r ∗ (F) = r ∗ (G), and any point p ∈ X , r ∗ (G) < r ∗ (G ∪ {p}) → r ∗ (F) < r ∗ (F ∪ {p}). M INI I NFO B ALL(S = {p1 , ..., pn }, B): Initially B = ∅. Returns B ∗ = (c∗ , r ∗ ) IF |S ∪ B| ≤ 3 RETURN B=S OLVE I NFO B ASIS(S ∪ B) ELSE Select at random p ∈ S B ∗ =M INI I NFO B ALL(S{p}, B) IF p ∈ B ∗ Then add p to the basis M INI I NFO B ALL(S{p}, B ∪ {p}) F. Nielsen and R. Nock On the Smallest Enclosing Information Disk
  • 12. Computing basis (S OLVE I NFO B ASIS) Lemma The first-type Bregman bisector Bisector(p, q) = {c ∈ X | DF (c, p) = DF (c, q)} is linear. This is a linear equation in c (an hyperplane). Bisector Bisector(p, q) = {x | x, dpq + kpq = 0} with dpq = F (p) − F (q) a vector, and kpq = F (p) − F (q) + q, F (q) − p, F (p) a constant (Itakura-Saito divergence) F. Nielsen and R. Nock On the Smallest Enclosing Information Disk
  • 13. Computing basis (S OLVE I NFO B ASIS) Basis 3 : The circumcenter is the trisector. (intersection of 3 linear bisectors, enough to consider any two of them). c∗ = l12 × l13 = l12 × l23 = l13 × l23 , lij : projective point associated to the linear bisector Bisector(pi , pj ) (×: cross-product) F. Nielsen and R. Nock On the Smallest Enclosing Information Disk
  • 14. Computing basis (S OLVE I NFO B ASIS) Basis 2 : Either minimize DF (c, p) s.t. c∗ ∈ Bisector(p, q), or better perform a logarithmic search on λ ∈ [0, 1] s. t. rλ = F −1 ((1 − λ) F (p) + λ F (q)) is on the geodesic of pq ( F −1 : reciprocal gradient). F. Nielsen and R. Nock On the Smallest Enclosing Information Disk
  • 15. Live Demo http://www.csl.sony.co.jp/person/nielsen/ BregmanBall/MINIBALL/ F. Nielsen and R. Nock On the Smallest Enclosing Information Disk
  • 16. Statistical application example Univariate Normal law distribution: 2 1 N(x|µ, σ) = √ exp(− (x−µ) ). 2σ 2 σ 2π Consider the Kullback-Leibler divergence of two distributions: f (x) KL(f , g) = x f (x) log g(x) . Canonical form of an exponential family: N(x|µ, σ) = √ 1 exp{ θ, f(x) } with: 2πZ (θ) 2 µ Z (θ) = σ exp{ 2σ2 } = θ2 1 2 − 2θ1 exp{− 4θ1 }, f(x) = [x 2 x]T : sufficient statistics, 1 θ = [− 2σ2 µ T ] : σ2 natural parameters. F. Nielsen and R. Nock On the Smallest Enclosing Information Disk
  • 17. Kullback-Leibler of parametric exponential family is a Bregman divergence for F = log Z . Z (θ ) KL(θp ||θq ) = DF (θp , θq ) = (θp − θq ), θp [f] + log Z (θq ) p θp [f] = x2 x Z (θp ) x x Z (θp ) exp{ θp , f(x) } exp{ θp , f(x) } = 2 µ2 + σp p µp Z (θ ) p Bisector (θp − θq ), θc [f] + log Z (θq ) = 0. 1D Gaussian distribution: change variables (µ, σ) → (µ2 + σ 2 , µ) = (x, y) (with x > y > 0). y2 It comes Z (x, y ) = x − y 2 exp{ 2(x−y 2 ) }, log Z (x, y ) = log F (x, y) y2 and 2(x−y 2 ) y2 y3 , ). 2(x−y 2 ) (x−y 2 )2 x − y2 + 1 = ( 2(x−y 2 ) − F. Nielsen and R. Nock On the Smallest Enclosing Information Disk
  • 18. Statistical application example (cont’d) M IN M AX: (µ∗ , σ ∗ ) r∗ (2.67446, 1.08313) and 0.801357, M IN AVG: (µ∗ , σ ∗ ) = (2.40909, 1.10782). Note that KL(Ni , Nj ) = 1 2 σi2 σj2 σ + 2 log σji − 1 + F. Nielsen and R. Nock (µj −µi )2 σj2 . On the Smallest Enclosing Information Disk
  • 19. Java Applet online: www.csl.sony.co.jp/person/nielsen/BregmanBall/ MINIBALL/ Source code: Basic MiniBall, Line intersection by projective geometry Visual Computing: Geometry, Graphics, and Vision, ISBN 1-58450-427-7, 2005. ˘ In high dimensions, extend Badoiu & Clarkson core-set See On approximating the smallest enclosing Bregman Balls (SoCG’06 video) F. Nielsen and R. Nock On the Smallest Enclosing Information Disk
  • 20. 3D Bregman balls (video) Relative entropy (KL) F. Nielsen and R. Nock Itakura-Saito On the Smallest Enclosing Information Disk
  • 21. Bregman Voronoi/Delaunay EXP F. Nielsen and R. Nock On the Smallest Enclosing Information Disk
  • 22. Bibliography Welzl, ”Smallest Enclosing Disks (Balls and Ellipsoids)”, LNCS 555:359-370, 1991. Crammer & Singer, ”Learning Algorithms for Enclosing Points in Bregmanian Spheres”, COLT03. Nock & Nielsen, ”Fitting the smallest Bregman ball”, ECML05 (SoCG06 video). Banerjee et. al, ”Clustering with Bregman divergences” , JMLR05. F. Nielsen and R. Nock On the Smallest Enclosing Information Disk