SlideShare a Scribd company logo
1 of 23
Download to read offline
Simplifying Gaussian Mixture Models
Via Entropic Quantization
Frank Nielsen1 2 , Vincent Garcia1 , and Richard Nock3
1 Ecole Polytechnique (Paris, France)
Sony Computer Science Laboratories (Tokyo, Japan)
Universit´ des Antilles et de la Guyane (Guadeloupe, France)
e
2

3

28th august 2009

V. Garcia (X, Paris, France)

Simplifying GMMs

28th august 2009

1 / 23
Introduction

Plan
1

Introduction
Mixture models
Problem
Mixture model simplification

2

Mixture model simplification
KLD and Bregman divergence
Sided BKMC
Symmetric BKMC
jMEF

3

Experiments
Quality measure and initialization
Sided BKMC
BKMC vs UTAC

4

Conclusion
V. Garcia (X, Paris, France)

Simplifying GMMs

28th august 2009

2 / 23
Introduction

Mixture models

Mixture models

Mixture model is a powerful framework to estimate PDF
Mixture model f

n

f (x) =

αi fi (x)
i=1

where αi ≥ 0 denotes a weight with

n
i=1 αi

=1

If f is a Gaussian mixture model (GMM),
(x − µi )T Σ−1 (x − µi )
1
i
fi (x) =
exp −
2
(2π)d/2 |Σi |1/2
with µi mean and Σi covariance matrix

V. Garcia (X, Paris, France)

Simplifying GMMs

28th august 2009

3 / 23
Introduction

Problem

Problem
2.5

2

1.5

1

0.5

0
−0.5

0

0.5

1

1.5

Density estimation using kernel-based Parzen estimator
Mixture models usually contain a lot of components
Estimation of statistical measures is computationally expensive
Need to reduce the number of components
Re-lear a simpler mixture model from dataset
Simplify the mixture model f
V. Garcia (X, Paris, France)

Simplifying GMMs

28th august 2009

4 / 23
Introduction

Mixture model simplification

Mixture model simplification

Given a mixture model f of n components
n

f (x) =

αi fi (x)
i=1

Compute a mixture model g of m components
m

αj gj (x)

g (x) =
j=1

such as g is the best approximation of f

V. Garcia (X, Paris, France)

Simplifying GMMs

28th august 2009

5 / 23
Mixture model simplification

Plan
1

Introduction
Mixture models
Problem
Mixture model simplification

2

Mixture model simplification
KLD and Bregman divergence
Sided BKMC
Symmetric BKMC
jMEF

3

Experiments
Quality measure and initialization
Sided BKMC
BKMC vs UTAC

4

Conclusion
V. Garcia (X, Paris, France)

Simplifying GMMs

28th august 2009

6 / 23
Mixture model simplification

KLD and Bregman divergence

Relative entropy and Bregman divergence
The fundamental measure between statistical distributions is the
relative entropy, also called the Kullback-Leibler divergence
Given fi and fj two distributions, the KLD is given by
KLD(fi ||fj ) =

fi (x) log

fi (x)
dx
fj (x)

In the case of normal distriubtions
det Σj
1
1
KLD(fi ||fj ) = log
+ tr Σ−1 Σi
j
2
det Σi
2
1
d
+ (µj − µi )T Σ−1 (µj − µi ) −
j
2
2

V. Garcia (X, Paris, France)

Simplifying GMMs

28th august 2009

7 / 23
Mixture model simplification

KLD and Bregman divergence

Relative entropy and Bregman divergence
Nomral distributions belong to the class of exponential families
Canonical form of exponential families
f (x) = exp

˜
˜
Θ, t(x) − F (Θ) + C (x)

Estimation of the KLD by computing the Bregman divergence defined
for the log normalizer F
˜ ˜
KLD(fi ||fj ) = DF (Θj ||Θi )
where
˜ ˜
˜
˜
˜
˜
˜
DF (Θj ||Θi ) = F (Θj ) − F (Θi ) − Θj − Θi , F (Θi )

V. Garcia (X, Paris, France)

Simplifying GMMs

28th august 2009

8 / 23
Mixture model simplification

KLD and Bregman divergence

Relative entropy and Bregman divergence
For multivariate normal distributions
Sufficient statistics

1
t(x) = (x, − xx T )
2

Natural parameters
1
˜
Θ = (θ, Θ) = (Σ−1 µ, Σ−1 )
2
Log normalizer
1
d
1
˜
F (Θ) = tr(Θ−1 θθT ) − log det Θ + log π
4
2
2
˜
F (Θ) =

V. Garcia (X, Paris, France)

1 −1
1
1
Θ θ , − Θ−1 − (Θ−1 θ)(Θ−1 θ)T
2
2
4
Simplifying GMMs

28th august 2009

9 / 23
Mixture model simplification

Sided BKMC

Bregman k-means clustering
K-means clustering
Set of points
Initialize k centroids = k classes
Repetition until convergence
Repartition step (distance)
Computation of centroids (centers of mass)

Bregman K-means clustering
Set of distributions
Initialize k centroids (αi , gi ) = GMM with k components
Repetition until convergence
Repartition step (sided Bregman divergence)
Computation of centroids (sided centroids)
V. Garcia (X, Paris, France)

Simplifying GMMs

28th august 2009

10 / 23
Mixture model simplification

Sided BKMC

Sided centroids

5 multivariate Gaussians
Right-centroid
Left-centroid
http://www.sonycsl.co.jp/person/nielsen/BNCj/
V. Garcia (X, Paris, France)

Simplifying GMMs

28th august 2009

11 / 23
Mixture model simplification

Sided BKMC

Right-sided BKMC algorithm
1: Initialize the GMM g
2: repeat
3:
Compute the cluster C : the Gaussian fi belongs to cluster Cj if and only if

˜ ˜
˜ ˜
DF (Θi Θj ) < DF (Θi Θl ), ∀l ∈ [1, m]  {j}
4:

Compute the centroids: the weight and the natural parameters of the j-th
centroid (i.e. Gaussian gj ) are given by:
αj =

αi ,
i

The sum

i

θj =

i

αi θi
,
i αi

Θj =

i

αi Θi
i αi

is performed on i ∈ [1, m] such as fi ∈ Cj

5: until the cluster does not change between two iterations

V. Garcia (X, Paris, France)

Simplifying GMMs

28th august 2009

12 / 23
Mixture model simplification

Sided BKMC

Left-sided BKMC algorithm
1: Initialize the GMM g
2: repeat
3:
Compute the cluster C : the Gaussian fi belongs to cluster Cj if and only if

˜ ˜
˜ ˜
DF (Θj Θi ) < DF (Θl Θi ), ∀l ∈ [1, m]  {j}
4:

Compute the centroids: the weight and the natural parameters of the j-th
centroid (i.e. Gaussian gj ) are given by:
αj =

αi ,

˜
Θj =

F −1

i

i

where
˜
F −1 (Θ) =

− Θ + θθT

−1

θ, −

αi
αj

˜
F Θi

1
Θ + θθT
2

−1

5: until the cluster does not change between two iterations
V. Garcia (X, Paris, France)

Simplifying GMMs

28th august 2009

13 / 23
Mixture model simplification

Symmetric BKMC

Symmetric BKMC algorithm
Symmetric similarity measure can be required (e.g. CBIR)
Repartition step: Symmetric Bregman divergence
˜ ˜
SDF (Θp , Θq ) =

˜ ˜
˜ ˜
DF (Θq ||Θp ) + DF (Θp ||Θq )
2

Computation of symmetric centroid:
Compute right and left centroids (cr and cl )
The symmetric centroid cs belongs to the geodesic link joining cr and cl
cλ =

F −1 (λ F (cr ) + (1 − λ) F (cl ))

The symmetric centroid cs = cλ verifies
SDF (cλ , cr ) = SDF (cλ , cl ).

V. Garcia (X, Paris, France)

Simplifying GMMs

28th august 2009

14 / 23
Mixture model simplification

jMEF

jMEF

jMEF : Java library for Mixture of Exponential Families
Create and manage MEF
Simplify MEF using BKMC
Available on line at www.lix.polytechnique.fr/∼nielsen/MEF

V. Garcia (X, Paris, France)

Simplifying GMMs

28th august 2009

15 / 23
Experiments

Plan
1

Introduction
Mixture models
Problem
Mixture model simplification

2

Mixture model simplification
KLD and Bregman divergence
Sided BKMC
Symmetric BKMC
jMEF

3

Experiments
Quality measure and initialization
Sided BKMC
BKMC vs UTAC

4

Conclusion
V. Garcia (X, Paris, France)

Simplifying GMMs

28th august 2009

16 / 23
Experiments

Quality measure and initialization

Quality measure and initialization
Simplification quality measure
KLD(f g ) (right-sided)
No closed-form expression
Draw 10,000 points to estimate this KLD (Monte-Carlo)
Initial GMM f
Learnt from an image
K-means on RGB pixels ⇒ 32 classes
EM algorithm ⇒ fi
Weights αi : proportion of pixels in each class

V. Garcia (X, Paris, France)

Simplifying GMMs

28th august 2009

17 / 23
Experiments

Sided BKMC

Sided BKMC

Evolution of KLD(f g ) as a function of m
The simplification quality increases with m
Left-sided BKMC provides the best results
Right-sided BKMC provides the worst results
V. Garcia (X, Paris, France)

Simplifying GMMs

28th august 2009

18 / 23
Experiments

BKMC vs UTAC

BKMC vs UTAC

UTAC algorithm based on sigma points + EM algorithm
BKMC provides better results than UTAC
BKMC is faster than UTAC: 20ms vs 100ms
V. Garcia (X, Paris, France)

Simplifying GMMs

28th august 2009

19 / 23
Experiments

BKMC vs UTAC

Clustering-based image segmentation
Image

UTAC

BKMC

KLD=0.23

KLD=0.11

KLD=0.16
V. Garcia (X, Paris, France)

f

KLD=0.13

Simplifying GMMs

28th august 2009

20 / 23
Experiments

BKMC vs UTAC

Clustering-based image segmentation
Image

UTAC

BKMC

KLD=0.69

KLD=0.53

KLD=0.36
V. Garcia (X, Paris, France)

f

KLD=0.18

Simplifying GMMs

28th august 2009

21 / 23
Conclusion

Plan
1

Introduction
Mixture models
Problem
Mixture model simplification

2

Mixture model simplification
KLD and Bregman divergence
Sided BKMC
Symmetric BKMC
jMEF

3

Experiments
Quality measure and initialization
Sided BKMC
BKMC vs UTAC

4

Conclusion
V. Garcia (X, Paris, France)

Simplifying GMMs

28th august 2009

22 / 23
Conclusion

Conclusion

GMM simplification algorithm based on k-means and Bregman
divergence
BKMC is faster and provides better results than UTAC algorithm
BKMC extends to mixtures of exponential families
jMEF available on line at www.lix.polytechnique.fr/∼nielsen/MEF
Included features:
Create/manage mixtures of exponential families
BKMC algorithm
Hierarchical GMM (ACCV 2009)

V. Garcia (X, Paris, France)

Simplifying GMMs

28th august 2009

23 / 23

More Related Content

What's hot

Path Contraction Faster than 2^n
Path Contraction Faster than 2^nPath Contraction Faster than 2^n
Path Contraction Faster than 2^nAkankshaAgrawal55
 
CutFEM on hierarchical B-Spline Cartesian grids with applications to fluid-st...
CutFEM on hierarchical B-Spline Cartesian grids with applications to fluid-st...CutFEM on hierarchical B-Spline Cartesian grids with applications to fluid-st...
CutFEM on hierarchical B-Spline Cartesian grids with applications to fluid-st...CHENNAKESAVA KADAPA
 
Large strain computational solid dynamics: An upwind cell centred Finite Volu...
Large strain computational solid dynamics: An upwind cell centred Finite Volu...Large strain computational solid dynamics: An upwind cell centred Finite Volu...
Large strain computational solid dynamics: An upwind cell centred Finite Volu...Jibran Haider
 
Pilot Optimization and Channel Estimation for Multiuser Massive MIMO Systems
Pilot Optimization and Channel Estimation for Multiuser Massive MIMO SystemsPilot Optimization and Channel Estimation for Multiuser Massive MIMO Systems
Pilot Optimization and Channel Estimation for Multiuser Massive MIMO SystemsT. E. BOGALE
 
An upwind cell centred Finite Volume Method for nearly incompressible explici...
An upwind cell centred Finite Volume Method for nearly incompressible explici...An upwind cell centred Finite Volume Method for nearly incompressible explici...
An upwind cell centred Finite Volume Method for nearly incompressible explici...Jibran Haider
 
Bagging-Clustering Methods to Forecast Time Series
Bagging-Clustering Methods to Forecast Time SeriesBagging-Clustering Methods to Forecast Time Series
Bagging-Clustering Methods to Forecast Time SeriesTiago Mendes Dantas
 
Traffic flow modeling on road networks using Hamilton-Jacobi equations
Traffic flow modeling on road networks using Hamilton-Jacobi equationsTraffic flow modeling on road networks using Hamilton-Jacobi equations
Traffic flow modeling on road networks using Hamilton-Jacobi equationsGuillaume Costeseque
 
Second order traffic flow models on networks
Second order traffic flow models on networksSecond order traffic flow models on networks
Second order traffic flow models on networksGuillaume Costeseque
 
Joint CSI Estimation, Beamforming and Scheduling Design for Wideband Massive ...
Joint CSI Estimation, Beamforming and Scheduling Design for Wideband Massive ...Joint CSI Estimation, Beamforming and Scheduling Design for Wideband Massive ...
Joint CSI Estimation, Beamforming and Scheduling Design for Wideband Massive ...T. E. BOGALE
 
Webinar: Integrating timetabling and vehicle scheduling to analyze the trade-...
Webinar: Integrating timetabling and vehicle scheduling to analyze the trade-...Webinar: Integrating timetabling and vehicle scheduling to analyze the trade-...
Webinar: Integrating timetabling and vehicle scheduling to analyze the trade-...BRTCoE
 
Transportation and assignment_problem
Transportation and assignment_problemTransportation and assignment_problem
Transportation and assignment_problemAnkit Katiyar
 
Craig-Bampton Method
Craig-Bampton MethodCraig-Bampton Method
Craig-Bampton Methodsiavoshani
 
New tools from the bandit literature to improve A/B Testing
New tools from the bandit literature to improve A/B TestingNew tools from the bandit literature to improve A/B Testing
New tools from the bandit literature to improve A/B Testingrecsysfr
 
CVPR2010: Advanced ITinCVPR in a Nutshell: part 6: Mixtures
CVPR2010: Advanced ITinCVPR in a Nutshell: part 6: MixturesCVPR2010: Advanced ITinCVPR in a Nutshell: part 6: Mixtures
CVPR2010: Advanced ITinCVPR in a Nutshell: part 6: Mixtureszukun
 
New approaches for boosting to uniformity
New approaches for boosting to uniformityNew approaches for boosting to uniformity
New approaches for boosting to uniformityNikita Kazeev
 
A One-Pass Triclustering Approach: Is There any Room for Big Data?
A One-Pass Triclustering Approach: Is There any Room for Big Data?A One-Pass Triclustering Approach: Is There any Room for Big Data?
A One-Pass Triclustering Approach: Is There any Room for Big Data?Dmitrii Ignatov
 
Second order traffic flow models on networks
Second order traffic flow models on networksSecond order traffic flow models on networks
Second order traffic flow models on networksGuillaume Costeseque
 

What's hot (19)

Path Contraction Faster than 2^n
Path Contraction Faster than 2^nPath Contraction Faster than 2^n
Path Contraction Faster than 2^n
 
CutFEM on hierarchical B-Spline Cartesian grids with applications to fluid-st...
CutFEM on hierarchical B-Spline Cartesian grids with applications to fluid-st...CutFEM on hierarchical B-Spline Cartesian grids with applications to fluid-st...
CutFEM on hierarchical B-Spline Cartesian grids with applications to fluid-st...
 
Large strain computational solid dynamics: An upwind cell centred Finite Volu...
Large strain computational solid dynamics: An upwind cell centred Finite Volu...Large strain computational solid dynamics: An upwind cell centred Finite Volu...
Large strain computational solid dynamics: An upwind cell centred Finite Volu...
 
Pilot Optimization and Channel Estimation for Multiuser Massive MIMO Systems
Pilot Optimization and Channel Estimation for Multiuser Massive MIMO SystemsPilot Optimization and Channel Estimation for Multiuser Massive MIMO Systems
Pilot Optimization and Channel Estimation for Multiuser Massive MIMO Systems
 
An upwind cell centred Finite Volume Method for nearly incompressible explici...
An upwind cell centred Finite Volume Method for nearly incompressible explici...An upwind cell centred Finite Volume Method for nearly incompressible explici...
An upwind cell centred Finite Volume Method for nearly incompressible explici...
 
Bagging-Clustering Methods to Forecast Time Series
Bagging-Clustering Methods to Forecast Time SeriesBagging-Clustering Methods to Forecast Time Series
Bagging-Clustering Methods to Forecast Time Series
 
Traffic flow modeling on road networks using Hamilton-Jacobi equations
Traffic flow modeling on road networks using Hamilton-Jacobi equationsTraffic flow modeling on road networks using Hamilton-Jacobi equations
Traffic flow modeling on road networks using Hamilton-Jacobi equations
 
Second order traffic flow models on networks
Second order traffic flow models on networksSecond order traffic flow models on networks
Second order traffic flow models on networks
 
Slides
SlidesSlides
Slides
 
Pixel rf
Pixel rfPixel rf
Pixel rf
 
Joint CSI Estimation, Beamforming and Scheduling Design for Wideband Massive ...
Joint CSI Estimation, Beamforming and Scheduling Design for Wideband Massive ...Joint CSI Estimation, Beamforming and Scheduling Design for Wideband Massive ...
Joint CSI Estimation, Beamforming and Scheduling Design for Wideband Massive ...
 
Webinar: Integrating timetabling and vehicle scheduling to analyze the trade-...
Webinar: Integrating timetabling and vehicle scheduling to analyze the trade-...Webinar: Integrating timetabling and vehicle scheduling to analyze the trade-...
Webinar: Integrating timetabling and vehicle scheduling to analyze the trade-...
 
Transportation and assignment_problem
Transportation and assignment_problemTransportation and assignment_problem
Transportation and assignment_problem
 
Craig-Bampton Method
Craig-Bampton MethodCraig-Bampton Method
Craig-Bampton Method
 
New tools from the bandit literature to improve A/B Testing
New tools from the bandit literature to improve A/B TestingNew tools from the bandit literature to improve A/B Testing
New tools from the bandit literature to improve A/B Testing
 
CVPR2010: Advanced ITinCVPR in a Nutshell: part 6: Mixtures
CVPR2010: Advanced ITinCVPR in a Nutshell: part 6: MixturesCVPR2010: Advanced ITinCVPR in a Nutshell: part 6: Mixtures
CVPR2010: Advanced ITinCVPR in a Nutshell: part 6: Mixtures
 
New approaches for boosting to uniformity
New approaches for boosting to uniformityNew approaches for boosting to uniformity
New approaches for boosting to uniformity
 
A One-Pass Triclustering Approach: Is There any Room for Big Data?
A One-Pass Triclustering Approach: Is There any Room for Big Data?A One-Pass Triclustering Approach: Is There any Room for Big Data?
A One-Pass Triclustering Approach: Is There any Room for Big Data?
 
Second order traffic flow models on networks
Second order traffic flow models on networksSecond order traffic flow models on networks
Second order traffic flow models on networks
 

Viewers also liked

теорема безу
теорема безутеорема безу
теорема безуnatali7441
 
eWingslab Marketing Plan 2015 (CHI) M'sia
eWingslab Marketing Plan 2015 (CHI) M'siaeWingslab Marketing Plan 2015 (CHI) M'sia
eWingslab Marketing Plan 2015 (CHI) M'siaUdd Tzei
 
Tan Emci "Kar Kedisi"
Tan Emci "Kar Kedisi"Tan Emci "Kar Kedisi"
Tan Emci "Kar Kedisi"BrandCritique
 
QUESTIONING 2O YEARS OF THE 1995 CONSTITUTION OF THE REPUBLIC OF UGANDA AND A...
QUESTIONING 2O YEARS OF THE 1995 CONSTITUTION OF THE REPUBLIC OF UGANDA AND A...QUESTIONING 2O YEARS OF THE 1995 CONSTITUTION OF THE REPUBLIC OF UGANDA AND A...
QUESTIONING 2O YEARS OF THE 1995 CONSTITUTION OF THE REPUBLIC OF UGANDA AND A...Moses Talibita
 
Program residen dan non residen(rev.1)
Program residen dan non residen(rev.1)Program residen dan non residen(rev.1)
Program residen dan non residen(rev.1)Zifalaniasta
 
Article catagery
Article catageryArticle catagery
Article catageryunnatisilks
 
Податкові аспекти ведення бізнесу в Україні (2015)
Податкові аспекти ведення бізнесу в Україні (2015)Податкові аспекти ведення бізнесу в Україні (2015)
Податкові аспекти ведення бізнесу в Україні (2015)BDO Ukraine LLC
 
Land usage power point
Land usage power pointLand usage power point
Land usage power pointKatia Bezzina
 
A2 advanced portfolio evaluation q2
A2 advanced portfolio evaluation q2A2 advanced portfolio evaluation q2
A2 advanced portfolio evaluation q2gerigj
 
La Plaza de Pisa/ Italy
La Plaza de Pisa/ ItalyLa Plaza de Pisa/ Italy
La Plaza de Pisa/ ItalyLavennder M
 
ログイン前セッションフィクセイション攻撃の脅威と対策
ログイン前セッションフィクセイション攻撃の脅威と対策ログイン前セッションフィクセイション攻撃の脅威と対策
ログイン前セッションフィクセイション攻撃の脅威と対策Hiroshi Tokumaru
 
Hyperthyroidism During pregnancy
Hyperthyroidism During pregnancyHyperthyroidism During pregnancy
Hyperthyroidism During pregnancyAboubakr Elnashar
 
ОПРОС: Что важно в электронном курсе?
ОПРОС: Что важно в электронном курсе?ОПРОС: Что важно в электронном курсе?
ОПРОС: Что важно в электронном курсе?Elena Tikhomirova
 
Trabajo en grupo de jona
Trabajo en grupo de jonaTrabajo en grupo de jona
Trabajo en grupo de jonaJonathan Perez
 

Viewers also liked (20)

The blog
The blogThe blog
The blog
 
теорема безу
теорема безутеорема безу
теорема безу
 
Resume
ResumeResume
Resume
 
eWingslab Marketing Plan 2015 (CHI) M'sia
eWingslab Marketing Plan 2015 (CHI) M'siaeWingslab Marketing Plan 2015 (CHI) M'sia
eWingslab Marketing Plan 2015 (CHI) M'sia
 
Tan Emci "Kar Kedisi"
Tan Emci "Kar Kedisi"Tan Emci "Kar Kedisi"
Tan Emci "Kar Kedisi"
 
QUESTIONING 2O YEARS OF THE 1995 CONSTITUTION OF THE REPUBLIC OF UGANDA AND A...
QUESTIONING 2O YEARS OF THE 1995 CONSTITUTION OF THE REPUBLIC OF UGANDA AND A...QUESTIONING 2O YEARS OF THE 1995 CONSTITUTION OF THE REPUBLIC OF UGANDA AND A...
QUESTIONING 2O YEARS OF THE 1995 CONSTITUTION OF THE REPUBLIC OF UGANDA AND A...
 
Program residen dan non residen(rev.1)
Program residen dan non residen(rev.1)Program residen dan non residen(rev.1)
Program residen dan non residen(rev.1)
 
Cert_4089835
Cert_4089835Cert_4089835
Cert_4089835
 
20080701
2008070120080701
20080701
 
Article catagery
Article catageryArticle catagery
Article catagery
 
Ai 5 12
Ai 5 12Ai 5 12
Ai 5 12
 
Податкові аспекти ведення бізнесу в Україні (2015)
Податкові аспекти ведення бізнесу в Україні (2015)Податкові аспекти ведення бізнесу в Україні (2015)
Податкові аспекти ведення бізнесу в Україні (2015)
 
Land usage power point
Land usage power pointLand usage power point
Land usage power point
 
A2 advanced portfolio evaluation q2
A2 advanced portfolio evaluation q2A2 advanced portfolio evaluation q2
A2 advanced portfolio evaluation q2
 
Proyecto herbitex
Proyecto herbitexProyecto herbitex
Proyecto herbitex
 
La Plaza de Pisa/ Italy
La Plaza de Pisa/ ItalyLa Plaza de Pisa/ Italy
La Plaza de Pisa/ Italy
 
ログイン前セッションフィクセイション攻撃の脅威と対策
ログイン前セッションフィクセイション攻撃の脅威と対策ログイン前セッションフィクセイション攻撃の脅威と対策
ログイン前セッションフィクセイション攻撃の脅威と対策
 
Hyperthyroidism During pregnancy
Hyperthyroidism During pregnancyHyperthyroidism During pregnancy
Hyperthyroidism During pregnancy
 
ОПРОС: Что важно в электронном курсе?
ОПРОС: Что важно в электронном курсе?ОПРОС: Что важно в электронном курсе?
ОПРОС: Что важно в электронном курсе?
 
Trabajo en grupo de jona
Trabajo en grupo de jonaTrabajo en grupo de jona
Trabajo en grupo de jona
 

Similar to Slides: Simplifying Gaussian Mixture Models Via Entropic Quantization (EUSIPCO 2009)

Generalized Nonlinear Models in R
Generalized Nonlinear Models in RGeneralized Nonlinear Models in R
Generalized Nonlinear Models in Rhtstatistics
 
Optimal L-shaped matrix reordering, aka graph's core-periphery
Optimal L-shaped matrix reordering, aka graph's core-peripheryOptimal L-shaped matrix reordering, aka graph's core-periphery
Optimal L-shaped matrix reordering, aka graph's core-peripheryFrancesco Tudisco
 
On learning statistical mixtures maximizing the complete likelihood
On learning statistical mixtures maximizing the complete likelihoodOn learning statistical mixtures maximizing the complete likelihood
On learning statistical mixtures maximizing the complete likelihoodFrank Nielsen
 
Subproblem-Tree Calibration: A Unified Approach to Max-Product Message Passin...
Subproblem-Tree Calibration: A Unified Approach to Max-Product Message Passin...Subproblem-Tree Calibration: A Unified Approach to Max-Product Message Passin...
Subproblem-Tree Calibration: A Unified Approach to Max-Product Message Passin...Varad Meru
 
Inria Tech Talk - La classification de données complexes avec MASSICCC
Inria Tech Talk - La classification de données complexes avec MASSICCCInria Tech Talk - La classification de données complexes avec MASSICCC
Inria Tech Talk - La classification de données complexes avec MASSICCCStéphanie Roger
 
SIAM - Minisymposium on Guaranteed numerical algorithms
SIAM - Minisymposium on Guaranteed numerical algorithmsSIAM - Minisymposium on Guaranteed numerical algorithms
SIAM - Minisymposium on Guaranteed numerical algorithmsJagadeeswaran Rathinavel
 
Number theoretic-rsa-chailos-new
Number theoretic-rsa-chailos-newNumber theoretic-rsa-chailos-new
Number theoretic-rsa-chailos-newChristos Loizos
 
Accelerating Pseudo-Marginal MCMC using Gaussian Processes
Accelerating Pseudo-Marginal MCMC using Gaussian ProcessesAccelerating Pseudo-Marginal MCMC using Gaussian Processes
Accelerating Pseudo-Marginal MCMC using Gaussian ProcessesMatt Moores
 
Bayesian Inference and Uncertainty Quantification for Inverse Problems
Bayesian Inference and Uncertainty Quantification for Inverse ProblemsBayesian Inference and Uncertainty Quantification for Inverse Problems
Bayesian Inference and Uncertainty Quantification for Inverse ProblemsMatt Moores
 
Putting OAC-triclustering on MapReduce
Putting OAC-triclustering on MapReducePutting OAC-triclustering on MapReduce
Putting OAC-triclustering on MapReduceDmitrii Ignatov
 

Similar to Slides: Simplifying Gaussian Mixture Models Via Entropic Quantization (EUSIPCO 2009) (20)

QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
 
Generalized Nonlinear Models in R
Generalized Nonlinear Models in RGeneralized Nonlinear Models in R
Generalized Nonlinear Models in R
 
Optimal L-shaped matrix reordering, aka graph's core-periphery
Optimal L-shaped matrix reordering, aka graph's core-peripheryOptimal L-shaped matrix reordering, aka graph's core-periphery
Optimal L-shaped matrix reordering, aka graph's core-periphery
 
On learning statistical mixtures maximizing the complete likelihood
On learning statistical mixtures maximizing the complete likelihoodOn learning statistical mixtures maximizing the complete likelihood
On learning statistical mixtures maximizing the complete likelihood
 
MUMS Opening Workshop - An Overview of Reduced-Order Models and Emulators (ED...
MUMS Opening Workshop - An Overview of Reduced-Order Models and Emulators (ED...MUMS Opening Workshop - An Overview of Reduced-Order Models and Emulators (ED...
MUMS Opening Workshop - An Overview of Reduced-Order Models and Emulators (ED...
 
Beck Workshop on Modelling and Simulation of Coal-fired Power Generation and ...
Beck Workshop on Modelling and Simulation of Coal-fired Power Generation and ...Beck Workshop on Modelling and Simulation of Coal-fired Power Generation and ...
Beck Workshop on Modelling and Simulation of Coal-fired Power Generation and ...
 
Introduction to logistic regression
Introduction to logistic regressionIntroduction to logistic regression
Introduction to logistic regression
 
Subproblem-Tree Calibration: A Unified Approach to Max-Product Message Passin...
Subproblem-Tree Calibration: A Unified Approach to Max-Product Message Passin...Subproblem-Tree Calibration: A Unified Approach to Max-Product Message Passin...
Subproblem-Tree Calibration: A Unified Approach to Max-Product Message Passin...
 
Clustering lect
Clustering lectClustering lect
Clustering lect
 
Inria Tech Talk - La classification de données complexes avec MASSICCC
Inria Tech Talk - La classification de données complexes avec MASSICCCInria Tech Talk - La classification de données complexes avec MASSICCC
Inria Tech Talk - La classification de données complexes avec MASSICCC
 
Subquad multi ff
Subquad multi ffSubquad multi ff
Subquad multi ff
 
SIAM - Minisymposium on Guaranteed numerical algorithms
SIAM - Minisymposium on Guaranteed numerical algorithmsSIAM - Minisymposium on Guaranteed numerical algorithms
SIAM - Minisymposium on Guaranteed numerical algorithms
 
QMC: Operator Splitting Workshop, Perturbed (accelerated) Proximal-Gradient A...
QMC: Operator Splitting Workshop, Perturbed (accelerated) Proximal-Gradient A...QMC: Operator Splitting Workshop, Perturbed (accelerated) Proximal-Gradient A...
QMC: Operator Splitting Workshop, Perturbed (accelerated) Proximal-Gradient A...
 
MUMS: Bayesian, Fiducial, and Frequentist Conference - Generalized Probabilis...
MUMS: Bayesian, Fiducial, and Frequentist Conference - Generalized Probabilis...MUMS: Bayesian, Fiducial, and Frequentist Conference - Generalized Probabilis...
MUMS: Bayesian, Fiducial, and Frequentist Conference - Generalized Probabilis...
 
計算材料学
計算材料学計算材料学
計算材料学
 
Number theoretic-rsa-chailos-new
Number theoretic-rsa-chailos-newNumber theoretic-rsa-chailos-new
Number theoretic-rsa-chailos-new
 
Accelerating Pseudo-Marginal MCMC using Gaussian Processes
Accelerating Pseudo-Marginal MCMC using Gaussian ProcessesAccelerating Pseudo-Marginal MCMC using Gaussian Processes
Accelerating Pseudo-Marginal MCMC using Gaussian Processes
 
Bayesian Inference and Uncertainty Quantification for Inverse Problems
Bayesian Inference and Uncertainty Quantification for Inverse ProblemsBayesian Inference and Uncertainty Quantification for Inverse Problems
Bayesian Inference and Uncertainty Quantification for Inverse Problems
 
Putting OAC-triclustering on MapReduce
Putting OAC-triclustering on MapReducePutting OAC-triclustering on MapReduce
Putting OAC-triclustering on MapReduce
 
MUMS: Bayesian, Fiducial, and Frequentist Conference - Model Selection in the...
MUMS: Bayesian, Fiducial, and Frequentist Conference - Model Selection in the...MUMS: Bayesian, Fiducial, and Frequentist Conference - Model Selection in the...
MUMS: Bayesian, Fiducial, and Frequentist Conference - Model Selection in the...
 

Recently uploaded

Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 

Recently uploaded (20)

Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 

Slides: Simplifying Gaussian Mixture Models Via Entropic Quantization (EUSIPCO 2009)

  • 1. Simplifying Gaussian Mixture Models Via Entropic Quantization Frank Nielsen1 2 , Vincent Garcia1 , and Richard Nock3 1 Ecole Polytechnique (Paris, France) Sony Computer Science Laboratories (Tokyo, Japan) Universit´ des Antilles et de la Guyane (Guadeloupe, France) e 2 3 28th august 2009 V. Garcia (X, Paris, France) Simplifying GMMs 28th august 2009 1 / 23
  • 2. Introduction Plan 1 Introduction Mixture models Problem Mixture model simplification 2 Mixture model simplification KLD and Bregman divergence Sided BKMC Symmetric BKMC jMEF 3 Experiments Quality measure and initialization Sided BKMC BKMC vs UTAC 4 Conclusion V. Garcia (X, Paris, France) Simplifying GMMs 28th august 2009 2 / 23
  • 3. Introduction Mixture models Mixture models Mixture model is a powerful framework to estimate PDF Mixture model f n f (x) = αi fi (x) i=1 where αi ≥ 0 denotes a weight with n i=1 αi =1 If f is a Gaussian mixture model (GMM), (x − µi )T Σ−1 (x − µi ) 1 i fi (x) = exp − 2 (2π)d/2 |Σi |1/2 with µi mean and Σi covariance matrix V. Garcia (X, Paris, France) Simplifying GMMs 28th august 2009 3 / 23
  • 4. Introduction Problem Problem 2.5 2 1.5 1 0.5 0 −0.5 0 0.5 1 1.5 Density estimation using kernel-based Parzen estimator Mixture models usually contain a lot of components Estimation of statistical measures is computationally expensive Need to reduce the number of components Re-lear a simpler mixture model from dataset Simplify the mixture model f V. Garcia (X, Paris, France) Simplifying GMMs 28th august 2009 4 / 23
  • 5. Introduction Mixture model simplification Mixture model simplification Given a mixture model f of n components n f (x) = αi fi (x) i=1 Compute a mixture model g of m components m αj gj (x) g (x) = j=1 such as g is the best approximation of f V. Garcia (X, Paris, France) Simplifying GMMs 28th august 2009 5 / 23
  • 6. Mixture model simplification Plan 1 Introduction Mixture models Problem Mixture model simplification 2 Mixture model simplification KLD and Bregman divergence Sided BKMC Symmetric BKMC jMEF 3 Experiments Quality measure and initialization Sided BKMC BKMC vs UTAC 4 Conclusion V. Garcia (X, Paris, France) Simplifying GMMs 28th august 2009 6 / 23
  • 7. Mixture model simplification KLD and Bregman divergence Relative entropy and Bregman divergence The fundamental measure between statistical distributions is the relative entropy, also called the Kullback-Leibler divergence Given fi and fj two distributions, the KLD is given by KLD(fi ||fj ) = fi (x) log fi (x) dx fj (x) In the case of normal distriubtions det Σj 1 1 KLD(fi ||fj ) = log + tr Σ−1 Σi j 2 det Σi 2 1 d + (µj − µi )T Σ−1 (µj − µi ) − j 2 2 V. Garcia (X, Paris, France) Simplifying GMMs 28th august 2009 7 / 23
  • 8. Mixture model simplification KLD and Bregman divergence Relative entropy and Bregman divergence Nomral distributions belong to the class of exponential families Canonical form of exponential families f (x) = exp ˜ ˜ Θ, t(x) − F (Θ) + C (x) Estimation of the KLD by computing the Bregman divergence defined for the log normalizer F ˜ ˜ KLD(fi ||fj ) = DF (Θj ||Θi ) where ˜ ˜ ˜ ˜ ˜ ˜ ˜ DF (Θj ||Θi ) = F (Θj ) − F (Θi ) − Θj − Θi , F (Θi ) V. Garcia (X, Paris, France) Simplifying GMMs 28th august 2009 8 / 23
  • 9. Mixture model simplification KLD and Bregman divergence Relative entropy and Bregman divergence For multivariate normal distributions Sufficient statistics 1 t(x) = (x, − xx T ) 2 Natural parameters 1 ˜ Θ = (θ, Θ) = (Σ−1 µ, Σ−1 ) 2 Log normalizer 1 d 1 ˜ F (Θ) = tr(Θ−1 θθT ) − log det Θ + log π 4 2 2 ˜ F (Θ) = V. Garcia (X, Paris, France) 1 −1 1 1 Θ θ , − Θ−1 − (Θ−1 θ)(Θ−1 θ)T 2 2 4 Simplifying GMMs 28th august 2009 9 / 23
  • 10. Mixture model simplification Sided BKMC Bregman k-means clustering K-means clustering Set of points Initialize k centroids = k classes Repetition until convergence Repartition step (distance) Computation of centroids (centers of mass) Bregman K-means clustering Set of distributions Initialize k centroids (αi , gi ) = GMM with k components Repetition until convergence Repartition step (sided Bregman divergence) Computation of centroids (sided centroids) V. Garcia (X, Paris, France) Simplifying GMMs 28th august 2009 10 / 23
  • 11. Mixture model simplification Sided BKMC Sided centroids 5 multivariate Gaussians Right-centroid Left-centroid http://www.sonycsl.co.jp/person/nielsen/BNCj/ V. Garcia (X, Paris, France) Simplifying GMMs 28th august 2009 11 / 23
  • 12. Mixture model simplification Sided BKMC Right-sided BKMC algorithm 1: Initialize the GMM g 2: repeat 3: Compute the cluster C : the Gaussian fi belongs to cluster Cj if and only if ˜ ˜ ˜ ˜ DF (Θi Θj ) < DF (Θi Θl ), ∀l ∈ [1, m] {j} 4: Compute the centroids: the weight and the natural parameters of the j-th centroid (i.e. Gaussian gj ) are given by: αj = αi , i The sum i θj = i αi θi , i αi Θj = i αi Θi i αi is performed on i ∈ [1, m] such as fi ∈ Cj 5: until the cluster does not change between two iterations V. Garcia (X, Paris, France) Simplifying GMMs 28th august 2009 12 / 23
  • 13. Mixture model simplification Sided BKMC Left-sided BKMC algorithm 1: Initialize the GMM g 2: repeat 3: Compute the cluster C : the Gaussian fi belongs to cluster Cj if and only if ˜ ˜ ˜ ˜ DF (Θj Θi ) < DF (Θl Θi ), ∀l ∈ [1, m] {j} 4: Compute the centroids: the weight and the natural parameters of the j-th centroid (i.e. Gaussian gj ) are given by: αj = αi , ˜ Θj = F −1 i i where ˜ F −1 (Θ) = − Θ + θθT −1 θ, − αi αj ˜ F Θi 1 Θ + θθT 2 −1 5: until the cluster does not change between two iterations V. Garcia (X, Paris, France) Simplifying GMMs 28th august 2009 13 / 23
  • 14. Mixture model simplification Symmetric BKMC Symmetric BKMC algorithm Symmetric similarity measure can be required (e.g. CBIR) Repartition step: Symmetric Bregman divergence ˜ ˜ SDF (Θp , Θq ) = ˜ ˜ ˜ ˜ DF (Θq ||Θp ) + DF (Θp ||Θq ) 2 Computation of symmetric centroid: Compute right and left centroids (cr and cl ) The symmetric centroid cs belongs to the geodesic link joining cr and cl cλ = F −1 (λ F (cr ) + (1 − λ) F (cl )) The symmetric centroid cs = cλ verifies SDF (cλ , cr ) = SDF (cλ , cl ). V. Garcia (X, Paris, France) Simplifying GMMs 28th august 2009 14 / 23
  • 15. Mixture model simplification jMEF jMEF jMEF : Java library for Mixture of Exponential Families Create and manage MEF Simplify MEF using BKMC Available on line at www.lix.polytechnique.fr/∼nielsen/MEF V. Garcia (X, Paris, France) Simplifying GMMs 28th august 2009 15 / 23
  • 16. Experiments Plan 1 Introduction Mixture models Problem Mixture model simplification 2 Mixture model simplification KLD and Bregman divergence Sided BKMC Symmetric BKMC jMEF 3 Experiments Quality measure and initialization Sided BKMC BKMC vs UTAC 4 Conclusion V. Garcia (X, Paris, France) Simplifying GMMs 28th august 2009 16 / 23
  • 17. Experiments Quality measure and initialization Quality measure and initialization Simplification quality measure KLD(f g ) (right-sided) No closed-form expression Draw 10,000 points to estimate this KLD (Monte-Carlo) Initial GMM f Learnt from an image K-means on RGB pixels ⇒ 32 classes EM algorithm ⇒ fi Weights αi : proportion of pixels in each class V. Garcia (X, Paris, France) Simplifying GMMs 28th august 2009 17 / 23
  • 18. Experiments Sided BKMC Sided BKMC Evolution of KLD(f g ) as a function of m The simplification quality increases with m Left-sided BKMC provides the best results Right-sided BKMC provides the worst results V. Garcia (X, Paris, France) Simplifying GMMs 28th august 2009 18 / 23
  • 19. Experiments BKMC vs UTAC BKMC vs UTAC UTAC algorithm based on sigma points + EM algorithm BKMC provides better results than UTAC BKMC is faster than UTAC: 20ms vs 100ms V. Garcia (X, Paris, France) Simplifying GMMs 28th august 2009 19 / 23
  • 20. Experiments BKMC vs UTAC Clustering-based image segmentation Image UTAC BKMC KLD=0.23 KLD=0.11 KLD=0.16 V. Garcia (X, Paris, France) f KLD=0.13 Simplifying GMMs 28th august 2009 20 / 23
  • 21. Experiments BKMC vs UTAC Clustering-based image segmentation Image UTAC BKMC KLD=0.69 KLD=0.53 KLD=0.36 V. Garcia (X, Paris, France) f KLD=0.18 Simplifying GMMs 28th august 2009 21 / 23
  • 22. Conclusion Plan 1 Introduction Mixture models Problem Mixture model simplification 2 Mixture model simplification KLD and Bregman divergence Sided BKMC Symmetric BKMC jMEF 3 Experiments Quality measure and initialization Sided BKMC BKMC vs UTAC 4 Conclusion V. Garcia (X, Paris, France) Simplifying GMMs 28th august 2009 22 / 23
  • 23. Conclusion Conclusion GMM simplification algorithm based on k-means and Bregman divergence BKMC is faster and provides better results than UTAC algorithm BKMC extends to mixtures of exponential families jMEF available on line at www.lix.polytechnique.fr/∼nielsen/MEF Included features: Create/manage mixtures of exponential families BKMC algorithm Hierarchical GMM (ACCV 2009) V. Garcia (X, Paris, France) Simplifying GMMs 28th august 2009 23 / 23