A unifying review of linear Gaussian Models

A Unifying Review of Linear Gaussian
Models
SAM ROWEIS
ZOUBIN GHAHRAMANI
YEZIWEI WANG
APPLICATION: 10424768
JANUARY 18 2017

The Basic Model
Basic
Model
Continuous-
state
A = 0
R diagonal
Factor
Analysis
𝑅
= 𝑙𝑖𝑚 𝜖→0 𝜖 𝐼
PCA
𝑅 = 𝜖 𝐼 SPCA
A ≠ 0
Kalman
Filter
Discrete-
state
A = 0
𝑅
= 𝑙𝑖𝑚 𝜖→0 𝜖 𝐼
VQ
Mixtures
Gaussian
A ≠ 0 HMM
 𝑥 𝑡+1 = 𝐴𝑥 𝑡 + 𝑤𝑡 = 𝐴𝑥 𝑡 + 𝑤.
 𝑦𝑡 = 𝐶𝑥 𝑡 + 𝑣 𝑡 = 𝐶𝑥 𝑡 + 𝑣.
 𝑤.~𝒩 0, 𝑄 : state evolution noise
 𝑣.~𝒩 0, 𝑅 : observation noise
 Hidden state sequence x is an
explanation of the complicated
observation sequence y (k<<p)

The basic Model: Inference
Probability Computation(Why
Gaussian)
 Gaussian + Gaussian = Gaussian
Filtering and smoothing
 Given initial parameters:
 Total likelihood:
 Filtering:
 Smoothing: where 𝜏 > 𝑡

The Basic Model: Learning
 Maximize likelihood
 Use solutions to filtering and smoothing
 Expectation-Maximization algorithm
 Y – observed data
 X – hidden data
 - parameters of the model

Continuous-state Static Modeling
 Assume Q = I, without loss of generality
 Inference:
 Learning: identifying the matrices C and R
 X – continuous ⇒ ∫

Continuous-state Static Modeling: Factor Analysis
Condition
 R – diagonal (uniqueness element)
 Q – identity matrix
 C – factor loading matrix
Process
 Marginal distribution:
 EM provides a unified approach for
learning

Continuous-state Static Modeling: SPCA and PCA
SPCA
 𝑅 = 𝜖 𝐼 (𝜖, 𝑔𝑙𝑜𝑏𝑎𝑙 𝑛𝑜𝑖𝑠𝑒 𝑙𝑒𝑣𝑒𝑙)
 Q = 𝐼
 Columns of C: principle components
 Inference:
 Learning: EM algorithm
 Rotation invariant.
PCA
 𝑅 = 𝑙𝑖𝑚 𝜖→0 𝜖 𝐼
 Q = 𝐼
 Inference: least squares projection

Continuous-state Dynamic Modeling: Kalman Filter
Models
 𝐴 ≠ 0
 𝑥 𝑡+1 = 𝐴𝑥 𝑡 + 𝑤𝑡 = 𝐴𝑥 𝑡 + 𝑤.
 𝑦𝑡 = 𝐶𝑥 𝑡 + 𝑣 𝑡 = 𝐶𝑥 𝑡 + 𝑣.
 𝑤.~𝒩 0, 𝑄
 𝑣.~𝒩 0, 𝑅

Discrete-state Static Modeling
 Initial state:
 Assume 𝑄1 = 𝐼 , without loss of generality
 X – discrete ⇒ ∑
 Winner-take-all - 𝑊𝑇𝐴 𝑥 = 𝑒𝑖, 𝑤ℎ𝑒𝑟𝑒 𝑖 𝑖𝑠 𝑡ℎ𝑒 𝑙𝑎𝑟𝑔𝑒𝑠𝑡 𝑐𝑜𝑜𝑟𝑑𝑖𝑛𝑎𝑡𝑒 𝑜𝑓 𝑖𝑛𝑝𝑢𝑡

Discrete-state Static Modeling: Mixtures of
Gaussians and Vector Quantization
Mixtures of Gaussians
 A = 0
 Likelihood: 𝑃 𝑦. = ∑𝑖=1
𝑘
𝒩 𝐶𝑖, 𝑅 | 𝑦.
𝜋𝑖,
𝜋𝑖 = 𝑃(𝑥. = 𝑒𝑖)
 Inference: (responsibility)
Vector Quantization
 𝑅 = 𝑙𝑖𝑚 𝜖→0 𝜖 𝐼
 𝜋𝑖 = 1
𝑘
 Inference: 1-nearest-neighbor rule
 Learning: EM ⇔ k-means algorithm

Discrete-state Dynamic Modeling: Hidden
Markov Models
 State Transition Matrix T
 𝐴𝑠𝑠𝑢𝑚𝑒 𝑄 = 𝐼 𝑤𝑖𝑡ℎ𝑜𝑢𝑡 𝑔𝑒𝑛𝑒𝑟𝑎𝑙𝑖𝑡𝑦 𝑙𝑜𝑠𝑠
 T can be equivalently modeled by A
and Q
 Emission probability: same covariance
 Initial Distribution ⇐ 𝑚𝑒𝑎𝑛 𝜇1
 𝐴 ≠ 0,
 Inference: forward-backward
algorithm
 Learning: Baum-Welch algorithm

Network Interpretations and Regularization
Output
Layer
Hidden
Layer
Input
Layer
y. 𝑥. 𝑦.
 Factor Analysis:
 Tying recognition weights to
generative weights; minimizing cost
function.
 Mixtures of Gaussians:
 Linear recognition followed by a soft
max nonlinearity; minimizing cost
function.
Recognition Generative

Summary & Reflection
Pros & Cons
Advantages:
Computation simplification
Highlights relationships between
models
Natural extensions
Disadvantages:
Not most efficient
Future Work
• Explore other distributions
• Develop mixture-state model
• Spatially adaptive observation
noise for continuous-state
models
Inference
•Input: observed data,
measurements
•Output: parameters of
unobserved states.
Learning
•Initialize parameters
•E-step: use inference solutions
•M-step: update parameters
•Output updated parameters

References
 Roweis, S. and Ghahramani, Z. (1999). A Unifying Review of Linear Gaussian
Models. Neural Computation, 11(2), pp.305-345.
 En.wikipedia.org. (2017). Independent component analysis. [online] Available
at: https://en.wikipedia.org/wiki/Independent_component_analysis [Accessed
18 Jan. 2017].
 En.wikipedia.org. (2017). Kalman filter. [online] Available at:
https://en.wikipedia.org/wiki/Kalman_filter [Accessed 18 Jan. 2017].
 En.wikipedia.org. (2017). Kalman filter. [online] Available at:
https://en.wikipedia.org/wiki/Kalman_filter#/media/File:Basic_concept_of_Kal
man_filtering.svg [Accessed 18 Jan. 2017].

A unifying review of linear Gaussian Models

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to A unifying review of linear Gaussian Models

Similar to A unifying review of linear Gaussian Models (20)

Recently uploaded

Recently uploaded (20)

A unifying review of linear Gaussian Models