Representation Learning & Generative Modeling with Variational Autoencoder(VAE) / Auto-Encoding Variational Bayes Review

Auto-Encoding Variational
Bayes
Diederik P. Kingma, Max Welling, 2013
TAVE Research Seminar
2021.05.18
Presenter : Changdae Oh
bnormal16@naver.com

2
Topics
 Representation Learning
 Generative Modeling
 Variational Auto-encoder
Keywords
1. < Manifold hypothesis, Latent variable >
2. < Data generating distribution, Density estimation >
3. < Variational Inference, Evidence lower bound,
Reparameterization >

3
1. Representation Learning
Keywords
< Manifold hypothesis, Latent
variable >

4
Representation Learning
X = (Latitude, Longitude, altitude)
3d
X : Location of the
car
Representation & Manifold
hypothesis
X = (Distance from the datum)
1d
satellit
e
navigati
on

5
Image
data
And the other atypical data,
too
Very high dimensional, Very
sparse !!
(28*28) = 784 (256*256*3) = 196608

6
: Hidden variable that are not measured directly,
but have a significant impact on the variation of data points.
Latent
Variable
: Learning non-linear subspace with dense data points
built by Hidden factors of variation.
Manifold
Learning
Lower dimension, Dense space

7
Goal : Learn a function to map input ‘x’ -> output ‘y’
The behavior of intermediate
layers
- All features are projected down to two dim. (For
visualization)
- The classes become increasingly linearly separable
https://deeplearning.cs.cmu.edu/F20/document/slides/lec17.representations.pdf
Layers sequentially “straighten” the data manifold
Manifold hypo. in Supervised
Learning

8
( can use latent
variables )
Manifold hypo. in Unsupervised
Learning
http://cs231n.stanford.edu/slides/2021/lecture_12.pdf
(Linear manifold)
Goal : Learn some underlying hidden structure of the
data

9
2. Generative Modeling
Keywords
< Data generating distribution, Density
estimation >

10
Generative Modeling
Density
estimation
Trainin
g
• 𝑃𝜃 𝑦 𝑥)
• 𝑦 = 𝑓𝜃(𝑥)
Discriminative
model
Generative
model
Use in 𝑓 ∶ 𝑥 → 𝑦 • 𝑔 ∶ 𝑠𝑒𝑒𝑑 → 𝑥
• 𝑔 ∶ 𝑠𝑒𝑒𝑑, 𝑦 → 𝑥
• 𝑃𝜃(𝑥)
• 𝑃𝜃(𝑥, 𝑦) or 𝑃𝜃 𝑥 𝑦)
Data generation
Conditional prob.
Estimation
Predictio
n
Learn direct maps

11
Generative Modeling
Process of the occurrence of natural
images
according to probability distribution
𝑃𝑑𝑎𝑡𝑎(𝐱)
Generative
model
We want to learn 𝑃𝑚𝑜𝑑𝑒𝑙 𝐱 ; 𝜃 similar to
Machine Learning, Ilseok Oh. Lecture slide
Data Generating
Distribution
* image from Fei-Dei Li, Justin Johnson, Serena Yeung, cs231n Stanford

12
Generative Modeling
NIPS 2016 Tutorial: Generative Adversarial Networks

13
Generative Modeling
We want to learn 𝑃𝑚𝑜𝑑𝑒𝑙 𝐱 ; 𝜃 similar to
Estimate directly via 𝑎𝑟𝑔𝑚𝑎𝑥𝜃 𝑃𝑚𝑜𝑑𝑒𝑙 𝐱 ; 𝜃 ?
* very challenging!! • Intractable
• Require strong
constraints
Latent variable (generative)
model
* slide from Aaron Courville, IFT6266 Hiver 2017
: learn a mapping from some latent
variable z
to a complicated distribution on x

14
3. Variational Auto-encoder
Keywords
< Variational Inference, Evidence lower bound,
Reparameterization >

15
Variational Auto-encoder
• The data we observe in the real world are very high-dimensional and
sparse.
• A low-dimensional, high-density nonlinear manifold exists
in the space where observational data are defined.
• There is a latent variable describing the manifold,
which is very closely related to the variation of observed data x.
Story so far
• Want to get a model that generate data similar to observational data x.
• To do that, we need to estimate the distribution of the data P(x).
• However, direct estimation of P(x) is challenging.
• Instead, let's model a conditional distribution P(x|z) using the latent
variable z.

16
• where does ‘z’ come from?
• How can ‘z’ be defined and
obtained?
Proble
m
* image from the cs236, Stanford 2019f - Deep Generative Models, lectue5
Since z is literally a latent variable,
it is very difficult to define it manually and impossible to measure
directly.

17
• 𝑥𝑖 ~ 𝑃 𝑥 𝑧)
• 𝑧𝑖 ~ 𝑃(𝑧)
• 𝑧𝑖 ~ 𝑃 𝑧 𝑥)
Distributional assumptions
Can use this sample directly
but the performance is not
good.
Still, there is a problem…
Overview for data generating
process
Assume a familiar
distribution.
𝑃 𝑧 𝑥) = 𝑃 𝑥 𝑧)𝑃 𝑧 /𝑃(𝑥)
Learn the distribution of the latent
variable z, which is well explained
from x.
And sampling z from that dist.
Assume a familiar
distribution.
Intractable

18
Variational Inference
𝑝𝜃 𝑧 𝑥) ≈ 𝑞𝜙(𝑧 | 𝑥)
General family of methods for approximating
Complicated densities by a simpler class of
densities
* slide from shakir Mohamed(Google DeepMind), Imperial College, London, 2015
intractab
le
tractabl
e,
familiar

19
Find objective
http://cs231n.stanford.edu/slides/2021/lecture_12.pdf
assumptions
Want to
maximize

20
Find objective
≥ 0
Expression 1 -
generic
Expression 2 -
practical
Tractable Variational Lower Bound!!
(also called Evidence Lower Bound)
Let’s maximize this
ELBO !
Totally MC approx.
est.
KLD : Analytical solution
Expectation term : MC approx.
est.

21
END-TO-END Learning !
( reparameterization )

22
* slide from CPSC 532L lecture 11, the University of British Columbia
Problem!

23
Reparameterization trick
𝑧 ~ 𝑞𝜙 𝑧 𝑥)
= 𝑁(𝜇 𝑥 , Σ(𝑥))
https://arxiv.org/abs/1606.05908

24
End2End learning
pros
cons
• Interpretable latent space
• Allows inference of q(z|x), can be
useful feature representation for
other tasks
• Approx’ optimal
• Samples are blurrier
* slide from Aaron Courville, IFT6266 Hiver 2017

25
Experiments on loss term
https://www.jeremyjordan.me/variational-autoencoders/

26
Learned latent space

27
Main Reference
Paper
• Auto-Encoding Variational Bayes, Diederik P Kingma, Max Welling, 2013.
[link]
• Tutorial on Variational Autoencoders, Carl Doersch, 2016. [link]
• NIPS 2016 Tutorial: Generative Adversarial Networks, Ian Goodfellow,
2016. [link]
Slide
• cs231n lecture slide, stanfold, 2021s. [link]
• cs236n lecture slide, stanfold, 2019f. [link]
• IFT6266-H2017, University of Montreal [link]
Book
• Deep Learning, Ian Goodfellow et al, 2016. [e-book]
• Machine Learning, Ilseok Oh, 2018.
etc.
• Tutorial - what is a variational autoencoder? [link]
• Everything about the autoencoder [video]

28
Changdae Oh
bnormal16@naver.com
https://velog.io/@changdaeoh
https://github.com/changdaeoh

Representation Learning & Generative Modeling with Variational Autoencoder(VAE) / Auto-Encoding Variational Bayes Review

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Representation Learning & Generative Modeling with Variational Autoencoder(VAE) / Auto-Encoding Variational Bayes Review

Similar to Representation Learning & Generative Modeling with Variational Autoencoder(VAE) / Auto-Encoding Variational Bayes Review (20)

Recently uploaded

Recently uploaded (20)

Representation Learning & Generative Modeling with Variational Autoencoder(VAE) / Auto-Encoding Variational Bayes Review

Editor's Notes