Plug & Play Generative Networks: Conditional Iterative Generation of Images in Latent Space

Plug & Play Generative Networks:
Conditional Iterative Generation of
Images in Latent Space
Anh Nguyen, Jason Yosinski, Yoshua Bengio, Alexey Dosovitskiy, Jeff Clune
2017
By Safaa Alnabulsi

Index
 Motivation
 What is GAN?
 Probabilistic interpretation of iterative image generation methods
 Methods and Experiments
o PDGN-AM: sampling without a learned prior
o PGN-x: DAE model of p(x)
o PPGN-h: Generator and DAE model of p(h)
 Additional Results
o Generating images with different condition networks (Captioning , Multifaceted Feature Visualization )
o Inpainting
 Conclusion
Author | Safaa Alnabulsi 2

Motivation
Challenges solved in generating Images:
 High quality images at higher resolutions (227 × 227):
Current image generative models often work well at low resolutions (e.g. 32 ×
32), but struggle to generate high-resolution (e.g. 128 × 128 or higher), due to many
challenges including difficulty in training and computationally expensive sampling
procedures.

Motivation
 High resolutions images
 Realistic and diverse samples within a class

Motivation
 High resolutions images
 Realistic and diverse samples within a class
 Does it for all 1000 ImageNet categories

Index
 Motivation
 What is GAN?
o Inpainting
 Conclusion

What is GAN?
 GAN consists of two models:
- Generative model (G): generates new data instances / models the distribution of
individual classes. p(x|c)
- Discriminative model (D): evaluates them for authenticity/ learns the boundary
between classes

How does GAN work?
GANs are formulated as a game between two networks and it is important (and tricky!) to
keep them in balance!
Author | Safaa Alnabulsi
Cite:
Real / Fake
8

Plug and Play Generative Networks

Index
 Motivation
 What is GAN?
o Inpainting
 Conclusion

Probabilistic interpretation of iterative
image generation methods
MALA-approx: Metropolis-adjusted Langevin algorithm (MALA )+ Markov chain
Monte Carlo (MCMC) which uses the following transition operator:

Probabilistic framework for Activation
Maximization
take a step from the current
image xt toward one that looks
more like a generic image (an
image from any class).
take a step from the current image
xt toward an image that causes the
classifier to output higher
confidence in the chosen class.
add a small amount of noise to
jump around the search space
to encourage a diversity of
images
prior conditionjoint model
12

Probabilistic framework for Activation
Maximization

Index
 Motivation
 What is GAN?
 Additional results
o Inpainting
 Conclusion

Methods

Index
 Motivation
 What is GAN?
o Inpainting
 Conclusion

Method | DGN-AM: sampling without a
learned prior
 Deep Generator Network-based Activation Maximization
 Perform sampling in this lower-dimensional h-space.
 h in this case represents features extracted from the first fully connected layer
(called fc6) of a pre-trained AlexNet on 1000-class ImageNet classification
network

learned prior
Once we trained the network G we find the equation for the MALA algorithm
No learned prior No noise

learned prior
 proc:
• Sampling in input space h is faster
than image space x.
• produce realistic images at a high
resolution.
• It can also produce interesting new
types of images that G never saw
during training
 cons:
• Low mixing speed
• Same image after many steps
• Lack of diversity
19

Index
 Motivation
 What is GAN?
o Inpainting
 Conclusion

Method | PPGN-x: DAE model of p(x)
 What is DAE (Denoising Autoencoder) ?
DAE
Noisex
+
R(x)

 proc:
• sampling from the entire model.
 cons:
• it models the data distribution
poorly.
• the chain mixes slowly (sampling in
the high-dimensional image space)
23

Index
 Motivation
 What is GAN?
o PPGN-h: Generator and DAE model of p(h) <-- (This paper‘s model)
o Inpainting
 Conclusion

Method | PPGN-h: Generator and DAE
model of p(h)
To address the poor mixing speed of DGN-AM, they incorporate a proper p(h) prior
learned via a DAE into the sampling procedure
25

Method | PPGN-h: Generator and DAE
model of p(h)
The update rule to sample h from this model:
26

Method | PPGN-h: Generator and DAE model
of p(h)
 proc:
• the chain mixes faster than PPGN-x
 cons:
• Sample from PPGN-h are
qualitatively similar to those from
DGN-AM
• Samples still lack quality and
diversity. ( poor p(h) model
learned by the DAE).
27

Index
 Motivation
 What is GAN?
 Additional Resuls
o Inpainting
 Conclusion

Generating images with different condition
networks | Captioning
 PPGNs can be flexibly turned into a text-to-image model by combining the
prior with an image captioning network, and this process does not even
require additional training.

Generating images with different condition
networks | Multifaceted Feature Visualization
 Instead of conditioning on a class output neuron, here we condition on a
hidden neuron, revealing many facets that a neuron has learned to detect
Figure 6: Images synthesized to activate a hidden neuron (number 196) previously identified as a
“face detector neuron”

Index
 Motivation
 What is GAN?
 How does GAN work?
 Additional results
o Inpainting
 Conclusion

Inpainting
 Because PPGNs can be interpreted
probabilistically, we can also sample
from them conditioned on part of an
image (in addition to the class
condition) to perform inpainting.

Index
 Motivation
 What is GAN?
o Inpainting
 Conclusion

Conclusion
 P&P model generates images in 227x227 which is considered HD in Image
generation models.
 The most useful property of PPGN is the capability of “plug and play”—
allowing one to drop in a replaceable condition network and generate images
according to a condition specified ( Classs, caption or neuron ) at test time.
 Usage of PPGNs to synthesize images for videos or create arts with one or
even multiple condition networks at the same time.
 The approach is modality-agnostic and can be applied to many types of
data.

Portrait of Edmond Belamy, 2018, created by GAN (Generative Adversarial Network).
Sold for $432,500 on 25 October at Christie’s in New York. Image © Obvious

Plug & Play Generative Networks: Conditional Iterative Generation of Images in Latent Space

Recommended

Recommended

More Related Content

Similar to Plug & Play Generative Networks: Conditional Iterative Generation of Images in Latent Space

Similar to Plug & Play Generative Networks: Conditional Iterative Generation of Images in Latent Space (20)

Recently uploaded

Recently uploaded (20)

Plug & Play Generative Networks: Conditional Iterative Generation of Images in Latent Space

Editor's Notes