Successfully reported this slideshow.
Your SlideShare is downloading. ×

Variational Discriminator Bottleneck

More Related Content

Related Books

Free with a 30 day trial from Scribd

See all

Related Audiobooks

Free with a 30 day trial from Scribd

See all

Variational Discriminator Bottleneck

  1. 1. VARIATIONAL DISCRIMINATOR BOTTLENECK: IMPROVING IMITATION LEARNING, INVERSE RL, AND GANS BY CONSTRAINING INFORMATION FLOW Yawei Luo
  2. 2. Notoriously D & G D can always find out the nonessential information from G(z) to make a judgement “fake”. -> Uninformative gradients -> Unstable training! How to force D to focus on essential information of G(z)?
  3. 3. Preliminaries • Mutual Information • Object function in information theoretic view
  4. 4. Preliminaries • Information Bottleneck
  5. 5. Preliminaries
  6. 6. Preliminaries q: decoder E: encoder
  7. 7. Back to GANs
  8. 8. Back to GANs Vanilla GAN: GAN with VIB:
  9. 9. Training I(X, Z) > Ic -> beta ++ I(X, Z) < Ic -> beta --
  10. 10. Experiments - IMITATION LEARNING
  11. 11. Experiments - IMITATION LEARNING
  12. 12. Experiments - INVERSE REINFORCEMENT LEARNING
  13. 13. Experiments – image generation

×