GAN Models and Training

•

1 like•392 views

This document discusses generative adversarial networks (GANs) and their training. GANs use a generator and discriminator in an adversarial setup, where the generator tries to generate fake images that fool the discriminator, while the discriminator tries to accurately distinguish real from fake images. The training involves a two-way minimax game between the generator and discriminator. Issues that can arise include discriminator saturation and mode collapse. Newer methods like Wasserstein GANs help address these by changing the loss function to be more of a "critic" that provides feedback to help the generator match the real data distribution.

Technology

BENJAMIN PETRY
bpetry@acm.org
www.bpetry.de
Generative Adversarial Networks
SHAMANE SIRIWARDHANA

Generative Adversarial Networks(GAN)
Basic Architecture

Somehow Generator needs to fool the Discriminator

Min-Max Game
● Generator trying to Fool the discriminator
● Discriminator somehow needs to identify fake from real very well
● Something similar to min-max in game theory
Since it’s Adversary

Two way Optimization
❖ In supervised learning we train our network with a single objective function
❖ But here we have to train Generator and the Discriminator separately

After the two way Optimization ...
Start End

Training the Generator
D is Fixed !
Yunjey Choi

Discriminator Optimization Summary
Yunjey Choi

Generator Optimization Summary
Yunjey Choi

Main Problem - Discriminator Saturation
● Discriminator is too Good :(
● There won’t be any chance for the generator to learn something
Yunjey Choi

● GAN's Task is to make the Generated Distribution(Pmodel) same as the Real data
Distribution(Preal)

● There are ways to measure similarity of two distributions Eg:
○ KL divergence
○ Jensen–Shannon divergence
We can easily prove that Optimization of GAN’s loss function is similar to reducing Jensen–Shannon
divergence between the two distributions

When we have an optimal discriminator
Optimization of the Loss = Minimizing the Jensen Shannon Divergence

Challenges in training !
● Non-convergence: the parameters oscillate, constantly destabilize and unlikely to arrive to
converge (Issues with Nash Equality).
● Mode collapse: generator collapses, leading to produce limited varieties of samples.

Yes! there are more stable methods right now !
❖ Wasserstein GAN
WGAN vs GAN - Similar in terms of Formality & Functionality
Only thing change is the Loss Function !

Now Loss Function is more of a Critic !
❖ Previously the Discriminator and the Generator are working against each
other
❖ But now discriminator is is trying to give the generator an Idea of how different
it’s generated data is deviate from the actual data distribution.
❖ No Log probabilities - No Diminish Gradients
❖ Uses EM(Earth Mover's Distance) distance to model the loss function !

Wasserstein Distance or EM Distance
This is a measurement about how much work that generator has to do to match the
distribution of the real images
This is why we call it a Critic!

Reducing the distance between generated samples and real samples
Generator
distribution
Real
distribution
Critic

Solving the Vanishing Gradient Issues ..
More stable training ...

Resources
GAN - https://arxiv.org/abs/1406.2661
WGAN - https://arxiv.org/abs/1701.07875
Improved WGAN - https://arxiv.org/abs/1704.00028
Principal Method Of Training GAN - https://openreview.net/pdf?id=Hk4_qw5xe
Amazing series of Article By Jonathan Hui
https://medium.com/@jonathan_hui/gan-whats-generative-adversarial-networks-and-its-application-f39ed278ef09

What's hot

Unsupervised learning represenation with DCGANShyam Krishna Khadka

Generative Adversarial Networks and Their ApplicationsArtifacia

GANs Presentation.pptxMAHMOUD729246

Introduction to Deep learningMassimiliano Ruocco

GAN - Theory and ApplicationsEmanuele Ghelfi

Transfer LearningHichem Felouat

Generative adversarial networks남주 김

GANs and ApplicationsHoang Nguyen

Intro to deep learning David Voyles

Deep learning - A Visual IntroductionLukas Masuch

Deep Learning for Computer Vision: Generative models and adversarial training...Universitat Politècnica de Catalunya

Introduction to Generative Adversarial NetworksBennoG1

Overview of Machine Learning and Feature EngineeringTuri, Inc.

Generative adversarial networksDing Li

CNN Algorithmgeorgejustymirobi1

Generative Adversarial Network (GAN)Prakhar Rastogi

Notes from Coursera Deep Learning courses by Andrew NgdataHacker. rs

Generative Adversarial NetworksMustafa Yagmur

Slides, thesis dissertation defense, deep generative neural networks for nove...mehdi Cherti

Deep learning for medical imaginggeetachauhan

What's hot (20)

Unsupervised learning represenation with DCGAN

Generative Adversarial Networks and Their Applications

GANs Presentation.pptx

Introduction to Deep learning

GAN - Theory and Applications

Transfer Learning

Generative adversarial networks

GANs and Applications

Intro to deep learning

Deep learning - A Visual Introduction

Deep Learning for Computer Vision: Generative models and adversarial training...

Introduction to Generative Adversarial Networks

Overview of Machine Learning and Feature Engineering

Generative adversarial networks

CNN Algorithm

Generative Adversarial Network (GAN)

Notes from Coursera Deep Learning courses by Andrew Ng

Generative Adversarial Networks

Slides, thesis dissertation defense, deep generative neural networks for nove...

Deep learning for medical imaging

Similar to GAN Models and Training

Adversarial training BasicsShamane Siriwardhana

Reading group gan - 20170417Shuai Zhang

gan.pdfDr.rukmani Devi

11_gan.pdfAnkush84837

GAN.pdfNiharikaThakur32

ICASSP 2018 Tutorial: Generative Adversarial Network and its Applications to ...宏毅李

Deep Generative Models - Kevin McGuinness - UPC Barcelona 2018Universitat Politècnica de Catalunya

brief Introduction to Different Kinds of GANsParham Zilouchian

Adversarial learning for neural dialogue generationKeon Kim

Generative Adversarial Network (GAN) for Image SynthesisRiwaz Mahat

Generative adversarial network_Ayadi_AlaeddineDeep Learning Italia

Exploring Generative AI with GAN ModelsKonfHubTechConferenc

Vladislav Kolbasin “Introduction to Generative Adversarial Networks (GANs)”Lviv Startup Club

GAN Deep Learning Approaches to Image Processing Applications (1).pptxRMDAcademicCoordinat

Seeing what a gan cannot generate: paper reviewQuantUniversity

GAN.pptxHemanthKonamanchili1

Generative Adversarial Networks GANs.pdfMatthewHaws4

Deep Generative Models II (DLAI D10L1 2017 UPC Deep Learning for Artificial I...Universitat Politècnica de Catalunya

Nips 2016 tutorial generative adversarial networks reviewMinho Heo

GDC2019 - SEED - Towards Deep Generative Models in Game DevelopmentElectronic Arts / DICE

Similar to GAN Models and Training (20)

Adversarial training Basics

Reading group gan - 20170417

gan.pdf

11_gan.pdf

GAN.pdf

ICASSP 2018 Tutorial: Generative Adversarial Network and its Applications to ...

Deep Generative Models - Kevin McGuinness - UPC Barcelona 2018

brief Introduction to Different Kinds of GANs

Adversarial learning for neural dialogue generation

Generative Adversarial Network (GAN) for Image Synthesis

Generative adversarial network_Ayadi_Alaeddine

Exploring Generative AI with GAN Models

Vladislav Kolbasin “Introduction to Generative Adversarial Networks (GANs)”

GAN Deep Learning Approaches to Image Processing Applications (1).pptx

Seeing what a gan cannot generate: paper review

GAN.pptx

Generative Adversarial Networks GANs.pdf

Deep Generative Models II (DLAI D10L1 2017 UPC Deep Learning for Artificial I...

Nips 2016 tutorial generative adversarial networks review

GDC2019 - SEED - Towards Deep Generative Models in Game Development

Recently uploaded

Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed

"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays

Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix

Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes

Presentation on how to chat with PDF using ChatGPT code interpreternaman860154

Understanding the Laravel MVC ArchitecturePixlogix Infotech

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski

Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren

Key Features Of Token Development (1).pptxLBM Solutions

Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106

SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j

Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxnull - The Open Security Community

The transition to renewables in India.pdfCompetition Advisory Services (India) LLP

My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar

CloudStudio User manual (basic edition):comworks

Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst

Vulnerability_Management_GRC_by Sohang Sengupta.pptxnull - The Open Security Community

Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation

Recently uploaded (20)

Scanning the Internet for External Cloud Exposures via SSL Certs

"Federated learning: out of reach no matter how close",Oleksandr Lapshyn

Swan(sea) Song – personal research during my six years at Swansea ... and bey...

Enhancing Worker Digital Experience: A Hands-on Workshop for Partners

Presentation on how to chat with PDF using ChatGPT code interpreter

Understanding the Laravel MVC Architecture

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...

Advanced Test Driven-Development @ php[tek] 2024

Key Features Of Token Development (1).pptx

Breaking the Kubernetes Kill Chain: Host Path Mount

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics

SIEMENS: RAPUNZEL – A Tale About Knowledge Graph

Designing IA for AI - Information Architecture Conference 2024

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx

The transition to renewables in India.pdf

My Hashitalk Indonesia April 2024 Presentation

CloudStudio User manual (basic edition):

Human Factors of XR: Using Human Factors to Design XR Systems

Vulnerability_Management_GRC_by Sohang Sengupta.pptx

Connect Wave/ connectwave Pitch Deck Presentation

GAN Models and Training

1. BENJAMIN PETRY bpetry@acm.org www.bpetry.de Generative Adversarial Networks SHAMANE SIRIWARDHANA

2. Godfather! Godfather !

3. Why Generative Models are Important

5. Discriminative (Eg : CNN) Yunjey Choi

6. Generative Models Yunjey Choi

10. Designing stuff for you !

11.

12.

13. Insilico Medicine

14. Generative Adversarial Networks(GAN) Basic Architecture

15. Somehow Generator needs to fool the Discriminator

16. Min-Max Game ● Generator trying to Fool the discriminator ● Discriminator somehow needs to identify fake from real very well ● Something similar to min-max in game theory Since it’s Adversary

17.

18. Two way Optimization ❖ In supervised learning we train our network with a single objective function ❖ But here we have to train Generator and the Discriminator separately

19. After the two way Optimization ... Start End

20. How to train your GAN

21. Discriminator

22. Training the Discriminator Yunjey Choi

23. Training the Discriminator Yunjey Choi

24. Generator

25. Training the Generator D is Fixed ! Yunjey Choi

26. Mathematical Intuition

27. Discriminator Optimization Summary Yunjey Choi

28. Generator Optimization Summary Yunjey Choi

29.

30. Main Problem - Discriminator Saturation ● Discriminator is too Good :( ● There won’t be any chance for the generator to learn something Yunjey Choi

31. Why GAN Works ? No Magic

32. ● GAN's Task is to make the Generated Distribution(Pmodel) same as the Real data Distribution(Preal)

33. ● There are ways to measure similarity of two distributions Eg: ○ KL divergence ○ Jensen–Shannon divergence We can easily prove that Optimization of GAN’s loss function is similar to reducing Jensen–Shannon divergence between the two distributions

34.

35. When we have an optimal discriminator Optimization of the Loss = Minimizing the Jensen Shannon Divergence

36. Challenges in training ! ● Non-convergence: the parameters oscillate, constantly destabilize and unlikely to arrive to converge (Issues with Nash Equality). ● Mode collapse: generator collapses, leading to produce limited varieties of samples.

37. Yes! there are more stable methods right now ! ❖ Wasserstein GAN WGAN vs GAN - Similar in terms of Formality & Functionality Only thing change is the Loss Function !

38. Now Loss Function is more of a Critic ! ❖ Previously the Discriminator and the Generator are working against each other ❖ But now discriminator is is trying to give the generator an Idea of how different it’s generated data is deviate from the actual data distribution. ❖ No Log probabilities - No Diminish Gradients ❖ Uses EM(Earth Mover's Distance) distance to model the loss function !

39. Wasserstein Distance or EM Distance This is a measurement about how much work that generator has to do to match the distribution of the real images This is why we call it a Critic!

40. Reducing the distance between generated samples and real samples Generator distribution Real distribution Critic

41. WGAN vs GAN

42. Solving the Vanishing Gradient Issues .. More stable training ...

43.

44.

45. Resources GAN - https://arxiv.org/abs/1406.2661 WGAN - https://arxiv.org/abs/1701.07875 Improved WGAN - https://arxiv.org/abs/1704.00028 Principal Method Of Training GAN - https://openreview.net/pdf?id=Hk4_qw5xe Amazing series of Article By Jonathan Hui https://medium.com/@jonathan_hui/gan-whats-generative-adversarial-networks-and-its-application-f39ed278ef09

GAN Models and Training

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to GAN Models and Training

Similar to GAN Models and Training (20)

Recently uploaded

Recently uploaded (20)

GAN Models and Training