Generative Adversarial Networks : Basic architecture and variants

•

2 likes•1,793 views

In this presentation we review the fundamentals behind GANs and look at different variants. We quickly review the theory such as the cost functions, training procedure, challenges and go on to look at variants such as CycleGAN, SAGAN etc.

Software

Generative Adversarial
Networks
Palacode Narayana Iyer Anantharaman
29 Oct 2018

References
• https://github.com/lukedeo/keras-acgan/blob/master/acgan-analysis.ipynb
• https://github.com/keras-team/keras/blob/master/examples/mnist_acgan.py
• https://skymind.ai/wiki/generative-adversarial-network-gan
• https://junyanz.github.io/CycleGAN/
• Self Attention Generative Adversarial Networks: Zhang et al

Why GAN?
• GANs can learn to mimic any distribution and generate data
• The data may be images, speech or music
• The outputs from GANs are found to be quite realistic and impressive
• Thus, GANs have a number of applications: From being a feature in products like
Photoshop to generating synthetic datasets for image augmentation

Generator
• Generates synthetic images given the input noise z
• G is differentiable
• Typically a Gaussian distribution

Training
• Train on 2 mini batches simultaneously
• Training samples
• Generated samples
• Cost

Different Variants of GAN
Ref: https://github.com/lukedeo/keras-acgan/blob/master/acgan-analysis.ipynb

Cycle GAN (2017)
• Original Paper: “Unpaired Image-to-Image Translation using Cycle-Consistent
Adversarial Networks”, Zhu et al

Image to Image Translation
• Image to image translation is aimed at finding a mapping
between an input image (X) and its corresponding output
image (Y), where the pair X, Y are provided in the dataset
• This assumes that we are provided with such a labelled
dataset with pairings
• CycleGAN attempts to find a mapping between images from
source and target domains in the absence of paired
examples
Learn G: X → Y such that the distribution of images from G(X) is
indistinguishable from the distribution Y using an adversarial
loss.
Couple this with an inverse mapping F: Y → X and enforce a
cycle consistency loss to enforce F(G(X)) ≈ X

Cycle GAN: Objective Function
• Two discriminators: Dx and Dy where Dx aims to distinguish between images {x}
and translated images {F(y)}. In the same way Dy aims to discriminate between {y}
and {G(x)}
• The objective function has 2 parts representing the losses:
• adversarial losses for matching the distribution of generated images to the data distribution
in the target domain
• Cycle consistency losses that prevent the learned mappings G and F from contradicting each
other

Exercises
• Go through the original paper and answer the following:
• How is the model evaluated? What are the metrics?
• What are the main applications discussed in the paper?
• What are the limitations and future work?

SAGAN (2018) Zhang et al Abstract
• GANs often use a CNN as a generator
• CNNs capture short range dependencies very well (local receptive fields) but not
effective to capture long distance correlations
• Self Attention Generative Adversarial Networks (SAGAN) is aimed at generating
images that take in to account both short and long distance dependencies in the
source images

Generative Adversarial Networks : Basic architecture and variants

What's hot

Dimensionality ReductionKnoldus Inc.

Dimensionality ReductionSaad Elbeleidy

Ml10 dimensionality reduction-and_advanced_topicsankit_ppt

08 neural networksankit_ppt

Challenging Common Assumptions in the Unsupervised Learning of Disentangled R...Sangwoo Mo

Algorithms Design PatternsAshwin Shiv

07 dimensionality reductionMarco Quartulli

02 image processingankit_ppt

Machine learning Algorithms with a Sagemaker demoHridyesh Bisht

Context-aware preference modeling with factorizationBalázs Hidasi

Ranking and Diversity in Recommendations - RecSys Stammtisch at SoundCloud, B...Alexandros Karatzoglou

Utilizing additional information in factorization methods (research overview,...Balázs Hidasi

Beginners Guide to Non-Negative Matrix FactorizationBenjamin Bengfort

Tutorial on Deep Generative ModelsMLReview

07 learningankit_ppt

Generative Models for General AudiencesSangwoo Mo

06 image featuresankit_ppt

An overview of Hidden Markov Models (HMM)ananth

An overview of machine learningdrcfetr

Session-Based Recommendations with Recurrent Neural Networks(Balazs Hidasi, ...hyunsung lee

What's hot (20)

Dimensionality Reduction

Ml10 dimensionality reduction-and_advanced_topics

08 neural networks

Challenging Common Assumptions in the Unsupervised Learning of Disentangled R...

Algorithms Design Patterns

07 dimensionality reduction

02 image processing

Machine learning Algorithms with a Sagemaker demo

Context-aware preference modeling with factorization

Ranking and Diversity in Recommendations - RecSys Stammtisch at SoundCloud, B...

Utilizing additional information in factorization methods (research overview,...

Beginners Guide to Non-Negative Matrix Factorization

Tutorial on Deep Generative Models

07 learning

Generative Models for General Audiences

06 image features

An overview of Hidden Markov Models (HMM)

An overview of machine learning

Session-Based Recommendations with Recurrent Neural Networks(Balazs Hidasi, ...

Similar to Generative Adversarial Networks : Basic architecture and variants

Vladislav Kolbasin “Introduction to Generative Adversarial Networks (GANs)”Lviv Startup Club

Jakub Langr (University of Oxford) - Overview of Generative Adversarial Netwo...Codiax

DiscoGANIl Gu Yi

Cahall Final Intern PresentationDaniel Cahall

20200322 inpaintingX 37

Volodymyr Lyubinets “Generative models for images”Lviv Startup Club

Distributed deep learningAlireza Shafaei

Unpaired Image Translations Using GANs: A ReviewIRJET Journal

Reading group gan - 20170417Shuai Zhang

Large-scale Recommendation Systems on Just a PCAapo Kyrölä

brief Introduction to Different Kinds of GANsParham Zilouchian

gan.pdfDr.rukmani Devi

Deep Generative Models - Kevin McGuinness - UPC Barcelona 2018Universitat Politècnica de Catalunya

Generative Adversarial Network (GAN) for Image SynthesisRiwaz Mahat

ExplainingMLModels.pdfLHong526661

Weave-D - 2nd Progress Evaluation Presentationlasinducharith

Generative Adversarial Networks and Their Applications in Medical ImagingSanghoon Hong

Deep Generative ModellingPetko Nikolov

Brief introduction on GANDai-Hai Nguyen

Collaborative 3D Modeling by the CrowdRyohei Suzuki

Similar to Generative Adversarial Networks : Basic architecture and variants (20)

Vladislav Kolbasin “Introduction to Generative Adversarial Networks (GANs)”

Jakub Langr (University of Oxford) - Overview of Generative Adversarial Netwo...

DiscoGAN

Cahall Final Intern Presentation

20200322 inpainting

Volodymyr Lyubinets “Generative models for images”

Distributed deep learning

Unpaired Image Translations Using GANs: A Review

Reading group gan - 20170417

Large-scale Recommendation Systems on Just a PC

brief Introduction to Different Kinds of GANs

gan.pdf

Deep Generative Models - Kevin McGuinness - UPC Barcelona 2018

Generative Adversarial Network (GAN) for Image Synthesis

ExplainingMLModels.pdf

Weave-D - 2nd Progress Evaluation Presentation

Generative Adversarial Networks and Their Applications in Medical Imaging

Deep Generative Modelling

Brief introduction on GAN

Collaborative 3D Modeling by the Crowd

Recently uploaded

Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...soniya singh

Active Directory Penetration Testing, cionsystems.com.pdfCionsystems

Salesforce Certified Field Service ConsultantAxelRicardoTrocheRiq

Advancing Engineering with AI through the Next Generation of Strategic Projec...OnePlan Solutions

SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AIABDERRAOUF MEHENNI

What is Binary Language? Computer Number SystemsJheuzeDellosa

Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...kellynguyen01

Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...MyIntelliSource, Inc.

The Ultimate Test Automation Guide_ Best Practices and Tips.pdfkalichargn70th171

Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...MyIntelliSource, Inc.

Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideChristina Lin

why an Opensea Clone Script might be your perfect match.pdfjoe51371421

How To Troubleshoot Collaboration Apps for the Modern Connected WorkerThousandEyes

Right Money Management App For Your Financial GoalsJhone kinadey

CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️anilsa9823

Test Automation Strategy for Frontend and BackendArshad QA

Software Quality Assurance Interview QuestionsArshad QA

Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfkalichargn70th171

Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS LiveCall Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy

Recently uploaded (20)

Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...

Active Directory Penetration Testing, cionsystems.com.pdf

Salesforce Certified Field Service Consultant

Advancing Engineering with AI through the Next Generation of Strategic Projec...

SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI

What is Binary Language? Computer Number Systems

Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...

Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...

The Ultimate Test Automation Guide_ Best Practices and Tips.pdf

Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...

Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide

why an Opensea Clone Script might be your perfect match.pdf

How To Troubleshoot Collaboration Apps for the Modern Connected Worker

Right Money Management App For Your Financial Goals

CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️

Test Automation Strategy for Frontend and Backend

Software Quality Assurance Interview Questions

Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf

Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live

Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications

Generative Adversarial Networks : Basic architecture and variants

1. Generative Adversarial Networks Palacode Narayana Iyer Anantharaman 29 Oct 2018

2. References • https://github.com/lukedeo/keras-acgan/blob/master/acgan-analysis.ipynb • https://github.com/keras-team/keras/blob/master/examples/mnist_acgan.py • https://skymind.ai/wiki/generative-adversarial-network-gan • https://junyanz.github.io/CycleGAN/ • Self Attention Generative Adversarial Networks: Zhang et al

3. Why GAN? • GANs can learn to mimic any distribution and generate data • The data may be images, speech or music • The outputs from GANs are found to be quite realistic and impressive • Thus, GANs have a number of applications: From being a feature in products like Photoshop to generating synthetic datasets for image augmentation

4. GAN Architecture

5. GAN Architecture

6. GAN Workflow

7. Generator • Generates synthetic images given the input noise z • G is differentiable • Typically a Gaussian distribution

8. Training • Train on 2 mini batches simultaneously • Training samples • Generated samples • Cost

9. Cost Function

10. Different Variants of GAN Ref: https://github.com/lukedeo/keras-acgan/blob/master/acgan-analysis.ipynb

11. Cycle GAN (2017) • Original Paper: “Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks”, Zhu et al

12. Image to Image Translation • Image to image translation is aimed at finding a mapping between an input image (X) and its corresponding output image (Y), where the pair X, Y are provided in the dataset • This assumes that we are provided with such a labelled dataset with pairings • CycleGAN attempts to find a mapping between images from source and target domains in the absence of paired examples Learn G: X → Y such that the distribution of images from G(X) is indistinguishable from the distribution Y using an adversarial loss. Couple this with an inverse mapping F: Y → X and enforce a cycle consistency loss to enforce F(G(X)) ≈ X

13. CycleGAN Approach

14. Cycle GAN: Objective Function • Two discriminators: Dx and Dy where Dx aims to distinguish between images {x} and translated images {F(y)}. In the same way Dy aims to discriminate between {y} and {G(x)} • The objective function has 2 parts representing the losses: • adversarial losses for matching the distribution of generated images to the data distribution in the target domain • Cycle consistency losses that prevent the learned mappings G and F from contradicting each other

15. Losses

16. Exercises • Go through the original paper and answer the following: • How is the model evaluated? What are the metrics? • What are the main applications discussed in the paper? • What are the limitations and future work?

17. SAGAN (2018) Zhang et al Abstract • GANs often use a CNN as a generator • CNNs capture short range dependencies very well (local receptive fields) but not effective to capture long distance correlations • Self Attention Generative Adversarial Networks (SAGAN) is aimed at generating images that take in to account both short and long distance dependencies in the source images

18. SAGAN

19. SAGAN Architecture

Generative Adversarial Networks : Basic architecture and variants

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Generative Adversarial Networks : Basic architecture and variants

Similar to Generative Adversarial Networks : Basic architecture and variants (20)

More from ananth

More from ananth (20)

Recently uploaded

Recently uploaded (20)

Generative Adversarial Networks : Basic architecture and variants