SlideShare a Scribd company logo
Evolution of the
StyleGAN family
presented by
Vitaly Bondar
Generative DL enthusiast
johngull @ gmail
Recap: GAN
Goodfellow et al. Generative Adversarial Networks, 2014
Recap: DCGAN
Radford et al. Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks, 2015
ProgressiveGAN
Karras et al. Progressive Growing of GANs for Improved Quality, Stability, and Variation, 2017
ProgressiveGAN
Karras et al. Progressive Growing of GANs for Improved Quality, Stability, and Variation, 2017
ProgressiveGAN
Karras et al. Progressive Growing of GANs for Improved Quality, Stability, and Variation, 2017
● Leaky ReLU
● Add minibatch standard deviation as value layer at the end of
discriminator
● Equalized learning rate (initialize weights as N(0;1), normalize with
He at runtime)
● PixelNorm
● Exponential running average for the weights of the generator
● Small mini-batches
● WGAN-GP + discriminator drift regularization | LSGAN + noise at
discriminator
● lr and batch size changes with the resolution changes
StyleGAN
Karras et al. A Style-Based Generator Architecture for Generative Adversarial Networks, 2018
StyleGAN
Karras et al. A Style-Based Generator Architecture for Generative Adversarial Networks, 2018
StyleGAN
Karras et al. A Style-Based Generator Architecture for Generative Adversarial Networks, 2018
● Leaky ReLU
● Equalized learning rate (initialize weights as N(0;1), normalize with
He at runtime)
● Exponential running average for the weights of the generator
● Small mini-batches
● lr and batch size changes with the resolution changes
● NS GAN Loss + R1 regularization
● lr for mapping network = 0.01*lr
● Truncation trick
StyleGAN
Karras et al. A Style-Based Generator Architecture for Generative Adversarial Networks, 2018
StyleGAN
Karras et al. A Style-Based Generator Architecture for Generative Adversarial Networks, 2018
StyleGAN
Karras et al. A Style-Based Generator Architecture for Generative Adversarial Networks, 2018
StyleGAN2
Karras et al.Analyzing and Improving the Image Quality of StyleGAN, 2019
StyleGAN waterdrops
StyleGAN2
Karras et al.Analyzing and Improving the Image Quality of StyleGAN, 2019
StyleGAN2 solution: weights demodulation
StyleGAN2
Karras et al.Analyzing and Improving the Image Quality of StyleGAN, 2019
StyleGAN2 solution: weights demodulation
StyleGAN2
Karras et al.Analyzing and Improving the Image Quality of StyleGAN, 2019
StyleGAN: low FID, but high PPL
StyleGAN2
Karras et al.Analyzing and Improving the Image Quality of StyleGAN, 2019
StyleGAN2 solution: lazy path length regularization
...
StyleGAN2
Karras et al.Analyzing and Improving the Image Quality of StyleGAN, 2019
StyleGAN: “phase” artifacts
StyleGAN2
Karras et al.Analyzing and Improving the Image Quality of StyleGAN, 2019
StyleGAN2 solution: skip connections
StyleGAN2
Karras et al.Analyzing and Improving the Image Quality of StyleGAN, 2019
StyleGAN2
Karras et al.Analyzing and Improving the Image Quality of StyleGAN, 2019
StyleGAN2
Karras et al.Analyzing and Improving the Image Quality of StyleGAN, 2019
● Almost all previous :)
● Weights modulation/demodulation (no demodulation in tRGB)
● NS GAN Loss + R1 regularization + PPL regularization
● Regularization rare applying
● Skip and residual connections instead of progressive growing
● Double size of last resolution
StyleGAN2
Karras et al.Analyzing and Improving the Image Quality of StyleGAN, 2019
StyleGAN2-ADA
Karras et al. Training Generative Adversarial Networks with Limited Data, 2020
GANs problem: data hungry
StyleGAN2-ADA
Karras et al. Training Generative Adversarial Networks with Limited Data, 2020
Usual CV vs GANs
Dataset augmentation
StyleGAN2-ADA
Karras et al. Training Generative Adversarial Networks with Limited Data, 2020
Usual CV vs GANs
Dataset augmentation Generator output
StyleGAN2-ADA
Karras et al. Training Generative Adversarial Networks with Limited Data, 2020
StyleGAN2-ADA
Karras et al. Training Generative Adversarial Networks with Limited Data, 2020
Leaking of augmetation
StyleGAN2-ADA
Karras et al. Training Generative Adversarial Networks with Limited Data, 2020
StyleGAN2-ADA
Karras et al. Training Generative Adversarial Networks with Limited Data, 2020
StyleGAN3 (Alias-Free GAN)
Karras et al. Alias-Free Generative Adversarial Networks, 2021
StyleGAN2 problem
StyleGAN3 (Alias-Free GAN)
Karras et al. Alias-Free Generative Adversarial Networks, 2021
StyleGAN2 problem
Despite their hierarchical convolutional nature, the synthesis process of typical generative
adversarial networks depends on absolute pixel coordinates in an unhealthy manner.
Unintentional positional references available to the intermediate layers:
● through image borders
● per-pixel noise inputs
● positional encodings, and aliasing
Authors identified two sources:
● non-ideal upsampling filters (e.g., nearest, bilinear, strided convolutions)
● the pointwise application of nonlinearities such as ReLU.
StyleGAN3 (Alias-Free GAN)
One-line idea:
Use contiguous space equivariant operations to remove alias and
borders-using effects.
Karras et al. Alias-Free Generative Adversarial Networks, 2021
StyleGAN3 (Alias-Free GAN)
Karras et al. Alias-Free Generative Adversarial Networks, 2021
StyleGAN2 (without skip)
StyleGAN3 (Alias-Free GAN)
Karras et al. Alias-Free Generative Adversarial Networks, 2021
StyleGAN3 (Alias-Free GAN)
Convolution - everything good
Karras et al. Alias-Free Generative Adversarial Networks, 2021
StyleGAN3 (Alias-Free GAN)
Upsampling/downsampling.
● Identity operations in the continuous space, we just need to change sampling
rate.
● Upsampling ~= add zeros for interleaving and convolve with discretized filter
● Downsampling = low-pass filter and sampling =
~= discrete convolution + dropping points
Karras et al. Alias-Free Generative Adversarial Networks, 2021
StyleGAN3 (Alias-Free GAN)
Nonlinearity
Karras et al. Alias-Free Generative Adversarial Networks, 2021
StyleGAN3 (Alias-Free GAN)
Karras et al. Alias-Free Generative Adversarial Networks, 2021
StyleGAN3 (Alias-Free GAN)
StyleGAN3 (Alias-Free GAN)
Karras et al. Alias-Free Generative Adversarial Networks, 2021
StyleGAN3 (Alias-Free GAN)
● In FFHQ (1024×1024) the three generators (StyleGan2, Alias-Free-T,
Alias-Free-R) had 30.0M, 22.3M and 15.8M parameters, while the training times
were 1106, 1576 (+42%) and 2248 (+103%) GPU hours.
● The present research consumed 92 GPU years and 225 MWh of electricity on
an in-house cluster of NVIDIA V100s.
Karras et al. Alias-Free Generative Adversarial Networks, 2021
Karras et al. Alias-Free Generative Adversarial Networks, 2021
Evolution of the StyleGAN family (oversimplified and as for October 2022)

More Related Content

What's hot

BigGAN: Large Scale GAN Training for High Fidelity Natural Image Synthesis
BigGAN: Large Scale GAN Training for High Fidelity Natural Image SynthesisBigGAN: Large Scale GAN Training for High Fidelity Natural Image Synthesis
BigGAN: Large Scale GAN Training for High Fidelity Natural Image Synthesis
Young Seok Kim
 
Basic Generative Adversarial Networks
Basic Generative Adversarial NetworksBasic Generative Adversarial Networks
Basic Generative Adversarial Networks
Dong Heon Cho
 
GANs and Applications
GANs and ApplicationsGANs and Applications
GANs and Applications
Hoang Nguyen
 
Generative Adversarial Networks and Their Medical Imaging Applications
Generative Adversarial Networks and Their Medical Imaging ApplicationsGenerative Adversarial Networks and Their Medical Imaging Applications
Generative Adversarial Networks and Their Medical Imaging Applications
Kyuhwan Jung
 
Generative Adversarial Networks
Generative Adversarial NetworksGenerative Adversarial Networks
Generative Adversarial Networks
Mustafa Yagmur
 
GAN in medical imaging
GAN in medical imagingGAN in medical imaging
GAN in medical imaging
Cheng-Bin Jin
 
Image-to-Image Translation with Conditional Adversarial Nets (UPC Reading Group)
Image-to-Image Translation with Conditional Adversarial Nets (UPC Reading Group)Image-to-Image Translation with Conditional Adversarial Nets (UPC Reading Group)
Image-to-Image Translation with Conditional Adversarial Nets (UPC Reading Group)
Universitat Politècnica de Catalunya
 
Generative adversarial networks
Generative adversarial networksGenerative adversarial networks
Generative adversarial networks
남주 김
 
Graph Neural Network in practice
Graph Neural Network in practiceGraph Neural Network in practice
Graph Neural Network in practice
tuxette
 
Generative Adversarial Networks and Their Applications in Medical Imaging
Generative Adversarial Networks  and Their Applications in Medical ImagingGenerative Adversarial Networks  and Their Applications in Medical Imaging
Generative Adversarial Networks and Their Applications in Medical Imaging
Sanghoon Hong
 
InfoGAN: Interpretable Representation Learning by Information Maximizing Gen...
InfoGAN: Interpretable Representation Learning by Information Maximizing Gen...InfoGAN: Interpretable Representation Learning by Information Maximizing Gen...
InfoGAN: Interpretable Representation Learning by Information Maximizing Gen...
Shuhei Yoshida
 
Generative Adversarial Networks (GANs) - Ian Goodfellow, OpenAI
Generative Adversarial Networks (GANs) - Ian Goodfellow, OpenAIGenerative Adversarial Networks (GANs) - Ian Goodfellow, OpenAI
Generative Adversarial Networks (GANs) - Ian Goodfellow, OpenAI
WithTheBest
 
Encoding in Style: a Style Encoder for Image-to-Image Translation
Encoding in Style: a Style Encoder for Image-to-Image TranslationEncoding in Style: a Style Encoder for Image-to-Image Translation
Encoding in Style: a Style Encoder for Image-to-Image Translation
taeseon ryu
 
Generative models (Geek hub 2021 lecture)
Generative models (Geek hub 2021 lecture)Generative models (Geek hub 2021 lecture)
Generative models (Geek hub 2021 lecture)
Vitaly Bondar
 
Variational Autoencoders For Image Generation
Variational Autoencoders For Image GenerationVariational Autoencoders For Image Generation
Variational Autoencoders For Image Generation
Jason Anderson
 
A (Very) Gentle Introduction to Generative Adversarial Networks (a.k.a GANs)
 A (Very) Gentle Introduction to Generative Adversarial Networks (a.k.a GANs) A (Very) Gentle Introduction to Generative Adversarial Networks (a.k.a GANs)
A (Very) Gentle Introduction to Generative Adversarial Networks (a.k.a GANs)
Thomas da Silva Paula
 
Generative Adversarial Networks (GAN)
Generative Adversarial Networks (GAN)Generative Adversarial Networks (GAN)
Generative Adversarial Networks (GAN)
Manohar Mukku
 
Generative adversarial text to image synthesis
Generative adversarial text to image synthesisGenerative adversarial text to image synthesis
Generative adversarial text to image synthesis
Universitat Politècnica de Catalunya
 
Generative adversarial networks
Generative adversarial networksGenerative adversarial networks
Generative adversarial networks
Yunjey Choi
 
A brief introduction to recent segmentation methods
A brief introduction to recent segmentation methodsA brief introduction to recent segmentation methods
A brief introduction to recent segmentation methods
Shunta Saito
 

What's hot (20)

BigGAN: Large Scale GAN Training for High Fidelity Natural Image Synthesis
BigGAN: Large Scale GAN Training for High Fidelity Natural Image SynthesisBigGAN: Large Scale GAN Training for High Fidelity Natural Image Synthesis
BigGAN: Large Scale GAN Training for High Fidelity Natural Image Synthesis
 
Basic Generative Adversarial Networks
Basic Generative Adversarial NetworksBasic Generative Adversarial Networks
Basic Generative Adversarial Networks
 
GANs and Applications
GANs and ApplicationsGANs and Applications
GANs and Applications
 
Generative Adversarial Networks and Their Medical Imaging Applications
Generative Adversarial Networks and Their Medical Imaging ApplicationsGenerative Adversarial Networks and Their Medical Imaging Applications
Generative Adversarial Networks and Their Medical Imaging Applications
 
Generative Adversarial Networks
Generative Adversarial NetworksGenerative Adversarial Networks
Generative Adversarial Networks
 
GAN in medical imaging
GAN in medical imagingGAN in medical imaging
GAN in medical imaging
 
Image-to-Image Translation with Conditional Adversarial Nets (UPC Reading Group)
Image-to-Image Translation with Conditional Adversarial Nets (UPC Reading Group)Image-to-Image Translation with Conditional Adversarial Nets (UPC Reading Group)
Image-to-Image Translation with Conditional Adversarial Nets (UPC Reading Group)
 
Generative adversarial networks
Generative adversarial networksGenerative adversarial networks
Generative adversarial networks
 
Graph Neural Network in practice
Graph Neural Network in practiceGraph Neural Network in practice
Graph Neural Network in practice
 
Generative Adversarial Networks and Their Applications in Medical Imaging
Generative Adversarial Networks  and Their Applications in Medical ImagingGenerative Adversarial Networks  and Their Applications in Medical Imaging
Generative Adversarial Networks and Their Applications in Medical Imaging
 
InfoGAN: Interpretable Representation Learning by Information Maximizing Gen...
InfoGAN: Interpretable Representation Learning by Information Maximizing Gen...InfoGAN: Interpretable Representation Learning by Information Maximizing Gen...
InfoGAN: Interpretable Representation Learning by Information Maximizing Gen...
 
Generative Adversarial Networks (GANs) - Ian Goodfellow, OpenAI
Generative Adversarial Networks (GANs) - Ian Goodfellow, OpenAIGenerative Adversarial Networks (GANs) - Ian Goodfellow, OpenAI
Generative Adversarial Networks (GANs) - Ian Goodfellow, OpenAI
 
Encoding in Style: a Style Encoder for Image-to-Image Translation
Encoding in Style: a Style Encoder for Image-to-Image TranslationEncoding in Style: a Style Encoder for Image-to-Image Translation
Encoding in Style: a Style Encoder for Image-to-Image Translation
 
Generative models (Geek hub 2021 lecture)
Generative models (Geek hub 2021 lecture)Generative models (Geek hub 2021 lecture)
Generative models (Geek hub 2021 lecture)
 
Variational Autoencoders For Image Generation
Variational Autoencoders For Image GenerationVariational Autoencoders For Image Generation
Variational Autoencoders For Image Generation
 
A (Very) Gentle Introduction to Generative Adversarial Networks (a.k.a GANs)
 A (Very) Gentle Introduction to Generative Adversarial Networks (a.k.a GANs) A (Very) Gentle Introduction to Generative Adversarial Networks (a.k.a GANs)
A (Very) Gentle Introduction to Generative Adversarial Networks (a.k.a GANs)
 
Generative Adversarial Networks (GAN)
Generative Adversarial Networks (GAN)Generative Adversarial Networks (GAN)
Generative Adversarial Networks (GAN)
 
Generative adversarial text to image synthesis
Generative adversarial text to image synthesisGenerative adversarial text to image synthesis
Generative adversarial text to image synthesis
 
Generative adversarial networks
Generative adversarial networksGenerative adversarial networks
Generative adversarial networks
 
A brief introduction to recent segmentation methods
A brief introduction to recent segmentation methodsA brief introduction to recent segmentation methods
A brief introduction to recent segmentation methods
 

More from Vitaly Bondar

Stable Diffusion path
Stable Diffusion pathStable Diffusion path
Stable Diffusion path
Vitaly Bondar
 
Imagen: Photorealistic Text-to-Image Diffusion Models with Deep Language Unde...
Imagen: Photorealistic Text-to-Image Diffusion Models with Deep Language Unde...Imagen: Photorealistic Text-to-Image Diffusion Models with Deep Language Unde...
Imagen: Photorealistic Text-to-Image Diffusion Models with Deep Language Unde...
Vitaly Bondar
 
Latent diffusions vs DALL-E v2
Latent diffusions vs DALL-E v2Latent diffusions vs DALL-E v2
Latent diffusions vs DALL-E v2
Vitaly Bondar
 
GANs: key ideas
GANs:  key ideasGANs:  key ideas
GANs: key ideas
Vitaly Bondar
 
Deep learning revolution
Deep learning revolutionDeep learning revolution
Deep learning revolution
Vitaly Bondar
 
A tale about 3rd place on "Agricultural crop cover classification challenge"
A tale about 3rd  place on  "Agricultural crop cover classification challenge"A tale about 3rd  place on  "Agricultural crop cover classification challenge"
A tale about 3rd place on "Agricultural crop cover classification challenge"
Vitaly Bondar
 

More from Vitaly Bondar (6)

Stable Diffusion path
Stable Diffusion pathStable Diffusion path
Stable Diffusion path
 
Imagen: Photorealistic Text-to-Image Diffusion Models with Deep Language Unde...
Imagen: Photorealistic Text-to-Image Diffusion Models with Deep Language Unde...Imagen: Photorealistic Text-to-Image Diffusion Models with Deep Language Unde...
Imagen: Photorealistic Text-to-Image Diffusion Models with Deep Language Unde...
 
Latent diffusions vs DALL-E v2
Latent diffusions vs DALL-E v2Latent diffusions vs DALL-E v2
Latent diffusions vs DALL-E v2
 
GANs: key ideas
GANs:  key ideasGANs:  key ideas
GANs: key ideas
 
Deep learning revolution
Deep learning revolutionDeep learning revolution
Deep learning revolution
 
A tale about 3rd place on "Agricultural crop cover classification challenge"
A tale about 3rd  place on  "Agricultural crop cover classification challenge"A tale about 3rd  place on  "Agricultural crop cover classification challenge"
A tale about 3rd place on "Agricultural crop cover classification challenge"
 

Recently uploaded

Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024
Massimo Talia
 
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdfAKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
SamSarthak3
 
Fundamentals of Electric Drives and its applications.pptx
Fundamentals of Electric Drives and its applications.pptxFundamentals of Electric Drives and its applications.pptx
Fundamentals of Electric Drives and its applications.pptx
manasideore6
 
ML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptxML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptx
Vijay Dialani, PhD
 
space technology lecture notes on satellite
space technology lecture notes on satellitespace technology lecture notes on satellite
space technology lecture notes on satellite
ongomchris
 
Top 10 Oil and Gas Projects in Saudi Arabia 2024.pdf
Top 10 Oil and Gas Projects in Saudi Arabia 2024.pdfTop 10 Oil and Gas Projects in Saudi Arabia 2024.pdf
Top 10 Oil and Gas Projects in Saudi Arabia 2024.pdf
Teleport Manpower Consultant
 
Gen AI Study Jams _ For the GDSC Leads in India.pdf
Gen AI Study Jams _ For the GDSC Leads in India.pdfGen AI Study Jams _ For the GDSC Leads in India.pdf
Gen AI Study Jams _ For the GDSC Leads in India.pdf
gdsczhcet
 
ethical hacking-mobile hacking methods.ppt
ethical hacking-mobile hacking methods.pptethical hacking-mobile hacking methods.ppt
ethical hacking-mobile hacking methods.ppt
Jayaprasanna4
 
ASME IX(9) 2007 Full Version .pdf
ASME IX(9)  2007 Full Version       .pdfASME IX(9)  2007 Full Version       .pdf
ASME IX(9) 2007 Full Version .pdf
AhmedHussein950959
 
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdf
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdfGoverning Equations for Fundamental Aerodynamics_Anderson2010.pdf
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdf
WENKENLI1
 
AP LAB PPT.pdf ap lab ppt no title specific
AP LAB PPT.pdf ap lab ppt no title specificAP LAB PPT.pdf ap lab ppt no title specific
AP LAB PPT.pdf ap lab ppt no title specific
BrazilAccount1
 
The Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdfThe Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdf
Pipe Restoration Solutions
 
ethical hacking in wireless-hacking1.ppt
ethical hacking in wireless-hacking1.pptethical hacking in wireless-hacking1.ppt
ethical hacking in wireless-hacking1.ppt
Jayaprasanna4
 
Water Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdfWater Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation & Control
 
H.Seo, ICLR 2024, MLILAB, KAIST AI.pdf
H.Seo,  ICLR 2024, MLILAB,  KAIST AI.pdfH.Seo,  ICLR 2024, MLILAB,  KAIST AI.pdf
H.Seo, ICLR 2024, MLILAB, KAIST AI.pdf
MLILAB
 
Runway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptxRunway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptx
SupreethSP4
 
Cosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdfCosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdf
Kamal Acharya
 
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
MdTanvirMahtab2
 
Planning Of Procurement o different goods and services
Planning Of Procurement o different goods and servicesPlanning Of Procurement o different goods and services
Planning Of Procurement o different goods and services
JoytuBarua2
 
CME397 Surface Engineering- Professional Elective
CME397 Surface Engineering- Professional ElectiveCME397 Surface Engineering- Professional Elective
CME397 Surface Engineering- Professional Elective
karthi keyan
 

Recently uploaded (20)

Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024
 
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdfAKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
 
Fundamentals of Electric Drives and its applications.pptx
Fundamentals of Electric Drives and its applications.pptxFundamentals of Electric Drives and its applications.pptx
Fundamentals of Electric Drives and its applications.pptx
 
ML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptxML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptx
 
space technology lecture notes on satellite
space technology lecture notes on satellitespace technology lecture notes on satellite
space technology lecture notes on satellite
 
Top 10 Oil and Gas Projects in Saudi Arabia 2024.pdf
Top 10 Oil and Gas Projects in Saudi Arabia 2024.pdfTop 10 Oil and Gas Projects in Saudi Arabia 2024.pdf
Top 10 Oil and Gas Projects in Saudi Arabia 2024.pdf
 
Gen AI Study Jams _ For the GDSC Leads in India.pdf
Gen AI Study Jams _ For the GDSC Leads in India.pdfGen AI Study Jams _ For the GDSC Leads in India.pdf
Gen AI Study Jams _ For the GDSC Leads in India.pdf
 
ethical hacking-mobile hacking methods.ppt
ethical hacking-mobile hacking methods.pptethical hacking-mobile hacking methods.ppt
ethical hacking-mobile hacking methods.ppt
 
ASME IX(9) 2007 Full Version .pdf
ASME IX(9)  2007 Full Version       .pdfASME IX(9)  2007 Full Version       .pdf
ASME IX(9) 2007 Full Version .pdf
 
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdf
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdfGoverning Equations for Fundamental Aerodynamics_Anderson2010.pdf
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdf
 
AP LAB PPT.pdf ap lab ppt no title specific
AP LAB PPT.pdf ap lab ppt no title specificAP LAB PPT.pdf ap lab ppt no title specific
AP LAB PPT.pdf ap lab ppt no title specific
 
The Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdfThe Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdf
 
ethical hacking in wireless-hacking1.ppt
ethical hacking in wireless-hacking1.pptethical hacking in wireless-hacking1.ppt
ethical hacking in wireless-hacking1.ppt
 
Water Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdfWater Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdf
 
H.Seo, ICLR 2024, MLILAB, KAIST AI.pdf
H.Seo,  ICLR 2024, MLILAB,  KAIST AI.pdfH.Seo,  ICLR 2024, MLILAB,  KAIST AI.pdf
H.Seo, ICLR 2024, MLILAB, KAIST AI.pdf
 
Runway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptxRunway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptx
 
Cosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdfCosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdf
 
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
 
Planning Of Procurement o different goods and services
Planning Of Procurement o different goods and servicesPlanning Of Procurement o different goods and services
Planning Of Procurement o different goods and services
 
CME397 Surface Engineering- Professional Elective
CME397 Surface Engineering- Professional ElectiveCME397 Surface Engineering- Professional Elective
CME397 Surface Engineering- Professional Elective
 

Evolution of the StyleGAN family

  • 1. Evolution of the StyleGAN family presented by Vitaly Bondar Generative DL enthusiast johngull @ gmail
  • 2. Recap: GAN Goodfellow et al. Generative Adversarial Networks, 2014
  • 3. Recap: DCGAN Radford et al. Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks, 2015
  • 4. ProgressiveGAN Karras et al. Progressive Growing of GANs for Improved Quality, Stability, and Variation, 2017
  • 5. ProgressiveGAN Karras et al. Progressive Growing of GANs for Improved Quality, Stability, and Variation, 2017
  • 6. ProgressiveGAN Karras et al. Progressive Growing of GANs for Improved Quality, Stability, and Variation, 2017 ● Leaky ReLU ● Add minibatch standard deviation as value layer at the end of discriminator ● Equalized learning rate (initialize weights as N(0;1), normalize with He at runtime) ● PixelNorm ● Exponential running average for the weights of the generator ● Small mini-batches ● WGAN-GP + discriminator drift regularization | LSGAN + noise at discriminator ● lr and batch size changes with the resolution changes
  • 7. StyleGAN Karras et al. A Style-Based Generator Architecture for Generative Adversarial Networks, 2018
  • 8. StyleGAN Karras et al. A Style-Based Generator Architecture for Generative Adversarial Networks, 2018
  • 9. StyleGAN Karras et al. A Style-Based Generator Architecture for Generative Adversarial Networks, 2018 ● Leaky ReLU ● Equalized learning rate (initialize weights as N(0;1), normalize with He at runtime) ● Exponential running average for the weights of the generator ● Small mini-batches ● lr and batch size changes with the resolution changes ● NS GAN Loss + R1 regularization ● lr for mapping network = 0.01*lr ● Truncation trick
  • 10. StyleGAN Karras et al. A Style-Based Generator Architecture for Generative Adversarial Networks, 2018
  • 11. StyleGAN Karras et al. A Style-Based Generator Architecture for Generative Adversarial Networks, 2018
  • 12. StyleGAN Karras et al. A Style-Based Generator Architecture for Generative Adversarial Networks, 2018
  • 13. StyleGAN2 Karras et al.Analyzing and Improving the Image Quality of StyleGAN, 2019 StyleGAN waterdrops
  • 14. StyleGAN2 Karras et al.Analyzing and Improving the Image Quality of StyleGAN, 2019 StyleGAN2 solution: weights demodulation
  • 15. StyleGAN2 Karras et al.Analyzing and Improving the Image Quality of StyleGAN, 2019 StyleGAN2 solution: weights demodulation
  • 16. StyleGAN2 Karras et al.Analyzing and Improving the Image Quality of StyleGAN, 2019 StyleGAN: low FID, but high PPL
  • 17. StyleGAN2 Karras et al.Analyzing and Improving the Image Quality of StyleGAN, 2019 StyleGAN2 solution: lazy path length regularization ...
  • 18. StyleGAN2 Karras et al.Analyzing and Improving the Image Quality of StyleGAN, 2019 StyleGAN: “phase” artifacts
  • 19. StyleGAN2 Karras et al.Analyzing and Improving the Image Quality of StyleGAN, 2019 StyleGAN2 solution: skip connections
  • 20. StyleGAN2 Karras et al.Analyzing and Improving the Image Quality of StyleGAN, 2019
  • 21. StyleGAN2 Karras et al.Analyzing and Improving the Image Quality of StyleGAN, 2019
  • 22. StyleGAN2 Karras et al.Analyzing and Improving the Image Quality of StyleGAN, 2019 ● Almost all previous :) ● Weights modulation/demodulation (no demodulation in tRGB) ● NS GAN Loss + R1 regularization + PPL regularization ● Regularization rare applying ● Skip and residual connections instead of progressive growing ● Double size of last resolution
  • 23. StyleGAN2 Karras et al.Analyzing and Improving the Image Quality of StyleGAN, 2019
  • 24. StyleGAN2-ADA Karras et al. Training Generative Adversarial Networks with Limited Data, 2020 GANs problem: data hungry
  • 25. StyleGAN2-ADA Karras et al. Training Generative Adversarial Networks with Limited Data, 2020 Usual CV vs GANs Dataset augmentation
  • 26. StyleGAN2-ADA Karras et al. Training Generative Adversarial Networks with Limited Data, 2020 Usual CV vs GANs Dataset augmentation Generator output
  • 27. StyleGAN2-ADA Karras et al. Training Generative Adversarial Networks with Limited Data, 2020
  • 28. StyleGAN2-ADA Karras et al. Training Generative Adversarial Networks with Limited Data, 2020 Leaking of augmetation
  • 29. StyleGAN2-ADA Karras et al. Training Generative Adversarial Networks with Limited Data, 2020
  • 30. StyleGAN2-ADA Karras et al. Training Generative Adversarial Networks with Limited Data, 2020
  • 31. StyleGAN3 (Alias-Free GAN) Karras et al. Alias-Free Generative Adversarial Networks, 2021 StyleGAN2 problem
  • 32. StyleGAN3 (Alias-Free GAN) Karras et al. Alias-Free Generative Adversarial Networks, 2021 StyleGAN2 problem Despite their hierarchical convolutional nature, the synthesis process of typical generative adversarial networks depends on absolute pixel coordinates in an unhealthy manner. Unintentional positional references available to the intermediate layers: ● through image borders ● per-pixel noise inputs ● positional encodings, and aliasing Authors identified two sources: ● non-ideal upsampling filters (e.g., nearest, bilinear, strided convolutions) ● the pointwise application of nonlinearities such as ReLU.
  • 33. StyleGAN3 (Alias-Free GAN) One-line idea: Use contiguous space equivariant operations to remove alias and borders-using effects. Karras et al. Alias-Free Generative Adversarial Networks, 2021
  • 34. StyleGAN3 (Alias-Free GAN) Karras et al. Alias-Free Generative Adversarial Networks, 2021 StyleGAN2 (without skip)
  • 35. StyleGAN3 (Alias-Free GAN) Karras et al. Alias-Free Generative Adversarial Networks, 2021
  • 36. StyleGAN3 (Alias-Free GAN) Convolution - everything good Karras et al. Alias-Free Generative Adversarial Networks, 2021
  • 37. StyleGAN3 (Alias-Free GAN) Upsampling/downsampling. ● Identity operations in the continuous space, we just need to change sampling rate. ● Upsampling ~= add zeros for interleaving and convolve with discretized filter ● Downsampling = low-pass filter and sampling = ~= discrete convolution + dropping points Karras et al. Alias-Free Generative Adversarial Networks, 2021
  • 38. StyleGAN3 (Alias-Free GAN) Nonlinearity Karras et al. Alias-Free Generative Adversarial Networks, 2021
  • 39. StyleGAN3 (Alias-Free GAN) Karras et al. Alias-Free Generative Adversarial Networks, 2021
  • 41. StyleGAN3 (Alias-Free GAN) Karras et al. Alias-Free Generative Adversarial Networks, 2021
  • 42. StyleGAN3 (Alias-Free GAN) ● In FFHQ (1024×1024) the three generators (StyleGan2, Alias-Free-T, Alias-Free-R) had 30.0M, 22.3M and 15.8M parameters, while the training times were 1106, 1576 (+42%) and 2248 (+103%) GPU hours. ● The present research consumed 92 GPU years and 225 MWh of electricity on an in-house cluster of NVIDIA V100s. Karras et al. Alias-Free Generative Adversarial Networks, 2021
  • 43. Karras et al. Alias-Free Generative Adversarial Networks, 2021
  • 44.
  • 45. Evolution of the StyleGAN family (oversimplified and as for October 2022)