SlideShare a Scribd company logo
1 of 18
Download to read offline
InfoGAN: Bridging the
Gap Between Data and
Understanding in GANs
Presenter
Faezeh Maghsoodifar
PhD Student
Introduction
Faezeh Maghsoodifar InfoGAN: Bridging the Gap Between Data and Understanding in GANs Feb 2024
Chen, Xi, et al. "Infogan: Interpretable representation learning by information maximizing generative adversarial nets." https://arxiv.org/abs/1606.03657 (2016).
Think!
How can we give specific meaning to each dimension within a model's latent
space to control specific attributes of the generated data?
2
GANs
InfoGANs Process Of Maximizing
Introduction Experiments Conclusion
Introduction
Faezeh Maghsoodifar InfoGAN: Bridging the Gap Between Data and Understanding in GANs Feb 2024
3
Chen, Xi, et al. "Infogan: Interpretable representation learning by information maximizing generative adversarial nets." https://arxiv.org/abs/1606.03657 (2016).
GANs
InfoGANs Process Of Maximizing
Introduction Experiments Conclusion
InfoGANs
4
Faezeh Maghsoodifar InfoGAN: Bridging the Gap Between Data and Understanding in GANs Feb 2024
Traditional GAN
with extra
component Q
Chen, Xi, et al. "Infogan: Interpretable representation learning by information maximizing generative adversarial nets." https://arxiv.org/abs/1606.03657 (2016).
GANs Process Of Maximizing
InfoGANs
Introduction Experiments Conclusion
Mutual information (MI)
Is between c, G
InfoGANs
5
Faezeh Maghsoodifar InfoGAN: Bridging the Gap Between Data and Understanding in GANs Feb 2024
Traditional GAN
with extra
component Q
DOI: 10.1115/1.4044076
Chen, Xi, et al. "Infogan: Interpretable representation learning by information maximizing generative adversarial nets." https://arxiv.org/abs/1606.03657 (2016).
GANs Process Of Maximizing
InfoGANs
Introduction Experiments Conclusion
Generative Adversarial Networks
6
Generative Adversarial Networks
Generate data
(fake data)
Generator & discriminator,
each competing to win.
Fully Connected
Generator
trying to fake
Discriminator,
trying not to be fooled
Faezeh Maghsoodifar InfoGAN: Bridging the Gap Between Data and Understanding in GANs Feb 2024
6
6
6
GANs Process Of Maximizing
InfoGANs
Introduction Experiments Conclusion
Deep Convolutional
Generative Adversarial Networks
Faezeh Maghsoodifar InfoGAN: Bridging the Gap Between Data and Understanding in GANs Feb 2024
Problem with Traditional GANs
1. Changing one dimension in a multi-dimensional does not have a clear associated meaning.
2. Latent vectors lack interpretable semantics, leading to unpredictable changes in outputs.
Formulation of GANs
7
7
GANs Process Of Maximizing
InfoGANs
Introduction Experiments Conclusion
InfoGANs
8
Faezeh Maghsoodifar InfoGAN: Bridging the Gap Between Data and Understanding in GANs Feb 2024
Mutual information (MI)
Is between
DOI: 10.1115/1.4044076
1 Latent Code → c
2 Generated Images → G(z,c)
Chen, Xi, et al. "Infogan: Interpretable representation learning by information maximizing generative adversarial nets." https://arxiv.org/abs/1606.03657 (2016).
GANs Process Of Maximizing
InfoGANs
Introduction Experiments Conclusion
Faezeh Maghsoodifar InfoGAN: Bridging the Gap Between Data and Understanding in GANs Feb 2024
Mutual information measures the
reduction of uncertainty in one
variable when another is observed, a
concept central to InfoGAN's approach.
The lower bound of mutual
information can be approximated and
maximized using Monte Carlo
simulation, streamlining the training
process.
01
02
03
Information
Theory
Maximization
Technique
Monte Carlo
Simulation
P
r
o
c
e
s
s
O
f
M
a
x
i
m
i
zing
Process Of Maximizing MI
InfoGAN employs variational
information maximization, a
technique that provides a lower
bound to the mutual information,
facilitating its maximization.
By using Lemma:
Q directly and
G via the re-parametrization trick
Can be maximized
Chen, Xi, et al. "Infogan: Interpretable representation learning by information maximizing generative adversarial nets." https://arxiv.org/abs/1606.03657 (2016).
9
GANs
InfoGANs
Introduction Process Of Maximizing
9
Experiments Conclusion
InfoGANs
Faezeh Maghsoodifar InfoGAN: Bridging the Gap Between Data and Understanding in GANs Feb 2024
Finally, the resulting algorithm calls
Information Maximizing Generative Adversarial Networks (InfoGAN)
Chen, Xi, et al. "Infogan: Interpretable representation learning by information maximizing generative adversarial nets." https://arxiv.org/abs/1606.03657 (2016).
10
GANs
InfoGANs
Introduction Process Of Maximizing
1
0
Experiments Conclusion
Datasets
MNIST Dataset
CelebA Dataset
SVHN Dataset
Faezeh Maghsoodifar InfoGAN: Bridging the Gap Between Data and Understanding in GANs Feb 2024
Chen, Xi, et al. "Infogan: Interpretable representation learning by information maximizing generative adversarial nets." https://arxiv.org/abs/1606.03657 (2016).
3D Faces Dataset
3D Chairs Dataset
GANs
InfoGANs Process Of Maximizing
Introduction
11
Experiments Conclusion
What They
Got
MNIST Dataset
Purpose: To disentangle digit shape from style.
The model was used to change the types of digits and styles, such as rotation and
width, and to show the model's ability to generalize well beyond its training range.
Faezeh Maghsoodifar InfoGAN: Bridging the Gap Between Data and Understanding in GANs Feb 2024
Chen, Xi, et al. "Infogan: Interpretable representation learning by information maximizing generative adversarial nets." https://arxiv.org/abs/1606.03657 (2016).
Disentangled Representation
Evaluation
InfoGAN successfully disentangles digit shape from style, with latent codes capturing
rotation and width, demonstrating natural-looking variations.
GANs
InfoGANs Process Of Maximizing
Introduction
12
Experiments Conclusion
What They
Got
3D Faces Dataset
On the faces’ dataset, InfoGAN learns to represent azimuth, elevation, and lighting as
continuous latent variables without supervision.
Faezeh Maghsoodifar InfoGAN: Bridging the Gap Between Data and Understanding in GANs Feb 2024
Chen, Xi, et al. "Infogan: Interpretable representation learning by information maximizing generative adversarial nets." https://arxiv.org/abs/1606.03657 (2016).
Disentangled Representation
Evaluation
Purpose: To learn interpretable representations of facial features without supervision.
InfoGAN was used to manipulate features such as pose (azimuth), elevation, and
lighting, demonstrating its ability to discover variations autonomously.
GANs
InfoGANs Process Of Maximizing
Introduction
13
Experiments Conclusion
What They
Got
3D Chairs Dataset
InfoGAN demonstrates its ability to interpolate between chair types, capturing
rotation and width variations with continuous latent codes.
Faezeh Maghsoodifar InfoGAN: Bridging the Gap Between Data and Understanding in GANs Feb 2024
Chen, Xi, et al. "Infogan: Interpretable representation learning by information maximizing generative adversarial nets." https://arxiv.org/abs/1606.03657 (2016).
Disentangled Representation
Evaluation
Purpose: To learn representations of object features such as rotation and width.
The model learned to interpolate between similar types of chairs and adjust their
widths using a single continuous code, showing its capability in understanding and
varying object dimensions.
GANs
InfoGANs Process Of Maximizing
Introduction
14
Experiments Conclusion
What They
Got
Street View House Numbers (SVHN) dataset
Faezeh Maghsoodifar InfoGAN: Bridging the Gap Between Data and Understanding in GANs Feb 2024
Chen, Xi, et al. "Infogan: Interpretable representation learning by information maximizing generative adversarial nets." https://arxiv.org/abs/1606.03657 (2016).
Disentangled Representation
Evaluation
Purpose: To learn interpretable representations from a noisy and less uniform dataset.
InfoGAN was tested on its ability to handle real-world complexity and variability in
image resolution and background distractions.
GANs
InfoGANs Process Of Maximizing
Introduction
15
Experiments Conclusion
What They
Got
CelebA dataset (celebrity faces)
Faezeh Maghsoodifar InfoGAN: Bridging the Gap Between Data and Understanding in GANs Feb 2024
Chen, Xi, et al. "Infogan: Interpretable representation learning by information maximizing generative adversarial nets." https://arxiv.org/abs/1606.03657 (2016).
Disentangled Representation
Evaluation
Purpose: To learn and disentangle complex visual concepts from a dataset with large
variations.
The model was used to control and understand diverse attributes in celebrity
images, such as pose, presence of eyeglasses, hairstyles, and emotions, even
without having multiple images of the same person in different poses
GANs
InfoGANs Process Of Maximizing
Introduction
16
Experiments Conclusion
Conclusion
Faezeh Maghsoodifar InfoGAN: Bridging the Gap Between Data and Understanding in GANs Feb 2024
Chen, Xi, et al. "Infogan: Interpretable representation learning by information maximizing generative adversarial nets." https://arxiv.org/abs/1606.03657 (2016).
1. InfoGAN assigns clear meanings to each dimension of the
latent space.
2. Each dimension of the hidden variable represents a distinct
semantic feature.
3. Enables control over specific attributes like handwriting style,
digit shape, and background in images.
4. InfoGAN aims to maximize mutual information between c and
G to retain the meaning of c in the generated images.
5. Introduces a lower bound on mutual information that can be
maximized during training.
GANs
InfoGANs Experiments
Process Of Maximizing
Introduction
17
Conclusion
Thank you
Faezeh Maghsoodifar
fmaghsoodifar@crimson.ua.edu

More Related Content

Similar to InfoGAN:Bridging the Gap Between Data and Understanding in GANs

Poster Abstracts
Poster AbstractsPoster Abstracts
Poster Abstractsbutest
 
Data and Knowledge as Commodities
Data and Knowledge as CommoditiesData and Knowledge as Commodities
Data and Knowledge as CommoditiesMathieu d'Aquin
 
Remote Patient & Elderly Care Monitoring
Remote Patient & Elderly Care MonitoringRemote Patient & Elderly Care Monitoring
Remote Patient & Elderly Care MonitoringVeselin Pizurica
 
Reviews on Deep Generative Models in the early days / GANs & VAEs paper review
Reviews on Deep Generative Models in the early days / GANs & VAEs paper reviewReviews on Deep Generative Models in the early days / GANs & VAEs paper review
Reviews on Deep Generative Models in the early days / GANs & VAEs paper reviewchangedaeoh
 
Generative Adversarial Networks (GANs) at the Data Science Meetup Luxembourg ...
Generative Adversarial Networks (GANs) at the Data Science Meetup Luxembourg ...Generative Adversarial Networks (GANs) at the Data Science Meetup Luxembourg ...
Generative Adversarial Networks (GANs) at the Data Science Meetup Luxembourg ...Chris Hammerschmidt
 
Generative Adversarial Networks 2
Generative Adversarial Networks 2Generative Adversarial Networks 2
Generative Adversarial Networks 2Alireza Shafaei
 
LINK MINING PROCESS
LINK MINING PROCESSLINK MINING PROCESS
LINK MINING PROCESSIJDKP
 
LINK MINING PROCESS
LINK MINING PROCESSLINK MINING PROCESS
LINK MINING PROCESSIJDKP
 
Scientific Applications and Heterogeneous Architectures
Scientific Applications and Heterogeneous ArchitecturesScientific Applications and Heterogeneous Architectures
Scientific Applications and Heterogeneous Architecturesinside-BigData.com
 
A Complete Analysis of Human Action Recognition Procedures
A Complete Analysis of Human Action Recognition ProceduresA Complete Analysis of Human Action Recognition Procedures
A Complete Analysis of Human Action Recognition Proceduresijtsrd
 
The Evolution of Generative AI Models_ From GANs to Transformers.pdf
The Evolution of Generative AI Models_ From GANs to Transformers.pdfThe Evolution of Generative AI Models_ From GANs to Transformers.pdf
The Evolution of Generative AI Models_ From GANs to Transformers.pdfSam H
 
Review on Hand Gesture Recognition
Review on Hand Gesture RecognitionReview on Hand Gesture Recognition
Review on Hand Gesture Recognitiondbpublications
 
[Paper Review] MisGAN: Learning from Incomplete Data with Generative Adversar...
[Paper Review] MisGAN: Learning from Incomplete Data with Generative Adversar...[Paper Review] MisGAN: Learning from Incomplete Data with Generative Adversar...
[Paper Review] MisGAN: Learning from Incomplete Data with Generative Adversar...Jihoo Kim
 
Top Cited Articles in Data Mining - International Journal of Data Mining & Kn...
Top Cited Articles in Data Mining - International Journal of Data Mining & Kn...Top Cited Articles in Data Mining - International Journal of Data Mining & Kn...
Top Cited Articles in Data Mining - International Journal of Data Mining & Kn...IJDKP
 
Hala skafkeynote@conferencedata2021
Hala skafkeynote@conferencedata2021Hala skafkeynote@conferencedata2021
Hala skafkeynote@conferencedata2021hala Skaf
 
An Extensive Review on Generative Adversarial Networks GAN’s
An Extensive Review on Generative Adversarial Networks GAN’sAn Extensive Review on Generative Adversarial Networks GAN’s
An Extensive Review on Generative Adversarial Networks GAN’sijtsrd
 

Similar to InfoGAN:Bridging the Gap Between Data and Understanding in GANs (20)

Medical ImageNet
Medical ImageNetMedical ImageNet
Medical ImageNet
 
Poster Abstracts
Poster AbstractsPoster Abstracts
Poster Abstracts
 
Data and Knowledge as Commodities
Data and Knowledge as CommoditiesData and Knowledge as Commodities
Data and Knowledge as Commodities
 
Remote Patient & Elderly Care Monitoring
Remote Patient & Elderly Care MonitoringRemote Patient & Elderly Care Monitoring
Remote Patient & Elderly Care Monitoring
 
Reviews on Deep Generative Models in the early days / GANs & VAEs paper review
Reviews on Deep Generative Models in the early days / GANs & VAEs paper reviewReviews on Deep Generative Models in the early days / GANs & VAEs paper review
Reviews on Deep Generative Models in the early days / GANs & VAEs paper review
 
Generative Adversarial Networks (GANs) at the Data Science Meetup Luxembourg ...
Generative Adversarial Networks (GANs) at the Data Science Meetup Luxembourg ...Generative Adversarial Networks (GANs) at the Data Science Meetup Luxembourg ...
Generative Adversarial Networks (GANs) at the Data Science Meetup Luxembourg ...
 
Generative Adversarial Networks 2
Generative Adversarial Networks 2Generative Adversarial Networks 2
Generative Adversarial Networks 2
 
LINK MINING PROCESS
LINK MINING PROCESSLINK MINING PROCESS
LINK MINING PROCESS
 
LINK MINING PROCESS
LINK MINING PROCESSLINK MINING PROCESS
LINK MINING PROCESS
 
Scientific Applications and Heterogeneous Architectures
Scientific Applications and Heterogeneous ArchitecturesScientific Applications and Heterogeneous Architectures
Scientific Applications and Heterogeneous Architectures
 
A Complete Analysis of Human Action Recognition Procedures
A Complete Analysis of Human Action Recognition ProceduresA Complete Analysis of Human Action Recognition Procedures
A Complete Analysis of Human Action Recognition Procedures
 
The Evolution of Generative AI Models_ From GANs to Transformers.pdf
The Evolution of Generative AI Models_ From GANs to Transformers.pdfThe Evolution of Generative AI Models_ From GANs to Transformers.pdf
The Evolution of Generative AI Models_ From GANs to Transformers.pdf
 
Review on Hand Gesture Recognition
Review on Hand Gesture RecognitionReview on Hand Gesture Recognition
Review on Hand Gesture Recognition
 
[Paper Review] MisGAN: Learning from Incomplete Data with Generative Adversar...
[Paper Review] MisGAN: Learning from Incomplete Data with Generative Adversar...[Paper Review] MisGAN: Learning from Incomplete Data with Generative Adversar...
[Paper Review] MisGAN: Learning from Incomplete Data with Generative Adversar...
 
Top Cited Articles in Data Mining - International Journal of Data Mining & Kn...
Top Cited Articles in Data Mining - International Journal of Data Mining & Kn...Top Cited Articles in Data Mining - International Journal of Data Mining & Kn...
Top Cited Articles in Data Mining - International Journal of Data Mining & Kn...
 
Hala skafkeynote@conferencedata2021
Hala skafkeynote@conferencedata2021Hala skafkeynote@conferencedata2021
Hala skafkeynote@conferencedata2021
 
Ml in genomics
Ml in genomicsMl in genomics
Ml in genomics
 
An Extensive Review on Generative Adversarial Networks GAN’s
An Extensive Review on Generative Adversarial Networks GAN’sAn Extensive Review on Generative Adversarial Networks GAN’s
An Extensive Review on Generative Adversarial Networks GAN’s
 
graziani_bias.pdf
graziani_bias.pdfgraziani_bias.pdf
graziani_bias.pdf
 
GANs in Digital Pathology
GANs in Digital PathologyGANs in Digital Pathology
GANs in Digital Pathology
 

Recently uploaded

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 

Recently uploaded (20)

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 

InfoGAN:Bridging the Gap Between Data and Understanding in GANs

  • 1. InfoGAN: Bridging the Gap Between Data and Understanding in GANs Presenter Faezeh Maghsoodifar PhD Student
  • 2. Introduction Faezeh Maghsoodifar InfoGAN: Bridging the Gap Between Data and Understanding in GANs Feb 2024 Chen, Xi, et al. "Infogan: Interpretable representation learning by information maximizing generative adversarial nets." https://arxiv.org/abs/1606.03657 (2016). Think! How can we give specific meaning to each dimension within a model's latent space to control specific attributes of the generated data? 2 GANs InfoGANs Process Of Maximizing Introduction Experiments Conclusion
  • 3. Introduction Faezeh Maghsoodifar InfoGAN: Bridging the Gap Between Data and Understanding in GANs Feb 2024 3 Chen, Xi, et al. "Infogan: Interpretable representation learning by information maximizing generative adversarial nets." https://arxiv.org/abs/1606.03657 (2016). GANs InfoGANs Process Of Maximizing Introduction Experiments Conclusion
  • 4. InfoGANs 4 Faezeh Maghsoodifar InfoGAN: Bridging the Gap Between Data and Understanding in GANs Feb 2024 Traditional GAN with extra component Q Chen, Xi, et al. "Infogan: Interpretable representation learning by information maximizing generative adversarial nets." https://arxiv.org/abs/1606.03657 (2016). GANs Process Of Maximizing InfoGANs Introduction Experiments Conclusion Mutual information (MI) Is between c, G
  • 5. InfoGANs 5 Faezeh Maghsoodifar InfoGAN: Bridging the Gap Between Data and Understanding in GANs Feb 2024 Traditional GAN with extra component Q DOI: 10.1115/1.4044076 Chen, Xi, et al. "Infogan: Interpretable representation learning by information maximizing generative adversarial nets." https://arxiv.org/abs/1606.03657 (2016). GANs Process Of Maximizing InfoGANs Introduction Experiments Conclusion
  • 6. Generative Adversarial Networks 6 Generative Adversarial Networks Generate data (fake data) Generator & discriminator, each competing to win. Fully Connected Generator trying to fake Discriminator, trying not to be fooled Faezeh Maghsoodifar InfoGAN: Bridging the Gap Between Data and Understanding in GANs Feb 2024 6 6 6 GANs Process Of Maximizing InfoGANs Introduction Experiments Conclusion Deep Convolutional
  • 7. Generative Adversarial Networks Faezeh Maghsoodifar InfoGAN: Bridging the Gap Between Data and Understanding in GANs Feb 2024 Problem with Traditional GANs 1. Changing one dimension in a multi-dimensional does not have a clear associated meaning. 2. Latent vectors lack interpretable semantics, leading to unpredictable changes in outputs. Formulation of GANs 7 7 GANs Process Of Maximizing InfoGANs Introduction Experiments Conclusion
  • 8. InfoGANs 8 Faezeh Maghsoodifar InfoGAN: Bridging the Gap Between Data and Understanding in GANs Feb 2024 Mutual information (MI) Is between DOI: 10.1115/1.4044076 1 Latent Code → c 2 Generated Images → G(z,c) Chen, Xi, et al. "Infogan: Interpretable representation learning by information maximizing generative adversarial nets." https://arxiv.org/abs/1606.03657 (2016). GANs Process Of Maximizing InfoGANs Introduction Experiments Conclusion
  • 9. Faezeh Maghsoodifar InfoGAN: Bridging the Gap Between Data and Understanding in GANs Feb 2024 Mutual information measures the reduction of uncertainty in one variable when another is observed, a concept central to InfoGAN's approach. The lower bound of mutual information can be approximated and maximized using Monte Carlo simulation, streamlining the training process. 01 02 03 Information Theory Maximization Technique Monte Carlo Simulation P r o c e s s O f M a x i m i zing Process Of Maximizing MI InfoGAN employs variational information maximization, a technique that provides a lower bound to the mutual information, facilitating its maximization. By using Lemma: Q directly and G via the re-parametrization trick Can be maximized Chen, Xi, et al. "Infogan: Interpretable representation learning by information maximizing generative adversarial nets." https://arxiv.org/abs/1606.03657 (2016). 9 GANs InfoGANs Introduction Process Of Maximizing 9 Experiments Conclusion
  • 10. InfoGANs Faezeh Maghsoodifar InfoGAN: Bridging the Gap Between Data and Understanding in GANs Feb 2024 Finally, the resulting algorithm calls Information Maximizing Generative Adversarial Networks (InfoGAN) Chen, Xi, et al. "Infogan: Interpretable representation learning by information maximizing generative adversarial nets." https://arxiv.org/abs/1606.03657 (2016). 10 GANs InfoGANs Introduction Process Of Maximizing 1 0 Experiments Conclusion
  • 11. Datasets MNIST Dataset CelebA Dataset SVHN Dataset Faezeh Maghsoodifar InfoGAN: Bridging the Gap Between Data and Understanding in GANs Feb 2024 Chen, Xi, et al. "Infogan: Interpretable representation learning by information maximizing generative adversarial nets." https://arxiv.org/abs/1606.03657 (2016). 3D Faces Dataset 3D Chairs Dataset GANs InfoGANs Process Of Maximizing Introduction 11 Experiments Conclusion
  • 12. What They Got MNIST Dataset Purpose: To disentangle digit shape from style. The model was used to change the types of digits and styles, such as rotation and width, and to show the model's ability to generalize well beyond its training range. Faezeh Maghsoodifar InfoGAN: Bridging the Gap Between Data and Understanding in GANs Feb 2024 Chen, Xi, et al. "Infogan: Interpretable representation learning by information maximizing generative adversarial nets." https://arxiv.org/abs/1606.03657 (2016). Disentangled Representation Evaluation InfoGAN successfully disentangles digit shape from style, with latent codes capturing rotation and width, demonstrating natural-looking variations. GANs InfoGANs Process Of Maximizing Introduction 12 Experiments Conclusion
  • 13. What They Got 3D Faces Dataset On the faces’ dataset, InfoGAN learns to represent azimuth, elevation, and lighting as continuous latent variables without supervision. Faezeh Maghsoodifar InfoGAN: Bridging the Gap Between Data and Understanding in GANs Feb 2024 Chen, Xi, et al. "Infogan: Interpretable representation learning by information maximizing generative adversarial nets." https://arxiv.org/abs/1606.03657 (2016). Disentangled Representation Evaluation Purpose: To learn interpretable representations of facial features without supervision. InfoGAN was used to manipulate features such as pose (azimuth), elevation, and lighting, demonstrating its ability to discover variations autonomously. GANs InfoGANs Process Of Maximizing Introduction 13 Experiments Conclusion
  • 14. What They Got 3D Chairs Dataset InfoGAN demonstrates its ability to interpolate between chair types, capturing rotation and width variations with continuous latent codes. Faezeh Maghsoodifar InfoGAN: Bridging the Gap Between Data and Understanding in GANs Feb 2024 Chen, Xi, et al. "Infogan: Interpretable representation learning by information maximizing generative adversarial nets." https://arxiv.org/abs/1606.03657 (2016). Disentangled Representation Evaluation Purpose: To learn representations of object features such as rotation and width. The model learned to interpolate between similar types of chairs and adjust their widths using a single continuous code, showing its capability in understanding and varying object dimensions. GANs InfoGANs Process Of Maximizing Introduction 14 Experiments Conclusion
  • 15. What They Got Street View House Numbers (SVHN) dataset Faezeh Maghsoodifar InfoGAN: Bridging the Gap Between Data and Understanding in GANs Feb 2024 Chen, Xi, et al. "Infogan: Interpretable representation learning by information maximizing generative adversarial nets." https://arxiv.org/abs/1606.03657 (2016). Disentangled Representation Evaluation Purpose: To learn interpretable representations from a noisy and less uniform dataset. InfoGAN was tested on its ability to handle real-world complexity and variability in image resolution and background distractions. GANs InfoGANs Process Of Maximizing Introduction 15 Experiments Conclusion
  • 16. What They Got CelebA dataset (celebrity faces) Faezeh Maghsoodifar InfoGAN: Bridging the Gap Between Data and Understanding in GANs Feb 2024 Chen, Xi, et al. "Infogan: Interpretable representation learning by information maximizing generative adversarial nets." https://arxiv.org/abs/1606.03657 (2016). Disentangled Representation Evaluation Purpose: To learn and disentangle complex visual concepts from a dataset with large variations. The model was used to control and understand diverse attributes in celebrity images, such as pose, presence of eyeglasses, hairstyles, and emotions, even without having multiple images of the same person in different poses GANs InfoGANs Process Of Maximizing Introduction 16 Experiments Conclusion
  • 17. Conclusion Faezeh Maghsoodifar InfoGAN: Bridging the Gap Between Data and Understanding in GANs Feb 2024 Chen, Xi, et al. "Infogan: Interpretable representation learning by information maximizing generative adversarial nets." https://arxiv.org/abs/1606.03657 (2016). 1. InfoGAN assigns clear meanings to each dimension of the latent space. 2. Each dimension of the hidden variable represents a distinct semantic feature. 3. Enables control over specific attributes like handwriting style, digit shape, and background in images. 4. InfoGAN aims to maximize mutual information between c and G to retain the meaning of c in the generated images. 5. Introduces a lower bound on mutual information that can be maximized during training. GANs InfoGANs Experiments Process Of Maximizing Introduction 17 Conclusion