InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets_review

•

0 likes•22 views

Yoonho Na

paper review

Technology

Introduction
Unsupervised Learning
Extracting value from unlabelled data
which exists in vast quantities
https://www.kaggle.com/altprof/basic-semi-supervised-learning-models

Introduction
Disentangle representation
https://www.slideshare.net/ShuheiYoshida2/infogan-interpretable-representation-learning-by-information-maximizing-generative-adversarial-nets-71376199

Introduction
Related work - DCGAN
Radford, Alec, Luke Metz, and Soumith Chintala. "Unsupervised representation learning with deep convolutional generative adversarial networks." arXiv preprint arXiv:1511.06434 (2015).

Methods
Mutual Information for Inducing Latent Codes
• Input noise vector = z +
c

• z: source of incompressible noise 
c: latent code (will target the salient structured sementic features
)

• In standard GAN, generator is free to ignore c by
fi
nding solution satisfying

Methods
Mutual Information for Inducing Latent Codes
• Mutual Information 
The "amount of information" learned from knowledge of random variable Y
about the other random variable X.

• Cost function

Methods
Variational Mutual Information Maximization

Implementation
• Q and D share all convolutional layer
s

• InfoGAN only adds a negligible computation cost to GA
N

• LI(G, Q) always converges faster than normal GAN objectiv
e

• InfoGAN essentially comes for free with GAN

Goal: To investigate if mutual information can be maximized ef
fi
ciently 
 
Dataset: MNIS
T

Latent code: uniform categorical distribution
Experiments
Mutual Information Maximization

Goal: To evaluate if InfoGAN can learn disentangled and interpretable representations

Dataset : MNIS
T

Latent code:
Experiments
Disentangled Representation

Dataset : 3D Face
s

Latent code:
Experiments
Disentangled Representation

Dataset : 3D Chair
s

Latent code:
Experiments
Disentangled Representation

Dataset : Celeb
A

Latent code:

10 uniform categorial variable
Experiments
Disentangled Representation

Conclusion
• InfoGAN is completely unsupervised and learns interpretable and disentangled
representations on challenging dataset
.

• InfoGAN adds only negligible computation cost on top of GAN and is easy to
train
.

• The core idea of using mutual information to induce representation can be
applied to other methods like VAE

Similar to InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets_review

https://telecombcn-dl.github.io/2017-dlsl/ Winter School on Deep Learning for Speech and Language. UPC BarcelonaTech ETSETB TelecomBCN. The aim of this course is to train students in methods of deep learning for speech and language. Recurrent Neural Networks (RNN) will be presented and analyzed in detail to understand the potential of these state of the art tools for time series processing. Engineering tips and scalability issues will be addressed to solve tasks such as machine translation, speech recognition, speech synthesis or question answering. Hands-on sessions will provide development skills so that attendees can become competent in contemporary data analytics tools.

Advanced Deep Architectures (D2L6 Deep Learning for Speech and Language UPC 2...

Universitat Politècnica de Catalunya

20181212 ibm aot

Hiroshi Maruyama

Analysis of Overlapping Communities in Signed Complex Networks

Mohsen Shahriari

Generative Adversarial Networks and Their Applications in Medical Imaging

Sanghoon Hong

Generative Adversarial Networks 2

Alireza Shafaei

IBM Middle East Data Science Connect 2016 - Doha, Qatar

Romeo Kienzler

Geo Python16 keynote

Romeo Kienzler

resume_Yuli_Liang

Yuli Liang

Preserving privacy of users is a key requirement of web-scale data mining applications and systems such as web search, recommender systems, crowdsourced platforms, and analytics applications, and has witnessed a renewed focus in light of recent data breaches and new regulations such as GDPR. In this tutorial, we will first present an overview of privacy breaches over the last two decades and the lessons learned, key regulations and laws, and evolution of privacy techniques leading to differential privacy definition / techniques. Then, we will focus on the application of privacy-preserving data mining techniques in practice, by presenting case studies such as Apple’s differential privacy deployment for iOS, Google’s RAPPOR, and LinkedIn Salary. We will also discuss various open source as well as commercial privacy tools, and conclude with open problems and challenges for data mining / machine learning community.

Privacy-preserving Data Mining in Industry: Practical Challenges and Lessons ...

Krishnaram Kenthapadi

This meetup took place in Mountain View on January 24th, 2019. Description: With the effort and contributions from researchers and practitioners from academia and industry, Machine Learning Interpretation has become a young sub-field of ML. However, the norms around its definition and understanding is still in its infancy and there are numerous different approaches emerging rapidly. However, there seems to be a lack of a consistent explanation framework to evaluate and consistently benchmark different algorithms - evaluating against interpretation, completeness and consistency of the algorithms. The idea with the gym is to provide a controlled interactive environment for all forms of Machine Learning algorithms, - initially focusing on supervised predictive modeling problems, to allow analysts and data-scientists to explore, debug and generate insightful understanding of the models by 1.Model Validation: Ways to explore and validate black box ML systems enabling model comparison both globally and locally - identifying biases in the training data through interpretation. 2.What-if Analysis: An interactive environment where communication can happen i.e. enable learning through interactions. User having the ability to conduct "What-If" analysis - effect of single or multiple features and their interactions 3.Model Debugging: Ways to analyze the misbehavior of the model by exploring counterfactual examples(adversarial examples and training) 4. Interpretable Models: Ability to build natively interpretable models - with the goal to simplify complex models to enable better understanding. The central concept with MLI gym is to have an interactive environment where one could explore and simulate variations in the world(a world post a model is operationalized) beyond the defined model metrics point estimates - e.g. ROC-AUC, confusion matrix, RMSE, R2 score and others. Speaker's Bio: Pramit is a Lead Data Scientist/ at H2O.ai. His area of interests is building Statistical/Machine Learning models(Bayesian and Frequentist Modeling techniques) to help the business realize their data-driven goals. Currently, he is exploring "Model Interpretation" as means to efficiently understand the true nature of predictive models to enable model robustness and security. He believes effective Model Inference coupled with Adversarial training could lead to building trustworthy models with known blind spots. He has started an open source project Skater: https://github.com/datascienceinc/Skater to solve the need for Model Inference(The project is still in its early stages of development but check it out, always eager for feedback)

Get hands-on with Explainable AI at Machine Learning Interpretability(MLI) Gym!

Sri Ambati

DEF CON 24 - Clarence Chio - machine duping 101

Felipe Prado

Semi-Supervised Classification with Graph Convolutional Networks @ICLR2017読み会

Eiji Sekiya

Big Data and the Internet of Things (IoT) have the potential to fundamentally shift the way we interact with our surroundings. The challenge of deriving insights from the Internet of Things (IoT) has been recognized as one of the most exciting and key opportunities for both academia and industry. Advanced analysis of big data streams from sensors and devices is bound to become a key area of data mining research as the number of applications requiring such processing increases. Dealing with the evolution over time of such data streams, i.e., with concepts that drift or change completely, is one of the core issues in stream mining. In this talk, I will present an overview of data stream mining, and I will introduce some popular open source tools for data stream mining.

MOA for the IoT at ACML 2016

Albert Bifet

This session talks about how to define a problem as a machine learning one. What are the steps toward reaching a satisfying solution from data preparation, feature engineering, evaluating suitable algorithms until releasing the model and putting it in practice. It presents a case study and go through some algorithms mostly implemented in Python. By Hussein Natsheh - Data Mining entrepreneur, scholar, and founder of CiApple YouTube video: https://youtu.be/NGbyeX4kpU4

JOSA TechTalks - Machine Learning in Practice

Jordan Open Source Association

Explains how deep learning creates howlers using commonly used annotation tools for images. We have identified several such howlers. Essentially, this presentation outlines the deficiencies of deep learning networks. We also explain the theoretical reasoning for these, building on Bengio's recent paper. The presentation also contains solutions which address these gaps, such as capsule networks, transfer learning, meta-learning and federated learning.

Dl surface statistical_regularities_vs_high_level_concepts_draft_v0.1

Vijay Srinivas Agneeswaran, Ph.D

AILABS Lecture Series - Is AI The New Electricity. Topic - Deep Learning - Ev...

AILABS Academy

Similar to InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets_review (16)

Advanced Deep Architectures (D2L6 Deep Learning for Speech and Language UPC 2...

20181212 ibm aot

Analysis of Overlapping Communities in Signed Complex Networks

Generative Adversarial Networks and Their Applications in Medical Imaging

Generative Adversarial Networks 2

IBM Middle East Data Science Connect 2016 - Doha, Qatar

Geo Python16 keynote

resume_Yuli_Liang

Privacy-preserving Data Mining in Industry: Practical Challenges and Lessons ...

Get hands-on with Explainable AI at Machine Learning Interpretability(MLI) Gym!

DEF CON 24 - Clarence Chio - machine duping 101

Semi-Supervised Classification with Graph Convolutional Networks @ICLR2017読み会

MOA for the IoT at ACML 2016

JOSA TechTalks - Machine Learning in Practice

Dl surface statistical_regularities_vs_high_level_concepts_draft_v0.1

AILABS Lecture Series - Is AI The New Electricity. Topic - Deep Learning - Ev...

Recently uploaded

الأمن السيبراني - ما لا يسع للمستخدم جهله

Mohamed Sweelam

WebRTC and SIP not just audio and video @ OpenSIPS 2024

Lorenzo Miniero

Here comes another enlightening document that dives into the thrilling world of breaking BitLocker, Windows' attempt at full disk encryption. This analysis will walk you through the myriad of creative hacks, from the classic cold boot attacks—because who doesn't love freezing their computer to steal some data—to exploiting those oh-so-reliable TPM chips that might as well have a "hack me" sign on them. We'll also cover some software vulnerabilities, because Microsoft just wouldn't be the same without a few of those sprinkled in for good measure. And let's not forget about intercepting those elusive decryption keys; it's like a digital treasure hunt! So, whether you're a security expert, a forensic analyst, or just a curious cat in the world of cybersecurity, enjoy the read, and maybe keep that data backed up somewhere safe, yeah? ------- This document provides a comprehensive analysis of the method demonstrated in the video "Breaking Bitlocker - Bypassing the Windows Disk Encryption" where the author showcases a low-cost hardware attack capable of bypassing BitLocker encryption. The analysis will cover various aspects of the attack, including the technical approach, the use of a Trusted Platform Module (TPM) chip, and the implications for security practices. The analysis provides a high-quality summary of the demonstrated attack, ensuring that security professionals and specialists from different fields can understand the potential risks and necessary countermeasures. The document is particularly useful for cybersecurity experts, IT professionals, and organizations that rely on BitLocker for data protection and to highlight the need for ongoing security assessments and the potential for similar vulnerabilities in other encryption systems.

Microsoft BitLocker Bypass Attack Method.pdf

Overkill Security

Introduction to FIDO Authentication and Passkeys.pptx

FIDO Alliance

Design Guidelines for Passkeys 2024.pptx

FIDO Alliance

Because observability is such a broad topic – and often something we learn on the job – it can feel like there’s too much to learn at once. But you don’t have to tackle everything and can start with the basics and build from there! Monitoring and observability aren’t traditionally found in software curriculums and many of us cobble this knowledge together from whatever vendor or ecosystem we were first introduced to and whatever is a part of your current company’s observability stack. No matter what tooling is in place, there are still observability fundamentals that developers should know. That’s why I’ve put together a primer on the different telemetry types, when to use them, how to understand the data journey, and what to look for in time series graphs.

Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)

Paige Cruz

CORS (Kitworks Team Study 양다윗 발표자료 240510)

Wonjun Hwang

Working together SRE & Platform Engineering

Marcus Vechiato

Microsoft CSP Briefing Pre-Engagement - Questionnaire

Exakis Nelite

The presentation was made in “Web3 Fusion: Embracing AI and Beyond” is more than a conference; it's a journey into the heart of digital transformation. The conference a provided a platform where the future of technology meets practical application. This three-day hybrid event, set in the heart of innovation, served as a gateway to the latest trends and transformative discussions in AI, Blockchain, IoT, AR/VR, and their collective impact on the information space.

AI in Action: Real World Use Cases by Anitaraj

AnitaRaj43

Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx

FIDO Alliance

(Explainable) Data-Centric AI: what are you explaininhg, and to whom?

Paolo Missier

Portal Kombat : extension du réseau de propagande russe

中央社

Design and Development of a Provenance Capture Platform for Data Science

Paolo Missier

How to Check CNIC Information Online with Pakdata cf

danishmna97

Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...

ScyllaDB

Six Myths about Ontologies: The Basics of Formal Ontology

johnbeverley2021

AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)

Samir Dash

Simplifying Mobile A11y Presentation.pptx

MarkSteadman7

"The Ultimate Prompt Engineering Guide for Generative AI" provides a comprehensive guide to leveraging the power of AI assistants through effective prompt design. It explores fundamental prompt concepts and details strategies for crafting prompts that maximize output quality. Readers learn about iterative refinement, examples, constraints, and advanced techniques like chaining and decomposition. Case studies demonstrate real-world applications in content creation, coding, analysis, and more. Trends in multimodal, automated, and responsible prompting are also examined. This book is a must-read for anyone seeking to optimize generative AI capabilities.

The Ultimate Prompt Engineering Guide for Generative AI: Get the Most Out of ...

SOFTTECHHUB

Recently uploaded (20)

الأمن السيبراني - ما لا يسع للمستخدم جهله

WebRTC and SIP not just audio and video @ OpenSIPS 2024

Microsoft BitLocker Bypass Attack Method.pdf

Introduction to FIDO Authentication and Passkeys.pptx

Design Guidelines for Passkeys 2024.pptx

Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)

CORS (Kitworks Team Study 양다윗 발표자료 240510)

Working together SRE & Platform Engineering

Microsoft CSP Briefing Pre-Engagement - Questionnaire

AI in Action: Real World Use Cases by Anitaraj

Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx

(Explainable) Data-Centric AI: what are you explaininhg, and to whom?

Portal Kombat : extension du réseau de propagande russe

Design and Development of a Provenance Capture Platform for Data Science

How to Check CNIC Information Online with Pakdata cf

Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...

Six Myths about Ontologies: The Basics of Formal Ontology

AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)

Simplifying Mobile A11y Presentation.pptx

The Ultimate Prompt Engineering Guide for Generative AI: Get the Most Out of ...

InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets_review

1. Yoonho Na 210618 Journal Club

2. 2

3. Introduction Unsupervised Learning Extracting value from unlabelled data which exists in vast quantities https://www.kaggle.com/altprof/basic-semi-supervised-learning-models

4. Introduction Disentangle representation https://www.slideshare.net/ShuheiYoshida2/infogan-interpretable-representation-learning-by-information-maximizing-generative-adversarial-nets-71376199

5. Introduction Related work - DCGAN Radford, Alec, Luke Metz, and Soumith Chintala. "Unsupervised representation learning with deep convolutional generative adversarial networks." arXiv preprint arXiv:1511.06434 (2015).

6. Methods Mutual Information for Inducing Latent Codes • Input noise vector = z + c • z: source of incompressible noise  c: latent code (will target the salient structured sementic features ) • In standard GAN, generator is free to ignore c by fi nding solution satisfying

7. Methods Mutual Information for Inducing Latent Codes • Mutual Information  The "amount of information" learned from knowledge of random variable Y about the other random variable X. • Cost function

8. Methods Variational Mutual Information Maximization

9. Methods Variational Mutual Information Maximization

10. Implementation • Q and D share all convolutional layer s • InfoGAN only adds a negligible computation cost to GA N • LI(G, Q) always converges faster than normal GAN objectiv e • InfoGAN essentially comes for free with GAN

11. Goal: To investigate if mutual information can be maximized ef fi ciently    Dataset: MNIS T Latent code: uniform categorical distribution Experiments Mutual Information Maximization

12. Goal: To evaluate if InfoGAN can learn disentangled and interpretable representations Dataset : MNIS T Latent code: Experiments Disentangled Representation

13. Dataset : 3D Face s Latent code: Experiments Disentangled Representation

14. Dataset : 3D Chair s Latent code: Experiments Disentangled Representation

15. Dataset : Celeb A Latent code: 10 uniform categorial variable Experiments Disentangled Representation

16. Conclusion • InfoGAN is completely unsupervised and learns interpretable and disentangled representations on challenging dataset . • InfoGAN adds only negligible computation cost on top of GAN and is easy to train . • The core idea of using mutual information to induce representation can be applied to other methods like VAE

InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets_review

Recommended

Recommended

More Related Content

Similar to InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets_review

Similar to InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets_review (16)

Recently uploaded

Recently uploaded (20)

InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets_review