SlideShare a Scribd company logo
On Different Distances Between Distributions
and Generative Adversarial Networks
Martin Arjovsky
Unsupervised learning
- We have samples from an unknown
distribution
Unsupervised learning
- We have samples from an unknown
distribution
- We want to approximate it by a parametric
distribution that’s close to in some sense.
Unsupervised learning
- We have samples from an unknown
distribution
- We want to approximate it by a parametric
distribution that’s close to in some sense.
- Close how?
Maximum Likelihood
- Maximum likelihood:
Maximum Likelihood
- Maximum likelihood:
- Assumptions: continuous with full support.
Maximum Likelihood
- Maximum likelihood:
- Assumptions: continuous with full support.
- Problems: restricted capacity distributes mass.
Modeling low dimensional distributions is impossible.
Kullback-Leibler Divergence
- Closeness measured by KL divergence (equivalent to
ML):
Kullback-Leibler Divergence
- Closeness measured by KL divergence (equivalent to
ML):
- When
integrand goes to infinity: high cost for mode
dropping.
Kullback-Leibler Divergence
- Closeness measured by KL divergence (equivalent to
ML):
- When
integrand goes to infinity: high cost for mode
dropping.
Generative Adversarial Networks
- Let be the law of for some simple (e.g.
Gaussian) r.v Z, passed through a complex function.
Generative Adversarial Networks
- Let be the law of for some simple (e.g.
Gaussian) r.v Z, passed through a complex function.
- Discriminator maximizes and generator minimizes
Generative Adversarial Networks
- Under optimal discriminator, minimizes
- Problems: vanishing gradients very quickly when D’s
accuracy is high.
Discriminator is pretty good...
Vanishing gradients, original cost
Alternate update
- Alternate update that has less vanishing gradients
Alternate update
- Alternate update that has less vanishing gradients
- Under optimality optimizes
Alternate update
- Alternate update that has less vanishing gradients
- Under optimality optimizes
- Problems: JSD with the wrong sign, reverse KL has
High variance updates
Problems of GANs (and divergences)
- When and lie on low dimensional manifolds,
there’s always a perfect discriminator, that provides
no usable gradients.
Problems of GANs (and divergences)
- When and lie on low dimensional manifolds,
there’s always a perfect discriminator, that provides
no usable gradients.
- Under the same assumptions
Problems of JSD, KLs et al.
- Doesn’t need to be a continuous function
of .
- Learning parallel lines.
Distances between distributions
- The topology JSD induces on probability measures is
too big, therefore few mappings to this space are
continuous.
Distances between distributions
- The topology JSD induces on probability measures is
too big, therefore few mappings to this space are
continuous.
- We can use the weak* topology, given by
Wasserstein
Wasserstein distance
- Wasserstein loss is continuous (lines ex):
Regularity of Wasserstein
Hierarchy of distances
Idea: optimize Wasserstein
- Wasserstein has a dual problem
- Idea: train one net f to maximize the dual, then do
gradient descent on theta.
Little Gaussian Experiment
WGAN loss correlates with sample quality!
Correlation for a normal GAN is terrible
Improved model stability
Improved model stability (cont.)
Further work needed
- Weight clipping is a terrible way to enforce Lipschitz constraints!
Further work needed
- Weight clipping is a terrible way to enforce Lipschitz constraints!
- There are many Wasserstein distances aside from EM:
Further work needed
- Weight clipping is a terrible way to enforce Lipschitz constraints!
- There are many Wasserstein distances aside from EM:
- What properties do they share? Make them different?
Further work needed
- Weight clipping is a terrible way to enforce Lipschitz constraints!
- There are many Wasserstein distances aside from EM:
- What properties do they share? Make them different?
- How much mode dropping / sample quality focused are they?
Further work needed
- Weight clipping is a terrible way to enforce Lipschitz constraints!
- There are many Wasserstein distances aside from EM:
- What properties do they share? Make them different?
- How much mode dropping / sample quality focused are they?
- How do we optimize them? They all have duals, but they are much more
complicated. (E.g. for W2 replace lipschitz by convex and convex conj).
Further work needed
- Weight clipping is a terrible way to enforce Lipschitz constraints!
- There are many Wasserstein distances aside from EM:
- What properties do they share? Make them different?
- How much mode dropping / sample quality focused are they?
- How do we optimize them? They all have duals, but they are much more
complicated. (E.g. for W2 replace lipschitz by convex and convex conj).
- Wasserstein requires a metric in X.
Further work needed
- Weight clipping is a terrible way to enforce Lipschitz constraints!
- There are many Wasserstein distances aside from EM:
- What properties do they share? Make them different?
- How much mode dropping / sample quality focused are they?
- How do we optimize them? They all have duals, but they are much more
complicated. (E.g. for W2 replace lipschitz by convex and convex conj).
- Wasserstein requires a metric in X.
- Which one is wgan using? (Some combination of features / samples L2?)
Further work needed
- Weight clipping is a terrible way to enforce Lipschitz constraints!
- There are many Wasserstein distances aside from EM:
- What properties do they share? Make them different?
- How much mode dropping / sample quality focused are they?
- How do we optimize them? They all have duals, but they are much more
complicated. (E.g. for W2 replace lipschitz by convex and convex conj).
- Wasserstein requires a metric in X.
- Which one is wgan using? (Some combination of features / samples L2?)
- Can we optimize W for a given metric? (And construct geodesics!)
Further work needed
- Weight clipping is a terrible way to enforce Lipschitz constraints!
- There are many Wasserstein distances aside from EM:
- What properties do they share? Make them different?
- How much mode dropping / sample quality focused are they?
- How do we optimize them? They all have duals, but they are much more
complicated. (E.g. for W2 replace lipschitz by convex and convex conj).
- Wasserstein requires a metric in X.
- Which one is wgan using? (Some combination of features / samples L2?)
- Can we optimize W for a given metric? (And construct geodesics!)
- Can we learn the metric simultaneously? (And learn geodesics!)
Martin Arjovsky - Wasserstein GAN - Creative AI meetup

More Related Content

More from Luba Elliott

Luba Elliott - AI art - ICCV Conference
Luba Elliott - AI art - ICCV ConferenceLuba Elliott - AI art - ICCV Conference
Luba Elliott - AI art - ICCV Conference
Luba Elliott
 
Luba Elliott - AI in contemporary art practice - Oxford
Luba Elliott - AI in contemporary art practice - OxfordLuba Elliott - AI in contemporary art practice - Oxford
Luba Elliott - AI in contemporary art practice - Oxford
Luba Elliott
 
Luba Elliott - AI in recent art practice - ML Prague
Luba Elliott - AI in recent art practice - ML PragueLuba Elliott - AI in recent art practice - ML Prague
Luba Elliott - AI in recent art practice - ML Prague
Luba Elliott
 
Three Images of the New - Richard Hames - Creative AI meetup
Three Images of the New - Richard Hames - Creative AI meetupThree Images of the New - Richard Hames - Creative AI meetup
Three Images of the New - Richard Hames - Creative AI meetup
Luba Elliott
 
AI Art Gallery Overview - Luba Elliott - NeurIPS Creativity Workshop
AI Art Gallery Overview - Luba Elliott - NeurIPS Creativity WorkshopAI Art Gallery Overview - Luba Elliott - NeurIPS Creativity Workshop
AI Art Gallery Overview - Luba Elliott - NeurIPS Creativity Workshop
Luba Elliott
 
Creativity is Intelligence - Kenneth Stanley - NeurIPS Creativity Workshop
Creativity is Intelligence - Kenneth Stanley - NeurIPS Creativity WorkshopCreativity is Intelligence - Kenneth Stanley - NeurIPS Creativity Workshop
Creativity is Intelligence - Kenneth Stanley - NeurIPS Creativity Workshop
Luba Elliott
 
Seen by machine: Computational Spectatorship in the BBC Archive
Seen by machine: Computational Spectatorship in the BBC ArchiveSeen by machine: Computational Spectatorship in the BBC Archive
Seen by machine: Computational Spectatorship in the BBC Archive
Luba Elliott
 
Luba Elliott AI art overview
Luba Elliott AI art overview Luba Elliott AI art overview
Luba Elliott AI art overview
Luba Elliott
 
Natasha Jaques - Learning via Social Awareness - Creative AI meetup
Natasha Jaques - Learning via Social Awareness - Creative AI meetupNatasha Jaques - Learning via Social Awareness - Creative AI meetup
Natasha Jaques - Learning via Social Awareness - Creative AI meetup
Luba Elliott
 
Sander Dieleman - Generating music in the raw audio domain - Creative AI meetup
Sander Dieleman - Generating music in the raw audio domain - Creative AI meetupSander Dieleman - Generating music in the raw audio domain - Creative AI meetup
Sander Dieleman - Generating music in the raw audio domain - Creative AI meetup
Luba Elliott
 
Design in AI
Design in AIDesign in AI
Design in AI
Luba Elliott
 
Marco Marchesi - Practical uses of style transfer in the creative industry
Marco Marchesi - Practical uses of style transfer in the creative industryMarco Marchesi - Practical uses of style transfer in the creative industry
Marco Marchesi - Practical uses of style transfer in the creative industry
Luba Elliott
 
Hooman Shayani - CAD/CAM in the Age of AI: Designers’ Journey from Earth to Sky
Hooman Shayani - CAD/CAM in the Age of AI: Designers’ Journey from Earth to SkyHooman Shayani - CAD/CAM in the Age of AI: Designers’ Journey from Earth to Sky
Hooman Shayani - CAD/CAM in the Age of AI: Designers’ Journey from Earth to Sky
Luba Elliott
 
Luba Elliott - Seeing AI through Art
Luba Elliott - Seeing AI through ArtLuba Elliott - Seeing AI through Art
Luba Elliott - Seeing AI through Art
Luba Elliott
 
Georgia Ward Dyer - O Time thy pyramids - Creative AI meetup
Georgia Ward Dyer - O Time thy pyramids - Creative AI meetupGeorgia Ward Dyer - O Time thy pyramids - Creative AI meetup
Georgia Ward Dyer - O Time thy pyramids - Creative AI meetup
Luba Elliott
 
Daniel Berio - Graffiti synthesis, a motion centric approach - Creative AI me...
Daniel Berio - Graffiti synthesis, a motion centric approach - Creative AI me...Daniel Berio - Graffiti synthesis, a motion centric approach - Creative AI me...
Daniel Berio - Graffiti synthesis, a motion centric approach - Creative AI me...
Luba Elliott
 
Ali Eslami - Artificial Intelligence and Computer Aided Design - Creative AI ...
Ali Eslami - Artificial Intelligence and Computer Aided Design - Creative AI ...Ali Eslami - Artificial Intelligence and Computer Aided Design - Creative AI ...
Ali Eslami - Artificial Intelligence and Computer Aided Design - Creative AI ...
Luba Elliott
 
Katriona Beales - Intelligence is not enough - Creative AI meetup
Katriona Beales - Intelligence is not enough - Creative AI meetupKatriona Beales - Intelligence is not enough - Creative AI meetup
Katriona Beales - Intelligence is not enough - Creative AI meetup
Luba Elliott
 
Kate Storrs - Closing the loop between biological and artificial vision - Cre...
Kate Storrs - Closing the loop between biological and artificial vision - Cre...Kate Storrs - Closing the loop between biological and artificial vision - Cre...
Kate Storrs - Closing the loop between biological and artificial vision - Cre...
Luba Elliott
 
Creative AI meetup: Tate IK Prize 2016 presentation by Ros Lawler
Creative AI meetup: Tate IK Prize 2016 presentation by Ros LawlerCreative AI meetup: Tate IK Prize 2016 presentation by Ros Lawler
Creative AI meetup: Tate IK Prize 2016 presentation by Ros Lawler
Luba Elliott
 

More from Luba Elliott (20)

Luba Elliott - AI art - ICCV Conference
Luba Elliott - AI art - ICCV ConferenceLuba Elliott - AI art - ICCV Conference
Luba Elliott - AI art - ICCV Conference
 
Luba Elliott - AI in contemporary art practice - Oxford
Luba Elliott - AI in contemporary art practice - OxfordLuba Elliott - AI in contemporary art practice - Oxford
Luba Elliott - AI in contemporary art practice - Oxford
 
Luba Elliott - AI in recent art practice - ML Prague
Luba Elliott - AI in recent art practice - ML PragueLuba Elliott - AI in recent art practice - ML Prague
Luba Elliott - AI in recent art practice - ML Prague
 
Three Images of the New - Richard Hames - Creative AI meetup
Three Images of the New - Richard Hames - Creative AI meetupThree Images of the New - Richard Hames - Creative AI meetup
Three Images of the New - Richard Hames - Creative AI meetup
 
AI Art Gallery Overview - Luba Elliott - NeurIPS Creativity Workshop
AI Art Gallery Overview - Luba Elliott - NeurIPS Creativity WorkshopAI Art Gallery Overview - Luba Elliott - NeurIPS Creativity Workshop
AI Art Gallery Overview - Luba Elliott - NeurIPS Creativity Workshop
 
Creativity is Intelligence - Kenneth Stanley - NeurIPS Creativity Workshop
Creativity is Intelligence - Kenneth Stanley - NeurIPS Creativity WorkshopCreativity is Intelligence - Kenneth Stanley - NeurIPS Creativity Workshop
Creativity is Intelligence - Kenneth Stanley - NeurIPS Creativity Workshop
 
Seen by machine: Computational Spectatorship in the BBC Archive
Seen by machine: Computational Spectatorship in the BBC ArchiveSeen by machine: Computational Spectatorship in the BBC Archive
Seen by machine: Computational Spectatorship in the BBC Archive
 
Luba Elliott AI art overview
Luba Elliott AI art overview Luba Elliott AI art overview
Luba Elliott AI art overview
 
Natasha Jaques - Learning via Social Awareness - Creative AI meetup
Natasha Jaques - Learning via Social Awareness - Creative AI meetupNatasha Jaques - Learning via Social Awareness - Creative AI meetup
Natasha Jaques - Learning via Social Awareness - Creative AI meetup
 
Sander Dieleman - Generating music in the raw audio domain - Creative AI meetup
Sander Dieleman - Generating music in the raw audio domain - Creative AI meetupSander Dieleman - Generating music in the raw audio domain - Creative AI meetup
Sander Dieleman - Generating music in the raw audio domain - Creative AI meetup
 
Design in AI
Design in AIDesign in AI
Design in AI
 
Marco Marchesi - Practical uses of style transfer in the creative industry
Marco Marchesi - Practical uses of style transfer in the creative industryMarco Marchesi - Practical uses of style transfer in the creative industry
Marco Marchesi - Practical uses of style transfer in the creative industry
 
Hooman Shayani - CAD/CAM in the Age of AI: Designers’ Journey from Earth to Sky
Hooman Shayani - CAD/CAM in the Age of AI: Designers’ Journey from Earth to SkyHooman Shayani - CAD/CAM in the Age of AI: Designers’ Journey from Earth to Sky
Hooman Shayani - CAD/CAM in the Age of AI: Designers’ Journey from Earth to Sky
 
Luba Elliott - Seeing AI through Art
Luba Elliott - Seeing AI through ArtLuba Elliott - Seeing AI through Art
Luba Elliott - Seeing AI through Art
 
Georgia Ward Dyer - O Time thy pyramids - Creative AI meetup
Georgia Ward Dyer - O Time thy pyramids - Creative AI meetupGeorgia Ward Dyer - O Time thy pyramids - Creative AI meetup
Georgia Ward Dyer - O Time thy pyramids - Creative AI meetup
 
Daniel Berio - Graffiti synthesis, a motion centric approach - Creative AI me...
Daniel Berio - Graffiti synthesis, a motion centric approach - Creative AI me...Daniel Berio - Graffiti synthesis, a motion centric approach - Creative AI me...
Daniel Berio - Graffiti synthesis, a motion centric approach - Creative AI me...
 
Ali Eslami - Artificial Intelligence and Computer Aided Design - Creative AI ...
Ali Eslami - Artificial Intelligence and Computer Aided Design - Creative AI ...Ali Eslami - Artificial Intelligence and Computer Aided Design - Creative AI ...
Ali Eslami - Artificial Intelligence and Computer Aided Design - Creative AI ...
 
Katriona Beales - Intelligence is not enough - Creative AI meetup
Katriona Beales - Intelligence is not enough - Creative AI meetupKatriona Beales - Intelligence is not enough - Creative AI meetup
Katriona Beales - Intelligence is not enough - Creative AI meetup
 
Kate Storrs - Closing the loop between biological and artificial vision - Cre...
Kate Storrs - Closing the loop between biological and artificial vision - Cre...Kate Storrs - Closing the loop between biological and artificial vision - Cre...
Kate Storrs - Closing the loop between biological and artificial vision - Cre...
 
Creative AI meetup: Tate IK Prize 2016 presentation by Ros Lawler
Creative AI meetup: Tate IK Prize 2016 presentation by Ros LawlerCreative AI meetup: Tate IK Prize 2016 presentation by Ros Lawler
Creative AI meetup: Tate IK Prize 2016 presentation by Ros Lawler
 

Recently uploaded

Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance
 
Quantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIsQuantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIs
Vlad Stirbu
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
mikeeftimakis1
 
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex ProofszkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
Alex Pruden
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
Ralf Eggert
 
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptxSecstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
nkrafacyberclub
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
KAMESHS29
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
DianaGray10
 
Assure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyesAssure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Aggregage
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
Kari Kakkonen
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
James Anderson
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
Sri Ambati
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
Adtran
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
ControlCase
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 

Recently uploaded (20)

Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
 
Quantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIsQuantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIs
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
 
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex ProofszkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
 
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptxSecstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
 
Assure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyesAssure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyes
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 

Martin Arjovsky - Wasserstein GAN - Creative AI meetup

  • 1. On Different Distances Between Distributions and Generative Adversarial Networks Martin Arjovsky
  • 2. Unsupervised learning - We have samples from an unknown distribution
  • 3. Unsupervised learning - We have samples from an unknown distribution - We want to approximate it by a parametric distribution that’s close to in some sense.
  • 4. Unsupervised learning - We have samples from an unknown distribution - We want to approximate it by a parametric distribution that’s close to in some sense. - Close how?
  • 6. Maximum Likelihood - Maximum likelihood: - Assumptions: continuous with full support.
  • 7. Maximum Likelihood - Maximum likelihood: - Assumptions: continuous with full support. - Problems: restricted capacity distributes mass. Modeling low dimensional distributions is impossible.
  • 8. Kullback-Leibler Divergence - Closeness measured by KL divergence (equivalent to ML):
  • 9. Kullback-Leibler Divergence - Closeness measured by KL divergence (equivalent to ML): - When integrand goes to infinity: high cost for mode dropping.
  • 10. Kullback-Leibler Divergence - Closeness measured by KL divergence (equivalent to ML): - When integrand goes to infinity: high cost for mode dropping.
  • 11. Generative Adversarial Networks - Let be the law of for some simple (e.g. Gaussian) r.v Z, passed through a complex function.
  • 12. Generative Adversarial Networks - Let be the law of for some simple (e.g. Gaussian) r.v Z, passed through a complex function. - Discriminator maximizes and generator minimizes
  • 13. Generative Adversarial Networks - Under optimal discriminator, minimizes - Problems: vanishing gradients very quickly when D’s accuracy is high.
  • 16. Alternate update - Alternate update that has less vanishing gradients
  • 17. Alternate update - Alternate update that has less vanishing gradients - Under optimality optimizes
  • 18. Alternate update - Alternate update that has less vanishing gradients - Under optimality optimizes - Problems: JSD with the wrong sign, reverse KL has
  • 20. Problems of GANs (and divergences) - When and lie on low dimensional manifolds, there’s always a perfect discriminator, that provides no usable gradients.
  • 21. Problems of GANs (and divergences) - When and lie on low dimensional manifolds, there’s always a perfect discriminator, that provides no usable gradients. - Under the same assumptions
  • 22. Problems of JSD, KLs et al. - Doesn’t need to be a continuous function of . - Learning parallel lines.
  • 23. Distances between distributions - The topology JSD induces on probability measures is too big, therefore few mappings to this space are continuous.
  • 24. Distances between distributions - The topology JSD induces on probability measures is too big, therefore few mappings to this space are continuous. - We can use the weak* topology, given by Wasserstein
  • 25. Wasserstein distance - Wasserstein loss is continuous (lines ex):
  • 28. Idea: optimize Wasserstein - Wasserstein has a dual problem - Idea: train one net f to maximize the dual, then do gradient descent on theta.
  • 30. WGAN loss correlates with sample quality!
  • 31. Correlation for a normal GAN is terrible
  • 34. Further work needed - Weight clipping is a terrible way to enforce Lipschitz constraints!
  • 35. Further work needed - Weight clipping is a terrible way to enforce Lipschitz constraints! - There are many Wasserstein distances aside from EM:
  • 36. Further work needed - Weight clipping is a terrible way to enforce Lipschitz constraints! - There are many Wasserstein distances aside from EM: - What properties do they share? Make them different?
  • 37. Further work needed - Weight clipping is a terrible way to enforce Lipschitz constraints! - There are many Wasserstein distances aside from EM: - What properties do they share? Make them different? - How much mode dropping / sample quality focused are they?
  • 38. Further work needed - Weight clipping is a terrible way to enforce Lipschitz constraints! - There are many Wasserstein distances aside from EM: - What properties do they share? Make them different? - How much mode dropping / sample quality focused are they? - How do we optimize them? They all have duals, but they are much more complicated. (E.g. for W2 replace lipschitz by convex and convex conj).
  • 39. Further work needed - Weight clipping is a terrible way to enforce Lipschitz constraints! - There are many Wasserstein distances aside from EM: - What properties do they share? Make them different? - How much mode dropping / sample quality focused are they? - How do we optimize them? They all have duals, but they are much more complicated. (E.g. for W2 replace lipschitz by convex and convex conj). - Wasserstein requires a metric in X.
  • 40. Further work needed - Weight clipping is a terrible way to enforce Lipschitz constraints! - There are many Wasserstein distances aside from EM: - What properties do they share? Make them different? - How much mode dropping / sample quality focused are they? - How do we optimize them? They all have duals, but they are much more complicated. (E.g. for W2 replace lipschitz by convex and convex conj). - Wasserstein requires a metric in X. - Which one is wgan using? (Some combination of features / samples L2?)
  • 41. Further work needed - Weight clipping is a terrible way to enforce Lipschitz constraints! - There are many Wasserstein distances aside from EM: - What properties do they share? Make them different? - How much mode dropping / sample quality focused are they? - How do we optimize them? They all have duals, but they are much more complicated. (E.g. for W2 replace lipschitz by convex and convex conj). - Wasserstein requires a metric in X. - Which one is wgan using? (Some combination of features / samples L2?) - Can we optimize W for a given metric? (And construct geodesics!)
  • 42. Further work needed - Weight clipping is a terrible way to enforce Lipschitz constraints! - There are many Wasserstein distances aside from EM: - What properties do they share? Make them different? - How much mode dropping / sample quality focused are they? - How do we optimize them? They all have duals, but they are much more complicated. (E.g. for W2 replace lipschitz by convex and convex conj). - Wasserstein requires a metric in X. - Which one is wgan using? (Some combination of features / samples L2?) - Can we optimize W for a given metric? (And construct geodesics!) - Can we learn the metric simultaneously? (And learn geodesics!)