Martin Arjovsky - Wasserstein GAN - Creative AI meetup

•Download as PPTX, PDF•

0 likes•541 views

The document discusses different approaches to measuring the distance between distributions for unsupervised learning tasks. It describes maximum likelihood and Kullback-Leibler divergence, which can result in mode dropping. Generative adversarial networks (GANs) are introduced as an alternative that trains a generator and discriminator in opposition. However, GANs suffer from vanishing gradients. The document proposes that the Wasserstein distance may be a better measure as it is continuous and avoids mode dropping. It presents preliminary experiments showing the Wasserstein GAN loss correlates better with sample quality than other methods. However, further work is needed to optimize Wasserstein distances and enforce Lipschitz constraints without weight clipping.

Technology

On Different Distances Between Distributions
and Generative Adversarial Networks
Martin Arjovsky

Unsupervised learning
- We have samples from an unknown
distribution

Unsupervised learning
- We have samples from an unknown
distribution
- We want to approximate it by a parametric
distribution that’s close to in some sense.

Maximum Likelihood
- Maximum likelihood:

Maximum Likelihood
- Maximum likelihood:
- Assumptions: continuous with full support.

Maximum Likelihood
- Maximum likelihood:
- Assumptions: continuous with full support.
- Problems: restricted capacity distributes mass.
Modeling low dimensional distributions is impossible.

Kullback-Leibler Divergence
- Closeness measured by KL divergence (equivalent to
ML):

Kullback-Leibler Divergence
- Closeness measured by KL divergence (equivalent to
ML):
- When
integrand goes to infinity: high cost for mode
dropping.

Generative Adversarial Networks
- Let be the law of for some simple (e.g.
Gaussian) r.v Z, passed through a complex function.

Generative Adversarial Networks
- Let be the law of for some simple (e.g.
Gaussian) r.v Z, passed through a complex function.
- Discriminator maximizes and generator minimizes

Generative Adversarial Networks
- Under optimal discriminator, minimizes
- Problems: vanishing gradients very quickly when D’s
accuracy is high.

Alternate update
- Alternate update that has less vanishing gradients

Alternate update
- Alternate update that has less vanishing gradients
- Under optimality optimizes

Alternate update
- Alternate update that has less vanishing gradients
- Under optimality optimizes
- Problems: JSD with the wrong sign, reverse KL has

Problems of GANs (and divergences)
- When and lie on low dimensional manifolds,
there’s always a perfect discriminator, that provides
no usable gradients.

Problems of JSD, KLs et al.
- Doesn’t need to be a continuous function
of .
- Learning parallel lines.

Distances between distributions
- The topology JSD induces on probability measures is
too big, therefore few mappings to this space are
continuous.

Wasserstein distance
- Wasserstein loss is continuous (lines ex):

Idea: optimize Wasserstein
- Wasserstein has a dual problem
- Idea: train one net f to maximize the dual, then do
gradient descent on theta.

WGAN loss correlates with sample quality!

Correlation for a normal GAN is terrible

Further work needed
- Weight clipping is a terrible way to enforce Lipschitz constraints!

Further work needed
- Weight clipping is a terrible way to enforce Lipschitz constraints!
- There are many Wasserstein distances aside from EM:

Further work needed
- Weight clipping is a terrible way to enforce Lipschitz constraints!
- There are many Wasserstein distances aside from EM:
- What properties do they share? Make them different?

Martin Arjovsky - Wasserstein GAN - Creative AI meetup

In this second installment of our Essentials of Automations webinar series, we’ll explore the landscape of triggers and actions, guiding you through the nuances of authoring and adapting workspaces for seamless automations. Gain an understanding of the full spectrum of triggers and actions available in FME, empowering you to enhance your workspaces for efficient automation. We’ll kick things off by showcasing the most commonly used event-based triggers, introducing you to various automation workflows like manual triggers, schedules, directory watchers, and more. Plus, see how these elements play out in real scenarios. Whether you’re tweaking your current setup or building from the ground up, this session will arm you with the tools and insights needed to transform your FME usage into a powerhouse of productivity. Join us to discover effective strategies that simplify complex processes, enhancing your productivity and transforming your data management practices with FME. Let’s turn complexity into clarity and make your workspaces work wonders!

FIDO Alliance Osaka Seminar: Overview.pdf

FIDO Alliance

Recently uploaded

Essentials of Automations: The Art of Triggers and Actions in FME

Safe Software

FIDO Alliance Osaka Seminar: Overview.pdf

FIDO Alliance

Quantum Computing: Current Landscape and the Future Role of APIs

Vlad Stirbu

Introduction to CHERI technology - Cybersecurity

mikeeftimakis1

zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs

Alex Pruden

This paper presents Reef, a system for generating publicly verifiable succinct non-interactive zero-knowledge proofs that a committed document matches or does not match a regular expression. We describe applications such as proving the strength of passwords, the provenance of email despite redactions, the validity of oblivious DNS queries, and the existence of mutations in DNA. Reef supports the Perl Compatible Regular Expression syntax, including wildcards, alternation, ranges, capture groups, Kleene star, negations, and lookarounds. Reef introduces a new type of automata, Skipping Alternating Finite Automata (SAFA), that skips irrelevant parts of a document when producing proofs without undermining soundness, and instantiates SAFA with a lookup argument. Our experimental evaluation confirms that Reef can generate proofs for documents with 32M characters; the proofs are small and cheap to verify (under a second). Paper: https://eprint.iacr.org/2023/1886

LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...

DanBrown980551

Do you want to learn how to model and simulate an electrical network from scratch in under an hour? Then welcome to this PowSyBl workshop, hosted by Rte, the French Transmission System Operator (TSO)! During the webinar, you will discover the PowSyBl ecosystem as well as handle and study an electrical network through an interactive Python notebook. PowSyBl is an open source project hosted by LF Energy, which offers a comprehensive set of features for electrical grid modelling and simulation. Among other advanced features, PowSyBl provides: - A fully editable and extendable library for grid component modelling; - Visualization tools to display your network; - Grid simulation tools, such as power flows, security analyses (with or without remedial actions) and sensitivity analyses; The framework is mostly written in Java, with a Python binding so that Python developers can access PowSyBl functionalities as well. What you will learn during the webinar: - For beginners: discover PowSyBl's functionalities through a quick general presentation and the notebook, without needing any expert coding skills; - For advanced developers: master the skills to efficiently apply PowSyBl functionalities to your real-world scenarios.

PHP Frameworks: I want to break free (IPC Berlin 2024)

Ralf Eggert

In this presentation, we examine the challenges and limitations of relying too heavily on PHP frameworks in web development. We discuss the history of PHP and its frameworks to understand how this dependence has evolved. The focus will be on providing concrete tips and strategies to reduce reliance on these frameworks, based on real-world examples and practical considerations. The goal is to equip developers with the skills and knowledge to create more flexible and future-proof web applications. We'll explore the importance of maintaining autonomy in a rapidly changing tech landscape and how to make informed decisions in PHP development. This talk is aimed at encouraging a more independent approach to using PHP frameworks, moving towards a more flexible and future-proof approach to PHP development.

Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx

nkrafacyberclub

RESUME BUILDER APPLICATION Project for students

KAMESHS29

UiPath Test Automation using UiPath Test Suite series, part 4

DianaGray10

Welcome to UiPath Test Automation using UiPath Test Suite series part 4. In this session, we will cover Test Manager overview along with SAP heatmap. The UiPath Test Manager overview with SAP heatmap webinar offers a concise yet comprehensive exploration of the role of a Test Manager within SAP environments, coupled with the utilization of heatmaps for effective testing strategies. Participants will gain insights into the responsibilities, challenges, and best practices associated with test management in SAP projects. Additionally, the webinar delves into the significance of heatmaps as a visual aid for identifying testing priorities, areas of risk, and resource allocation within SAP landscapes. Through this session, attendees can expect to enhance their understanding of test management principles while learning practical approaches to optimize testing processes in SAP environments using heatmap visualization techniques What will you get from this session? 1. Insights into SAP testing best practices 2. Heatmap utilization for testing 3. Optimization of testing processes 4. Demo Topics covered: Execution from the test manager Orchestrator execution result Defect reporting SAP heatmap example with demo Speaker: Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP

Assure Contact Center Experiences for Your Customers With ThousandEyes

ThousandEyes

FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf

FIDO Alliance

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Aggregage

Epistemic Interaction - tuning interfaces to provide information for AI support

Alan Dix

Paper presented at SYNERGY workshop at AVI 2024, Genoa, Italy. 3rd June 2024 https://alandix.com/academic/papers/synergy2024-epistemic/ As machine learning integrates deeper into human-computer interactions, the concept of epistemic interaction emerges, aiming to refine these interactions to enhance system adaptability. This approach encourages minor, intentional adjustments in user behaviour to enrich the data available for system learning. This paper introduces epistemic interaction within the context of human-system communication, illustrating how deliberate interaction design can improve system understanding and adaptation. Through concrete examples, we demonstrate the potential of epistemic interaction to significantly advance human-computer interaction by leveraging intuitive human communication strategies to inform system design and functionality, offering a novel pathway for enriching user-system engagements.

Climate Impact of Software Testing at Nordic Testing Days

Kari Kakkonen

My slides at Nordic Testing Days 6.6.2024 Climate impact / sustainability of software testing discussed on the talk. ICT and testing must carry their part of global responsibility to help with the climat warming. We can minimize the carbon footprint but we can also have a carbon handprint, a positive impact on the climate. Quality characteristics can be added with sustainability, and then measured continuously. Test environments can be used less, and in smaller scale and on demand. Test techniques can be used in optimizing or minimizing number of tests. Test automation can be used to speed up testing.

GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...

James Anderson

Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management. The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM). Speakers: Bob Boule Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle. Gopinath Rebala Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.

GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...

Sri Ambati

Pushing the limits of ePRTC: 100ns holdover for 100 days

Adtran

PCI PIN Basics Webinar from the Controlcase Team

ControlCase

DevOps and Testing slides at DASA Connect

Kari Kakkonen

Recently uploaded (20)

Essentials of Automations: The Art of Triggers and Actions in FME

FIDO Alliance Osaka Seminar: Overview.pdf

Quantum Computing: Current Landscape and the Future Role of APIs

Introduction to CHERI technology - Cybersecurity

zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs

LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...

PHP Frameworks: I want to break free (IPC Berlin 2024)

Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx

RESUME BUILDER APPLICATION Project for students

UiPath Test Automation using UiPath Test Suite series, part 4

Assure Contact Center Experiences for Your Customers With ThousandEyes

FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Epistemic Interaction - tuning interfaces to provide information for AI support

Climate Impact of Software Testing at Nordic Testing Days

GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...

GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...

Pushing the limits of ePRTC: 100ns holdover for 100 days

PCI PIN Basics Webinar from the Controlcase Team

DevOps and Testing slides at DASA Connect

Martin Arjovsky - Wasserstein GAN - Creative AI meetup

1. On Different Distances Between Distributions and Generative Adversarial Networks Martin Arjovsky

2. Unsupervised learning - We have samples from an unknown distribution

3. Unsupervised learning - We have samples from an unknown distribution - We want to approximate it by a parametric distribution that’s close to in some sense.

4. Unsupervised learning - We have samples from an unknown distribution - We want to approximate it by a parametric distribution that’s close to in some sense. - Close how?

5. Maximum Likelihood - Maximum likelihood:

6. Maximum Likelihood - Maximum likelihood: - Assumptions: continuous with full support.

7. Maximum Likelihood - Maximum likelihood: - Assumptions: continuous with full support. - Problems: restricted capacity distributes mass. Modeling low dimensional distributions is impossible.

8. Kullback-Leibler Divergence - Closeness measured by KL divergence (equivalent to ML):

9. Kullback-Leibler Divergence - Closeness measured by KL divergence (equivalent to ML): - When integrand goes to infinity: high cost for mode dropping.

10. Kullback-Leibler Divergence - Closeness measured by KL divergence (equivalent to ML): - When integrand goes to infinity: high cost for mode dropping.

11. Generative Adversarial Networks - Let be the law of for some simple (e.g. Gaussian) r.v Z, passed through a complex function.

12. Generative Adversarial Networks - Let be the law of for some simple (e.g. Gaussian) r.v Z, passed through a complex function. - Discriminator maximizes and generator minimizes

13. Generative Adversarial Networks - Under optimal discriminator, minimizes - Problems: vanishing gradients very quickly when D’s accuracy is high.

14. Discriminator is pretty good...

15. Vanishing gradients, original cost

16. Alternate update - Alternate update that has less vanishing gradients

17. Alternate update - Alternate update that has less vanishing gradients - Under optimality optimizes

18. Alternate update - Alternate update that has less vanishing gradients - Under optimality optimizes - Problems: JSD with the wrong sign, reverse KL has

19. High variance updates

20. Problems of GANs (and divergences) - When and lie on low dimensional manifolds, there’s always a perfect discriminator, that provides no usable gradients.

21. Problems of GANs (and divergences) - When and lie on low dimensional manifolds, there’s always a perfect discriminator, that provides no usable gradients. - Under the same assumptions

22. Problems of JSD, KLs et al. - Doesn’t need to be a continuous function of . - Learning parallel lines.

23. Distances between distributions - The topology JSD induces on probability measures is too big, therefore few mappings to this space are continuous.

24. Distances between distributions - The topology JSD induces on probability measures is too big, therefore few mappings to this space are continuous. - We can use the weak* topology, given by Wasserstein

25. Wasserstein distance - Wasserstein loss is continuous (lines ex):

26. Regularity of Wasserstein

27. Hierarchy of distances

28. Idea: optimize Wasserstein - Wasserstein has a dual problem - Idea: train one net f to maximize the dual, then do gradient descent on theta.

29. Little Gaussian Experiment

30. WGAN loss correlates with sample quality!

31. Correlation for a normal GAN is terrible

32. Improved model stability

33. Improved model stability (cont.)

34. Further work needed - Weight clipping is a terrible way to enforce Lipschitz constraints!

35. Further work needed - Weight clipping is a terrible way to enforce Lipschitz constraints! - There are many Wasserstein distances aside from EM:

36. Further work needed - Weight clipping is a terrible way to enforce Lipschitz constraints! - There are many Wasserstein distances aside from EM: - What properties do they share? Make them different?

37. Further work needed - Weight clipping is a terrible way to enforce Lipschitz constraints! - There are many Wasserstein distances aside from EM: - What properties do they share? Make them different? - How much mode dropping / sample quality focused are they?

38. Further work needed - Weight clipping is a terrible way to enforce Lipschitz constraints! - There are many Wasserstein distances aside from EM: - What properties do they share? Make them different? - How much mode dropping / sample quality focused are they? - How do we optimize them? They all have duals, but they are much more complicated. (E.g. for W2 replace lipschitz by convex and convex conj).

39. Further work needed - Weight clipping is a terrible way to enforce Lipschitz constraints! - There are many Wasserstein distances aside from EM: - What properties do they share? Make them different? - How much mode dropping / sample quality focused are they? - How do we optimize them? They all have duals, but they are much more complicated. (E.g. for W2 replace lipschitz by convex and convex conj). - Wasserstein requires a metric in X.

40. Further work needed - Weight clipping is a terrible way to enforce Lipschitz constraints! - There are many Wasserstein distances aside from EM: - What properties do they share? Make them different? - How much mode dropping / sample quality focused are they? - How do we optimize them? They all have duals, but they are much more complicated. (E.g. for W2 replace lipschitz by convex and convex conj). - Wasserstein requires a metric in X. - Which one is wgan using? (Some combination of features / samples L2?)

41. Further work needed - Weight clipping is a terrible way to enforce Lipschitz constraints! - There are many Wasserstein distances aside from EM: - What properties do they share? Make them different? - How much mode dropping / sample quality focused are they? - How do we optimize them? They all have duals, but they are much more complicated. (E.g. for W2 replace lipschitz by convex and convex conj). - Wasserstein requires a metric in X. - Which one is wgan using? (Some combination of features / samples L2?) - Can we optimize W for a given metric? (And construct geodesics!)

42. Further work needed - Weight clipping is a terrible way to enforce Lipschitz constraints! - There are many Wasserstein distances aside from EM: - What properties do they share? Make them different? - How much mode dropping / sample quality focused are they? - How do we optimize them? They all have duals, but they are much more complicated. (E.g. for W2 replace lipschitz by convex and convex conj). - Wasserstein requires a metric in X. - Which one is wgan using? (Some combination of features / samples L2?) - Can we optimize W for a given metric? (And construct geodesics!) - Can we learn the metric simultaneously? (And learn geodesics!)

Martin Arjovsky - Wasserstein GAN - Creative AI meetup

Recommended

Recommended

More Related Content

More from Luba Elliott

More from Luba Elliott (20)

Recently uploaded

Recently uploaded (20)

Martin Arjovsky - Wasserstein GAN - Creative AI meetup