SlideShare a Scribd company logo
Michael Jordan
University of California, Berkeley
On Blending Machine
Learning with
Microeconomics
Where is Intelligence Found on Earth?
University of California, Berkeley
Where is Intelligence Found on Earth?
• Brains and Minds
University of California, Berkeley
Brains and Minds
• But our understanding of how intelligence arises in
brains and minds is very primitive; we’re still at the
stage of metaphor
University of California, Berkeley
Brains and Minds
• But our understanding of how intelligence arises in
brains and minds is very primitive; we’re still at the
stage of metaphor
• Moreover, to build intelligent systems it is neither
necessary nor sufficient that the components of the
system exhibit human-like intelligence
University of California, Berkeley
Brains and Minds
• But our understanding of how intelligence arises in
brains and minds is very primitive; we’re still in the
stage of metaphor
• Moreover, to build intelligent systems it is neither
necessary nor sufficient that the components of the
system exhibit human-like intelligence
• Are there other examples of intelligence worth
mimicking in computer algorithms on the planet?
University of California, Berkeley
Where is Intelligence Found on Earth?
• Brains and Minds
• Markets
University of California, Berkeley
Markets
• Markets can be viewed as decentralized algorithms
• They accomplish complex tasks like bringing the
necessary goods into a city day in and day out
• They are adaptive (accommodating change in physical
or social structure), robust (working rain or shine),
scalable (working in small villages and big cities), and
they can have a very long lifetime
– indeed, they can work for decades or centuries
– if we’re looking for principles for lifelong adaptation, we should be
considering markets as intelligent systems!
• Of course, markets aren’t perfect, which simply means
that there are research opportunities
University of California, Berkeley
Examples at the Interface of ML and Econ
• Multi-way markets in which the individual agents need to
explore to learn their preferences
• Large-scale multi-way markets in which agents view other
sides of the market via recommendation systems
• Inferential methods for mitigating information asymmetries
• Latent variable inference in game theory
• Data collection in strategic settings
• Information sharing, free riding
• The goal is to discover new principles to build healthy (e.g.,
fair) learning-based markets that are stabilized over long
stretches of time
University of California, Berkeley
AI (aka Machine Learning) Successes
• First Generation (‘90-’00): the backend
– e.g., fraud detection, search, supply-chain management
• Second Generation (‘00-’10): the human side
– e.g., recommendation systems, commerce, social media
• Third Generation (‘10-now): pattern recognition
– e.g., speech recognition, computer vision, translation
• Fourth Generation (emerging): markets
– not just one agent making a decision or sequence of
decisions
– but a huge interconnected web of data, agents, decisions
University of California, Berkeley
Decisions
• It’s not just a matter of a threshold
• Real-world decisions with consequences
– counterfactuals, provenance, relevance, dialog
• Sets of decisions across a network
– false-discovery rate (instead of precision/recall/accuracy)
• Sets of decisions across a network over time
– streaming, asynchronous decisions (cf. Zrnic, Ramdas & Jordan,
Asynchronous online testing of multiple hypotheses, arXiv, 2019)
• Decisions when there is scarcity and competition
– need for an economic perspective
Consider Classical Recommendation
Systems
• A record is kept of each customer’s purchases
• Customers are “similar” if they buy similar sets of
items
• Items are “similar” are they are bought together by
multiple customers
• Recommendations are made on the basis of these
similarities
• These systems have become a commodity
Multiple Decisions with Competition
• Suppose that recommending a certain movie is a good
business decision (e.g., because it’s very popular)
• Is it OK to recommend the same movie to everyone?
• Is it OK to recommend the same book to everyone?
• Is it OK to recommend the same restaurant to
everyone?
• Is it OK to recommend the same street to every driver?
• Is it OK to recommend the same stock purchase to
everyone?
The Alternative: Create a Market
• A two-way market between consumers and producers
– based on recommendation systems on both sides
• E.g., diners are one side of the market, and restaurants
on the other side
• E.g., drivers are one side of the market, and street
segments on the other side
• This isn’t just classical microeconomics; the use of
recommendation systems is key
University of California, Berkeley
The Alternative: Create a Market
• A two-way market between consumers and producers
– based on recommendation systems on both sides
• E.g., diners are one side of the market, and restaurants
on the other side
• E.g., drivers are one side of the market, and street
segments on the other side
• This isn’t just classical microeconomics; the use of
recommendation systems is key
University of California, Berkeley
Machine Learning Meets Economics
• Bandits that compete (with Lydia Liu and Horia Mania)
– how to explore and exploit when others may also be attempting to
explore and exploit
• Finding Nash equilibria with gradient-based algorithms in
high-dimensional action spaces (with Eric Mazumdar)
– avoiding saddle points that are not Nash equilibria
• Local Stackelberg equilibria in nonconvex-nonconcave
games (with Chi Jin and Praneeth Netrapalli)
– and how does this relate to SGD?
• Asynchronous, online false-discovery rate control (with
Tijana Zrnic and Aaditya Ramdas)
– the real-world A/B testing problem
University of California, Berkeley
Lydia Liu Horia Mania
Competing Bandits in
Matching Markets
• MABs offer a natural platform to understand exploration /
exploitation trade-offs
1
2
3
.
Multi-Armed Bandits
Buyers / Demand Sellers / Supply
1 > 3 > 2
Suppose we have a market in which the participants have
preferences:
2 > 3 > 1
1 > 2 > 3
1 > 2 > 3
3 > 1 > 2
2 > 1 > 3
Gale and Shapley introduced this problem in 1962 and proposed
a celebrated algorithm that always finds a stable match
In this algorithm one side of the market iteratively makes
proposals to the other side
Matching Markets
What if the participants in the market do not know their
preferences a priori, but observe noisy utilities through
repeated interactions?
Matching Markets Meet Learning
Now the participants have an exploration/exploitation problem,
in the context of other participants
1
.
2
.
3
.
0
Competing Agents
• We conceive of a bandit market: agents on one side, arms on
the other side.
Agents get noisy rewards when they pull arms.
Arms have preferences over agents (these
preferences can also express agents’ skill
levels)
When multiple agents pull the same arm only
the most preferred agent gets a reward.
Bandit Markets
Then it is natural to define the regret of agent i up to time n
as:
Mean reward of
stable match
Reward at time t
Minimizing this regret is natural. It says that agents should
expect rewards as good as their stable match in hindsight.
Regret in Bandit Markets
Gale-Shapley upper confidence bounds (GS-UCB):
• Agents rank arms according to upper confidence bounds
for the mean rewards.
• Agents submit rankings to a matching platform.
• The platform uses these rankings to run the Gale-Shapley
algorithm to match agents and arms.
• Agents receive rewards and update upper confidence
bounds.
• Repeat.
Regret-Minimizing Algorithm
Theorem (informal): If there are N agents and K arms and
GS-UCB is run, the regret of agent i satisfies
Reward gap of possibly other agents.
• In other words, if the bear decides to explore more, the human
might have higher regret.
• See paper for refinements of this bound and further discussion of
exploration-exploitation trade-offs in this setting.
• Finally, we note that GS-UCB is incentive compatible. No single
agent has an incentive to deviate from the method.
Theorem
Eric Mazumdar
Finding Nash Equilibria (and Only
Nash Equilibria) with Gradients
Why zero-sum games?
• A lot of recent interest in finding the local Nash equilibria of zero-sum
continuous games.
• e.g. adversarial learning, training GANs, robust reinforcement learning
Zero-Sum Games
Issues with gradient-play in 2-player zero-sum games
Not all locally
asymptotically
stable equilibria
are differential
Nash equilibria!
Simultaneous Gradient
descent (simGD) has
two main issues:
• Convergence to limit
cycles.
• Convergence to non-
Nash fixed points.
2
Recent algorithms do not solve these problems
2
9
On finding local Nash equilibria (and only local Nash equilibria) in zero-sum continuous games. E. Mazumdar, M. Jordan, S. Sastry NeurIPS
Behaviors in Zero-sum Games
Algorithm
Avoid some
local Nash
eq.
Converge to
non-Nash
eq.
Oscillate
around eq.
Converge to
limit cycles
Gradient
play
Symplectic
Gradient
Adjustment
Consensus
Optimization
New algorithm for gradient-based learning in zero-sum
games:
3
0
Local Symplectic Surgery:
On finding local Nash equilibria (and only local Nash equilibria) in zero-sum continuous games. E. Mazumdar, M. Jordan, S. Sastry NeurIPS
By cancelling out the symplectic part of the vector field
around critical points, the only equilibria to which this method
can converge are the local Nash equilibria of the game.
31On finding local Nash equilibria (and only local Nash equilibria) in zero-sum continuous games. E. Mazumdar, M. Jordan, S. Sastry NeurIPS
GAN experiments on a test for mode-collapse
LSS
(~50,000 iters)
simGD
(~200,000 iters)
Consensus opt.
(~100,000 iters)
Curvature exploitation
(~15,000 iters)
Ground truth is a mixture of 16 Gaussians used to test for mode collapse, with covariance 0.005 I2
Generator and discriminator are Tanh neural networks with 4 hidden layers of 100 neurons each.
3
2
On finding local Nash equilibria (and only local Nash equilibria) in zero-sum continuous games. E. Mazumdar, M. Jordan, S. Sastry NeurIPS
LSS seems to converge to better solutions
LSS
(~50,000
iters)
simGD
(~200,000
iters)
Consensus
(~100,000
iters)
Curvature
(~15,000 iters)
From an initialization where
simGD quickly converges to an
incorrect distribution, LSS
recovers the ground truth.
• Consensus optimization, without
careful hyper-parameter tuning
gives rise to new non-Nash
equilibria, which results in worse
performance.
• Local curvature exploitation is
hard to implement in high
dimensional settings due to the
need to find eigenvalue/eigenvector
pairs of the diagonal blocks of J at
each iteration
What is Local Optimality in Nonconvex-Nonconcave
Minimax Optimization?
Chi Jin1, Praneeth Netrapalli2, Michael I. Jordan1
1
University of California, Berkeley. 2
Microsoft Research, India.
Minmax Optimization
Multi-agent decision making:
or
Framework:
min
x∈Rd
max
y∈Rd
f (x, y)
where f is nonconvex in x and nonconcave in y.
Applications
Adversarial Training
Generative Adversarial Network (GAN)
Algorithms and Questions
Gradient Descent Ascent (GDA):



xt+1 = xt − ηx f (xt ).
yt+1 = yt + ηy f (yt ).
−2 0 2
x1
−2
−1
0
1
2
x2
Fundamental questions:
• What “optimal” points should we find?
• What are the points that GDA converges to? (if it converges)
Nash Equilibrium
Point (x , y ) is a Nash equilibrium if:
• y is a maximum of f (x , ·); x is a minimum of f (·, y ).
Convex-concave case: GDA converges to a Nash equilibrium.
Local Nash Equilibrium
Nonconvex-nonconcave case: It’s NP-hard to find Nash equilibrium.
Find a local Nash equilibrium—point (x , y ) such that
• y is a local maximum of f (x , ·); x is a local minimum of f (·, y ).
Are local Nash equilibria what we want in GAN/adversarial training?
Simultaneous vs. Sequential
simultaneous
vs.
sequential
Nash equilibria come from simultaneous games—both act simultaneously.
GAN/adversarial training are sequential games—one leader, one follower.
For nonconvex-nonconcave f , which player acts first is crucial, since in general
min
x∈Rd
max
y∈Rd
f (x, y) = max
y∈Rd
min
x∈Rd
f (x, y)
Minimax Point (Stackelberg Equilibrium)
Point (x , y ) is a Minimax Point (or Stackelberg Equilibrium) if:
• y is a maximum of f (x , ·);
• x is a minimum of φ(·) where φ(x) = maxy f (x, y).
[Leader always prepares for the best follower.]
Minimax points always exist even for nonconvex-nonconcave functions.
A New Notion of Local Optimality
[This Work] Point (x , y ) is a local minimax point if:
• y is a local maximum of f (x , ·);
• ∃ 0 > 0, so that x is a local minimum of g (·) for any ≤ 0,
where g (x) = maxy: y−y ≤ f (x, y).
Limit Points of GDA
[DP18, MR18]: The stable limit points of GDA need not to be local Nash!
Theorem [This Work]
When learning rate ηy /ηx → ∞, the stable limit points of GDA are exactly
local minimax points up to some degenerate points.
γ-GDA
Local
Minimax
(∞-GDA)
Local
Maximin
Local
Nash

More Related Content

Similar to AISF19 - On Blending Machine Learning with Microeconomics

Knowledge based reasoning using aiml.pdf
Knowledge based reasoning using aiml.pdfKnowledge based reasoning using aiml.pdf
Knowledge based reasoning using aiml.pdf
anuabhi0107
 
1 artificial intelligence
1  artificial intelligence1  artificial intelligence
1 artificial intelligence
Ahmad sohail Kakar
 
wellington1.ppt
wellington1.pptwellington1.ppt
Utah Code Camp 2014 - Learning from Data by Thomas Holloway
Utah Code Camp 2014 - Learning from Data by Thomas HollowayUtah Code Camp 2014 - Learning from Data by Thomas Holloway
Utah Code Camp 2014 - Learning from Data by Thomas Holloway
Thomas Holloway
 
Foundations of Competitive Strategy (Part I)
Foundations of Competitive Strategy (Part I)Foundations of Competitive Strategy (Part I)
Foundations of Competitive Strategy (Part I)
ProfessorDeitz
 
AppTheories_T4
AppTheories_T4AppTheories_T4
AppTheories_T4
Manu Muñoz H
 
Dr. Stephen Koontz - Thinning Cash Fed Cattle Trade: How Thin is Too Thin & W...
Dr. Stephen Koontz - Thinning Cash Fed Cattle Trade: How Thin is Too Thin & W...Dr. Stephen Koontz - Thinning Cash Fed Cattle Trade: How Thin is Too Thin & W...
Dr. Stephen Koontz - Thinning Cash Fed Cattle Trade: How Thin is Too Thin & W...
John Blue
 
Market making in the third sector, rob macmillan, sra seminar april 2013
Market making in the third sector, rob macmillan, sra seminar april 2013Market making in the third sector, rob macmillan, sra seminar april 2013
Market making in the third sector, rob macmillan, sra seminar april 2013
Third Sector Research Centre
 
Animal disease control and value chain practices: Incorporating economics and...
Animal disease control and value chain practices: Incorporating economics and...Animal disease control and value chain practices: Incorporating economics and...
Animal disease control and value chain practices: Incorporating economics and...
ILRI
 
E marketing[1]
E marketing[1]E marketing[1]
E marketing[1]
nanote12
 
Culbert recommender systems
Culbert recommender systemsCulbert recommender systems
Culbert recommender systems
Aravindharamanan S
 
Recommendation Systems : Selection vs Fulfillment
Recommendation Systems : Selection vs FulfillmentRecommendation Systems : Selection vs Fulfillment
Recommendation Systems : Selection vs Fulfillment
Akansha Kumar, Ph.D.
 
Future of AI-powered automation in business
Future of AI-powered automation in businessFuture of AI-powered automation in business
Future of AI-powered automation in business
Louis Dorard
 
Demystifying Recommendation Systems
Demystifying Recommendation SystemsDemystifying Recommendation Systems
Demystifying Recommendation Systems
Rumman Chowdhury
 
Data Con LA 2022 - Real world consumer segmentation
Data Con LA 2022 - Real world consumer segmentationData Con LA 2022 - Real world consumer segmentation
Data Con LA 2022 - Real world consumer segmentation
Data Con LA
 
Artificial Intelligence and The Complexity
Artificial Intelligence and The ComplexityArtificial Intelligence and The Complexity
Artificial Intelligence and The Complexity
Hendri Karisma
 
Overview of recommender system
Overview of recommender systemOverview of recommender system
Overview of recommender system
Stanley Wang
 
Introduction to Artificial Intelligence
Introduction to Artificial IntelligenceIntroduction to Artificial Intelligence
Introduction to Artificial Intelligence
ananth
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
Alan Said
 
Machine learning
Machine learningMachine learning
Machine learning
Dr Geetha Mohan
 

Similar to AISF19 - On Blending Machine Learning with Microeconomics (20)

Knowledge based reasoning using aiml.pdf
Knowledge based reasoning using aiml.pdfKnowledge based reasoning using aiml.pdf
Knowledge based reasoning using aiml.pdf
 
1 artificial intelligence
1  artificial intelligence1  artificial intelligence
1 artificial intelligence
 
wellington1.ppt
wellington1.pptwellington1.ppt
wellington1.ppt
 
Utah Code Camp 2014 - Learning from Data by Thomas Holloway
Utah Code Camp 2014 - Learning from Data by Thomas HollowayUtah Code Camp 2014 - Learning from Data by Thomas Holloway
Utah Code Camp 2014 - Learning from Data by Thomas Holloway
 
Foundations of Competitive Strategy (Part I)
Foundations of Competitive Strategy (Part I)Foundations of Competitive Strategy (Part I)
Foundations of Competitive Strategy (Part I)
 
AppTheories_T4
AppTheories_T4AppTheories_T4
AppTheories_T4
 
Dr. Stephen Koontz - Thinning Cash Fed Cattle Trade: How Thin is Too Thin & W...
Dr. Stephen Koontz - Thinning Cash Fed Cattle Trade: How Thin is Too Thin & W...Dr. Stephen Koontz - Thinning Cash Fed Cattle Trade: How Thin is Too Thin & W...
Dr. Stephen Koontz - Thinning Cash Fed Cattle Trade: How Thin is Too Thin & W...
 
Market making in the third sector, rob macmillan, sra seminar april 2013
Market making in the third sector, rob macmillan, sra seminar april 2013Market making in the third sector, rob macmillan, sra seminar april 2013
Market making in the third sector, rob macmillan, sra seminar april 2013
 
Animal disease control and value chain practices: Incorporating economics and...
Animal disease control and value chain practices: Incorporating economics and...Animal disease control and value chain practices: Incorporating economics and...
Animal disease control and value chain practices: Incorporating economics and...
 
E marketing[1]
E marketing[1]E marketing[1]
E marketing[1]
 
Culbert recommender systems
Culbert recommender systemsCulbert recommender systems
Culbert recommender systems
 
Recommendation Systems : Selection vs Fulfillment
Recommendation Systems : Selection vs FulfillmentRecommendation Systems : Selection vs Fulfillment
Recommendation Systems : Selection vs Fulfillment
 
Future of AI-powered automation in business
Future of AI-powered automation in businessFuture of AI-powered automation in business
Future of AI-powered automation in business
 
Demystifying Recommendation Systems
Demystifying Recommendation SystemsDemystifying Recommendation Systems
Demystifying Recommendation Systems
 
Data Con LA 2022 - Real world consumer segmentation
Data Con LA 2022 - Real world consumer segmentationData Con LA 2022 - Real world consumer segmentation
Data Con LA 2022 - Real world consumer segmentation
 
Artificial Intelligence and The Complexity
Artificial Intelligence and The ComplexityArtificial Intelligence and The Complexity
Artificial Intelligence and The Complexity
 
Overview of recommender system
Overview of recommender systemOverview of recommender system
Overview of recommender system
 
Introduction to Artificial Intelligence
Introduction to Artificial IntelligenceIntroduction to Artificial Intelligence
Introduction to Artificial Intelligence
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
 
Machine learning
Machine learningMachine learning
Machine learning
 

More from Bill Liu

Walk Through a Real World ML Production Project
Walk Through a Real World ML Production ProjectWalk Through a Real World ML Production Project
Walk Through a Real World ML Production Project
Bill Liu
 
Redefining MLOps with Model Deployment, Management and Observability in Produ...
Redefining MLOps with Model Deployment, Management and Observability in Produ...Redefining MLOps with Model Deployment, Management and Observability in Produ...
Redefining MLOps with Model Deployment, Management and Observability in Produ...
Bill Liu
 
Productizing Machine Learning at the Edge
Productizing Machine Learning at the EdgeProductizing Machine Learning at the Edge
Productizing Machine Learning at the Edge
Bill Liu
 
Transformers in Vision: From Zero to Hero
Transformers in Vision: From Zero to HeroTransformers in Vision: From Zero to Hero
Transformers in Vision: From Zero to Hero
Bill Liu
 
Deep AutoViML For Tensorflow Models and MLOps Workflows
Deep AutoViML For Tensorflow Models and MLOps WorkflowsDeep AutoViML For Tensorflow Models and MLOps Workflows
Deep AutoViML For Tensorflow Models and MLOps Workflows
Bill Liu
 
Metaflow: The ML Infrastructure at Netflix
Metaflow: The ML Infrastructure at NetflixMetaflow: The ML Infrastructure at Netflix
Metaflow: The ML Infrastructure at Netflix
Bill Liu
 
Practical Crowdsourcing for ML at Scale
Practical Crowdsourcing for ML at ScalePractical Crowdsourcing for ML at Scale
Practical Crowdsourcing for ML at Scale
Bill Liu
 
Building large scale transactional data lake using apache hudi
Building large scale transactional data lake using apache hudiBuilding large scale transactional data lake using apache hudi
Building large scale transactional data lake using apache hudi
Bill Liu
 
Deep Reinforcement Learning and Its Applications
Deep Reinforcement Learning and Its ApplicationsDeep Reinforcement Learning and Its Applications
Deep Reinforcement Learning and Its Applications
Bill Liu
 
Big Data and AI in Fighting Against COVID-19
Big Data and AI in Fighting Against COVID-19Big Data and AI in Fighting Against COVID-19
Big Data and AI in Fighting Against COVID-19
Bill Liu
 
Highly-scalable Reinforcement Learning RLlib for Real-world Applications
Highly-scalable Reinforcement Learning RLlib for Real-world ApplicationsHighly-scalable Reinforcement Learning RLlib for Real-world Applications
Highly-scalable Reinforcement Learning RLlib for Real-world Applications
Bill Liu
 
Build computer vision models to perform object detection and classification w...
Build computer vision models to perform object detection and classification w...Build computer vision models to perform object detection and classification w...
Build computer vision models to perform object detection and classification w...
Bill Liu
 
Causal Inference in Data Science and Machine Learning
Causal Inference in Data Science and Machine LearningCausal Inference in Data Science and Machine Learning
Causal Inference in Data Science and Machine Learning
Bill Liu
 
Weekly #106: Deep Learning on Mobile
Weekly #106: Deep Learning on MobileWeekly #106: Deep Learning on Mobile
Weekly #106: Deep Learning on Mobile
Bill Liu
 
Weekly #105: AutoViz and Auto_ViML Visualization and Machine Learning
Weekly #105: AutoViz and Auto_ViML Visualization and Machine LearningWeekly #105: AutoViz and Auto_ViML Visualization and Machine Learning
Weekly #105: AutoViz and Auto_ViML Visualization and Machine Learning
Bill Liu
 
AISF19 - Travel in the AI-First World
AISF19 - Travel in the AI-First WorldAISF19 - Travel in the AI-First World
AISF19 - Travel in the AI-First World
Bill Liu
 
AISF19 - Unleash Computer Vision at the Edge
AISF19 - Unleash Computer Vision at the EdgeAISF19 - Unleash Computer Vision at the Edge
AISF19 - Unleash Computer Vision at the Edge
Bill Liu
 
AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...
AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...
AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...
Bill Liu
 
Toronto meetup 20190917
Toronto meetup 20190917Toronto meetup 20190917
Toronto meetup 20190917
Bill Liu
 
Feature Engineering for NLP
Feature Engineering for NLPFeature Engineering for NLP
Feature Engineering for NLP
Bill Liu
 

More from Bill Liu (20)

Walk Through a Real World ML Production Project
Walk Through a Real World ML Production ProjectWalk Through a Real World ML Production Project
Walk Through a Real World ML Production Project
 
Redefining MLOps with Model Deployment, Management and Observability in Produ...
Redefining MLOps with Model Deployment, Management and Observability in Produ...Redefining MLOps with Model Deployment, Management and Observability in Produ...
Redefining MLOps with Model Deployment, Management and Observability in Produ...
 
Productizing Machine Learning at the Edge
Productizing Machine Learning at the EdgeProductizing Machine Learning at the Edge
Productizing Machine Learning at the Edge
 
Transformers in Vision: From Zero to Hero
Transformers in Vision: From Zero to HeroTransformers in Vision: From Zero to Hero
Transformers in Vision: From Zero to Hero
 
Deep AutoViML For Tensorflow Models and MLOps Workflows
Deep AutoViML For Tensorflow Models and MLOps WorkflowsDeep AutoViML For Tensorflow Models and MLOps Workflows
Deep AutoViML For Tensorflow Models and MLOps Workflows
 
Metaflow: The ML Infrastructure at Netflix
Metaflow: The ML Infrastructure at NetflixMetaflow: The ML Infrastructure at Netflix
Metaflow: The ML Infrastructure at Netflix
 
Practical Crowdsourcing for ML at Scale
Practical Crowdsourcing for ML at ScalePractical Crowdsourcing for ML at Scale
Practical Crowdsourcing for ML at Scale
 
Building large scale transactional data lake using apache hudi
Building large scale transactional data lake using apache hudiBuilding large scale transactional data lake using apache hudi
Building large scale transactional data lake using apache hudi
 
Deep Reinforcement Learning and Its Applications
Deep Reinforcement Learning and Its ApplicationsDeep Reinforcement Learning and Its Applications
Deep Reinforcement Learning and Its Applications
 
Big Data and AI in Fighting Against COVID-19
Big Data and AI in Fighting Against COVID-19Big Data and AI in Fighting Against COVID-19
Big Data and AI in Fighting Against COVID-19
 
Highly-scalable Reinforcement Learning RLlib for Real-world Applications
Highly-scalable Reinforcement Learning RLlib for Real-world ApplicationsHighly-scalable Reinforcement Learning RLlib for Real-world Applications
Highly-scalable Reinforcement Learning RLlib for Real-world Applications
 
Build computer vision models to perform object detection and classification w...
Build computer vision models to perform object detection and classification w...Build computer vision models to perform object detection and classification w...
Build computer vision models to perform object detection and classification w...
 
Causal Inference in Data Science and Machine Learning
Causal Inference in Data Science and Machine LearningCausal Inference in Data Science and Machine Learning
Causal Inference in Data Science and Machine Learning
 
Weekly #106: Deep Learning on Mobile
Weekly #106: Deep Learning on MobileWeekly #106: Deep Learning on Mobile
Weekly #106: Deep Learning on Mobile
 
Weekly #105: AutoViz and Auto_ViML Visualization and Machine Learning
Weekly #105: AutoViz and Auto_ViML Visualization and Machine LearningWeekly #105: AutoViz and Auto_ViML Visualization and Machine Learning
Weekly #105: AutoViz and Auto_ViML Visualization and Machine Learning
 
AISF19 - Travel in the AI-First World
AISF19 - Travel in the AI-First WorldAISF19 - Travel in the AI-First World
AISF19 - Travel in the AI-First World
 
AISF19 - Unleash Computer Vision at the Edge
AISF19 - Unleash Computer Vision at the EdgeAISF19 - Unleash Computer Vision at the Edge
AISF19 - Unleash Computer Vision at the Edge
 
AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...
AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...
AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...
 
Toronto meetup 20190917
Toronto meetup 20190917Toronto meetup 20190917
Toronto meetup 20190917
 
Feature Engineering for NLP
Feature Engineering for NLPFeature Engineering for NLP
Feature Engineering for NLP
 

Recently uploaded

Containers & AI - Beauty and the Beast!?!
Containers & AI - Beauty and the Beast!?!Containers & AI - Beauty and the Beast!?!
Containers & AI - Beauty and the Beast!?!
Tobias Schneck
 
"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota
Fwdays
 
"$10 thousand per minute of downtime: architecture, queues, streaming and fin...
"$10 thousand per minute of downtime: architecture, queues, streaming and fin..."$10 thousand per minute of downtime: architecture, queues, streaming and fin...
"$10 thousand per minute of downtime: architecture, queues, streaming and fin...
Fwdays
 
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
Fwdays
 
ScyllaDB Tablets: Rethinking Replication
ScyllaDB Tablets: Rethinking ReplicationScyllaDB Tablets: Rethinking Replication
ScyllaDB Tablets: Rethinking Replication
ScyllaDB
 
Introducing BoxLang : A new JVM language for productivity and modularity!
Introducing BoxLang : A new JVM language for productivity and modularity!Introducing BoxLang : A new JVM language for productivity and modularity!
Introducing BoxLang : A new JVM language for productivity and modularity!
Ortus Solutions, Corp
 
Essentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation ParametersEssentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation Parameters
Safe Software
 
Demystifying Knowledge Management through Storytelling
Demystifying Knowledge Management through StorytellingDemystifying Knowledge Management through Storytelling
Demystifying Knowledge Management through Storytelling
Enterprise Knowledge
 
Harnessing the Power of NLP and Knowledge Graphs for Opioid Research
Harnessing the Power of NLP and Knowledge Graphs for Opioid ResearchHarnessing the Power of NLP and Knowledge Graphs for Opioid Research
Harnessing the Power of NLP and Knowledge Graphs for Opioid Research
Neo4j
 
Lee Barnes - Path to Becoming an Effective Test Automation Engineer.pdf
Lee Barnes - Path to Becoming an Effective Test Automation Engineer.pdfLee Barnes - Path to Becoming an Effective Test Automation Engineer.pdf
Lee Barnes - Path to Becoming an Effective Test Automation Engineer.pdf
leebarnesutopia
 
Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving
 
Leveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and StandardsLeveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and Standards
Neo4j
 
GlobalLogic Java Community Webinar #18 “How to Improve Web Application Perfor...
GlobalLogic Java Community Webinar #18 “How to Improve Web Application Perfor...GlobalLogic Java Community Webinar #18 “How to Improve Web Application Perfor...
GlobalLogic Java Community Webinar #18 “How to Improve Web Application Perfor...
GlobalLogic Ukraine
 
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
DanBrown980551
 
Christine's Supplier Sourcing Presentaion.pptx
Christine's Supplier Sourcing Presentaion.pptxChristine's Supplier Sourcing Presentaion.pptx
Christine's Supplier Sourcing Presentaion.pptx
christinelarrosa
 
Apps Break Data
Apps Break DataApps Break Data
Apps Break Data
Ivo Velitchkov
 
AI in the Workplace Reskilling, Upskilling, and Future Work.pptx
AI in the Workplace Reskilling, Upskilling, and Future Work.pptxAI in the Workplace Reskilling, Upskilling, and Future Work.pptx
AI in the Workplace Reskilling, Upskilling, and Future Work.pptx
Sunil Jagani
 
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
Jason Yip
 
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
AlexanderRichford
 
What is an RPA CoE? Session 1 – CoE Vision
What is an RPA CoE?  Session 1 – CoE VisionWhat is an RPA CoE?  Session 1 – CoE Vision
What is an RPA CoE? Session 1 – CoE Vision
DianaGray10
 

Recently uploaded (20)

Containers & AI - Beauty and the Beast!?!
Containers & AI - Beauty and the Beast!?!Containers & AI - Beauty and the Beast!?!
Containers & AI - Beauty and the Beast!?!
 
"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota
 
"$10 thousand per minute of downtime: architecture, queues, streaming and fin...
"$10 thousand per minute of downtime: architecture, queues, streaming and fin..."$10 thousand per minute of downtime: architecture, queues, streaming and fin...
"$10 thousand per minute of downtime: architecture, queues, streaming and fin...
 
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
 
ScyllaDB Tablets: Rethinking Replication
ScyllaDB Tablets: Rethinking ReplicationScyllaDB Tablets: Rethinking Replication
ScyllaDB Tablets: Rethinking Replication
 
Introducing BoxLang : A new JVM language for productivity and modularity!
Introducing BoxLang : A new JVM language for productivity and modularity!Introducing BoxLang : A new JVM language for productivity and modularity!
Introducing BoxLang : A new JVM language for productivity and modularity!
 
Essentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation ParametersEssentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation Parameters
 
Demystifying Knowledge Management through Storytelling
Demystifying Knowledge Management through StorytellingDemystifying Knowledge Management through Storytelling
Demystifying Knowledge Management through Storytelling
 
Harnessing the Power of NLP and Knowledge Graphs for Opioid Research
Harnessing the Power of NLP and Knowledge Graphs for Opioid ResearchHarnessing the Power of NLP and Knowledge Graphs for Opioid Research
Harnessing the Power of NLP and Knowledge Graphs for Opioid Research
 
Lee Barnes - Path to Becoming an Effective Test Automation Engineer.pdf
Lee Barnes - Path to Becoming an Effective Test Automation Engineer.pdfLee Barnes - Path to Becoming an Effective Test Automation Engineer.pdf
Lee Barnes - Path to Becoming an Effective Test Automation Engineer.pdf
 
Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024
 
Leveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and StandardsLeveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and Standards
 
GlobalLogic Java Community Webinar #18 “How to Improve Web Application Perfor...
GlobalLogic Java Community Webinar #18 “How to Improve Web Application Perfor...GlobalLogic Java Community Webinar #18 “How to Improve Web Application Perfor...
GlobalLogic Java Community Webinar #18 “How to Improve Web Application Perfor...
 
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
 
Christine's Supplier Sourcing Presentaion.pptx
Christine's Supplier Sourcing Presentaion.pptxChristine's Supplier Sourcing Presentaion.pptx
Christine's Supplier Sourcing Presentaion.pptx
 
Apps Break Data
Apps Break DataApps Break Data
Apps Break Data
 
AI in the Workplace Reskilling, Upskilling, and Future Work.pptx
AI in the Workplace Reskilling, Upskilling, and Future Work.pptxAI in the Workplace Reskilling, Upskilling, and Future Work.pptx
AI in the Workplace Reskilling, Upskilling, and Future Work.pptx
 
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
 
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
 
What is an RPA CoE? Session 1 – CoE Vision
What is an RPA CoE?  Session 1 – CoE VisionWhat is an RPA CoE?  Session 1 – CoE Vision
What is an RPA CoE? Session 1 – CoE Vision
 

AISF19 - On Blending Machine Learning with Microeconomics

  • 1. Michael Jordan University of California, Berkeley On Blending Machine Learning with Microeconomics
  • 2. Where is Intelligence Found on Earth? University of California, Berkeley
  • 3. Where is Intelligence Found on Earth? • Brains and Minds University of California, Berkeley
  • 4. Brains and Minds • But our understanding of how intelligence arises in brains and minds is very primitive; we’re still at the stage of metaphor University of California, Berkeley
  • 5. Brains and Minds • But our understanding of how intelligence arises in brains and minds is very primitive; we’re still at the stage of metaphor • Moreover, to build intelligent systems it is neither necessary nor sufficient that the components of the system exhibit human-like intelligence University of California, Berkeley
  • 6. Brains and Minds • But our understanding of how intelligence arises in brains and minds is very primitive; we’re still in the stage of metaphor • Moreover, to build intelligent systems it is neither necessary nor sufficient that the components of the system exhibit human-like intelligence • Are there other examples of intelligence worth mimicking in computer algorithms on the planet? University of California, Berkeley
  • 7. Where is Intelligence Found on Earth? • Brains and Minds • Markets University of California, Berkeley
  • 8. Markets • Markets can be viewed as decentralized algorithms • They accomplish complex tasks like bringing the necessary goods into a city day in and day out • They are adaptive (accommodating change in physical or social structure), robust (working rain or shine), scalable (working in small villages and big cities), and they can have a very long lifetime – indeed, they can work for decades or centuries – if we’re looking for principles for lifelong adaptation, we should be considering markets as intelligent systems! • Of course, markets aren’t perfect, which simply means that there are research opportunities University of California, Berkeley
  • 9. Examples at the Interface of ML and Econ • Multi-way markets in which the individual agents need to explore to learn their preferences • Large-scale multi-way markets in which agents view other sides of the market via recommendation systems • Inferential methods for mitigating information asymmetries • Latent variable inference in game theory • Data collection in strategic settings • Information sharing, free riding • The goal is to discover new principles to build healthy (e.g., fair) learning-based markets that are stabilized over long stretches of time University of California, Berkeley
  • 10. AI (aka Machine Learning) Successes • First Generation (‘90-’00): the backend – e.g., fraud detection, search, supply-chain management • Second Generation (‘00-’10): the human side – e.g., recommendation systems, commerce, social media • Third Generation (‘10-now): pattern recognition – e.g., speech recognition, computer vision, translation • Fourth Generation (emerging): markets – not just one agent making a decision or sequence of decisions – but a huge interconnected web of data, agents, decisions University of California, Berkeley
  • 11. Decisions • It’s not just a matter of a threshold • Real-world decisions with consequences – counterfactuals, provenance, relevance, dialog • Sets of decisions across a network – false-discovery rate (instead of precision/recall/accuracy) • Sets of decisions across a network over time – streaming, asynchronous decisions (cf. Zrnic, Ramdas & Jordan, Asynchronous online testing of multiple hypotheses, arXiv, 2019) • Decisions when there is scarcity and competition – need for an economic perspective
  • 12. Consider Classical Recommendation Systems • A record is kept of each customer’s purchases • Customers are “similar” if they buy similar sets of items • Items are “similar” are they are bought together by multiple customers • Recommendations are made on the basis of these similarities • These systems have become a commodity
  • 13. Multiple Decisions with Competition • Suppose that recommending a certain movie is a good business decision (e.g., because it’s very popular) • Is it OK to recommend the same movie to everyone? • Is it OK to recommend the same book to everyone? • Is it OK to recommend the same restaurant to everyone? • Is it OK to recommend the same street to every driver? • Is it OK to recommend the same stock purchase to everyone?
  • 14. The Alternative: Create a Market • A two-way market between consumers and producers – based on recommendation systems on both sides • E.g., diners are one side of the market, and restaurants on the other side • E.g., drivers are one side of the market, and street segments on the other side • This isn’t just classical microeconomics; the use of recommendation systems is key University of California, Berkeley
  • 15. The Alternative: Create a Market • A two-way market between consumers and producers – based on recommendation systems on both sides • E.g., diners are one side of the market, and restaurants on the other side • E.g., drivers are one side of the market, and street segments on the other side • This isn’t just classical microeconomics; the use of recommendation systems is key University of California, Berkeley
  • 16. Machine Learning Meets Economics • Bandits that compete (with Lydia Liu and Horia Mania) – how to explore and exploit when others may also be attempting to explore and exploit • Finding Nash equilibria with gradient-based algorithms in high-dimensional action spaces (with Eric Mazumdar) – avoiding saddle points that are not Nash equilibria • Local Stackelberg equilibria in nonconvex-nonconcave games (with Chi Jin and Praneeth Netrapalli) – and how does this relate to SGD? • Asynchronous, online false-discovery rate control (with Tijana Zrnic and Aaditya Ramdas) – the real-world A/B testing problem University of California, Berkeley
  • 17. Lydia Liu Horia Mania Competing Bandits in Matching Markets
  • 18. • MABs offer a natural platform to understand exploration / exploitation trade-offs 1 2 3 . Multi-Armed Bandits
  • 19. Buyers / Demand Sellers / Supply 1 > 3 > 2 Suppose we have a market in which the participants have preferences: 2 > 3 > 1 1 > 2 > 3 1 > 2 > 3 3 > 1 > 2 2 > 1 > 3 Gale and Shapley introduced this problem in 1962 and proposed a celebrated algorithm that always finds a stable match In this algorithm one side of the market iteratively makes proposals to the other side Matching Markets
  • 20. What if the participants in the market do not know their preferences a priori, but observe noisy utilities through repeated interactions? Matching Markets Meet Learning Now the participants have an exploration/exploitation problem, in the context of other participants
  • 22. • We conceive of a bandit market: agents on one side, arms on the other side. Agents get noisy rewards when they pull arms. Arms have preferences over agents (these preferences can also express agents’ skill levels) When multiple agents pull the same arm only the most preferred agent gets a reward. Bandit Markets
  • 23. Then it is natural to define the regret of agent i up to time n as: Mean reward of stable match Reward at time t Minimizing this regret is natural. It says that agents should expect rewards as good as their stable match in hindsight. Regret in Bandit Markets
  • 24. Gale-Shapley upper confidence bounds (GS-UCB): • Agents rank arms according to upper confidence bounds for the mean rewards. • Agents submit rankings to a matching platform. • The platform uses these rankings to run the Gale-Shapley algorithm to match agents and arms. • Agents receive rewards and update upper confidence bounds. • Repeat. Regret-Minimizing Algorithm
  • 25. Theorem (informal): If there are N agents and K arms and GS-UCB is run, the regret of agent i satisfies Reward gap of possibly other agents. • In other words, if the bear decides to explore more, the human might have higher regret. • See paper for refinements of this bound and further discussion of exploration-exploitation trade-offs in this setting. • Finally, we note that GS-UCB is incentive compatible. No single agent has an incentive to deviate from the method. Theorem
  • 26. Eric Mazumdar Finding Nash Equilibria (and Only Nash Equilibria) with Gradients
  • 27. Why zero-sum games? • A lot of recent interest in finding the local Nash equilibria of zero-sum continuous games. • e.g. adversarial learning, training GANs, robust reinforcement learning Zero-Sum Games
  • 28. Issues with gradient-play in 2-player zero-sum games Not all locally asymptotically stable equilibria are differential Nash equilibria! Simultaneous Gradient descent (simGD) has two main issues: • Convergence to limit cycles. • Convergence to non- Nash fixed points. 2
  • 29. Recent algorithms do not solve these problems 2 9 On finding local Nash equilibria (and only local Nash equilibria) in zero-sum continuous games. E. Mazumdar, M. Jordan, S. Sastry NeurIPS Behaviors in Zero-sum Games Algorithm Avoid some local Nash eq. Converge to non-Nash eq. Oscillate around eq. Converge to limit cycles Gradient play Symplectic Gradient Adjustment Consensus Optimization
  • 30. New algorithm for gradient-based learning in zero-sum games: 3 0 Local Symplectic Surgery: On finding local Nash equilibria (and only local Nash equilibria) in zero-sum continuous games. E. Mazumdar, M. Jordan, S. Sastry NeurIPS By cancelling out the symplectic part of the vector field around critical points, the only equilibria to which this method can converge are the local Nash equilibria of the game.
  • 31. 31On finding local Nash equilibria (and only local Nash equilibria) in zero-sum continuous games. E. Mazumdar, M. Jordan, S. Sastry NeurIPS GAN experiments on a test for mode-collapse LSS (~50,000 iters) simGD (~200,000 iters) Consensus opt. (~100,000 iters) Curvature exploitation (~15,000 iters) Ground truth is a mixture of 16 Gaussians used to test for mode collapse, with covariance 0.005 I2 Generator and discriminator are Tanh neural networks with 4 hidden layers of 100 neurons each.
  • 32. 3 2 On finding local Nash equilibria (and only local Nash equilibria) in zero-sum continuous games. E. Mazumdar, M. Jordan, S. Sastry NeurIPS LSS seems to converge to better solutions LSS (~50,000 iters) simGD (~200,000 iters) Consensus (~100,000 iters) Curvature (~15,000 iters) From an initialization where simGD quickly converges to an incorrect distribution, LSS recovers the ground truth. • Consensus optimization, without careful hyper-parameter tuning gives rise to new non-Nash equilibria, which results in worse performance. • Local curvature exploitation is hard to implement in high dimensional settings due to the need to find eigenvalue/eigenvector pairs of the diagonal blocks of J at each iteration
  • 33. What is Local Optimality in Nonconvex-Nonconcave Minimax Optimization? Chi Jin1, Praneeth Netrapalli2, Michael I. Jordan1 1 University of California, Berkeley. 2 Microsoft Research, India.
  • 34. Minmax Optimization Multi-agent decision making: or Framework: min x∈Rd max y∈Rd f (x, y) where f is nonconvex in x and nonconcave in y.
  • 36. Algorithms and Questions Gradient Descent Ascent (GDA):    xt+1 = xt − ηx f (xt ). yt+1 = yt + ηy f (yt ). −2 0 2 x1 −2 −1 0 1 2 x2 Fundamental questions: • What “optimal” points should we find? • What are the points that GDA converges to? (if it converges)
  • 37. Nash Equilibrium Point (x , y ) is a Nash equilibrium if: • y is a maximum of f (x , ·); x is a minimum of f (·, y ). Convex-concave case: GDA converges to a Nash equilibrium.
  • 38. Local Nash Equilibrium Nonconvex-nonconcave case: It’s NP-hard to find Nash equilibrium. Find a local Nash equilibrium—point (x , y ) such that • y is a local maximum of f (x , ·); x is a local minimum of f (·, y ). Are local Nash equilibria what we want in GAN/adversarial training?
  • 39. Simultaneous vs. Sequential simultaneous vs. sequential Nash equilibria come from simultaneous games—both act simultaneously. GAN/adversarial training are sequential games—one leader, one follower. For nonconvex-nonconcave f , which player acts first is crucial, since in general min x∈Rd max y∈Rd f (x, y) = max y∈Rd min x∈Rd f (x, y)
  • 40. Minimax Point (Stackelberg Equilibrium) Point (x , y ) is a Minimax Point (or Stackelberg Equilibrium) if: • y is a maximum of f (x , ·); • x is a minimum of φ(·) where φ(x) = maxy f (x, y). [Leader always prepares for the best follower.] Minimax points always exist even for nonconvex-nonconcave functions.
  • 41. A New Notion of Local Optimality [This Work] Point (x , y ) is a local minimax point if: • y is a local maximum of f (x , ·); • ∃ 0 > 0, so that x is a local minimum of g (·) for any ≤ 0, where g (x) = maxy: y−y ≤ f (x, y).
  • 42. Limit Points of GDA [DP18, MR18]: The stable limit points of GDA need not to be local Nash! Theorem [This Work] When learning rate ηy /ηx → ∞, the stable limit points of GDA are exactly local minimax points up to some degenerate points. γ-GDA Local Minimax (∞-GDA) Local Maximin Local Nash