SlideShare a Scribd company logo
Sarah H.Q. Li
Ph.D. Candidate
Department of Aeronautics and Astronautics, University of Washington
Incentivizing autonomous
vehicles/spacecraft toward
constraint satisfaction
2 of 27
Autonomous decision making in stochastic environments
- UAV delivery
- In-orbit servicing satellite constellation
- Routing of competitive autonomous vehicles
Motivations
3 of 27
Application: In-orbit satellite
service planning
4 of 27
Extension to MDP
congestion games
Outline
Finite time MDP
01
03
02
04
05
06
Application: congestion
control of competitive
ride-share systems
Incentive design for MDP
congestion games
Frank-Wolfe-Based
Learning method
Bibliography
5 of 27
Markov decision processes
6 of 27
Dynamic programming
7 of 27
Extension to multiple players
8 of 27
MDP Wardrop equilibrium
9 of 27
MDP Wardrop equilibrium
10 of 27
Potential game formulation
[1]: Calderone, Dan, and S. Shankar Sastry. "Markov decision process routing games." In 2017 ACM/IEEE 8th International
Conference on Cyber-Physical Systems (ICCPS), pp. 273-280. IEEE, 2017.
11 of 27
Summary
12 of 27
Algorithms for MDP congestion
game
13 of 27
Frank-Wolfe algorithm for MDP
congestion games
14 of 27
Learning algorithm for MDP
Wardrop equilibrium
[2] Yu, Yue, Dan Calderone, Sarah HQ Li, Lillian J. Ratliff, and Behçet Açıkmeşe. "A Primal-Dual Approach to Markovian Network
Optimization." under review for Automatica.
15 of 27
Application: urban ride-share
16 of 27
Application: urban ride-share
17 of 27
Incentivizing desirable player
behavior
18 of 27
Incentivizing desirable player
behavior
[3] Li, Sarah HQ, Yue Yu, Daniel Calderone, Lillian Ratliff, and Behçet Açikmeşe. "Tolling for constraint satisfaction in markov decision
process congestion games." In 2019 American Control Conference (ACC), pp. 1238-1243. IEEE, 2019.
19 of 27
Finding the minimum tolling value
20 of 27
Finding the minimum tolling value
21 of 27
Finding the minimum tolling value
22 of 27
Finding the minimum tolling value
[4] Li, Sarah HQ, Yue Yu, Dan Calderone, Lillian J. Ratliff, and Behcet Acikmese. "Online Constraint Satisfaction via Tolls in MDP
Congestion Games." arXiv preprint arXiv:1907.08912 (2021).
23 of 27
Incentivizing desirable player
behavior
24 of 27
Collaborators
CREDITS: This presentation template was created by Slidesgo, including
icons by Flaticon, and infographics & images by Freepik.
Please keep this slide for attribution.
25 of 27
Does anyone have any questions?
sarahli@uw.edu
THANKS
[1]: Calderone, Dan, and S. Shankar Sastry. "Markov decision process routing games." In 2017
ACM/IEEE 8th International Conference on Cyber-Physical Systems (ICCPS), pp. 273-280. IEEE, 2017.
[2] Yu, Yue, Dan Calderone, Sarah HQ Li, Lillian J. Ratliff, and Behçet Açıkmeşe. "A Primal-Dual
Approach to Markovian Network Optimization." under review for Automatica.
[3] Li, Sarah HQ, Yue Yu, Daniel Calderone, Lillian Ratliff, and Behçet Açikmeşe. "Tolling for constraint
satisfaction in markov decision process congestion games." In 2019 American Control Conference (ACC),
pp. 1238-1243. IEEE, 2019.
[4] Li, Sarah HQ, Yue Yu, Dan Calderone, Lillian J. Ratliff, and Behcet Acikmese. "Online Constraint
Satisfaction via Tolls in MDP Congestion Games." arXiv preprint arXiv:1907.08912 (2021).
26 of 27
Connection to routing games
27 of 27
Application: In-orbit satellite
service planning

More Related Content

Similar to Mdpcg presentation

A Mashup-based Approach for Virtual SDN Management
A Mashup-based Approach for Virtual SDN ManagementA Mashup-based Approach for Virtual SDN Management
A Mashup-based Approach for Virtual SDN Management
Oscar Caicedo
 
ICCES 2017 - Crowd Density Estimation Method using Regression Analysis
ICCES 2017 - Crowd Density Estimation Method using Regression AnalysisICCES 2017 - Crowd Density Estimation Method using Regression Analysis
ICCES 2017 - Crowd Density Estimation Method using Regression Analysis
Ahmed Gad
 
Scientific Applications and Heterogeneous Architectures
Scientific Applications and Heterogeneous ArchitecturesScientific Applications and Heterogeneous Architectures
Scientific Applications and Heterogeneous Architectures
inside-BigData.com
 

Similar to Mdpcg presentation (20)

Lecture 1 - Learning Dynamical Systems from Demonstrations
Lecture 1 - Learning Dynamical Systems from DemonstrationsLecture 1 - Learning Dynamical Systems from Demonstrations
Lecture 1 - Learning Dynamical Systems from Demonstrations
 
Software Architecture - Principles, Patterns and Practices - OSI Days - 2017
Software Architecture - Principles, Patterns and Practices - OSI Days - 2017Software Architecture - Principles, Patterns and Practices - OSI Days - 2017
Software Architecture - Principles, Patterns and Practices - OSI Days - 2017
 
Query Recommendation - Barcelona 2017
Query Recommendation - Barcelona 2017Query Recommendation - Barcelona 2017
Query Recommendation - Barcelona 2017
 
Semi-Supervised Classification with Graph Convolutional Networks @ICLR2017読み会
Semi-Supervised Classification with Graph Convolutional Networks @ICLR2017読み会Semi-Supervised Classification with Graph Convolutional Networks @ICLR2017読み会
Semi-Supervised Classification with Graph Convolutional Networks @ICLR2017読み会
 
00 Automatic Mental Health Classification in Online Settings and Language Emb...
00 Automatic Mental Health Classification in Online Settings and Language Emb...00 Automatic Mental Health Classification in Online Settings and Language Emb...
00 Automatic Mental Health Classification in Online Settings and Language Emb...
 
VSlam 2017 11_20(張閎智)
VSlam 2017 11_20(張閎智)VSlam 2017 11_20(張閎智)
VSlam 2017 11_20(張閎智)
 
Agent-Based Modeling & Python @ PyData DC 2016
Agent-Based Modeling & Python @ PyData DC 2016Agent-Based Modeling & Python @ PyData DC 2016
Agent-Based Modeling & Python @ PyData DC 2016
 
EDBT 2015: Summer School Overview
EDBT 2015: Summer School OverviewEDBT 2015: Summer School Overview
EDBT 2015: Summer School Overview
 
A Mashup-based Approach for Virtual SDN Management
A Mashup-based Approach for Virtual SDN ManagementA Mashup-based Approach for Virtual SDN Management
A Mashup-based Approach for Virtual SDN Management
 
2. visualization in data mining
2. visualization in data mining2. visualization in data mining
2. visualization in data mining
 
Advanced Deep Architectures (D2L6 Deep Learning for Speech and Language UPC 2...
Advanced Deep Architectures (D2L6 Deep Learning for Speech and Language UPC 2...Advanced Deep Architectures (D2L6 Deep Learning for Speech and Language UPC 2...
Advanced Deep Architectures (D2L6 Deep Learning for Speech and Language UPC 2...
 
New Research Articles 2020 November Issue International Journal of Software E...
New Research Articles 2020 November Issue International Journal of Software E...New Research Articles 2020 November Issue International Journal of Software E...
New Research Articles 2020 November Issue International Journal of Software E...
 
NVIDIA @ Infinite Conference, London
NVIDIA @ Infinite Conference, LondonNVIDIA @ Infinite Conference, London
NVIDIA @ Infinite Conference, London
 
ICCES 2017 - Crowd Density Estimation Method using Regression Analysis
ICCES 2017 - Crowd Density Estimation Method using Regression AnalysisICCES 2017 - Crowd Density Estimation Method using Regression Analysis
ICCES 2017 - Crowd Density Estimation Method using Regression Analysis
 
Scientific Applications and Heterogeneous Architectures
Scientific Applications and Heterogeneous ArchitecturesScientific Applications and Heterogeneous Architectures
Scientific Applications and Heterogeneous Architectures
 
Comparative Analysis of RMSE and MAP Metrices for Evaluating CNN and LSTM Mod...
Comparative Analysis of RMSE and MAP Metrices for Evaluating CNN and LSTM Mod...Comparative Analysis of RMSE and MAP Metrices for Evaluating CNN and LSTM Mod...
Comparative Analysis of RMSE and MAP Metrices for Evaluating CNN and LSTM Mod...
 
Trb2016 alireza naimi
Trb2016 alireza naimiTrb2016 alireza naimi
Trb2016 alireza naimi
 
Algorithm Design.pdf
Algorithm Design.pdfAlgorithm Design.pdf
Algorithm Design.pdf
 
Personalized Page Generation for Browsing Recommendations
Personalized Page Generation for Browsing RecommendationsPersonalized Page Generation for Browsing Recommendations
Personalized Page Generation for Browsing Recommendations
 
Azure machine learning
Azure machine learningAzure machine learning
Azure machine learning
 

More from Advanced-Concepts-Team

Pablo Gomez - Solving Large-scale Challenges with ESA Datalabs
Pablo Gomez - Solving Large-scale Challenges with ESA DatalabsPablo Gomez - Solving Large-scale Challenges with ESA Datalabs
Pablo Gomez - Solving Large-scale Challenges with ESA Datalabs
Advanced-Concepts-Team
 
Jonathan Sauder - Miniaturizing Mechanical Systems for CubeSats: Design Princ...
Jonathan Sauder - Miniaturizing Mechanical Systems for CubeSats: Design Princ...Jonathan Sauder - Miniaturizing Mechanical Systems for CubeSats: Design Princ...
Jonathan Sauder - Miniaturizing Mechanical Systems for CubeSats: Design Princ...
Advanced-Concepts-Team
 
EDEN ISS - A space greenhouse analogue in Antarctica
EDEN ISS - A space greenhouse analogue in AntarcticaEDEN ISS - A space greenhouse analogue in Antarctica
EDEN ISS - A space greenhouse analogue in Antarctica
Advanced-Concepts-Team
 
How to give a robot a soul
How to give a robot a soulHow to give a robot a soul
How to give a robot a soul
Advanced-Concepts-Team
 
Vernal pools a new ecosystem for astrobiology studies
Vernal pools a new ecosystem for astrobiology studiesVernal pools a new ecosystem for astrobiology studies
Vernal pools a new ecosystem for astrobiology studies
Advanced-Concepts-Team
 

More from Advanced-Concepts-Team (20)

2024.03.22 - Mike Heddes - Introduction to Hyperdimensional Computing.pdf
2024.03.22 - Mike Heddes - Introduction to Hyperdimensional Computing.pdf2024.03.22 - Mike Heddes - Introduction to Hyperdimensional Computing.pdf
2024.03.22 - Mike Heddes - Introduction to Hyperdimensional Computing.pdf
 
Isabelle Diacaire - From Ariadnas to Industry R&D in optics and photonics
Isabelle Diacaire - From Ariadnas to Industry R&D in optics and photonicsIsabelle Diacaire - From Ariadnas to Industry R&D in optics and photonics
Isabelle Diacaire - From Ariadnas to Industry R&D in optics and photonics
 
The ExoGRAVITY project - observations of exoplanets from the ground with opti...
The ExoGRAVITY project - observations of exoplanets from the ground with opti...The ExoGRAVITY project - observations of exoplanets from the ground with opti...
The ExoGRAVITY project - observations of exoplanets from the ground with opti...
 
MOND_famaey.pdf
MOND_famaey.pdfMOND_famaey.pdf
MOND_famaey.pdf
 
Pablo Gomez - Solving Large-scale Challenges with ESA Datalabs
Pablo Gomez - Solving Large-scale Challenges with ESA DatalabsPablo Gomez - Solving Large-scale Challenges with ESA Datalabs
Pablo Gomez - Solving Large-scale Challenges with ESA Datalabs
 
Jonathan Sauder - Miniaturizing Mechanical Systems for CubeSats: Design Princ...
Jonathan Sauder - Miniaturizing Mechanical Systems for CubeSats: Design Princ...Jonathan Sauder - Miniaturizing Mechanical Systems for CubeSats: Design Princ...
Jonathan Sauder - Miniaturizing Mechanical Systems for CubeSats: Design Princ...
 
Towards an Artificial Muse for new Ideas in Quantum Physics
Towards an Artificial Muse for new Ideas in Quantum PhysicsTowards an Artificial Muse for new Ideas in Quantum Physics
Towards an Artificial Muse for new Ideas in Quantum Physics
 
EDEN ISS - A space greenhouse analogue in Antarctica
EDEN ISS - A space greenhouse analogue in AntarcticaEDEN ISS - A space greenhouse analogue in Antarctica
EDEN ISS - A space greenhouse analogue in Antarctica
 
How to give a robot a soul
How to give a robot a soulHow to give a robot a soul
How to give a robot a soul
 
Information processing with artificial spiking neural networks
Information processing with artificial spiking neural networksInformation processing with artificial spiking neural networks
Information processing with artificial spiking neural networks
 
Exploring Architected Materials Using Machine Learning
Exploring Architected Materials Using Machine LearningExploring Architected Materials Using Machine Learning
Exploring Architected Materials Using Machine Learning
 
Electromagnetically Actuated Systems for Modular, Self-Assembling and Self-Re...
Electromagnetically Actuated Systems for Modular, Self-Assembling and Self-Re...Electromagnetically Actuated Systems for Modular, Self-Assembling and Self-Re...
Electromagnetically Actuated Systems for Modular, Self-Assembling and Self-Re...
 
HORUS: Peering into Lunar Shadowed Regions with AI
HORUS: Peering into Lunar Shadowed Regions with AIHORUS: Peering into Lunar Shadowed Regions with AI
HORUS: Peering into Lunar Shadowed Regions with AI
 
META-SPACE: Psycho-physiologically Adaptive and Personalized Virtual Reality ...
META-SPACE: Psycho-physiologically Adaptive and Personalized Virtual Reality ...META-SPACE: Psycho-physiologically Adaptive and Personalized Virtual Reality ...
META-SPACE: Psycho-physiologically Adaptive and Personalized Virtual Reality ...
 
The Large Interferometer For Exoplanets (LIFE) II: Key Methods and Technologies
The Large Interferometer For Exoplanets (LIFE) II: Key Methods and TechnologiesThe Large Interferometer For Exoplanets (LIFE) II: Key Methods and Technologies
The Large Interferometer For Exoplanets (LIFE) II: Key Methods and Technologies
 
Black Holes and Bright Quasars
Black Holes and Bright QuasarsBlack Holes and Bright Quasars
Black Holes and Bright Quasars
 
In vitro simulation of spaceflight environment to elucidate combined effect o...
In vitro simulation of spaceflight environment to elucidate combined effect o...In vitro simulation of spaceflight environment to elucidate combined effect o...
In vitro simulation of spaceflight environment to elucidate combined effect o...
 
The Large Interferometer For Exoplanets (LIFE): the science of characterising...
The Large Interferometer For Exoplanets (LIFE): the science of characterising...The Large Interferometer For Exoplanets (LIFE): the science of characterising...
The Large Interferometer For Exoplanets (LIFE): the science of characterising...
 
Vernal pools a new ecosystem for astrobiology studies
Vernal pools a new ecosystem for astrobiology studiesVernal pools a new ecosystem for astrobiology studies
Vernal pools a new ecosystem for astrobiology studies
 
Keeping a Sentinel Eye on the Volcanoes – from Space!
Keeping a Sentinel Eye on the Volcanoes – from Space!Keeping a Sentinel Eye on the Volcanoes – from Space!
Keeping a Sentinel Eye on the Volcanoes – from Space!
 

Recently uploaded

Online blood donation management system project.pdf
Online blood donation management system project.pdfOnline blood donation management system project.pdf
Online blood donation management system project.pdf
Kamal Acharya
 
Automobile Management System Project Report.pdf
Automobile Management System Project Report.pdfAutomobile Management System Project Report.pdf
Automobile Management System Project Report.pdf
Kamal Acharya
 
Digital Signal Processing Lecture notes n.pdf
Digital Signal Processing Lecture notes n.pdfDigital Signal Processing Lecture notes n.pdf
Digital Signal Processing Lecture notes n.pdf
AbrahamGadissa
 

Recently uploaded (20)

Introduction to Machine Learning Unit-5 Notes for II-II Mechanical Engineering
Introduction to Machine Learning Unit-5 Notes for II-II Mechanical EngineeringIntroduction to Machine Learning Unit-5 Notes for II-II Mechanical Engineering
Introduction to Machine Learning Unit-5 Notes for II-II Mechanical Engineering
 
İTÜ CAD and Reverse Engineering Workshop
İTÜ CAD and Reverse Engineering WorkshopİTÜ CAD and Reverse Engineering Workshop
İTÜ CAD and Reverse Engineering Workshop
 
Introduction to Casting Processes in Manufacturing
Introduction to Casting Processes in ManufacturingIntroduction to Casting Processes in Manufacturing
Introduction to Casting Processes in Manufacturing
 
Halogenation process of chemical process industries
Halogenation process of chemical process industriesHalogenation process of chemical process industries
Halogenation process of chemical process industries
 
Introduction to Machine Learning Unit-4 Notes for II-II Mechanical Engineering
Introduction to Machine Learning Unit-4 Notes for II-II Mechanical EngineeringIntroduction to Machine Learning Unit-4 Notes for II-II Mechanical Engineering
Introduction to Machine Learning Unit-4 Notes for II-II Mechanical Engineering
 
Online blood donation management system project.pdf
Online blood donation management system project.pdfOnline blood donation management system project.pdf
Online blood donation management system project.pdf
 
2024 DevOps Pro Europe - Growing at the edge
2024 DevOps Pro Europe - Growing at the edge2024 DevOps Pro Europe - Growing at the edge
2024 DevOps Pro Europe - Growing at the edge
 
Water Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdfWater Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdf
 
Event Management System Vb Net Project Report.pdf
Event Management System Vb Net  Project Report.pdfEvent Management System Vb Net  Project Report.pdf
Event Management System Vb Net Project Report.pdf
 
Toll tax management system project report..pdf
Toll tax management system project report..pdfToll tax management system project report..pdf
Toll tax management system project report..pdf
 
Automobile Management System Project Report.pdf
Automobile Management System Project Report.pdfAutomobile Management System Project Report.pdf
Automobile Management System Project Report.pdf
 
Immunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary AttacksImmunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary Attacks
 
Digital Signal Processing Lecture notes n.pdf
Digital Signal Processing Lecture notes n.pdfDigital Signal Processing Lecture notes n.pdf
Digital Signal Processing Lecture notes n.pdf
 
CME397 Surface Engineering- Professional Elective
CME397 Surface Engineering- Professional ElectiveCME397 Surface Engineering- Professional Elective
CME397 Surface Engineering- Professional Elective
 
Explosives Industry manufacturing process.pdf
Explosives Industry manufacturing process.pdfExplosives Industry manufacturing process.pdf
Explosives Industry manufacturing process.pdf
 
HYDROPOWER - Hydroelectric power generation
HYDROPOWER - Hydroelectric power generationHYDROPOWER - Hydroelectric power generation
HYDROPOWER - Hydroelectric power generation
 
Quality defects in TMT Bars, Possible causes and Potential Solutions.
Quality defects in TMT Bars, Possible causes and Potential Solutions.Quality defects in TMT Bars, Possible causes and Potential Solutions.
Quality defects in TMT Bars, Possible causes and Potential Solutions.
 
A CASE STUDY ON ONLINE TICKET BOOKING SYSTEM PROJECT.pdf
A CASE STUDY ON ONLINE TICKET BOOKING SYSTEM PROJECT.pdfA CASE STUDY ON ONLINE TICKET BOOKING SYSTEM PROJECT.pdf
A CASE STUDY ON ONLINE TICKET BOOKING SYSTEM PROJECT.pdf
 
shape functions of 1D and 2 D rectangular elements.pptx
shape functions of 1D and 2 D rectangular elements.pptxshape functions of 1D and 2 D rectangular elements.pptx
shape functions of 1D and 2 D rectangular elements.pptx
 
Architectural Portfolio Sean Lockwood
Architectural Portfolio Sean LockwoodArchitectural Portfolio Sean Lockwood
Architectural Portfolio Sean Lockwood
 

Mdpcg presentation

  • 1. Sarah H.Q. Li Ph.D. Candidate Department of Aeronautics and Astronautics, University of Washington Incentivizing autonomous vehicles/spacecraft toward constraint satisfaction
  • 2. 2 of 27 Autonomous decision making in stochastic environments - UAV delivery - In-orbit servicing satellite constellation - Routing of competitive autonomous vehicles Motivations
  • 3. 3 of 27 Application: In-orbit satellite service planning
  • 4. 4 of 27 Extension to MDP congestion games Outline Finite time MDP 01 03 02 04 05 06 Application: congestion control of competitive ride-share systems Incentive design for MDP congestion games Frank-Wolfe-Based Learning method Bibliography
  • 5. 5 of 27 Markov decision processes
  • 6. 6 of 27 Dynamic programming
  • 7. 7 of 27 Extension to multiple players
  • 8. 8 of 27 MDP Wardrop equilibrium
  • 9. 9 of 27 MDP Wardrop equilibrium
  • 10. 10 of 27 Potential game formulation [1]: Calderone, Dan, and S. Shankar Sastry. "Markov decision process routing games." In 2017 ACM/IEEE 8th International Conference on Cyber-Physical Systems (ICCPS), pp. 273-280. IEEE, 2017.
  • 12. 12 of 27 Algorithms for MDP congestion game
  • 13. 13 of 27 Frank-Wolfe algorithm for MDP congestion games
  • 14. 14 of 27 Learning algorithm for MDP Wardrop equilibrium [2] Yu, Yue, Dan Calderone, Sarah HQ Li, Lillian J. Ratliff, and Behçet Açıkmeşe. "A Primal-Dual Approach to Markovian Network Optimization." under review for Automatica.
  • 15. 15 of 27 Application: urban ride-share
  • 16. 16 of 27 Application: urban ride-share
  • 17. 17 of 27 Incentivizing desirable player behavior
  • 18. 18 of 27 Incentivizing desirable player behavior [3] Li, Sarah HQ, Yue Yu, Daniel Calderone, Lillian Ratliff, and Behçet Açikmeşe. "Tolling for constraint satisfaction in markov decision process congestion games." In 2019 American Control Conference (ACC), pp. 1238-1243. IEEE, 2019.
  • 19. 19 of 27 Finding the minimum tolling value
  • 20. 20 of 27 Finding the minimum tolling value
  • 21. 21 of 27 Finding the minimum tolling value
  • 22. 22 of 27 Finding the minimum tolling value [4] Li, Sarah HQ, Yue Yu, Dan Calderone, Lillian J. Ratliff, and Behcet Acikmese. "Online Constraint Satisfaction via Tolls in MDP Congestion Games." arXiv preprint arXiv:1907.08912 (2021).
  • 23. 23 of 27 Incentivizing desirable player behavior
  • 25. CREDITS: This presentation template was created by Slidesgo, including icons by Flaticon, and infographics & images by Freepik. Please keep this slide for attribution. 25 of 27 Does anyone have any questions? sarahli@uw.edu THANKS [1]: Calderone, Dan, and S. Shankar Sastry. "Markov decision process routing games." In 2017 ACM/IEEE 8th International Conference on Cyber-Physical Systems (ICCPS), pp. 273-280. IEEE, 2017. [2] Yu, Yue, Dan Calderone, Sarah HQ Li, Lillian J. Ratliff, and Behçet Açıkmeşe. "A Primal-Dual Approach to Markovian Network Optimization." under review for Automatica. [3] Li, Sarah HQ, Yue Yu, Daniel Calderone, Lillian Ratliff, and Behçet Açikmeşe. "Tolling for constraint satisfaction in markov decision process congestion games." In 2019 American Control Conference (ACC), pp. 1238-1243. IEEE, 2019. [4] Li, Sarah HQ, Yue Yu, Dan Calderone, Lillian J. Ratliff, and Behcet Acikmese. "Online Constraint Satisfaction via Tolls in MDP Congestion Games." arXiv preprint arXiv:1907.08912 (2021).
  • 26. 26 of 27 Connection to routing games
  • 27. 27 of 27 Application: In-orbit satellite service planning