SlideShare a Scribd company logo
1 of 20
Download to read offline
A Game Theoretical Approach to
           Multi-Agent Synchronization




Sofie Haesaert, DCSC
co-authors: prof. dr. R. Babuˇka and prof. dr. F.L. Lewis
                             s
March 28, 2012
Linear-Quadratic Discrete-Time Graphical Game
                                Game-Theoretical Solution
                     Example: Five Agent Synchronization


Multi-Agent Synchronization


         Leader-Follower synchronization :
              Cooperative
              Game Theory
              Communication graph

         Multitude of applications in:
              Computer science
              Spacecraft
              Unmanned air vehicles



2 / 18                                                      A Game Theoretical Approach to Multi-Agent Synchronization
Linear-Quadratic Discrete-Time Graphical Game
                                Game-Theoretical Solution
                     Example: Five Agent Synchronization


Outline



         Linear-Quadratic Discrete-Time Graphical Game
            Communication Graph
            Local Tracking Error
            Performance Indices

         Game-Theoretical Solution
           Global Nash Equilibrium
           Coupled Riccati Equations

         Example: Five Agent Synchronization



3 / 18                                                      A Game Theoretical Approach to Multi-Agent Synchronization
Linear-Quadratic Discrete-Time Graphical Game    Communication Graph
                              Game-Theoretical Solution   Local Tracking Error
                   Example: Five Agent Synchronization    Performance Indices


Leader-Follower Synchronization



                                                                                       z0
 State of Leader Agent
 z0 (k + 1) = Az0 (k)                                                        z1                  z2
 State of i-th Agent
 zi (k + 1) = Azi (k) + Bi ui (k)                 ∀i ∈ {1, ..., N}                     z3

 Objective:                                                                  z4                  z5
 zi (k) → z0 (k)          ∀i ∈ {1, ..., N}



                                                                             [Wang, and Chen, 2002]
4 / 18                                                    A Game Theoretical Approach to Multi-Agent Synchronization
Linear-Quadratic Discrete-Time Graphical Game    Communication Graph
                         Game-Theoretical Solution   Local Tracking Error
              Example: Five Agent Synchronization    Performance Indices


Communication Graph



                                                                            z0
  Graph G
      Nodes V(G) = {z1 , z2 , . . . , zN }                          z1           z2


                                                                            z3


                                                                    z4           z5
Linear-Quadratic Discrete-Time Graphical Game    Communication Graph
                              Game-Theoretical Solution   Local Tracking Error
                   Example: Five Agent Synchronization    Performance Indices


Communication Graph



                                                                                    z0
     Graph G
           Nodes V(G) = {z1 , z2 , . . . , zN }                           z1                  z2
           Edges E ⊆ V × V
                                                                         e13               e23
           Edge weights                                                             z3
                  Edge weights eij                                        e34               e35
                                                                          z4                  z5
                                                                                  e45




5 / 18                                                    A Game Theoretical Approach to Multi-Agent Synchronization
Linear-Quadratic Discrete-Time Graphical Game    Communication Graph
                              Game-Theoretical Solution   Local Tracking Error
                   Example: Five Agent Synchronization    Performance Indices


Communication Graph



                                                                                    z0
     Graph G                                                               g1               g2
           Nodes V(G) = {z1 , z2 , . . . , zN }                                  g3
                                                                          z1                  z2
           Edges E ⊆ V × V
           Edge weights                                                             z3
                  Edge weights eij
                  Pinning gains: gi
                                                                          z4                  z5




5 / 18                                                    A Game Theoretical Approach to Multi-Agent Synchronization
Linear-Quadratic Discrete-Time Graphical Game     Communication Graph
                              Game-Theoretical Solution    Local Tracking Error
                   Example: Five Agent Synchronization     Performance Indices


Local Tracking Error


  Local Tracking Error:
                                                                                       z0
  xi (k) =             eij (zi,k − zj,k ) + gi (zi,k − z0,k )                  g1
                j∈Ni
                                                                             z1                  z2

  Dynamics                                                                   e13
                                                                                       z3
  xi (k + 1) = Axi,k +                    eij + gi        Bi ui,k
                                   j∈Ni
                                                                             z4                  z5
                             −          eij Bj uj,k
                                 j∈Ni




6 / 18                                                     A Game Theoretical Approach to Multi-Agent Synchronization
Linear-Quadratic Discrete-Time Graphical Game    Communication Graph
                               Game-Theoretical Solution   Local Tracking Error
                    Example: Five Agent Synchronization    Performance Indices


Local Tracking Error


     Local Tracking Error Dynamics:
                                                                                     z0
     xi (k + 1) = Axi,k +                    eij + gi Bi ui,k                g1
                                      j∈Ni
                                                                           x1                  x2
                                −          eij Bj uj,k
                                    j∈Ni
                                                                                     x3

     The states xi of the agents in the graph
     can be combined into the global state:                                x4                  x5


         x(k) = [x1 (k) x2 (k) . . . xN (k)]T
                  T      T            T


                                                                        [Khoo, Xie, and Man, 2009]
7 / 18                                                     A Game Theoretical Approach to Multi-Agent Synchronization
Linear-Quadratic Discrete-Time Graphical Game      Communication Graph
                              Game-Theoretical Solution     Local Tracking Error
                   Example: Five Agent Synchronization      Performance Indices


Performance Index


     Each agent optimizes its own
     performance index, consisting of                                                z0
                                                                              g1
           Local tracking error xi
           Cost for own actions ui,k                                       x1                   x2
           Cost for actions of neighbors uj,k
                ∞
                         T                 T                                          x3
         Ji =          (xi,k Qii xi,k ) + ui,k Rii ui,k
                k=0
                                                                           x4                   x5
                                             T
                                 +          uj,k Rij uj,k
                                     j∈Ni



                                                                   [Vamvoudakis, and Lewis, 2011]
8 / 18                                                      A Game Theoretical Approach to Multi-Agent Synchronization
Linear-Quadratic Discrete-Time Graphical Game
                                                            Global Nash Equilibrium
                                Game-Theoretical Solution
                                                            Coupled Riccati Equations
                     Example: Five Agent Synchronization


                                                                                             z0
                                                                                        g1

                                                                                    x1                  x2

           All agents follow a policy πi such that:
                                                                                              x3
              ui (k) = πi x(k)            ∀i ∈ {1, . . . , N}
                                                                                    x4                  x5
         Definition (Global Nash Equilibrium)

                                        ∗          ∗
         An N-tuple of policies Π = {π1 , . . . , πN } constitutes a global Nash
         equilibrium solution for an N-agent game if every agent is in its
         best response to all the other agents in the graph.



                                                                               [Basar and Olsder, 1999]
9 / 18                                                      A Game Theoretical Approach to Multi-Agent Synchronization
Linear-Quadratic Discrete-Time Graphical Game
                                                           Global Nash Equilibrium
                               Game-Theoretical Solution
                                                           Coupled Riccati Equations
                    Example: Five Agent Synchronization




      The expected cost the i-th agent in the global Nash equilibrium
      can be expressed as:

                                     VΠ,i (x(k)) = x T (k)Si x(k)

      with:
                  x(k) The global tracking error state
                                                  ∗            ∗
                      Π The N-tuple of policies {π1 , . . . , πN }
                      Si The Riccati matrix for the i-th agent




10 / 18                                                    A Game Theoretical Approach to Multi-Agent Synchronization
Linear-Quadratic Discrete-Time Graphical Game
                                                               Global Nash Equilibrium
                               Game-Theoretical Solution
                                                               Coupled Riccati Equations
                    Example: Five Agent Synchronization




      The Coupled Riccati equations are
                         ¯
                    Si = Qi + ΛT Si Λ +                               ∗T     ∗         ∀i ∈ N
                                                           j∈{Ni ,i} πj Rij πj

      With:
                     ¯
                     Qi The state weighting is such that:
                               ¯
                        x T (k)Qi x(k) = xiT (k)Qii xi (k)
                      Λ The global closed loop matrix :
                                                       −1 ¯
                         Λ = I + i∈N Bi Rii BiT Siq
                                          ¯ −1 ¯          A
                     π ∗ The policies π ∗ = −R −1 B T S Λ
                                                  ¯ i
                       i                         i             ii    i




11 / 18                                                        A Game Theoretical Approach to Multi-Agent Synchronization
Linear-Quadratic Discrete-Time Graphical Game
                                                           Global Nash Equilibrium
                               Game-Theoretical Solution
                                                           Coupled Riccati Equations
                    Example: Five Agent Synchronization




      The Coupled Riccati equations are
                        ¯
                   Si = Qi + ΛT Si Λ
                                        ¯ −1        −1 ¯
                      + j∈{Ni ,i} ΛT Sj Bj Rjj Rij Rjj BjT Sj Λ                        ∀i ∈ N

      The positive definite solution of Coupled Riccati Equations:
             Asymptotically Stabilizes the states xi                      ∀i ∈ {1, . . . , N}
             The Global Nash Equilibrium solution




12 / 18                                                    A Game Theoretical Approach to Multi-Agent Synchronization
Linear-Quadratic Discrete-Time Graphical Game
                                                             Global Nash Equilibrium
                               Game-Theoretical Solution
                                                             Coupled Riccati Equations
                    Example: Five Agent Synchronization


Difference Coupled Riccati Equations



      Iterative solution of the Coupled Riccati Equations:

           Siq+1 =Qi + Λq
                  ¯                  T
                                         Siq Λq
                            +               Λq    T
                                                      Sjq Bj Rjj Rij Rjj BjT Sjq Λq
                                                          ¯ −1        −1 ¯
                                                                                             ∀i ∈ N
                                j∈{Ni ,i}


      The iteration exists if the inverse of I +                          i∈N   Bi Rii BiT Siq
                                                                                ¯ −1 ¯
      exists for all q.




13 / 18                                                      A Game Theoretical Approach to Multi-Agent Synchronization
Linear-Quadratic Discrete-Time Graphical Game
                                Game-Theoretical Solution
                     Example: Five Agent Synchronization


Synchronization
                                                                                                       z0
                                                                                              1                1
     Considering the following dynamics:                                                 z1
                                                                                                       1            z2

     z0 (k + 1) = Az0 (k)                                                                     1                1
                                                                                                       z3
     zi (k + 1) = Azi (k) + Bi ui (k)                   ∀i ∈ {1, ..., N}                       1                1
                                                                                         z4                         z5
      with :                                                                                           1
                    0.995  0.0998                                         2                        2
          A=                                       , B1 = B2 =                  , B3 =                     ,
                   −0.0998 0.995                                          3                        2
                         1                     1 0
          B4 = B5               , Qii =                     , Rij = 1, for all i, j ∈ {1, . . . , N}
                         2                     0 1


14 / 18                                                       A Game Theoretical Approach to Multi-Agent Synchronization
Linear-Quadratic Discrete-Time Graphical Game
                               Game-Theoretical Solution
                    Example: Five Agent Synchronization


Synchronization: z




                                                                                             z0


                                                                                      z1           z2


                                                                                             z3


                                                                                      z4           z5




15 / 18                                                    A Game Theoretical Approach to Multi-Agent Synchronization
Linear-Quadratic Discrete-Time Graphical Game
                               Game-Theoretical Solution
                    Example: Five Agent Synchronization


Synchronization: The local tracking error x




                                                                                            z0
                                                                                      x1           x2


                                                                                             x3


                                                                                      x4           x5




16 / 18                                                    A Game Theoretical Approach to Multi-Agent Synchronization
Linear-Quadratic Discrete-Time Graphical Game
                               Game-Theoretical Solution
                    Example: Five Agent Synchronization


Conclusion



      Exact solution of the linear-quadratic discrete-time graphical game.


      Future work:

      Use solution to quantify accuracy of approximative (learning)
      algorithms for linear-quadratic discrete-time graphical game.




17 / 18                                                    A Game Theoretical Approach to Multi-Agent Synchronization
Linear-Quadratic Discrete-Time Graphical Game
                               Game-Theoretical Solution
                    Example: Five Agent Synchronization




      Thank you for your time

                                    Are there any questions?




18 / 18                                                    A Game Theoretical Approach to Multi-Agent Synchronization

More Related Content

What's hot

Design and Analysis of Algorithms
Design and Analysis of AlgorithmsDesign and Analysis of Algorithms
Design and Analysis of AlgorithmsArvind Krishnaa
 
Au2419291933
Au2419291933Au2419291933
Au2419291933IJMER
 
Ijcatr03051008Implementation of Matrix based Mapping Method Using Elliptic Cu...
Ijcatr03051008Implementation of Matrix based Mapping Method Using Elliptic Cu...Ijcatr03051008Implementation of Matrix based Mapping Method Using Elliptic Cu...
Ijcatr03051008Implementation of Matrix based Mapping Method Using Elliptic Cu...Editor IJCATR
 
DESIGN AND ANALYSIS OF ALGORITHMS
DESIGN AND ANALYSIS OF ALGORITHMSDESIGN AND ANALYSIS OF ALGORITHMS
DESIGN AND ANALYSIS OF ALGORITHMSGayathri Gaayu
 
THE RESULT FOR THE GRUNDY NUMBER ON P4- CLASSES
THE RESULT FOR THE GRUNDY NUMBER ON P4- CLASSESTHE RESULT FOR THE GRUNDY NUMBER ON P4- CLASSES
THE RESULT FOR THE GRUNDY NUMBER ON P4- CLASSESgraphhoc
 
Dixon Deep Learning
Dixon Deep LearningDixon Deep Learning
Dixon Deep LearningSciCompIIT
 
01 graphical models
01 graphical models01 graphical models
01 graphical modelszukun
 
Elliptic curve scalar multiplier using karatsuba
Elliptic curve scalar multiplier using karatsubaElliptic curve scalar multiplier using karatsuba
Elliptic curve scalar multiplier using karatsubaIAEME Publication
 
Elliptic Curve Cryptography for those who are afraid of maths
Elliptic Curve Cryptography for those who are afraid of mathsElliptic Curve Cryptography for those who are afraid of maths
Elliptic Curve Cryptography for those who are afraid of mathsMartijn Grooten
 
Cs6402 design and analysis of algorithms may june 2016 answer key
Cs6402 design and analysis of algorithms may june 2016 answer keyCs6402 design and analysis of algorithms may june 2016 answer key
Cs6402 design and analysis of algorithms may june 2016 answer keyappasami
 
An ElGamal Encryption Scheme of Adjacency Matrix and Finite Machines
An ElGamal Encryption Scheme of Adjacency Matrix and Finite MachinesAn ElGamal Encryption Scheme of Adjacency Matrix and Finite Machines
An ElGamal Encryption Scheme of Adjacency Matrix and Finite MachinesComputer Science Journals
 
Dynamic Programming - Part 1
Dynamic Programming - Part 1Dynamic Programming - Part 1
Dynamic Programming - Part 1Amrinder Arora
 
Graph Traversal Algorithms - Depth First Search Traversal
Graph Traversal Algorithms - Depth First Search TraversalGraph Traversal Algorithms - Depth First Search Traversal
Graph Traversal Algorithms - Depth First Search TraversalAmrinder Arora
 

What's hot (20)

Design and Analysis of Algorithms
Design and Analysis of AlgorithmsDesign and Analysis of Algorithms
Design and Analysis of Algorithms
 
Au2419291933
Au2419291933Au2419291933
Au2419291933
 
Ecc2
Ecc2Ecc2
Ecc2
 
Ijcatr03051008Implementation of Matrix based Mapping Method Using Elliptic Cu...
Ijcatr03051008Implementation of Matrix based Mapping Method Using Elliptic Cu...Ijcatr03051008Implementation of Matrix based Mapping Method Using Elliptic Cu...
Ijcatr03051008Implementation of Matrix based Mapping Method Using Elliptic Cu...
 
DESIGN AND ANALYSIS OF ALGORITHMS
DESIGN AND ANALYSIS OF ALGORITHMSDESIGN AND ANALYSIS OF ALGORITHMS
DESIGN AND ANALYSIS OF ALGORITHMS
 
Elliptic curve cryptography
Elliptic curve cryptographyElliptic curve cryptography
Elliptic curve cryptography
 
THE RESULT FOR THE GRUNDY NUMBER ON P4- CLASSES
THE RESULT FOR THE GRUNDY NUMBER ON P4- CLASSESTHE RESULT FOR THE GRUNDY NUMBER ON P4- CLASSES
THE RESULT FOR THE GRUNDY NUMBER ON P4- CLASSES
 
Dixon Deep Learning
Dixon Deep LearningDixon Deep Learning
Dixon Deep Learning
 
01 graphical models
01 graphical models01 graphical models
01 graphical models
 
Ki2518101816
Ki2518101816Ki2518101816
Ki2518101816
 
Asymptotic Notation
Asymptotic NotationAsymptotic Notation
Asymptotic Notation
 
K031065069
K031065069K031065069
K031065069
 
Elliptic curve scalar multiplier using karatsuba
Elliptic curve scalar multiplier using karatsubaElliptic curve scalar multiplier using karatsuba
Elliptic curve scalar multiplier using karatsuba
 
Dynamicpgmming
DynamicpgmmingDynamicpgmming
Dynamicpgmming
 
04 maths
04 maths04 maths
04 maths
 
Elliptic Curve Cryptography for those who are afraid of maths
Elliptic Curve Cryptography for those who are afraid of mathsElliptic Curve Cryptography for those who are afraid of maths
Elliptic Curve Cryptography for those who are afraid of maths
 
Cs6402 design and analysis of algorithms may june 2016 answer key
Cs6402 design and analysis of algorithms may june 2016 answer keyCs6402 design and analysis of algorithms may june 2016 answer key
Cs6402 design and analysis of algorithms may june 2016 answer key
 
An ElGamal Encryption Scheme of Adjacency Matrix and Finite Machines
An ElGamal Encryption Scheme of Adjacency Matrix and Finite MachinesAn ElGamal Encryption Scheme of Adjacency Matrix and Finite Machines
An ElGamal Encryption Scheme of Adjacency Matrix and Finite Machines
 
Dynamic Programming - Part 1
Dynamic Programming - Part 1Dynamic Programming - Part 1
Dynamic Programming - Part 1
 
Graph Traversal Algorithms - Depth First Search Traversal
Graph Traversal Algorithms - Depth First Search TraversalGraph Traversal Algorithms - Depth First Search Traversal
Graph Traversal Algorithms - Depth First Search Traversal
 

Viewers also liked

Viewers also liked (8)

Aiim share point-2013-survey-infographic
Aiim share point-2013-survey-infographicAiim share point-2013-survey-infographic
Aiim share point-2013-survey-infographic
 
Presentation
PresentationPresentation
Presentation
 
Crime rate by: Sam Miller
Crime rate by: Sam MillerCrime rate by: Sam Miller
Crime rate by: Sam Miller
 
Liver cancer
Liver cancerLiver cancer
Liver cancer
 
Kony 2012
Kony 2012Kony 2012
Kony 2012
 
Pollution
PollutionPollution
Pollution
 
Panda jolene
Panda jolenePanda jolene
Panda jolene
 
cassandra children hunger in haiti :(
cassandra children hunger in haiti :( cassandra children hunger in haiti :(
cassandra children hunger in haiti :(
 

Recently uploaded

AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfphamnguyenenglishnb
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Celine George
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxRaymartEstabillo3
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designMIPLM
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...Nguyen Thanh Tu Collection
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfMr Bounab Samir
 
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxGrade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxChelloAnnAsuncion2
 
Hierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementHierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementmkooblal
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfFraming an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfUjwalaBharambe
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxDr.Ibrahim Hassaan
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptxSherlyMaeNeri
 
Quarter 4 Peace-education.pptx Catch Up Friday
Quarter 4 Peace-education.pptx Catch Up FridayQuarter 4 Peace-education.pptx Catch Up Friday
Quarter 4 Peace-education.pptx Catch Up FridayMakMakNepo
 
Romantic Opera MUSIC FOR GRADE NINE pptx
Romantic Opera MUSIC FOR GRADE NINE pptxRomantic Opera MUSIC FOR GRADE NINE pptx
Romantic Opera MUSIC FOR GRADE NINE pptxsqpmdrvczh
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxAnupkumar Sharma
 

Recently uploaded (20)

AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
 
Rapple "Scholarly Communications and the Sustainable Development Goals"
Rapple "Scholarly Communications and the Sustainable Development Goals"Rapple "Scholarly Communications and the Sustainable Development Goals"
Rapple "Scholarly Communications and the Sustainable Development Goals"
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-design
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
 
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxGrade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
 
Hierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementHierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of management
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfFraming an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptx
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptx
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Raw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptxRaw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptx
 
Quarter 4 Peace-education.pptx Catch Up Friday
Quarter 4 Peace-education.pptx Catch Up FridayQuarter 4 Peace-education.pptx Catch Up Friday
Quarter 4 Peace-education.pptx Catch Up Friday
 
Romantic Opera MUSIC FOR GRADE NINE pptx
Romantic Opera MUSIC FOR GRADE NINE pptxRomantic Opera MUSIC FOR GRADE NINE pptx
Romantic Opera MUSIC FOR GRADE NINE pptx
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
 

A Game Theoretical Approach to Multi-Agent Synchronization

  • 1. A Game Theoretical Approach to Multi-Agent Synchronization Sofie Haesaert, DCSC co-authors: prof. dr. R. Babuˇka and prof. dr. F.L. Lewis s March 28, 2012
  • 2. Linear-Quadratic Discrete-Time Graphical Game Game-Theoretical Solution Example: Five Agent Synchronization Multi-Agent Synchronization Leader-Follower synchronization : Cooperative Game Theory Communication graph Multitude of applications in: Computer science Spacecraft Unmanned air vehicles 2 / 18 A Game Theoretical Approach to Multi-Agent Synchronization
  • 3. Linear-Quadratic Discrete-Time Graphical Game Game-Theoretical Solution Example: Five Agent Synchronization Outline Linear-Quadratic Discrete-Time Graphical Game Communication Graph Local Tracking Error Performance Indices Game-Theoretical Solution Global Nash Equilibrium Coupled Riccati Equations Example: Five Agent Synchronization 3 / 18 A Game Theoretical Approach to Multi-Agent Synchronization
  • 4. Linear-Quadratic Discrete-Time Graphical Game Communication Graph Game-Theoretical Solution Local Tracking Error Example: Five Agent Synchronization Performance Indices Leader-Follower Synchronization z0 State of Leader Agent z0 (k + 1) = Az0 (k) z1 z2 State of i-th Agent zi (k + 1) = Azi (k) + Bi ui (k) ∀i ∈ {1, ..., N} z3 Objective: z4 z5 zi (k) → z0 (k) ∀i ∈ {1, ..., N} [Wang, and Chen, 2002] 4 / 18 A Game Theoretical Approach to Multi-Agent Synchronization
  • 5. Linear-Quadratic Discrete-Time Graphical Game Communication Graph Game-Theoretical Solution Local Tracking Error Example: Five Agent Synchronization Performance Indices Communication Graph z0 Graph G Nodes V(G) = {z1 , z2 , . . . , zN } z1 z2 z3 z4 z5
  • 6. Linear-Quadratic Discrete-Time Graphical Game Communication Graph Game-Theoretical Solution Local Tracking Error Example: Five Agent Synchronization Performance Indices Communication Graph z0 Graph G Nodes V(G) = {z1 , z2 , . . . , zN } z1 z2 Edges E ⊆ V × V e13 e23 Edge weights z3 Edge weights eij e34 e35 z4 z5 e45 5 / 18 A Game Theoretical Approach to Multi-Agent Synchronization
  • 7. Linear-Quadratic Discrete-Time Graphical Game Communication Graph Game-Theoretical Solution Local Tracking Error Example: Five Agent Synchronization Performance Indices Communication Graph z0 Graph G g1 g2 Nodes V(G) = {z1 , z2 , . . . , zN } g3 z1 z2 Edges E ⊆ V × V Edge weights z3 Edge weights eij Pinning gains: gi z4 z5 5 / 18 A Game Theoretical Approach to Multi-Agent Synchronization
  • 8. Linear-Quadratic Discrete-Time Graphical Game Communication Graph Game-Theoretical Solution Local Tracking Error Example: Five Agent Synchronization Performance Indices Local Tracking Error Local Tracking Error: z0 xi (k) = eij (zi,k − zj,k ) + gi (zi,k − z0,k ) g1 j∈Ni z1 z2 Dynamics e13 z3 xi (k + 1) = Axi,k + eij + gi Bi ui,k j∈Ni z4 z5 − eij Bj uj,k j∈Ni 6 / 18 A Game Theoretical Approach to Multi-Agent Synchronization
  • 9. Linear-Quadratic Discrete-Time Graphical Game Communication Graph Game-Theoretical Solution Local Tracking Error Example: Five Agent Synchronization Performance Indices Local Tracking Error Local Tracking Error Dynamics: z0 xi (k + 1) = Axi,k + eij + gi Bi ui,k g1 j∈Ni x1 x2 − eij Bj uj,k j∈Ni x3 The states xi of the agents in the graph can be combined into the global state: x4 x5 x(k) = [x1 (k) x2 (k) . . . xN (k)]T T T T [Khoo, Xie, and Man, 2009] 7 / 18 A Game Theoretical Approach to Multi-Agent Synchronization
  • 10. Linear-Quadratic Discrete-Time Graphical Game Communication Graph Game-Theoretical Solution Local Tracking Error Example: Five Agent Synchronization Performance Indices Performance Index Each agent optimizes its own performance index, consisting of z0 g1 Local tracking error xi Cost for own actions ui,k x1 x2 Cost for actions of neighbors uj,k ∞ T T x3 Ji = (xi,k Qii xi,k ) + ui,k Rii ui,k k=0 x4 x5 T + uj,k Rij uj,k j∈Ni [Vamvoudakis, and Lewis, 2011] 8 / 18 A Game Theoretical Approach to Multi-Agent Synchronization
  • 11. Linear-Quadratic Discrete-Time Graphical Game Global Nash Equilibrium Game-Theoretical Solution Coupled Riccati Equations Example: Five Agent Synchronization z0 g1 x1 x2 All agents follow a policy πi such that: x3 ui (k) = πi x(k) ∀i ∈ {1, . . . , N} x4 x5 Definition (Global Nash Equilibrium) ∗ ∗ An N-tuple of policies Π = {π1 , . . . , πN } constitutes a global Nash equilibrium solution for an N-agent game if every agent is in its best response to all the other agents in the graph. [Basar and Olsder, 1999] 9 / 18 A Game Theoretical Approach to Multi-Agent Synchronization
  • 12. Linear-Quadratic Discrete-Time Graphical Game Global Nash Equilibrium Game-Theoretical Solution Coupled Riccati Equations Example: Five Agent Synchronization The expected cost the i-th agent in the global Nash equilibrium can be expressed as: VΠ,i (x(k)) = x T (k)Si x(k) with: x(k) The global tracking error state ∗ ∗ Π The N-tuple of policies {π1 , . . . , πN } Si The Riccati matrix for the i-th agent 10 / 18 A Game Theoretical Approach to Multi-Agent Synchronization
  • 13. Linear-Quadratic Discrete-Time Graphical Game Global Nash Equilibrium Game-Theoretical Solution Coupled Riccati Equations Example: Five Agent Synchronization The Coupled Riccati equations are ¯ Si = Qi + ΛT Si Λ + ∗T ∗ ∀i ∈ N j∈{Ni ,i} πj Rij πj With: ¯ Qi The state weighting is such that: ¯ x T (k)Qi x(k) = xiT (k)Qii xi (k) Λ The global closed loop matrix : −1 ¯ Λ = I + i∈N Bi Rii BiT Siq ¯ −1 ¯ A π ∗ The policies π ∗ = −R −1 B T S Λ ¯ i i i ii i 11 / 18 A Game Theoretical Approach to Multi-Agent Synchronization
  • 14. Linear-Quadratic Discrete-Time Graphical Game Global Nash Equilibrium Game-Theoretical Solution Coupled Riccati Equations Example: Five Agent Synchronization The Coupled Riccati equations are ¯ Si = Qi + ΛT Si Λ ¯ −1 −1 ¯ + j∈{Ni ,i} ΛT Sj Bj Rjj Rij Rjj BjT Sj Λ ∀i ∈ N The positive definite solution of Coupled Riccati Equations: Asymptotically Stabilizes the states xi ∀i ∈ {1, . . . , N} The Global Nash Equilibrium solution 12 / 18 A Game Theoretical Approach to Multi-Agent Synchronization
  • 15. Linear-Quadratic Discrete-Time Graphical Game Global Nash Equilibrium Game-Theoretical Solution Coupled Riccati Equations Example: Five Agent Synchronization Difference Coupled Riccati Equations Iterative solution of the Coupled Riccati Equations: Siq+1 =Qi + Λq ¯ T Siq Λq + Λq T Sjq Bj Rjj Rij Rjj BjT Sjq Λq ¯ −1 −1 ¯ ∀i ∈ N j∈{Ni ,i} The iteration exists if the inverse of I + i∈N Bi Rii BiT Siq ¯ −1 ¯ exists for all q. 13 / 18 A Game Theoretical Approach to Multi-Agent Synchronization
  • 16. Linear-Quadratic Discrete-Time Graphical Game Game-Theoretical Solution Example: Five Agent Synchronization Synchronization z0 1 1 Considering the following dynamics: z1 1 z2 z0 (k + 1) = Az0 (k) 1 1 z3 zi (k + 1) = Azi (k) + Bi ui (k) ∀i ∈ {1, ..., N} 1 1 z4 z5 with : 1 0.995 0.0998 2 2 A= , B1 = B2 = , B3 = , −0.0998 0.995 3 2 1 1 0 B4 = B5 , Qii = , Rij = 1, for all i, j ∈ {1, . . . , N} 2 0 1 14 / 18 A Game Theoretical Approach to Multi-Agent Synchronization
  • 17. Linear-Quadratic Discrete-Time Graphical Game Game-Theoretical Solution Example: Five Agent Synchronization Synchronization: z z0 z1 z2 z3 z4 z5 15 / 18 A Game Theoretical Approach to Multi-Agent Synchronization
  • 18. Linear-Quadratic Discrete-Time Graphical Game Game-Theoretical Solution Example: Five Agent Synchronization Synchronization: The local tracking error x z0 x1 x2 x3 x4 x5 16 / 18 A Game Theoretical Approach to Multi-Agent Synchronization
  • 19. Linear-Quadratic Discrete-Time Graphical Game Game-Theoretical Solution Example: Five Agent Synchronization Conclusion Exact solution of the linear-quadratic discrete-time graphical game. Future work: Use solution to quantify accuracy of approximative (learning) algorithms for linear-quadratic discrete-time graphical game. 17 / 18 A Game Theoretical Approach to Multi-Agent Synchronization
  • 20. Linear-Quadratic Discrete-Time Graphical Game Game-Theoretical Solution Example: Five Agent Synchronization Thank you for your time Are there any questions? 18 / 18 A Game Theoretical Approach to Multi-Agent Synchronization