SlideShare a Scribd company logo
December 18 Fri., 2015, 09:10-09:30, Regular Session: Modeling 1, Frb04.3 @ 802
Transition  Models  of  Equilibrium  
Assessment  in  Bayesian  Game
Kiminao Kogiso
University of Electro-Communications
Tokyo, Japan
The 54 Conference on Decision and Control
Osaka International Convention Center, Osaka, Japan
December 15 to 18, 2015
Supported by
JSPS Grant-in-Aid for Challenging Exploratory Research
2014 to 2016
Outline
2
Introduction  
Static  Bayesian  Game  
Novel  Form  in  Bayesian  Nash  Equilibrium  
Dynamics  in  Equilibrium  Assessment  
Simulation  
Conclusion
Introduction
3
Strategic game enabling to consider uncertainties in player’s decisions.
player: a reasonable decision maker
action: what a player chooses
utility: a player’s preference over the actions
type: a label of player’s private valuation (what the player really feels)
belief: a probability distribution over the types
(degree of feeling, tendency, proclivity,…)
Static Bayesian Game[1]
[1] Harsanyi, 1967. [2] Alpcan and Basar, et al., 2011, 2013. [3] Roy, et al., 2010. [4] Liu, et al., 2006. [5] Akkarajitsakul, et al., 2011.
A Bayesian game used in engineering problems to analyze a Bayesian
Nash equilibrium or to design a game mechanism.
network security[2,3], intrusion detection[4,5,6], belief learning[7]
electricity pricing[8,9], mechanism design[10]
[6] Sedjelmachi, et al., 2014, 2015. [7] Nachbar, 2008. [8] Li, et al., 2011, 2014. [9] Yang, et al., 2013. [10] Tao, et al., 2015.
Introduction
4
Insufficient tools and concepts[11]
Bayesian Nash equilibrium plays key roles in game analysis & design.
equilibrium analysis: for given belief, find a Bayesian Nash Equilibrium(BNE).
belief learning: for given BNE, find a corresponding belief.
mechanism design: for given utility, find rules to achieve a desired BNE.
Objective of this talk
Derive a dynamical state-space model whose state involves a BNE.
derive a novel condition related to the BNE,
discover a map (discrete-time system) defined by the novel condition,
confirm a time response of the map.
[11] Powell, 2011.
Challenge: prepare tools & concepts to apply our model-based fashion
to analysis and design of the game.
Bayesian Game
Player set
Action set
Type set
Utility
Strategy (mixed)
Belief
Static Bayesian Game: General
5
Two-player two-action Bayesian game w/ two types
G(N, A, ⇥, u, µ, S)
N := {1, 2}
A := A1 ⇥ A2
⇥ := ⇥1 ⇥ ⇥2
u := (u1, u2)
µ := (µ1, µ2)
S := (S1, S2)
ai 2 Ai := {a, ¯a} 8i 2 N
✓i 2 ⇥i := {✓, ¯✓} 8i 2 N
µi 2 ⇧(⇥i) 8i 2 N
Si : ⇥i ! ⇧(Ai) 8i 2 N
si 2 Si(⇥i) 8i 2 N
⇧(X) : a probability distribution over a finite set X
Ui(✓i, ✓ i) :=

ui(a, a, ✓i, ✓ i) ui(a, ¯a, ✓i, ✓ i)
ui(¯a, a, ✓i, ✓ i) ui(¯a, ¯a, ✓i, ✓ i)
: utility matrix8i 2 N, 8✓ 2 ⇥
ui : A ⇥ ⇥ ! < 8i 2 N
i 2 N
Static Bayesian Game: Example
6
Service of tennis
2, 2 0, 1
1, 21, 1
flat
spin
flat spin
0, 1 1, 2
0, 11, 2
flat
spin
flat spin
sideline
1, 0 1, 1
2, 00, 1
flat
spin
flat spin
1, 3 1, 2
0, 32, 2
flat
spin
flat spin
centerline
s1(a|✓)
s1(¯a|✓)
s1(¯a|¯✓)
s1(a|¯✓)
s2(a|¯✓) s2(¯a|¯✓)s2(¯a|✓)s2(a|✓)
center line ✓ side line ¯✓
✓¯✓
¯a
¯a ¯a
¯a
¯a ¯a
¯a¯aa
a a
a
µ1(✓)
µ1(¯✓)
µ2(¯✓)µ2(✓)
a
a a
a
type
belief
Bayesian Nash Equilibrium
7
Equilibrium assessment
definitions of Bayesian Nash Equilibrium(BNE)
using an ex-ante expected utility:
using a best response to opponent strategy:
EUi(si, s i) EUi(s0
, s i) 8s0
i 2 Si, s0
i 6= si
is denoted as the Bayesian Nash equilibrium.
A strategy profile , satisfying , is also a BNE.s = (si, s i)
Given a prior common probability , for any , the strategy satisfyingi 2 N sp(µ)
si 2 BRi(s i, µ) 8i 2 N
the pair of considered as key variables of the Bayesian game.
Equilibrium Assessment : a pair of a belief and the corresponding BNE.(ˆµ, ˆs)
(µ, s)
equilibrium analysis[10]: find a BNE .ˆs9 ˆµ,
[10] Y. Shoham and K. Leyton-Brown, Multiagent Systems, Cambridge University Press, 2009.
then the pair is an Equilibrium Assessment, where ,✏ :=
⇥
1 1
⇤
(ˆµ, ˆs)
Novel Form Satisfying BNE
8
If the game satisfies the following condition (simultaneous polynomial in ):
Sufficient condition to be BNE
Lemma
8✓i 2 ⇥i, 8i 2 N
G
✏⇣i(ˆs i, ✓i) (✓i)p(ˆµ) = 0
⇣i(ˆs i, ✓i) :=
⇥
Ui(✓i, ✓)ˆs i(✓) Ui(✓i, ¯✓)ˆs i(¯✓)
⇤
,
(✓) :=

1 0 0 0
0 1 0 0 , (¯✓) :=

0 0 1 0
0 0 0 1 .
idea: derived from KKT condition of BNE by cancelation of Lagrangian variables.
point: # of the polynomials: 4, # of the variables: 6; D.O.F. in determining their values.
note: a BNE (mixed strategy) holds the above equation, but some of pure strategy BNEs
do not hold it.
ˆµ
all of EAs
Discover Dynamics!
9
Map from EA to EA
Idea to derive dynamics in EA
all of EAs
⇥
EA
(ˆµ, ˆs)
satisfying
the Lemma
all of EAs
⇥
EA
(ˆµ + ˆµ, ˆs + ˆs)
satisfying
the Lemma
Given an initial EA, if there exists such that the game satisfies
the following condition w.r.t. utility matrices: ,
Dynamics in Equilibrium Assessment
10
Main result
Theorem
⇥
1 1
⇤
Ui(✓i, ✓)

1
1 1
= 0
⇥
1 1
⇤
Ui(✓i, ¯✓)

2
1 2
= 0
8✓i 2 ⇥i8i 2 N
ˆµ(k + 1) = diag(A1, A2)ˆµ(k)
ˆs(k + 1) = A (ci(k))ˆsi(k)
ci(k) :=
ˆµi(✓i, k + 1)
ˆµi(✓i, k)
, and is a row stochastic matrix.Ai 2 <2⇥2
8i 2 N
= [ 1 2]T
2 <2
then a nonlinear autonomous system in terms of the equilibrium assessment:
transfers from an EA to another EA , where(ˆµ(k), ˆs(k)) (ˆµ(k + 1), ˆs(k + 1))
ci(k) ! 1
A (1) = I
ci(k) :=
ˆµi(✓i, k + 1)
ˆµi(✓i, k)ˆµ(k + 1) = diag(A1, A2)ˆµ(k) ˆs(k + 1) = A (ci(k))ˆsi(k)
ˆµ(k)
ˆµ(k + 1)
stable linear system: time-varying system:
·
ˆs(k)
ˆµ(k)
Simulation
11
Trajectory of equilibrium assessment
0.3
0.4
0.5
0.6
0.7
10
0.2
0.4
0.6
0.8
probabilityprobability
step
0 1 2 3 4 5 6 7 8 9
100 1 2 3 4 5 6 7 8 9
s1( )a|θ−− s1( )a|θ−
−
s1( )a|θ
−
− s1( )a|θ
−−
s2( )a|θ−− s2( )a|θ−
−
s2( )a|θ
−
− s2( )a|θ
−−
0.4
0.6
0.8
1.0
1.2
1.4
1.6
1.8
the proposed model
the computation of
the best respose (4)
10
expectedutilityvalue
step
0 1 2 3 4 5 6 7 8 9
EU1
EU2
0.4
0.6
0.8
1.0
1.2
1.4
1.6
1.8
10
step
0 1 2 3 4 5 6 7 8 9
valueofci()θi
c ( )1 θ−
c ( )1 θ
−
c ( )2 θ−
c ( )2 θ
−
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
10
probability
step
0 1 2 3 4 5 6 7 8 9
1( )η θ−
1( )η θ
−
2( )η θ−
2( )η θ
−
belief strategy
c expected
utility
Conclusion
12
Introduction
Static Bayesian Game
two-players two-actions game with two-types
New Form in Bayesian Nash Equilib.
polynomial conditions in equilibrium assessment
Dynamics in Equilibrium Assessment
discrete-time autonomous time-varying system
convergence of the EA (stability)
Simulation
confirms states updated become EA and converge.
Future works
estimate player’s belief for a given BNE, and
realize a control-theoretic mechanism design method.
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
10
probability
step
0 1 2 3 4 5 6 7 8 9
1( )η θ−
1( )η θ
−
2( )η θ−
2( )η θ
−
belief
0.3
0.4
0.5
0.6
0.7
10
0.2
0.4
0.6
0.8
probabilityprobability
step
0 1 2 3 4 5 6 7 8 9
100 1 2 3 4 5 6 7 8 9
s1( )a|θ−− s1( )a|θ−
−
s1( )a|θ
−
− s1( )a|θ
−−
s2( )a|θ−− s2( )a|θ−
−
s2( )a|θ
−
− s2( )a|θ
−−
strategy
ˆµ(k + 1) = diag(A1, A2)ˆµ(k)
ˆs(k + 1) = A (ci(k))ˆsi(k)
Dynamics in EA:

More Related Content

Similar to Transition Models of Equilibrium Assessment in Bayesian Game

Beyond Nash Equilibrium - Correlated Equilibrium and Evolutionary Equilibrium
Beyond Nash Equilibrium - Correlated Equilibrium and Evolutionary Equilibrium Beyond Nash Equilibrium - Correlated Equilibrium and Evolutionary Equilibrium
Beyond Nash Equilibrium - Correlated Equilibrium and Evolutionary Equilibrium
Jie Bao
 
CAGT-IST Student Presentations
CAGT-IST Student Presentations CAGT-IST Student Presentations
CAGT-IST Student Presentations
Prithviraj (Raj) Dasgupta
 
IIT JAM Mathematical Statistics - MS 2022 | Sourav Sir's Classes
IIT JAM Mathematical Statistics - MS 2022 | Sourav Sir's ClassesIIT JAM Mathematical Statistics - MS 2022 | Sourav Sir's Classes
IIT JAM Mathematical Statistics - MS 2022 | Sourav Sir's Classes
SOURAV DAS
 
lect1207
lect1207lect1207
lect1207
webuploader
 
Games, Queries, and Argumentation Frameworks: Time for a Family Reunion!
Games, Queries, and Argumentation Frameworks: Time for a Family Reunion!Games, Queries, and Argumentation Frameworks: Time for a Family Reunion!
Games, Queries, and Argumentation Frameworks: Time for a Family Reunion!
Bertram Ludäscher
 
CCS 3102 Lecture 2_ Mathematical foundations.pdf
CCS 3102 Lecture 2_ Mathematical foundations.pdfCCS 3102 Lecture 2_ Mathematical foundations.pdf
CCS 3102 Lecture 2_ Mathematical foundations.pdf
JosephKariuki46
 
Influencing Visual Judgment through Affective Priming
Influencing Visual Judgment through Affective PrimingInfluencing Visual Judgment through Affective Priming
Influencing Visual Judgment through Affective Priming
Lane Harrison
 
nips-gg
nips-ggnips-gg
nips-gg
webuploader
 
Supervised sequential pattern mining for identifying important patterns of pl...
Supervised sequential pattern mining for identifying important patterns of pl...Supervised sequential pattern mining for identifying important patterns of pl...
Supervised sequential pattern mining for identifying important patterns of pl...
Rory Bunker
 
Ubisoft
UbisoftUbisoft
Ubisoft
GIAF
 
Cooperative Game Theory
Cooperative Game TheoryCooperative Game Theory
Cooperative Game Theory
SSA KPI
 
Accelerating Metropolis Hastings with Lightweight Inference Compilation
Accelerating Metropolis Hastings with Lightweight Inference CompilationAccelerating Metropolis Hastings with Lightweight Inference Compilation
Accelerating Metropolis Hastings with Lightweight Inference Compilation
Feynman Liang
 
Probability Distribution
Probability DistributionProbability Distribution
Probability Distribution
Long Beach City College
 
A System of Estimators of the Population Mean under Two-Phase Sampling in Pre...
A System of Estimators of the Population Mean under Two-Phase Sampling in Pre...A System of Estimators of the Population Mean under Two-Phase Sampling in Pre...
A System of Estimators of the Population Mean under Two-Phase Sampling in Pre...
Premier Publishers
 
Catalan Tau Collocation for Numerical Solution of 2-Dimentional Nonlinear Par...
Catalan Tau Collocation for Numerical Solution of 2-Dimentional Nonlinear Par...Catalan Tau Collocation for Numerical Solution of 2-Dimentional Nonlinear Par...
Catalan Tau Collocation for Numerical Solution of 2-Dimentional Nonlinear Par...
IJERA Editor
 
Operations Research Situations and Games
Operations Research Situations and GamesOperations Research Situations and Games
Operations Research Situations and Games
SSA KPI
 
PCB_Lect02_Pairwise_allign (1).pdf
PCB_Lect02_Pairwise_allign (1).pdfPCB_Lect02_Pairwise_allign (1).pdf
PCB_Lect02_Pairwise_allign (1).pdf
ssusera1eccd
 
An Analytical Study of Puzzle Selection Strategies for the ESP Game
An Analytical Study of Puzzle Selection Strategies for the ESP GameAn Analytical Study of Puzzle Selection Strategies for the ESP Game
An Analytical Study of Puzzle Selection Strategies for the ESP Game
Academia Sinica
 
STT802project-writeup-Final (1)
STT802project-writeup-Final (1)STT802project-writeup-Final (1)
STT802project-writeup-Final (1)
James P. Regan II
 
Introduction to Image Processing
Introduction to Image ProcessingIntroduction to Image Processing
Introduction to Image Processing
Israel Gbati
 

Similar to Transition Models of Equilibrium Assessment in Bayesian Game (20)

Beyond Nash Equilibrium - Correlated Equilibrium and Evolutionary Equilibrium
Beyond Nash Equilibrium - Correlated Equilibrium and Evolutionary Equilibrium Beyond Nash Equilibrium - Correlated Equilibrium and Evolutionary Equilibrium
Beyond Nash Equilibrium - Correlated Equilibrium and Evolutionary Equilibrium
 
CAGT-IST Student Presentations
CAGT-IST Student Presentations CAGT-IST Student Presentations
CAGT-IST Student Presentations
 
IIT JAM Mathematical Statistics - MS 2022 | Sourav Sir's Classes
IIT JAM Mathematical Statistics - MS 2022 | Sourav Sir's ClassesIIT JAM Mathematical Statistics - MS 2022 | Sourav Sir's Classes
IIT JAM Mathematical Statistics - MS 2022 | Sourav Sir's Classes
 
lect1207
lect1207lect1207
lect1207
 
Games, Queries, and Argumentation Frameworks: Time for a Family Reunion!
Games, Queries, and Argumentation Frameworks: Time for a Family Reunion!Games, Queries, and Argumentation Frameworks: Time for a Family Reunion!
Games, Queries, and Argumentation Frameworks: Time for a Family Reunion!
 
CCS 3102 Lecture 2_ Mathematical foundations.pdf
CCS 3102 Lecture 2_ Mathematical foundations.pdfCCS 3102 Lecture 2_ Mathematical foundations.pdf
CCS 3102 Lecture 2_ Mathematical foundations.pdf
 
Influencing Visual Judgment through Affective Priming
Influencing Visual Judgment through Affective PrimingInfluencing Visual Judgment through Affective Priming
Influencing Visual Judgment through Affective Priming
 
nips-gg
nips-ggnips-gg
nips-gg
 
Supervised sequential pattern mining for identifying important patterns of pl...
Supervised sequential pattern mining for identifying important patterns of pl...Supervised sequential pattern mining for identifying important patterns of pl...
Supervised sequential pattern mining for identifying important patterns of pl...
 
Ubisoft
UbisoftUbisoft
Ubisoft
 
Cooperative Game Theory
Cooperative Game TheoryCooperative Game Theory
Cooperative Game Theory
 
Accelerating Metropolis Hastings with Lightweight Inference Compilation
Accelerating Metropolis Hastings with Lightweight Inference CompilationAccelerating Metropolis Hastings with Lightweight Inference Compilation
Accelerating Metropolis Hastings with Lightweight Inference Compilation
 
Probability Distribution
Probability DistributionProbability Distribution
Probability Distribution
 
A System of Estimators of the Population Mean under Two-Phase Sampling in Pre...
A System of Estimators of the Population Mean under Two-Phase Sampling in Pre...A System of Estimators of the Population Mean under Two-Phase Sampling in Pre...
A System of Estimators of the Population Mean under Two-Phase Sampling in Pre...
 
Catalan Tau Collocation for Numerical Solution of 2-Dimentional Nonlinear Par...
Catalan Tau Collocation for Numerical Solution of 2-Dimentional Nonlinear Par...Catalan Tau Collocation for Numerical Solution of 2-Dimentional Nonlinear Par...
Catalan Tau Collocation for Numerical Solution of 2-Dimentional Nonlinear Par...
 
Operations Research Situations and Games
Operations Research Situations and GamesOperations Research Situations and Games
Operations Research Situations and Games
 
PCB_Lect02_Pairwise_allign (1).pdf
PCB_Lect02_Pairwise_allign (1).pdfPCB_Lect02_Pairwise_allign (1).pdf
PCB_Lect02_Pairwise_allign (1).pdf
 
An Analytical Study of Puzzle Selection Strategies for the ESP Game
An Analytical Study of Puzzle Selection Strategies for the ESP GameAn Analytical Study of Puzzle Selection Strategies for the ESP Game
An Analytical Study of Puzzle Selection Strategies for the ESP Game
 
STT802project-writeup-Final (1)
STT802project-writeup-Final (1)STT802project-writeup-Final (1)
STT802project-writeup-Final (1)
 
Introduction to Image Processing
Introduction to Image ProcessingIntroduction to Image Processing
Introduction to Image Processing
 

More from Kiminao Kogiso

Cyber-Security Enhancements of Networked Control Systems Using Homomorphic En...
Cyber-Security Enhancements of Networked Control Systems Using Homomorphic En...Cyber-Security Enhancements of Networked Control Systems Using Homomorphic En...
Cyber-Security Enhancements of Networked Control Systems Using Homomorphic En...
Kiminao Kogiso
 
Player's Belief Estimation for Super Human Sports
Player's Belief Estimation for Super Human SportsPlayer's Belief Estimation for Super Human Sports
Player's Belief Estimation for Super Human Sports
Kiminao Kogiso
 
Hybrid Nonlinear Model of McKibben Pneumatic Artificial Muscle Systems Incorp...
Hybrid Nonlinear Model of McKibben Pneumatic Artificial Muscle Systems Incorp...Hybrid Nonlinear Model of McKibben Pneumatic Artificial Muscle Systems Incorp...
Hybrid Nonlinear Model of McKibben Pneumatic Artificial Muscle Systems Incorp...
Kiminao Kogiso
 
Controller encryption using RSA public-key encryption scheme (Asian Control C...
Controller encryption using RSA public-key encryption scheme (Asian Control C...Controller encryption using RSA public-key encryption scheme (Asian Control C...
Controller encryption using RSA public-key encryption scheme (Asian Control C...
Kiminao Kogiso
 
Application of ElGamal Encryption Scheme to Control System for Security Enhan...
Application of ElGamal Encryption Scheme to Control System for Security Enhan...Application of ElGamal Encryption Scheme to Control System for Security Enhan...
Application of ElGamal Encryption Scheme to Control System for Security Enhan...
Kiminao Kogiso
 
Considerations on model predictive control of McKibben pneumatic artificial m...
Considerations on model predictive control of McKibben pneumatic artificial m...Considerations on model predictive control of McKibben pneumatic artificial m...
Considerations on model predictive control of McKibben pneumatic artificial m...
Kiminao Kogiso
 
Estimating Player's Belief in Bayesian Game by Feedback Control
Estimating Player's Belief in Bayesian Game by Feedback ControlEstimating Player's Belief in Bayesian Game by Feedback Control
Estimating Player's Belief in Bayesian Game by Feedback ControlKiminao Kogiso
 
Modeling of McKibben pneumatic artificial muscle system using pressure-depend...
Modeling of McKibben pneumatic artificial muscle system using pressure-depend...Modeling of McKibben pneumatic artificial muscle system using pressure-depend...
Modeling of McKibben pneumatic artificial muscle system using pressure-depend...Kiminao Kogiso
 
Experimental Validation of McKibben Pneumatic Artificial Muscle Model
Experimental Validation of McKibben Pneumatic Artificial Muscle ModelExperimental Validation of McKibben Pneumatic Artificial Muscle Model
Experimental Validation of McKibben Pneumatic Artificial Muscle Model
Kiminao Kogiso
 
Identification Procedure for McKibben Pneumatic Artificial Muscle Systems
Identification Procedure for McKibben Pneumatic Artificial Muscle SystemsIdentification Procedure for McKibben Pneumatic Artificial Muscle Systems
Identification Procedure for McKibben Pneumatic Artificial Muscle Systems
Kiminao Kogiso
 

More from Kiminao Kogiso (10)

Cyber-Security Enhancements of Networked Control Systems Using Homomorphic En...
Cyber-Security Enhancements of Networked Control Systems Using Homomorphic En...Cyber-Security Enhancements of Networked Control Systems Using Homomorphic En...
Cyber-Security Enhancements of Networked Control Systems Using Homomorphic En...
 
Player's Belief Estimation for Super Human Sports
Player's Belief Estimation for Super Human SportsPlayer's Belief Estimation for Super Human Sports
Player's Belief Estimation for Super Human Sports
 
Hybrid Nonlinear Model of McKibben Pneumatic Artificial Muscle Systems Incorp...
Hybrid Nonlinear Model of McKibben Pneumatic Artificial Muscle Systems Incorp...Hybrid Nonlinear Model of McKibben Pneumatic Artificial Muscle Systems Incorp...
Hybrid Nonlinear Model of McKibben Pneumatic Artificial Muscle Systems Incorp...
 
Controller encryption using RSA public-key encryption scheme (Asian Control C...
Controller encryption using RSA public-key encryption scheme (Asian Control C...Controller encryption using RSA public-key encryption scheme (Asian Control C...
Controller encryption using RSA public-key encryption scheme (Asian Control C...
 
Application of ElGamal Encryption Scheme to Control System for Security Enhan...
Application of ElGamal Encryption Scheme to Control System for Security Enhan...Application of ElGamal Encryption Scheme to Control System for Security Enhan...
Application of ElGamal Encryption Scheme to Control System for Security Enhan...
 
Considerations on model predictive control of McKibben pneumatic artificial m...
Considerations on model predictive control of McKibben pneumatic artificial m...Considerations on model predictive control of McKibben pneumatic artificial m...
Considerations on model predictive control of McKibben pneumatic artificial m...
 
Estimating Player's Belief in Bayesian Game by Feedback Control
Estimating Player's Belief in Bayesian Game by Feedback ControlEstimating Player's Belief in Bayesian Game by Feedback Control
Estimating Player's Belief in Bayesian Game by Feedback Control
 
Modeling of McKibben pneumatic artificial muscle system using pressure-depend...
Modeling of McKibben pneumatic artificial muscle system using pressure-depend...Modeling of McKibben pneumatic artificial muscle system using pressure-depend...
Modeling of McKibben pneumatic artificial muscle system using pressure-depend...
 
Experimental Validation of McKibben Pneumatic Artificial Muscle Model
Experimental Validation of McKibben Pneumatic Artificial Muscle ModelExperimental Validation of McKibben Pneumatic Artificial Muscle Model
Experimental Validation of McKibben Pneumatic Artificial Muscle Model
 
Identification Procedure for McKibben Pneumatic Artificial Muscle Systems
Identification Procedure for McKibben Pneumatic Artificial Muscle SystemsIdentification Procedure for McKibben Pneumatic Artificial Muscle Systems
Identification Procedure for McKibben Pneumatic Artificial Muscle Systems
 

Recently uploaded

bank management system in java and mysql report1.pdf
bank management system in java and mysql report1.pdfbank management system in java and mysql report1.pdf
bank management system in java and mysql report1.pdf
Divyam548318
 
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
insn4465
 
BPV-GUI-01-Guide-for-ASME-Review-Teams-(General)-10-10-2023.pdf
BPV-GUI-01-Guide-for-ASME-Review-Teams-(General)-10-10-2023.pdfBPV-GUI-01-Guide-for-ASME-Review-Teams-(General)-10-10-2023.pdf
BPV-GUI-01-Guide-for-ASME-Review-Teams-(General)-10-10-2023.pdf
MIGUELANGEL966976
 
Harnessing WebAssembly for Real-time Stateless Streaming Pipelines
Harnessing WebAssembly for Real-time Stateless Streaming PipelinesHarnessing WebAssembly for Real-time Stateless Streaming Pipelines
Harnessing WebAssembly for Real-time Stateless Streaming Pipelines
Christina Lin
 
132/33KV substation case study Presentation
132/33KV substation case study Presentation132/33KV substation case study Presentation
132/33KV substation case study Presentation
kandramariana6
 
Properties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptxProperties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptx
MDSABBIROJJAMANPAYEL
 
2. Operations Strategy in a Global Environment.ppt
2. Operations Strategy in a Global Environment.ppt2. Operations Strategy in a Global Environment.ppt
2. Operations Strategy in a Global Environment.ppt
PuktoonEngr
 
[JPP-1] - (JEE 3.0) - Kinematics 1D - 14th May..pdf
[JPP-1] - (JEE 3.0) - Kinematics 1D - 14th May..pdf[JPP-1] - (JEE 3.0) - Kinematics 1D - 14th May..pdf
[JPP-1] - (JEE 3.0) - Kinematics 1D - 14th May..pdf
awadeshbabu
 
A SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMS
A SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMSA SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMS
A SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMS
IJNSA Journal
 
Generative AI leverages algorithms to create various forms of content
Generative AI leverages algorithms to create various forms of contentGenerative AI leverages algorithms to create various forms of content
Generative AI leverages algorithms to create various forms of content
Hitesh Mohapatra
 
Question paper of renewable energy sources
Question paper of renewable energy sourcesQuestion paper of renewable energy sources
Question paper of renewable energy sources
mahammadsalmanmech
 
Understanding Inductive Bias in Machine Learning
Understanding Inductive Bias in Machine LearningUnderstanding Inductive Bias in Machine Learning
Understanding Inductive Bias in Machine Learning
SUTEJAS
 
Advanced control scheme of doubly fed induction generator for wind turbine us...
Advanced control scheme of doubly fed induction generator for wind turbine us...Advanced control scheme of doubly fed induction generator for wind turbine us...
Advanced control scheme of doubly fed induction generator for wind turbine us...
IJECEIAES
 
basic-wireline-operations-course-mahmoud-f-radwan.pdf
basic-wireline-operations-course-mahmoud-f-radwan.pdfbasic-wireline-operations-course-mahmoud-f-radwan.pdf
basic-wireline-operations-course-mahmoud-f-radwan.pdf
NidhalKahouli2
 
5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...
5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...
5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...
ihlasbinance2003
 
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressionsKuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
Victor Morales
 
Swimming pool mechanical components design.pptx
Swimming pool  mechanical components design.pptxSwimming pool  mechanical components design.pptx
Swimming pool mechanical components design.pptx
yokeleetan1
 
spirit beverages ppt without graphics.pptx
spirit beverages ppt without graphics.pptxspirit beverages ppt without graphics.pptx
spirit beverages ppt without graphics.pptx
Madan Karki
 
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
thanhdowork
 
Presentation of IEEE Slovenia CIS (Computational Intelligence Society) Chapte...
Presentation of IEEE Slovenia CIS (Computational Intelligence Society) Chapte...Presentation of IEEE Slovenia CIS (Computational Intelligence Society) Chapte...
Presentation of IEEE Slovenia CIS (Computational Intelligence Society) Chapte...
University of Maribor
 

Recently uploaded (20)

bank management system in java and mysql report1.pdf
bank management system in java and mysql report1.pdfbank management system in java and mysql report1.pdf
bank management system in java and mysql report1.pdf
 
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
 
BPV-GUI-01-Guide-for-ASME-Review-Teams-(General)-10-10-2023.pdf
BPV-GUI-01-Guide-for-ASME-Review-Teams-(General)-10-10-2023.pdfBPV-GUI-01-Guide-for-ASME-Review-Teams-(General)-10-10-2023.pdf
BPV-GUI-01-Guide-for-ASME-Review-Teams-(General)-10-10-2023.pdf
 
Harnessing WebAssembly for Real-time Stateless Streaming Pipelines
Harnessing WebAssembly for Real-time Stateless Streaming PipelinesHarnessing WebAssembly for Real-time Stateless Streaming Pipelines
Harnessing WebAssembly for Real-time Stateless Streaming Pipelines
 
132/33KV substation case study Presentation
132/33KV substation case study Presentation132/33KV substation case study Presentation
132/33KV substation case study Presentation
 
Properties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptxProperties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptx
 
2. Operations Strategy in a Global Environment.ppt
2. Operations Strategy in a Global Environment.ppt2. Operations Strategy in a Global Environment.ppt
2. Operations Strategy in a Global Environment.ppt
 
[JPP-1] - (JEE 3.0) - Kinematics 1D - 14th May..pdf
[JPP-1] - (JEE 3.0) - Kinematics 1D - 14th May..pdf[JPP-1] - (JEE 3.0) - Kinematics 1D - 14th May..pdf
[JPP-1] - (JEE 3.0) - Kinematics 1D - 14th May..pdf
 
A SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMS
A SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMSA SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMS
A SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMS
 
Generative AI leverages algorithms to create various forms of content
Generative AI leverages algorithms to create various forms of contentGenerative AI leverages algorithms to create various forms of content
Generative AI leverages algorithms to create various forms of content
 
Question paper of renewable energy sources
Question paper of renewable energy sourcesQuestion paper of renewable energy sources
Question paper of renewable energy sources
 
Understanding Inductive Bias in Machine Learning
Understanding Inductive Bias in Machine LearningUnderstanding Inductive Bias in Machine Learning
Understanding Inductive Bias in Machine Learning
 
Advanced control scheme of doubly fed induction generator for wind turbine us...
Advanced control scheme of doubly fed induction generator for wind turbine us...Advanced control scheme of doubly fed induction generator for wind turbine us...
Advanced control scheme of doubly fed induction generator for wind turbine us...
 
basic-wireline-operations-course-mahmoud-f-radwan.pdf
basic-wireline-operations-course-mahmoud-f-radwan.pdfbasic-wireline-operations-course-mahmoud-f-radwan.pdf
basic-wireline-operations-course-mahmoud-f-radwan.pdf
 
5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...
5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...
5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...
 
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressionsKuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
 
Swimming pool mechanical components design.pptx
Swimming pool  mechanical components design.pptxSwimming pool  mechanical components design.pptx
Swimming pool mechanical components design.pptx
 
spirit beverages ppt without graphics.pptx
spirit beverages ppt without graphics.pptxspirit beverages ppt without graphics.pptx
spirit beverages ppt without graphics.pptx
 
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
 
Presentation of IEEE Slovenia CIS (Computational Intelligence Society) Chapte...
Presentation of IEEE Slovenia CIS (Computational Intelligence Society) Chapte...Presentation of IEEE Slovenia CIS (Computational Intelligence Society) Chapte...
Presentation of IEEE Slovenia CIS (Computational Intelligence Society) Chapte...
 

Transition Models of Equilibrium Assessment in Bayesian Game

  • 1. December 18 Fri., 2015, 09:10-09:30, Regular Session: Modeling 1, Frb04.3 @ 802 Transition  Models  of  Equilibrium   Assessment  in  Bayesian  Game Kiminao Kogiso University of Electro-Communications Tokyo, Japan The 54 Conference on Decision and Control Osaka International Convention Center, Osaka, Japan December 15 to 18, 2015 Supported by JSPS Grant-in-Aid for Challenging Exploratory Research 2014 to 2016
  • 2. Outline 2 Introduction   Static  Bayesian  Game   Novel  Form  in  Bayesian  Nash  Equilibrium   Dynamics  in  Equilibrium  Assessment   Simulation   Conclusion
  • 3. Introduction 3 Strategic game enabling to consider uncertainties in player’s decisions. player: a reasonable decision maker action: what a player chooses utility: a player’s preference over the actions type: a label of player’s private valuation (what the player really feels) belief: a probability distribution over the types (degree of feeling, tendency, proclivity,…) Static Bayesian Game[1] [1] Harsanyi, 1967. [2] Alpcan and Basar, et al., 2011, 2013. [3] Roy, et al., 2010. [4] Liu, et al., 2006. [5] Akkarajitsakul, et al., 2011. A Bayesian game used in engineering problems to analyze a Bayesian Nash equilibrium or to design a game mechanism. network security[2,3], intrusion detection[4,5,6], belief learning[7] electricity pricing[8,9], mechanism design[10] [6] Sedjelmachi, et al., 2014, 2015. [7] Nachbar, 2008. [8] Li, et al., 2011, 2014. [9] Yang, et al., 2013. [10] Tao, et al., 2015.
  • 4. Introduction 4 Insufficient tools and concepts[11] Bayesian Nash equilibrium plays key roles in game analysis & design. equilibrium analysis: for given belief, find a Bayesian Nash Equilibrium(BNE). belief learning: for given BNE, find a corresponding belief. mechanism design: for given utility, find rules to achieve a desired BNE. Objective of this talk Derive a dynamical state-space model whose state involves a BNE. derive a novel condition related to the BNE, discover a map (discrete-time system) defined by the novel condition, confirm a time response of the map. [11] Powell, 2011. Challenge: prepare tools & concepts to apply our model-based fashion to analysis and design of the game.
  • 5. Bayesian Game Player set Action set Type set Utility Strategy (mixed) Belief Static Bayesian Game: General 5 Two-player two-action Bayesian game w/ two types G(N, A, ⇥, u, µ, S) N := {1, 2} A := A1 ⇥ A2 ⇥ := ⇥1 ⇥ ⇥2 u := (u1, u2) µ := (µ1, µ2) S := (S1, S2) ai 2 Ai := {a, ¯a} 8i 2 N ✓i 2 ⇥i := {✓, ¯✓} 8i 2 N µi 2 ⇧(⇥i) 8i 2 N Si : ⇥i ! ⇧(Ai) 8i 2 N si 2 Si(⇥i) 8i 2 N ⇧(X) : a probability distribution over a finite set X Ui(✓i, ✓ i) :=  ui(a, a, ✓i, ✓ i) ui(a, ¯a, ✓i, ✓ i) ui(¯a, a, ✓i, ✓ i) ui(¯a, ¯a, ✓i, ✓ i) : utility matrix8i 2 N, 8✓ 2 ⇥ ui : A ⇥ ⇥ ! < 8i 2 N i 2 N
  • 6. Static Bayesian Game: Example 6 Service of tennis 2, 2 0, 1 1, 21, 1 flat spin flat spin 0, 1 1, 2 0, 11, 2 flat spin flat spin sideline 1, 0 1, 1 2, 00, 1 flat spin flat spin 1, 3 1, 2 0, 32, 2 flat spin flat spin centerline s1(a|✓) s1(¯a|✓) s1(¯a|¯✓) s1(a|¯✓) s2(a|¯✓) s2(¯a|¯✓)s2(¯a|✓)s2(a|✓) center line ✓ side line ¯✓ ✓¯✓ ¯a ¯a ¯a ¯a ¯a ¯a ¯a¯aa a a a µ1(✓) µ1(¯✓) µ2(¯✓)µ2(✓) a a a a type belief
  • 7. Bayesian Nash Equilibrium 7 Equilibrium assessment definitions of Bayesian Nash Equilibrium(BNE) using an ex-ante expected utility: using a best response to opponent strategy: EUi(si, s i) EUi(s0 , s i) 8s0 i 2 Si, s0 i 6= si is denoted as the Bayesian Nash equilibrium. A strategy profile , satisfying , is also a BNE.s = (si, s i) Given a prior common probability , for any , the strategy satisfyingi 2 N sp(µ) si 2 BRi(s i, µ) 8i 2 N the pair of considered as key variables of the Bayesian game. Equilibrium Assessment : a pair of a belief and the corresponding BNE.(ˆµ, ˆs) (µ, s) equilibrium analysis[10]: find a BNE .ˆs9 ˆµ, [10] Y. Shoham and K. Leyton-Brown, Multiagent Systems, Cambridge University Press, 2009.
  • 8. then the pair is an Equilibrium Assessment, where ,✏ := ⇥ 1 1 ⇤ (ˆµ, ˆs) Novel Form Satisfying BNE 8 If the game satisfies the following condition (simultaneous polynomial in ): Sufficient condition to be BNE Lemma 8✓i 2 ⇥i, 8i 2 N G ✏⇣i(ˆs i, ✓i) (✓i)p(ˆµ) = 0 ⇣i(ˆs i, ✓i) := ⇥ Ui(✓i, ✓)ˆs i(✓) Ui(✓i, ¯✓)ˆs i(¯✓) ⇤ , (✓) :=  1 0 0 0 0 1 0 0 , (¯✓) :=  0 0 1 0 0 0 0 1 . idea: derived from KKT condition of BNE by cancelation of Lagrangian variables. point: # of the polynomials: 4, # of the variables: 6; D.O.F. in determining their values. note: a BNE (mixed strategy) holds the above equation, but some of pure strategy BNEs do not hold it. ˆµ all of EAs
  • 9. Discover Dynamics! 9 Map from EA to EA Idea to derive dynamics in EA all of EAs ⇥ EA (ˆµ, ˆs) satisfying the Lemma all of EAs ⇥ EA (ˆµ + ˆµ, ˆs + ˆs) satisfying the Lemma
  • 10. Given an initial EA, if there exists such that the game satisfies the following condition w.r.t. utility matrices: , Dynamics in Equilibrium Assessment 10 Main result Theorem ⇥ 1 1 ⇤ Ui(✓i, ✓)  1 1 1 = 0 ⇥ 1 1 ⇤ Ui(✓i, ¯✓)  2 1 2 = 0 8✓i 2 ⇥i8i 2 N ˆµ(k + 1) = diag(A1, A2)ˆµ(k) ˆs(k + 1) = A (ci(k))ˆsi(k) ci(k) := ˆµi(✓i, k + 1) ˆµi(✓i, k) , and is a row stochastic matrix.Ai 2 <2⇥2 8i 2 N = [ 1 2]T 2 <2 then a nonlinear autonomous system in terms of the equilibrium assessment: transfers from an EA to another EA , where(ˆµ(k), ˆs(k)) (ˆµ(k + 1), ˆs(k + 1)) ci(k) ! 1 A (1) = I ci(k) := ˆµi(✓i, k + 1) ˆµi(✓i, k)ˆµ(k + 1) = diag(A1, A2)ˆµ(k) ˆs(k + 1) = A (ci(k))ˆsi(k) ˆµ(k) ˆµ(k + 1) stable linear system: time-varying system: · ˆs(k) ˆµ(k)
  • 11. Simulation 11 Trajectory of equilibrium assessment 0.3 0.4 0.5 0.6 0.7 10 0.2 0.4 0.6 0.8 probabilityprobability step 0 1 2 3 4 5 6 7 8 9 100 1 2 3 4 5 6 7 8 9 s1( )a|θ−− s1( )a|θ− − s1( )a|θ − − s1( )a|θ −− s2( )a|θ−− s2( )a|θ− − s2( )a|θ − − s2( )a|θ −− 0.4 0.6 0.8 1.0 1.2 1.4 1.6 1.8 the proposed model the computation of the best respose (4) 10 expectedutilityvalue step 0 1 2 3 4 5 6 7 8 9 EU1 EU2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 1.8 10 step 0 1 2 3 4 5 6 7 8 9 valueofci()θi c ( )1 θ− c ( )1 θ − c ( )2 θ− c ( )2 θ − 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 10 probability step 0 1 2 3 4 5 6 7 8 9 1( )η θ− 1( )η θ − 2( )η θ− 2( )η θ − belief strategy c expected utility
  • 12. Conclusion 12 Introduction Static Bayesian Game two-players two-actions game with two-types New Form in Bayesian Nash Equilib. polynomial conditions in equilibrium assessment Dynamics in Equilibrium Assessment discrete-time autonomous time-varying system convergence of the EA (stability) Simulation confirms states updated become EA and converge. Future works estimate player’s belief for a given BNE, and realize a control-theoretic mechanism design method. 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 10 probability step 0 1 2 3 4 5 6 7 8 9 1( )η θ− 1( )η θ − 2( )η θ− 2( )η θ − belief 0.3 0.4 0.5 0.6 0.7 10 0.2 0.4 0.6 0.8 probabilityprobability step 0 1 2 3 4 5 6 7 8 9 100 1 2 3 4 5 6 7 8 9 s1( )a|θ−− s1( )a|θ− − s1( )a|θ − − s1( )a|θ −− s2( )a|θ−− s2( )a|θ− − s2( )a|θ − − s2( )a|θ −− strategy ˆµ(k + 1) = diag(A1, A2)ˆµ(k) ˆs(k + 1) = A (ci(k))ˆsi(k) Dynamics in EA: