SlideShare a Scribd company logo
1 of 20
CPS 570: Artificial Intelligence
Game Theory
Instructor: Vincent Conitzer
Penalty kick example
probability .7
probability .3
probability .6
probability .4
probability 1
Is this a
“rational”
outcome?
If not, what
is?
action
action
Rock-paper-scissors
0, 0 -1, 1 1, -1
1, -1 0, 0 -1, 1
-1, 1 1, -1 0, 0
Row player
aka. player 1
chooses a row
Column player aka.
player 2
(simultaneously)
chooses a column
A row or column is
called an action or
(pure) strategy
Row player’s utility is always listed first, column player’s second
Zero-sum game: the utilities in each entry sum to 0 (or a constant)
Three-player game would be a 3D table with 3 utilities per entry, etc.
A poker-like game
1 gets King 1 gets Jack
raise raise
check check
call fold call fold call fold call fold
“nature”
player 1
player 1
player 2 player 2
2 1 1 1 -2 -1
1 1
0, 0 0, 0 1, -1 1, -1
.5, -.5 1.5, -1.5 0, 0 1, -1
-.5, .5 -.5, .5 1, -1 1, -1
0, 0 1, -1 0, 0 1, -1
cc cf fc ff
rr
cr
cc
rc
“Chicken”
0, 0 -1, 1
1, -1 -5, -5
D
S
D S
S
D
D
S
• Two players drive cars towards each other
• If one player goes straight, that player wins
• If both go straight, they both die
not zero-sum
“2/3 of the average” game
• Everyone writes down a number between 0 and 100
• Person closest to 2/3 of the average wins
• Example:
– A says 50
– B says 10
– C says 90
– Average(50, 10, 90) = 50
– 2/3 of average = 33.33
– A is closest (|50-33.33| = 16.67), so A wins
Rock-paper-scissors – Seinfeld variant
0, 0 1, -1 1, -1
-1, 1 0, 0 -1, 1
-1, 1 1, -1 0, 0
MICKEY: All right, rock beats paper!
(Mickey smacks Kramer's hand for losing)
KRAMER: I thought paper covered rock.
MICKEY: Nah, rock flies right through paper.
KRAMER: What beats rock?
MICKEY: (looks at hand) Nothing beats rock.
Dominance
• Player i’s strategy si strictly dominates si’ if
– for any s-i, ui(si , s-i) > ui(si’, s-i)
• si weakly dominates si’ if
– for any s-i, ui(si , s-i) ≥ ui(si’, s-i); and
– for some s-i, ui(si , s-i) > ui(si’, s-i)
0, 0 1, -1 1, -1
-1, 1 0, 0 -1, 1
-1, 1 1, -1 0, 0
strict dominance
weak dominance
-i = “the player(s)
other than i”
Prisoner’s Dilemma
-2, -2 0, -3
-3, 0 -1, -1
confess
• Pair of criminals has been caught
• District attorney has evidence to convict them of a
minor crime (1 year in jail); knows that they
committed a major crime together (3 years in jail)
but cannot prove it
• Offers them a deal:
– If both confess to the major crime, they each get a 1 year reduction
– If only one confesses, that one gets 3 years reduction
don’t confess
don’t confess
confess
“Should I buy an SUV?”
-10, -10 -7, -11
-11, -7 -8, -8
cost: 5
cost: 3
cost: 5 cost: 5
cost: 5 cost: 5
cost: 8 cost: 2
purchasing + gas cost accident cost
Back to the poker-like game
1 gets King 1 gets Jack
raise raise
check check
call fold call fold call fold call fold
“nature”
player 1
player 1
player 2 player 2
2 1 1 1 -2 -1
1 1
0, 0 0, 0 1, -1 1, -1
.5, -.5 1.5, -1.5 0, 0 1, -1
-.5, .5 -.5, .5 1, -1 1, -1
0, 0 1, -1 0, 0 1, -1
cc cf fc ff
rr
cr
cc
rc
Iterated dominance
• Iterated dominance: remove (strictly/weakly)
dominated strategy, repeat
• Iterated strict dominance on Seinfeld’s RPS:
0, 0 1, -1 1, -1
-1, 1 0, 0 -1, 1
-1, 1 1, -1 0, 0
0, 0 1, -1
-1, 1 0, 0
“2/3 of the average” game revisited
0
100
(2/3)*100
(2/3)*(2/3)*100
…
dominated
dominated after removal of
(originally) dominated strategies
Mixed strategies
• Mixed strategy for player i = probability
distribution over player i’s (pure) strategies
• E.g. 1/3 , 1/3 , 1/3
• Example of dominance by a mixed strategy:
3, 0 0, 0
0, 0 3, 0
1, 0 1, 0
1/2
1/2
Nash equilibrium [Nash 1950]
• A profile (= strategy for each player) so that no
player wants to deviate
0, 0 -1, 1
1, -1 -5, -5
D
S
D S
• This game has another Nash equilibrium in
mixed strategies…
Rock-paper-scissors
0, 0 -1, 1 1, -1
1, -1 0, 0 -1, 1
-1, 1 1, -1 0, 0
• Any pure-strategy Nash equilibria?
• But it has a mixed-strategy Nash equilibrium:
Both players put probability 1/3 on each action
• If the other player does this, every action will give you
expected utility 0
– Might as well randomize
Nash equilibria of “chicken”…
0, 0 -1, 1
1, -1 -5, -5
D
S
D S
• Is there a Nash equilibrium that uses mixed strategies? Say, where player 1
uses a mixed strategy?
• If a mixed strategy is a best response, then all of the pure strategies that it
randomizes over must also be best responses
• So we need to make player 1 indifferent between D and S
• Player 1’s utility for playing D = -pc
S
• Player 1’s utility for playing S = pc
D - 5pc
S = 1 - 6pc
S
• So we need -pc
S = 1 - 6pc
S which means pc
S = 1/5
• Then, player 2 needs to be indifferent as well
• Mixed-strategy Nash equilibrium: ((4/5 D, 1/5 S), (4/5 D, 1/5 S))
– People may die! Expected utility -1/5 for each player
The presentation
game
Pay attention
(A)
Do not pay
attention (NA)
Put effort into
presentation (E)
Do not put effort into
presentation (NE)
2, 2 -1, 0
-7, -8 0, 0
• Pure-strategy Nash equilibria: (E, A), (NE, NA)
• Mixed-strategy Nash equilibrium:
((4/5 E, 1/5 NE), (1/10 A, 9/10 NA))
– Utility -7/10 for presenter, 0 for audience
Back to the poker-like game, again
1 gets King 1 gets Jack
raise raise
check check
call fold call fold call fold call fold
“nature”
player 1
player 1
player 2 player 2
2 1 1 1 -2 -1
1 1
0, 0 0, 0 1, -1 1, -1
.5, -.5 1.5, -1.5 0, 0 1, -1
-.5, .5 -.5, .5 1, -1 1, -1
0, 0 1, -1 0, 0 1, -1
cc cf fc ff
rr
cr
cc
rc
2/3 1/3
1/3
2/3
• To make player 1 indifferent between rr and rc, we need:
utility for rr = 0*P(cc)+1*(1-P(cc)) = .5*P(cc)+0*(1-P(cc)) = utility for rc
That is, P(cc) = 2/3
• To make player 2 indifferent between cc and fc, we need:
utility for cc = 0*P(rr)+(-.5)*(1-P(rr)) = -1*P(rr)+0*(1-P(rr)) = utility for fc
That is, P(rr) = 1/3
Real-world security
applications
Airport security
Where should checkpoints, canine units,
etc. be deployed?
US Coast Guard
Which patrol routes should be followed?
Federal Air Marshals
Which flights get a FAM?
Milind Tambe’s TEAMCORE group (USC)
Wildlife Protection
Where to patrol to catch poachers or find
their snares?

More Related Content

Similar to AI subject - cps 570 _ game _ theory.ppt

ACI-Webinar-3-MinMaxAlphaBetaPruning-TicTacToe.pptx
ACI-Webinar-3-MinMaxAlphaBetaPruning-TicTacToe.pptxACI-Webinar-3-MinMaxAlphaBetaPruning-TicTacToe.pptx
ACI-Webinar-3-MinMaxAlphaBetaPruning-TicTacToe.pptx
ssuser1eba67
 
cvpr2011: game theory in CVPR part 1
cvpr2011: game theory in CVPR part 1cvpr2011: game theory in CVPR part 1
cvpr2011: game theory in CVPR part 1
zukun
 
Topic 3- Cooperation and Collective Action
Topic 3- Cooperation and Collective ActionTopic 3- Cooperation and Collective Action
Topic 3- Cooperation and Collective Action
John Bradford
 

Similar to AI subject - cps 570 _ game _ theory.ppt (20)

Ai
AiAi
Ai
 
Artificial intelligence games
Artificial intelligence gamesArtificial intelligence games
Artificial intelligence games
 
ACI-Webinar-3-MinMaxAlphaBetaPruning-TicTacToe.pptx
ACI-Webinar-3-MinMaxAlphaBetaPruning-TicTacToe.pptxACI-Webinar-3-MinMaxAlphaBetaPruning-TicTacToe.pptx
ACI-Webinar-3-MinMaxAlphaBetaPruning-TicTacToe.pptx
 
cai
caicai
cai
 
Games.4
Games.4Games.4
Games.4
 
file1
file1file1
file1
 
cvpr2011: game theory in CVPR part 1
cvpr2011: game theory in CVPR part 1cvpr2011: game theory in CVPR part 1
cvpr2011: game theory in CVPR part 1
 
Game theory 2
Game theory 2Game theory 2
Game theory 2
 
Topic 3- Cooperation and Collective Action
Topic 3- Cooperation and Collective ActionTopic 3- Cooperation and Collective Action
Topic 3- Cooperation and Collective Action
 
Game playing
Game playingGame playing
Game playing
 
Minimax
MinimaxMinimax
Minimax
 
Game Theory
Game TheoryGame Theory
Game Theory
 
Game theory 3
Game theory 3Game theory 3
Game theory 3
 
MINI-MAX ALGORITHM.pptx
MINI-MAX ALGORITHM.pptxMINI-MAX ALGORITHM.pptx
MINI-MAX ALGORITHM.pptx
 
General games
General gamesGeneral games
General games
 
Lenses for the masses - introducing Goggles
Lenses for the masses - introducing GogglesLenses for the masses - introducing Goggles
Lenses for the masses - introducing Goggles
 
Volleyball analytics: Modeling volleyball using Markov chains
Volleyball analytics: Modeling volleyball using Markov chainsVolleyball analytics: Modeling volleyball using Markov chains
Volleyball analytics: Modeling volleyball using Markov chains
 
AI Strategies for Solving Poker Texas Hold'em
AI Strategies for Solving Poker Texas Hold'emAI Strategies for Solving Poker Texas Hold'em
AI Strategies for Solving Poker Texas Hold'em
 
Lect04 slides
Lect04 slidesLect04 slides
Lect04 slides
 
9SearchAdversarial (1).pptx
9SearchAdversarial (1).pptx9SearchAdversarial (1).pptx
9SearchAdversarial (1).pptx
 

Recently uploaded

result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college project
Tonystark477637
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
rknatarajan
 

Recently uploaded (20)

Online banking management system project.pdf
Online banking management system project.pdfOnline banking management system project.pdf
Online banking management system project.pdf
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSIS
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college project
 
UNIT-IFLUID PROPERTIES & FLOW CHARACTERISTICS
UNIT-IFLUID PROPERTIES & FLOW CHARACTERISTICSUNIT-IFLUID PROPERTIES & FLOW CHARACTERISTICS
UNIT-IFLUID PROPERTIES & FLOW CHARACTERISTICS
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
 
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
 
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.ppt
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
 
Vivazz, Mieres Social Housing Design Spain
Vivazz, Mieres Social Housing Design SpainVivazz, Mieres Social Housing Design Spain
Vivazz, Mieres Social Housing Design Spain
 
Generative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPTGenerative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPT
 
University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdf
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
 
chapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineeringchapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineering
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . ppt
 
Extrusion Processes and Their Limitations
Extrusion Processes and Their LimitationsExtrusion Processes and Their Limitations
Extrusion Processes and Their Limitations
 
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
 
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
 

AI subject - cps 570 _ game _ theory.ppt

  • 1. CPS 570: Artificial Intelligence Game Theory Instructor: Vincent Conitzer
  • 2. Penalty kick example probability .7 probability .3 probability .6 probability .4 probability 1 Is this a “rational” outcome? If not, what is? action action
  • 3. Rock-paper-scissors 0, 0 -1, 1 1, -1 1, -1 0, 0 -1, 1 -1, 1 1, -1 0, 0 Row player aka. player 1 chooses a row Column player aka. player 2 (simultaneously) chooses a column A row or column is called an action or (pure) strategy Row player’s utility is always listed first, column player’s second Zero-sum game: the utilities in each entry sum to 0 (or a constant) Three-player game would be a 3D table with 3 utilities per entry, etc.
  • 4. A poker-like game 1 gets King 1 gets Jack raise raise check check call fold call fold call fold call fold “nature” player 1 player 1 player 2 player 2 2 1 1 1 -2 -1 1 1 0, 0 0, 0 1, -1 1, -1 .5, -.5 1.5, -1.5 0, 0 1, -1 -.5, .5 -.5, .5 1, -1 1, -1 0, 0 1, -1 0, 0 1, -1 cc cf fc ff rr cr cc rc
  • 5. “Chicken” 0, 0 -1, 1 1, -1 -5, -5 D S D S S D D S • Two players drive cars towards each other • If one player goes straight, that player wins • If both go straight, they both die not zero-sum
  • 6. “2/3 of the average” game • Everyone writes down a number between 0 and 100 • Person closest to 2/3 of the average wins • Example: – A says 50 – B says 10 – C says 90 – Average(50, 10, 90) = 50 – 2/3 of average = 33.33 – A is closest (|50-33.33| = 16.67), so A wins
  • 7. Rock-paper-scissors – Seinfeld variant 0, 0 1, -1 1, -1 -1, 1 0, 0 -1, 1 -1, 1 1, -1 0, 0 MICKEY: All right, rock beats paper! (Mickey smacks Kramer's hand for losing) KRAMER: I thought paper covered rock. MICKEY: Nah, rock flies right through paper. KRAMER: What beats rock? MICKEY: (looks at hand) Nothing beats rock.
  • 8. Dominance • Player i’s strategy si strictly dominates si’ if – for any s-i, ui(si , s-i) > ui(si’, s-i) • si weakly dominates si’ if – for any s-i, ui(si , s-i) ≥ ui(si’, s-i); and – for some s-i, ui(si , s-i) > ui(si’, s-i) 0, 0 1, -1 1, -1 -1, 1 0, 0 -1, 1 -1, 1 1, -1 0, 0 strict dominance weak dominance -i = “the player(s) other than i”
  • 9. Prisoner’s Dilemma -2, -2 0, -3 -3, 0 -1, -1 confess • Pair of criminals has been caught • District attorney has evidence to convict them of a minor crime (1 year in jail); knows that they committed a major crime together (3 years in jail) but cannot prove it • Offers them a deal: – If both confess to the major crime, they each get a 1 year reduction – If only one confesses, that one gets 3 years reduction don’t confess don’t confess confess
  • 10. “Should I buy an SUV?” -10, -10 -7, -11 -11, -7 -8, -8 cost: 5 cost: 3 cost: 5 cost: 5 cost: 5 cost: 5 cost: 8 cost: 2 purchasing + gas cost accident cost
  • 11. Back to the poker-like game 1 gets King 1 gets Jack raise raise check check call fold call fold call fold call fold “nature” player 1 player 1 player 2 player 2 2 1 1 1 -2 -1 1 1 0, 0 0, 0 1, -1 1, -1 .5, -.5 1.5, -1.5 0, 0 1, -1 -.5, .5 -.5, .5 1, -1 1, -1 0, 0 1, -1 0, 0 1, -1 cc cf fc ff rr cr cc rc
  • 12. Iterated dominance • Iterated dominance: remove (strictly/weakly) dominated strategy, repeat • Iterated strict dominance on Seinfeld’s RPS: 0, 0 1, -1 1, -1 -1, 1 0, 0 -1, 1 -1, 1 1, -1 0, 0 0, 0 1, -1 -1, 1 0, 0
  • 13. “2/3 of the average” game revisited 0 100 (2/3)*100 (2/3)*(2/3)*100 … dominated dominated after removal of (originally) dominated strategies
  • 14. Mixed strategies • Mixed strategy for player i = probability distribution over player i’s (pure) strategies • E.g. 1/3 , 1/3 , 1/3 • Example of dominance by a mixed strategy: 3, 0 0, 0 0, 0 3, 0 1, 0 1, 0 1/2 1/2
  • 15. Nash equilibrium [Nash 1950] • A profile (= strategy for each player) so that no player wants to deviate 0, 0 -1, 1 1, -1 -5, -5 D S D S • This game has another Nash equilibrium in mixed strategies…
  • 16. Rock-paper-scissors 0, 0 -1, 1 1, -1 1, -1 0, 0 -1, 1 -1, 1 1, -1 0, 0 • Any pure-strategy Nash equilibria? • But it has a mixed-strategy Nash equilibrium: Both players put probability 1/3 on each action • If the other player does this, every action will give you expected utility 0 – Might as well randomize
  • 17. Nash equilibria of “chicken”… 0, 0 -1, 1 1, -1 -5, -5 D S D S • Is there a Nash equilibrium that uses mixed strategies? Say, where player 1 uses a mixed strategy? • If a mixed strategy is a best response, then all of the pure strategies that it randomizes over must also be best responses • So we need to make player 1 indifferent between D and S • Player 1’s utility for playing D = -pc S • Player 1’s utility for playing S = pc D - 5pc S = 1 - 6pc S • So we need -pc S = 1 - 6pc S which means pc S = 1/5 • Then, player 2 needs to be indifferent as well • Mixed-strategy Nash equilibrium: ((4/5 D, 1/5 S), (4/5 D, 1/5 S)) – People may die! Expected utility -1/5 for each player
  • 18. The presentation game Pay attention (A) Do not pay attention (NA) Put effort into presentation (E) Do not put effort into presentation (NE) 2, 2 -1, 0 -7, -8 0, 0 • Pure-strategy Nash equilibria: (E, A), (NE, NA) • Mixed-strategy Nash equilibrium: ((4/5 E, 1/5 NE), (1/10 A, 9/10 NA)) – Utility -7/10 for presenter, 0 for audience
  • 19. Back to the poker-like game, again 1 gets King 1 gets Jack raise raise check check call fold call fold call fold call fold “nature” player 1 player 1 player 2 player 2 2 1 1 1 -2 -1 1 1 0, 0 0, 0 1, -1 1, -1 .5, -.5 1.5, -1.5 0, 0 1, -1 -.5, .5 -.5, .5 1, -1 1, -1 0, 0 1, -1 0, 0 1, -1 cc cf fc ff rr cr cc rc 2/3 1/3 1/3 2/3 • To make player 1 indifferent between rr and rc, we need: utility for rr = 0*P(cc)+1*(1-P(cc)) = .5*P(cc)+0*(1-P(cc)) = utility for rc That is, P(cc) = 2/3 • To make player 2 indifferent between cc and fc, we need: utility for cc = 0*P(rr)+(-.5)*(1-P(rr)) = -1*P(rr)+0*(1-P(rr)) = utility for fc That is, P(rr) = 1/3
  • 20. Real-world security applications Airport security Where should checkpoints, canine units, etc. be deployed? US Coast Guard Which patrol routes should be followed? Federal Air Marshals Which flights get a FAM? Milind Tambe’s TEAMCORE group (USC) Wildlife Protection Where to patrol to catch poachers or find their snares?