SlideShare a Scribd company logo
1 of 32
Download to read offline
Sparse Binary Zero-Sum Games 
[ACML 2014] 
David Auger1 Jialin Liu2 Sylvie Ruette3 David L. St-Pierre4 
Olivier Teytaud2 
1AlCAAP, Laboratoire PRiSM, Universite de Versailles Saint Quentin-en-Yvelines, France 
2TAO, INRIA-CNRS-LRI, Universite Paris-Sud, France 
3Laboratoire de Mathematiques, CNRS, Universite Paris-Sud, France 
4Universite du Quebec a Trois-Rivieres, Canada 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 1 / 26
Thanks to reviewers for very fruitful comments. 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 2 / 26
Introduction 
Two-person zero-sum game MKK 
Nash Equilibrium ! O(K2) with   3 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 3 / 26
Introduction 
Two-person zero-sum game MKK 
Nash Equilibrium ! O(K2) with   3 
If the Nash is sparse ! k  k submatrix 
! O(k3kK log K) with probability 1   (provable) 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 3 / 26
Zero-sum matrix games 
Game de
ned by matrix M 
I choose (privately) i 
Simultaneously, you choose j 
I earn Mi ;j 
You earn Mi ;j 
So this is zero-sum. 
Or you earn 1  Mi ;j (so this is 1-sum, equivalent). 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 4 / 26
Ok, I earn Mi ;j , you earn Mi ;j 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 5 / 26
Ok, I earn Mi ;j , you earn Mi ;j 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 5 / 26
Nash Equilibrium 
Nash Equilibrium (NE) 
Zero-sum matrix game M 
My strategy = probability distrib. on rows = x 
Your strategy = probability distrib. on cols = y 
Expected reward = xTMy 
There exists x; y such that 8x; y, 
xTMy  xTMy  xTMy: 
(x; y) is a Nash Equilibrium (no unicity). 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 6 / 26
Ok, I earn Mi ;j , you earn Mi ;j 
Nash: Ok I play i with probability x 
i 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 7 / 26
Ok, I earn Mi ;j , you earn Mi ;j 
Nash: Ok I play i with probability x 
i 
How to 
compute x*? 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 7 / 26
Solving Nash 
Solution 1: Linear Programming (LP) 
1 M   M + C so that it is positive (without loss of generality) 
2 LP:
nd 0  u minimizing 
P 
i 
ui such that (MT )  u  1 
P 
3 x = u= 
i 
ui 
=) classical, provably exact, polynomial time 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 8 / 26
Solving Nash 
Solution 2: Approximate Nash Equilibrium 
Approximate -NE 
(x; y) such that 
xTMy    xTMy  xTMy + : 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 9 / 26
Solution 1: LP (comp. expensive) 
Solution 2: Approximate Nash Equilibrium 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 10 / 26
Solution 1: LP (comp. expensive) 
Solution 2: Approximate Nash Equilibrium 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 10 / 26
Computing approximate Nash Equilibrium 
Assuming the matrix is of size K  K ... 
LP (see reduction from Nash to linear programming in 
[Von Stengel (2002)]): O(K2) with 3    4 
[Grigoriadis and Khachiyan(1995)]: 
-Nash with expected time O(K log(K) 
2 ), i.e. less than the size of the 
matrix! 
Parallel : O( log2(K) 
2 ) if using K 
log(K) processors 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 11 / 26
Computing approximate Nash Equilibrium 
Assuming the matrix is of size K  K ... 
LP (see reduction from Nash to linear programming in 
[Von Stengel (2002)]): O(K2) with 3    4 
[Grigoriadis and Khachiyan(1995)]: 
-Nash with expected time O(K log(K) 
2 ), i.e. less than the size of the 
matrix! 
Parallel : O( log2(K) 
2 ) if using K 
log(K) processors 
Other algorithms: similar complexity, approximate solution +
xed 
time with probability 1   
EXP3 ([Auer et al.(1995)]) 
Inf ([Audibert and Bubeck(2009)]) 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 11 / 26
Other tools 1: Hadamard determinant 
Hadamard determinant bound 
([Hadamard(1893)], [Brenner and Cummings(1972)]) 
Given matrix Mkk with coecients in f1; 0; 1g, then M has 
determinant at most k 
k 
2 , i.e. 
j detMj  k 
k 
2 : 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 12 / 26
Other tools 2: Linear programming 
Solve 
min ax 
Mx  c 
x 2 Rd 
If there is a
nite optimum, then there is a
nite optimum x such 
that, for some E with jEj = d, 
8i 2 E, Mi x = ci 
the Mi for i in E are linear independent 
(=) i.e. d lin. indep. constraints are active) 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 13 / 26
Why is this relevant ? 
Nash = solution of linear programming problem 
x: Nash Equilibrium of MKK 
Let us assume that x is unique and has at most k non-zero 
components (sparsity) 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 14 / 26
Why is this relevant ? 
Nash = solution of linear programming problem 
x: Nash Equilibrium of MKK 
Let us assume that x is unique and has at most k non-zero 
components (sparsity) 
) x = also NE of a k  k submatrix: Mk 
) x = solution of LP in dimension k 
) x = solution of k lin. eq. with coecients in f1; 0; 1g 
) x = inv-matrix  vector 
) x = obtained by cofactors / det matrix 
x k 
) has denominator at most k 
2 
0k 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 14 / 26
How to realise ? 
Under assumption that the Nash is sparse 
x is rational with small denominator 
So let us compute an -Nash (sublinear time!) 
And let us compute its closest approximation with small 
denominator (Hadamard) 
variants for -Nash =) exact Nash 
Rounding: switch to closest approximation 
Truncation: remove small components and work on the remaining 
submatrix (exact solving) 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 15 / 26
Evil in the details 
jjy  yjj1   does not imply V(y)  V(y) + ; 
indeed V(y)  V(y) + jjyyjj1 
k 
k 
2 
Results : (if Grigoriadis) 
For a K  K matrix with Nash k-sparse 
Exact solution in time O(poly (k) + (K log K)k3k ) with 
truncation-algorithm 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 16 / 26
Experimental results: two card games 
Previous results: ingaming of Urban Rivals 
New results: metagaming of Pokemon 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 17 / 26
Ingaming results (Urban Rivals) 
Previous work: [Flory and Teytaud(2011)], implementation of 
Truncated-EXP3, without proof 
Urban Rivals AI 
= Monte Carlo Tree Search 
([Coulom (2006)]), 
using zero-sum matrix games 
as a key component 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 18 / 26
Ingaming results (Urban Rivals) 
Previous work: [Flory and Teytaud(2011)], implementation of 
Truncated-EXP3, without proof 
Results don't look impressive ( 56%), but the game is highly 
randomized =) Reaching 55% is far from being negligible 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 19 / 26
New experiments 
Test on Pokemon Deck choice (metagaming) 
Based on EXP3+truncation 
Various versions of EXP3 (6= parameters) 
Code available https://www.lri.fr/~teytaud/games.html 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 20 / 26
New experiments 
With a poorly tuned EXP3 : truncation brings a huge improvement 
0 
0.9 
0.85 
0.8 
0.75 
0.7 
0.65 
0.6 
0.55 
0.5 
10 
1 
10 
2 
10 
3 
10 
4 
10 
0.45 
TEXP3 vs EXP3 
0 
0.9 
0.85 
0.8 
0.75 
0.7 
0.65 
0.6 
0.55 
0.5 
10 
1 
10 
2 
10 
3 
10 
4 
10 
0.45 
TEXP3 vs Uniform 
EXP3 vs Uniform 
Figure: Performance in terms of budget T with a poorly tuned EXP3 for the 
game of Pokeman using 2 cards. 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 21 / 26

More Related Content

What's hot

2012 mdsp pr13 support vector machine
2012 mdsp pr13 support vector machine2012 mdsp pr13 support vector machine
2012 mdsp pr13 support vector machinenozomuhamada
 
Convolutional neural network backpropagation derivation
Convolutional neural network backpropagation derivationConvolutional neural network backpropagation derivation
Convolutional neural network backpropagation derivationPunnoose A.K
 
Cnn backpropagation derivation
Cnn backpropagation derivationCnn backpropagation derivation
Cnn backpropagation derivationPunnoose A.K
 
Quasistatic Fracture using Nonliner-Nonlocal Elastostatics with an Analytic T...
Quasistatic Fracture using Nonliner-Nonlocal Elastostatics with an Analytic T...Quasistatic Fracture using Nonliner-Nonlocal Elastostatics with an Analytic T...
Quasistatic Fracture using Nonliner-Nonlocal Elastostatics with an Analytic T...Patrick Diehl
 
2012 mdsp pr12 k means mixture of gaussian
2012 mdsp pr12 k means mixture of gaussian2012 mdsp pr12 k means mixture of gaussian
2012 mdsp pr12 k means mixture of gaussiannozomuhamada
 
Anti-differentiating approximation algorithms: A case study with min-cuts, sp...
Anti-differentiating approximation algorithms: A case study with min-cuts, sp...Anti-differentiating approximation algorithms: A case study with min-cuts, sp...
Anti-differentiating approximation algorithms: A case study with min-cuts, sp...David Gleich
 
Statistical inference of generative network models - Tiago P. Peixoto
Statistical inference of generative network models - Tiago P. PeixotoStatistical inference of generative network models - Tiago P. Peixoto
Statistical inference of generative network models - Tiago P. PeixotoLake Como School of Advanced Studies
 
Neural Art (English Version)
Neural Art (English Version)Neural Art (English Version)
Neural Art (English Version)Mark Chang
 
Animashree Anandkumar, Electrical Engineering and CS Dept, UC Irvine at MLcon...
Animashree Anandkumar, Electrical Engineering and CS Dept, UC Irvine at MLcon...Animashree Anandkumar, Electrical Engineering and CS Dept, UC Irvine at MLcon...
Animashree Anandkumar, Electrical Engineering and CS Dept, UC Irvine at MLcon...MLconf
 
Computational Linguistics week 5
Computational Linguistics  week 5Computational Linguistics  week 5
Computational Linguistics week 5Mark Chang
 
Chapter 06 boolean algebra 2o-p
Chapter 06 boolean algebra 2o-pChapter 06 boolean algebra 2o-p
Chapter 06 boolean algebra 2o-pIIUI
 
Chapter 06 boolean algebra
Chapter 06 boolean algebraChapter 06 boolean algebra
Chapter 06 boolean algebraIIUI
 
Chapter 06 boolean algebra 3o-p
Chapter 06 boolean algebra 3o-pChapter 06 boolean algebra 3o-p
Chapter 06 boolean algebra 3o-pIIUI
 
2012 mdsp pr09 pca lda
2012 mdsp pr09 pca lda2012 mdsp pr09 pca lda
2012 mdsp pr09 pca ldanozomuhamada
 
2012 mdsp pr10 ica
2012 mdsp pr10 ica2012 mdsp pr10 ica
2012 mdsp pr10 icanozomuhamada
 
2012 mdsp pr11 ica part 2 face recognition
2012 mdsp pr11 ica part 2 face recognition2012 mdsp pr11 ica part 2 face recognition
2012 mdsp pr11 ica part 2 face recognitionnozomuhamada
 
NTHU AI Reading Group: Improved Training of Wasserstein GANs
NTHU AI Reading Group: Improved Training of Wasserstein GANsNTHU AI Reading Group: Improved Training of Wasserstein GANs
NTHU AI Reading Group: Improved Training of Wasserstein GANsMark Chang
 
Big data matrix factorizations and Overlapping community detection in graphs
Big data matrix factorizations and Overlapping community detection in graphsBig data matrix factorizations and Overlapping community detection in graphs
Big data matrix factorizations and Overlapping community detection in graphsDavid Gleich
 
Hierarchical matrix techniques for maximum likelihood covariance estimation
Hierarchical matrix techniques for maximum likelihood covariance estimationHierarchical matrix techniques for maximum likelihood covariance estimation
Hierarchical matrix techniques for maximum likelihood covariance estimationAlexander Litvinenko
 

What's hot (20)

2012 mdsp pr13 support vector machine
2012 mdsp pr13 support vector machine2012 mdsp pr13 support vector machine
2012 mdsp pr13 support vector machine
 
Convolutional neural network backpropagation derivation
Convolutional neural network backpropagation derivationConvolutional neural network backpropagation derivation
Convolutional neural network backpropagation derivation
 
Cnn backpropagation derivation
Cnn backpropagation derivationCnn backpropagation derivation
Cnn backpropagation derivation
 
Quasistatic Fracture using Nonliner-Nonlocal Elastostatics with an Analytic T...
Quasistatic Fracture using Nonliner-Nonlocal Elastostatics with an Analytic T...Quasistatic Fracture using Nonliner-Nonlocal Elastostatics with an Analytic T...
Quasistatic Fracture using Nonliner-Nonlocal Elastostatics with an Analytic T...
 
2012 mdsp pr12 k means mixture of gaussian
2012 mdsp pr12 k means mixture of gaussian2012 mdsp pr12 k means mixture of gaussian
2012 mdsp pr12 k means mixture of gaussian
 
Anti-differentiating approximation algorithms: A case study with min-cuts, sp...
Anti-differentiating approximation algorithms: A case study with min-cuts, sp...Anti-differentiating approximation algorithms: A case study with min-cuts, sp...
Anti-differentiating approximation algorithms: A case study with min-cuts, sp...
 
Statistical inference of generative network models - Tiago P. Peixoto
Statistical inference of generative network models - Tiago P. PeixotoStatistical inference of generative network models - Tiago P. Peixoto
Statistical inference of generative network models - Tiago P. Peixoto
 
Neural Art (English Version)
Neural Art (English Version)Neural Art (English Version)
Neural Art (English Version)
 
Animashree Anandkumar, Electrical Engineering and CS Dept, UC Irvine at MLcon...
Animashree Anandkumar, Electrical Engineering and CS Dept, UC Irvine at MLcon...Animashree Anandkumar, Electrical Engineering and CS Dept, UC Irvine at MLcon...
Animashree Anandkumar, Electrical Engineering and CS Dept, UC Irvine at MLcon...
 
Computational Linguistics week 5
Computational Linguistics  week 5Computational Linguistics  week 5
Computational Linguistics week 5
 
Chapter 06 boolean algebra 2o-p
Chapter 06 boolean algebra 2o-pChapter 06 boolean algebra 2o-p
Chapter 06 boolean algebra 2o-p
 
Gwt sdm public
Gwt sdm publicGwt sdm public
Gwt sdm public
 
Chapter 06 boolean algebra
Chapter 06 boolean algebraChapter 06 boolean algebra
Chapter 06 boolean algebra
 
Chapter 06 boolean algebra 3o-p
Chapter 06 boolean algebra 3o-pChapter 06 boolean algebra 3o-p
Chapter 06 boolean algebra 3o-p
 
2012 mdsp pr09 pca lda
2012 mdsp pr09 pca lda2012 mdsp pr09 pca lda
2012 mdsp pr09 pca lda
 
2012 mdsp pr10 ica
2012 mdsp pr10 ica2012 mdsp pr10 ica
2012 mdsp pr10 ica
 
2012 mdsp pr11 ica part 2 face recognition
2012 mdsp pr11 ica part 2 face recognition2012 mdsp pr11 ica part 2 face recognition
2012 mdsp pr11 ica part 2 face recognition
 
NTHU AI Reading Group: Improved Training of Wasserstein GANs
NTHU AI Reading Group: Improved Training of Wasserstein GANsNTHU AI Reading Group: Improved Training of Wasserstein GANs
NTHU AI Reading Group: Improved Training of Wasserstein GANs
 
Big data matrix factorizations and Overlapping community detection in graphs
Big data matrix factorizations and Overlapping community detection in graphsBig data matrix factorizations and Overlapping community detection in graphs
Big data matrix factorizations and Overlapping community detection in graphs
 
Hierarchical matrix techniques for maximum likelihood covariance estimation
Hierarchical matrix techniques for maximum likelihood covariance estimationHierarchical matrix techniques for maximum likelihood covariance estimation
Hierarchical matrix techniques for maximum likelihood covariance estimation
 

Viewers also liked (6)

Game theory
Game theoryGame theory
Game theory
 
Lesson 35: Game Theory and Linear Programming
Lesson 35: Game Theory and Linear ProgrammingLesson 35: Game Theory and Linear Programming
Lesson 35: Game Theory and Linear Programming
 
Game theory
Game theoryGame theory
Game theory
 
Game theory
Game theoryGame theory
Game theory
 
Game theory
Game theoryGame theory
Game theory
 
Linear Programming 1
Linear Programming 1Linear Programming 1
Linear Programming 1
 

Similar to Sparse Binary Zero Sum Games (ACML2014)

Accelerating Metropolis Hastings with Lightweight Inference Compilation
Accelerating Metropolis Hastings with Lightweight Inference CompilationAccelerating Metropolis Hastings with Lightweight Inference Compilation
Accelerating Metropolis Hastings with Lightweight Inference CompilationFeynman Liang
 
MVPA with SpaceNet: sparse structured priors
MVPA with SpaceNet: sparse structured priorsMVPA with SpaceNet: sparse structured priors
MVPA with SpaceNet: sparse structured priorsElvis DOHMATOB
 
Convolutional networks and graph networks through kernels
Convolutional networks and graph networks through kernelsConvolutional networks and graph networks through kernels
Convolutional networks and graph networks through kernelstuxette
 
Minimax optimal alternating minimization \\ for kernel nonparametric tensor l...
Minimax optimal alternating minimization \\ for kernel nonparametric tensor l...Minimax optimal alternating minimization \\ for kernel nonparametric tensor l...
Minimax optimal alternating minimization \\ for kernel nonparametric tensor l...Taiji Suzuki
 
Convex Optimization Modelling with CVXOPT
Convex Optimization Modelling with CVXOPTConvex Optimization Modelling with CVXOPT
Convex Optimization Modelling with CVXOPTandrewmart11
 
The world of loss function
The world of loss functionThe world of loss function
The world of loss function홍배 김
 
Do's and Don'ts of using t-SNE.pdf
Do's and Don'ts of using t-SNE.pdfDo's and Don'ts of using t-SNE.pdf
Do's and Don'ts of using t-SNE.pdfFrankClat
 
ISI MSQE Entrance Question Paper (2008)
ISI MSQE Entrance Question Paper (2008)ISI MSQE Entrance Question Paper (2008)
ISI MSQE Entrance Question Paper (2008)CrackDSE
 
Variational Autoencoded Regression of Visual Data with Generative Adversarial...
Variational Autoencoded Regression of Visual Data with Generative Adversarial...Variational Autoencoded Regression of Visual Data with Generative Adversarial...
Variational Autoencoded Regression of Visual Data with Generative Adversarial...NAVER Engineering
 
SPDE presentation 2012
SPDE presentation 2012SPDE presentation 2012
SPDE presentation 2012Zheng Mengdi
 
Linear Cryptanalysis Lecture 線形解読法
Linear Cryptanalysis Lecture 線形解読法Linear Cryptanalysis Lecture 線形解読法
Linear Cryptanalysis Lecture 線形解読法Kai Katsumata
 
Computer organiztion3
Computer organiztion3Computer organiztion3
Computer organiztion3Umang Gupta
 
MLHEP 2015: Introductory Lecture #4
MLHEP 2015: Introductory Lecture #4MLHEP 2015: Introductory Lecture #4
MLHEP 2015: Introductory Lecture #4arogozhnikov
 

Similar to Sparse Binary Zero Sum Games (ACML2014) (20)

Accelerating Metropolis Hastings with Lightweight Inference Compilation
Accelerating Metropolis Hastings with Lightweight Inference CompilationAccelerating Metropolis Hastings with Lightweight Inference Compilation
Accelerating Metropolis Hastings with Lightweight Inference Compilation
 
MVPA with SpaceNet: sparse structured priors
MVPA with SpaceNet: sparse structured priorsMVPA with SpaceNet: sparse structured priors
MVPA with SpaceNet: sparse structured priors
 
CompactDisc.ppt
CompactDisc.pptCompactDisc.ppt
CompactDisc.ppt
 
Convolutional networks and graph networks through kernels
Convolutional networks and graph networks through kernelsConvolutional networks and graph networks through kernels
Convolutional networks and graph networks through kernels
 
Minimax optimal alternating minimization \\ for kernel nonparametric tensor l...
Minimax optimal alternating minimization \\ for kernel nonparametric tensor l...Minimax optimal alternating minimization \\ for kernel nonparametric tensor l...
Minimax optimal alternating minimization \\ for kernel nonparametric tensor l...
 
Randomized algorithms ver 1.0
Randomized algorithms ver 1.0Randomized algorithms ver 1.0
Randomized algorithms ver 1.0
 
Convex Optimization Modelling with CVXOPT
Convex Optimization Modelling with CVXOPTConvex Optimization Modelling with CVXOPT
Convex Optimization Modelling with CVXOPT
 
The world of loss function
The world of loss functionThe world of loss function
The world of loss function
 
Do's and Don'ts of using t-SNE.pdf
Do's and Don'ts of using t-SNE.pdfDo's and Don'ts of using t-SNE.pdf
Do's and Don'ts of using t-SNE.pdf
 
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
 
Automatic bayesian cubature
Automatic bayesian cubatureAutomatic bayesian cubature
Automatic bayesian cubature
 
ISI MSQE Entrance Question Paper (2008)
ISI MSQE Entrance Question Paper (2008)ISI MSQE Entrance Question Paper (2008)
ISI MSQE Entrance Question Paper (2008)
 
Variational Autoencoded Regression of Visual Data with Generative Adversarial...
Variational Autoencoded Regression of Visual Data with Generative Adversarial...Variational Autoencoded Regression of Visual Data with Generative Adversarial...
Variational Autoencoded Regression of Visual Data with Generative Adversarial...
 
SPDE presentation 2012
SPDE presentation 2012SPDE presentation 2012
SPDE presentation 2012
 
Unit 3
Unit 3Unit 3
Unit 3
 
Unit 3
Unit 3Unit 3
Unit 3
 
Linear Cryptanalysis Lecture 線形解読法
Linear Cryptanalysis Lecture 線形解読法Linear Cryptanalysis Lecture 線形解読法
Linear Cryptanalysis Lecture 線形解読法
 
Computer organiztion3
Computer organiztion3Computer organiztion3
Computer organiztion3
 
MLHEP 2015: Introductory Lecture #4
MLHEP 2015: Introductory Lecture #4MLHEP 2015: Introductory Lecture #4
MLHEP 2015: Introductory Lecture #4
 
Bitwise
BitwiseBitwise
Bitwise
 

More from Jialin LIU

Slides banditrmhc
Slides banditrmhcSlides banditrmhc
Slides banditrmhcJialin LIU
 
GVGAI Single-Player Learning Competition at IEEE CIG17
GVGAI Single-Player Learning Competition at IEEE CIG17GVGAI Single-Player Learning Competition at IEEE CIG17
GVGAI Single-Player Learning Competition at IEEE CIG17Jialin LIU
 
My PhD defence
My PhD defenceMy PhD defence
My PhD defenceJialin LIU
 
Algorithm Portfolios for Noisy Optimization: Compare Solvers Early (LION8)
Algorithm Portfolios for Noisy Optimization: Compare Solvers Early (LION8)Algorithm Portfolios for Noisy Optimization: Compare Solvers Early (LION8)
Algorithm Portfolios for Noisy Optimization: Compare Solvers Early (LION8)Jialin LIU
 
A Mathematically Derived Number of Resamplings for Noisy Optimization (GECCO2...
A Mathematically Derived Number of Resamplings for Noisy Optimization (GECCO2...A Mathematically Derived Number of Resamplings for Noisy Optimization (GECCO2...
A Mathematically Derived Number of Resamplings for Noisy Optimization (GECCO2...Jialin LIU
 
Meta online learning: experiments on a unit commitment problem (ESANN2014)
Meta online learning: experiments on a unit commitment problem (ESANN2014)Meta online learning: experiments on a unit commitment problem (ESANN2014)
Meta online learning: experiments on a unit commitment problem (ESANN2014)Jialin LIU
 
Noisy Optimization Convergence Rates (GECCO2013)
Noisy Optimization Convergence Rates (GECCO2013)Noisy Optimization Convergence Rates (GECCO2013)
Noisy Optimization Convergence Rates (GECCO2013)Jialin LIU
 

More from Jialin LIU (7)

Slides banditrmhc
Slides banditrmhcSlides banditrmhc
Slides banditrmhc
 
GVGAI Single-Player Learning Competition at IEEE CIG17
GVGAI Single-Player Learning Competition at IEEE CIG17GVGAI Single-Player Learning Competition at IEEE CIG17
GVGAI Single-Player Learning Competition at IEEE CIG17
 
My PhD defence
My PhD defenceMy PhD defence
My PhD defence
 
Algorithm Portfolios for Noisy Optimization: Compare Solvers Early (LION8)
Algorithm Portfolios for Noisy Optimization: Compare Solvers Early (LION8)Algorithm Portfolios for Noisy Optimization: Compare Solvers Early (LION8)
Algorithm Portfolios for Noisy Optimization: Compare Solvers Early (LION8)
 
A Mathematically Derived Number of Resamplings for Noisy Optimization (GECCO2...
A Mathematically Derived Number of Resamplings for Noisy Optimization (GECCO2...A Mathematically Derived Number of Resamplings for Noisy Optimization (GECCO2...
A Mathematically Derived Number of Resamplings for Noisy Optimization (GECCO2...
 
Meta online learning: experiments on a unit commitment problem (ESANN2014)
Meta online learning: experiments on a unit commitment problem (ESANN2014)Meta online learning: experiments on a unit commitment problem (ESANN2014)
Meta online learning: experiments on a unit commitment problem (ESANN2014)
 
Noisy Optimization Convergence Rates (GECCO2013)
Noisy Optimization Convergence Rates (GECCO2013)Noisy Optimization Convergence Rates (GECCO2013)
Noisy Optimization Convergence Rates (GECCO2013)
 

Recently uploaded

NATIONAL ANTHEMS OF AFRICA (National Anthems of Africa)
NATIONAL ANTHEMS OF AFRICA (National Anthems of Africa)NATIONAL ANTHEMS OF AFRICA (National Anthems of Africa)
NATIONAL ANTHEMS OF AFRICA (National Anthems of Africa)Basil Achie
 
The Ten Facts About People With Autism Presentation
The Ten Facts About People With Autism PresentationThe Ten Facts About People With Autism Presentation
The Ten Facts About People With Autism PresentationNathan Young
 
Genshin Impact PPT Template by EaTemp.pptx
Genshin Impact PPT Template by EaTemp.pptxGenshin Impact PPT Template by EaTemp.pptx
Genshin Impact PPT Template by EaTemp.pptxJohnree4
 
call girls in delhi malviya nagar @9811711561@
call girls in delhi malviya nagar @9811711561@call girls in delhi malviya nagar @9811711561@
call girls in delhi malviya nagar @9811711561@vikas rana
 
The 3rd Intl. Workshop on NL-based Software Engineering
The 3rd Intl. Workshop on NL-based Software EngineeringThe 3rd Intl. Workshop on NL-based Software Engineering
The 3rd Intl. Workshop on NL-based Software EngineeringSebastiano Panichella
 
CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdf
CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdfCTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdf
CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdfhenrik385807
 
Open Source Camp Kubernetes 2024 | Running WebAssembly on Kubernetes by Alex ...
Open Source Camp Kubernetes 2024 | Running WebAssembly on Kubernetes by Alex ...Open Source Camp Kubernetes 2024 | Running WebAssembly on Kubernetes by Alex ...
Open Source Camp Kubernetes 2024 | Running WebAssembly on Kubernetes by Alex ...NETWAYS
 
Event 4 Introduction to Open Source.pptx
Event 4 Introduction to Open Source.pptxEvent 4 Introduction to Open Source.pptx
Event 4 Introduction to Open Source.pptxaryanv1753
 
Mathan flower ppt.pptx slide orchids ✨🌸
Mathan flower ppt.pptx slide orchids ✨🌸Mathan flower ppt.pptx slide orchids ✨🌸
Mathan flower ppt.pptx slide orchids ✨🌸mathanramanathan2005
 
Call Girls in Rohini Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Rohini Delhi 💯Call Us 🔝8264348440🔝Call Girls in Rohini Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Rohini Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Simulation-based Testing of Unmanned Aerial Vehicles with Aerialist
Simulation-based Testing of Unmanned Aerial Vehicles with AerialistSimulation-based Testing of Unmanned Aerial Vehicles with Aerialist
Simulation-based Testing of Unmanned Aerial Vehicles with AerialistSebastiano Panichella
 
Open Source Strategy in Logistics 2015_Henrik Hankedvz-d-nl-log-conference.pdf
Open Source Strategy in Logistics 2015_Henrik Hankedvz-d-nl-log-conference.pdfOpen Source Strategy in Logistics 2015_Henrik Hankedvz-d-nl-log-conference.pdf
Open Source Strategy in Logistics 2015_Henrik Hankedvz-d-nl-log-conference.pdfhenrik385807
 
OSCamp Kubernetes 2024 | SRE Challenges in Monolith to Microservices Shift at...
OSCamp Kubernetes 2024 | SRE Challenges in Monolith to Microservices Shift at...OSCamp Kubernetes 2024 | SRE Challenges in Monolith to Microservices Shift at...
OSCamp Kubernetes 2024 | SRE Challenges in Monolith to Microservices Shift at...NETWAYS
 
CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...
CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...
CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...henrik385807
 
Philippine History cavite Mutiny Report.ppt
Philippine History cavite Mutiny Report.pptPhilippine History cavite Mutiny Report.ppt
Philippine History cavite Mutiny Report.pptssuser319dad
 
Exploring protein-protein interactions by Weak Affinity Chromatography (WAC) ...
Exploring protein-protein interactions by Weak Affinity Chromatography (WAC) ...Exploring protein-protein interactions by Weak Affinity Chromatography (WAC) ...
Exploring protein-protein interactions by Weak Affinity Chromatography (WAC) ...Salam Al-Karadaghi
 
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...Pooja Nehwal
 
OSCamp Kubernetes 2024 | Zero-Touch OS-Infrastruktur für Container und Kubern...
OSCamp Kubernetes 2024 | Zero-Touch OS-Infrastruktur für Container und Kubern...OSCamp Kubernetes 2024 | Zero-Touch OS-Infrastruktur für Container und Kubern...
OSCamp Kubernetes 2024 | Zero-Touch OS-Infrastruktur für Container und Kubern...NETWAYS
 
Work Remotely with Confluence ACE 2.pptx
Work Remotely with Confluence ACE 2.pptxWork Remotely with Confluence ACE 2.pptx
Work Remotely with Confluence ACE 2.pptxmavinoikein
 
Gaps, Issues and Challenges in the Implementation of Mother Tongue Based-Mult...
Gaps, Issues and Challenges in the Implementation of Mother Tongue Based-Mult...Gaps, Issues and Challenges in the Implementation of Mother Tongue Based-Mult...
Gaps, Issues and Challenges in the Implementation of Mother Tongue Based-Mult...marjmae69
 

Recently uploaded (20)

NATIONAL ANTHEMS OF AFRICA (National Anthems of Africa)
NATIONAL ANTHEMS OF AFRICA (National Anthems of Africa)NATIONAL ANTHEMS OF AFRICA (National Anthems of Africa)
NATIONAL ANTHEMS OF AFRICA (National Anthems of Africa)
 
The Ten Facts About People With Autism Presentation
The Ten Facts About People With Autism PresentationThe Ten Facts About People With Autism Presentation
The Ten Facts About People With Autism Presentation
 
Genshin Impact PPT Template by EaTemp.pptx
Genshin Impact PPT Template by EaTemp.pptxGenshin Impact PPT Template by EaTemp.pptx
Genshin Impact PPT Template by EaTemp.pptx
 
call girls in delhi malviya nagar @9811711561@
call girls in delhi malviya nagar @9811711561@call girls in delhi malviya nagar @9811711561@
call girls in delhi malviya nagar @9811711561@
 
The 3rd Intl. Workshop on NL-based Software Engineering
The 3rd Intl. Workshop on NL-based Software EngineeringThe 3rd Intl. Workshop on NL-based Software Engineering
The 3rd Intl. Workshop on NL-based Software Engineering
 
CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdf
CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdfCTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdf
CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdf
 
Open Source Camp Kubernetes 2024 | Running WebAssembly on Kubernetes by Alex ...
Open Source Camp Kubernetes 2024 | Running WebAssembly on Kubernetes by Alex ...Open Source Camp Kubernetes 2024 | Running WebAssembly on Kubernetes by Alex ...
Open Source Camp Kubernetes 2024 | Running WebAssembly on Kubernetes by Alex ...
 
Event 4 Introduction to Open Source.pptx
Event 4 Introduction to Open Source.pptxEvent 4 Introduction to Open Source.pptx
Event 4 Introduction to Open Source.pptx
 
Mathan flower ppt.pptx slide orchids ✨🌸
Mathan flower ppt.pptx slide orchids ✨🌸Mathan flower ppt.pptx slide orchids ✨🌸
Mathan flower ppt.pptx slide orchids ✨🌸
 
Call Girls in Rohini Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Rohini Delhi 💯Call Us 🔝8264348440🔝Call Girls in Rohini Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Rohini Delhi 💯Call Us 🔝8264348440🔝
 
Simulation-based Testing of Unmanned Aerial Vehicles with Aerialist
Simulation-based Testing of Unmanned Aerial Vehicles with AerialistSimulation-based Testing of Unmanned Aerial Vehicles with Aerialist
Simulation-based Testing of Unmanned Aerial Vehicles with Aerialist
 
Open Source Strategy in Logistics 2015_Henrik Hankedvz-d-nl-log-conference.pdf
Open Source Strategy in Logistics 2015_Henrik Hankedvz-d-nl-log-conference.pdfOpen Source Strategy in Logistics 2015_Henrik Hankedvz-d-nl-log-conference.pdf
Open Source Strategy in Logistics 2015_Henrik Hankedvz-d-nl-log-conference.pdf
 
OSCamp Kubernetes 2024 | SRE Challenges in Monolith to Microservices Shift at...
OSCamp Kubernetes 2024 | SRE Challenges in Monolith to Microservices Shift at...OSCamp Kubernetes 2024 | SRE Challenges in Monolith to Microservices Shift at...
OSCamp Kubernetes 2024 | SRE Challenges in Monolith to Microservices Shift at...
 
CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...
CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...
CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...
 
Philippine History cavite Mutiny Report.ppt
Philippine History cavite Mutiny Report.pptPhilippine History cavite Mutiny Report.ppt
Philippine History cavite Mutiny Report.ppt
 
Exploring protein-protein interactions by Weak Affinity Chromatography (WAC) ...
Exploring protein-protein interactions by Weak Affinity Chromatography (WAC) ...Exploring protein-protein interactions by Weak Affinity Chromatography (WAC) ...
Exploring protein-protein interactions by Weak Affinity Chromatography (WAC) ...
 
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...
 
OSCamp Kubernetes 2024 | Zero-Touch OS-Infrastruktur für Container und Kubern...
OSCamp Kubernetes 2024 | Zero-Touch OS-Infrastruktur für Container und Kubern...OSCamp Kubernetes 2024 | Zero-Touch OS-Infrastruktur für Container und Kubern...
OSCamp Kubernetes 2024 | Zero-Touch OS-Infrastruktur für Container und Kubern...
 
Work Remotely with Confluence ACE 2.pptx
Work Remotely with Confluence ACE 2.pptxWork Remotely with Confluence ACE 2.pptx
Work Remotely with Confluence ACE 2.pptx
 
Gaps, Issues and Challenges in the Implementation of Mother Tongue Based-Mult...
Gaps, Issues and Challenges in the Implementation of Mother Tongue Based-Mult...Gaps, Issues and Challenges in the Implementation of Mother Tongue Based-Mult...
Gaps, Issues and Challenges in the Implementation of Mother Tongue Based-Mult...
 

Sparse Binary Zero Sum Games (ACML2014)

  • 1. Sparse Binary Zero-Sum Games [ACML 2014] David Auger1 Jialin Liu2 Sylvie Ruette3 David L. St-Pierre4 Olivier Teytaud2 1AlCAAP, Laboratoire PRiSM, Universite de Versailles Saint Quentin-en-Yvelines, France 2TAO, INRIA-CNRS-LRI, Universite Paris-Sud, France 3Laboratoire de Mathematiques, CNRS, Universite Paris-Sud, France 4Universite du Quebec a Trois-Rivieres, Canada Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 1 / 26
  • 2. Thanks to reviewers for very fruitful comments. Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 2 / 26
  • 3. Introduction Two-person zero-sum game MKK Nash Equilibrium ! O(K2) with 3 Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 3 / 26
  • 4. Introduction Two-person zero-sum game MKK Nash Equilibrium ! O(K2) with 3 If the Nash is sparse ! k k submatrix ! O(k3kK log K) with probability 1 (provable) Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 3 / 26
  • 6. ned by matrix M I choose (privately) i Simultaneously, you choose j I earn Mi ;j You earn Mi ;j So this is zero-sum. Or you earn 1 Mi ;j (so this is 1-sum, equivalent). Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 4 / 26
  • 7. Ok, I earn Mi ;j , you earn Mi ;j Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 5 / 26
  • 8. Ok, I earn Mi ;j , you earn Mi ;j Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 5 / 26
  • 9. Nash Equilibrium Nash Equilibrium (NE) Zero-sum matrix game M My strategy = probability distrib. on rows = x Your strategy = probability distrib. on cols = y Expected reward = xTMy There exists x; y such that 8x; y, xTMy xTMy xTMy: (x; y) is a Nash Equilibrium (no unicity). Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 6 / 26
  • 10. Ok, I earn Mi ;j , you earn Mi ;j Nash: Ok I play i with probability x i Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 7 / 26
  • 11. Ok, I earn Mi ;j , you earn Mi ;j Nash: Ok I play i with probability x i How to compute x*? Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 7 / 26
  • 12. Solving Nash Solution 1: Linear Programming (LP) 1 M M + C so that it is positive (without loss of generality) 2 LP:
  • 13. nd 0 u minimizing P i ui such that (MT ) u 1 P 3 x = u= i ui =) classical, provably exact, polynomial time Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 8 / 26
  • 14. Solving Nash Solution 2: Approximate Nash Equilibrium Approximate -NE (x; y) such that xTMy xTMy xTMy + : Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 9 / 26
  • 15. Solution 1: LP (comp. expensive) Solution 2: Approximate Nash Equilibrium Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 10 / 26
  • 16. Solution 1: LP (comp. expensive) Solution 2: Approximate Nash Equilibrium Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 10 / 26
  • 17. Computing approximate Nash Equilibrium Assuming the matrix is of size K K ... LP (see reduction from Nash to linear programming in [Von Stengel (2002)]): O(K2) with 3 4 [Grigoriadis and Khachiyan(1995)]: -Nash with expected time O(K log(K) 2 ), i.e. less than the size of the matrix! Parallel : O( log2(K) 2 ) if using K log(K) processors Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 11 / 26
  • 18. Computing approximate Nash Equilibrium Assuming the matrix is of size K K ... LP (see reduction from Nash to linear programming in [Von Stengel (2002)]): O(K2) with 3 4 [Grigoriadis and Khachiyan(1995)]: -Nash with expected time O(K log(K) 2 ), i.e. less than the size of the matrix! Parallel : O( log2(K) 2 ) if using K log(K) processors Other algorithms: similar complexity, approximate solution +
  • 19. xed time with probability 1 EXP3 ([Auer et al.(1995)]) Inf ([Audibert and Bubeck(2009)]) Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 11 / 26
  • 20. Other tools 1: Hadamard determinant Hadamard determinant bound ([Hadamard(1893)], [Brenner and Cummings(1972)]) Given matrix Mkk with coecients in f1; 0; 1g, then M has determinant at most k k 2 , i.e. j detMj k k 2 : Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 12 / 26
  • 21. Other tools 2: Linear programming Solve min ax Mx c x 2 Rd If there is a
  • 22. nite optimum, then there is a
  • 23. nite optimum x such that, for some E with jEj = d, 8i 2 E, Mi x = ci the Mi for i in E are linear independent (=) i.e. d lin. indep. constraints are active) Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 13 / 26
  • 24. Why is this relevant ? Nash = solution of linear programming problem x: Nash Equilibrium of MKK Let us assume that x is unique and has at most k non-zero components (sparsity) Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 14 / 26
  • 25. Why is this relevant ? Nash = solution of linear programming problem x: Nash Equilibrium of MKK Let us assume that x is unique and has at most k non-zero components (sparsity) ) x = also NE of a k k submatrix: Mk ) x = solution of LP in dimension k ) x = solution of k lin. eq. with coecients in f1; 0; 1g ) x = inv-matrix vector ) x = obtained by cofactors / det matrix x k ) has denominator at most k 2 0k Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 14 / 26
  • 26. How to realise ? Under assumption that the Nash is sparse x is rational with small denominator So let us compute an -Nash (sublinear time!) And let us compute its closest approximation with small denominator (Hadamard) variants for -Nash =) exact Nash Rounding: switch to closest approximation Truncation: remove small components and work on the remaining submatrix (exact solving) Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 15 / 26
  • 27. Evil in the details jjy yjj1 does not imply V(y) V(y) + ; indeed V(y) V(y) + jjyyjj1 k k 2 Results : (if Grigoriadis) For a K K matrix with Nash k-sparse Exact solution in time O(poly (k) + (K log K)k3k ) with truncation-algorithm Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 16 / 26
  • 28. Experimental results: two card games Previous results: ingaming of Urban Rivals New results: metagaming of Pokemon Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 17 / 26
  • 29. Ingaming results (Urban Rivals) Previous work: [Flory and Teytaud(2011)], implementation of Truncated-EXP3, without proof Urban Rivals AI = Monte Carlo Tree Search ([Coulom (2006)]), using zero-sum matrix games as a key component Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 18 / 26
  • 30. Ingaming results (Urban Rivals) Previous work: [Flory and Teytaud(2011)], implementation of Truncated-EXP3, without proof Results don't look impressive ( 56%), but the game is highly randomized =) Reaching 55% is far from being negligible Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 19 / 26
  • 31. New experiments Test on Pokemon Deck choice (metagaming) Based on EXP3+truncation Various versions of EXP3 (6= parameters) Code available https://www.lri.fr/~teytaud/games.html Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 20 / 26
  • 32. New experiments With a poorly tuned EXP3 : truncation brings a huge improvement 0 0.9 0.85 0.8 0.75 0.7 0.65 0.6 0.55 0.5 10 1 10 2 10 3 10 4 10 0.45 TEXP3 vs EXP3 0 0.9 0.85 0.8 0.75 0.7 0.65 0.6 0.55 0.5 10 1 10 2 10 3 10 4 10 0.45 TEXP3 vs Uniform EXP3 vs Uniform Figure: Performance in terms of budget T with a poorly tuned EXP3 for the game of Pokeman using 2 cards. Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 21 / 26
  • 33. New experiments With a well-tuned EXP3, truncation brings a signi
  • 34. cant improvement 0 0.58 0.57 0.56 0.55 0.54 0.53 0.52 0.51 10 1 10 2 10 3 10 4 10 0.5 TEXP3 vs EXP3 0 0.95 0.9 0.85 0.8 0.75 0.7 0.65 0.6 0.55 0.5 10 1 10 2 10 3 10 4 10 0.45 TEXP3 vs Uniform EXP3 vs Uniform Figure: Performance in terms of budget T with a well-tuned EXP3 for the game of Pokeman using 2 cards. Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 22 / 26
  • 35. Conclusions further work Proved small improvement, experimentally big improvement. Improving the bound ? We don't know k (sparsity level). Adaptive algorithms ? Proved only with unique Nash (x; y). Necessary ? Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 23 / 26
  • 36. Jean-Yeves Audibert and Sebastien Bubeck. Minimax policies for adversarial and stochastic bandits. In 22th annual conference on learning theory, 2009. Peter Auer, Nicolo Cesa-Bianchi, Yoav Freund, and Robert E. Schapire. Gambling in a rigged casino: the adversarial multi-armed bandit problem. In Proceedings of the 36th Annual Symposium on Foundations of Computer Science. IEEE Computer Society Press, 1995. Remi Coulom (2006). Ecient selectivity and backup operators in Monte-Carlo tree search. In Computers and games, 2006. Joel Brenner and Larry Cummings. The Hadamard maximum determinant problem. In Amer. Math. Monthly, 1972. Sebastien Flory and Olivier Teytaud. Upper con
  • 37. dence trees with short term partial information. In Procedings of EvoGames, 2011. Michael D. Grigoriadis and Leonid G. Khachiyan. A sublinear-time randomized approximation algorithm for matrix games. In Operations Research Letters, 1995. Jacques Hadamard. Resolution d'une question relative aux determinants. In Bull. Sci. Math., 1893. Bernhard Von Stengel. Computing equilibria for two-person games. In Handbook of game theory with economic applications, 2002. Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 24 / 26
  • 38. Thank you for your attention ! David Auger David L. St-Pierre Sylvie Ruette Olivier Teytaud Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 25 / 26
  • 39. [ACML 2014] Sparse Binary Zero-Sum Games D. Auger J. Liu S. Ruette D. L. St-Pierre O. Teytaud Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 26 / 26