SlideShare a Scribd company logo
1 of 22
Portfolios of Artificial Intelligences 
+ playing with random seeds 
1. What is a portfolio 
2. Offline portfolio 
3. Online portfolio 
4. Mathematics (sorry) 
5. Experiments 
J.-B. Hoock, D. L. St-Pierre, O. Teytaud
Portfolio 
● I have K algorithms for solving a given task : 
– Mcts 
– Alpha-Beta 
– Parametric script 
– Nested MC 
– … 
● I want to choose the best one
Two frameworks 
● Offline 
– I do some work before the competition 
– I combine all my algorithms into 1 
– Simple version : 
● Compute some probability vector p 
● For each game, use Algo(i) with probability p(i) 
● Online 
– For each game, 
● Use Algo(i) with probability p(i) 
● Update p when the game is over
1. What is a portfolio 
2. Offline portfolio 
3. Online portfolio 
4. Mathematics (sorry) 
5. Experiments
Offline Nash portfolio 
● K algorithms for black BAI(1),..., BAI(K) 
● K' algorithms for white WAI(1),...,WAI(K') 
● Def : Mij=proba( BAI(i) beats WAI(j) ) 
● Define (p,q) = Nash equilibrium of M 
– p = best stochastic portfolio for Black (Nash sense) 
– q = best stochastic portfolio for White (Nash sense) 
● Portfolio : 
– Black : Play BAI(i) with probability p(i) 
– White : Play WAI(j) with probability q(j)
Other offline portfolios 
● K algorithms for black BAI(1),..., BAI(K) 
● K' algorithms for white WAI(1),...,WAI(K) 
● Definitions : 
– Uniform portfolio : p(i) = 1/K q(j)=1/K' 
– Fixed seed : p(i)=1, q(j)=1 for some i,j 
– Best arm : fixed seed with i best row / j best column 
● Portfolio : 
– Black : Play BAI(i) with probability p(i) 
– White : Play WAI(j) with probability q(j)
1. What is a portfolio 
2. Offline portfolio 
3. Online portfolio 
4. Mathematics (sorry) 
5. Experiments
Online portfolio (for Black) 
● Just apply UCBT (or your favorite bandit) 
● Before playing a game : 
– p(i) = frequency of win for BAI(i) 
– n(i) =number of times BAI(i) was used 
– N= sum of the n(i) 
– sc(i)= p(i) + Clog(N)/n(i) 
+C' sqrt( p(i)(1-p(i)) log(N) /n(i) ) 
– choose i* maximizing sc(i*) 
● Play with BAI(i*)
1. What is a portfolio 
2. Offline portfolio 
3. Online portfolio 
4. Mathematics (sorry) 
5. Experiments
Nash 
Computed 
● exactly in polynomial time. 
● with precision e in expected time 
O( (K+K') log (K+K') 2 / e 2 ) 
The best portfolio in terms of 
● Worst case winrate against the WAI(i) 
● Worst case winrate against WAI(i) for i ~ some 
probability distribution
UCBT for Black 
● Nearly zero computational overhead 
● Asymptotically optimal winning rate against a 
stationary opponent, among the BAI(i) 
● We did not try discounted Ucb
1. What is a portfolio 
2. Offline portfolio 
3. Online portfolio 
4. Mathematics (sorry) 
5. Experiments 
on 9x9 Go
First portfolio : random seeds 
● Pick up a stochastic algorithm 
● Choose K random seeds 
● You get K algorithms 
Hint : the random seed has a significant impact. 
Yes, it's by rote learning (kind of opening book).
Performance of Nash portfolio 
(learnt offline), in generalization 
● Against 
« new » seeds 
● Vs uniform 
==> this means we 
outperform the 
default version 
(which is randomized seeds). 
Portfolios are here 
a distribution on random seeds. 
We get an improved algorithm 
(winning rate 66%) just 
with that.
Performance of Nash portfolio 
(learnt offline), in generalization 
● Against 
« new » seeds 
● Vs uniform : 
==> this means we 
outperform the 
default version 
(which is randomized seeds) 
Portfolios are here 
a distribution on random seeds. 
We get an improved algorithm 
(winning rate 66%) just 
with that. 
X-axis = K = K'
Remarks 
● Nash portfolio good 
● « Best Arm » seed very good 
● But we will see that « best arm » has 
weaknesses ==> it can be « overfitted » i.e. 
easily beaten by a « learning » opponent.
UCBT cruches fixedSeed and wins 
against uniform 
Dots decreasing 
to 0. 
Fixed seeds 
(deterministic 
algorithms) 
are overfitted 
after 64 games 
X-axis = 
log2 (nb of games) 
(max. 512 games)
UCBT cruches fixedSeed and wins 
against uniform 
Dots decreasing 
to 0. 
Fixed seeds 
(deterministic 
algorithms) 
are overfitted 
after 64 games 
X-axis = 
log2 (nb of games) 
(max. 512 games)
Other experiments : variants of 
some algorithm 
● Gnugo with options (32 variants) 
● Nash-portfolio or UCBT portfolio : only a few 
percents of improvements over a single ad hoc 
variant. 
==> less impressive than with random seeds
Conclusions 
● Nice application for Nash-portfolio: 
– Choose a stochastic algorithm 
– Build a matrix M of games randomSeed vs 
randomSeed 
– Compute the Nash equilibrium 
– You get a new probability distribution on random seeds 
– It should be strong than the original algorithm. 
● Nice application for UCBT-portfolio 
– Play against it 
– As long as you lose, it will keep the same line of play
Conclusions 
● Further work 
– Better Nash approximation 
– Increase fun (should Ucbt explore more or less ? 
discount ?) 
– Bigger experiments (bigger games ? 19x19 ?) 
● Comments ? 
We forgot to cite your paper ? 
We did not try on your favorite game ? 
Our results are bullshit ? Please tell us:-)
AI Portfolios Improve Random Seed Algorithms

More Related Content

Viewers also liked

Artificial intelligence for power systems
Artificial intelligence for power systemsArtificial intelligence for power systems
Artificial intelligence for power systemsOlivier Teytaud
 
Bias correction, and other uncertainty management techniques
Bias correction, and other uncertainty management techniquesBias correction, and other uncertainty management techniques
Bias correction, and other uncertainty management techniquesOlivier Teytaud
 
Planning for power systems
Planning for power systemsPlanning for power systems
Planning for power systemsOlivier Teytaud
 
Réseaux neuronaux profonds & intelligence artificielle
Réseaux neuronaux profonds & intelligence artificielleRéseaux neuronaux profonds & intelligence artificielle
Réseaux neuronaux profonds & intelligence artificielleOlivier Teytaud
 
Simple regret bandit algorithms for unstructured noisy optimization
Simple regret bandit algorithms for unstructured noisy optimizationSimple regret bandit algorithms for unstructured noisy optimization
Simple regret bandit algorithms for unstructured noisy optimizationOlivier Teytaud
 
Simulation-based optimization: Upper Confidence Tree and Direct Policy Search
Simulation-based optimization: Upper Confidence Tree and Direct Policy SearchSimulation-based optimization: Upper Confidence Tree and Direct Policy Search
Simulation-based optimization: Upper Confidence Tree and Direct Policy SearchOlivier Teytaud
 
Bias and Variance in Continuous EDA: massively parallel continuous optimization
Bias and Variance in Continuous EDA: massively parallel continuous optimizationBias and Variance in Continuous EDA: massively parallel continuous optimization
Bias and Variance in Continuous EDA: massively parallel continuous optimizationOlivier Teytaud
 
Keywords and examples of machine learning
Keywords and examples of machine learningKeywords and examples of machine learning
Keywords and examples of machine learningOlivier Teytaud
 

Viewers also liked (11)

Artificial intelligence for power systems
Artificial intelligence for power systemsArtificial intelligence for power systems
Artificial intelligence for power systems
 
Functional programming
Functional programmingFunctional programming
Functional programming
 
Bias correction, and other uncertainty management techniques
Bias correction, and other uncertainty management techniquesBias correction, and other uncertainty management techniques
Bias correction, and other uncertainty management techniques
 
Planning for power systems
Planning for power systemsPlanning for power systems
Planning for power systems
 
Réseaux neuronaux profonds & intelligence artificielle
Réseaux neuronaux profonds & intelligence artificielleRéseaux neuronaux profonds & intelligence artificielle
Réseaux neuronaux profonds & intelligence artificielle
 
Debugging
DebuggingDebugging
Debugging
 
Simple regret bandit algorithms for unstructured noisy optimization
Simple regret bandit algorithms for unstructured noisy optimizationSimple regret bandit algorithms for unstructured noisy optimization
Simple regret bandit algorithms for unstructured noisy optimization
 
Simulation-based optimization: Upper Confidence Tree and Direct Policy Search
Simulation-based optimization: Upper Confidence Tree and Direct Policy SearchSimulation-based optimization: Upper Confidence Tree and Direct Policy Search
Simulation-based optimization: Upper Confidence Tree and Direct Policy Search
 
Power systemsilablri
Power systemsilablriPower systemsilablri
Power systemsilablri
 
Bias and Variance in Continuous EDA: massively parallel continuous optimization
Bias and Variance in Continuous EDA: massively parallel continuous optimizationBias and Variance in Continuous EDA: massively parallel continuous optimization
Bias and Variance in Continuous EDA: massively parallel continuous optimization
 
Keywords and examples of machine learning
Keywords and examples of machine learningKeywords and examples of machine learning
Keywords and examples of machine learning
 

Similar to AI Portfolios Improve Random Seed Algorithms

Theories of continuous optimization
Theories of continuous optimizationTheories of continuous optimization
Theories of continuous optimizationOlivier Teytaud
 
13_Unsupervised Learning.pdf
13_Unsupervised Learning.pdf13_Unsupervised Learning.pdf
13_Unsupervised Learning.pdfEmanAsem4
 
Pyoneers - IITGN CSE Seminar Presentation
Pyoneers - IITGN CSE Seminar PresentationPyoneers - IITGN CSE Seminar Presentation
Pyoneers - IITGN CSE Seminar PresentationVaidyanathan P. R.
 
Dynamic Programming
Dynamic ProgrammingDynamic Programming
Dynamic ProgrammingSahil Kumar
 
Meta Monte-Carlo Tree Search
Meta Monte-Carlo Tree SearchMeta Monte-Carlo Tree Search
Meta Monte-Carlo Tree SearchOlivier Teytaud
 
Undecidability in partially observable deterministic games
Undecidability in partially observable deterministic gamesUndecidability in partially observable deterministic games
Undecidability in partially observable deterministic gamesOlivier Teytaud
 
An Analytical Study of Puzzle Selection Strategies for the ESP Game
An Analytical Study of Puzzle Selection Strategies for the ESP GameAn Analytical Study of Puzzle Selection Strategies for the ESP Game
An Analytical Study of Puzzle Selection Strategies for the ESP GameAcademia Sinica
 
Dynamic programming
Dynamic programmingDynamic programming
Dynamic programmingGopi Saiteja
 
Machine learning 2016: deep networks and Monte Carlo Tree Search
Machine learning 2016: deep networks and Monte Carlo Tree SearchMachine learning 2016: deep networks and Monte Carlo Tree Search
Machine learning 2016: deep networks and Monte Carlo Tree SearchOlivier Teytaud
 
Machine learning 2016: deep networks and Monte Carlo Tree Search
Machine learning 2016: deep networks and Monte Carlo Tree SearchMachine learning 2016: deep networks and Monte Carlo Tree Search
Machine learning 2016: deep networks and Monte Carlo Tree SearchOlivier Teytaud
 
Haskell in the Real World
Haskell in the Real WorldHaskell in the Real World
Haskell in the Real Worldosfameron
 
AlphaZero and beyond: Polygames
AlphaZero and beyond: PolygamesAlphaZero and beyond: Polygames
AlphaZero and beyond: PolygamesOlivier Teytaud
 
Lecture9-bayes.pptx
Lecture9-bayes.pptxLecture9-bayes.pptx
Lecture9-bayes.pptxTienChung4
 
Games, Queries, and Argumentation Frameworks: Time for a Family Reunion!
Games, Queries, and Argumentation Frameworks: Time for a Family Reunion!Games, Queries, and Argumentation Frameworks: Time for a Family Reunion!
Games, Queries, and Argumentation Frameworks: Time for a Family Reunion!Bertram Ludäscher
 
Choosing between several options in uncertain environments
Choosing between several options in uncertain environmentsChoosing between several options in uncertain environments
Choosing between several options in uncertain environmentsOlivier Teytaud
 
dynamic programming Rod cutting class
dynamic programming Rod cutting classdynamic programming Rod cutting class
dynamic programming Rod cutting classgiridaroori
 
clegoues-pwlconf-sept16-asPDF.pdf
clegoues-pwlconf-sept16-asPDF.pdfclegoues-pwlconf-sept16-asPDF.pdf
clegoues-pwlconf-sept16-asPDF.pdfaoecmtin
 

Similar to AI Portfolios Improve Random Seed Algorithms (20)

Theories of continuous optimization
Theories of continuous optimizationTheories of continuous optimization
Theories of continuous optimization
 
13_Unsupervised Learning.pdf
13_Unsupervised Learning.pdf13_Unsupervised Learning.pdf
13_Unsupervised Learning.pdf
 
Provenance Games
Provenance GamesProvenance Games
Provenance Games
 
Pyoneers - IITGN CSE Seminar Presentation
Pyoneers - IITGN CSE Seminar PresentationPyoneers - IITGN CSE Seminar Presentation
Pyoneers - IITGN CSE Seminar Presentation
 
groovy & grails - lecture 8
groovy & grails - lecture 8groovy & grails - lecture 8
groovy & grails - lecture 8
 
Dynamic Programming
Dynamic ProgrammingDynamic Programming
Dynamic Programming
 
Meta Monte-Carlo Tree Search
Meta Monte-Carlo Tree SearchMeta Monte-Carlo Tree Search
Meta Monte-Carlo Tree Search
 
Undecidability in partially observable deterministic games
Undecidability in partially observable deterministic gamesUndecidability in partially observable deterministic games
Undecidability in partially observable deterministic games
 
An Analytical Study of Puzzle Selection Strategies for the ESP Game
An Analytical Study of Puzzle Selection Strategies for the ESP GameAn Analytical Study of Puzzle Selection Strategies for the ESP Game
An Analytical Study of Puzzle Selection Strategies for the ESP Game
 
Dynamic programming
Dynamic programmingDynamic programming
Dynamic programming
 
present_merged
present_mergedpresent_merged
present_merged
 
Machine learning 2016: deep networks and Monte Carlo Tree Search
Machine learning 2016: deep networks and Monte Carlo Tree SearchMachine learning 2016: deep networks and Monte Carlo Tree Search
Machine learning 2016: deep networks and Monte Carlo Tree Search
 
Machine learning 2016: deep networks and Monte Carlo Tree Search
Machine learning 2016: deep networks and Monte Carlo Tree SearchMachine learning 2016: deep networks and Monte Carlo Tree Search
Machine learning 2016: deep networks and Monte Carlo Tree Search
 
Haskell in the Real World
Haskell in the Real WorldHaskell in the Real World
Haskell in the Real World
 
AlphaZero and beyond: Polygames
AlphaZero and beyond: PolygamesAlphaZero and beyond: Polygames
AlphaZero and beyond: Polygames
 
Lecture9-bayes.pptx
Lecture9-bayes.pptxLecture9-bayes.pptx
Lecture9-bayes.pptx
 
Games, Queries, and Argumentation Frameworks: Time for a Family Reunion!
Games, Queries, and Argumentation Frameworks: Time for a Family Reunion!Games, Queries, and Argumentation Frameworks: Time for a Family Reunion!
Games, Queries, and Argumentation Frameworks: Time for a Family Reunion!
 
Choosing between several options in uncertain environments
Choosing between several options in uncertain environmentsChoosing between several options in uncertain environments
Choosing between several options in uncertain environments
 
dynamic programming Rod cutting class
dynamic programming Rod cutting classdynamic programming Rod cutting class
dynamic programming Rod cutting class
 
clegoues-pwlconf-sept16-asPDF.pdf
clegoues-pwlconf-sept16-asPDF.pdfclegoues-pwlconf-sept16-asPDF.pdf
clegoues-pwlconf-sept16-asPDF.pdf
 

Recently uploaded

CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordAsst.prof M.Gokilavani
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSRajkumarAkumalla
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )Tsuyoshi Horigome
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingrakeshbaidya232001
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Christo Ananth
 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...ranjana rawat
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escortsranjana rawat
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlysanyuktamishra911
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Call Girls in Nagpur High Profile
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performancesivaprakash250
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingrknatarajan
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Dr.Costas Sachpazis
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxpranjaldaimarysona
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).pptssuser5c9d4b1
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSISrknatarajan
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...Call Girls in Nagpur High Profile
 

Recently uploaded (20)

★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writing
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
 
Roadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and RoutesRoadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and Routes
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performance
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptx
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSIS
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
 

AI Portfolios Improve Random Seed Algorithms

  • 1. Portfolios of Artificial Intelligences + playing with random seeds 1. What is a portfolio 2. Offline portfolio 3. Online portfolio 4. Mathematics (sorry) 5. Experiments J.-B. Hoock, D. L. St-Pierre, O. Teytaud
  • 2. Portfolio ● I have K algorithms for solving a given task : – Mcts – Alpha-Beta – Parametric script – Nested MC – … ● I want to choose the best one
  • 3. Two frameworks ● Offline – I do some work before the competition – I combine all my algorithms into 1 – Simple version : ● Compute some probability vector p ● For each game, use Algo(i) with probability p(i) ● Online – For each game, ● Use Algo(i) with probability p(i) ● Update p when the game is over
  • 4. 1. What is a portfolio 2. Offline portfolio 3. Online portfolio 4. Mathematics (sorry) 5. Experiments
  • 5. Offline Nash portfolio ● K algorithms for black BAI(1),..., BAI(K) ● K' algorithms for white WAI(1),...,WAI(K') ● Def : Mij=proba( BAI(i) beats WAI(j) ) ● Define (p,q) = Nash equilibrium of M – p = best stochastic portfolio for Black (Nash sense) – q = best stochastic portfolio for White (Nash sense) ● Portfolio : – Black : Play BAI(i) with probability p(i) – White : Play WAI(j) with probability q(j)
  • 6. Other offline portfolios ● K algorithms for black BAI(1),..., BAI(K) ● K' algorithms for white WAI(1),...,WAI(K) ● Definitions : – Uniform portfolio : p(i) = 1/K q(j)=1/K' – Fixed seed : p(i)=1, q(j)=1 for some i,j – Best arm : fixed seed with i best row / j best column ● Portfolio : – Black : Play BAI(i) with probability p(i) – White : Play WAI(j) with probability q(j)
  • 7. 1. What is a portfolio 2. Offline portfolio 3. Online portfolio 4. Mathematics (sorry) 5. Experiments
  • 8. Online portfolio (for Black) ● Just apply UCBT (or your favorite bandit) ● Before playing a game : – p(i) = frequency of win for BAI(i) – n(i) =number of times BAI(i) was used – N= sum of the n(i) – sc(i)= p(i) + Clog(N)/n(i) +C' sqrt( p(i)(1-p(i)) log(N) /n(i) ) – choose i* maximizing sc(i*) ● Play with BAI(i*)
  • 9. 1. What is a portfolio 2. Offline portfolio 3. Online portfolio 4. Mathematics (sorry) 5. Experiments
  • 10. Nash Computed ● exactly in polynomial time. ● with precision e in expected time O( (K+K') log (K+K') 2 / e 2 ) The best portfolio in terms of ● Worst case winrate against the WAI(i) ● Worst case winrate against WAI(i) for i ~ some probability distribution
  • 11. UCBT for Black ● Nearly zero computational overhead ● Asymptotically optimal winning rate against a stationary opponent, among the BAI(i) ● We did not try discounted Ucb
  • 12. 1. What is a portfolio 2. Offline portfolio 3. Online portfolio 4. Mathematics (sorry) 5. Experiments on 9x9 Go
  • 13. First portfolio : random seeds ● Pick up a stochastic algorithm ● Choose K random seeds ● You get K algorithms Hint : the random seed has a significant impact. Yes, it's by rote learning (kind of opening book).
  • 14. Performance of Nash portfolio (learnt offline), in generalization ● Against « new » seeds ● Vs uniform ==> this means we outperform the default version (which is randomized seeds). Portfolios are here a distribution on random seeds. We get an improved algorithm (winning rate 66%) just with that.
  • 15. Performance of Nash portfolio (learnt offline), in generalization ● Against « new » seeds ● Vs uniform : ==> this means we outperform the default version (which is randomized seeds) Portfolios are here a distribution on random seeds. We get an improved algorithm (winning rate 66%) just with that. X-axis = K = K'
  • 16. Remarks ● Nash portfolio good ● « Best Arm » seed very good ● But we will see that « best arm » has weaknesses ==> it can be « overfitted » i.e. easily beaten by a « learning » opponent.
  • 17. UCBT cruches fixedSeed and wins against uniform Dots decreasing to 0. Fixed seeds (deterministic algorithms) are overfitted after 64 games X-axis = log2 (nb of games) (max. 512 games)
  • 18. UCBT cruches fixedSeed and wins against uniform Dots decreasing to 0. Fixed seeds (deterministic algorithms) are overfitted after 64 games X-axis = log2 (nb of games) (max. 512 games)
  • 19. Other experiments : variants of some algorithm ● Gnugo with options (32 variants) ● Nash-portfolio or UCBT portfolio : only a few percents of improvements over a single ad hoc variant. ==> less impressive than with random seeds
  • 20. Conclusions ● Nice application for Nash-portfolio: – Choose a stochastic algorithm – Build a matrix M of games randomSeed vs randomSeed – Compute the Nash equilibrium – You get a new probability distribution on random seeds – It should be strong than the original algorithm. ● Nice application for UCBT-portfolio – Play against it – As long as you lose, it will keep the same line of play
  • 21. Conclusions ● Further work – Better Nash approximation – Increase fun (should Ucbt explore more or less ? discount ?) – Bigger experiments (bigger games ? 19x19 ?) ● Comments ? We forgot to cite your paper ? We did not try on your favorite game ? Our results are bullshit ? Please tell us:-)