SlideShare a Scribd company logo
MetaOnline Learning: Experimentsona Unit 
CommitmentProblem 
Jialin Liu, Olivier Teytaud 
liu@lri.fr,teytaud@lri.fr 
Black-box Noisy Optimization 
Objective function fitness : Rd ! R 
Optimum  = argmin 
2Rd 
fitness() 
Some NOAs 
 RSAES: Self-Adaptive Evolution Strategy 
with resampling; 
 Fabian’s algorithm: a first-order method 
using gradients estimated by finite 
differences[?, ?]; 
 Noisy Newton’s algorithm: a second-order 
method using a Hessian matrix approxi-mated 
also by finite differences[?]. 
Compare Solvers Early 
 kn  n: lag 
 Why this lag ? 
(i) comparing current recommendations 
! comparing good points 
! very close fitness 
! very expensive 
(ii) algorithms’ ranking is usually stable 
! let us save up time by comparing 
older recommendations 
Solvers and Notations 
: parent pop. size in ES 
: pop. size in ES 
d: search space dimension 
n: generation index 
n : stepsize at generation n 
rn: resampling number at generation n 
For all NOPA: 
kn = dn0:1e 
rn = n3 
sn = 15n 
Table 1: Solvers in experiments 
Notation Algorithm and parametrization 
RSAES  = 10d,  = 5d, rn = 10n2 
Fabian1 n = 10=n0:49, a = 100 
Fabian2 n = 10=n0:05, a = 100 
Newton1 n = 10=n, rn = n2 
Newton2 n = 100=n4, rn = n2 
P:12345 NOPA of 5 solvers above 
P:12345 + S: P:12345 with information sharing. 
P:22 NOPA of 2 (identical) Fabian1 
P:22 + S: P:22 with information sharing. 
P:222 NOPA of 3 (identical) Fabian1 
P:222 + S: P:222 with information sharing. 
Some References 
Abstract 
 Online learning = real time machine learning “on the fly” 
 Meta online learning = combining several online learning algorithms from a given set (termed 
portfolio) of algorithms ' combining Noisy Optimization Algorithms (NOPA=noisy optimiza-tion 
portfolio algorithm). 
 Goals: (i) mitigating the effect of a bad choice of online learning algorithms (ii) parallelization 
(iii) combining the strengths of different algorithms. 
 This paper: 
- Portfolio = classical for combinatorial optimization: we test portfolios for noisy optimization. 
- Recently, a methodology termed lag has been proposed for NOPA. We test experimentally 
the lag methodology for various problems. 
Noisy Optimization Portfolio Algorithm (NOPA) 
Iteration n of the portfolio fS1; : : : ; SMg containing M NOAs: 
 Initialization module: If n = 0 initialize all Si, i 2 f1; : : : ;Mg. 
 For i 2 f1; : : : ;Mg: 
– Update module: Apply an iteration of solver Si until it has received at least n data samples. 
– Let i;n be the current recommendation by solver Si. 
 Comparison module: If n = rm for some m, then 
– For i 2 f1; : : : ;Mg, perform sm evaluations of the (stochastic) reward R(i;kn) and define yi the average reward. 
– Define i   arg mini2f1;:::;Mg yi. 
 Recommendation module: ~ = i;n 
Experiments 
Table 2: Artificial problem R() = jj  jj2 + jj  jjz  Gaussian. n: evaluation number. z = rate 
at which the variance decreases around the optimum. 
z Comparison of log(R(~n))= log(n) for d = 2 Comparison of log(R(~n))= log(n) for d = 5 
0 Newton1  RSAES ' P:12345  : : : Newton1  RSAES ' P:12345  : : : 
1 P:12345  Fabian1 ' P:22  : : : Fabian1  P:22  P:222  : : : 
2 P:12345  Fabian1 ' P:22  : : : Fabian1  P:12345  P:22  : : : 
Discussion: NOPAs are usually not far from the best of their NOAs. In small dimension with noise 
variance decreazing quickly to 0 around optimum (z = 2), NOPA outperforms all its NOAs. 
Table 3: Stochastic Unit Commitment problems, conformant planning. St: number of stocks. 
Problem size Considered NOA or NOPA 
St, T, d P:22 P:22 + S: P:222 P:222 + S: Best NOA Worst NOA 
3, 21, 63 0.61  0.07 0.63  0.03 0.63  0.05 0.63  0.07 0.49  0.08 0.81  0.05 
4, 21, 84 0.75  0.02 0.75  0.03 0.79  0.05 0.76  0.03 0.69  0.06 1.27  0.06 
5, 21, 105 0.53  0.04 0.58  0.08 0.58  0.03 0.52  0.05 0.58  0.04 1.44  0.16 
6, 15, 90 0.40  0.05 0.39  0.06 0.37  0.06 0.39  0.06 0.38  0.06 0.96  0.13 
6, 21, 126 0.53  0.08 0.54  0.08 0.55  0.07 0.54  0.07 0.54  0.07 1.78  0.37 
8, 15, 120 0.53  0.03 0.50  0.05 0.53  0.02 0.51  0.05 0.51  0.04 1.70  0.10 
8, 21, 168 0.69  0.04 0.77  0.09 0.73  0.06 0.71  0.04 0.71  0.06 2.68  0.02 
7, 21, 147 0.70  0.07 0.70  0.05 0.70  0.07 0.70  0.07 0.69  0;06 2.28  0.08 
Discussion: Given a same budget, a NOPA of identical solvers can outperform its NOAs. RSAES 
is usually the best NOA for small dimensions and variants of Fabian for large dimension. 
Table 4: Approximate convergence rates log(R(~n))= log(n) for Cart-Pole, a multimodal problem, using 
NN. n: evaluation number. 
Solver 2 neurons, d = 9 4 neurons, d = 17 8 neurons, d = 33 
1 (RSAES) -0.4580330.045014 -0.4215350.045643 -0.3517260.051705 
2 (Fabian1) 0.0022265.29923e-05 0.0020891.57766e-04 0.002218.14518e-05 
3 (Fabian2) 0.0023189.80792e-05 0.0022381.14289e-04 0.002361.51244e-04 
4 (Newton1) 0.0022296.08973e-05 -0.0307310.111294 0.0022471.19829e-04 
5 (Newton2) 0.002275.2989e-05 0.0022177.80888e-05 0.0023079.96404e-05 
6 (P:12345) -0.4087050.068428 -0.39170.071791 -0.3203990.050338 
7 (P:12345 + S:) -0.427430.05709 -0.4037070.056173 -0.3540430.069576 
Discussion: Fabian and Newton can’t solve this multimodal problem ) one solver is much better 
than others ) easy for NOPA. 
Conclusion 
 Main conclusion: 
Usual: Portfolio of Algorithms for Combinatorial Optimization; 
New: Portfolio of Algorithms for Noisy Optimization. 
 “Sharing” not that good. 
 NOPA sometimes better than NOA even if all NOA equal! 
 We show mathematically[?] and empirically a log(M) shift when using M solvers, when working 
on the log-log scale (usual scale in noisy optimization). 
 Portfolio = approximately as efficient as the best - except when one iteration of one algorithm 
monopolizes most of the budget - as RSAES in the unit commitment problem.

More Related Content

What's hot

Aaex7 group2(中英夾雜)
Aaex7 group2(中英夾雜)Aaex7 group2(中英夾雜)
Aaex7 group2(中英夾雜)
Shiang-Yun Yang
 
Scalable membership management
Scalable membership management Scalable membership management
Scalable membership management
Vinay Setty
 
Kmeans with canopy clustering
Kmeans with canopy clusteringKmeans with canopy clustering
Kmeans with canopy clustering
SeongHyun Jeong
 
Siphon
SiphonSiphon
Optimal Budget Allocation: Theoretical Guarantee and Efficient Algorithm
Optimal Budget Allocation: Theoretical Guarantee and Efficient AlgorithmOptimal Budget Allocation: Theoretical Guarantee and Efficient Algorithm
Optimal Budget Allocation: Theoretical Guarantee and Efficient Algorithm
Tasuku Soma
 
lecture 4
lecture 4lecture 4
lecture 4
sajinsc
 
Metodo Monte Carlo -Wang Landau
Metodo Monte Carlo -Wang LandauMetodo Monte Carlo -Wang Landau
Metodo Monte Carlo -Wang Landau
angely alcendra
 
A study of the worst case ratio of a simple algorithm for simple assembly lin...
A study of the worst case ratio of a simple algorithm for simple assembly lin...A study of the worst case ratio of a simple algorithm for simple assembly lin...
A study of the worst case ratio of a simple algorithm for simple assembly lin...
narmo
 
Graph Kernelpdf
Graph KernelpdfGraph Kernelpdf
Graph Kernelpdf
pratik shukla
 
Maximizing Submodular Function over the Integer Lattice
Maximizing Submodular Function over the Integer LatticeMaximizing Submodular Function over the Integer Lattice
Maximizing Submodular Function over the Integer Lattice
Tasuku Soma
 
Efficient Hill Climber for Multi-Objective Pseudo-Boolean Optimization
Efficient Hill Climber for Multi-Objective Pseudo-Boolean OptimizationEfficient Hill Climber for Multi-Objective Pseudo-Boolean Optimization
Efficient Hill Climber for Multi-Objective Pseudo-Boolean Optimization
jfrchicanog
 
Scaling out logistic regression with Spark
Scaling out logistic regression with SparkScaling out logistic regression with Spark
Scaling out logistic regression with Spark
Barak Gitsis
 
Gradient descent optimizer
Gradient descent optimizerGradient descent optimizer
Gradient descent optimizer
Hojin Yang
 
Parallel Wavelet Schemes for Images
Parallel Wavelet Schemes for ImagesParallel Wavelet Schemes for Images
Parallel Wavelet Schemes for Images
David Bařina
 
Preemptive RANSAC by David Nister.
Preemptive RANSAC by David Nister.Preemptive RANSAC by David Nister.
Preemptive RANSAC by David Nister.
Ian Sa
 
study Streaming Multigrid For Gradient Domain Operations On Large Images
study Streaming Multigrid For Gradient Domain Operations On Large Imagesstudy Streaming Multigrid For Gradient Domain Operations On Large Images
study Streaming Multigrid For Gradient Domain Operations On Large Images
Chiamin Hsu
 
Pseudo and Quasi Random Number Generation
Pseudo and Quasi Random Number GenerationPseudo and Quasi Random Number Generation
Pseudo and Quasi Random Number Generation
Ashwin Rao
 
Ultrasound Modular Architecture
Ultrasound Modular ArchitectureUltrasound Modular Architecture
Ultrasound Modular Architecture
Jose Miguel Moreno
 
presentation
presentationpresentation
presentation
jie ren
 
Computer Vision: Feature matching with RANSAC Algorithm
Computer Vision: Feature matching with RANSAC AlgorithmComputer Vision: Feature matching with RANSAC Algorithm
Computer Vision: Feature matching with RANSAC Algorithm
allyn joy calcaben
 

What's hot (20)

Aaex7 group2(中英夾雜)
Aaex7 group2(中英夾雜)Aaex7 group2(中英夾雜)
Aaex7 group2(中英夾雜)
 
Scalable membership management
Scalable membership management Scalable membership management
Scalable membership management
 
Kmeans with canopy clustering
Kmeans with canopy clusteringKmeans with canopy clustering
Kmeans with canopy clustering
 
Siphon
SiphonSiphon
Siphon
 
Optimal Budget Allocation: Theoretical Guarantee and Efficient Algorithm
Optimal Budget Allocation: Theoretical Guarantee and Efficient AlgorithmOptimal Budget Allocation: Theoretical Guarantee and Efficient Algorithm
Optimal Budget Allocation: Theoretical Guarantee and Efficient Algorithm
 
lecture 4
lecture 4lecture 4
lecture 4
 
Metodo Monte Carlo -Wang Landau
Metodo Monte Carlo -Wang LandauMetodo Monte Carlo -Wang Landau
Metodo Monte Carlo -Wang Landau
 
A study of the worst case ratio of a simple algorithm for simple assembly lin...
A study of the worst case ratio of a simple algorithm for simple assembly lin...A study of the worst case ratio of a simple algorithm for simple assembly lin...
A study of the worst case ratio of a simple algorithm for simple assembly lin...
 
Graph Kernelpdf
Graph KernelpdfGraph Kernelpdf
Graph Kernelpdf
 
Maximizing Submodular Function over the Integer Lattice
Maximizing Submodular Function over the Integer LatticeMaximizing Submodular Function over the Integer Lattice
Maximizing Submodular Function over the Integer Lattice
 
Efficient Hill Climber for Multi-Objective Pseudo-Boolean Optimization
Efficient Hill Climber for Multi-Objective Pseudo-Boolean OptimizationEfficient Hill Climber for Multi-Objective Pseudo-Boolean Optimization
Efficient Hill Climber for Multi-Objective Pseudo-Boolean Optimization
 
Scaling out logistic regression with Spark
Scaling out logistic regression with SparkScaling out logistic regression with Spark
Scaling out logistic regression with Spark
 
Gradient descent optimizer
Gradient descent optimizerGradient descent optimizer
Gradient descent optimizer
 
Parallel Wavelet Schemes for Images
Parallel Wavelet Schemes for ImagesParallel Wavelet Schemes for Images
Parallel Wavelet Schemes for Images
 
Preemptive RANSAC by David Nister.
Preemptive RANSAC by David Nister.Preemptive RANSAC by David Nister.
Preemptive RANSAC by David Nister.
 
study Streaming Multigrid For Gradient Domain Operations On Large Images
study Streaming Multigrid For Gradient Domain Operations On Large Imagesstudy Streaming Multigrid For Gradient Domain Operations On Large Images
study Streaming Multigrid For Gradient Domain Operations On Large Images
 
Pseudo and Quasi Random Number Generation
Pseudo and Quasi Random Number GenerationPseudo and Quasi Random Number Generation
Pseudo and Quasi Random Number Generation
 
Ultrasound Modular Architecture
Ultrasound Modular ArchitectureUltrasound Modular Architecture
Ultrasound Modular Architecture
 
presentation
presentationpresentation
presentation
 
Computer Vision: Feature matching with RANSAC Algorithm
Computer Vision: Feature matching with RANSAC AlgorithmComputer Vision: Feature matching with RANSAC Algorithm
Computer Vision: Feature matching with RANSAC Algorithm
 

Similar to Meta online learning: experiments on a unit commitment problem (ESANN2014)

Introduction to Max-SAT and Max-SAT Evaluation
Introduction to Max-SAT and Max-SAT EvaluationIntroduction to Max-SAT and Max-SAT Evaluation
Introduction to Max-SAT and Max-SAT Evaluation
Masahiro Sakai
 
Confidence interval two tail tests-lower bounds upperbounds
Confidence interval two tail tests-lower bounds upperboundsConfidence interval two tail tests-lower bounds upperbounds
Confidence interval two tail tests-lower bounds upperbounds
ssuser3c3f88
 
Unit 5
Unit 5Unit 5
Unit 5
Unit 5Unit 5
Unit 5
guna287176
 
dynamic programming Rod cutting class
dynamic programming Rod cutting classdynamic programming Rod cutting class
dynamic programming Rod cutting class
giridaroori
 
dynamic-programming
dynamic-programmingdynamic-programming
dynamic-programming
MuhammadSheraz836877
 
Sparsenet
SparsenetSparsenet
Sparsenet
ndronen
 
lecture 1
lecture 1lecture 1
lecture 1
sajinsc
 
Methods of Manifold Learning for Dimension Reduction of Large Data Sets
Methods of Manifold Learning for Dimension Reduction of Large Data SetsMethods of Manifold Learning for Dimension Reduction of Large Data Sets
Methods of Manifold Learning for Dimension Reduction of Large Data Sets
Ryan B Harvey, CSDP, CSM
 
1 d,2d laplace inversion of lr nmr
1 d,2d laplace inversion of lr nmr1 d,2d laplace inversion of lr nmr
1 d,2d laplace inversion of lr nmr
Adam Lee Perelman
 
Parallelising Dynamic Programming
Parallelising Dynamic ProgrammingParallelising Dynamic Programming
Parallelising Dynamic Programming
Raphael Reitzig
 
Computer Science Exam Help
Computer Science Exam Help Computer Science Exam Help
Computer Science Exam Help
Programming Exam Help
 
Druinsky_SIAMCSE15
Druinsky_SIAMCSE15Druinsky_SIAMCSE15
Druinsky_SIAMCSE15
Karen Pao
 
lecture_09.pptx
lecture_09.pptxlecture_09.pptx
lecture_09.pptx
PeruruFamidaNajumun
 
Algorithm Portfolios for Noisy Optimization: Compare Solvers Early (LION8)
Algorithm Portfolios for Noisy Optimization: Compare Solvers Early (LION8)Algorithm Portfolios for Noisy Optimization: Compare Solvers Early (LION8)
Algorithm Portfolios for Noisy Optimization: Compare Solvers Early (LION8)
Jialin LIU
 
lecture01_lecture01_lecture0001_ceva.pdf
lecture01_lecture01_lecture0001_ceva.pdflecture01_lecture01_lecture0001_ceva.pdf
lecture01_lecture01_lecture0001_ceva.pdf
AnaNeacsu5
 
Linear Programming- Leacture-16-lp1.pptx
Linear Programming- Leacture-16-lp1.pptxLinear Programming- Leacture-16-lp1.pptx
Linear Programming- Leacture-16-lp1.pptx
SarahKoech1
 
Algorithm review
Algorithm reviewAlgorithm review
Algorithm review
chidabdu
 
UNIT-II.pptx
UNIT-II.pptxUNIT-II.pptx
UNIT-II.pptx
JyoReddy9
 
My PhD defence
My PhD defenceMy PhD defence
My PhD defence
Jialin LIU
 

Similar to Meta online learning: experiments on a unit commitment problem (ESANN2014) (20)

Introduction to Max-SAT and Max-SAT Evaluation
Introduction to Max-SAT and Max-SAT EvaluationIntroduction to Max-SAT and Max-SAT Evaluation
Introduction to Max-SAT and Max-SAT Evaluation
 
Confidence interval two tail tests-lower bounds upperbounds
Confidence interval two tail tests-lower bounds upperboundsConfidence interval two tail tests-lower bounds upperbounds
Confidence interval two tail tests-lower bounds upperbounds
 
Unit 5
Unit 5Unit 5
Unit 5
 
Unit 5
Unit 5Unit 5
Unit 5
 
dynamic programming Rod cutting class
dynamic programming Rod cutting classdynamic programming Rod cutting class
dynamic programming Rod cutting class
 
dynamic-programming
dynamic-programmingdynamic-programming
dynamic-programming
 
Sparsenet
SparsenetSparsenet
Sparsenet
 
lecture 1
lecture 1lecture 1
lecture 1
 
Methods of Manifold Learning for Dimension Reduction of Large Data Sets
Methods of Manifold Learning for Dimension Reduction of Large Data SetsMethods of Manifold Learning for Dimension Reduction of Large Data Sets
Methods of Manifold Learning for Dimension Reduction of Large Data Sets
 
1 d,2d laplace inversion of lr nmr
1 d,2d laplace inversion of lr nmr1 d,2d laplace inversion of lr nmr
1 d,2d laplace inversion of lr nmr
 
Parallelising Dynamic Programming
Parallelising Dynamic ProgrammingParallelising Dynamic Programming
Parallelising Dynamic Programming
 
Computer Science Exam Help
Computer Science Exam Help Computer Science Exam Help
Computer Science Exam Help
 
Druinsky_SIAMCSE15
Druinsky_SIAMCSE15Druinsky_SIAMCSE15
Druinsky_SIAMCSE15
 
lecture_09.pptx
lecture_09.pptxlecture_09.pptx
lecture_09.pptx
 
Algorithm Portfolios for Noisy Optimization: Compare Solvers Early (LION8)
Algorithm Portfolios for Noisy Optimization: Compare Solvers Early (LION8)Algorithm Portfolios for Noisy Optimization: Compare Solvers Early (LION8)
Algorithm Portfolios for Noisy Optimization: Compare Solvers Early (LION8)
 
lecture01_lecture01_lecture0001_ceva.pdf
lecture01_lecture01_lecture0001_ceva.pdflecture01_lecture01_lecture0001_ceva.pdf
lecture01_lecture01_lecture0001_ceva.pdf
 
Linear Programming- Leacture-16-lp1.pptx
Linear Programming- Leacture-16-lp1.pptxLinear Programming- Leacture-16-lp1.pptx
Linear Programming- Leacture-16-lp1.pptx
 
Algorithm review
Algorithm reviewAlgorithm review
Algorithm review
 
UNIT-II.pptx
UNIT-II.pptxUNIT-II.pptx
UNIT-II.pptx
 
My PhD defence
My PhD defenceMy PhD defence
My PhD defence
 

Recently uploaded

The Intersection between Competition and Data Privacy – KEMP – June 2024 OECD...
The Intersection between Competition and Data Privacy – KEMP – June 2024 OECD...The Intersection between Competition and Data Privacy – KEMP – June 2024 OECD...
The Intersection between Competition and Data Privacy – KEMP – June 2024 OECD...
OECD Directorate for Financial and Enterprise Affairs
 
Pro-competitive Industrial Policy – OECD – June 2024 OECD discussion
Pro-competitive Industrial Policy – OECD – June 2024 OECD discussionPro-competitive Industrial Policy – OECD – June 2024 OECD discussion
Pro-competitive Industrial Policy – OECD – June 2024 OECD discussion
OECD Directorate for Financial and Enterprise Affairs
 
Prsentation for VIVA Welike project 1semester.pptx
Prsentation for VIVA Welike project 1semester.pptxPrsentation for VIVA Welike project 1semester.pptx
Prsentation for VIVA Welike project 1semester.pptx
prafulpawar29
 
2 December UAE National Day - United Arab Emirates
2 December UAE National Day - United Arab Emirates2 December UAE National Day - United Arab Emirates
2 December UAE National Day - United Arab Emirates
UAE Ppt
 
怎么办理(lincoln学位证书)英国林肯大学毕业证文凭学位证书原版一模一样
怎么办理(lincoln学位证书)英国林肯大学毕业证文凭学位证书原版一模一样怎么办理(lincoln学位证书)英国林肯大学毕业证文凭学位证书原版一模一样
怎么办理(lincoln学位证书)英国林肯大学毕业证文凭学位证书原版一模一样
kekzed
 
The Intersection between Competition and Data Privacy – OECD – June 2024 OECD...
The Intersection between Competition and Data Privacy – OECD – June 2024 OECD...The Intersection between Competition and Data Privacy – OECD – June 2024 OECD...
The Intersection between Competition and Data Privacy – OECD – June 2024 OECD...
OECD Directorate for Financial and Enterprise Affairs
 
Pro-competitive Industrial Policy – LANE – June 2024 OECD discussion
Pro-competitive Industrial Policy – LANE – June 2024 OECD discussionPro-competitive Industrial Policy – LANE – June 2024 OECD discussion
Pro-competitive Industrial Policy – LANE – June 2024 OECD discussion
OECD Directorate for Financial and Enterprise Affairs
 
Gamify it until you make it Improving Agile Development and Operations with ...
Gamify it until you make it  Improving Agile Development and Operations with ...Gamify it until you make it  Improving Agile Development and Operations with ...
Gamify it until you make it Improving Agile Development and Operations with ...
Ben Linders
 
原版制作贝德福特大学毕业证(bedfordhire毕业证)硕士文凭原版一模一样
原版制作贝德福特大学毕业证(bedfordhire毕业证)硕士文凭原版一模一样原版制作贝德福特大学毕业证(bedfordhire毕业证)硕士文凭原版一模一样
原版制作贝德福特大学毕业证(bedfordhire毕业证)硕士文凭原版一模一样
gpww3sf4
 
ServiceNow CIS-ITSM Exam Dumps & Questions [2024]
ServiceNow CIS-ITSM Exam Dumps & Questions [2024]ServiceNow CIS-ITSM Exam Dumps & Questions [2024]
ServiceNow CIS-ITSM Exam Dumps & Questions [2024]
SkillCertProExams
 
ACTIVE IMPLANTABLE MEDICAL DEVICE IN EUROPE
ACTIVE IMPLANTABLE MEDICAL DEVICE IN EUROPEACTIVE IMPLANTABLE MEDICAL DEVICE IN EUROPE
ACTIVE IMPLANTABLE MEDICAL DEVICE IN EUROPE
Charmi13
 
The Intersection between Competition and Data Privacy – CAPEL – June 2024 OEC...
The Intersection between Competition and Data Privacy – CAPEL – June 2024 OEC...The Intersection between Competition and Data Privacy – CAPEL – June 2024 OEC...
The Intersection between Competition and Data Privacy – CAPEL – June 2024 OEC...
OECD Directorate for Financial and Enterprise Affairs
 
Artificial Intelligence, Data and Competition – ČORBA – June 2024 OECD discus...
Artificial Intelligence, Data and Competition – ČORBA – June 2024 OECD discus...Artificial Intelligence, Data and Competition – ČORBA – June 2024 OECD discus...
Artificial Intelligence, Data and Competition – ČORBA – June 2024 OECD discus...
OECD Directorate for Financial and Enterprise Affairs
 
Artificial Intelligence, Data and Competition – SCHREPEL – June 2024 OECD dis...
Artificial Intelligence, Data and Competition – SCHREPEL – June 2024 OECD dis...Artificial Intelligence, Data and Competition – SCHREPEL – June 2024 OECD dis...
Artificial Intelligence, Data and Competition – SCHREPEL – June 2024 OECD dis...
OECD Directorate for Financial and Enterprise Affairs
 
Legislation And Regulations For Import, Manufacture,.pptx
Legislation And Regulations For Import, Manufacture,.pptxLegislation And Regulations For Import, Manufacture,.pptx
Legislation And Regulations For Import, Manufacture,.pptx
Charmi13
 
BRIC_2024_2024-06-06-11:30-haunschild_archival_version.pdf
BRIC_2024_2024-06-06-11:30-haunschild_archival_version.pdfBRIC_2024_2024-06-06-11:30-haunschild_archival_version.pdf
BRIC_2024_2024-06-06-11:30-haunschild_archival_version.pdf
Robin Haunschild
 
IEEE CIS Webinar Sustainable futures.pdf
IEEE CIS Webinar Sustainable futures.pdfIEEE CIS Webinar Sustainable futures.pdf
IEEE CIS Webinar Sustainable futures.pdf
Claudio Gallicchio
 
Proposal: The Ark Project and The BEEP Inc
Proposal: The Ark Project and The BEEP IncProposal: The Ark Project and The BEEP Inc
Proposal: The Ark Project and The BEEP Inc
Raheem Muhammad
 
Disaster Management project for holidays homework and other uses
Disaster Management project for holidays homework and other usesDisaster Management project for holidays homework and other uses
Disaster Management project for holidays homework and other uses
RIDHIMAGARG21
 
Genesis chapter 3 Isaiah Scudder.pptx
Genesis    chapter 3 Isaiah Scudder.pptxGenesis    chapter 3 Isaiah Scudder.pptx
Genesis chapter 3 Isaiah Scudder.pptx
FamilyWorshipCenterD
 

Recently uploaded (20)

The Intersection between Competition and Data Privacy – KEMP – June 2024 OECD...
The Intersection between Competition and Data Privacy – KEMP – June 2024 OECD...The Intersection between Competition and Data Privacy – KEMP – June 2024 OECD...
The Intersection between Competition and Data Privacy – KEMP – June 2024 OECD...
 
Pro-competitive Industrial Policy – OECD – June 2024 OECD discussion
Pro-competitive Industrial Policy – OECD – June 2024 OECD discussionPro-competitive Industrial Policy – OECD – June 2024 OECD discussion
Pro-competitive Industrial Policy – OECD – June 2024 OECD discussion
 
Prsentation for VIVA Welike project 1semester.pptx
Prsentation for VIVA Welike project 1semester.pptxPrsentation for VIVA Welike project 1semester.pptx
Prsentation for VIVA Welike project 1semester.pptx
 
2 December UAE National Day - United Arab Emirates
2 December UAE National Day - United Arab Emirates2 December UAE National Day - United Arab Emirates
2 December UAE National Day - United Arab Emirates
 
怎么办理(lincoln学位证书)英国林肯大学毕业证文凭学位证书原版一模一样
怎么办理(lincoln学位证书)英国林肯大学毕业证文凭学位证书原版一模一样怎么办理(lincoln学位证书)英国林肯大学毕业证文凭学位证书原版一模一样
怎么办理(lincoln学位证书)英国林肯大学毕业证文凭学位证书原版一模一样
 
The Intersection between Competition and Data Privacy – OECD – June 2024 OECD...
The Intersection between Competition and Data Privacy – OECD – June 2024 OECD...The Intersection between Competition and Data Privacy – OECD – June 2024 OECD...
The Intersection between Competition and Data Privacy – OECD – June 2024 OECD...
 
Pro-competitive Industrial Policy – LANE – June 2024 OECD discussion
Pro-competitive Industrial Policy – LANE – June 2024 OECD discussionPro-competitive Industrial Policy – LANE – June 2024 OECD discussion
Pro-competitive Industrial Policy – LANE – June 2024 OECD discussion
 
Gamify it until you make it Improving Agile Development and Operations with ...
Gamify it until you make it  Improving Agile Development and Operations with ...Gamify it until you make it  Improving Agile Development and Operations with ...
Gamify it until you make it Improving Agile Development and Operations with ...
 
原版制作贝德福特大学毕业证(bedfordhire毕业证)硕士文凭原版一模一样
原版制作贝德福特大学毕业证(bedfordhire毕业证)硕士文凭原版一模一样原版制作贝德福特大学毕业证(bedfordhire毕业证)硕士文凭原版一模一样
原版制作贝德福特大学毕业证(bedfordhire毕业证)硕士文凭原版一模一样
 
ServiceNow CIS-ITSM Exam Dumps & Questions [2024]
ServiceNow CIS-ITSM Exam Dumps & Questions [2024]ServiceNow CIS-ITSM Exam Dumps & Questions [2024]
ServiceNow CIS-ITSM Exam Dumps & Questions [2024]
 
ACTIVE IMPLANTABLE MEDICAL DEVICE IN EUROPE
ACTIVE IMPLANTABLE MEDICAL DEVICE IN EUROPEACTIVE IMPLANTABLE MEDICAL DEVICE IN EUROPE
ACTIVE IMPLANTABLE MEDICAL DEVICE IN EUROPE
 
The Intersection between Competition and Data Privacy – CAPEL – June 2024 OEC...
The Intersection between Competition and Data Privacy – CAPEL – June 2024 OEC...The Intersection between Competition and Data Privacy – CAPEL – June 2024 OEC...
The Intersection between Competition and Data Privacy – CAPEL – June 2024 OEC...
 
Artificial Intelligence, Data and Competition – ČORBA – June 2024 OECD discus...
Artificial Intelligence, Data and Competition – ČORBA – June 2024 OECD discus...Artificial Intelligence, Data and Competition – ČORBA – June 2024 OECD discus...
Artificial Intelligence, Data and Competition – ČORBA – June 2024 OECD discus...
 
Artificial Intelligence, Data and Competition – SCHREPEL – June 2024 OECD dis...
Artificial Intelligence, Data and Competition – SCHREPEL – June 2024 OECD dis...Artificial Intelligence, Data and Competition – SCHREPEL – June 2024 OECD dis...
Artificial Intelligence, Data and Competition – SCHREPEL – June 2024 OECD dis...
 
Legislation And Regulations For Import, Manufacture,.pptx
Legislation And Regulations For Import, Manufacture,.pptxLegislation And Regulations For Import, Manufacture,.pptx
Legislation And Regulations For Import, Manufacture,.pptx
 
BRIC_2024_2024-06-06-11:30-haunschild_archival_version.pdf
BRIC_2024_2024-06-06-11:30-haunschild_archival_version.pdfBRIC_2024_2024-06-06-11:30-haunschild_archival_version.pdf
BRIC_2024_2024-06-06-11:30-haunschild_archival_version.pdf
 
IEEE CIS Webinar Sustainable futures.pdf
IEEE CIS Webinar Sustainable futures.pdfIEEE CIS Webinar Sustainable futures.pdf
IEEE CIS Webinar Sustainable futures.pdf
 
Proposal: The Ark Project and The BEEP Inc
Proposal: The Ark Project and The BEEP IncProposal: The Ark Project and The BEEP Inc
Proposal: The Ark Project and The BEEP Inc
 
Disaster Management project for holidays homework and other uses
Disaster Management project for holidays homework and other usesDisaster Management project for holidays homework and other uses
Disaster Management project for holidays homework and other uses
 
Genesis chapter 3 Isaiah Scudder.pptx
Genesis    chapter 3 Isaiah Scudder.pptxGenesis    chapter 3 Isaiah Scudder.pptx
Genesis chapter 3 Isaiah Scudder.pptx
 

Meta online learning: experiments on a unit commitment problem (ESANN2014)

  • 1. MetaOnline Learning: Experimentsona Unit CommitmentProblem Jialin Liu, Olivier Teytaud liu@lri.fr,teytaud@lri.fr Black-box Noisy Optimization Objective function fitness : Rd ! R Optimum = argmin 2Rd fitness() Some NOAs RSAES: Self-Adaptive Evolution Strategy with resampling; Fabian’s algorithm: a first-order method using gradients estimated by finite differences[?, ?]; Noisy Newton’s algorithm: a second-order method using a Hessian matrix approxi-mated also by finite differences[?]. Compare Solvers Early kn n: lag Why this lag ? (i) comparing current recommendations ! comparing good points ! very close fitness ! very expensive (ii) algorithms’ ranking is usually stable ! let us save up time by comparing older recommendations Solvers and Notations : parent pop. size in ES : pop. size in ES d: search space dimension n: generation index n : stepsize at generation n rn: resampling number at generation n For all NOPA: kn = dn0:1e rn = n3 sn = 15n Table 1: Solvers in experiments Notation Algorithm and parametrization RSAES = 10d, = 5d, rn = 10n2 Fabian1 n = 10=n0:49, a = 100 Fabian2 n = 10=n0:05, a = 100 Newton1 n = 10=n, rn = n2 Newton2 n = 100=n4, rn = n2 P:12345 NOPA of 5 solvers above P:12345 + S: P:12345 with information sharing. P:22 NOPA of 2 (identical) Fabian1 P:22 + S: P:22 with information sharing. P:222 NOPA of 3 (identical) Fabian1 P:222 + S: P:222 with information sharing. Some References Abstract Online learning = real time machine learning “on the fly” Meta online learning = combining several online learning algorithms from a given set (termed portfolio) of algorithms ' combining Noisy Optimization Algorithms (NOPA=noisy optimiza-tion portfolio algorithm). Goals: (i) mitigating the effect of a bad choice of online learning algorithms (ii) parallelization (iii) combining the strengths of different algorithms. This paper: - Portfolio = classical for combinatorial optimization: we test portfolios for noisy optimization. - Recently, a methodology termed lag has been proposed for NOPA. We test experimentally the lag methodology for various problems. Noisy Optimization Portfolio Algorithm (NOPA) Iteration n of the portfolio fS1; : : : ; SMg containing M NOAs: Initialization module: If n = 0 initialize all Si, i 2 f1; : : : ;Mg. For i 2 f1; : : : ;Mg: – Update module: Apply an iteration of solver Si until it has received at least n data samples. – Let i;n be the current recommendation by solver Si. Comparison module: If n = rm for some m, then – For i 2 f1; : : : ;Mg, perform sm evaluations of the (stochastic) reward R(i;kn) and define yi the average reward. – Define i arg mini2f1;:::;Mg yi. Recommendation module: ~ = i;n Experiments Table 2: Artificial problem R() = jj jj2 + jj jjz Gaussian. n: evaluation number. z = rate at which the variance decreases around the optimum. z Comparison of log(R(~n))= log(n) for d = 2 Comparison of log(R(~n))= log(n) for d = 5 0 Newton1 RSAES ' P:12345 : : : Newton1 RSAES ' P:12345 : : : 1 P:12345 Fabian1 ' P:22 : : : Fabian1 P:22 P:222 : : : 2 P:12345 Fabian1 ' P:22 : : : Fabian1 P:12345 P:22 : : : Discussion: NOPAs are usually not far from the best of their NOAs. In small dimension with noise variance decreazing quickly to 0 around optimum (z = 2), NOPA outperforms all its NOAs. Table 3: Stochastic Unit Commitment problems, conformant planning. St: number of stocks. Problem size Considered NOA or NOPA St, T, d P:22 P:22 + S: P:222 P:222 + S: Best NOA Worst NOA 3, 21, 63 0.61 0.07 0.63 0.03 0.63 0.05 0.63 0.07 0.49 0.08 0.81 0.05 4, 21, 84 0.75 0.02 0.75 0.03 0.79 0.05 0.76 0.03 0.69 0.06 1.27 0.06 5, 21, 105 0.53 0.04 0.58 0.08 0.58 0.03 0.52 0.05 0.58 0.04 1.44 0.16 6, 15, 90 0.40 0.05 0.39 0.06 0.37 0.06 0.39 0.06 0.38 0.06 0.96 0.13 6, 21, 126 0.53 0.08 0.54 0.08 0.55 0.07 0.54 0.07 0.54 0.07 1.78 0.37 8, 15, 120 0.53 0.03 0.50 0.05 0.53 0.02 0.51 0.05 0.51 0.04 1.70 0.10 8, 21, 168 0.69 0.04 0.77 0.09 0.73 0.06 0.71 0.04 0.71 0.06 2.68 0.02 7, 21, 147 0.70 0.07 0.70 0.05 0.70 0.07 0.70 0.07 0.69 0;06 2.28 0.08 Discussion: Given a same budget, a NOPA of identical solvers can outperform its NOAs. RSAES is usually the best NOA for small dimensions and variants of Fabian for large dimension. Table 4: Approximate convergence rates log(R(~n))= log(n) for Cart-Pole, a multimodal problem, using NN. n: evaluation number. Solver 2 neurons, d = 9 4 neurons, d = 17 8 neurons, d = 33 1 (RSAES) -0.4580330.045014 -0.4215350.045643 -0.3517260.051705 2 (Fabian1) 0.0022265.29923e-05 0.0020891.57766e-04 0.002218.14518e-05 3 (Fabian2) 0.0023189.80792e-05 0.0022381.14289e-04 0.002361.51244e-04 4 (Newton1) 0.0022296.08973e-05 -0.0307310.111294 0.0022471.19829e-04 5 (Newton2) 0.002275.2989e-05 0.0022177.80888e-05 0.0023079.96404e-05 6 (P:12345) -0.4087050.068428 -0.39170.071791 -0.3203990.050338 7 (P:12345 + S:) -0.427430.05709 -0.4037070.056173 -0.3540430.069576 Discussion: Fabian and Newton can’t solve this multimodal problem ) one solver is much better than others ) easy for NOPA. Conclusion Main conclusion: Usual: Portfolio of Algorithms for Combinatorial Optimization; New: Portfolio of Algorithms for Noisy Optimization. “Sharing” not that good. NOPA sometimes better than NOA even if all NOA equal! We show mathematically[?] and empirically a log(M) shift when using M solvers, when working on the log-log scale (usual scale in noisy optimization). Portfolio = approximately as efficient as the best - except when one iteration of one algorithm monopolizes most of the budget - as RSAES in the unit commitment problem.