SlideShare a Scribd company logo
IFAC 
2014 
CAPE 
TOWN 
-­‐ZA 
Universal approximators for Direct Policy Search 
in multi-purpose water reservoir management: A 
comparative analysis 
Matteo Giuliani, Emanuele Mason, Andrea Castelletti, Francesca Pianosi, 
Rodolfo Soncini-Sessa 
Dipartimento di Elettronica, Informazione, e Bioingegneria, Politecnico di Milano, Milano, Italy 
Hydroinformatics Lab, Como Campus, Politecnico di Milano, Italy 
Department of Civil and Environmental Engineering, University of Bristol, Bristol, UK 
Modelling 
and 
Control 
of 
Water 
Systems
Controlling hydro-environmental systems 
The long-term optimal operation of hydro-environmental systems can be 
formulated as a q-objective stochastic optimal control problem 
min 
μt(·) 
J = |J1 J2...J q| 
xt+1 = ft(xt, ut, t+1) 
Ji = lim 
h⇤⌅ 
E 
⇥h1 
⇥ 
 
h⇤−1 
t=0 
i-th immediate cost 
i = 1, t(xt, ut, ⇥t+1) 
!tgi 
⇥ 
i-th objective discount factor 
state control disturbance 
subject to 
! 
ut = μt(xt) 
t+1 ⇠ (·) 
xt 2 Rnx 
ut 2 Rnu 
t 2 Rn
SDP and the 3 curses 
Stochastic Dynamic Programming is - in principle - the best approach 
to solve the problem - in practice - it suffers from 3 major shortcomings 
1) Curse of dimensionality: computational cost grows exponentially with 
state, control and disturbance dimension [Bellman, 1967]; 
ut 
Qt 
ut 
xt 
Look-up table 
Q-function 
unknown 
Q-function 
computations are numerically 
performed on a discretized variable 
domain
SDP and the 3 curses 
Stochastic Dynamic Programming is - in principle - the best approach 
to solve the problem - in practice - it suffers from 3 major shortcomings 
1) Curse of dimensionality: computational cost grows exponentially with 
state, control and disturbance dimension [Bellman, 1967]; 
ut 
Qt 
ut 
xt 
Look-up table 
Q-function 
unknown 
Q-function 
computations are numerically 
performed on a discretized variable 
domain 
2) Curse of modelling: any variable considered among the operating rule’s 
arguments has to be modelled [Bertsekas and Tsitsiklis, 1996]; 
t t+1 time 
xt 
ut, t+1 
models are use in a multiple one-step- 
ahead-simulation mode
SDP and the 3 curses 
Stochastic Dynamic Programming is - in principle - the best approach 
to solve the problem - in practice - it suffers from 3 major shortcomings 
3) Curse of multiple objectives: computational cost grows exponentially 
with the number of objectives considered [Powell, 2011]. 
PARETO frontier 
multi-objective problems are solved 
by reiteratively solving single 
objective problems 
J1 
J2 
J3
Beyond SDP: ADP and RL 
Approximate Dynamic Programming and Reinforcement Learning 
provide a framework to overcome some or all the SDP’s curses. 
[Powell, 2007; Busoniu et al. 2011 
VALUE FUNCTION-BASED APPROCHES: 
• Approximate value iteration 
• Approximate policy iteration 
• Approximate policy evaluation 
Model-free or model-based // parametric or non-parametric 
POLICY SEARCH-BASED APPROACHES: 
• Direct policy search 
Simulation-based optimization // parametric
Beyond SDP: ADP and RL 
Approximate Dynamic Programming and Reinforcement Learning 
provide a framework to overcome some or all the SDP’s curses. 
[Powell, 2007; Busoniu et al. 2011 
VALUE FUNCTION-BASED APPROCHES: 
• Approximate value iteration 
• Approximate policy iteration 
• Approximate policy evaluation 
Model-free or model-based // parametric or non-parametric 
POLICY SEARCH-BASED APPROACHES: 
• Direct policy search 
Simulation-based optimization // parametricv
Multi-objective Direct Policy Search (MODPS) 
Assuming the operating rule belong to a given family of functions and 
search the optimal solution in the policy’s parameter space 
ORIGINAL PROBLEM POLICY SEARCH PROBLEM 
min 
μt(·) 
J = |J1 J2...J q| 
subject to 
! 
xt+1 = ft(xt, ut, t+1) 
ut = μt(xt) 
t+1 ⇠ (·) 
xt 2 Rnx 
ut 2 Rnu 
t 2 Rn 
ut = μt(xt, ✓t) 
min 
μt(·) 
J = |J1 J2...J q| 
✓t 
subject to 
! 
xt+1 = ft(xt, ut, t+1) 
ut = μt(xt,✓t) 
t+1 ⇠ (·) 
xt 2 Rnx 
ut 2 Rnu 
t 2 Rn 
✓t 2 ⇥t2 Rn✓
Selecting the policy approximation: Ad hoc/Empirism 
WHEN 
1. The system is already operated 
500! 
450! 
400! 
350! 
300! 
250! 
200! 
1 
150! 
100! 
3 
2 4 
0 25 50 75 100 125 150 175 200 225 
250! 
50! 
0! 
release [m3/s] 
storage [Mm3] 
5 
Identify existing regularities in a 
sample of the operator behaviour 
2. the system is simple (i.e. one reservoir) AND/OR the systems has 
one single objective (e.g. water supply) 
• NEW York City rule [Clark, 1950] 
• Space rule [Clark, 1956] 
• Standard Operating Policy [Draper, 2004] 
• ….. 
Empirical rules identified in 
the past
Selecting the policy approximation: Universal Approx. 
Provided that some conditions are met, an Universal Approximator is 
approximate arbitrarily closely every continuous function. 
ARTIFICIAL NEURAL NETWORKS [Cybenko 1989, Funahashi 1989, Hornik et al. 1989] 
Parameter dimension 
n✓ = nu(N(nx + 2) + 1) 
GAUSSIAN RADIAL BASIS FUNCTIONS [Busoniu et al. 2011] 
Number of NEURONS 
Parameter dimension 
n✓ = N(2nx + nu) 
Number of BASES 
x1 
x2 
x3 
u1 
x1 
x2 
x3 
u1
Selecting the optimization algorithm 
Key problem features 
• High dimensional search spaces (rich parameterizations) 
• Complex search spaces (many local minima) 
• Sensitivity to parameter initialization (no-preconditioning) 
• Multiple objectives 
• Non differentiable objective functions 
• Sensitivity to noise
Selecting the optimization algorithm 
Key problem features 
• High dimensional search spaces (rich parameterizations) 
• Complex search spaces (many local minima) 
• Sensitivity to parameter initialization (no-preconditioning) 
• Multiple objectives 
• Non differentiable objective functions 
• Sensitivity to noise 
BORG [Hadka and Reed 2012; Reed et al. 2013] 
a MULTI-OBJECTIVE EVOLUTIONARY ALGORITHM 
BORG is self-adaptive and employs 
• multiple search operators adaptively selected during the optimization 
• e-dominance archiving with internal operators to detect search stagnation 
• randomized restarts to escape local optima
CASE STUDY
Red-Thai Binh River System - Vietnam 
HaGiang BaoLac 
Hanoi 
BacMe 
Thao Lo 
HoaBinh 
TamDuong 
NamGiang 
TaBu 
MuongTe 
LaiChau 
YenBai VuQuang 
VIETNAM 
CHINA 
LAOS 
THAILAND 
CAMBODIA 
Da 
Integrated Management of Red-Thai Binh Rivers System (IMRR) funded by the Italian 
Ministry of Foreign Affairs http://www.imrr.info/
Hoa Binh reservoir - Vietnam 
Main characteristics 
• Catchment area 52,000 km2 
• Active capacity 6 x 109 m3 
• 8 penstocks 2,360 m3/s (240 MW) 
• 12 bottom gates 22,000 m3/s 
• 6 spillways 14,000 m3/s 
• 15% national energy (7,800 GWh) 
source: IWRP2008 
Operating objectives 
• Hydropower production 
• Flood control (Hanoi) 
RESERVOIR 
CATCHMENT 
POWER PLANT 
DIVERSION DAM 
COMSUMPTIVE USE THAO 
LO 
DA 
HOA 
BINH
Experimental Setting: ANN vs RBF 
STATE VECTOR (n_x=5) 
• 2 time indexes (sin, cosin) 
• Storage 
• Previous day inflow to reservoir 
• Previous day lateral inflow 
CONTROL VECTOR (n_u=1) 
• release from the reservoir 
RESERVOIR 
CATCHMENT 
POWER PLANT 
DIVERSION DAM 
COMSUMPTIVE USE THAO 
LO 
DA 
HOA 
BINH 
ALGORITHM SETTING and RUNNING 
• Default Borg MOEA parameterization [Hadka and Reed 2013] 
• NFE = 500,000 per replication 
• 20 replications to avoid dependence on randomness 
• Historical horizon 1962-1969, which comprises normal, wet and dry years
Policy perfomance – operating objectives 
2.6 x 107 
2.5 
2.4 
2.3 
2.2 
2.1 
2 
Legend: ANN - RBF 
number of neurons/basis 
1.80 100 200 300 400 500 600 700 
(b) Generational distance (c) Additive ε-indicator (d) Hypervolume 
0.35 
0.3 
0.25 
0.2 
0.15 
0.1 
0.05 
0 
0.7 
0.6 
0.5 
0.4 
0.3 
0.2 
0.1 
0 
0.8 
0.7 
0.6 
0.5 
0.4 
0.3 
0.2 
0.1 
0 
4 6 8 101214 16 4 6 8 10 121416 4 6 8 101214 16 4 6 8 10 121416 4 6 8 101214 16 4 6 8 10 121416 
# of neurons/basis # of neurons/basis # of neurons/basis 
1.9 
Jflo 
Jhyd 
468 
10 
12 
14 
16 
(a) Policy Performance with different ANN and RBF architectures 
ANN 
RBF 
Hydropower 
-­‐ 
kWh/d 
Floods 
– 
cm2/d
Policy perfomance – front approximation quality 
2.6 x 107 
2.5 
2.4 
2.3 
2.2 
2.1 
2 
Legend: ANN - RBF 
number of neurons/basis 
1.80 100 200 300 400 500 600 700 
(b) Generational distance (c) Additive ε-indicator (d) Hypervolume 
0.35 
0.3 
0.25 
0.2 
0.15 
0.1 
0.05 
0 
0.7 
0.6 
0.5 
0.4 
0.3 
0.2 
0.1 
0 
0.8 
0.7 
0.6 
0.5 
0.4 
0.3 
0.2 
0.1 
0 
4 6 8 101214 16 4 6 8 10 121416 4 6 8 101214 16 4 6 8 10 121416 4 6 8 101214 16 4 6 8 10 121416 
# of neurons/basis # of neurons/basis # of neurons/basis 
1.9 
Jflo 
Jhyd 
468 
10 
12 
14 
16 
(a) Policy Performance with different ANN and RBF architectures 
ANN 
RBF 
Hydropower 
-­‐ 
kWh/d 
Floods 
– 
cm2/d 
CONVERGENCE CONSISTENCY DIVERSITY
Policy reliability 
(a) 75% best metric value (b) 95% best metric value 
4 6 8 101214 16 4 6 8 10 121416 
1 
0.8 
0.6 
0.4 
0.2 
0 
1 
0.8 
0.6 
0.4 
0.2 
1 
0.8 
0.6 
0.4 
0.2 
0 
4 6 8 101214 16 4 6 8 10 121416 
0 
4 6 8 101214 16 4 6 8 10 121416 
1 
0.8 
0.6 
0.4 
0.2 
0 
4 6 8 101214 16 4 6 8 10 121416 
hypervolume additive ε-indicator generational distance 
4 6 8 101214 16 4 6 8 10 121416 
4 6 8 101214 16 4 6 8 10 121416 
1 
0.8 
0.6 
0.4 
0.2 
0 
1 
0.8 
0.6 
0.4 
0.2 
0 
# of neurons/basis # of neurons/basis 
ANN 
RBF 
CONVERGENCE 
CONSISTENCY 
DIVERSITY 
FIG. 3. Probability of attainment with a threshold equal to 75% (a) and to 95% (b) of
Run time search dynamics (NFA = 2M) 
(a) Generational distance (b) Additive ε−indicator (c) Hypervolume 
ANN ( 6 neurons ) 
RBF (6 bases) 
0 0.5 1 1.5 2 
1.4 
1.2 
1 
0.8 
0.6 
0.4 
0.2 
0 
0 0.5 1 1.5 2 
1.4 
1.2 
1 
0.8 
0.6 
0.4 
0.2 
0 
0 0.5 1 1.5 2 
0.9 
0.8 
0.7 
0.6 
0.5 
0.4 
0.3 
0.2 
0.1 
0 
NFA (x106) NFA (x106) NFA (x106) 
FIG. 4. Analysis CONVERGENCE of runtime search dynamics CONSISTENCY for ANN (red lines) and DIVERSITY 
RBF (blue lines) 
operating policy optimization in terms of generational distance (a), additive -indicator 
(b), and hypervolume (c).
Policy validation 
2.6 x 107 
2.4 
2.2 
2 
1.80 100 200 300 400 500 600 700 800 900 
Jflo 
Jhyd 
2.6 x 107 
2.4 
2.2 
2 
1.80 100 200 300 400 500 600 700 800 900 
Jflo 
Jhyd 
ANN 
RBF 
ANN 
RBF 
(a) Results over the optimization horizon (1962-1969) 
(b) Results over the validation horizon (1995-2004) 
ANN 
RBF 
Hydropower 
-­‐ 
kWh/d 
Floods 
– 
cm2/d 
Hydropower 
-­‐ 
kWh/d 
Floods 
– 
cm2/d
Conclusions 
§ MODPS is an interesting alternative to SDP familiy methods for a number of 
good reasons 
1. No discretization required: NO curse of dimensionality; 
2. Does not require separability in time of constraints and objective 
functions (e.g. duration curves): NO curse of dimensionality; 
3. Can easily include any model-free information as long as this is control-indipendent: 
NO curse of modelling; 
4. Can be combined with any simulation model (also high fidelity ones): NO 
curse of modelling; 
5. Can be easily combined with truly multi-objective optimization 
algorithms: NO curse of the multiple objectives.
Conclusions 
§ RBFs and ANNs seem to perform comparatively well when evaluated 
in terms of policy performance 
§ RBFs outperform ANNs in terms of quality of the Pareto front 
approximation, reliability and run time search dynamics 
§ Future works will focus on exploring multiple output policies (e.g. 
network of reservoirs)
THANKS

More Related Content

What's hot

Manual de sams 2009
Manual de sams 2009Manual de sams 2009
Manual de sams 2009
Juan Torres Ñaccha
 
Tyler A - UEI Day 1 - Kochi Jan18
Tyler A - UEI Day 1 - Kochi Jan18Tyler A - UEI Day 1 - Kochi Jan18
Tyler A - UEI Day 1 - Kochi Jan18
India UK Water Centre (IUKWC)
 
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...
India UK Water Centre (IUKWC)
 
Drought monitoring, Precipitation statistics, and water balance with freely a...
Drought monitoring, Precipitation statistics, and water balance with freely a...Drought monitoring, Precipitation statistics, and water balance with freely a...
Drought monitoring, Precipitation statistics, and water balance with freely a...
AngelosAlamanos
 
2004 jh cypruspp
2004 jh cypruspp2004 jh cypruspp
2004 jh cypruspp
Alberto Villalobos Silva
 
Adam Lewis–SPEDDEXES 2014
Adam Lewis–SPEDDEXES 2014Adam Lewis–SPEDDEXES 2014
Adam Lewis–SPEDDEXES 2014
aceas13tern
 
Assessing the importance of geo hydrological data acquisition in the developm...
Assessing the importance of geo hydrological data acquisition in the developm...Assessing the importance of geo hydrological data acquisition in the developm...
Assessing the importance of geo hydrological data acquisition in the developm...
Alexander Decker
 
APCBEE
APCBEEAPCBEE
Unit cost of water
Unit cost of waterUnit cost of water
Unit cost of water
Ezekiel Adelere Adeniran
 
Developing best practice for infilling daily river flow data
Developing best practice for infilling daily river flow dataDeveloping best practice for infilling daily river flow data
Developing best practice for infilling daily river flow data
hydrologywebsite1
 
DEM-based Methods for Flood Risk Mapping at Large Scale
DEM-based Methods for Flood Risk Mapping at Large ScaleDEM-based Methods for Flood Risk Mapping at Large Scale
DEM-based Methods for Flood Risk Mapping at Large Scale
Salvatore Manfreda
 
StrataNY_2011_ferrari
StrataNY_2011_ferrariStrataNY_2011_ferrari
StrataNY_2011_ferrari
Michael Ferrari
 
Flood risk modelling and assessment for community resilience
Flood risk modelling and assessment for community resilienceFlood risk modelling and assessment for community resilience
Flood risk modelling and assessment for community resilience
Albert Chen
 
Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)
IJERD Editor
 
DSD-INT 2015 - Developing an operational hydrologic forecast system using EPS...
DSD-INT 2015 - Developing an operational hydrologic forecast system using EPS...DSD-INT 2015 - Developing an operational hydrologic forecast system using EPS...
DSD-INT 2015 - Developing an operational hydrologic forecast system using EPS...
Deltares
 
AAS National Conference 2008: Gary Davis
AAS National Conference 2008: Gary DavisAAS National Conference 2008: Gary Davis
AAS National Conference 2008: Gary Davis
American Astronautical Society
 
IMWA2011_Turner_311
IMWA2011_Turner_311IMWA2011_Turner_311
IMWA2011_Turner_311
Alison Turner
 
Comparison of actual release schedule and optimal
Comparison of actual release schedule and optimalComparison of actual release schedule and optimal
Comparison of actual release schedule and optimal
IAEME Publication
 
3 sites selection and technology options for a marine energy system
3 sites selection and technology options for a marine energy system3 sites selection and technology options for a marine energy system
3 sites selection and technology options for a marine energy system
02shubhank1997
 
Water resource management at catchment scales using lightweight UAVs: current...
Water resource management at catchment scales using lightweight UAVs: current...Water resource management at catchment scales using lightweight UAVs: current...
Water resource management at catchment scales using lightweight UAVs: current...
Judit Urquijo
 

What's hot (20)

Manual de sams 2009
Manual de sams 2009Manual de sams 2009
Manual de sams 2009
 
Tyler A - UEI Day 1 - Kochi Jan18
Tyler A - UEI Day 1 - Kochi Jan18Tyler A - UEI Day 1 - Kochi Jan18
Tyler A - UEI Day 1 - Kochi Jan18
 
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...
 
Drought monitoring, Precipitation statistics, and water balance with freely a...
Drought monitoring, Precipitation statistics, and water balance with freely a...Drought monitoring, Precipitation statistics, and water balance with freely a...
Drought monitoring, Precipitation statistics, and water balance with freely a...
 
2004 jh cypruspp
2004 jh cypruspp2004 jh cypruspp
2004 jh cypruspp
 
Adam Lewis–SPEDDEXES 2014
Adam Lewis–SPEDDEXES 2014Adam Lewis–SPEDDEXES 2014
Adam Lewis–SPEDDEXES 2014
 
Assessing the importance of geo hydrological data acquisition in the developm...
Assessing the importance of geo hydrological data acquisition in the developm...Assessing the importance of geo hydrological data acquisition in the developm...
Assessing the importance of geo hydrological data acquisition in the developm...
 
APCBEE
APCBEEAPCBEE
APCBEE
 
Unit cost of water
Unit cost of waterUnit cost of water
Unit cost of water
 
Developing best practice for infilling daily river flow data
Developing best practice for infilling daily river flow dataDeveloping best practice for infilling daily river flow data
Developing best practice for infilling daily river flow data
 
DEM-based Methods for Flood Risk Mapping at Large Scale
DEM-based Methods for Flood Risk Mapping at Large ScaleDEM-based Methods for Flood Risk Mapping at Large Scale
DEM-based Methods for Flood Risk Mapping at Large Scale
 
StrataNY_2011_ferrari
StrataNY_2011_ferrariStrataNY_2011_ferrari
StrataNY_2011_ferrari
 
Flood risk modelling and assessment for community resilience
Flood risk modelling and assessment for community resilienceFlood risk modelling and assessment for community resilience
Flood risk modelling and assessment for community resilience
 
Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)
 
DSD-INT 2015 - Developing an operational hydrologic forecast system using EPS...
DSD-INT 2015 - Developing an operational hydrologic forecast system using EPS...DSD-INT 2015 - Developing an operational hydrologic forecast system using EPS...
DSD-INT 2015 - Developing an operational hydrologic forecast system using EPS...
 
AAS National Conference 2008: Gary Davis
AAS National Conference 2008: Gary DavisAAS National Conference 2008: Gary Davis
AAS National Conference 2008: Gary Davis
 
IMWA2011_Turner_311
IMWA2011_Turner_311IMWA2011_Turner_311
IMWA2011_Turner_311
 
Comparison of actual release schedule and optimal
Comparison of actual release schedule and optimalComparison of actual release schedule and optimal
Comparison of actual release schedule and optimal
 
3 sites selection and technology options for a marine energy system
3 sites selection and technology options for a marine energy system3 sites selection and technology options for a marine energy system
3 sites selection and technology options for a marine energy system
 
Water resource management at catchment scales using lightweight UAVs: current...
Water resource management at catchment scales using lightweight UAVs: current...Water resource management at catchment scales using lightweight UAVs: current...
Water resource management at catchment scales using lightweight UAVs: current...
 

Similar to Universal approximators for Direct Policy Search in multi-purpose water reservoir management

Particle Learning in Online Tool Wear Diagnosis and Prognosis
Particle Learning in Online Tool Wear Diagnosis and PrognosisParticle Learning in Online Tool Wear Diagnosis and Prognosis
Particle Learning in Online Tool Wear Diagnosis and Prognosis
Jianlei Zhang, PhD
 
Biosight: Quantitative Methods for Policy Analysis: Stochastic Dynamic Progra...
Biosight: Quantitative Methods for Policy Analysis: Stochastic Dynamic Progra...Biosight: Quantitative Methods for Policy Analysis: Stochastic Dynamic Progra...
Biosight: Quantitative Methods for Policy Analysis: Stochastic Dynamic Progra...
IFPRI-EPTD
 
Extend Your Journey: Introducing Signal Strength into Location-based Applicat...
Extend Your Journey: Introducing Signal Strength into Location-based Applicat...Extend Your Journey: Introducing Signal Strength into Location-based Applicat...
Extend Your Journey: Introducing Signal Strength into Location-based Applicat...
Chih-Chuan Cheng
 
presentation_btp
presentation_btppresentation_btp
presentation_btp
Anshul Goyal, EIT
 
Introduction to Chainer Chemistry
Introduction to Chainer ChemistryIntroduction to Chainer Chemistry
Introduction to Chainer Chemistry
Preferred Networks
 
Efficient analytical and hybrid simulations using OpenSees
Efficient analytical and hybrid simulations using OpenSeesEfficient analytical and hybrid simulations using OpenSees
Efficient analytical and hybrid simulations using OpenSees
openseesdays
 
Implementation of the fully adaptive radar framework: Practical limitations
Implementation of the fully adaptive radar framework: Practical limitationsImplementation of the fully adaptive radar framework: Practical limitations
Implementation of the fully adaptive radar framework: Practical limitations
Luis Úbeda Medina
 
Industrial plant optimization in reduced dimensional spaces
Industrial plant optimization in reduced dimensional spacesIndustrial plant optimization in reduced dimensional spaces
Industrial plant optimization in reduced dimensional spaces
Capstone
 
Real-Time Analysis of Streaming Synchotron Data: SCinet SC19 Technology Chall...
Real-Time Analysis of Streaming Synchotron Data: SCinet SC19 Technology Chall...Real-Time Analysis of Streaming Synchotron Data: SCinet SC19 Technology Chall...
Real-Time Analysis of Streaming Synchotron Data: SCinet SC19 Technology Chall...
Globus
 
A Hybrid Method of CART and Artificial Neural Network for Short Term Load For...
A Hybrid Method of CART and Artificial Neural Network for Short Term Load For...A Hybrid Method of CART and Artificial Neural Network for Short Term Load For...
A Hybrid Method of CART and Artificial Neural Network for Short Term Load For...
Salford Systems
 
ODVSML_Presentation
ODVSML_PresentationODVSML_Presentation
ODVSML_Presentation
Shounak Mitra
 
IEEE International Conference Presentation
IEEE International Conference PresentationIEEE International Conference Presentation
IEEE International Conference Presentation
Anmol Dwivedi
 
Modeling full scale-data(2)
Modeling full scale-data(2)Modeling full scale-data(2)
Modeling full scale-data(2)
John B. Cook, PE, CEO
 
MOMDPSO_IDETC_2014_Weiyang
MOMDPSO_IDETC_2014_WeiyangMOMDPSO_IDETC_2014_Weiyang
MOMDPSO_IDETC_2014_Weiyang
MDO_Lab
 
Hairong Qi V Swaminathan
Hairong Qi V SwaminathanHairong Qi V Swaminathan
Hairong Qi V Swaminathan
FNian
 
1 Aminullah Assagaf_Estimation-of-domain-of-attraction-for-the-fract_2021_Non...
1 Aminullah Assagaf_Estimation-of-domain-of-attraction-for-the-fract_2021_Non...1 Aminullah Assagaf_Estimation-of-domain-of-attraction-for-the-fract_2021_Non...
1 Aminullah Assagaf_Estimation-of-domain-of-attraction-for-the-fract_2021_Non...
Aminullah Assagaf
 
Application of Artificial Neural Network (Ann) In Operation of Reservoirs
Application of Artificial Neural Network (Ann) In Operation of ReservoirsApplication of Artificial Neural Network (Ann) In Operation of Reservoirs
Application of Artificial Neural Network (Ann) In Operation of Reservoirs
IOSR Journals
 
Automated Generation of High-accuracy Interatomic Potentials Using Quantum Data
Automated Generation of High-accuracy Interatomic Potentials Using Quantum DataAutomated Generation of High-accuracy Interatomic Potentials Using Quantum Data
Automated Generation of High-accuracy Interatomic Potentials Using Quantum Data
aimsnist
 
BPSO&1-NN algorithm-based variable selection for power system stability ident...
BPSO&1-NN algorithm-based variable selection for power system stability ident...BPSO&1-NN algorithm-based variable selection for power system stability ident...
BPSO&1-NN algorithm-based variable selection for power system stability ident...
IJAEMSJORNAL
 
Evolving Fuzzy System Applied to Battery Charge Capacity Prediction for Faul...
 Evolving Fuzzy System Applied to Battery Charge Capacity Prediction for Faul... Evolving Fuzzy System Applied to Battery Charge Capacity Prediction for Faul...
Evolving Fuzzy System Applied to Battery Charge Capacity Prediction for Faul...
Murilo Camargos
 

Similar to Universal approximators for Direct Policy Search in multi-purpose water reservoir management (20)

Particle Learning in Online Tool Wear Diagnosis and Prognosis
Particle Learning in Online Tool Wear Diagnosis and PrognosisParticle Learning in Online Tool Wear Diagnosis and Prognosis
Particle Learning in Online Tool Wear Diagnosis and Prognosis
 
Biosight: Quantitative Methods for Policy Analysis: Stochastic Dynamic Progra...
Biosight: Quantitative Methods for Policy Analysis: Stochastic Dynamic Progra...Biosight: Quantitative Methods for Policy Analysis: Stochastic Dynamic Progra...
Biosight: Quantitative Methods for Policy Analysis: Stochastic Dynamic Progra...
 
Extend Your Journey: Introducing Signal Strength into Location-based Applicat...
Extend Your Journey: Introducing Signal Strength into Location-based Applicat...Extend Your Journey: Introducing Signal Strength into Location-based Applicat...
Extend Your Journey: Introducing Signal Strength into Location-based Applicat...
 
presentation_btp
presentation_btppresentation_btp
presentation_btp
 
Introduction to Chainer Chemistry
Introduction to Chainer ChemistryIntroduction to Chainer Chemistry
Introduction to Chainer Chemistry
 
Efficient analytical and hybrid simulations using OpenSees
Efficient analytical and hybrid simulations using OpenSeesEfficient analytical and hybrid simulations using OpenSees
Efficient analytical and hybrid simulations using OpenSees
 
Implementation of the fully adaptive radar framework: Practical limitations
Implementation of the fully adaptive radar framework: Practical limitationsImplementation of the fully adaptive radar framework: Practical limitations
Implementation of the fully adaptive radar framework: Practical limitations
 
Industrial plant optimization in reduced dimensional spaces
Industrial plant optimization in reduced dimensional spacesIndustrial plant optimization in reduced dimensional spaces
Industrial plant optimization in reduced dimensional spaces
 
Real-Time Analysis of Streaming Synchotron Data: SCinet SC19 Technology Chall...
Real-Time Analysis of Streaming Synchotron Data: SCinet SC19 Technology Chall...Real-Time Analysis of Streaming Synchotron Data: SCinet SC19 Technology Chall...
Real-Time Analysis of Streaming Synchotron Data: SCinet SC19 Technology Chall...
 
A Hybrid Method of CART and Artificial Neural Network for Short Term Load For...
A Hybrid Method of CART and Artificial Neural Network for Short Term Load For...A Hybrid Method of CART and Artificial Neural Network for Short Term Load For...
A Hybrid Method of CART and Artificial Neural Network for Short Term Load For...
 
ODVSML_Presentation
ODVSML_PresentationODVSML_Presentation
ODVSML_Presentation
 
IEEE International Conference Presentation
IEEE International Conference PresentationIEEE International Conference Presentation
IEEE International Conference Presentation
 
Modeling full scale-data(2)
Modeling full scale-data(2)Modeling full scale-data(2)
Modeling full scale-data(2)
 
MOMDPSO_IDETC_2014_Weiyang
MOMDPSO_IDETC_2014_WeiyangMOMDPSO_IDETC_2014_Weiyang
MOMDPSO_IDETC_2014_Weiyang
 
Hairong Qi V Swaminathan
Hairong Qi V SwaminathanHairong Qi V Swaminathan
Hairong Qi V Swaminathan
 
1 Aminullah Assagaf_Estimation-of-domain-of-attraction-for-the-fract_2021_Non...
1 Aminullah Assagaf_Estimation-of-domain-of-attraction-for-the-fract_2021_Non...1 Aminullah Assagaf_Estimation-of-domain-of-attraction-for-the-fract_2021_Non...
1 Aminullah Assagaf_Estimation-of-domain-of-attraction-for-the-fract_2021_Non...
 
Application of Artificial Neural Network (Ann) In Operation of Reservoirs
Application of Artificial Neural Network (Ann) In Operation of ReservoirsApplication of Artificial Neural Network (Ann) In Operation of Reservoirs
Application of Artificial Neural Network (Ann) In Operation of Reservoirs
 
Automated Generation of High-accuracy Interatomic Potentials Using Quantum Data
Automated Generation of High-accuracy Interatomic Potentials Using Quantum DataAutomated Generation of High-accuracy Interatomic Potentials Using Quantum Data
Automated Generation of High-accuracy Interatomic Potentials Using Quantum Data
 
BPSO&1-NN algorithm-based variable selection for power system stability ident...
BPSO&1-NN algorithm-based variable selection for power system stability ident...BPSO&1-NN algorithm-based variable selection for power system stability ident...
BPSO&1-NN algorithm-based variable selection for power system stability ident...
 
Evolving Fuzzy System Applied to Battery Charge Capacity Prediction for Faul...
 Evolving Fuzzy System Applied to Battery Charge Capacity Prediction for Faul... Evolving Fuzzy System Applied to Battery Charge Capacity Prediction for Faul...
Evolving Fuzzy System Applied to Battery Charge Capacity Prediction for Faul...
 

Recently uploaded

LLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by Anant
LLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by AnantLLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by Anant
LLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by Anant
Anant Corporation
 
People as resource Grade IX.pdf minimala
People as resource Grade IX.pdf minimalaPeople as resource Grade IX.pdf minimala
People as resource Grade IX.pdf minimala
riddhimaagrawal986
 
22CYT12-Unit-V-E Waste and its Management.ppt
22CYT12-Unit-V-E Waste and its Management.ppt22CYT12-Unit-V-E Waste and its Management.ppt
22CYT12-Unit-V-E Waste and its Management.ppt
KrishnaveniKrishnara1
 
ITSM Integration with MuleSoft.pptx
ITSM  Integration with MuleSoft.pptxITSM  Integration with MuleSoft.pptx
ITSM Integration with MuleSoft.pptx
VANDANAMOHANGOUDA
 
Use PyCharm for remote debugging of WSL on a Windo cf5c162d672e4e58b4dde5d797...
Use PyCharm for remote debugging of WSL on a Windo cf5c162d672e4e58b4dde5d797...Use PyCharm for remote debugging of WSL on a Windo cf5c162d672e4e58b4dde5d797...
Use PyCharm for remote debugging of WSL on a Windo cf5c162d672e4e58b4dde5d797...
shadow0702a
 
Unit-III-ELECTROCHEMICAL STORAGE DEVICES.ppt
Unit-III-ELECTROCHEMICAL STORAGE DEVICES.pptUnit-III-ELECTROCHEMICAL STORAGE DEVICES.ppt
Unit-III-ELECTROCHEMICAL STORAGE DEVICES.ppt
KrishnaveniKrishnara1
 
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
ecqow
 
Properties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptxProperties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptx
MDSABBIROJJAMANPAYEL
 
Data Control Language.pptx Data Control Language.pptx
Data Control Language.pptx Data Control Language.pptxData Control Language.pptx Data Control Language.pptx
Data Control Language.pptx Data Control Language.pptx
ramrag33
 
Mechanical Engineering on AAI Summer Training Report-003.pdf
Mechanical Engineering on AAI Summer Training Report-003.pdfMechanical Engineering on AAI Summer Training Report-003.pdf
Mechanical Engineering on AAI Summer Training Report-003.pdf
21UME003TUSHARDEB
 
An improved modulation technique suitable for a three level flying capacitor ...
An improved modulation technique suitable for a three level flying capacitor ...An improved modulation technique suitable for a three level flying capacitor ...
An improved modulation technique suitable for a three level flying capacitor ...
IJECEIAES
 
CEC 352 - SATELLITE COMMUNICATION UNIT 1
CEC 352 - SATELLITE COMMUNICATION UNIT 1CEC 352 - SATELLITE COMMUNICATION UNIT 1
CEC 352 - SATELLITE COMMUNICATION UNIT 1
PKavitha10
 
An Introduction to the Compiler Designss
An Introduction to the Compiler DesignssAn Introduction to the Compiler Designss
An Introduction to the Compiler Designss
ElakkiaU
 
Manufacturing Process of molasses based distillery ppt.pptx
Manufacturing Process of molasses based distillery ppt.pptxManufacturing Process of molasses based distillery ppt.pptx
Manufacturing Process of molasses based distillery ppt.pptx
Madan Karki
 
Design and optimization of ion propulsion drone
Design and optimization of ion propulsion droneDesign and optimization of ion propulsion drone
Design and optimization of ion propulsion drone
bjmsejournal
 
Seminar on Distillation study-mafia.pptx
Seminar on Distillation study-mafia.pptxSeminar on Distillation study-mafia.pptx
Seminar on Distillation study-mafia.pptx
Madan Karki
 
CompEx~Manual~1210 (2).pdf COMPEX GAS AND VAPOURS
CompEx~Manual~1210 (2).pdf COMPEX GAS AND VAPOURSCompEx~Manual~1210 (2).pdf COMPEX GAS AND VAPOURS
CompEx~Manual~1210 (2).pdf COMPEX GAS AND VAPOURS
RamonNovais6
 
IEEE Aerospace and Electronic Systems Society as a Graduate Student Member
IEEE Aerospace and Electronic Systems Society as a Graduate Student MemberIEEE Aerospace and Electronic Systems Society as a Graduate Student Member
IEEE Aerospace and Electronic Systems Society as a Graduate Student Member
VICTOR MAESTRE RAMIREZ
 
Certificates - Mahmoud Mohamed Moursi Ahmed
Certificates - Mahmoud Mohamed Moursi AhmedCertificates - Mahmoud Mohamed Moursi Ahmed
Certificates - Mahmoud Mohamed Moursi Ahmed
Mahmoud Morsy
 
Material for memory and display system h
Material for memory and display system hMaterial for memory and display system h
Material for memory and display system h
gowrishankartb2005
 

Recently uploaded (20)

LLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by Anant
LLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by AnantLLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by Anant
LLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by Anant
 
People as resource Grade IX.pdf minimala
People as resource Grade IX.pdf minimalaPeople as resource Grade IX.pdf minimala
People as resource Grade IX.pdf minimala
 
22CYT12-Unit-V-E Waste and its Management.ppt
22CYT12-Unit-V-E Waste and its Management.ppt22CYT12-Unit-V-E Waste and its Management.ppt
22CYT12-Unit-V-E Waste and its Management.ppt
 
ITSM Integration with MuleSoft.pptx
ITSM  Integration with MuleSoft.pptxITSM  Integration with MuleSoft.pptx
ITSM Integration with MuleSoft.pptx
 
Use PyCharm for remote debugging of WSL on a Windo cf5c162d672e4e58b4dde5d797...
Use PyCharm for remote debugging of WSL on a Windo cf5c162d672e4e58b4dde5d797...Use PyCharm for remote debugging of WSL on a Windo cf5c162d672e4e58b4dde5d797...
Use PyCharm for remote debugging of WSL on a Windo cf5c162d672e4e58b4dde5d797...
 
Unit-III-ELECTROCHEMICAL STORAGE DEVICES.ppt
Unit-III-ELECTROCHEMICAL STORAGE DEVICES.pptUnit-III-ELECTROCHEMICAL STORAGE DEVICES.ppt
Unit-III-ELECTROCHEMICAL STORAGE DEVICES.ppt
 
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
 
Properties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptxProperties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptx
 
Data Control Language.pptx Data Control Language.pptx
Data Control Language.pptx Data Control Language.pptxData Control Language.pptx Data Control Language.pptx
Data Control Language.pptx Data Control Language.pptx
 
Mechanical Engineering on AAI Summer Training Report-003.pdf
Mechanical Engineering on AAI Summer Training Report-003.pdfMechanical Engineering on AAI Summer Training Report-003.pdf
Mechanical Engineering on AAI Summer Training Report-003.pdf
 
An improved modulation technique suitable for a three level flying capacitor ...
An improved modulation technique suitable for a three level flying capacitor ...An improved modulation technique suitable for a three level flying capacitor ...
An improved modulation technique suitable for a three level flying capacitor ...
 
CEC 352 - SATELLITE COMMUNICATION UNIT 1
CEC 352 - SATELLITE COMMUNICATION UNIT 1CEC 352 - SATELLITE COMMUNICATION UNIT 1
CEC 352 - SATELLITE COMMUNICATION UNIT 1
 
An Introduction to the Compiler Designss
An Introduction to the Compiler DesignssAn Introduction to the Compiler Designss
An Introduction to the Compiler Designss
 
Manufacturing Process of molasses based distillery ppt.pptx
Manufacturing Process of molasses based distillery ppt.pptxManufacturing Process of molasses based distillery ppt.pptx
Manufacturing Process of molasses based distillery ppt.pptx
 
Design and optimization of ion propulsion drone
Design and optimization of ion propulsion droneDesign and optimization of ion propulsion drone
Design and optimization of ion propulsion drone
 
Seminar on Distillation study-mafia.pptx
Seminar on Distillation study-mafia.pptxSeminar on Distillation study-mafia.pptx
Seminar on Distillation study-mafia.pptx
 
CompEx~Manual~1210 (2).pdf COMPEX GAS AND VAPOURS
CompEx~Manual~1210 (2).pdf COMPEX GAS AND VAPOURSCompEx~Manual~1210 (2).pdf COMPEX GAS AND VAPOURS
CompEx~Manual~1210 (2).pdf COMPEX GAS AND VAPOURS
 
IEEE Aerospace and Electronic Systems Society as a Graduate Student Member
IEEE Aerospace and Electronic Systems Society as a Graduate Student MemberIEEE Aerospace and Electronic Systems Society as a Graduate Student Member
IEEE Aerospace and Electronic Systems Society as a Graduate Student Member
 
Certificates - Mahmoud Mohamed Moursi Ahmed
Certificates - Mahmoud Mohamed Moursi AhmedCertificates - Mahmoud Mohamed Moursi Ahmed
Certificates - Mahmoud Mohamed Moursi Ahmed
 
Material for memory and display system h
Material for memory and display system hMaterial for memory and display system h
Material for memory and display system h
 

Universal approximators for Direct Policy Search in multi-purpose water reservoir management

  • 1. IFAC 2014 CAPE TOWN -­‐ZA Universal approximators for Direct Policy Search in multi-purpose water reservoir management: A comparative analysis Matteo Giuliani, Emanuele Mason, Andrea Castelletti, Francesca Pianosi, Rodolfo Soncini-Sessa Dipartimento di Elettronica, Informazione, e Bioingegneria, Politecnico di Milano, Milano, Italy Hydroinformatics Lab, Como Campus, Politecnico di Milano, Italy Department of Civil and Environmental Engineering, University of Bristol, Bristol, UK Modelling and Control of Water Systems
  • 2. Controlling hydro-environmental systems The long-term optimal operation of hydro-environmental systems can be formulated as a q-objective stochastic optimal control problem min μt(·) J = |J1 J2...J q| xt+1 = ft(xt, ut, t+1) Ji = lim h⇤⌅ E ⇥h1 ⇥ h⇤−1 t=0 i-th immediate cost i = 1, t(xt, ut, ⇥t+1) !tgi ⇥ i-th objective discount factor state control disturbance subject to ! ut = μt(xt) t+1 ⇠ (·) xt 2 Rnx ut 2 Rnu t 2 Rn
  • 3. SDP and the 3 curses Stochastic Dynamic Programming is - in principle - the best approach to solve the problem - in practice - it suffers from 3 major shortcomings 1) Curse of dimensionality: computational cost grows exponentially with state, control and disturbance dimension [Bellman, 1967]; ut Qt ut xt Look-up table Q-function unknown Q-function computations are numerically performed on a discretized variable domain
  • 4. SDP and the 3 curses Stochastic Dynamic Programming is - in principle - the best approach to solve the problem - in practice - it suffers from 3 major shortcomings 1) Curse of dimensionality: computational cost grows exponentially with state, control and disturbance dimension [Bellman, 1967]; ut Qt ut xt Look-up table Q-function unknown Q-function computations are numerically performed on a discretized variable domain 2) Curse of modelling: any variable considered among the operating rule’s arguments has to be modelled [Bertsekas and Tsitsiklis, 1996]; t t+1 time xt ut, t+1 models are use in a multiple one-step- ahead-simulation mode
  • 5. SDP and the 3 curses Stochastic Dynamic Programming is - in principle - the best approach to solve the problem - in practice - it suffers from 3 major shortcomings 3) Curse of multiple objectives: computational cost grows exponentially with the number of objectives considered [Powell, 2011]. PARETO frontier multi-objective problems are solved by reiteratively solving single objective problems J1 J2 J3
  • 6. Beyond SDP: ADP and RL Approximate Dynamic Programming and Reinforcement Learning provide a framework to overcome some or all the SDP’s curses. [Powell, 2007; Busoniu et al. 2011 VALUE FUNCTION-BASED APPROCHES: • Approximate value iteration • Approximate policy iteration • Approximate policy evaluation Model-free or model-based // parametric or non-parametric POLICY SEARCH-BASED APPROACHES: • Direct policy search Simulation-based optimization // parametric
  • 7. Beyond SDP: ADP and RL Approximate Dynamic Programming and Reinforcement Learning provide a framework to overcome some or all the SDP’s curses. [Powell, 2007; Busoniu et al. 2011 VALUE FUNCTION-BASED APPROCHES: • Approximate value iteration • Approximate policy iteration • Approximate policy evaluation Model-free or model-based // parametric or non-parametric POLICY SEARCH-BASED APPROACHES: • Direct policy search Simulation-based optimization // parametricv
  • 8. Multi-objective Direct Policy Search (MODPS) Assuming the operating rule belong to a given family of functions and search the optimal solution in the policy’s parameter space ORIGINAL PROBLEM POLICY SEARCH PROBLEM min μt(·) J = |J1 J2...J q| subject to ! xt+1 = ft(xt, ut, t+1) ut = μt(xt) t+1 ⇠ (·) xt 2 Rnx ut 2 Rnu t 2 Rn ut = μt(xt, ✓t) min μt(·) J = |J1 J2...J q| ✓t subject to ! xt+1 = ft(xt, ut, t+1) ut = μt(xt,✓t) t+1 ⇠ (·) xt 2 Rnx ut 2 Rnu t 2 Rn ✓t 2 ⇥t2 Rn✓
  • 9. Selecting the policy approximation: Ad hoc/Empirism WHEN 1. The system is already operated 500! 450! 400! 350! 300! 250! 200! 1 150! 100! 3 2 4 0 25 50 75 100 125 150 175 200 225 250! 50! 0! release [m3/s] storage [Mm3] 5 Identify existing regularities in a sample of the operator behaviour 2. the system is simple (i.e. one reservoir) AND/OR the systems has one single objective (e.g. water supply) • NEW York City rule [Clark, 1950] • Space rule [Clark, 1956] • Standard Operating Policy [Draper, 2004] • ….. Empirical rules identified in the past
  • 10. Selecting the policy approximation: Universal Approx. Provided that some conditions are met, an Universal Approximator is approximate arbitrarily closely every continuous function. ARTIFICIAL NEURAL NETWORKS [Cybenko 1989, Funahashi 1989, Hornik et al. 1989] Parameter dimension n✓ = nu(N(nx + 2) + 1) GAUSSIAN RADIAL BASIS FUNCTIONS [Busoniu et al. 2011] Number of NEURONS Parameter dimension n✓ = N(2nx + nu) Number of BASES x1 x2 x3 u1 x1 x2 x3 u1
  • 11. Selecting the optimization algorithm Key problem features • High dimensional search spaces (rich parameterizations) • Complex search spaces (many local minima) • Sensitivity to parameter initialization (no-preconditioning) • Multiple objectives • Non differentiable objective functions • Sensitivity to noise
  • 12. Selecting the optimization algorithm Key problem features • High dimensional search spaces (rich parameterizations) • Complex search spaces (many local minima) • Sensitivity to parameter initialization (no-preconditioning) • Multiple objectives • Non differentiable objective functions • Sensitivity to noise BORG [Hadka and Reed 2012; Reed et al. 2013] a MULTI-OBJECTIVE EVOLUTIONARY ALGORITHM BORG is self-adaptive and employs • multiple search operators adaptively selected during the optimization • e-dominance archiving with internal operators to detect search stagnation • randomized restarts to escape local optima
  • 14. Red-Thai Binh River System - Vietnam HaGiang BaoLac Hanoi BacMe Thao Lo HoaBinh TamDuong NamGiang TaBu MuongTe LaiChau YenBai VuQuang VIETNAM CHINA LAOS THAILAND CAMBODIA Da Integrated Management of Red-Thai Binh Rivers System (IMRR) funded by the Italian Ministry of Foreign Affairs http://www.imrr.info/
  • 15. Hoa Binh reservoir - Vietnam Main characteristics • Catchment area 52,000 km2 • Active capacity 6 x 109 m3 • 8 penstocks 2,360 m3/s (240 MW) • 12 bottom gates 22,000 m3/s • 6 spillways 14,000 m3/s • 15% national energy (7,800 GWh) source: IWRP2008 Operating objectives • Hydropower production • Flood control (Hanoi) RESERVOIR CATCHMENT POWER PLANT DIVERSION DAM COMSUMPTIVE USE THAO LO DA HOA BINH
  • 16. Experimental Setting: ANN vs RBF STATE VECTOR (n_x=5) • 2 time indexes (sin, cosin) • Storage • Previous day inflow to reservoir • Previous day lateral inflow CONTROL VECTOR (n_u=1) • release from the reservoir RESERVOIR CATCHMENT POWER PLANT DIVERSION DAM COMSUMPTIVE USE THAO LO DA HOA BINH ALGORITHM SETTING and RUNNING • Default Borg MOEA parameterization [Hadka and Reed 2013] • NFE = 500,000 per replication • 20 replications to avoid dependence on randomness • Historical horizon 1962-1969, which comprises normal, wet and dry years
  • 17. Policy perfomance – operating objectives 2.6 x 107 2.5 2.4 2.3 2.2 2.1 2 Legend: ANN - RBF number of neurons/basis 1.80 100 200 300 400 500 600 700 (b) Generational distance (c) Additive ε-indicator (d) Hypervolume 0.35 0.3 0.25 0.2 0.15 0.1 0.05 0 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 4 6 8 101214 16 4 6 8 10 121416 4 6 8 101214 16 4 6 8 10 121416 4 6 8 101214 16 4 6 8 10 121416 # of neurons/basis # of neurons/basis # of neurons/basis 1.9 Jflo Jhyd 468 10 12 14 16 (a) Policy Performance with different ANN and RBF architectures ANN RBF Hydropower -­‐ kWh/d Floods – cm2/d
  • 18. Policy perfomance – front approximation quality 2.6 x 107 2.5 2.4 2.3 2.2 2.1 2 Legend: ANN - RBF number of neurons/basis 1.80 100 200 300 400 500 600 700 (b) Generational distance (c) Additive ε-indicator (d) Hypervolume 0.35 0.3 0.25 0.2 0.15 0.1 0.05 0 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 4 6 8 101214 16 4 6 8 10 121416 4 6 8 101214 16 4 6 8 10 121416 4 6 8 101214 16 4 6 8 10 121416 # of neurons/basis # of neurons/basis # of neurons/basis 1.9 Jflo Jhyd 468 10 12 14 16 (a) Policy Performance with different ANN and RBF architectures ANN RBF Hydropower -­‐ kWh/d Floods – cm2/d CONVERGENCE CONSISTENCY DIVERSITY
  • 19. Policy reliability (a) 75% best metric value (b) 95% best metric value 4 6 8 101214 16 4 6 8 10 121416 1 0.8 0.6 0.4 0.2 0 1 0.8 0.6 0.4 0.2 1 0.8 0.6 0.4 0.2 0 4 6 8 101214 16 4 6 8 10 121416 0 4 6 8 101214 16 4 6 8 10 121416 1 0.8 0.6 0.4 0.2 0 4 6 8 101214 16 4 6 8 10 121416 hypervolume additive ε-indicator generational distance 4 6 8 101214 16 4 6 8 10 121416 4 6 8 101214 16 4 6 8 10 121416 1 0.8 0.6 0.4 0.2 0 1 0.8 0.6 0.4 0.2 0 # of neurons/basis # of neurons/basis ANN RBF CONVERGENCE CONSISTENCY DIVERSITY FIG. 3. Probability of attainment with a threshold equal to 75% (a) and to 95% (b) of
  • 20. Run time search dynamics (NFA = 2M) (a) Generational distance (b) Additive ε−indicator (c) Hypervolume ANN ( 6 neurons ) RBF (6 bases) 0 0.5 1 1.5 2 1.4 1.2 1 0.8 0.6 0.4 0.2 0 0 0.5 1 1.5 2 1.4 1.2 1 0.8 0.6 0.4 0.2 0 0 0.5 1 1.5 2 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 NFA (x106) NFA (x106) NFA (x106) FIG. 4. Analysis CONVERGENCE of runtime search dynamics CONSISTENCY for ANN (red lines) and DIVERSITY RBF (blue lines) operating policy optimization in terms of generational distance (a), additive -indicator (b), and hypervolume (c).
  • 21. Policy validation 2.6 x 107 2.4 2.2 2 1.80 100 200 300 400 500 600 700 800 900 Jflo Jhyd 2.6 x 107 2.4 2.2 2 1.80 100 200 300 400 500 600 700 800 900 Jflo Jhyd ANN RBF ANN RBF (a) Results over the optimization horizon (1962-1969) (b) Results over the validation horizon (1995-2004) ANN RBF Hydropower -­‐ kWh/d Floods – cm2/d Hydropower -­‐ kWh/d Floods – cm2/d
  • 22. Conclusions § MODPS is an interesting alternative to SDP familiy methods for a number of good reasons 1. No discretization required: NO curse of dimensionality; 2. Does not require separability in time of constraints and objective functions (e.g. duration curves): NO curse of dimensionality; 3. Can easily include any model-free information as long as this is control-indipendent: NO curse of modelling; 4. Can be combined with any simulation model (also high fidelity ones): NO curse of modelling; 5. Can be easily combined with truly multi-objective optimization algorithms: NO curse of the multiple objectives.
  • 23. Conclusions § RBFs and ANNs seem to perform comparatively well when evaluated in terms of policy performance § RBFs outperform ANNs in terms of quality of the Pareto front approximation, reliability and run time search dynamics § Future works will focus on exploring multiple output policies (e.g. network of reservoirs)