SlideShare a Scribd company logo
1 of 27
ROBUSTNESS METRICS
How are they calculated and when should they be used?
C. McPhail, H.R. Maier, J.H. Kwakkel, M. Giuliani, A. Castelletti and S.Westra
How do we plan for an uncertain future?
How do we plan for an uncertain future?
Estimated
distribution of
future states
(Scenario #1)
Estimated
distribution of
future states
(Scenario #2)
System
state
Today Future
How do we quantify system performance
under uncertainty?
System performance
System performance
System
state
Today Future
Relative likelihood of
occurrence is unknown
How do we quantify system performance
under uncertainty?
System performance
System performance
Robustness
System
state
Today Future
Relative likelihood of
occurrence is unknown
Which metric should we use for
calculating robustness?
Maximin
Maximax
Hurwicz optimism-
pessimism rule
Laplace’s principle of
insufficient reason
Minimax regret
90th percentile
minimax regret
Mean-variance
Undesirable
deviations
Percentile-
based
skewnessPercentile-based
peakedness
Starr’s domain
criterion
Which metric should we use for
calculating robustness?
Maximin
Maximax
Hurwicz optimism-
pessimism rule
Laplace’s principle of
insufficient reason
Minimax regret
90th percentile
minimax regret
Mean-variance
Undesirable
deviations
Percentile-
based
skewnessPercentile-based
peakedness
Starr’s domain
criterion
Which metric should we use for
calculating robustness?
Maximin
Maximax
Hurwicz optimism-
pessimism rule
Laplace’s principle of
insufficient reason
Minimax regret
90th percentile
minimax regret
Mean-variance
Undesirable
deviations
Percentile-
based
skewnessPercentile-based
peakedness
Starr’s domain
criterion
Which metric should we use for
calculating robustness?
Maximin
Maximax
Hurwicz optimism-
pessimism rule
Laplace’s principle of
insufficient reason
Minimax regret
90th percentile
minimax regret
Mean-variance
Undesirable
deviations
Percentile-
based
skewnessPercentile-based
peakedness
Starr’s domain
criterion
Contributions of the research
1. A unified framework for the calculation of a wide range of robustness metrics.
Enabling a comparison of robustness metrics.
2. A taxonomy of robustness metrics.
Providing guidance to decision-makers.
Robustness value
Performance
metric (e.g.
cost,
reliability)
Decision
alternatives
(e.g. Policy
options, plans,
solutions)
Plausible
future
conditions
(Scenarios)
Robustness metric
Probabilityofoccurrencein
selectedscenarios
Transformed Performance
1 2 3 4 … n
Performance
Scenario #
1 2 4 …
TransformedPerformance
Scenario #
Performance over all
scenarios
Transformed
performance values over
all scenarios
Transformed
performance values over
selected scenarios
Robustness value
Mean
T1: Performance
value transformation
T2: Scenario subset
selection
T3: Robustness
metric calculation
1 2 3 4 … nTransformedPerformance
Scenario #
1 2 3 4 … n
TransformedPerformance
Scenario #
T1: Performance value transformation
Performance (identity)Cost of making wrong
decision (regret)
1 2 3 4 … n
Performance
Scenario #
1 2 3 4 … n
TransformedPerformance
Scenario #
1 2 3 4 … n
Performance
Scenario #
Performance values Pass/fail (satisficing)
OR OR
Threshold
T1: Performance value transformation
Robustness calculation based on
relative performance values
Robustness calculation based on
absolute performance values
Indication of whether system
performance is satisfactory or
not
- (MORE)
- (POMORE)
- (Decision Scaling)
- Starr’s domain criterion
- (Info Gap)
Indication of actual system
performance
- Minimax regret
- 90th percentile minimax regret
- Undesirable deviations
- Maximin (minimax)
- Maximax
- Hurwicz’s optimism-pessimism rule
- Laplace’s principle of insufficient
reason
- Mean-variance
- Percentile-based skewness
- Percentile-based peakedness
T2: Scenario subset selection
Worst half of scenariosBest-case scenario
1 2 3 4 … n
Performance
Scenario #
Transformed
performance values
Worst-case scenario
OR OR
1 2 3 4 … n
Performance
Scenario #
1 2 3 4 … n
Performance
Scenario #
1 2 3 4 … n
Performance
Scenario #
T2: Scenario subset selection
Less risk averse
More risk averse
Probabilityofoccurrencein
selectedscenarios
Transformed Performance
T3: Robustness metric calculation
SkewVariance
Transformed
performance values in
selected scenarios
Expected value
(e.g. mean)
OR OR
Mean
Probabilityofoccurrencein
selectedscenarios
Transformed Performance
Probabilityofoccurrencein
selectedscenarios
Transformed Performance
Variance
Probabilityofoccurrencein
selectedscenarios
Transformed Performance
Mean
Median
T3: Robustness metric calculation
Robustness metric
Robustness metric calculation
None Sum Mean Weighted mean Variance Skew Kurtosis
Maximin ✓
Maximax ✓
Hurwicz optimism-pessimism rule ✓
Laplace’s principle of insufficient reason ✓
Minimax regret ✓
90th percentile minimax regret ✓
Mean-variance ✓ ✓
Undesirable deviations ✓
Percentile-based skewness ✓
Percentile-based peakedness ✓
Starr’s domain criterion ✓
Probabilityofoccurrencein
selectedscenarios
Transformed Performance
1 2 3 4 … n
Performance
Scenario #
1 2 4 …
TransformedPerformance
Scenario #
Performance over all
scenarios
Transformed
performance values over
all scenarios
Transformed
performance values over
selected scenarios
Robustness value
Mean
T1: Performance
value transformation
T2: Scenario subset
selection
T3: Robustness
metric calculation
1 2 3 4 … nTransformedPerformance
Scenario #
Maximin
Maximax
Hurwicz optimism-pessimism rule
Laplace’s principle of insufficient
reason
Minimax regret
90th percentile minimax regret
Mean-variance
Undesirable deviations
Percentile-based skewness
Percentile-based peakedness
Starr’s domain criterion
Metric
T1: Performance value
transformation
T2: Scenario subset
selection
T3: Robustness
metric calculation
Maximin Identity Worst-case Identity
Maximax Identity Best-case Identity
Hurwicz optimism-pessimism rule Identity Worst- and best-cases Weighted mean
Laplace’s principle of insufficient
reason
Identity All Mean
Minimax regret
Regret from best decision
alternative
Worst-case Identity
90th percentile minimax regret
Regret from best decision
alternative
90th percentile Identity
Mean-variance Identity All Mean-variance
Undesirable deviations
Regret from median
performance
Worst-half Sum
Percentile-based skewness Identity
10th, 50th and 90th
percentiles
Skew
Percentile-based peakedness Identity
10th, 25th, 75th and 90th
percentiles
Kurtosis
Starr’s domain criterion Satisfaction of constraints All Mean
Maximin
Maximax
Hurwicz optimism-pessimism rule
Laplace’s principle of insufficient
reason
Minimax regret
90th percentile minimax regret
Mean-variance
Undesirable deviations
Percentile-based skewness
Percentile-based peakedness
Starr’s domain criterion
Metric
T1: Performance value
transformation
T2: Scenario subset
selection
T3: Robustness
metric calculation
Maximin Identity Worst-case Identity
Maximax Identity Best-case Identity
Hurwicz optimism-pessimism rule Identity Worst- and best-cases Weighted mean
Laplace’s principle of insufficient
reason
Identity All Mean
Minimax regret
Regret from best decision
alternative
Worst-case Identity
90th percentile minimax regret
Regret from best decision
alternative
90th percentile Identity
Mean-variance Identity All Mean-variance
Undesirable deviations
Regret from median
performance
Worst-half Sum
Percentile-based skewness Identity
10th, 50th and 90th
percentiles
Skew
Percentile-based peakedness Identity
10th, 25th, 75th and 90th
percentiles
Kurtosis
Starr’s domain criterion Satisfaction of constraints All Mean
Future work
A conceptual framework for understanding when robustness metrics agree or
disagree.
Paper under revision
C. McPhail, H.R. Maier, J.H. Kwakkel, M. Giuliani, A. Castelletti and S.Westra
(under revision), Robustness metrics: How are they calculated, when should
they be used and why do they give different results?, Earth’s Future.
Contact
Cameron McPhail
University of Adelaide
cameron.mcphail@adelaide.edu.au
T1: Performance value transformation
Description Equation
Identity transformation 𝑓′ 𝑥𝑖, 𝑠𝑗 = 𝑓 𝑥𝑖, 𝑠𝑗
Regret from best decision
alternative
𝑓′ 𝑥𝑖, 𝑠𝑗 =
max
𝑥
𝑓 𝑥, 𝑠𝑗 − 𝑓 𝑥𝑖, 𝑠𝑗 , maximisation
𝑓 𝑥𝑖, 𝑠𝑗 − min
𝑥
𝑓 𝑥, 𝑠𝑗 , minimisation
Regret from median
𝑓′ 𝑥𝑖, 𝑠𝑗 =
𝑞50 − 𝑓 𝑥𝑖, 𝑠𝑗 , maximisation
𝑓 𝑥𝑖, 𝑠𝑗 − 𝑞50, minimisation
where 𝑞50 is the median performance for decision alternative 𝑥𝑖. i.e.
𝑃 𝑓 𝑥𝑖, 𝑆 ≤ 𝑞50 =
1
2
Satisfaction of constraints
𝑓′ 𝑥𝑖, 𝑠𝑗 =
1 if 𝑓 𝑥𝑖, 𝑠𝑗 ≥ 𝑐
0 if 𝑓 𝑥𝑖, 𝑠𝑗 < 𝑐
, maximisation
1 if 𝑓 𝑥𝑖, 𝑠𝑗 ≤ 𝑐
0 if 𝑓 𝑥𝑖, 𝑠𝑗 > 𝑐
, minimisation
where 𝑐 is a constraint
T2: Scenario subset selection
Description Equation
Worst-case 𝑆′
=
arg min
𝑠
𝑓′ 𝑥𝑖, 𝑠 , maximisation
arg max
𝑠
𝑓′ 𝑥𝑖, 𝑠 , minimisation
Best-case 𝑆′
=
arg max
𝑠
𝑓′ 𝑥𝑖, 𝑠 , maximisation
arg min
𝑠
𝑓′ 𝑥𝑖, 𝑠 , minimisation
Worst- and best-cases 𝑆′
= arg max
𝑠
𝑓′ 𝑥𝑖, 𝑠 , arg min
𝑠
𝑓′ 𝑥𝑖, 𝑠
All 𝑆′
= 𝑆
Worst-half
𝑆′
=
𝑠 ∈ 𝑆: 𝑓′ 𝑥𝑖, 𝑠 ≤ 𝑞50 , maximisation
𝑠 ∈ 𝑆: 𝑓′ 𝑥𝑖, 𝑠 ≥ 𝑞50 , minimisation
where 𝑞50 is the 50th percentile (median) value of 𝑓′ 𝑥𝑖, 𝑆
Percentile
𝑆′
= 𝑓′ 𝑥𝑖, 𝑠 = 𝑞 𝑘
where 𝑞 𝑘 is the kth percentile value of 𝑓′ 𝑥𝑖, 𝑆
Note that the scenario 𝑠 that produces the value of 𝑓′ 𝑥𝑖, 𝑠 closest to 𝑞 𝑘 is the scenario that
is used.
T3: Robustness metric calculation
Description Equation
Identity
transformation
𝑅 𝑥𝑖, 𝑆 = 𝑓′ 𝑥𝑖, 𝑆′
Mean
𝑅 𝑥𝑖, 𝑆 =
1
𝑛′
𝑗=1
𝑛′
𝑓′ 𝑥𝑖, 𝑠𝑗
where 𝑛′ is the number of scenarios in 𝑆′
Sum 𝑅 𝑥𝑖, 𝑆 =
𝑗=1
𝑛′
𝑓′ 𝑥𝑖, 𝑠𝑗
Weighted mean (two
scenarios)
𝑅 𝑥𝑖, 𝑆 = 𝛼𝑓′ 𝑥𝑖, 𝑠 𝑎 + 1 − 𝛼 𝑓′ 𝑥𝑖, 𝑠 𝑏
where 𝑠 𝑎 and 𝑠 𝑏 are two scenarios and 𝛼 is the preference of the decision maker towards using 𝑠 𝑎 and 0 <
𝛼 < 1
(Also see next slide…)
T3: Robustness metric calculation
Description Equation
Variance-based (i.e.
the standard
deviation)
𝑅 𝑥𝑖, 𝑆 =
1
𝑛′ − 1
𝑗=1
𝑛′
𝑓′ 𝑥𝑖, 𝑠𝑗 − 𝜇
2
where 𝜇 is the mean (see the equation earlier in this table)
Mean-variance
𝑅 𝑥𝑖, 𝑆 =
𝜇 + 1 𝜎 + 1 , maximisation
− 𝜇 + 1 𝜎 + 1 , minimisation
where 𝜇 is the mean and 𝜎 is the standard deviation (given by equations above)
Skew
𝑅 𝑥𝑖, 𝑆 =
𝑓′ 𝑥𝑖, 𝑠90 + 𝑓′ 𝑥𝑖, 𝑠10 2 − 𝑓′ 𝑥𝑖, 𝑠50
𝑓′ 𝑥𝑖, 𝑠90 − 𝑓′ 𝑥𝑖, 𝑠10 2
, maximisation
−
𝑓′ 𝑥𝑖, 𝑠90 + 𝑓′ 𝑥𝑖, 𝑠10 2 − 𝑓′ 𝑥𝑖, 𝑠50
𝑓′ 𝑥𝑖, 𝑠90 − 𝑓′ 𝑥𝑖, 𝑠10 2
, minimisation
where 𝑠10, 𝑠50 and 𝑠90 are scenarios that represent the 10th, 50th and 90th percentiles for 𝑓′ 𝑥𝑖, 𝑆
Kurtosis
𝑅 𝑥𝑖, 𝑆 =
𝑓′ 𝑥𝑖, 𝑠90 − 𝑓′ 𝑥𝑖, 𝑠10
𝑓′ 𝑥𝑖, 𝑠75 − 𝑓′ 𝑥𝑖, 𝑠25
where 𝑠10, 𝑠25, 𝑠75 and 𝑠90 are scenarios that represent the 10th, 25th, 75th and 90th percentiles for
𝑓′ 𝑥𝑖, 𝑆

More Related Content

What's hot

Thesis proposal defense 2011.06.01
Thesis proposal defense 2011.06.01Thesis proposal defense 2011.06.01
Thesis proposal defense 2011.06.01
Alison Tsai
 
Thesis Power Point Presentation
Thesis Power Point PresentationThesis Power Point Presentation
Thesis Power Point Presentation
riddhikapandya1985
 
Dissertation oral defense presentation
Dissertation   oral defense presentationDissertation   oral defense presentation
Dissertation oral defense presentation
Dr. Naomi Mangatu
 
Powerpoint presentation M.A. Thesis Defence
Powerpoint presentation M.A. Thesis DefencePowerpoint presentation M.A. Thesis Defence
Powerpoint presentation M.A. Thesis Defence
Catie Chase
 
Master Thesis Presentation
Master Thesis PresentationMaster Thesis Presentation
Master Thesis Presentation
Wishofnight13
 
A Qualitative Phenomenological Study on Prison Volunteers in California’s Cor...
A Qualitative Phenomenological Study on Prison Volunteers in California’s Cor...A Qualitative Phenomenological Study on Prison Volunteers in California’s Cor...
A Qualitative Phenomenological Study on Prison Volunteers in California’s Cor...
Donna Madison-Bell
 
3B - GIS-BASED ANALYSIS OF AQUACULTURE SITE SELECTION
3B - GIS-BASED ANALYSIS OF AQUACULTURE SITE SELECTION3B - GIS-BASED ANALYSIS OF AQUACULTURE SITE SELECTION
3B - GIS-BASED ANALYSIS OF AQUACULTURE SITE SELECTION
Wisconsin Land Information Association
 

What's hot (20)

Thesis proposal defense 2011.06.01
Thesis proposal defense 2011.06.01Thesis proposal defense 2011.06.01
Thesis proposal defense 2011.06.01
 
Thesis presentation: Applications of machine learning in predicting supply risks
Thesis presentation: Applications of machine learning in predicting supply risksThesis presentation: Applications of machine learning in predicting supply risks
Thesis presentation: Applications of machine learning in predicting supply risks
 
Oral proposaldefense
Oral proposaldefenseOral proposaldefense
Oral proposaldefense
 
Thesis Power Point Presentation
Thesis Power Point PresentationThesis Power Point Presentation
Thesis Power Point Presentation
 
Dissertation oral defense presentation
Dissertation   oral defense presentationDissertation   oral defense presentation
Dissertation oral defense presentation
 
Powerpoint presentation M.A. Thesis Defence
Powerpoint presentation M.A. Thesis DefencePowerpoint presentation M.A. Thesis Defence
Powerpoint presentation M.A. Thesis Defence
 
BEHAILU DEFENCE PPT (final 4)
BEHAILU DEFENCE PPT (final 4)BEHAILU DEFENCE PPT (final 4)
BEHAILU DEFENCE PPT (final 4)
 
Dissertation defense ppt
Dissertation defense ppt Dissertation defense ppt
Dissertation defense ppt
 
Master Thesis Presentation
Master Thesis PresentationMaster Thesis Presentation
Master Thesis Presentation
 
Seminar human impact on rivers
Seminar human impact on riversSeminar human impact on rivers
Seminar human impact on rivers
 
Decision Making Using the Analytic Hierarchy Process (AHP); A Step by Step A...
Decision Making Using the Analytic Hierarchy Process (AHP);  A Step by Step A...Decision Making Using the Analytic Hierarchy Process (AHP);  A Step by Step A...
Decision Making Using the Analytic Hierarchy Process (AHP); A Step by Step A...
 
A PROJECT REPORT ON THE STUDY OF FACTORS INFLUENCING THE BUYING OF MOBILE HAN...
A PROJECT REPORT ON THE STUDY OF FACTORS INFLUENCING THE BUYING OF MOBILE HAN...A PROJECT REPORT ON THE STUDY OF FACTORS INFLUENCING THE BUYING OF MOBILE HAN...
A PROJECT REPORT ON THE STUDY OF FACTORS INFLUENCING THE BUYING OF MOBILE HAN...
 
Colloquium Presentation_Ph.D Research Proposal
Colloquium Presentation_Ph.D Research ProposalColloquium Presentation_Ph.D Research Proposal
Colloquium Presentation_Ph.D Research Proposal
 
Two Means, Independent Samples
Two Means, Independent SamplesTwo Means, Independent Samples
Two Means, Independent Samples
 
A Qualitative Phenomenological Study on Prison Volunteers in California’s Cor...
A Qualitative Phenomenological Study on Prison Volunteers in California’s Cor...A Qualitative Phenomenological Study on Prison Volunteers in California’s Cor...
A Qualitative Phenomenological Study on Prison Volunteers in California’s Cor...
 
Thesis Defense Presentation
Thesis Defense PresentationThesis Defense Presentation
Thesis Defense Presentation
 
Msc Thesis - Presentation
Msc Thesis - PresentationMsc Thesis - Presentation
Msc Thesis - Presentation
 
Master's course defense presentation in Water Resource Management and GIS
Master's course defense presentation in Water Resource Management and GIS  Master's course defense presentation in Water Resource Management and GIS
Master's course defense presentation in Water Resource Management and GIS
 
3B - GIS-BASED ANALYSIS OF AQUACULTURE SITE SELECTION
3B - GIS-BASED ANALYSIS OF AQUACULTURE SITE SELECTION3B - GIS-BASED ANALYSIS OF AQUACULTURE SITE SELECTION
3B - GIS-BASED ANALYSIS OF AQUACULTURE SITE SELECTION
 
Test for equal variances
Test for equal variancesTest for equal variances
Test for equal variances
 

Similar to Robustness metrics: How are they calculated and when should they be used?

L1 statistics
L1 statisticsL1 statistics
L1 statistics
dapdai
 
2008 implementation of va r in financial institutions
2008   implementation of va r in financial institutions2008   implementation of va r in financial institutions
2008 implementation of va r in financial institutions
crmbasel
 
The Dark Art of Production Alerting
The Dark Art of Production AlertingThe Dark Art of Production Alerting
The Dark Art of Production Alerting
Alois Reitbauer
 
SAMPLING MEAN DEFINITION The term sampling mean .docx
SAMPLING MEAN DEFINITION The term sampling mean .docxSAMPLING MEAN DEFINITION The term sampling mean .docx
SAMPLING MEAN DEFINITION The term sampling mean .docx
anhlodge
 
Ch3_Statistical Analysis and Random Error Estimation.pdf
Ch3_Statistical Analysis and Random Error Estimation.pdfCh3_Statistical Analysis and Random Error Estimation.pdf
Ch3_Statistical Analysis and Random Error Estimation.pdf
Vamshi962726
 

Similar to Robustness metrics: How are they calculated and when should they be used? (20)

L1 statistics
L1 statisticsL1 statistics
L1 statistics
 
hypothesis.pptx
hypothesis.pptxhypothesis.pptx
hypothesis.pptx
 
QT1 - 07 - Estimation
QT1 - 07 - EstimationQT1 - 07 - Estimation
QT1 - 07 - Estimation
 
Intro to Quant Trading Strategies (Lecture 10 of 10)
Intro to Quant Trading Strategies (Lecture 10 of 10)Intro to Quant Trading Strategies (Lecture 10 of 10)
Intro to Quant Trading Strategies (Lecture 10 of 10)
 
Catching the most with high-throughput screening
Catching the most with high-throughput screeningCatching the most with high-throughput screening
Catching the most with high-throughput screening
 
Chapter_9.pptx
Chapter_9.pptxChapter_9.pptx
Chapter_9.pptx
 
BIIntro.ppt
BIIntro.pptBIIntro.ppt
BIIntro.ppt
 
2008 implementation of va r in financial institutions
2008   implementation of va r in financial institutions2008   implementation of va r in financial institutions
2008 implementation of va r in financial institutions
 
The Dark Art of Production Alerting
The Dark Art of Production AlertingThe Dark Art of Production Alerting
The Dark Art of Production Alerting
 
SAMPLING MEAN DEFINITION The term sampling mean .docx
SAMPLING MEAN DEFINITION The term sampling mean .docxSAMPLING MEAN DEFINITION The term sampling mean .docx
SAMPLING MEAN DEFINITION The term sampling mean .docx
 
Sriram seminar on introduction to statistics
Sriram seminar on introduction to statisticsSriram seminar on introduction to statistics
Sriram seminar on introduction to statistics
 
Intro to Quant Trading Strategies (Lecture 2 of 10)
Intro to Quant Trading Strategies (Lecture 2 of 10)Intro to Quant Trading Strategies (Lecture 2 of 10)
Intro to Quant Trading Strategies (Lecture 2 of 10)
 
Statistics excellent
Statistics excellentStatistics excellent
Statistics excellent
 
Six Sigma Confidence Interval Analysis (CIA) Training Module
Six Sigma Confidence Interval Analysis (CIA) Training ModuleSix Sigma Confidence Interval Analysis (CIA) Training Module
Six Sigma Confidence Interval Analysis (CIA) Training Module
 
Ch3_Statistical Analysis and Random Error Estimation.pdf
Ch3_Statistical Analysis and Random Error Estimation.pdfCh3_Statistical Analysis and Random Error Estimation.pdf
Ch3_Statistical Analysis and Random Error Estimation.pdf
 
Or ppt,new
Or ppt,newOr ppt,new
Or ppt,new
 
Classification modelling review
Classification modelling reviewClassification modelling review
Classification modelling review
 
Basic statistics 1
Basic statistics  1Basic statistics  1
Basic statistics 1
 
Quantitative Methods for Lawyers - Class #7 - Probability & Basic Statistics ...
Quantitative Methods for Lawyers - Class #7 - Probability & Basic Statistics ...Quantitative Methods for Lawyers - Class #7 - Probability & Basic Statistics ...
Quantitative Methods for Lawyers - Class #7 - Probability & Basic Statistics ...
 
Statistics
StatisticsStatistics
Statistics
 

Recently uploaded

result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college project
Tonystark477637
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
dharasingh5698
 

Recently uploaded (20)

Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
 
Extrusion Processes and Their Limitations
Extrusion Processes and Their LimitationsExtrusion Processes and Their Limitations
Extrusion Processes and Their Limitations
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
 
University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdf
 
Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
UNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular ConduitsUNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular Conduits
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSIS
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptx
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college project
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
 
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
 
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
 
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
 
MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTING
MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTINGMANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTING
MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTING
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
 

Robustness metrics: How are they calculated and when should they be used?

  • 1. ROBUSTNESS METRICS How are they calculated and when should they be used? C. McPhail, H.R. Maier, J.H. Kwakkel, M. Giuliani, A. Castelletti and S.Westra
  • 2. How do we plan for an uncertain future?
  • 3. How do we plan for an uncertain future? Estimated distribution of future states (Scenario #1) Estimated distribution of future states (Scenario #2) System state Today Future
  • 4. How do we quantify system performance under uncertainty? System performance System performance System state Today Future Relative likelihood of occurrence is unknown
  • 5. How do we quantify system performance under uncertainty? System performance System performance Robustness System state Today Future Relative likelihood of occurrence is unknown
  • 6. Which metric should we use for calculating robustness? Maximin Maximax Hurwicz optimism- pessimism rule Laplace’s principle of insufficient reason Minimax regret 90th percentile minimax regret Mean-variance Undesirable deviations Percentile- based skewnessPercentile-based peakedness Starr’s domain criterion
  • 7. Which metric should we use for calculating robustness? Maximin Maximax Hurwicz optimism- pessimism rule Laplace’s principle of insufficient reason Minimax regret 90th percentile minimax regret Mean-variance Undesirable deviations Percentile- based skewnessPercentile-based peakedness Starr’s domain criterion
  • 8. Which metric should we use for calculating robustness? Maximin Maximax Hurwicz optimism- pessimism rule Laplace’s principle of insufficient reason Minimax regret 90th percentile minimax regret Mean-variance Undesirable deviations Percentile- based skewnessPercentile-based peakedness Starr’s domain criterion
  • 9. Which metric should we use for calculating robustness? Maximin Maximax Hurwicz optimism- pessimism rule Laplace’s principle of insufficient reason Minimax regret 90th percentile minimax regret Mean-variance Undesirable deviations Percentile- based skewnessPercentile-based peakedness Starr’s domain criterion
  • 10. Contributions of the research 1. A unified framework for the calculation of a wide range of robustness metrics. Enabling a comparison of robustness metrics. 2. A taxonomy of robustness metrics. Providing guidance to decision-makers.
  • 11. Robustness value Performance metric (e.g. cost, reliability) Decision alternatives (e.g. Policy options, plans, solutions) Plausible future conditions (Scenarios) Robustness metric
  • 12. Probabilityofoccurrencein selectedscenarios Transformed Performance 1 2 3 4 … n Performance Scenario # 1 2 4 … TransformedPerformance Scenario # Performance over all scenarios Transformed performance values over all scenarios Transformed performance values over selected scenarios Robustness value Mean T1: Performance value transformation T2: Scenario subset selection T3: Robustness metric calculation 1 2 3 4 … nTransformedPerformance Scenario #
  • 13. 1 2 3 4 … n TransformedPerformance Scenario # T1: Performance value transformation Performance (identity)Cost of making wrong decision (regret) 1 2 3 4 … n Performance Scenario # 1 2 3 4 … n TransformedPerformance Scenario # 1 2 3 4 … n Performance Scenario # Performance values Pass/fail (satisficing) OR OR Threshold
  • 14. T1: Performance value transformation Robustness calculation based on relative performance values Robustness calculation based on absolute performance values Indication of whether system performance is satisfactory or not - (MORE) - (POMORE) - (Decision Scaling) - Starr’s domain criterion - (Info Gap) Indication of actual system performance - Minimax regret - 90th percentile minimax regret - Undesirable deviations - Maximin (minimax) - Maximax - Hurwicz’s optimism-pessimism rule - Laplace’s principle of insufficient reason - Mean-variance - Percentile-based skewness - Percentile-based peakedness
  • 15. T2: Scenario subset selection Worst half of scenariosBest-case scenario 1 2 3 4 … n Performance Scenario # Transformed performance values Worst-case scenario OR OR 1 2 3 4 … n Performance Scenario # 1 2 3 4 … n Performance Scenario # 1 2 3 4 … n Performance Scenario #
  • 16. T2: Scenario subset selection Less risk averse More risk averse
  • 17. Probabilityofoccurrencein selectedscenarios Transformed Performance T3: Robustness metric calculation SkewVariance Transformed performance values in selected scenarios Expected value (e.g. mean) OR OR Mean Probabilityofoccurrencein selectedscenarios Transformed Performance Probabilityofoccurrencein selectedscenarios Transformed Performance Variance Probabilityofoccurrencein selectedscenarios Transformed Performance Mean Median
  • 18. T3: Robustness metric calculation Robustness metric Robustness metric calculation None Sum Mean Weighted mean Variance Skew Kurtosis Maximin ✓ Maximax ✓ Hurwicz optimism-pessimism rule ✓ Laplace’s principle of insufficient reason ✓ Minimax regret ✓ 90th percentile minimax regret ✓ Mean-variance ✓ ✓ Undesirable deviations ✓ Percentile-based skewness ✓ Percentile-based peakedness ✓ Starr’s domain criterion ✓
  • 19. Probabilityofoccurrencein selectedscenarios Transformed Performance 1 2 3 4 … n Performance Scenario # 1 2 4 … TransformedPerformance Scenario # Performance over all scenarios Transformed performance values over all scenarios Transformed performance values over selected scenarios Robustness value Mean T1: Performance value transformation T2: Scenario subset selection T3: Robustness metric calculation 1 2 3 4 … nTransformedPerformance Scenario #
  • 20. Maximin Maximax Hurwicz optimism-pessimism rule Laplace’s principle of insufficient reason Minimax regret 90th percentile minimax regret Mean-variance Undesirable deviations Percentile-based skewness Percentile-based peakedness Starr’s domain criterion
  • 21. Metric T1: Performance value transformation T2: Scenario subset selection T3: Robustness metric calculation Maximin Identity Worst-case Identity Maximax Identity Best-case Identity Hurwicz optimism-pessimism rule Identity Worst- and best-cases Weighted mean Laplace’s principle of insufficient reason Identity All Mean Minimax regret Regret from best decision alternative Worst-case Identity 90th percentile minimax regret Regret from best decision alternative 90th percentile Identity Mean-variance Identity All Mean-variance Undesirable deviations Regret from median performance Worst-half Sum Percentile-based skewness Identity 10th, 50th and 90th percentiles Skew Percentile-based peakedness Identity 10th, 25th, 75th and 90th percentiles Kurtosis Starr’s domain criterion Satisfaction of constraints All Mean Maximin Maximax Hurwicz optimism-pessimism rule Laplace’s principle of insufficient reason Minimax regret 90th percentile minimax regret Mean-variance Undesirable deviations Percentile-based skewness Percentile-based peakedness Starr’s domain criterion
  • 22. Metric T1: Performance value transformation T2: Scenario subset selection T3: Robustness metric calculation Maximin Identity Worst-case Identity Maximax Identity Best-case Identity Hurwicz optimism-pessimism rule Identity Worst- and best-cases Weighted mean Laplace’s principle of insufficient reason Identity All Mean Minimax regret Regret from best decision alternative Worst-case Identity 90th percentile minimax regret Regret from best decision alternative 90th percentile Identity Mean-variance Identity All Mean-variance Undesirable deviations Regret from median performance Worst-half Sum Percentile-based skewness Identity 10th, 50th and 90th percentiles Skew Percentile-based peakedness Identity 10th, 25th, 75th and 90th percentiles Kurtosis Starr’s domain criterion Satisfaction of constraints All Mean
  • 23. Future work A conceptual framework for understanding when robustness metrics agree or disagree. Paper under revision C. McPhail, H.R. Maier, J.H. Kwakkel, M. Giuliani, A. Castelletti and S.Westra (under revision), Robustness metrics: How are they calculated, when should they be used and why do they give different results?, Earth’s Future. Contact Cameron McPhail University of Adelaide cameron.mcphail@adelaide.edu.au
  • 24. T1: Performance value transformation Description Equation Identity transformation 𝑓′ 𝑥𝑖, 𝑠𝑗 = 𝑓 𝑥𝑖, 𝑠𝑗 Regret from best decision alternative 𝑓′ 𝑥𝑖, 𝑠𝑗 = max 𝑥 𝑓 𝑥, 𝑠𝑗 − 𝑓 𝑥𝑖, 𝑠𝑗 , maximisation 𝑓 𝑥𝑖, 𝑠𝑗 − min 𝑥 𝑓 𝑥, 𝑠𝑗 , minimisation Regret from median 𝑓′ 𝑥𝑖, 𝑠𝑗 = 𝑞50 − 𝑓 𝑥𝑖, 𝑠𝑗 , maximisation 𝑓 𝑥𝑖, 𝑠𝑗 − 𝑞50, minimisation where 𝑞50 is the median performance for decision alternative 𝑥𝑖. i.e. 𝑃 𝑓 𝑥𝑖, 𝑆 ≤ 𝑞50 = 1 2 Satisfaction of constraints 𝑓′ 𝑥𝑖, 𝑠𝑗 = 1 if 𝑓 𝑥𝑖, 𝑠𝑗 ≥ 𝑐 0 if 𝑓 𝑥𝑖, 𝑠𝑗 < 𝑐 , maximisation 1 if 𝑓 𝑥𝑖, 𝑠𝑗 ≤ 𝑐 0 if 𝑓 𝑥𝑖, 𝑠𝑗 > 𝑐 , minimisation where 𝑐 is a constraint
  • 25. T2: Scenario subset selection Description Equation Worst-case 𝑆′ = arg min 𝑠 𝑓′ 𝑥𝑖, 𝑠 , maximisation arg max 𝑠 𝑓′ 𝑥𝑖, 𝑠 , minimisation Best-case 𝑆′ = arg max 𝑠 𝑓′ 𝑥𝑖, 𝑠 , maximisation arg min 𝑠 𝑓′ 𝑥𝑖, 𝑠 , minimisation Worst- and best-cases 𝑆′ = arg max 𝑠 𝑓′ 𝑥𝑖, 𝑠 , arg min 𝑠 𝑓′ 𝑥𝑖, 𝑠 All 𝑆′ = 𝑆 Worst-half 𝑆′ = 𝑠 ∈ 𝑆: 𝑓′ 𝑥𝑖, 𝑠 ≤ 𝑞50 , maximisation 𝑠 ∈ 𝑆: 𝑓′ 𝑥𝑖, 𝑠 ≥ 𝑞50 , minimisation where 𝑞50 is the 50th percentile (median) value of 𝑓′ 𝑥𝑖, 𝑆 Percentile 𝑆′ = 𝑓′ 𝑥𝑖, 𝑠 = 𝑞 𝑘 where 𝑞 𝑘 is the kth percentile value of 𝑓′ 𝑥𝑖, 𝑆 Note that the scenario 𝑠 that produces the value of 𝑓′ 𝑥𝑖, 𝑠 closest to 𝑞 𝑘 is the scenario that is used.
  • 26. T3: Robustness metric calculation Description Equation Identity transformation 𝑅 𝑥𝑖, 𝑆 = 𝑓′ 𝑥𝑖, 𝑆′ Mean 𝑅 𝑥𝑖, 𝑆 = 1 𝑛′ 𝑗=1 𝑛′ 𝑓′ 𝑥𝑖, 𝑠𝑗 where 𝑛′ is the number of scenarios in 𝑆′ Sum 𝑅 𝑥𝑖, 𝑆 = 𝑗=1 𝑛′ 𝑓′ 𝑥𝑖, 𝑠𝑗 Weighted mean (two scenarios) 𝑅 𝑥𝑖, 𝑆 = 𝛼𝑓′ 𝑥𝑖, 𝑠 𝑎 + 1 − 𝛼 𝑓′ 𝑥𝑖, 𝑠 𝑏 where 𝑠 𝑎 and 𝑠 𝑏 are two scenarios and 𝛼 is the preference of the decision maker towards using 𝑠 𝑎 and 0 < 𝛼 < 1 (Also see next slide…)
  • 27. T3: Robustness metric calculation Description Equation Variance-based (i.e. the standard deviation) 𝑅 𝑥𝑖, 𝑆 = 1 𝑛′ − 1 𝑗=1 𝑛′ 𝑓′ 𝑥𝑖, 𝑠𝑗 − 𝜇 2 where 𝜇 is the mean (see the equation earlier in this table) Mean-variance 𝑅 𝑥𝑖, 𝑆 = 𝜇 + 1 𝜎 + 1 , maximisation − 𝜇 + 1 𝜎 + 1 , minimisation where 𝜇 is the mean and 𝜎 is the standard deviation (given by equations above) Skew 𝑅 𝑥𝑖, 𝑆 = 𝑓′ 𝑥𝑖, 𝑠90 + 𝑓′ 𝑥𝑖, 𝑠10 2 − 𝑓′ 𝑥𝑖, 𝑠50 𝑓′ 𝑥𝑖, 𝑠90 − 𝑓′ 𝑥𝑖, 𝑠10 2 , maximisation − 𝑓′ 𝑥𝑖, 𝑠90 + 𝑓′ 𝑥𝑖, 𝑠10 2 − 𝑓′ 𝑥𝑖, 𝑠50 𝑓′ 𝑥𝑖, 𝑠90 − 𝑓′ 𝑥𝑖, 𝑠10 2 , minimisation where 𝑠10, 𝑠50 and 𝑠90 are scenarios that represent the 10th, 50th and 90th percentiles for 𝑓′ 𝑥𝑖, 𝑆 Kurtosis 𝑅 𝑥𝑖, 𝑆 = 𝑓′ 𝑥𝑖, 𝑠90 − 𝑓′ 𝑥𝑖, 𝑠10 𝑓′ 𝑥𝑖, 𝑠75 − 𝑓′ 𝑥𝑖, 𝑠25 where 𝑠10, 𝑠25, 𝑠75 and 𝑠90 are scenarios that represent the 10th, 25th, 75th and 90th percentiles for 𝑓′ 𝑥𝑖, 𝑆

Editor's Notes

  1. Environmental decision-making has long required planning for an uncertain future An example is a water supply system: large-scale infrastructure, large costs, long planning period (decades)
  2. DMs are increasingly considering multiple plausible futures (scenarios) to represent futures where the relative likelihoods are unknown (example could be RCP4.5 and RPC8.5)
  3. For each of the projected futures (scenarios) we can calculate the system performance as per before (e.g. reliability) But we don’t know the likelihood of each of these futures
  4. So we consider the concept of robustness: the greatest performance, across as many futures as possible
  5. The literature for robustness is full of these different metrics But they are all reflecting different aspects of what it means to have the greatest performance over as many scenarios as possible Some focus on maximising performance for some scenarios Others focus on having consistent performance across as many scenarios as possible So it’s difficult for DMs to know which one to use, because they are all telling you different things
  6. One of the oldest metrics is the Maximin metric Wald, 1950 Economics Looking at maximising the worst-case performance
  7. Starr, 1963
  8. Kwakkel, 2016
  9. To begin the unifying framework, need to look at what all metrics have in common They all take the decision alternatives, …, …, and they filter this through the robustness metric to produce the robustness value
  10. The unifying framework allows the categorisation of the metrics according to 3 transformations, T1, T2, T3 All of the metrics use these transformations to get from the performance in each scenario, to the value of robustness We start with all of the performance values and apply the performance value transformation From these transformed values, we select a subset of scenarios From the subset, we calculate some value based on the distribution of performance values, and this gives us the robustness
  11. The performance values are transformed depending on the needs of the DM If there is an important threshold (e.g. supply > demand, see dotted line) then it may make sense to use a satisficing metric, where we only care about whether the decision alternative passes or fails. Otherwise you might be most interested in minimising the regret (cost of making the wrong choice). E.g. in a water supply system you want to minimise unnecessary overexpenditure You might just want to maximise the performance itself, in which case the Identity transform is fine. E.g. if you’re looking at a stream restoration with a fixed budget, it might be best to just consider maximising performance
  12. Second transformation is the scenario subset selection This is asking which performance values should be used? The worst-case? The best-case? Some subset? Or all values?
  13. This transformation is related to the level of risk aversion that the DM has At the bottom you have the maximin metric which represents a very risk averse metric, assuming that the worst-case scenario will happen This might be most relevant when considering the design of a dam, where failure could kill 1000s of residents At the top you have the maximax metric which represents a lower level of risk aversion This might be more relevant for something such as the design of a stormwater system, which will be unlikely to kill 1000s of people
  14. We have a distribution of transformed performance values in the selected scenarios Generally DMs are interested in the expected value. E.g. cost or reliability. And therefore use the sum or mean of the performance values For supplemental info, a DM might use variance or some higher-order statistic from the distribution to understand the difference between decision alternatives with similar expected value: Variance – spread of performance Skew – difference in performance under extreme conditions Kurtosis – consistency in performance
  15. We’ve developed this unifying framework with three transformations T1: Is there some important threshold such as supply > demand? Are you interested in minimising the cost of making a wrong decision? Are you interested just in maximising performance? T2: How risk averse do you need to be for this problem? This effects the scenarios that are selected. T3: Are you interested in the expected value of performance? Or are you interested in looking at how the performance varies across scenarios?