SlideShare a Scribd company logo
1 of 47
Download to read offline
Inexact computer model calibration
Matthew Plumlee
Industrial Engineering and Management Sciences
Northwestern University
Matthew Plumlee SAMSI, 2018 1 / 17
History Matching
In a case study on aligning oil models with real data:
[Craig, Goldstien, et al. (1997)] In many cases, a fully satis-
factory match is not obtained and usually it is hard to judge
whether this was due to a problem with the underlying model
or due to an inadequate search over the space of possible inputs.
Key idea: Views the discrepancy itself as a random object (perhaps
with some prior).
This is the first (but maybe not first?) use of
yO
pxi q  yM
pxi , uq bi pxi q  i .
Matthew Plumlee SAMSI, 2018 2 / 17
History Matching
In a case study on aligning oil models with real data:
[Craig, Goldstien, et al. (1997)] In many cases, a fully satis-
factory match is not obtained and usually it is hard to judge
whether this was due to a problem with the underlying model
or due to an inadequate search over the space of possible inputs.
Key idea: Views the discrepancy itself as a random object (perhaps
with some prior).
This is the first (but maybe not first?) use of
yO
pxi q  yM
pxi , uq bi pxi q  i .
Matthew Plumlee SAMSI, 2018 2 / 17
Kennedy and O’Hagan (2001)
[Kennedy and O’Hagan, 2001] Even if there is no parameter
uncertainty , is that we know the values of all the inputs required
to make a particular prediction of the process being modelled,
the predicted value will not equal the true value of the process.
Key idea: Modelling the discrepancy uncertainty with a prior.
The prior choice that was advocated was a Gaussian process, possibly
because it fits nicely into computation.
Discussion article, so we have some comments:
Peter Diggle and Keith Beven had some concerns about Gaussian
process priors, as they are complex and difficult to understand.
Many concerns about parameters.
Matthew Plumlee SAMSI, 2018 3 / 17
Kennedy and O’Hagan (2001)
[Kennedy and O’Hagan, 2001] Even if there is no parameter
uncertainty , is that we know the values of all the inputs required
to make a particular prediction of the process being modelled,
the predicted value will not equal the true value of the process.
Key idea: Modelling the discrepancy uncertainty with a prior.
The prior choice that was advocated was a Gaussian process, possibly
because it fits nicely into computation.
Discussion article, so we have some comments:
Peter Diggle and Keith Beven had some concerns about Gaussian
process priors, as they are complex and difficult to understand.
Many concerns about parameters.
Matthew Plumlee SAMSI, 2018 3 / 17
Application issues and some
abandonment
Farah et al. (2014), “if the BEA simulator were to be substantially
biased, the form of the bias would affect the posterior distribution
of the calibration parameters.”
Gramacy et al. (2015), “identification issues known to plague
[KO]-style calibration.”
It seems odd to define the relationship
yR
pxq  yM
px, uq bpxq
without fixing u. After all, how could this be true for all u?
Matthew Plumlee SAMSI, 2018 4 / 17
Application issues and some
abandonment
Farah et al. (2014), “if the BEA simulator were to be substantially
biased, the form of the bias would affect the posterior distribution
of the calibration parameters.”
Gramacy et al. (2015), “identification issues known to plague
[KO]-style calibration.”
It seems odd to define the relationship
yR
pxq  yM
px, uq bpxq
without fixing u. After all, how could this be true for all u?
Matthew Plumlee SAMSI, 2018 4 / 17
Application issues and some
abandonment
Farah et al. (2014), “if the BEA simulator were to be substantially
biased, the form of the bias would affect the posterior distribution
of the calibration parameters.”
Gramacy et al. (2015), “identification issues known to plague
[KO]-style calibration.”
It seems odd to define the relationship
yR
pxq  yM
px, uq bpxq
without fixing u. After all, how could this be true for all u?
Matthew Plumlee SAMSI, 2018 4 / 17
Application issues and some
abandonment
Farah et al. (2014), “if the BEA simulator were to be substantially
biased, the form of the bias would affect the posterior distribution
of the calibration parameters.”
Gramacy et al. (2015), “identification issues known to plague
[KO]-style calibration.”
It seems odd to define the relationship
yR
pxq  yM
px, uq bpxq
without fixing u. After all, how could this be true for all u?
Matthew Plumlee SAMSI, 2018 4 / 17
Back to the parameter controversy
Two philosophical remedies:
yR
pxq  yM
px, u¦q bpxq or yR
pxq  yM
px, uq bpx, uq.
Higdon, Kennedy et al. (2004) seem to imply some true u.
Loeppky, Bingham and Welch (2006) give some discussion and note
“Even if one could estimate the [parameter] exactly, it may be the case
that this value leads to a worse prediction of the process mean ...”
Bayarri et al. (2007) says that u should be in each term and should be
u¦.
However, it left ambiguous what, exactly, u¦ ought to be, aside from
the “true generating parameter”.
Matthew Plumlee SAMSI, 2018 5 / 17
Back to the parameter controversy
Two philosophical remedies:
yR
pxq  yM
px, u¦q bpxq or yR
pxq  yM
px, uq bpx, uq.
Higdon, Kennedy et al. (2004) seem to imply some true u.
Loeppky, Bingham and Welch (2006) give some discussion and note
“Even if one could estimate the [parameter] exactly, it may be the case
that this value leads to a worse prediction of the process mean ...”
Bayarri et al. (2007) says that u should be in each term and should be
u¦.
However, it left ambiguous what, exactly, u¦ ought to be, aside from
the “true generating parameter”.
Matthew Plumlee SAMSI, 2018 5 / 17
Back to the parameter controversy
Two philosophical remedies:
yR
pxq  yM
px, u¦q bpxq or yR
pxq  yM
px, uq bpx, uq.
Higdon, Kennedy et al. (2004) seem to imply some true u.
Loeppky, Bingham and Welch (2006) give some discussion and note
“Even if one could estimate the [parameter] exactly, it may be the case
that this value leads to a worse prediction of the process mean ...”
Bayarri et al. (2007) says that u should be in each term and should be
u¦.
However, it left ambiguous what, exactly, u¦ ought to be, aside from
the “true generating parameter”.
Matthew Plumlee SAMSI, 2018 5 / 17
Back to the parameter controversy
Two philosophical remedies:
yR
pxq  yM
px, u¦q bpxq or yR
pxq  yM
px, uq bpx, uq.
Higdon, Kennedy et al. (2004) seem to imply some true u.
Loeppky, Bingham and Welch (2006) give some discussion and note
“Even if one could estimate the [parameter] exactly, it may be the case
that this value leads to a worse prediction of the process mean ...”
Bayarri et al. (2007) says that u should be in each term and should be
u¦.
However, it left ambiguous what, exactly, u¦ ought to be, aside from
the “true generating parameter”.
Matthew Plumlee SAMSI, 2018 5 / 17
Back to the parameter controversy
Two philosophical remedies:
yR
pxq  yM
px, u¦q bpxq or yR
pxq  yM
px, uq bpx, uq.
Higdon, Kennedy et al. (2004) seem to imply some true u.
Loeppky, Bingham and Welch (2006) give some discussion and note
“Even if one could estimate the [parameter] exactly, it may be the case
that this value leads to a worse prediction of the process mean ...”
Bayarri et al. (2007) says that u should be in each term and should be
u¦.
However, it left ambiguous what, exactly, u¦ ought to be, aside from
the “true generating parameter”.
Matthew Plumlee SAMSI, 2018 5 / 17
An opinion
Parameters be outlined based some “optimal” form of the discrepancy
that is well-defined.
Some rational:
There is no recovery of the true parameter: “Abandon all hope, ye
who enter here.”
If you change the model, the parameter should change.
It should not depend on the data collected.
It should not be based on a prior.
We need to make clear what we are estimating for users of our
methods.
Matthew Plumlee SAMSI, 2018 6 / 17
An opinion
Parameters be outlined based some “optimal” form of the discrepancy
that is well-defined.
Some rational:
There is no recovery of the true parameter: “Abandon all hope, ye
who enter here.”
If you change the model, the parameter should change.
It should not depend on the data collected.
It should not be based on a prior.
We need to make clear what we are estimating for users of our
methods.
Matthew Plumlee SAMSI, 2018 6 / 17
An opinion
Parameters be outlined based some “optimal” form of the discrepancy
that is well-defined.
Some rational:
There is no recovery of the true parameter: “Abandon all hope, ye
who enter here.”
If you change the model, the parameter should change.
It should not depend on the data collected.
It should not be based on a prior.
We need to make clear what we are estimating for users of our
methods.
Matthew Plumlee SAMSI, 2018 6 / 17
An opinion
Parameters be outlined based some “optimal” form of the discrepancy
that is well-defined.
Some rational:
There is no recovery of the true parameter: “Abandon all hope, ye
who enter here.”
If you change the model, the parameter should change.
It should not depend on the data collected.
It should not be based on a prior.
We need to make clear what we are estimating for users of our
methods.
Matthew Plumlee SAMSI, 2018 6 / 17
An opinion
Parameters be outlined based some “optimal” form of the discrepancy
that is well-defined.
Some rational:
There is no recovery of the true parameter: “Abandon all hope, ye
who enter here.”
If you change the model, the parameter should change.
It should not depend on the data collected.
It should not be based on a prior.
We need to make clear what we are estimating for users of our
methods.
Matthew Plumlee SAMSI, 2018 6 / 17
An opinion
Parameters be outlined based some “optimal” form of the discrepancy
that is well-defined.
Some rational:
There is no recovery of the true parameter: “Abandon all hope, ye
who enter here.”
If you change the model, the parameter should change.
It should not depend on the data collected.
It should not be based on a prior.
We need to make clear what we are estimating for users of our
methods.
Matthew Plumlee SAMSI, 2018 6 / 17
An opinion
Parameters be outlined based some “optimal” form of the discrepancy
that is well-defined.
Some rational:
There is no recovery of the true parameter: “Abandon all hope, ye
who enter here.”
If you change the model, the parameter should change.
It should not depend on the data collected.
It should not be based on a prior.
We need to make clear what we are estimating for users of our
methods.
Matthew Plumlee SAMSI, 2018 6 / 17
Opinion, cont.
My personal affinity lies with a weighted L2 definition
u¦  arg min
»
X
pyR
pxq¡yM
px, uqq2
dµpxq.
Why?
Users understand this really well
It gives flexiblity on where the user finds the inputs the most
important.
But almost any norm will do (Sobolov, RKHS, etc), just make it clear!
Others have used it to define “tuning parameters”.
Matthew Plumlee SAMSI, 2018 7 / 17
Opinion, cont.
My personal affinity lies with a weighted L2 definition
u¦  arg min
»
X
pyR
pxq¡yM
px, uqq2
dµpxq.
Why?
Users understand this really well
It gives flexiblity on where the user finds the inputs the most
important.
But almost any norm will do (Sobolov, RKHS, etc), just make it clear!
Others have used it to define “tuning parameters”.
Matthew Plumlee SAMSI, 2018 7 / 17
Opinion, cont.
My personal affinity lies with a weighted L2 definition
u¦  arg min
»
X
pyR
pxq¡yM
px, uqq2
dµpxq.
Why?
Users understand this really well
It gives flexiblity on where the user finds the inputs the most
important.
But almost any norm will do (Sobolov, RKHS, etc), just make it clear!
Others have used it to define “tuning parameters”.
Matthew Plumlee SAMSI, 2018 7 / 17
History Matching (1997ish)
Matthew Plumlee SAMSI, 2018 8 / 17
Kennedy and O’Hagan (2001)
Matthew Plumlee SAMSI, 2018 8 / 17
L2 Calibration
Tuo and Wu (2015, Annals) criticize Kennedy and O’Hagan under the
premise that the MAP is not a good estimator of the L2 minimizer.
Key idea: Take our data and come up with some estimate of the true
function ˆyRp¤q. Then
ˆuL2
 arg min }ˆyR
¡yM
p¤, uq}L2 .
Under a litany of conditions, they show ˆuL2 , behaves exactly like if our
model was exactly correct!
This means that we just need to “re-center” our least squares
confidence interval, but same width.
Matthew Plumlee SAMSI, 2018 9 / 17
L2 Calibration
Tuo and Wu (2015, Annals) criticize Kennedy and O’Hagan under the
premise that the MAP is not a good estimator of the L2 minimizer.
Key idea: Take our data and come up with some estimate of the true
function ˆyRp¤q. Then
ˆuL2
 arg min }ˆyR
¡yM
p¤, uq}L2 .
Under a litany of conditions, they show ˆuL2 , behaves exactly like if our
model was exactly correct!
This means that we just need to “re-center” our least squares
confidence interval, but same width.
Matthew Plumlee SAMSI, 2018 9 / 17
L2 Calibration
Tuo and Wu (2015, Annals) criticize Kennedy and O’Hagan under the
premise that the MAP is not a good estimator of the L2 minimizer.
Key idea: Take our data and come up with some estimate of the true
function ˆyRp¤q. Then
ˆuL2
 arg min }ˆyR
¡yM
p¤, uq}L2 .
Under a litany of conditions, they show ˆuL2 , behaves exactly like if our
model was exactly correct!
This means that we just need to “re-center” our least squares
confidence interval, but same width.
Matthew Plumlee SAMSI, 2018 9 / 17
L2 Calibration
Tuo and Wu (2015, Annals) criticize Kennedy and O’Hagan under the
premise that the MAP is not a good estimator of the L2 minimizer.
Key idea: Take our data and come up with some estimate of the true
function ˆyRp¤q. Then
ˆuL2
 arg min }ˆyR
¡yM
p¤, uq}L2 .
Under a litany of conditions, they show ˆuL2 , behaves exactly like if our
model was exactly correct!
This means that we just need to “re-center” our least squares
confidence interval, but same width.
Matthew Plumlee SAMSI, 2018 9 / 17
L2 Calibration
Tuo and Wu (2015, Annals) criticize Kennedy and O’Hagan under the
premise that the MAP is not a good estimator of the L2 minimizer.
Key idea: Take our data and come up with some estimate of the true
function ˆyRp¤q. Then
ˆuL2
 arg min }ˆyR
¡yM
p¤, uq}L2 .
Under a litany of conditions, they show ˆuL2 , behaves exactly like if our
model was exactly correct!
This means that we just need to “re-center” our least squares
confidence interval, but same width.
Matthew Plumlee SAMSI, 2018 9 / 17
Matthew Plumlee SAMSI, 2018 10 / 17
Where are we again?
Trying to construct intervals with non-linear regression for inexact
models.
1960-1995 Na¨ıve approaches, in general, will not work unless your xi s
are exactly generated from µ or your model is exact.
1995-2013 History Matching produce huge intervals. Kennedy and
O’Hagan method produces large intervals, MAP away
from L2 parameter, often does not cover L2 parameter.
2013-Today L2 and like were proposed to basically work in large
samples.
Matthew Plumlee SAMSI, 2018 11 / 17
Where are we again?
Trying to construct intervals with non-linear regression for inexact
models.
1960-1995 Na¨ıve approaches, in general, will not work unless your xi s
are exactly generated from µ or your model is exact.
1995-2013 History Matching produce huge intervals. Kennedy and
O’Hagan method produces large intervals, MAP away
from L2 parameter, often does not cover L2 parameter.
2013-Today L2 and like were proposed to basically work in large
samples.
Matthew Plumlee SAMSI, 2018 11 / 17
Orthogonal approach
Key idea:
Theorem (Plumlee, 2017)
Suppose that u is the minimizer of L2 located in the interior of u € Rp.
Then bp¤q  must be such that
»
X
yM
px, uqpyR
pxq¡yM
px, uqqlooooooooooomooooooooooon
bpxq
dµpxq  0. p¦q
Simply use KO with
yR
pxq  yM
px, uq bupxq,
with orthogonal bu.
Theorem (Plumlee and Joseph, 2018)
Under reasonable conditions, we can construct a prior on a Gaussian
process such that p¦q is met.
Matthew Plumlee SAMSI, 2018 12 / 17
Orthogonal Process Approach
Matthew Plumlee SAMSI, 2018 13 / 17
Frequency of coverage of the L2 minimizer
Type Poor Exp Rich Exp Poor Obs Rich Obs
HM 1.00 1.00 1.00 1.00
KO 1.00 1.00 1.00 1.00
L2 0.87 0.92 0.58 0.98
OGP 1.00 1.00 1.00 1.00
Average Interval Width
Type Poor Exp Rich Exp Poor Obs Rich Obs
HM 3.18 3.54 3.62 3.91
KO 2.64 2.61 2.67 2.88
L2 0.61 0.37 1.15 0.63
OGP 2.55 0.94 3.75 1.23
Matthew Plumlee SAMSI, 2018 14 / 17
On discrepancy correction
One of the key features of Kennedy and O’Hagan was the concept of
bias correction.
That is, instead of using yMpx, u¦q as a predictor of the best model,
why not go straight to yMpx, u¦q bpxq?
Practitioners are sometimes unwilling to use the predictive discrepancy
function in making inferences because...
It rarely has the behavior they expect.
If their discrepancy is doing the heavy lifting, they want to fix their
model.
It is hard to communicate/transfer the model to colleagues.
So I argue the best model itself is something we want to conjecture on.
Matthew Plumlee SAMSI, 2018 15 / 17
On discrepancy correction
One of the key features of Kennedy and O’Hagan was the concept of
bias correction.
That is, instead of using yMpx, u¦q as a predictor of the best model,
why not go straight to yMpx, u¦q bpxq?
Practitioners are sometimes unwilling to use the predictive discrepancy
function in making inferences because...
It rarely has the behavior they expect.
If their discrepancy is doing the heavy lifting, they want to fix their
model.
It is hard to communicate/transfer the model to colleagues.
So I argue the best model itself is something we want to conjecture on.
Matthew Plumlee SAMSI, 2018 15 / 17
On discrepancy correction
One of the key features of Kennedy and O’Hagan was the concept of
bias correction.
That is, instead of using yMpx, u¦q as a predictor of the best model,
why not go straight to yMpx, u¦q bpxq?
Practitioners are sometimes unwilling to use the predictive discrepancy
function in making inferences because...
It rarely has the behavior they expect.
If their discrepancy is doing the heavy lifting, they want to fix their
model.
It is hard to communicate/transfer the model to colleagues.
So I argue the best model itself is something we want to conjecture on.
Matthew Plumlee SAMSI, 2018 15 / 17
On discrepancy correction
One of the key features of Kennedy and O’Hagan was the concept of
bias correction.
That is, instead of using yMpx, u¦q as a predictor of the best model,
why not go straight to yMpx, u¦q bpxq?
Practitioners are sometimes unwilling to use the predictive discrepancy
function in making inferences because...
It rarely has the behavior they expect.
If their discrepancy is doing the heavy lifting, they want to fix their
model.
It is hard to communicate/transfer the model to colleagues.
So I argue the best model itself is something we want to conjecture on.
Matthew Plumlee SAMSI, 2018 15 / 17
Cardiac example
time
0 5 10 15
normalizedcurrent
0
0.02
0.04
0.06
0.08
0.1
0.12
0.14
KO
time
0 5 10 15
OGP
Matthew Plumlee SAMSI, 2018 16 / 17
Cardiac example
time
0 5 10 15
normalizedcurrent
0
0.02
0.04
0.06
0.08
0.1
0.12
0.14
KO
time
0 5 10 15
OGP
Another side benefit: faster MCMC mixing.
KO: ESS about 6.5 / 1000 samples
OGP: ESS about 78.2 / 1000 samples
Matthew Plumlee SAMSI, 2018 16 / 17
Some future comments
When working on calibration, we should focus on getting
uncertainty right, not asymptotic value.
Measured relative to some parameter based on yR
p¤q and yM
p¤, ¤q.
We should not claim that we are covering nature’s parameter. This
fact depends on the state-of-the-art science and nature.
“Right” by inverse price-is-right rule.
We need to work from definitions of parameters, but L2 is not the
only one.
What is that weight function?
What about L1 or other measures of difference?
Smoothness?
So many bells and whistles were designed for KO. What needs to
be extended?
Functional Responses
Emulator interface
Multi-model cases
Matthew Plumlee SAMSI, 2018 17 / 17
Some future comments
When working on calibration, we should focus on getting
uncertainty right, not asymptotic value.
Measured relative to some parameter based on yR
p¤q and yM
p¤, ¤q.
We should not claim that we are covering nature’s parameter. This
fact depends on the state-of-the-art science and nature.
“Right” by inverse price-is-right rule.
We need to work from definitions of parameters, but L2 is not the
only one.
What is that weight function?
What about L1 or other measures of difference?
Smoothness?
So many bells and whistles were designed for KO. What needs to
be extended?
Functional Responses
Emulator interface
Multi-model cases
Matthew Plumlee SAMSI, 2018 17 / 17
Some future comments
When working on calibration, we should focus on getting
uncertainty right, not asymptotic value.
Measured relative to some parameter based on yR
p¤q and yM
p¤, ¤q.
We should not claim that we are covering nature’s parameter. This
fact depends on the state-of-the-art science and nature.
“Right” by inverse price-is-right rule.
We need to work from definitions of parameters, but L2 is not the
only one.
What is that weight function?
What about L1 or other measures of difference?
Smoothness?
So many bells and whistles were designed for KO. What needs to
be extended?
Functional Responses
Emulator interface
Multi-model cases
Matthew Plumlee SAMSI, 2018 17 / 17
Some future comments
When working on calibration, we should focus on getting
uncertainty right, not asymptotic value.
Measured relative to some parameter based on yR
p¤q and yM
p¤, ¤q.
We should not claim that we are covering nature’s parameter. This
fact depends on the state-of-the-art science and nature.
“Right” by inverse price-is-right rule.
We need to work from definitions of parameters, but L2 is not the
only one.
What is that weight function?
What about L1 or other measures of difference?
Smoothness?
So many bells and whistles were designed for KO. What needs to
be extended?
Functional Responses
Emulator interface
Multi-model cases
Matthew Plumlee SAMSI, 2018 17 / 17

More Related Content

Similar to MUMS Opening Workshop - Panel Discussion: Inexact Computer Model Calibration, Mathhew Plumlee, August 21, 2018

Ejbrm volume6-issue1-article183
Ejbrm volume6-issue1-article183Ejbrm volume6-issue1-article183
Ejbrm volume6-issue1-article183Soma Sinha Roy
 
discussion on Bayesian restricted likelihood
discussion on Bayesian restricted likelihooddiscussion on Bayesian restricted likelihood
discussion on Bayesian restricted likelihoodChristian Robert
 
Alexandr Honchar. Financial ML != ML and Finance
Alexandr Honchar. Financial ML != ML and FinanceAlexandr Honchar. Financial ML != ML and Finance
Alexandr Honchar. Financial ML != ML and FinanceLviv Startup Club
 
What If? Demystifying AI Decisions with Counterfactuals
What If? Demystifying AI Decisions with CounterfactualsWhat If? Demystifying AI Decisions with Counterfactuals
What If? Demystifying AI Decisions with Counterfactualsdavidmartens007
 
Statistical "Reforms": Fixing Science or Threats to Replication and Falsifica...
Statistical "Reforms": Fixing Science or Threats to Replication and Falsifica...Statistical "Reforms": Fixing Science or Threats to Replication and Falsifica...
Statistical "Reforms": Fixing Science or Threats to Replication and Falsifica...jemille6
 
Design Analysis - Planning for Your Next SOLIDWORKS Simulation Project
Design Analysis - Planning for Your Next SOLIDWORKS Simulation ProjectDesign Analysis - Planning for Your Next SOLIDWORKS Simulation Project
Design Analysis - Planning for Your Next SOLIDWORKS Simulation ProjectHawk Ridge Systems
 
Matching Criteria
Matching CriteriaMatching Criteria
Matching CriteriaHugh Knight
 
Is it important to explain a theorem? A case study in UML and ALCQI
Is it important to explain a theorem? A case study in UML and ALCQIIs it important to explain a theorem? A case study in UML and ALCQI
Is it important to explain a theorem? A case study in UML and ALCQIAlexandre Rademaker
 
Explainability and bias in AI
Explainability and bias in AIExplainability and bias in AI
Explainability and bias in AIBill Liu
 
Why are data transformations a bad choice in statistics
Why are data transformations a bad choice in statisticsWhy are data transformations a bad choice in statistics
Why are data transformations a bad choice in statisticsAdrian Olszewski
 
Explore, Explain, and Debug aka Interpretable Machine Learning
Explore, Explain, and Debug aka Interpretable Machine LearningExplore, Explain, and Debug aka Interpretable Machine Learning
Explore, Explain, and Debug aka Interpretable Machine LearningPrzemek Biecek
 
Thinking Outside The Box Florida Final[1]
Thinking Outside The Box Florida Final[1]Thinking Outside The Box Florida Final[1]
Thinking Outside The Box Florida Final[1]bohammer
 
Probabilistic Programming in Python
Probabilistic Programming in PythonProbabilistic Programming in Python
Probabilistic Programming in PythonPeadar Coyle
 
STAT 778 Project Proposal - Jonathan Poon
STAT 778 Project Proposal - Jonathan PoonSTAT 778 Project Proposal - Jonathan Poon
STAT 778 Project Proposal - Jonathan PoonJonathan Poon
 
Legal Analytics Course - Class 6 - Overfitting, Underfitting, & Cross-Validat...
Legal Analytics Course - Class 6 - Overfitting, Underfitting, & Cross-Validat...Legal Analytics Course - Class 6 - Overfitting, Underfitting, & Cross-Validat...
Legal Analytics Course - Class 6 - Overfitting, Underfitting, & Cross-Validat...Daniel Katz
 
Hexawise Soap Opera Testing
Hexawise Soap Opera TestingHexawise Soap Opera Testing
Hexawise Soap Opera TestingTyler Klose
 

Similar to MUMS Opening Workshop - Panel Discussion: Inexact Computer Model Calibration, Mathhew Plumlee, August 21, 2018 (20)

Ejbrm volume6-issue1-article183
Ejbrm volume6-issue1-article183Ejbrm volume6-issue1-article183
Ejbrm volume6-issue1-article183
 
50134 09
50134 0950134 09
50134 09
 
discussion on Bayesian restricted likelihood
discussion on Bayesian restricted likelihooddiscussion on Bayesian restricted likelihood
discussion on Bayesian restricted likelihood
 
Alexandr Honchar. Financial ML != ML and Finance
Alexandr Honchar. Financial ML != ML and FinanceAlexandr Honchar. Financial ML != ML and Finance
Alexandr Honchar. Financial ML != ML and Finance
 
What If? Demystifying AI Decisions with Counterfactuals
What If? Demystifying AI Decisions with CounterfactualsWhat If? Demystifying AI Decisions with Counterfactuals
What If? Demystifying AI Decisions with Counterfactuals
 
Statistical "Reforms": Fixing Science or Threats to Replication and Falsifica...
Statistical "Reforms": Fixing Science or Threats to Replication and Falsifica...Statistical "Reforms": Fixing Science or Threats to Replication and Falsifica...
Statistical "Reforms": Fixing Science or Threats to Replication and Falsifica...
 
Design Analysis - Planning for Your Next SOLIDWORKS Simulation Project
Design Analysis - Planning for Your Next SOLIDWORKS Simulation ProjectDesign Analysis - Planning for Your Next SOLIDWORKS Simulation Project
Design Analysis - Planning for Your Next SOLIDWORKS Simulation Project
 
Matching Criteria
Matching CriteriaMatching Criteria
Matching Criteria
 
Is it important to explain a theorem? A case study in UML and ALCQI
Is it important to explain a theorem? A case study in UML and ALCQIIs it important to explain a theorem? A case study in UML and ALCQI
Is it important to explain a theorem? A case study in UML and ALCQI
 
StatsModelling
StatsModellingStatsModelling
StatsModelling
 
Explainability and bias in AI
Explainability and bias in AIExplainability and bias in AI
Explainability and bias in AI
 
Why are data transformations a bad choice in statistics
Why are data transformations a bad choice in statisticsWhy are data transformations a bad choice in statistics
Why are data transformations a bad choice in statistics
 
Explore, Explain, and Debug aka Interpretable Machine Learning
Explore, Explain, and Debug aka Interpretable Machine LearningExplore, Explain, and Debug aka Interpretable Machine Learning
Explore, Explain, and Debug aka Interpretable Machine Learning
 
Statsci
StatsciStatsci
Statsci
 
Thinking Outside The Box Florida Final[1]
Thinking Outside The Box Florida Final[1]Thinking Outside The Box Florida Final[1]
Thinking Outside The Box Florida Final[1]
 
Probabilistic Programming in Python
Probabilistic Programming in PythonProbabilistic Programming in Python
Probabilistic Programming in Python
 
STAT 778 Project Proposal - Jonathan Poon
STAT 778 Project Proposal - Jonathan PoonSTAT 778 Project Proposal - Jonathan Poon
STAT 778 Project Proposal - Jonathan Poon
 
Legal Analytics Course - Class 6 - Overfitting, Underfitting, & Cross-Validat...
Legal Analytics Course - Class 6 - Overfitting, Underfitting, & Cross-Validat...Legal Analytics Course - Class 6 - Overfitting, Underfitting, & Cross-Validat...
Legal Analytics Course - Class 6 - Overfitting, Underfitting, & Cross-Validat...
 
MUMS Opening Workshop -On the Impact(s) of Structural Model Error on Simulati...
MUMS Opening Workshop -On the Impact(s) of Structural Model Error on Simulati...MUMS Opening Workshop -On the Impact(s) of Structural Model Error on Simulati...
MUMS Opening Workshop -On the Impact(s) of Structural Model Error on Simulati...
 
Hexawise Soap Opera Testing
Hexawise Soap Opera TestingHexawise Soap Opera Testing
Hexawise Soap Opera Testing
 

More from The Statistical and Applied Mathematical Sciences Institute

More from The Statistical and Applied Mathematical Sciences Institute (20)

Causal Inference Opening Workshop - Latent Variable Models, Causal Inference,...
Causal Inference Opening Workshop - Latent Variable Models, Causal Inference,...Causal Inference Opening Workshop - Latent Variable Models, Causal Inference,...
Causal Inference Opening Workshop - Latent Variable Models, Causal Inference,...
 
2019 Fall Series: Special Guest Lecture - 0-1 Phase Transitions in High Dimen...
2019 Fall Series: Special Guest Lecture - 0-1 Phase Transitions in High Dimen...2019 Fall Series: Special Guest Lecture - 0-1 Phase Transitions in High Dimen...
2019 Fall Series: Special Guest Lecture - 0-1 Phase Transitions in High Dimen...
 
Causal Inference Opening Workshop - Causal Discovery in Neuroimaging Data - F...
Causal Inference Opening Workshop - Causal Discovery in Neuroimaging Data - F...Causal Inference Opening Workshop - Causal Discovery in Neuroimaging Data - F...
Causal Inference Opening Workshop - Causal Discovery in Neuroimaging Data - F...
 
Causal Inference Opening Workshop - Smooth Extensions to BART for Heterogeneo...
Causal Inference Opening Workshop - Smooth Extensions to BART for Heterogeneo...Causal Inference Opening Workshop - Smooth Extensions to BART for Heterogeneo...
Causal Inference Opening Workshop - Smooth Extensions to BART for Heterogeneo...
 
Causal Inference Opening Workshop - A Bracketing Relationship between Differe...
Causal Inference Opening Workshop - A Bracketing Relationship between Differe...Causal Inference Opening Workshop - A Bracketing Relationship between Differe...
Causal Inference Opening Workshop - A Bracketing Relationship between Differe...
 
Causal Inference Opening Workshop - Testing Weak Nulls in Matched Observation...
Causal Inference Opening Workshop - Testing Weak Nulls in Matched Observation...Causal Inference Opening Workshop - Testing Weak Nulls in Matched Observation...
Causal Inference Opening Workshop - Testing Weak Nulls in Matched Observation...
 
Causal Inference Opening Workshop - Difference-in-differences: more than meet...
Causal Inference Opening Workshop - Difference-in-differences: more than meet...Causal Inference Opening Workshop - Difference-in-differences: more than meet...
Causal Inference Opening Workshop - Difference-in-differences: more than meet...
 
Causal Inference Opening Workshop - New Statistical Learning Methods for Esti...
Causal Inference Opening Workshop - New Statistical Learning Methods for Esti...Causal Inference Opening Workshop - New Statistical Learning Methods for Esti...
Causal Inference Opening Workshop - New Statistical Learning Methods for Esti...
 
Causal Inference Opening Workshop - Bipartite Causal Inference with Interfere...
Causal Inference Opening Workshop - Bipartite Causal Inference with Interfere...Causal Inference Opening Workshop - Bipartite Causal Inference with Interfere...
Causal Inference Opening Workshop - Bipartite Causal Inference with Interfere...
 
Causal Inference Opening Workshop - Bridging the Gap Between Causal Literatur...
Causal Inference Opening Workshop - Bridging the Gap Between Causal Literatur...Causal Inference Opening Workshop - Bridging the Gap Between Causal Literatur...
Causal Inference Opening Workshop - Bridging the Gap Between Causal Literatur...
 
Causal Inference Opening Workshop - Some Applications of Reinforcement Learni...
Causal Inference Opening Workshop - Some Applications of Reinforcement Learni...Causal Inference Opening Workshop - Some Applications of Reinforcement Learni...
Causal Inference Opening Workshop - Some Applications of Reinforcement Learni...
 
Causal Inference Opening Workshop - Bracketing Bounds for Differences-in-Diff...
Causal Inference Opening Workshop - Bracketing Bounds for Differences-in-Diff...Causal Inference Opening Workshop - Bracketing Bounds for Differences-in-Diff...
Causal Inference Opening Workshop - Bracketing Bounds for Differences-in-Diff...
 
Causal Inference Opening Workshop - Assisting the Impact of State Polcies: Br...
Causal Inference Opening Workshop - Assisting the Impact of State Polcies: Br...Causal Inference Opening Workshop - Assisting the Impact of State Polcies: Br...
Causal Inference Opening Workshop - Assisting the Impact of State Polcies: Br...
 
Causal Inference Opening Workshop - Experimenting in Equilibrium - Stefan Wag...
Causal Inference Opening Workshop - Experimenting in Equilibrium - Stefan Wag...Causal Inference Opening Workshop - Experimenting in Equilibrium - Stefan Wag...
Causal Inference Opening Workshop - Experimenting in Equilibrium - Stefan Wag...
 
Causal Inference Opening Workshop - Targeted Learning for Causal Inference Ba...
Causal Inference Opening Workshop - Targeted Learning for Causal Inference Ba...Causal Inference Opening Workshop - Targeted Learning for Causal Inference Ba...
Causal Inference Opening Workshop - Targeted Learning for Causal Inference Ba...
 
Causal Inference Opening Workshop - Bayesian Nonparametric Models for Treatme...
Causal Inference Opening Workshop - Bayesian Nonparametric Models for Treatme...Causal Inference Opening Workshop - Bayesian Nonparametric Models for Treatme...
Causal Inference Opening Workshop - Bayesian Nonparametric Models for Treatme...
 
2019 Fall Series: Special Guest Lecture - Adversarial Risk Analysis of the Ge...
2019 Fall Series: Special Guest Lecture - Adversarial Risk Analysis of the Ge...2019 Fall Series: Special Guest Lecture - Adversarial Risk Analysis of the Ge...
2019 Fall Series: Special Guest Lecture - Adversarial Risk Analysis of the Ge...
 
2019 Fall Series: Professional Development, Writing Academic Papers…What Work...
2019 Fall Series: Professional Development, Writing Academic Papers…What Work...2019 Fall Series: Professional Development, Writing Academic Papers…What Work...
2019 Fall Series: Professional Development, Writing Academic Papers…What Work...
 
2019 GDRR: Blockchain Data Analytics - Machine Learning in/for Blockchain: Fu...
2019 GDRR: Blockchain Data Analytics - Machine Learning in/for Blockchain: Fu...2019 GDRR: Blockchain Data Analytics - Machine Learning in/for Blockchain: Fu...
2019 GDRR: Blockchain Data Analytics - Machine Learning in/for Blockchain: Fu...
 
2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...
2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...
2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...
 

Recently uploaded

Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibitjbellavia9
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxVishalSingh1417
 
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural Resources
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural ResourcesEnergy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural Resources
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural ResourcesShubhangi Sonawane
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...Nguyen Thanh Tu Collection
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701bronxfugly43
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxnegromaestrong
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docxPoojaSen20
 
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-IIFood Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-IIShubhangi Sonawane
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.christianmathematics
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfChris Hunter
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxVishalSingh1417
 

Recently uploaded (20)

Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural Resources
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural ResourcesEnergy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural Resources
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural Resources
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-IIFood Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdf
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 

MUMS Opening Workshop - Panel Discussion: Inexact Computer Model Calibration, Mathhew Plumlee, August 21, 2018

  • 1. Inexact computer model calibration Matthew Plumlee Industrial Engineering and Management Sciences Northwestern University Matthew Plumlee SAMSI, 2018 1 / 17
  • 2. History Matching In a case study on aligning oil models with real data: [Craig, Goldstien, et al. (1997)] In many cases, a fully satis- factory match is not obtained and usually it is hard to judge whether this was due to a problem with the underlying model or due to an inadequate search over the space of possible inputs. Key idea: Views the discrepancy itself as a random object (perhaps with some prior). This is the first (but maybe not first?) use of yO pxi q yM pxi , uq bi pxi q  i . Matthew Plumlee SAMSI, 2018 2 / 17
  • 3. History Matching In a case study on aligning oil models with real data: [Craig, Goldstien, et al. (1997)] In many cases, a fully satis- factory match is not obtained and usually it is hard to judge whether this was due to a problem with the underlying model or due to an inadequate search over the space of possible inputs. Key idea: Views the discrepancy itself as a random object (perhaps with some prior). This is the first (but maybe not first?) use of yO pxi q yM pxi , uq bi pxi q  i . Matthew Plumlee SAMSI, 2018 2 / 17
  • 4. Kennedy and O’Hagan (2001) [Kennedy and O’Hagan, 2001] Even if there is no parameter uncertainty , is that we know the values of all the inputs required to make a particular prediction of the process being modelled, the predicted value will not equal the true value of the process. Key idea: Modelling the discrepancy uncertainty with a prior. The prior choice that was advocated was a Gaussian process, possibly because it fits nicely into computation. Discussion article, so we have some comments: Peter Diggle and Keith Beven had some concerns about Gaussian process priors, as they are complex and difficult to understand. Many concerns about parameters. Matthew Plumlee SAMSI, 2018 3 / 17
  • 5. Kennedy and O’Hagan (2001) [Kennedy and O’Hagan, 2001] Even if there is no parameter uncertainty , is that we know the values of all the inputs required to make a particular prediction of the process being modelled, the predicted value will not equal the true value of the process. Key idea: Modelling the discrepancy uncertainty with a prior. The prior choice that was advocated was a Gaussian process, possibly because it fits nicely into computation. Discussion article, so we have some comments: Peter Diggle and Keith Beven had some concerns about Gaussian process priors, as they are complex and difficult to understand. Many concerns about parameters. Matthew Plumlee SAMSI, 2018 3 / 17
  • 6. Application issues and some abandonment Farah et al. (2014), “if the BEA simulator were to be substantially biased, the form of the bias would affect the posterior distribution of the calibration parameters.” Gramacy et al. (2015), “identification issues known to plague [KO]-style calibration.” It seems odd to define the relationship yR pxq yM px, uq bpxq without fixing u. After all, how could this be true for all u? Matthew Plumlee SAMSI, 2018 4 / 17
  • 7. Application issues and some abandonment Farah et al. (2014), “if the BEA simulator were to be substantially biased, the form of the bias would affect the posterior distribution of the calibration parameters.” Gramacy et al. (2015), “identification issues known to plague [KO]-style calibration.” It seems odd to define the relationship yR pxq yM px, uq bpxq without fixing u. After all, how could this be true for all u? Matthew Plumlee SAMSI, 2018 4 / 17
  • 8. Application issues and some abandonment Farah et al. (2014), “if the BEA simulator were to be substantially biased, the form of the bias would affect the posterior distribution of the calibration parameters.” Gramacy et al. (2015), “identification issues known to plague [KO]-style calibration.” It seems odd to define the relationship yR pxq yM px, uq bpxq without fixing u. After all, how could this be true for all u? Matthew Plumlee SAMSI, 2018 4 / 17
  • 9. Application issues and some abandonment Farah et al. (2014), “if the BEA simulator were to be substantially biased, the form of the bias would affect the posterior distribution of the calibration parameters.” Gramacy et al. (2015), “identification issues known to plague [KO]-style calibration.” It seems odd to define the relationship yR pxq yM px, uq bpxq without fixing u. After all, how could this be true for all u? Matthew Plumlee SAMSI, 2018 4 / 17
  • 10. Back to the parameter controversy Two philosophical remedies: yR pxq yM px, u¦q bpxq or yR pxq yM px, uq bpx, uq. Higdon, Kennedy et al. (2004) seem to imply some true u. Loeppky, Bingham and Welch (2006) give some discussion and note “Even if one could estimate the [parameter] exactly, it may be the case that this value leads to a worse prediction of the process mean ...” Bayarri et al. (2007) says that u should be in each term and should be u¦. However, it left ambiguous what, exactly, u¦ ought to be, aside from the “true generating parameter”. Matthew Plumlee SAMSI, 2018 5 / 17
  • 11. Back to the parameter controversy Two philosophical remedies: yR pxq yM px, u¦q bpxq or yR pxq yM px, uq bpx, uq. Higdon, Kennedy et al. (2004) seem to imply some true u. Loeppky, Bingham and Welch (2006) give some discussion and note “Even if one could estimate the [parameter] exactly, it may be the case that this value leads to a worse prediction of the process mean ...” Bayarri et al. (2007) says that u should be in each term and should be u¦. However, it left ambiguous what, exactly, u¦ ought to be, aside from the “true generating parameter”. Matthew Plumlee SAMSI, 2018 5 / 17
  • 12. Back to the parameter controversy Two philosophical remedies: yR pxq yM px, u¦q bpxq or yR pxq yM px, uq bpx, uq. Higdon, Kennedy et al. (2004) seem to imply some true u. Loeppky, Bingham and Welch (2006) give some discussion and note “Even if one could estimate the [parameter] exactly, it may be the case that this value leads to a worse prediction of the process mean ...” Bayarri et al. (2007) says that u should be in each term and should be u¦. However, it left ambiguous what, exactly, u¦ ought to be, aside from the “true generating parameter”. Matthew Plumlee SAMSI, 2018 5 / 17
  • 13. Back to the parameter controversy Two philosophical remedies: yR pxq yM px, u¦q bpxq or yR pxq yM px, uq bpx, uq. Higdon, Kennedy et al. (2004) seem to imply some true u. Loeppky, Bingham and Welch (2006) give some discussion and note “Even if one could estimate the [parameter] exactly, it may be the case that this value leads to a worse prediction of the process mean ...” Bayarri et al. (2007) says that u should be in each term and should be u¦. However, it left ambiguous what, exactly, u¦ ought to be, aside from the “true generating parameter”. Matthew Plumlee SAMSI, 2018 5 / 17
  • 14. Back to the parameter controversy Two philosophical remedies: yR pxq yM px, u¦q bpxq or yR pxq yM px, uq bpx, uq. Higdon, Kennedy et al. (2004) seem to imply some true u. Loeppky, Bingham and Welch (2006) give some discussion and note “Even if one could estimate the [parameter] exactly, it may be the case that this value leads to a worse prediction of the process mean ...” Bayarri et al. (2007) says that u should be in each term and should be u¦. However, it left ambiguous what, exactly, u¦ ought to be, aside from the “true generating parameter”. Matthew Plumlee SAMSI, 2018 5 / 17
  • 15. An opinion Parameters be outlined based some “optimal” form of the discrepancy that is well-defined. Some rational: There is no recovery of the true parameter: “Abandon all hope, ye who enter here.” If you change the model, the parameter should change. It should not depend on the data collected. It should not be based on a prior. We need to make clear what we are estimating for users of our methods. Matthew Plumlee SAMSI, 2018 6 / 17
  • 16. An opinion Parameters be outlined based some “optimal” form of the discrepancy that is well-defined. Some rational: There is no recovery of the true parameter: “Abandon all hope, ye who enter here.” If you change the model, the parameter should change. It should not depend on the data collected. It should not be based on a prior. We need to make clear what we are estimating for users of our methods. Matthew Plumlee SAMSI, 2018 6 / 17
  • 17. An opinion Parameters be outlined based some “optimal” form of the discrepancy that is well-defined. Some rational: There is no recovery of the true parameter: “Abandon all hope, ye who enter here.” If you change the model, the parameter should change. It should not depend on the data collected. It should not be based on a prior. We need to make clear what we are estimating for users of our methods. Matthew Plumlee SAMSI, 2018 6 / 17
  • 18. An opinion Parameters be outlined based some “optimal” form of the discrepancy that is well-defined. Some rational: There is no recovery of the true parameter: “Abandon all hope, ye who enter here.” If you change the model, the parameter should change. It should not depend on the data collected. It should not be based on a prior. We need to make clear what we are estimating for users of our methods. Matthew Plumlee SAMSI, 2018 6 / 17
  • 19. An opinion Parameters be outlined based some “optimal” form of the discrepancy that is well-defined. Some rational: There is no recovery of the true parameter: “Abandon all hope, ye who enter here.” If you change the model, the parameter should change. It should not depend on the data collected. It should not be based on a prior. We need to make clear what we are estimating for users of our methods. Matthew Plumlee SAMSI, 2018 6 / 17
  • 20. An opinion Parameters be outlined based some “optimal” form of the discrepancy that is well-defined. Some rational: There is no recovery of the true parameter: “Abandon all hope, ye who enter here.” If you change the model, the parameter should change. It should not depend on the data collected. It should not be based on a prior. We need to make clear what we are estimating for users of our methods. Matthew Plumlee SAMSI, 2018 6 / 17
  • 21. An opinion Parameters be outlined based some “optimal” form of the discrepancy that is well-defined. Some rational: There is no recovery of the true parameter: “Abandon all hope, ye who enter here.” If you change the model, the parameter should change. It should not depend on the data collected. It should not be based on a prior. We need to make clear what we are estimating for users of our methods. Matthew Plumlee SAMSI, 2018 6 / 17
  • 22. Opinion, cont. My personal affinity lies with a weighted L2 definition u¦ arg min » X pyR pxq¡yM px, uqq2 dµpxq. Why? Users understand this really well It gives flexiblity on where the user finds the inputs the most important. But almost any norm will do (Sobolov, RKHS, etc), just make it clear! Others have used it to define “tuning parameters”. Matthew Plumlee SAMSI, 2018 7 / 17
  • 23. Opinion, cont. My personal affinity lies with a weighted L2 definition u¦ arg min » X pyR pxq¡yM px, uqq2 dµpxq. Why? Users understand this really well It gives flexiblity on where the user finds the inputs the most important. But almost any norm will do (Sobolov, RKHS, etc), just make it clear! Others have used it to define “tuning parameters”. Matthew Plumlee SAMSI, 2018 7 / 17
  • 24. Opinion, cont. My personal affinity lies with a weighted L2 definition u¦ arg min » X pyR pxq¡yM px, uqq2 dµpxq. Why? Users understand this really well It gives flexiblity on where the user finds the inputs the most important. But almost any norm will do (Sobolov, RKHS, etc), just make it clear! Others have used it to define “tuning parameters”. Matthew Plumlee SAMSI, 2018 7 / 17
  • 25. History Matching (1997ish) Matthew Plumlee SAMSI, 2018 8 / 17
  • 26. Kennedy and O’Hagan (2001) Matthew Plumlee SAMSI, 2018 8 / 17
  • 27. L2 Calibration Tuo and Wu (2015, Annals) criticize Kennedy and O’Hagan under the premise that the MAP is not a good estimator of the L2 minimizer. Key idea: Take our data and come up with some estimate of the true function ˆyRp¤q. Then ˆuL2 arg min }ˆyR ¡yM p¤, uq}L2 . Under a litany of conditions, they show ˆuL2 , behaves exactly like if our model was exactly correct! This means that we just need to “re-center” our least squares confidence interval, but same width. Matthew Plumlee SAMSI, 2018 9 / 17
  • 28. L2 Calibration Tuo and Wu (2015, Annals) criticize Kennedy and O’Hagan under the premise that the MAP is not a good estimator of the L2 minimizer. Key idea: Take our data and come up with some estimate of the true function ˆyRp¤q. Then ˆuL2 arg min }ˆyR ¡yM p¤, uq}L2 . Under a litany of conditions, they show ˆuL2 , behaves exactly like if our model was exactly correct! This means that we just need to “re-center” our least squares confidence interval, but same width. Matthew Plumlee SAMSI, 2018 9 / 17
  • 29. L2 Calibration Tuo and Wu (2015, Annals) criticize Kennedy and O’Hagan under the premise that the MAP is not a good estimator of the L2 minimizer. Key idea: Take our data and come up with some estimate of the true function ˆyRp¤q. Then ˆuL2 arg min }ˆyR ¡yM p¤, uq}L2 . Under a litany of conditions, they show ˆuL2 , behaves exactly like if our model was exactly correct! This means that we just need to “re-center” our least squares confidence interval, but same width. Matthew Plumlee SAMSI, 2018 9 / 17
  • 30. L2 Calibration Tuo and Wu (2015, Annals) criticize Kennedy and O’Hagan under the premise that the MAP is not a good estimator of the L2 minimizer. Key idea: Take our data and come up with some estimate of the true function ˆyRp¤q. Then ˆuL2 arg min }ˆyR ¡yM p¤, uq}L2 . Under a litany of conditions, they show ˆuL2 , behaves exactly like if our model was exactly correct! This means that we just need to “re-center” our least squares confidence interval, but same width. Matthew Plumlee SAMSI, 2018 9 / 17
  • 31. L2 Calibration Tuo and Wu (2015, Annals) criticize Kennedy and O’Hagan under the premise that the MAP is not a good estimator of the L2 minimizer. Key idea: Take our data and come up with some estimate of the true function ˆyRp¤q. Then ˆuL2 arg min }ˆyR ¡yM p¤, uq}L2 . Under a litany of conditions, they show ˆuL2 , behaves exactly like if our model was exactly correct! This means that we just need to “re-center” our least squares confidence interval, but same width. Matthew Plumlee SAMSI, 2018 9 / 17
  • 32. Matthew Plumlee SAMSI, 2018 10 / 17
  • 33. Where are we again? Trying to construct intervals with non-linear regression for inexact models. 1960-1995 Na¨ıve approaches, in general, will not work unless your xi s are exactly generated from µ or your model is exact. 1995-2013 History Matching produce huge intervals. Kennedy and O’Hagan method produces large intervals, MAP away from L2 parameter, often does not cover L2 parameter. 2013-Today L2 and like were proposed to basically work in large samples. Matthew Plumlee SAMSI, 2018 11 / 17
  • 34. Where are we again? Trying to construct intervals with non-linear regression for inexact models. 1960-1995 Na¨ıve approaches, in general, will not work unless your xi s are exactly generated from µ or your model is exact. 1995-2013 History Matching produce huge intervals. Kennedy and O’Hagan method produces large intervals, MAP away from L2 parameter, often does not cover L2 parameter. 2013-Today L2 and like were proposed to basically work in large samples. Matthew Plumlee SAMSI, 2018 11 / 17
  • 35. Orthogonal approach Key idea: Theorem (Plumlee, 2017) Suppose that u is the minimizer of L2 located in the interior of u € Rp. Then bp¤q must be such that » X yM px, uqpyR pxq¡yM px, uqqlooooooooooomooooooooooon bpxq dµpxq 0. p¦q Simply use KO with yR pxq yM px, uq bupxq, with orthogonal bu. Theorem (Plumlee and Joseph, 2018) Under reasonable conditions, we can construct a prior on a Gaussian process such that p¦q is met. Matthew Plumlee SAMSI, 2018 12 / 17
  • 36. Orthogonal Process Approach Matthew Plumlee SAMSI, 2018 13 / 17
  • 37. Frequency of coverage of the L2 minimizer Type Poor Exp Rich Exp Poor Obs Rich Obs HM 1.00 1.00 1.00 1.00 KO 1.00 1.00 1.00 1.00 L2 0.87 0.92 0.58 0.98 OGP 1.00 1.00 1.00 1.00 Average Interval Width Type Poor Exp Rich Exp Poor Obs Rich Obs HM 3.18 3.54 3.62 3.91 KO 2.64 2.61 2.67 2.88 L2 0.61 0.37 1.15 0.63 OGP 2.55 0.94 3.75 1.23 Matthew Plumlee SAMSI, 2018 14 / 17
  • 38. On discrepancy correction One of the key features of Kennedy and O’Hagan was the concept of bias correction. That is, instead of using yMpx, u¦q as a predictor of the best model, why not go straight to yMpx, u¦q bpxq? Practitioners are sometimes unwilling to use the predictive discrepancy function in making inferences because... It rarely has the behavior they expect. If their discrepancy is doing the heavy lifting, they want to fix their model. It is hard to communicate/transfer the model to colleagues. So I argue the best model itself is something we want to conjecture on. Matthew Plumlee SAMSI, 2018 15 / 17
  • 39. On discrepancy correction One of the key features of Kennedy and O’Hagan was the concept of bias correction. That is, instead of using yMpx, u¦q as a predictor of the best model, why not go straight to yMpx, u¦q bpxq? Practitioners are sometimes unwilling to use the predictive discrepancy function in making inferences because... It rarely has the behavior they expect. If their discrepancy is doing the heavy lifting, they want to fix their model. It is hard to communicate/transfer the model to colleagues. So I argue the best model itself is something we want to conjecture on. Matthew Plumlee SAMSI, 2018 15 / 17
  • 40. On discrepancy correction One of the key features of Kennedy and O’Hagan was the concept of bias correction. That is, instead of using yMpx, u¦q as a predictor of the best model, why not go straight to yMpx, u¦q bpxq? Practitioners are sometimes unwilling to use the predictive discrepancy function in making inferences because... It rarely has the behavior they expect. If their discrepancy is doing the heavy lifting, they want to fix their model. It is hard to communicate/transfer the model to colleagues. So I argue the best model itself is something we want to conjecture on. Matthew Plumlee SAMSI, 2018 15 / 17
  • 41. On discrepancy correction One of the key features of Kennedy and O’Hagan was the concept of bias correction. That is, instead of using yMpx, u¦q as a predictor of the best model, why not go straight to yMpx, u¦q bpxq? Practitioners are sometimes unwilling to use the predictive discrepancy function in making inferences because... It rarely has the behavior they expect. If their discrepancy is doing the heavy lifting, they want to fix their model. It is hard to communicate/transfer the model to colleagues. So I argue the best model itself is something we want to conjecture on. Matthew Plumlee SAMSI, 2018 15 / 17
  • 42. Cardiac example time 0 5 10 15 normalizedcurrent 0 0.02 0.04 0.06 0.08 0.1 0.12 0.14 KO time 0 5 10 15 OGP Matthew Plumlee SAMSI, 2018 16 / 17
  • 43. Cardiac example time 0 5 10 15 normalizedcurrent 0 0.02 0.04 0.06 0.08 0.1 0.12 0.14 KO time 0 5 10 15 OGP Another side benefit: faster MCMC mixing. KO: ESS about 6.5 / 1000 samples OGP: ESS about 78.2 / 1000 samples Matthew Plumlee SAMSI, 2018 16 / 17
  • 44. Some future comments When working on calibration, we should focus on getting uncertainty right, not asymptotic value. Measured relative to some parameter based on yR p¤q and yM p¤, ¤q. We should not claim that we are covering nature’s parameter. This fact depends on the state-of-the-art science and nature. “Right” by inverse price-is-right rule. We need to work from definitions of parameters, but L2 is not the only one. What is that weight function? What about L1 or other measures of difference? Smoothness? So many bells and whistles were designed for KO. What needs to be extended? Functional Responses Emulator interface Multi-model cases Matthew Plumlee SAMSI, 2018 17 / 17
  • 45. Some future comments When working on calibration, we should focus on getting uncertainty right, not asymptotic value. Measured relative to some parameter based on yR p¤q and yM p¤, ¤q. We should not claim that we are covering nature’s parameter. This fact depends on the state-of-the-art science and nature. “Right” by inverse price-is-right rule. We need to work from definitions of parameters, but L2 is not the only one. What is that weight function? What about L1 or other measures of difference? Smoothness? So many bells and whistles were designed for KO. What needs to be extended? Functional Responses Emulator interface Multi-model cases Matthew Plumlee SAMSI, 2018 17 / 17
  • 46. Some future comments When working on calibration, we should focus on getting uncertainty right, not asymptotic value. Measured relative to some parameter based on yR p¤q and yM p¤, ¤q. We should not claim that we are covering nature’s parameter. This fact depends on the state-of-the-art science and nature. “Right” by inverse price-is-right rule. We need to work from definitions of parameters, but L2 is not the only one. What is that weight function? What about L1 or other measures of difference? Smoothness? So many bells and whistles were designed for KO. What needs to be extended? Functional Responses Emulator interface Multi-model cases Matthew Plumlee SAMSI, 2018 17 / 17
  • 47. Some future comments When working on calibration, we should focus on getting uncertainty right, not asymptotic value. Measured relative to some parameter based on yR p¤q and yM p¤, ¤q. We should not claim that we are covering nature’s parameter. This fact depends on the state-of-the-art science and nature. “Right” by inverse price-is-right rule. We need to work from definitions of parameters, but L2 is not the only one. What is that weight function? What about L1 or other measures of difference? Smoothness? So many bells and whistles were designed for KO. What needs to be extended? Functional Responses Emulator interface Multi-model cases Matthew Plumlee SAMSI, 2018 17 / 17