SlideShare a Scribd company logo
1 of 72
Download to read offline
The importance of foundations
Nancy Reid
University of Toronto
June 8 2023
Introduction
What are the foundations of statistics? Savage 1954
It is unanimously agreed that
statistics depends somehow on
probability. But, as to what prob-
ability is and how it is connected
with statistics, there has seldom
been such complete disagreement
and breakdown of communication
since the Tower of Babel. Doubt-
less, much of the disagreement is
merely terminological and would
disappear under sufficiently sharp
analysis.
JSM August 2023 2
... What are the foundations of statistics? R & Cox, 2015
• Statistics needs a healthy interplay between theory and applications
• theory meaning foundations, rather than theoretical analysis of specific techniques
• must be continually tested against new applications
• “the practical application of general theorems is a different art
from their establishment by mathematical proof” Fisher 1958 SMRW
JSM August 2023 3
... What are the foundations of statistics?
• probability, analysis, applied mathematics modelling
• Bayes, Neyman, Fisher approaches to inference
• nature of uncertainty epistemic, empirical
• nature of induction belief functions, inferential models
• interpretation of p-values, confidence regions, credibility intervals, likelihood ratios
• role of sufficiency, ancillarity, conditioning, asymptotic theory
• sparsity, causality, assumption-free/lean inference, stability, prediction, decisions
JSM August 2023 4
... What are the foundations of statistics? Cox 2006
I’m fairly cautious about the im-
pact of the book in that it really
is very cryptic indeed on key issues
but we will see. In particular quite
apart from the Bayesian stuff I have
essentially discarded (not rejected)
the Neyman-Pearson machinery in
favour of Fisher’s original approach
and I am sure this is the right route.
Thanks to AWF Edwards
JSM August 2023 5
... What are the foundations of statistics? Cox 2006
I’m fairly cautious about the im-
pact of the book in that it really
is very cryptic indeed on key issues
but we will see. In particular quite
apart from the Bayesian stuff I have
essentially discarded (not rejected)
the Neyman-Pearson machinery in
favour of Fisher’s original approach
and I am sure this is the right route.
Thanks to AWF Edwards
“dispassionate assessment, frail attempt at, 1–196”
JSM August 2023 6
What use are foundations?
• provide a rigorous basis for the development of techniques
• provide a common language for particular classes of problems
• help to clarify the nature of uncertainty in scientific conclusions
• highlight aspects of data analysis which are likely to raise difficult issues
• suggest strategies for tackling highly complex problems
• avoid ‘re-inventing the wheel’ for each new application
JSM August 2023 7
Applications
Theory and Applications Example 1
NY Times, April 27
World Weather Attribution, April 27
JSM August 2023 8
Applications Example 2
Applications Example 2
Applications Example 2
Applications Example 3
Applications Example 3
Statistics is Everywhere Example 4
Statistics is Everywhere Example 4
Linking theory with practice
1. Drought Climate change attribution
climate models, subgroup analyses, predictions
2. Covid Treatment with Ivermectin
randomized trial, proportional hazards regression, Bayes/frequentist
3. Women Co-authorship and gender
linear regression, binary outcome, confounding
4. Faces Human perception
sign test, regression, computational modelling
JSM August 2023 15
Applications
Drought
Drought World Weather Attribution 2023
JSM August 2023 16
The source Kimutai et al. 2023
Link
JSM August 2023 17
The data Kimutai et al. 2023
• observational data, 3 sources
1. global daily rainfall & temperature 0.5◦
× 0.5◦
, 1979 –
2. daily rainfall infra-red, “SoA”, 1981 –
3. monthly rainfall 1981–2014
• 4-year smoothed mean surface temperature proxy for anthropogenic climate change
• climate modelling data, 4 sources
1. combine 12 global and 8 climate models: resolution 0.44◦
29 sims
2. combine 5 global and 4 climate models: resolution 0.22◦
10 sims
3. atmosphere-ocean coupled GCMs (two) 10/3 simulations
4. sea-surface temperature forced ensemble, high resolution 11 simulations
• thanks to Whitney Huang for many clarifications
JSM August 2023 18
The analysis Kimutai et al. 2023
• response is log10(monthly rainfall) in 2021 and 2022
and log10(PET) — potential evapotranspiration
• covariates are global temperature anomaly, and ENSO index
El Nino-Southern Oscillation
• “As a measure of anthropogenic climate change we use smoothed GMST”
Global Mean Surface Temperature
• “Methods for observational and model analysis ... and synthesis are used according
to the World Weather Attribution Protocol” Philip et al. 2020
1. trend using observational data
2. find climate models consistent with 1.
3. compare predictions from 1. and 2.
4. synthesize results in 3. to provide conclusions
JSM August 2023 19
The results Kimutai et al. 2023
• rainfall decreasing with
increasing temperature
but not much
• 2022 rainfall is about a
1 in 20 year event
• 2022 drought about
2 times more likely under
climate change
• uncertainty 0.1 — 360
JSM August 2023 20
2 times more likely? Kimutai et al. 2023
• change the response to SPEI
rainfall adjusted for evaporation
• 2022 drought now 5500 times
more likely
uncertainty 32 to 4 × 108
• consider ‘long rains’ and ‘short
rains’ separately MAM, OND
• combine model simulation
results with observational
data
JSM August 2023 21
Combining climate simulations and data Kimutai et al. 2023
precipitation
adjusted
for evaporation
JSM August 2023 22
forest plots
The theory Whitney Huang
• extrapolation beyond observations Stein Statist. Sci. 2019
extreme value modelling
• assigning uncertainty to combined results
sources of uncertainty Senn 2020
• ratios of estimated probabilities
nearly unbounded confidence intervals CH 1974
• “the whole real line as a confidence interval
does not mean that a vacuous statment is being made”
• joint modelling of precipitation and evapotranspiration
tail copula modelling
JSM August 2023 24
Aside: joint modelling
JSM August 2023 25
Summer 2023 World Weather Attribution
JSM August 2023 26
Applications
Medicine
Covid
The source Naggie et al 2022
JSM August 2023 28
The data Naggie et al 2022
• randomized controlled trial
• platform trial of 6 potential treatments multicenter
• analysis of each treatment uses the same placebo group
• 817 patients in treatment arm; 774 in control arm
• primary outcome: time to recovery
• explanatory variables: treatment, age, sex, prior symptoms, calendar time,
vaccination status, geographic region, center, baseline severity and others
JSM August 2023 29
The analysis Naggie et al 2022
• Bayesian proportional hazards model
λ(t; x) = λ0(t) exp(β0 + β1x1 + β2x2 + · · · + βpxp)
• some covariates fit with splines e.g. age
• underlying hazard modelled parametrically e.g. Weibull, or splines
• prior distributions:
• for parameters of hazard function
• for coefficients for explanatory variables β0, β2, . . . , βp
• for β1 – treatment skeptical, noninformative, none
• likelihood × prior −→ posterior −→ marginal posterior for β1 or exp(β1)
JSM August 2023 30
The results Naggie et al 2022
JSM August 2023 31
The results Naggie et al 2022
“Among outpatients with mild to moderate
COVID-19, treatment with ivermectin,
compared with placebo, did not
significantly improve time to recovery.”
Hazard ratio estimated at 1.07, with
posterior probability that HR > 1 = 0.91
“This did not meet the prespecified
threshold of posterior probability greater
than 0.95”
JSM August 2023 32
The results Naggie et al 2022
JSM August 2023 33
The theory: modelling and conditional inference
• Cox 1972: On regression models and life tables
• sets out proportional hazards regression and non-proportional
λ(t; x) = λ0(t) exp(xT
β)
• proposes analysis via partial likelihood eliminates hazard function
• uses point process modelling + conditional inference
• full likelihood function L(β, λ) ∝
n
!
j=1
{λ0(tj) exp(xT
j β)}δj
exp{− exp(xT
j β)Λ0(tj)}
• partial likelihood function Lpart(β) ∝
!
failures
exp(xT
j β)
"
k∈Rj
exp(xT
kβ)
JSM August 2023 34
The theory: Bayes and frequentist
Skeptical prior 1.07 (0.96 − 1.17) 0.91
Noninformative prior 1.09 (0.97 − 1.22) 0.93
No prior 1.09 (0.98 − 1.22)
JSM August 2023 35
The theory: Bayes and frequentist
• both methods often lead to the same conclusions but not always
• Wasserman 2015; 2022 very helpful overviews
• Stein 1959
• Stone 1970
• Robins and Ritov 1997
• ...
• the nature of the conclusions is different
• probability representing degree of uncertainty epistemic
• probability representing long-run frequency aleatory
• calibration of Bayesian inference assesses the first on the basis of the second
Cox 1958, BFF
JSM August 2023 36
Applications
Women
Women Example 3
The source Ross et al 2022 Nature
JSM August 2023 38
The data Ross et al 2022 Nature
• “finding ‘what isn’t there’ from ‘what is there’ is a fundamental problem in statistics”
• analytic data
• 118 campuses send de-identified data to U Michigan
+ survey + qualitative analysis
• tracks spending on personnel for each research project
payroll, all funding sources
• 57 campuses with complete data 2013–2016
• identify teams: PI, faculty, PDF, PhD, UGrad, Research Staff
• identify publications: Web of Science
• identify gender, job titles, scientific fields, patents, ...
• 9800 teams with 129,000 team members
• 39,000 articles; 18m ‘potential authorships’; 367,000 actual authorships
scientific articles
JSM August 2023 39
The analysis Ross et al 2022 Nature
• response attribution rate =
# actual authorships
# potential authorships
= pr( attribution )
• covariates date of publication, number of days worked in the team, calendar time,
position in the team, team’s PI
• model
P( named ) = β0 + β1woman + βT
covariates + error
JSM August 2023 40
The results Ross et al 2022 Nature
overall attribution rate 3.1%;
attribution rate for men 4.23%; attribution rate for women 2.12% includes patents
difference smaller when covariates included but still statistically significant
JSM August 2023 41
Figure 1 Figure 2
The theory Ross et al 2022 Nature
• linear regression with response a proportion
• logistic function is pretty linear for p ∈ (0.2, 0.8) but these p’s ∈ (0.01, 0.04)
• there’s a paper for that!
On the linear in probability model for binary data Battey, Cox & Jackson 2019
• possibly more concerning: what is the unit of observation? calculation of standard
errors
• conditional inference relevant subsets; adequate variability
JSM August 2023 43
Applications
Fun
Faces Example 4
The source Wardle et al. PNAS 2021
JSM August 2023 45
... The source Wardle et al. PNAS 2021
JSM August 2023 46
Anthropomorphizing Globe & Mail, June 5
JSM August 2023 47
Anthropomorphizing Globe & Mail, June 5
The reason we are astonished by their output is because, as a species, we’re
gullible. We tend to read human characteristics into any pattern that even mildly
resembles a human.
JSM August 2023 48
Back to Foundations
Linking theory with practice
1. Drought Climate change attribution
climate models, subgroup analyses, predictions
2. Covid Treatment with Ivermectin
randomized trial, proportional hazards regression, Bayes/frequentist
3. Women Co-authorship and gender
linear regression, binary outcome, confounding
4. Faces Human perception
sign test, regression, computational modelling
JSM August 2023 49
What did I learn
• statistical “workflows” seem to be emerging in different disciplines
• Drought — “A Protocol for probabilistic extreme event attribution analysis ”
Philip et al 2020, Adv. Stat. Clim. Met. Ocean
• “Writing statistical methods for ecologists” Davis & Kay 2023, Ecosphere
• tutorial-type articles in scientific journals
• Annals of Thoracic Surgery — the statistician’s page
• J Am Medical Association — Guide to Statistics and Methods
• Nature Methods — Points of Significance
• “open data” observed in the breach
• Drought — “Almost all the data are available via the KNMI Climate Explorer”
• Women — “datasets ... are available at the Virtual Data Enclave Repository”
• Covid — “... the data will be made publicly available”
• Communication sleuthing is hard
JSM August 2023 50
“US Life expectancy has been
declining for decades’’
Communication Lost in Translation
“US Life expectancy has been
declining for decades’’
Communication Lost in Translation
“US Life expectancy has been
declining for decades’’
“Increases in US life expectancy slowed”
Communication Lost in Translation
Communication Lost in Translation
Communication Lost in Translation
In mice
and in cells
Communication Lost in Translation
Back to foundations
• probability, analysis, applied mathematics modelling
• Bayes, Neyman, Fisher approaches to inference
• nature of uncertainty epistemic, empirical
• nature of induction belief functions, inferential models
• interpretation of p-values, confidence regions, credibility intervals, likelihood ratios
• role of sufficiency, ancillarity, conditioning, asymptotic theory
• sparsity, causality, assumption-free/lean inference, stability, prediction
JSM August 2023 57
... Back to foundations
• probability, analysis, applied mathematics modelling
• Bayes, Neyman, Fisher approaches to inference
• nature of uncertainty epistemic, empirical
• nature of induction belief functions, inferential models
• interpretation of p-values, confidence regions, credibility intervals, likelihood ratios
• realistic estimates of precision, complex dependencies, subgroup analyses
• sparsity, causality, assumption-free/lean inference, stability, prediction
JSM August 2023 58
... Back to foundations
JSM August 2023 59
References i
Battey, H.S., Cox, D.R. and Jackson, M. (2019). On the linear in probability model for binary data.
Roy. Soc. Open Sci. 6, 190067.
Cox, D.R. (1958). Some problems connected with statistical inference. Ann. Math. Statist. 29,
357–372.
Cox, D.R. (1972). Regression models and life-tables (with discussion). J. R. Statist. Soc. B 34, 187–220.
Cox, D.R. (1978). Foundations of statistical inference: the case for eclecticism. Austral. J. Statist. 20,
43–59.
Cox, D.R. (2006). Principles of Statistical Inference. Cambridge, Cambridge University Press.
Cox, D.R. and Hinkley, D.V. (1974). Theoretical Statistics. London, Chapman & Hall.
Davis, A.J. and Kay, S. (2023). Writing statistical methods for ecologists. Ecosphere 14:e4539.
doi:10.1002/ecs2.4539
JSM August 2023
References ii
Erdenesanaa, D. (2023). “Some July heat: ‘virtually impossible’ without climate change, analysis
finds.” New York Times, July 25, 2023.
https://www.nytimes.com/2023/07/25/climate/us-europe-heat-waves-climate-change.html
accessed August 13, 2023.
Fisher, R.A. (1958) Statistical Methods for Research Workers. 13th ed. Edinburgh, Oliver & Boyd.
Kimball, S. (2022). “Ivermectin — a drug once touted by conservatives — doesn’t improve recovery
much”. CNBC News, October 24, 2022. https://www.cnbc.com/2022/10/24/
ivermectin-once-touted-as-a-covid-treatment-by-conservatives-doesnt-improve-recovery-much-
html accessed August 13, 2023.
Kumutai, J. et al. (2023). Human induced climate change increased drought severity in Horn of
Africa. World Weather Attribution Report. https://www.worldweatherattribution.org/
human-induced-climate-change-increased-drought-severity-in-southern-horn-of-africa/
accessed August 23, 2023.
JSM August 2023
References iii
McShane, J. “Female scientists don’t get the credit they deserve. A study proves it.” Washington
Post, June 22, 2022. https://www.washingtonpost.com/business/2022/06/22/
women-scientists-authorship-credit-study/ accessed August 13, 2023.
Naggie, S. et al. (2023). Effect of Ivermectin vs placebo on time to sustained recovery in outpatients
with mild to moderate COVID-19. J. Am. Med. Assoc. 328, 1595–1603. doi:10.1001/jama.2022.18590
Philip, S. et al. (2020). A protocol for probabilistic extreme event analysis.
Adv. Stat. Clim. Meteorol. Oceanogr. 6, 177–203.
Reid, N. and Cox, D.R. (2015). On some principles of statistical inference. Int. Statist. Rev. 83,
298–308. doi:10.1111/insr.12067
Robins, J.M. and Ritov, Y. (1997). Toward a curse of dimensionality appropriate (CODA) asymptotic
theory for semi-parametric models. Statist. Med. 16, 285–319.
JSM August 2023
References iv
Ross, M.B. et al. (2022). Women are credited less in science than men. Nature 608, 135–145.
doi:10.1038/s41586-022-04966-w
Savage, L. J. (1954). The Foundations of Statistics. New York, Dover.
Senn, S. (2020). “Error point: the importance of knowing how much you don’t know” (Guest Post).
January, 20, 2020. https://errorstatistics.com/2020/01/20/
s-senn-error-point-the-importance-of-knowing-how-much-you-dont-know-guest-post/
Stein, C. (1959). An example of wide discrepancy between fiducial and confidence intervals.
Ann. Math. Statist. 30, 877–880.
Stein, M.L. (2019). Some statistical issues in climate science. Statist. Sci. 35, 31–41.
Stone, M. (1970). Necessary and sufficient condition for convergence in probability to invariant
posterior distributions. Ann. Math. Statist. 41, 1349–1353.
JSM August 2023
References v
Temming, M. (2022). “Americans tend to assume imaginary faces are male”. Science News January
27, 2022.
https://www.sciencenews.org/article/faces-objects-imaginary-male-female-perception
Wardle, S.G. (2022). Illusory faces are more likely to be perceived as male than female.
Proc. Nat. Acad. Sci. 119 (5) e2117413119. doi:10.1073/pnas.2117413119
Wasserman, L.A. (2015). Bayes versus frequentist.
https://www.stat.cmu.edu/~larry/=sml/BayesVersusFrequentist.pdf accessed August 13,
2023.
Wasserman, L.A. (2022). “The foundations of statistical inference”. Information Universe, July 29,
2022. https://www.youtube.com/watch?v=Z-YvWyM6dRQ accessed August 13, 2023.
Wilson, J. (2023). “Will AI really change everything? Not likely”. Globe & Mail, June 5, 2023.
https://www.theglobeandmail.com/opinion/
article-will-ai-really-change-everything-not-likely/ accessed August 13, 2023.
JSM August 2023
References vi
Zhong, R. (2023). “Climate change made East African drought 100 times as likely, study says”. New
York Times April 22, 2023.
https://www.nytimes.com/2023/04/27/climate/horn-of-africa-somalia-drought.html
accessed August 13, 2023.
JSM August 2023

More Related Content

Similar to reid-postJSM-DRC.pdf

MESUR: Making sense and use of usage data
MESUR: Making sense and use of usage dataMESUR: Making sense and use of usage data
MESUR: Making sense and use of usage dataHerbert Van de Sompel
 
Does preregistration improve the interpretability and credibility of research...
Does preregistration improve the interpretability and credibility of research...Does preregistration improve the interpretability and credibility of research...
Does preregistration improve the interpretability and credibility of research...Mark Rubin
 
The effects of specialization on research careers
The effects of specialization on research careersThe effects of specialization on research careers
The effects of specialization on research careersNicolas Robinson-Garcia
 
Large-Scale Inverse Problems
Large-Scale Inverse ProblemsLarge-Scale Inverse Problems
Large-Scale Inverse ProblemsArvind Krishna
 
Joint Contributions of Martin B. Wilk and Ram Gnanadesikan
Joint Contributions of Martin B. Wilk  and Ram GnanadesikanJoint Contributions of Martin B. Wilk  and Ram Gnanadesikan
Joint Contributions of Martin B. Wilk and Ram GnanadesikanSunitha Flowerhill
 
Forest Environment Analysis for the Pandemic Health
Forest Environment Analysis for the Pandemic HealthForest Environment Analysis for the Pandemic Health
Forest Environment Analysis for the Pandemic HealthJun Steed Huang
 
httpwww.jstor.orgOn the Mathematical Foundations of The.docx
httpwww.jstor.orgOn the Mathematical Foundations of The.docxhttpwww.jstor.orgOn the Mathematical Foundations of The.docx
httpwww.jstor.orgOn the Mathematical Foundations of The.docxwellesleyterresa
 
Sequence-to-Sequence Modeling for Time Series
Sequence-to-Sequence Modeling for Time SeriesSequence-to-Sequence Modeling for Time Series
Sequence-to-Sequence Modeling for Time SeriesArun Kejariwal
 
XploreIQ: Machine Learning and Big Data The Successful Use of Algorithms in E...
XploreIQ: Machine Learning and Big Data The Successful Use of Algorithms in E...XploreIQ: Machine Learning and Big Data The Successful Use of Algorithms in E...
XploreIQ: Machine Learning and Big Data The Successful Use of Algorithms in E...SGS
 
The Seismic Hazard Modeller’s Toolkit: An Open-Source Library for the Const...
The Seismic Hazard Modeller’s Toolkit:  An Open-Source Library for the Const...The Seismic Hazard Modeller’s Toolkit:  An Open-Source Library for the Const...
The Seismic Hazard Modeller’s Toolkit: An Open-Source Library for the Const...Global Earthquake Model Foundation
 
AI for automated materials discovery via learning to represent, predict, gene...
AI for automated materials discovery via learning to represent, predict, gene...AI for automated materials discovery via learning to represent, predict, gene...
AI for automated materials discovery via learning to represent, predict, gene...Deakin University
 
Networks, Deep Learning (and COVID-19)
Networks, Deep Learning (and COVID-19)Networks, Deep Learning (and COVID-19)
Networks, Deep Learning (and COVID-19)tm1966
 

Similar to reid-postJSM-DRC.pdf (20)

tjosullivanThesis
tjosullivanThesistjosullivanThesis
tjosullivanThesis
 
MESUR: Making sense and use of usage data
MESUR: Making sense and use of usage dataMESUR: Making sense and use of usage data
MESUR: Making sense and use of usage data
 
FORECASTING MODELS
FORECASTING MODELSFORECASTING MODELS
FORECASTING MODELS
 
Does preregistration improve the interpretability and credibility of research...
Does preregistration improve the interpretability and credibility of research...Does preregistration improve the interpretability and credibility of research...
Does preregistration improve the interpretability and credibility of research...
 
CLIM: Transition Workshop - Extremes and Networks: a review of activities -...
CLIM: Transition Workshop - Extremes and  Networks: a  review of activities -...CLIM: Transition Workshop - Extremes and  Networks: a  review of activities -...
CLIM: Transition Workshop - Extremes and Networks: a review of activities -...
 
The effects of specialization on research careers
The effects of specialization on research careersThe effects of specialization on research careers
The effects of specialization on research careers
 
Large-Scale Inverse Problems
Large-Scale Inverse ProblemsLarge-Scale Inverse Problems
Large-Scale Inverse Problems
 
Joint Contributions of Martin B. Wilk and Ram Gnanadesikan
Joint Contributions of Martin B. Wilk  and Ram GnanadesikanJoint Contributions of Martin B. Wilk  and Ram Gnanadesikan
Joint Contributions of Martin B. Wilk and Ram Gnanadesikan
 
Forest Environment Analysis for the Pandemic Health
Forest Environment Analysis for the Pandemic HealthForest Environment Analysis for the Pandemic Health
Forest Environment Analysis for the Pandemic Health
 
httpwww.jstor.orgOn the Mathematical Foundations of The.docx
httpwww.jstor.orgOn the Mathematical Foundations of The.docxhttpwww.jstor.orgOn the Mathematical Foundations of The.docx
httpwww.jstor.orgOn the Mathematical Foundations of The.docx
 
MathModels.ppt
MathModels.pptMathModels.ppt
MathModels.ppt
 
MUMS Opening Workshop - Principles of Predictive Computational Science: Predi...
MUMS Opening Workshop - Principles of Predictive Computational Science: Predi...MUMS Opening Workshop - Principles of Predictive Computational Science: Predi...
MUMS Opening Workshop - Principles of Predictive Computational Science: Predi...
 
Sequence-to-Sequence Modeling for Time Series
Sequence-to-Sequence Modeling for Time SeriesSequence-to-Sequence Modeling for Time Series
Sequence-to-Sequence Modeling for Time Series
 
XploreIQ: Machine Learning and Big Data The Successful Use of Algorithms in E...
XploreIQ: Machine Learning and Big Data The Successful Use of Algorithms in E...XploreIQ: Machine Learning and Big Data The Successful Use of Algorithms in E...
XploreIQ: Machine Learning and Big Data The Successful Use of Algorithms in E...
 
The Seismic Hazard Modeller’s Toolkit: An Open-Source Library for the Const...
The Seismic Hazard Modeller’s Toolkit:  An Open-Source Library for the Const...The Seismic Hazard Modeller’s Toolkit:  An Open-Source Library for the Const...
The Seismic Hazard Modeller’s Toolkit: An Open-Source Library for the Const...
 
AI for automated materials discovery via learning to represent, predict, gene...
AI for automated materials discovery via learning to represent, predict, gene...AI for automated materials discovery via learning to represent, predict, gene...
AI for automated materials discovery via learning to represent, predict, gene...
 
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
 
MUMS: Transition & SPUQ Workshop - Uncertainty Quantification in Materials Wo...
MUMS: Transition & SPUQ Workshop - Uncertainty Quantification in Materials Wo...MUMS: Transition & SPUQ Workshop - Uncertainty Quantification in Materials Wo...
MUMS: Transition & SPUQ Workshop - Uncertainty Quantification in Materials Wo...
 
Networks, Deep Learning (and COVID-19)
Networks, Deep Learning (and COVID-19)Networks, Deep Learning (and COVID-19)
Networks, Deep Learning (and COVID-19)
 
Signal and noise
Signal and noiseSignal and noise
Signal and noise
 

More from jemille6

“The importance of philosophy of science for statistical science and vice versa”
“The importance of philosophy of science for statistical science and vice versa”“The importance of philosophy of science for statistical science and vice versa”
“The importance of philosophy of science for statistical science and vice versa”jemille6
 
Statistical Inference as Severe Testing: Beyond Performance and Probabilism
Statistical Inference as Severe Testing: Beyond Performance and ProbabilismStatistical Inference as Severe Testing: Beyond Performance and Probabilism
Statistical Inference as Severe Testing: Beyond Performance and Probabilismjemille6
 
Errors of the Error Gatekeepers: The case of Statistical Significance 2016-2022
Errors of the Error Gatekeepers: The case of Statistical Significance 2016-2022Errors of the Error Gatekeepers: The case of Statistical Significance 2016-2022
Errors of the Error Gatekeepers: The case of Statistical Significance 2016-2022jemille6
 
Causal inference is not statistical inference
Causal inference is not statistical inferenceCausal inference is not statistical inference
Causal inference is not statistical inferencejemille6
 
What are questionable research practices?
What are questionable research practices?What are questionable research practices?
What are questionable research practices?jemille6
 
What's the question?
What's the question? What's the question?
What's the question? jemille6
 
The neglected importance of complexity in statistics and Metascience
The neglected importance of complexity in statistics and MetascienceThe neglected importance of complexity in statistics and Metascience
The neglected importance of complexity in statistics and Metasciencejemille6
 
Mathematically Elegant Answers to Research Questions No One is Asking (meta-a...
Mathematically Elegant Answers to Research Questions No One is Asking (meta-a...Mathematically Elegant Answers to Research Questions No One is Asking (meta-a...
Mathematically Elegant Answers to Research Questions No One is Asking (meta-a...jemille6
 
On Severity, the Weight of Evidence, and the Relationship Between the Two
On Severity, the Weight of Evidence, and the Relationship Between the TwoOn Severity, the Weight of Evidence, and the Relationship Between the Two
On Severity, the Weight of Evidence, and the Relationship Between the Twojemille6
 
Revisiting the Two Cultures in Statistical Modeling and Inference as they rel...
Revisiting the Two Cultures in Statistical Modeling and Inference as they rel...Revisiting the Two Cultures in Statistical Modeling and Inference as they rel...
Revisiting the Two Cultures in Statistical Modeling and Inference as they rel...jemille6
 
Comparing Frequentists and Bayesian Control of Multiple Testing
Comparing Frequentists and Bayesian Control of Multiple TestingComparing Frequentists and Bayesian Control of Multiple Testing
Comparing Frequentists and Bayesian Control of Multiple Testingjemille6
 
Good Data Dredging
Good Data DredgingGood Data Dredging
Good Data Dredgingjemille6
 
The Duality of Parameters and the Duality of Probability
The Duality of Parameters and the Duality of ProbabilityThe Duality of Parameters and the Duality of Probability
The Duality of Parameters and the Duality of Probabilityjemille6
 
Error Control and Severity
Error Control and SeverityError Control and Severity
Error Control and Severityjemille6
 
The Statistics Wars and Their Causalities (refs)
The Statistics Wars and Their Causalities (refs)The Statistics Wars and Their Causalities (refs)
The Statistics Wars and Their Causalities (refs)jemille6
 
The Statistics Wars and Their Casualties (w/refs)
The Statistics Wars and Their Casualties (w/refs)The Statistics Wars and Their Casualties (w/refs)
The Statistics Wars and Their Casualties (w/refs)jemille6
 
On the interpretation of the mathematical characteristics of statistical test...
On the interpretation of the mathematical characteristics of statistical test...On the interpretation of the mathematical characteristics of statistical test...
On the interpretation of the mathematical characteristics of statistical test...jemille6
 
The role of background assumptions in severity appraisal (
The role of background assumptions in severity appraisal (The role of background assumptions in severity appraisal (
The role of background assumptions in severity appraisal (jemille6
 
The two statistical cornerstones of replicability: addressing selective infer...
The two statistical cornerstones of replicability: addressing selective infer...The two statistical cornerstones of replicability: addressing selective infer...
The two statistical cornerstones of replicability: addressing selective infer...jemille6
 
The replication crisis: are P-values the problem and are Bayes factors the so...
The replication crisis: are P-values the problem and are Bayes factors the so...The replication crisis: are P-values the problem and are Bayes factors the so...
The replication crisis: are P-values the problem and are Bayes factors the so...jemille6
 

More from jemille6 (20)

“The importance of philosophy of science for statistical science and vice versa”
“The importance of philosophy of science for statistical science and vice versa”“The importance of philosophy of science for statistical science and vice versa”
“The importance of philosophy of science for statistical science and vice versa”
 
Statistical Inference as Severe Testing: Beyond Performance and Probabilism
Statistical Inference as Severe Testing: Beyond Performance and ProbabilismStatistical Inference as Severe Testing: Beyond Performance and Probabilism
Statistical Inference as Severe Testing: Beyond Performance and Probabilism
 
Errors of the Error Gatekeepers: The case of Statistical Significance 2016-2022
Errors of the Error Gatekeepers: The case of Statistical Significance 2016-2022Errors of the Error Gatekeepers: The case of Statistical Significance 2016-2022
Errors of the Error Gatekeepers: The case of Statistical Significance 2016-2022
 
Causal inference is not statistical inference
Causal inference is not statistical inferenceCausal inference is not statistical inference
Causal inference is not statistical inference
 
What are questionable research practices?
What are questionable research practices?What are questionable research practices?
What are questionable research practices?
 
What's the question?
What's the question? What's the question?
What's the question?
 
The neglected importance of complexity in statistics and Metascience
The neglected importance of complexity in statistics and MetascienceThe neglected importance of complexity in statistics and Metascience
The neglected importance of complexity in statistics and Metascience
 
Mathematically Elegant Answers to Research Questions No One is Asking (meta-a...
Mathematically Elegant Answers to Research Questions No One is Asking (meta-a...Mathematically Elegant Answers to Research Questions No One is Asking (meta-a...
Mathematically Elegant Answers to Research Questions No One is Asking (meta-a...
 
On Severity, the Weight of Evidence, and the Relationship Between the Two
On Severity, the Weight of Evidence, and the Relationship Between the TwoOn Severity, the Weight of Evidence, and the Relationship Between the Two
On Severity, the Weight of Evidence, and the Relationship Between the Two
 
Revisiting the Two Cultures in Statistical Modeling and Inference as they rel...
Revisiting the Two Cultures in Statistical Modeling and Inference as they rel...Revisiting the Two Cultures in Statistical Modeling and Inference as they rel...
Revisiting the Two Cultures in Statistical Modeling and Inference as they rel...
 
Comparing Frequentists and Bayesian Control of Multiple Testing
Comparing Frequentists and Bayesian Control of Multiple TestingComparing Frequentists and Bayesian Control of Multiple Testing
Comparing Frequentists and Bayesian Control of Multiple Testing
 
Good Data Dredging
Good Data DredgingGood Data Dredging
Good Data Dredging
 
The Duality of Parameters and the Duality of Probability
The Duality of Parameters and the Duality of ProbabilityThe Duality of Parameters and the Duality of Probability
The Duality of Parameters and the Duality of Probability
 
Error Control and Severity
Error Control and SeverityError Control and Severity
Error Control and Severity
 
The Statistics Wars and Their Causalities (refs)
The Statistics Wars and Their Causalities (refs)The Statistics Wars and Their Causalities (refs)
The Statistics Wars and Their Causalities (refs)
 
The Statistics Wars and Their Casualties (w/refs)
The Statistics Wars and Their Casualties (w/refs)The Statistics Wars and Their Casualties (w/refs)
The Statistics Wars and Their Casualties (w/refs)
 
On the interpretation of the mathematical characteristics of statistical test...
On the interpretation of the mathematical characteristics of statistical test...On the interpretation of the mathematical characteristics of statistical test...
On the interpretation of the mathematical characteristics of statistical test...
 
The role of background assumptions in severity appraisal (
The role of background assumptions in severity appraisal (The role of background assumptions in severity appraisal (
The role of background assumptions in severity appraisal (
 
The two statistical cornerstones of replicability: addressing selective infer...
The two statistical cornerstones of replicability: addressing selective infer...The two statistical cornerstones of replicability: addressing selective infer...
The two statistical cornerstones of replicability: addressing selective infer...
 
The replication crisis: are P-values the problem and are Bayes factors the so...
The replication crisis: are P-values the problem and are Bayes factors the so...The replication crisis: are P-values the problem and are Bayes factors the so...
The replication crisis: are P-values the problem and are Bayes factors the so...
 

Recently uploaded

KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...M56BOOKSTORE PRODUCT/SERVICE
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...jaredbarbolino94
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersSabitha Banu
 
Meghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentMeghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentInMediaRes1
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxiammrhaywood
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxAvyJaneVismanos
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceSamikshaHamane
 
Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfMahmoud M. Sallam
 
Capitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptxCapitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptxCapitolTechU
 
Biting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfBiting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfadityarao40181
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,Virag Sontakke
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxRaymartEstabillo3
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Celine George
 

Recently uploaded (20)

KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginners
 
Meghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentMeghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media Component
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptx
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in Pharmacovigilance
 
Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdf
 
Capitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptxCapitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptx
 
Biting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfBiting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdf
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
 

reid-postJSM-DRC.pdf

  • 1. The importance of foundations Nancy Reid University of Toronto June 8 2023
  • 3. What are the foundations of statistics? Savage 1954 It is unanimously agreed that statistics depends somehow on probability. But, as to what prob- ability is and how it is connected with statistics, there has seldom been such complete disagreement and breakdown of communication since the Tower of Babel. Doubt- less, much of the disagreement is merely terminological and would disappear under sufficiently sharp analysis. JSM August 2023 2
  • 4. ... What are the foundations of statistics? R & Cox, 2015 • Statistics needs a healthy interplay between theory and applications • theory meaning foundations, rather than theoretical analysis of specific techniques • must be continually tested against new applications • “the practical application of general theorems is a different art from their establishment by mathematical proof” Fisher 1958 SMRW JSM August 2023 3
  • 5. ... What are the foundations of statistics? • probability, analysis, applied mathematics modelling • Bayes, Neyman, Fisher approaches to inference • nature of uncertainty epistemic, empirical • nature of induction belief functions, inferential models • interpretation of p-values, confidence regions, credibility intervals, likelihood ratios • role of sufficiency, ancillarity, conditioning, asymptotic theory • sparsity, causality, assumption-free/lean inference, stability, prediction, decisions JSM August 2023 4
  • 6. ... What are the foundations of statistics? Cox 2006 I’m fairly cautious about the im- pact of the book in that it really is very cryptic indeed on key issues but we will see. In particular quite apart from the Bayesian stuff I have essentially discarded (not rejected) the Neyman-Pearson machinery in favour of Fisher’s original approach and I am sure this is the right route. Thanks to AWF Edwards JSM August 2023 5
  • 7. ... What are the foundations of statistics? Cox 2006 I’m fairly cautious about the im- pact of the book in that it really is very cryptic indeed on key issues but we will see. In particular quite apart from the Bayesian stuff I have essentially discarded (not rejected) the Neyman-Pearson machinery in favour of Fisher’s original approach and I am sure this is the right route. Thanks to AWF Edwards “dispassionate assessment, frail attempt at, 1–196” JSM August 2023 6
  • 8. What use are foundations? • provide a rigorous basis for the development of techniques • provide a common language for particular classes of problems • help to clarify the nature of uncertainty in scientific conclusions • highlight aspects of data analysis which are likely to raise difficult issues • suggest strategies for tackling highly complex problems • avoid ‘re-inventing the wheel’ for each new application JSM August 2023 7
  • 10. Theory and Applications Example 1 NY Times, April 27 World Weather Attribution, April 27 JSM August 2023 8
  • 17. Linking theory with practice 1. Drought Climate change attribution climate models, subgroup analyses, predictions 2. Covid Treatment with Ivermectin randomized trial, proportional hazards regression, Bayes/frequentist 3. Women Co-authorship and gender linear regression, binary outcome, confounding 4. Faces Human perception sign test, regression, computational modelling JSM August 2023 15
  • 19. Drought World Weather Attribution 2023 JSM August 2023 16
  • 20. The source Kimutai et al. 2023 Link JSM August 2023 17
  • 21. The data Kimutai et al. 2023 • observational data, 3 sources 1. global daily rainfall & temperature 0.5◦ × 0.5◦ , 1979 – 2. daily rainfall infra-red, “SoA”, 1981 – 3. monthly rainfall 1981–2014 • 4-year smoothed mean surface temperature proxy for anthropogenic climate change • climate modelling data, 4 sources 1. combine 12 global and 8 climate models: resolution 0.44◦ 29 sims 2. combine 5 global and 4 climate models: resolution 0.22◦ 10 sims 3. atmosphere-ocean coupled GCMs (two) 10/3 simulations 4. sea-surface temperature forced ensemble, high resolution 11 simulations • thanks to Whitney Huang for many clarifications JSM August 2023 18
  • 22. The analysis Kimutai et al. 2023 • response is log10(monthly rainfall) in 2021 and 2022 and log10(PET) — potential evapotranspiration • covariates are global temperature anomaly, and ENSO index El Nino-Southern Oscillation • “As a measure of anthropogenic climate change we use smoothed GMST” Global Mean Surface Temperature • “Methods for observational and model analysis ... and synthesis are used according to the World Weather Attribution Protocol” Philip et al. 2020 1. trend using observational data 2. find climate models consistent with 1. 3. compare predictions from 1. and 2. 4. synthesize results in 3. to provide conclusions JSM August 2023 19
  • 23. The results Kimutai et al. 2023 • rainfall decreasing with increasing temperature but not much • 2022 rainfall is about a 1 in 20 year event • 2022 drought about 2 times more likely under climate change • uncertainty 0.1 — 360 JSM August 2023 20
  • 24. 2 times more likely? Kimutai et al. 2023 • change the response to SPEI rainfall adjusted for evaporation • 2022 drought now 5500 times more likely uncertainty 32 to 4 × 108 • consider ‘long rains’ and ‘short rains’ separately MAM, OND • combine model simulation results with observational data JSM August 2023 21
  • 25. Combining climate simulations and data Kimutai et al. 2023 precipitation adjusted for evaporation JSM August 2023 22
  • 27. The theory Whitney Huang • extrapolation beyond observations Stein Statist. Sci. 2019 extreme value modelling • assigning uncertainty to combined results sources of uncertainty Senn 2020 • ratios of estimated probabilities nearly unbounded confidence intervals CH 1974 • “the whole real line as a confidence interval does not mean that a vacuous statment is being made” • joint modelling of precipitation and evapotranspiration tail copula modelling JSM August 2023 24
  • 28. Aside: joint modelling JSM August 2023 25
  • 29. Summer 2023 World Weather Attribution JSM August 2023 26
  • 31. Covid
  • 32. The source Naggie et al 2022 JSM August 2023 28
  • 33. The data Naggie et al 2022 • randomized controlled trial • platform trial of 6 potential treatments multicenter • analysis of each treatment uses the same placebo group • 817 patients in treatment arm; 774 in control arm • primary outcome: time to recovery • explanatory variables: treatment, age, sex, prior symptoms, calendar time, vaccination status, geographic region, center, baseline severity and others JSM August 2023 29
  • 34. The analysis Naggie et al 2022 • Bayesian proportional hazards model λ(t; x) = λ0(t) exp(β0 + β1x1 + β2x2 + · · · + βpxp) • some covariates fit with splines e.g. age • underlying hazard modelled parametrically e.g. Weibull, or splines • prior distributions: • for parameters of hazard function • for coefficients for explanatory variables β0, β2, . . . , βp • for β1 – treatment skeptical, noninformative, none • likelihood × prior −→ posterior −→ marginal posterior for β1 or exp(β1) JSM August 2023 30
  • 35. The results Naggie et al 2022 JSM August 2023 31
  • 36. The results Naggie et al 2022 “Among outpatients with mild to moderate COVID-19, treatment with ivermectin, compared with placebo, did not significantly improve time to recovery.” Hazard ratio estimated at 1.07, with posterior probability that HR > 1 = 0.91 “This did not meet the prespecified threshold of posterior probability greater than 0.95” JSM August 2023 32
  • 37. The results Naggie et al 2022 JSM August 2023 33
  • 38. The theory: modelling and conditional inference • Cox 1972: On regression models and life tables • sets out proportional hazards regression and non-proportional λ(t; x) = λ0(t) exp(xT β) • proposes analysis via partial likelihood eliminates hazard function • uses point process modelling + conditional inference • full likelihood function L(β, λ) ∝ n ! j=1 {λ0(tj) exp(xT j β)}δj exp{− exp(xT j β)Λ0(tj)} • partial likelihood function Lpart(β) ∝ ! failures exp(xT j β) " k∈Rj exp(xT kβ) JSM August 2023 34
  • 39. The theory: Bayes and frequentist Skeptical prior 1.07 (0.96 − 1.17) 0.91 Noninformative prior 1.09 (0.97 − 1.22) 0.93 No prior 1.09 (0.98 − 1.22) JSM August 2023 35
  • 40. The theory: Bayes and frequentist • both methods often lead to the same conclusions but not always • Wasserman 2015; 2022 very helpful overviews • Stein 1959 • Stone 1970 • Robins and Ritov 1997 • ... • the nature of the conclusions is different • probability representing degree of uncertainty epistemic • probability representing long-run frequency aleatory • calibration of Bayesian inference assesses the first on the basis of the second Cox 1958, BFF JSM August 2023 36
  • 43. The source Ross et al 2022 Nature JSM August 2023 38
  • 44. The data Ross et al 2022 Nature • “finding ‘what isn’t there’ from ‘what is there’ is a fundamental problem in statistics” • analytic data • 118 campuses send de-identified data to U Michigan + survey + qualitative analysis • tracks spending on personnel for each research project payroll, all funding sources • 57 campuses with complete data 2013–2016 • identify teams: PI, faculty, PDF, PhD, UGrad, Research Staff • identify publications: Web of Science • identify gender, job titles, scientific fields, patents, ... • 9800 teams with 129,000 team members • 39,000 articles; 18m ‘potential authorships’; 367,000 actual authorships scientific articles JSM August 2023 39
  • 45. The analysis Ross et al 2022 Nature • response attribution rate = # actual authorships # potential authorships = pr( attribution ) • covariates date of publication, number of days worked in the team, calendar time, position in the team, team’s PI • model P( named ) = β0 + β1woman + βT covariates + error JSM August 2023 40
  • 46. The results Ross et al 2022 Nature overall attribution rate 3.1%; attribution rate for men 4.23%; attribution rate for women 2.12% includes patents difference smaller when covariates included but still statistically significant JSM August 2023 41
  • 48. The theory Ross et al 2022 Nature • linear regression with response a proportion • logistic function is pretty linear for p ∈ (0.2, 0.8) but these p’s ∈ (0.01, 0.04) • there’s a paper for that! On the linear in probability model for binary data Battey, Cox & Jackson 2019 • possibly more concerning: what is the unit of observation? calculation of standard errors • conditional inference relevant subsets; adequate variability JSM August 2023 43
  • 51. The source Wardle et al. PNAS 2021 JSM August 2023 45
  • 52. ... The source Wardle et al. PNAS 2021 JSM August 2023 46
  • 53. Anthropomorphizing Globe & Mail, June 5 JSM August 2023 47
  • 54. Anthropomorphizing Globe & Mail, June 5 The reason we are astonished by their output is because, as a species, we’re gullible. We tend to read human characteristics into any pattern that even mildly resembles a human. JSM August 2023 48
  • 56. Linking theory with practice 1. Drought Climate change attribution climate models, subgroup analyses, predictions 2. Covid Treatment with Ivermectin randomized trial, proportional hazards regression, Bayes/frequentist 3. Women Co-authorship and gender linear regression, binary outcome, confounding 4. Faces Human perception sign test, regression, computational modelling JSM August 2023 49
  • 57. What did I learn • statistical “workflows” seem to be emerging in different disciplines • Drought — “A Protocol for probabilistic extreme event attribution analysis ” Philip et al 2020, Adv. Stat. Clim. Met. Ocean • “Writing statistical methods for ecologists” Davis & Kay 2023, Ecosphere • tutorial-type articles in scientific journals • Annals of Thoracic Surgery — the statistician’s page • J Am Medical Association — Guide to Statistics and Methods • Nature Methods — Points of Significance • “open data” observed in the breach • Drought — “Almost all the data are available via the KNMI Climate Explorer” • Women — “datasets ... are available at the Virtual Data Enclave Repository” • Covid — “... the data will be made publicly available” • Communication sleuthing is hard JSM August 2023 50
  • 58. “US Life expectancy has been declining for decades’’ Communication Lost in Translation
  • 59. “US Life expectancy has been declining for decades’’ Communication Lost in Translation
  • 60. “US Life expectancy has been declining for decades’’ “Increases in US life expectancy slowed” Communication Lost in Translation
  • 61. Communication Lost in Translation
  • 62. Communication Lost in Translation
  • 63. In mice and in cells Communication Lost in Translation
  • 64. Back to foundations • probability, analysis, applied mathematics modelling • Bayes, Neyman, Fisher approaches to inference • nature of uncertainty epistemic, empirical • nature of induction belief functions, inferential models • interpretation of p-values, confidence regions, credibility intervals, likelihood ratios • role of sufficiency, ancillarity, conditioning, asymptotic theory • sparsity, causality, assumption-free/lean inference, stability, prediction JSM August 2023 57
  • 65. ... Back to foundations • probability, analysis, applied mathematics modelling • Bayes, Neyman, Fisher approaches to inference • nature of uncertainty epistemic, empirical • nature of induction belief functions, inferential models • interpretation of p-values, confidence regions, credibility intervals, likelihood ratios • realistic estimates of precision, complex dependencies, subgroup analyses • sparsity, causality, assumption-free/lean inference, stability, prediction JSM August 2023 58
  • 66. ... Back to foundations JSM August 2023 59
  • 67. References i Battey, H.S., Cox, D.R. and Jackson, M. (2019). On the linear in probability model for binary data. Roy. Soc. Open Sci. 6, 190067. Cox, D.R. (1958). Some problems connected with statistical inference. Ann. Math. Statist. 29, 357–372. Cox, D.R. (1972). Regression models and life-tables (with discussion). J. R. Statist. Soc. B 34, 187–220. Cox, D.R. (1978). Foundations of statistical inference: the case for eclecticism. Austral. J. Statist. 20, 43–59. Cox, D.R. (2006). Principles of Statistical Inference. Cambridge, Cambridge University Press. Cox, D.R. and Hinkley, D.V. (1974). Theoretical Statistics. London, Chapman & Hall. Davis, A.J. and Kay, S. (2023). Writing statistical methods for ecologists. Ecosphere 14:e4539. doi:10.1002/ecs2.4539 JSM August 2023
  • 68. References ii Erdenesanaa, D. (2023). “Some July heat: ‘virtually impossible’ without climate change, analysis finds.” New York Times, July 25, 2023. https://www.nytimes.com/2023/07/25/climate/us-europe-heat-waves-climate-change.html accessed August 13, 2023. Fisher, R.A. (1958) Statistical Methods for Research Workers. 13th ed. Edinburgh, Oliver & Boyd. Kimball, S. (2022). “Ivermectin — a drug once touted by conservatives — doesn’t improve recovery much”. CNBC News, October 24, 2022. https://www.cnbc.com/2022/10/24/ ivermectin-once-touted-as-a-covid-treatment-by-conservatives-doesnt-improve-recovery-much- html accessed August 13, 2023. Kumutai, J. et al. (2023). Human induced climate change increased drought severity in Horn of Africa. World Weather Attribution Report. https://www.worldweatherattribution.org/ human-induced-climate-change-increased-drought-severity-in-southern-horn-of-africa/ accessed August 23, 2023. JSM August 2023
  • 69. References iii McShane, J. “Female scientists don’t get the credit they deserve. A study proves it.” Washington Post, June 22, 2022. https://www.washingtonpost.com/business/2022/06/22/ women-scientists-authorship-credit-study/ accessed August 13, 2023. Naggie, S. et al. (2023). Effect of Ivermectin vs placebo on time to sustained recovery in outpatients with mild to moderate COVID-19. J. Am. Med. Assoc. 328, 1595–1603. doi:10.1001/jama.2022.18590 Philip, S. et al. (2020). A protocol for probabilistic extreme event analysis. Adv. Stat. Clim. Meteorol. Oceanogr. 6, 177–203. Reid, N. and Cox, D.R. (2015). On some principles of statistical inference. Int. Statist. Rev. 83, 298–308. doi:10.1111/insr.12067 Robins, J.M. and Ritov, Y. (1997). Toward a curse of dimensionality appropriate (CODA) asymptotic theory for semi-parametric models. Statist. Med. 16, 285–319. JSM August 2023
  • 70. References iv Ross, M.B. et al. (2022). Women are credited less in science than men. Nature 608, 135–145. doi:10.1038/s41586-022-04966-w Savage, L. J. (1954). The Foundations of Statistics. New York, Dover. Senn, S. (2020). “Error point: the importance of knowing how much you don’t know” (Guest Post). January, 20, 2020. https://errorstatistics.com/2020/01/20/ s-senn-error-point-the-importance-of-knowing-how-much-you-dont-know-guest-post/ Stein, C. (1959). An example of wide discrepancy between fiducial and confidence intervals. Ann. Math. Statist. 30, 877–880. Stein, M.L. (2019). Some statistical issues in climate science. Statist. Sci. 35, 31–41. Stone, M. (1970). Necessary and sufficient condition for convergence in probability to invariant posterior distributions. Ann. Math. Statist. 41, 1349–1353. JSM August 2023
  • 71. References v Temming, M. (2022). “Americans tend to assume imaginary faces are male”. Science News January 27, 2022. https://www.sciencenews.org/article/faces-objects-imaginary-male-female-perception Wardle, S.G. (2022). Illusory faces are more likely to be perceived as male than female. Proc. Nat. Acad. Sci. 119 (5) e2117413119. doi:10.1073/pnas.2117413119 Wasserman, L.A. (2015). Bayes versus frequentist. https://www.stat.cmu.edu/~larry/=sml/BayesVersusFrequentist.pdf accessed August 13, 2023. Wasserman, L.A. (2022). “The foundations of statistical inference”. Information Universe, July 29, 2022. https://www.youtube.com/watch?v=Z-YvWyM6dRQ accessed August 13, 2023. Wilson, J. (2023). “Will AI really change everything? Not likely”. Globe & Mail, June 5, 2023. https://www.theglobeandmail.com/opinion/ article-will-ai-really-change-everything-not-likely/ accessed August 13, 2023. JSM August 2023
  • 72. References vi Zhong, R. (2023). “Climate change made East African drought 100 times as likely, study says”. New York Times April 22, 2023. https://www.nytimes.com/2023/04/27/climate/horn-of-africa-somalia-drought.html accessed August 13, 2023. JSM August 2023