SlideShare a Scribd company logo
1 of 45
Download to read offline
The neglected importance
of complexity
in statistics and Metascience

Daniele Fanelli
In this talk:
1) what’s missing in the current paradigm
2) what a new paradigm might look like
3) evidence in support of this proposal
The elephant in the room of:
1) metascience
●
e.g. reproducibility
2) statistics
●
e.g. model “complexity”
●
[see SW seminar 2021, Fanelli 2019, 2022]
What is “complex”?
Level of complexity
Many, diverse, interacting parts.
Long to describe, difficult to predict.
example 1) complexity deflates the
“reproducibility crisis”.
year project discipline N result
2014 Many labs 1 psychology,
misc.
13, 36 labs 77%
2016 COS social+cognitive
psychology
100 36-68%
2016 Camerer et al. experimental
economics
18 61-78%
2018 Many labs 2 social+cognitive
psychology
28, 62 samples,
36 countries
54%
2018 Camerer et al. social studies in
Nature, Science
21 57-67%
2021 RPCB cancer biology 188, 50 exper.,
23 papers
3-82%
Lower reproducib:
1) complex phenomena
2) complex methods
●
not just random noise
●
structured, systematic
diff.
example 2) complexity confuses
statistical results
AIC:−2log( L)+2k
models vary by “fitting propensity”
(“complexity” beyond n. of parameters)
(Bonifay and Cai 2017)
universe of possible data
When is a theory actually supported?
How might the elephant appear?
1) integrate complexity of phenomena & methods in measuring, forecasting,
correcting reproducibility
2) penalize statistical models for complexity beyond number of parameters
integrating “complexity” ~ severity of testing
https://www.wallpaperflare.com/elephant-during-daytime-grayscale-photo-of-elephant-portrait-wallpaper-zhdat
Part 2: A candidate alternative
K theory in a nutshell
(Fanelli 2019, Fanelli 2022)
K =
H(Y) - H(Y|X, τ))
H(Y) H(X) D(τ))
+ nX
nY
+
K = consilience
K =
H(Y) - H(Y|X, τ))
H(Y) H(X) D(τ))
+ nX
nY
explain/predict/control
more/diverse phenomena
with fewer/simpler theories/methods
+
what the variables represent
=
H(Y) - H(Y|X, τ))
H(Y) H(X) D(τ))
+ nX
nY
K
explain/predict/control
more/diverse phenomena
with fewer/simpler theories/methods
+
more information Y makes K GROW
(Fanelli 2019, Fanelli 2022)
K=
H(Y) - H(Y|X, τ))
H(Y) H(X) D(τ))
+ nX
nY
+
(Fanelli 2019, Fanelli 2022)
=
- H(Y|X, τ))
H(X) D(τ))
nX
H(Y1
) + H(Y2
) + H(Y3
)
nY
H(Y1
) + H(Y2
) + H(Y3
) +
K
more information Y makes K GROW
+
K theory in a nutshell
(Fanelli 2019, Fanelli 2022)
=
H(Y) - H(Y|X, τ))
H(Y) H(X) D(τ))
+ nX
nY
K
+
more info. Y|X, X, τ makes K small
(Fanelli 2019, Fanelli 2022)
K
=
H(Y) - H(Y|X, τ))
H(Y)
H(X) D(τ))
+
nX
nY
+
(Fanelli 2019, Fanelli 2022)
K
=
H(Y) -
H(Y)
H(X)
+
nX
nY
H(Y1
|X, τ))+H(Y2
|X, τ))+H(Y3
|X, τ))
D(τ)1
)+D(τ)2
)+D(τ)3
)
+
more info. Y|X, X, τ makes K small
K theory in a nutshell
(Fanelli 2019, Fanelli 2022)
=
H(Y) - H(Y|X, τ))
H(Y) H(X) D(τ))
+ nX
nY
K
+
=
≈1, full consilience
=0, no knowledge
<0, wrong
K theory in a nutshell
(Fanelli 2019, Fanelli 2022)
=
H(Y) - H(Y|X, τ))
H(Y) H(X) D(τ))
+ nX
nY
K
+
K of a regression model
(Fanelli 2019, Fanelli 2022)
Y = α + β X + error
=
H(Y) - H(Y|X, τ))
H(Y) H(X) D(τ))
+ nX
nY
K
+
(Fanelli 2019, Fanelli 2022)
Y = α + β X + error
this has been said before
=
H(Y) - H(Y|X, τ))
H(Y) H(X) D(τ))
+ nX
nY
K
+
(Fanelli 2019, Fanelli 2022)
key theoretical innovations
Y = α + β X + error
=
H(Y) - H(Y|X, τ))
H(Y) H(X) D(τ))
+ nX
nY
K
+
(Fanelli 2019, Fanelli 2022)
key methodological innovations
Y = α + β X + error
=
H(Y) - H(Y|X, τ))
H(Y) H(X) D(τ))
+ nX
nY
K
+
(Fanelli 2019, Fanelli 2022)
graphs are everywhere in science
methodologies
theories
(Fanelli 2019, Fanelli 2022)
www.protocols.io/view/an-optimized-protocol-
for-in-vivo-analysis-of-tumo-3byl471m2lo5/v1
=
H(Y) - H(Y|X, τ))
H(Y) H(X) D(τ))
+ nX
nY
K
+
(Mueller 2015, ICSS)
https://www.wallpaperflare.com/elephant-in-black-and-white-elephant-photo-animal-grey-wallpaper-tkqgk
Part 3: supporting evidence
1) K predicts perceived
and actual reproducibility
“tau” of biological experiments
(Fanelli, Tan, Amaral & Neves, 2022, MetaArxiv)
K vs. perceived reproducibility
(Fanelli, Tan, Amaral & Neves, 2022, MetaArxiv)
K vs. actual reproducibility
Kr=K o 2−λ⋅d
kr hr=ko ho 2−λ⋅d
log
kr
ko
=log
ho
hr
−λ⋅d
R≡log
H (Y )−H (Y∣X , τr)
H (Y )−H (Y∣X , τo)
=α+βlog
1
D (τr)/ N
(Fanelli, Tan, Amaral & Neves, in prep)
1) K predicts actual reproducibility
(part of collaboration with Brazilian Reproducibility Initiative)
(Fanelli, Tan, Amaral & Neves, in prep)
independent new predictor
multiple regression, Y=reproducibility
(Fanelli, Tan, Amaral & Neves, in prep)
multiple regression, Y=reproducibility
(here D(τ) based on sentences in replication protocol!)
very easy to measure, automatize
better than just P-values and N
(Fanelli, Tan, Amaral & Neves, in prep)
multiple regression, Y=reproducibility R
leads to progress in metascience
(Fanelli, Tan, Amaral & Neves, in prep)
multiple regression, Y=reproducibility
2) D(τ) might
predict fitting propensity
preregistered test
(Fanelli & Bonifay, in prep)
1) generated N=20 models, all with 36 parameters
2) derived D(τ)), predicted their fitting propensity
3) tested them on 20,000 random covariance matrices
(Fanelli & Bonifay, in prep)
preregistered test
D(τ) uniquely reflects model complexity
(pre-registered test)
(Fanelli & Bonifay, in progress)
(Fanelli & Bonifay, in progress)
NO alternative theory explains this!
(pre-registered test)
multiple regression: Y=fitting propensity
Spearman’s ρ = 0.64,
P<0.002
Spearman’s ρ = 0.73,
P<0.001
(Bonifay and Cai 2017)
Bifactor
confirmatory
(theoretical)
20 parameters
EIFA:
exploratory
(a-theoretical)
20 parameters
(Fanelli & Bonifay, in progress)
how K improves theory testing
example of application
theories invoking
general
factor
(e.g. IQ, stress,
psychosis)
Bifactor widely used to “test” theories
(Bonifay and Cai 2017)
Bifactor
confirmatory
(theoretical)
20 parameters
EIFA:
exploratory
(a-theoretical)
20 parameters
K(Bifactor) ≥ K(EIFA)
“The [bifactor-encoded] theory
is specifically supported
by the data”
(Fanelli & Bonifay, in progress)
P<0.05
when is a theory actually supported?
example of application
Summary of this talk:
1) what’s missing in the current paradigm?
●
we pretend complexity is irrelevant
2) what might a new paradigm look like?
●
measuring D(τ), integrating/penalizing with K
3) evidence in support of this proposal?
●
increasingly promising
●
any alternative, better suggestions?

More Related Content

Similar to The neglected importance of complexity in statistics and Metascience

La statistique et le machine learning pour l'intégration de données de la bio...
La statistique et le machine learning pour l'intégration de données de la bio...La statistique et le machine learning pour l'intégration de données de la bio...
La statistique et le machine learning pour l'intégration de données de la bio...tuxette
 
Slides econometrics-2018-graduate-4
Slides econometrics-2018-graduate-4Slides econometrics-2018-graduate-4
Slides econometrics-2018-graduate-4Arthur Charpentier
 
better together? statistical learning in models made of modules
better together? statistical learning in models made of modulesbetter together? statistical learning in models made of modules
better together? statistical learning in models made of modulesChristian Robert
 
Module - 2 Discrete Mathematics and Graph Theory
Module - 2 Discrete Mathematics and Graph TheoryModule - 2 Discrete Mathematics and Graph Theory
Module - 2 Discrete Mathematics and Graph TheoryAdhiyaman Manickam
 
Comparison of the optimal design
Comparison of the optimal designComparison of the optimal design
Comparison of the optimal designAlexander Decker
 
Dependent processes in Bayesian Nonparametrics
Dependent processes in Bayesian NonparametricsDependent processes in Bayesian Nonparametrics
Dependent processes in Bayesian NonparametricsJulyan Arbel
 
Modeling Heterogeneity by Structural Varying Coefficients Models in Presence of...
Modeling Heterogeneity by Structural Varying Coefficients Models in Presence of...Modeling Heterogeneity by Structural Varying Coefficients Models in Presence of...
Modeling Heterogeneity by Structural Varying Coefficients Models in Presence of...Facultad de Ciencias Económicas UdeA
 
Introduction to FDA and linear models
 Introduction to FDA and linear models Introduction to FDA and linear models
Introduction to FDA and linear modelstuxette
 
SOLVING BVPs OF SINGULARLY PERTURBED DISCRETE SYSTEMS
SOLVING BVPs OF SINGULARLY PERTURBED DISCRETE SYSTEMSSOLVING BVPs OF SINGULARLY PERTURBED DISCRETE SYSTEMS
SOLVING BVPs OF SINGULARLY PERTURBED DISCRETE SYSTEMSTahia ZERIZER
 
Can we estimate a constant?
Can we estimate a constant?Can we estimate a constant?
Can we estimate a constant?Christian Robert
 
6.04218.062J Mathematics for Computer Science September 9, 20.docx
6.04218.062J Mathematics for Computer Science September 9, 20.docx6.04218.062J Mathematics for Computer Science September 9, 20.docx
6.04218.062J Mathematics for Computer Science September 9, 20.docxtroutmanboris
 
On the interplay between intergenerational transfers and natural resources
On the interplay between intergenerational transfers and natural resourcesOn the interplay between intergenerational transfers and natural resources
On the interplay between intergenerational transfers and natural resourcesRoberto Iacono
 
prior selection for mixture estimation
prior selection for mixture estimationprior selection for mixture estimation
prior selection for mixture estimationChristian Robert
 
Slides: Hypothesis testing, information divergence and computational geometry
Slides: Hypothesis testing, information divergence and computational geometrySlides: Hypothesis testing, information divergence and computational geometry
Slides: Hypothesis testing, information divergence and computational geometryFrank Nielsen
 
Dimensionality Reduction Techniques In Response Surface Designs
Dimensionality Reduction Techniques In Response Surface DesignsDimensionality Reduction Techniques In Response Surface Designs
Dimensionality Reduction Techniques In Response Surface Designsinventionjournals
 

Similar to The neglected importance of complexity in statistics and Metascience (20)

La statistique et le machine learning pour l'intégration de données de la bio...
La statistique et le machine learning pour l'intégration de données de la bio...La statistique et le machine learning pour l'intégration de données de la bio...
La statistique et le machine learning pour l'intégration de données de la bio...
 
Recursion (in Python)
Recursion (in Python)Recursion (in Python)
Recursion (in Python)
 
Slides econometrics-2018-graduate-4
Slides econometrics-2018-graduate-4Slides econometrics-2018-graduate-4
Slides econometrics-2018-graduate-4
 
better together? statistical learning in models made of modules
better together? statistical learning in models made of modulesbetter together? statistical learning in models made of modules
better together? statistical learning in models made of modules
 
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
 
Module - 2 Discrete Mathematics and Graph Theory
Module - 2 Discrete Mathematics and Graph TheoryModule - 2 Discrete Mathematics and Graph Theory
Module - 2 Discrete Mathematics and Graph Theory
 
Comparison of the optimal design
Comparison of the optimal designComparison of the optimal design
Comparison of the optimal design
 
Dependent processes in Bayesian Nonparametrics
Dependent processes in Bayesian NonparametricsDependent processes in Bayesian Nonparametrics
Dependent processes in Bayesian Nonparametrics
 
Modeling Heterogeneity by Structural Varying Coefficients Models in Presence of...
Modeling Heterogeneity by Structural Varying Coefficients Models in Presence of...Modeling Heterogeneity by Structural Varying Coefficients Models in Presence of...
Modeling Heterogeneity by Structural Varying Coefficients Models in Presence of...
 
Introduction to FDA and linear models
 Introduction to FDA and linear models Introduction to FDA and linear models
Introduction to FDA and linear models
 
QMC: Operator Splitting Workshop, Composite Infimal Convolutions - Zev Woodst...
QMC: Operator Splitting Workshop, Composite Infimal Convolutions - Zev Woodst...QMC: Operator Splitting Workshop, Composite Infimal Convolutions - Zev Woodst...
QMC: Operator Splitting Workshop, Composite Infimal Convolutions - Zev Woodst...
 
SOLVING BVPs OF SINGULARLY PERTURBED DISCRETE SYSTEMS
SOLVING BVPs OF SINGULARLY PERTURBED DISCRETE SYSTEMSSOLVING BVPs OF SINGULARLY PERTURBED DISCRETE SYSTEMS
SOLVING BVPs OF SINGULARLY PERTURBED DISCRETE SYSTEMS
 
Can we estimate a constant?
Can we estimate a constant?Can we estimate a constant?
Can we estimate a constant?
 
6.04218.062J Mathematics for Computer Science September 9, 20.docx
6.04218.062J Mathematics for Computer Science September 9, 20.docx6.04218.062J Mathematics for Computer Science September 9, 20.docx
6.04218.062J Mathematics for Computer Science September 9, 20.docx
 
On the interplay between intergenerational transfers and natural resources
On the interplay between intergenerational transfers and natural resourcesOn the interplay between intergenerational transfers and natural resources
On the interplay between intergenerational transfers and natural resources
 
prior selection for mixture estimation
prior selection for mixture estimationprior selection for mixture estimation
prior selection for mixture estimation
 
Slides: Hypothesis testing, information divergence and computational geometry
Slides: Hypothesis testing, information divergence and computational geometrySlides: Hypothesis testing, information divergence and computational geometry
Slides: Hypothesis testing, information divergence and computational geometry
 
F0742328
F0742328F0742328
F0742328
 
Unit 6: All
Unit 6: AllUnit 6: All
Unit 6: All
 
Dimensionality Reduction Techniques In Response Surface Designs
Dimensionality Reduction Techniques In Response Surface DesignsDimensionality Reduction Techniques In Response Surface Designs
Dimensionality Reduction Techniques In Response Surface Designs
 

More from jemille6

“The importance of philosophy of science for statistical science and vice versa”
“The importance of philosophy of science for statistical science and vice versa”“The importance of philosophy of science for statistical science and vice versa”
“The importance of philosophy of science for statistical science and vice versa”jemille6
 
Statistical Inference as Severe Testing: Beyond Performance and Probabilism
Statistical Inference as Severe Testing: Beyond Performance and ProbabilismStatistical Inference as Severe Testing: Beyond Performance and Probabilism
Statistical Inference as Severe Testing: Beyond Performance and Probabilismjemille6
 
D. Mayo JSM slides v2.pdf
D. Mayo JSM slides v2.pdfD. Mayo JSM slides v2.pdf
D. Mayo JSM slides v2.pdfjemille6
 
reid-postJSM-DRC.pdf
reid-postJSM-DRC.pdfreid-postJSM-DRC.pdf
reid-postJSM-DRC.pdfjemille6
 
Errors of the Error Gatekeepers: The case of Statistical Significance 2016-2022
Errors of the Error Gatekeepers: The case of Statistical Significance 2016-2022Errors of the Error Gatekeepers: The case of Statistical Significance 2016-2022
Errors of the Error Gatekeepers: The case of Statistical Significance 2016-2022jemille6
 
Causal inference is not statistical inference
Causal inference is not statistical inferenceCausal inference is not statistical inference
Causal inference is not statistical inferencejemille6
 
What are questionable research practices?
What are questionable research practices?What are questionable research practices?
What are questionable research practices?jemille6
 
What's the question?
What's the question? What's the question?
What's the question? jemille6
 
Mathematically Elegant Answers to Research Questions No One is Asking (meta-a...
Mathematically Elegant Answers to Research Questions No One is Asking (meta-a...Mathematically Elegant Answers to Research Questions No One is Asking (meta-a...
Mathematically Elegant Answers to Research Questions No One is Asking (meta-a...jemille6
 
On Severity, the Weight of Evidence, and the Relationship Between the Two
On Severity, the Weight of Evidence, and the Relationship Between the TwoOn Severity, the Weight of Evidence, and the Relationship Between the Two
On Severity, the Weight of Evidence, and the Relationship Between the Twojemille6
 
Revisiting the Two Cultures in Statistical Modeling and Inference as they rel...
Revisiting the Two Cultures in Statistical Modeling and Inference as they rel...Revisiting the Two Cultures in Statistical Modeling and Inference as they rel...
Revisiting the Two Cultures in Statistical Modeling and Inference as they rel...jemille6
 
Comparing Frequentists and Bayesian Control of Multiple Testing
Comparing Frequentists and Bayesian Control of Multiple TestingComparing Frequentists and Bayesian Control of Multiple Testing
Comparing Frequentists and Bayesian Control of Multiple Testingjemille6
 
Good Data Dredging
Good Data DredgingGood Data Dredging
Good Data Dredgingjemille6
 
The Duality of Parameters and the Duality of Probability
The Duality of Parameters and the Duality of ProbabilityThe Duality of Parameters and the Duality of Probability
The Duality of Parameters and the Duality of Probabilityjemille6
 
Error Control and Severity
Error Control and SeverityError Control and Severity
Error Control and Severityjemille6
 
The Statistics Wars and Their Causalities (refs)
The Statistics Wars and Their Causalities (refs)The Statistics Wars and Their Causalities (refs)
The Statistics Wars and Their Causalities (refs)jemille6
 
The Statistics Wars and Their Casualties (w/refs)
The Statistics Wars and Their Casualties (w/refs)The Statistics Wars and Their Casualties (w/refs)
The Statistics Wars and Their Casualties (w/refs)jemille6
 
On the interpretation of the mathematical characteristics of statistical test...
On the interpretation of the mathematical characteristics of statistical test...On the interpretation of the mathematical characteristics of statistical test...
On the interpretation of the mathematical characteristics of statistical test...jemille6
 
The role of background assumptions in severity appraisal (
The role of background assumptions in severity appraisal (The role of background assumptions in severity appraisal (
The role of background assumptions in severity appraisal (jemille6
 
The two statistical cornerstones of replicability: addressing selective infer...
The two statistical cornerstones of replicability: addressing selective infer...The two statistical cornerstones of replicability: addressing selective infer...
The two statistical cornerstones of replicability: addressing selective infer...jemille6
 

More from jemille6 (20)

“The importance of philosophy of science for statistical science and vice versa”
“The importance of philosophy of science for statistical science and vice versa”“The importance of philosophy of science for statistical science and vice versa”
“The importance of philosophy of science for statistical science and vice versa”
 
Statistical Inference as Severe Testing: Beyond Performance and Probabilism
Statistical Inference as Severe Testing: Beyond Performance and ProbabilismStatistical Inference as Severe Testing: Beyond Performance and Probabilism
Statistical Inference as Severe Testing: Beyond Performance and Probabilism
 
D. Mayo JSM slides v2.pdf
D. Mayo JSM slides v2.pdfD. Mayo JSM slides v2.pdf
D. Mayo JSM slides v2.pdf
 
reid-postJSM-DRC.pdf
reid-postJSM-DRC.pdfreid-postJSM-DRC.pdf
reid-postJSM-DRC.pdf
 
Errors of the Error Gatekeepers: The case of Statistical Significance 2016-2022
Errors of the Error Gatekeepers: The case of Statistical Significance 2016-2022Errors of the Error Gatekeepers: The case of Statistical Significance 2016-2022
Errors of the Error Gatekeepers: The case of Statistical Significance 2016-2022
 
Causal inference is not statistical inference
Causal inference is not statistical inferenceCausal inference is not statistical inference
Causal inference is not statistical inference
 
What are questionable research practices?
What are questionable research practices?What are questionable research practices?
What are questionable research practices?
 
What's the question?
What's the question? What's the question?
What's the question?
 
Mathematically Elegant Answers to Research Questions No One is Asking (meta-a...
Mathematically Elegant Answers to Research Questions No One is Asking (meta-a...Mathematically Elegant Answers to Research Questions No One is Asking (meta-a...
Mathematically Elegant Answers to Research Questions No One is Asking (meta-a...
 
On Severity, the Weight of Evidence, and the Relationship Between the Two
On Severity, the Weight of Evidence, and the Relationship Between the TwoOn Severity, the Weight of Evidence, and the Relationship Between the Two
On Severity, the Weight of Evidence, and the Relationship Between the Two
 
Revisiting the Two Cultures in Statistical Modeling and Inference as they rel...
Revisiting the Two Cultures in Statistical Modeling and Inference as they rel...Revisiting the Two Cultures in Statistical Modeling and Inference as they rel...
Revisiting the Two Cultures in Statistical Modeling and Inference as they rel...
 
Comparing Frequentists and Bayesian Control of Multiple Testing
Comparing Frequentists and Bayesian Control of Multiple TestingComparing Frequentists and Bayesian Control of Multiple Testing
Comparing Frequentists and Bayesian Control of Multiple Testing
 
Good Data Dredging
Good Data DredgingGood Data Dredging
Good Data Dredging
 
The Duality of Parameters and the Duality of Probability
The Duality of Parameters and the Duality of ProbabilityThe Duality of Parameters and the Duality of Probability
The Duality of Parameters and the Duality of Probability
 
Error Control and Severity
Error Control and SeverityError Control and Severity
Error Control and Severity
 
The Statistics Wars and Their Causalities (refs)
The Statistics Wars and Their Causalities (refs)The Statistics Wars and Their Causalities (refs)
The Statistics Wars and Their Causalities (refs)
 
The Statistics Wars and Their Casualties (w/refs)
The Statistics Wars and Their Casualties (w/refs)The Statistics Wars and Their Casualties (w/refs)
The Statistics Wars and Their Casualties (w/refs)
 
On the interpretation of the mathematical characteristics of statistical test...
On the interpretation of the mathematical characteristics of statistical test...On the interpretation of the mathematical characteristics of statistical test...
On the interpretation of the mathematical characteristics of statistical test...
 
The role of background assumptions in severity appraisal (
The role of background assumptions in severity appraisal (The role of background assumptions in severity appraisal (
The role of background assumptions in severity appraisal (
 
The two statistical cornerstones of replicability: addressing selective infer...
The two statistical cornerstones of replicability: addressing selective infer...The two statistical cornerstones of replicability: addressing selective infer...
The two statistical cornerstones of replicability: addressing selective infer...
 

Recently uploaded

CELL CYCLE Division Science 8 quarter IV.pptx
CELL CYCLE Division Science 8 quarter IV.pptxCELL CYCLE Division Science 8 quarter IV.pptx
CELL CYCLE Division Science 8 quarter IV.pptxJiesonDelaCerna
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatYousafMalik24
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceSamikshaHamane
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for BeginnersSabitha Banu
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxAvyJaneVismanos
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfFraming an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfUjwalaBharambe
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfMahmoud M. Sallam
 
MARGINALIZATION (Different learners in Marginalized Group
MARGINALIZATION (Different learners in Marginalized GroupMARGINALIZATION (Different learners in Marginalized Group
MARGINALIZATION (Different learners in Marginalized GroupJonathanParaisoCruz
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxiammrhaywood
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,Virag Sontakke
 
Types of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptxTypes of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptxEyham Joco
 

Recently uploaded (20)

CELL CYCLE Division Science 8 quarter IV.pptx
CELL CYCLE Division Science 8 quarter IV.pptxCELL CYCLE Division Science 8 quarter IV.pptx
CELL CYCLE Division Science 8 quarter IV.pptx
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice great
 
ESSENTIAL of (CS/IT/IS) class 06 (database)
ESSENTIAL of (CS/IT/IS) class 06 (database)ESSENTIAL of (CS/IT/IS) class 06 (database)
ESSENTIAL of (CS/IT/IS) class 06 (database)
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in Pharmacovigilance
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for Beginners
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptx
 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfFraming an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdf
 
MARGINALIZATION (Different learners in Marginalized Group
MARGINALIZATION (Different learners in Marginalized GroupMARGINALIZATION (Different learners in Marginalized Group
MARGINALIZATION (Different learners in Marginalized Group
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
 
Types of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptxTypes of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptx
 

The neglected importance of complexity in statistics and Metascience

  • 1. The neglected importance of complexity in statistics and Metascience  Daniele Fanelli
  • 2. In this talk: 1) what’s missing in the current paradigm 2) what a new paradigm might look like 3) evidence in support of this proposal
  • 3. The elephant in the room of: 1) metascience ● e.g. reproducibility 2) statistics ● e.g. model “complexity” ● [see SW seminar 2021, Fanelli 2019, 2022]
  • 4. What is “complex”? Level of complexity Many, diverse, interacting parts. Long to describe, difficult to predict.
  • 5. example 1) complexity deflates the “reproducibility crisis”. year project discipline N result 2014 Many labs 1 psychology, misc. 13, 36 labs 77% 2016 COS social+cognitive psychology 100 36-68% 2016 Camerer et al. experimental economics 18 61-78% 2018 Many labs 2 social+cognitive psychology 28, 62 samples, 36 countries 54% 2018 Camerer et al. social studies in Nature, Science 21 57-67% 2021 RPCB cancer biology 188, 50 exper., 23 papers 3-82% Lower reproducib: 1) complex phenomena 2) complex methods ● not just random noise ● structured, systematic diff.
  • 6. example 2) complexity confuses statistical results AIC:−2log( L)+2k
  • 7. models vary by “fitting propensity” (“complexity” beyond n. of parameters) (Bonifay and Cai 2017) universe of possible data When is a theory actually supported?
  • 8. How might the elephant appear? 1) integrate complexity of phenomena & methods in measuring, forecasting, correcting reproducibility 2) penalize statistical models for complexity beyond number of parameters integrating “complexity” ~ severity of testing
  • 10. K theory in a nutshell (Fanelli 2019, Fanelli 2022) K = H(Y) - H(Y|X, τ)) H(Y) H(X) D(τ)) + nX nY +
  • 11. K = consilience K = H(Y) - H(Y|X, τ)) H(Y) H(X) D(τ)) + nX nY explain/predict/control more/diverse phenomena with fewer/simpler theories/methods +
  • 12. what the variables represent = H(Y) - H(Y|X, τ)) H(Y) H(X) D(τ)) + nX nY K explain/predict/control more/diverse phenomena with fewer/simpler theories/methods +
  • 13. more information Y makes K GROW (Fanelli 2019, Fanelli 2022) K= H(Y) - H(Y|X, τ)) H(Y) H(X) D(τ)) + nX nY +
  • 14. (Fanelli 2019, Fanelli 2022) = - H(Y|X, τ)) H(X) D(τ)) nX H(Y1 ) + H(Y2 ) + H(Y3 ) nY H(Y1 ) + H(Y2 ) + H(Y3 ) + K more information Y makes K GROW +
  • 15. K theory in a nutshell (Fanelli 2019, Fanelli 2022) = H(Y) - H(Y|X, τ)) H(Y) H(X) D(τ)) + nX nY K +
  • 16. more info. Y|X, X, τ makes K small (Fanelli 2019, Fanelli 2022) K = H(Y) - H(Y|X, τ)) H(Y) H(X) D(τ)) + nX nY +
  • 17. (Fanelli 2019, Fanelli 2022) K = H(Y) - H(Y) H(X) + nX nY H(Y1 |X, τ))+H(Y2 |X, τ))+H(Y3 |X, τ)) D(τ)1 )+D(τ)2 )+D(τ)3 ) + more info. Y|X, X, τ makes K small
  • 18. K theory in a nutshell (Fanelli 2019, Fanelli 2022) = H(Y) - H(Y|X, τ)) H(Y) H(X) D(τ)) + nX nY K + = ≈1, full consilience =0, no knowledge <0, wrong
  • 19. K theory in a nutshell (Fanelli 2019, Fanelli 2022) = H(Y) - H(Y|X, τ)) H(Y) H(X) D(τ)) + nX nY K +
  • 20. K of a regression model (Fanelli 2019, Fanelli 2022) Y = α + β X + error = H(Y) - H(Y|X, τ)) H(Y) H(X) D(τ)) + nX nY K +
  • 21. (Fanelli 2019, Fanelli 2022) Y = α + β X + error this has been said before = H(Y) - H(Y|X, τ)) H(Y) H(X) D(τ)) + nX nY K +
  • 22. (Fanelli 2019, Fanelli 2022) key theoretical innovations Y = α + β X + error = H(Y) - H(Y|X, τ)) H(Y) H(X) D(τ)) + nX nY K +
  • 23. (Fanelli 2019, Fanelli 2022) key methodological innovations Y = α + β X + error = H(Y) - H(Y|X, τ)) H(Y) H(X) D(τ)) + nX nY K +
  • 24. (Fanelli 2019, Fanelli 2022) graphs are everywhere in science methodologies theories (Fanelli 2019, Fanelli 2022) www.protocols.io/view/an-optimized-protocol- for-in-vivo-analysis-of-tumo-3byl471m2lo5/v1 = H(Y) - H(Y|X, τ)) H(Y) H(X) D(τ)) + nX nY K + (Mueller 2015, ICSS)
  • 26. 1) K predicts perceived and actual reproducibility
  • 27. “tau” of biological experiments (Fanelli, Tan, Amaral & Neves, 2022, MetaArxiv)
  • 28. K vs. perceived reproducibility (Fanelli, Tan, Amaral & Neves, 2022, MetaArxiv)
  • 29. K vs. actual reproducibility Kr=K o 2−λ⋅d kr hr=ko ho 2−λ⋅d log kr ko =log ho hr −λ⋅d R≡log H (Y )−H (Y∣X , τr) H (Y )−H (Y∣X , τo) =α+βlog 1 D (τr)/ N
  • 30. (Fanelli, Tan, Amaral & Neves, in prep) 1) K predicts actual reproducibility (part of collaboration with Brazilian Reproducibility Initiative)
  • 31. (Fanelli, Tan, Amaral & Neves, in prep) independent new predictor multiple regression, Y=reproducibility
  • 32. (Fanelli, Tan, Amaral & Neves, in prep) multiple regression, Y=reproducibility (here D(τ) based on sentences in replication protocol!) very easy to measure, automatize
  • 33. better than just P-values and N (Fanelli, Tan, Amaral & Neves, in prep) multiple regression, Y=reproducibility R
  • 34. leads to progress in metascience (Fanelli, Tan, Amaral & Neves, in prep) multiple regression, Y=reproducibility
  • 35. 2) D(τ) might predict fitting propensity
  • 36. preregistered test (Fanelli & Bonifay, in prep)
  • 37. 1) generated N=20 models, all with 36 parameters 2) derived D(τ)), predicted their fitting propensity 3) tested them on 20,000 random covariance matrices (Fanelli & Bonifay, in prep) preregistered test
  • 38. D(τ) uniquely reflects model complexity (pre-registered test) (Fanelli & Bonifay, in progress)
  • 39. (Fanelli & Bonifay, in progress) NO alternative theory explains this! (pre-registered test) multiple regression: Y=fitting propensity
  • 40.
  • 41. Spearman’s ρ = 0.64, P<0.002 Spearman’s ρ = 0.73, P<0.001
  • 42. (Bonifay and Cai 2017) Bifactor confirmatory (theoretical) 20 parameters EIFA: exploratory (a-theoretical) 20 parameters (Fanelli & Bonifay, in progress) how K improves theory testing example of application
  • 43. theories invoking general factor (e.g. IQ, stress, psychosis) Bifactor widely used to “test” theories
  • 44. (Bonifay and Cai 2017) Bifactor confirmatory (theoretical) 20 parameters EIFA: exploratory (a-theoretical) 20 parameters K(Bifactor) ≥ K(EIFA) “The [bifactor-encoded] theory is specifically supported by the data” (Fanelli & Bonifay, in progress) P<0.05 when is a theory actually supported? example of application
  • 45. Summary of this talk: 1) what’s missing in the current paradigm? ● we pretend complexity is irrelevant 2) what might a new paradigm look like? ● measuring D(τ), integrating/penalizing with K 3) evidence in support of this proposal? ● increasingly promising ● any alternative, better suggestions?