SlideShare a Scribd company logo
Preregistration as a Tool to
Evaluate the Severity of a Test
Daniël Lakens
Bakan, 1966
Good research practices
Bem, 2011
Do you trust Daryl Bem
predicted an effect of
erotic pictures? And if
not, why should anyone
trust you?
Who cares if the
effect was predicted?
Severe testers care.
“The main way a theory gets
money in the bank is by
predicting facts that, absent
the theory, would be
antecedently improbable.”
Meehl, 1990
Why do we care about
prediction? Do we want
temporal novelty?
No.
Mayo, 2018
De Groot, 1969
Why do we care about
prediction? Do we want
theoretical novelty?
No.
Mayo, 2018
Why do we care about
prediction? Do we want
use novelty? Maybe,
but it’s a bit vague.
Mayo, 2018
The real underlying
question is: Is the
severity of the test
violated?
Mayo, 2018
Preregistration has a long
history in science – but
the replication crisis and
rise of the internet created
momentum in psych.
Wiseman, Watt, & Kornbrot, 2019
Preregistration
Pre-registration
formalizes error
control.
Often raised in the context
of Type 1 errors, but just as
relevant for Type 2 errors
(e.g., not finding effects if
they exist).
Prior to 2000, 17/30 studies
the number of null results
in large National Heart
Lung, and Blood Institute
funded trials showed an
effect. After pre-registration
became required, only 2/25.
Kaplan & Irvin, 2015
De Groot, 1969
“In order to provide hard evidence for or against an
empirical proposition, one has to resort to strictly
confirmatory studies. The degree to which the scientific
community will accept semiconfirmatory studies as
evidence depends partly on the plausibility of the claim
under scrutiny.”
Wagenmakers, Wetzels, Borsboom, van der Maas, 2012
Nosek & Lakens, 2014
Nosek & Lakens, 2014
So initially, preregistration was presented
purely as an approach to make it possible to
‘audit’ a p-value, and check if it they did not
lose their “error probing capacity” (Mayo,
2018).
However, researchers also too easily assumed preregistered
studies are always more compelling:
“This is particularly important if one wants to convince a
skeptical audience of a controversial claim: After all,
confirmatory studies are much more compelling than
exploratory studies.”
Wagenmakers, Wetzels, Borsboom, van der Maas, 2012
Taken together, these practices [reducing p-hacking and
publication bias, and power analysis] will ensure that articles
published as Registered Reports have a substantially higher
truth value than regular studies. Such articles can therefore
be expected to be more replicable and have a greater impact
on the field.
Chambers, NeuroChambers blog, 2012
Preregistration clarifies the distinction between
planned and unplanned research by reducing
unnoticed flexibility. This improves credibility
of findings and calibration of uncertainty.
Nosek, Beck, Campbell, Flake, Hardwicke, Mellor, van ‘t Veer, Vazire, 2019
In practice, confirmatory might be much
more compelling, have improved
credibility of findings, and higher truth
value. They might also not.
By not presenting this as an empirical
possibility on average, people promoting
registered reports upset some of their
peers. Multiple people have made a point
along the lines of:
“Thus, the danger in focusing on the achievement of
methodological precision is that we forget to consider
critically whether we have actually examined the issues
at hand in the best possible way, or whether we even are
asking the appropriate questions. To the extent that this is
the case, we are at risk of becoming methodological
fetishists”
Ellemers, 2013
The discussion about costs and
benefits of preregistration has
been hindered by a lack of a
conceptual analysis of what
preregistration aims to
accomplish.
So what is
preregistration for?
Preregistration adds value for people who,
based on their philosophy of science,
increase their trust in claims that are
supported by severe tests and predictive
successes.
Lakens, 2020
Preregistration has the goal to allow
others to transparently evaluate the
capacity of a test to support or
falsify a prediction.
Lakens, 2020
Preregistration itself does not make
a study better or worse compared to
a non-preregistered study.
Lakens, 2020
For example, preregistering a
tautological prediction (the theory of
reasoned action predicts reasoned
action) is not a severe test.
Fiedler, 2004
Let alone that many preregistrations
are of very low quality (which social
norms currently do not allow you to
say out loud, because “they tried”).
Let alone that many preregistrations
are of very low quality (which social
norms currently do not allow you to
say out loud, because “they tried”).
However, preregistration makes it
relatively more likely that tests have
desired long run error rates, and thus
makes it relatively more likely that
tests are more severe.
But Daniël, surely that
is not the only thing
preregistration for?
Yes, that is all it does. Preregistration
only has the goal to allow others to
transparently evaluate the capacity of
a test to support or falsify a prediction.
Sure, completing a preregistration
has other consequences. Thinking
through your study might improve it.
But there is no need to publicly post
the preregistration to reap that
benefit. They are simply positive
externalities.
But there is no need to publicly post
the preregistration to reap that
benefit. They are simply positive
externalities.
Why does this distinction matter? If
the use of a tool is detached from a
philosophy of science it risks
becoming a heuristic.
Researchers should not choose to
preregister because it has become a
new norm, they should preregister
because it supports their goal: severe
tests.
Thank you
@Lakens

More Related Content

What's hot

Probing with Severity: Beyond Bayesian Probabilism and Frequentist Performance
Probing with Severity: Beyond Bayesian Probabilism and Frequentist PerformanceProbing with Severity: Beyond Bayesian Probabilism and Frequentist Performance
Probing with Severity: Beyond Bayesian Probabilism and Frequentist Performance
jemille6
 
Surrogate Science: How Fisher, Neyman-Pearson, and Bayes Were Transformed int...
Surrogate Science: How Fisher, Neyman-Pearson, and Bayes Were Transformed int...Surrogate Science: How Fisher, Neyman-Pearson, and Bayes Were Transformed int...
Surrogate Science: How Fisher, Neyman-Pearson, and Bayes Were Transformed int...
jemille6
 
Replication Crises and the Statistics Wars: Hidden Controversies
Replication Crises and the Statistics Wars: Hidden ControversiesReplication Crises and the Statistics Wars: Hidden Controversies
Replication Crises and the Statistics Wars: Hidden Controversies
jemille6
 
D. Mayo: The Science Wars and the Statistics Wars: scientism, popular statist...
D. Mayo: The Science Wars and the Statistics Wars: scientism, popular statist...D. Mayo: The Science Wars and the Statistics Wars: scientism, popular statist...
D. Mayo: The Science Wars and the Statistics Wars: scientism, popular statist...
jemille6
 
D. G. Mayo: The Replication Crises and its Constructive Role in the Philosoph...
D. G. Mayo: The Replication Crises and its Constructive Role in the Philosoph...D. G. Mayo: The Replication Crises and its Constructive Role in the Philosoph...
D. G. Mayo: The Replication Crises and its Constructive Role in the Philosoph...
jemille6
 
Phil6334 day#4slidesfeb13
Phil6334 day#4slidesfeb13Phil6334 day#4slidesfeb13
Phil6334 day#4slidesfeb13
jemille6
 
Controversy Over the Significance Test Controversy
Controversy Over the Significance Test ControversyControversy Over the Significance Test Controversy
Controversy Over the Significance Test Controversy
jemille6
 
D. G. Mayo: Your data-driven claims must still be probed severely
D. G. Mayo: Your data-driven claims must still be probed severelyD. G. Mayo: Your data-driven claims must still be probed severely
D. G. Mayo: Your data-driven claims must still be probed severely
jemille6
 
Severe Testing: The Key to Error Correction
Severe Testing: The Key to Error CorrectionSevere Testing: The Key to Error Correction
Severe Testing: The Key to Error Correction
jemille6
 
Gelman psych crisis_2
Gelman psych crisis_2Gelman psych crisis_2
Gelman psych crisis_2
jemille6
 
D. Mayo: Replication Research Under an Error Statistical Philosophy
D. Mayo: Replication Research Under an Error Statistical Philosophy D. Mayo: Replication Research Under an Error Statistical Philosophy
D. Mayo: Replication Research Under an Error Statistical Philosophy
jemille6
 
Final mayo's aps_talk
Final mayo's aps_talkFinal mayo's aps_talk
Final mayo's aps_talk
jemille6
 
Fusion Confusion? Comments on Nancy Reid: "BFF Four-Are we Converging?"
Fusion Confusion? Comments on Nancy Reid: "BFF Four-Are we Converging?"Fusion Confusion? Comments on Nancy Reid: "BFF Four-Are we Converging?"
Fusion Confusion? Comments on Nancy Reid: "BFF Four-Are we Converging?"
jemille6
 
Statistical Flukes, the Higgs Discovery, and 5 Sigma
Statistical Flukes, the Higgs Discovery, and 5 Sigma Statistical Flukes, the Higgs Discovery, and 5 Sigma
Statistical Flukes, the Higgs Discovery, and 5 Sigma
jemille6
 
D. Mayo: Philosophy of Statistics & the Replication Crisis in Science
D. Mayo: Philosophy of Statistics & the Replication Crisis in ScienceD. Mayo: Philosophy of Statistics & the Replication Crisis in Science
D. Mayo: Philosophy of Statistics & the Replication Crisis in Science
jemille6
 
Feb21 mayobostonpaper
Feb21 mayobostonpaperFeb21 mayobostonpaper
Feb21 mayobostonpaper
jemille6
 
Mayo & parker spsp 2016 june 16
Mayo & parker   spsp 2016 june 16Mayo & parker   spsp 2016 june 16
Mayo & parker spsp 2016 june 16
jemille6
 
Senn repligate
Senn repligateSenn repligate
Senn repligate
jemille6
 
April 3 2014 slides mayo
April 3 2014 slides mayoApril 3 2014 slides mayo
April 3 2014 slides mayo
jemille6
 
"The Statistical Replication Crisis: Paradoxes and Scapegoats”
"The Statistical Replication Crisis: Paradoxes and Scapegoats”"The Statistical Replication Crisis: Paradoxes and Scapegoats”
"The Statistical Replication Crisis: Paradoxes and Scapegoats”
jemille6
 

What's hot (20)

Probing with Severity: Beyond Bayesian Probabilism and Frequentist Performance
Probing with Severity: Beyond Bayesian Probabilism and Frequentist PerformanceProbing with Severity: Beyond Bayesian Probabilism and Frequentist Performance
Probing with Severity: Beyond Bayesian Probabilism and Frequentist Performance
 
Surrogate Science: How Fisher, Neyman-Pearson, and Bayes Were Transformed int...
Surrogate Science: How Fisher, Neyman-Pearson, and Bayes Were Transformed int...Surrogate Science: How Fisher, Neyman-Pearson, and Bayes Were Transformed int...
Surrogate Science: How Fisher, Neyman-Pearson, and Bayes Were Transformed int...
 
Replication Crises and the Statistics Wars: Hidden Controversies
Replication Crises and the Statistics Wars: Hidden ControversiesReplication Crises and the Statistics Wars: Hidden Controversies
Replication Crises and the Statistics Wars: Hidden Controversies
 
D. Mayo: The Science Wars and the Statistics Wars: scientism, popular statist...
D. Mayo: The Science Wars and the Statistics Wars: scientism, popular statist...D. Mayo: The Science Wars and the Statistics Wars: scientism, popular statist...
D. Mayo: The Science Wars and the Statistics Wars: scientism, popular statist...
 
D. G. Mayo: The Replication Crises and its Constructive Role in the Philosoph...
D. G. Mayo: The Replication Crises and its Constructive Role in the Philosoph...D. G. Mayo: The Replication Crises and its Constructive Role in the Philosoph...
D. G. Mayo: The Replication Crises and its Constructive Role in the Philosoph...
 
Phil6334 day#4slidesfeb13
Phil6334 day#4slidesfeb13Phil6334 day#4slidesfeb13
Phil6334 day#4slidesfeb13
 
Controversy Over the Significance Test Controversy
Controversy Over the Significance Test ControversyControversy Over the Significance Test Controversy
Controversy Over the Significance Test Controversy
 
D. G. Mayo: Your data-driven claims must still be probed severely
D. G. Mayo: Your data-driven claims must still be probed severelyD. G. Mayo: Your data-driven claims must still be probed severely
D. G. Mayo: Your data-driven claims must still be probed severely
 
Severe Testing: The Key to Error Correction
Severe Testing: The Key to Error CorrectionSevere Testing: The Key to Error Correction
Severe Testing: The Key to Error Correction
 
Gelman psych crisis_2
Gelman psych crisis_2Gelman psych crisis_2
Gelman psych crisis_2
 
D. Mayo: Replication Research Under an Error Statistical Philosophy
D. Mayo: Replication Research Under an Error Statistical Philosophy D. Mayo: Replication Research Under an Error Statistical Philosophy
D. Mayo: Replication Research Under an Error Statistical Philosophy
 
Final mayo's aps_talk
Final mayo's aps_talkFinal mayo's aps_talk
Final mayo's aps_talk
 
Fusion Confusion? Comments on Nancy Reid: "BFF Four-Are we Converging?"
Fusion Confusion? Comments on Nancy Reid: "BFF Four-Are we Converging?"Fusion Confusion? Comments on Nancy Reid: "BFF Four-Are we Converging?"
Fusion Confusion? Comments on Nancy Reid: "BFF Four-Are we Converging?"
 
Statistical Flukes, the Higgs Discovery, and 5 Sigma
Statistical Flukes, the Higgs Discovery, and 5 Sigma Statistical Flukes, the Higgs Discovery, and 5 Sigma
Statistical Flukes, the Higgs Discovery, and 5 Sigma
 
D. Mayo: Philosophy of Statistics & the Replication Crisis in Science
D. Mayo: Philosophy of Statistics & the Replication Crisis in ScienceD. Mayo: Philosophy of Statistics & the Replication Crisis in Science
D. Mayo: Philosophy of Statistics & the Replication Crisis in Science
 
Feb21 mayobostonpaper
Feb21 mayobostonpaperFeb21 mayobostonpaper
Feb21 mayobostonpaper
 
Mayo & parker spsp 2016 june 16
Mayo & parker   spsp 2016 june 16Mayo & parker   spsp 2016 june 16
Mayo & parker spsp 2016 june 16
 
Senn repligate
Senn repligateSenn repligate
Senn repligate
 
April 3 2014 slides mayo
April 3 2014 slides mayoApril 3 2014 slides mayo
April 3 2014 slides mayo
 
"The Statistical Replication Crisis: Paradoxes and Scapegoats”
"The Statistical Replication Crisis: Paradoxes and Scapegoats”"The Statistical Replication Crisis: Paradoxes and Scapegoats”
"The Statistical Replication Crisis: Paradoxes and Scapegoats”
 

Similar to D. Lakens: Preregistration as a Tool to Evaluate the Severity of a Test

The role of background assumptions in severity appraisal (
The role of background assumptions in severity appraisal (The role of background assumptions in severity appraisal (
The role of background assumptions in severity appraisal (
jemille6
 
SAC360 Chapter 7 statistics
SAC360 Chapter 7 statisticsSAC360 Chapter 7 statistics
SAC360 Chapter 7 statistics
BealCollegeOnline
 
Jsm big-data
Jsm big-dataJsm big-data
Jsm big-data
Sean Taylor
 
Error Control and Severity
Error Control and SeverityError Control and Severity
Error Control and Severity
jemille6
 
Measuring Risk - What Doesn’t Work and What Does
Measuring Risk - What Doesn’t Work and What DoesMeasuring Risk - What Doesn’t Work and What Does
Measuring Risk - What Doesn’t Work and What Does
Jody Keyser
 
Answer these questions in a paragraph or more. Use teh prompts below
Answer these questions in a paragraph or more. Use teh prompts belowAnswer these questions in a paragraph or more. Use teh prompts below
Answer these questions in a paragraph or more. Use teh prompts below
brockdebroah
 
Book Summary : Everybody Lies
Book Summary : Everybody LiesBook Summary : Everybody Lies
Book Summary : Everybody Lies
Rahul Rishi
 
AI & VR for Academy of Medical Educators.pptx
AI & VR for Academy of Medical Educators.pptxAI & VR for Academy of Medical Educators.pptx
AI & VR for Academy of Medical Educators.pptx
Janet Corral
 
تحليل البيانات وتفسير المعطيات
تحليل البيانات وتفسير المعطياتتحليل البيانات وتفسير المعطيات
تحليل البيانات وتفسير المعطيات
مركز البحوث الأقسام العلمية
 
Mayo O&M slides (4-28-13)
Mayo O&M slides (4-28-13)Mayo O&M slides (4-28-13)
Mayo O&M slides (4-28-13)jemille6
 
Statistical "Reforms": Fixing Science or Threats to Replication and Falsifica...
Statistical "Reforms": Fixing Science or Threats to Replication and Falsifica...Statistical "Reforms": Fixing Science or Threats to Replication and Falsifica...
Statistical "Reforms": Fixing Science or Threats to Replication and Falsifica...
jemille6
 
May Newsletter
May NewsletterMay Newsletter
May Newsletter
mjcunny
 
David Hand:Trustworthiness of statistical analysis
David Hand:Trustworthiness of statistical analysisDavid Hand:Trustworthiness of statistical analysis
David Hand:Trustworthiness of statistical analysis
jemille6
 
Causality for Policy Assessment and 
Impact Analysis
Causality for Policy Assessment and 
Impact AnalysisCausality for Policy Assessment and 
Impact Analysis
Causality for Policy Assessment and 
Impact Analysis
Bayesia USA
 
Evidence-Based Decision Making for Hospital Administrators
Evidence-Based Decision Making for Hospital AdministratorsEvidence-Based Decision Making for Hospital Administrators
Evidence-Based Decision Making for Hospital Administrators
Center for Evidence-Based Management
 
Jerait PDF.pdf
Jerait PDF.pdfJerait PDF.pdf
Jerait PDF.pdf
JeraitServices
 
10 Ways Dial Testing Will Improve Your Research
10 Ways Dial Testing Will Improve Your Research10 Ways Dial Testing Will Improve Your Research
10 Ways Dial Testing Will Improve Your Research
Brian Izenson
 
Does preregistration improve the interpretability and credibility of research...
Does preregistration improve the interpretability and credibility of research...Does preregistration improve the interpretability and credibility of research...
Does preregistration improve the interpretability and credibility of research...
Mark Rubin
 
The replication crisis: are P-values the problem and are Bayes factors the so...
The replication crisis: are P-values the problem and are Bayes factors the so...The replication crisis: are P-values the problem and are Bayes factors the so...
The replication crisis: are P-values the problem and are Bayes factors the so...
jemille6
 

Similar to D. Lakens: Preregistration as a Tool to Evaluate the Severity of a Test (20)

The role of background assumptions in severity appraisal (
The role of background assumptions in severity appraisal (The role of background assumptions in severity appraisal (
The role of background assumptions in severity appraisal (
 
SAC360 Chapter 7 statistics
SAC360 Chapter 7 statisticsSAC360 Chapter 7 statistics
SAC360 Chapter 7 statistics
 
Jsm big-data
Jsm big-dataJsm big-data
Jsm big-data
 
Error Control and Severity
Error Control and SeverityError Control and Severity
Error Control and Severity
 
Measuring Risk - What Doesn’t Work and What Does
Measuring Risk - What Doesn’t Work and What DoesMeasuring Risk - What Doesn’t Work and What Does
Measuring Risk - What Doesn’t Work and What Does
 
Answer these questions in a paragraph or more. Use teh prompts below
Answer these questions in a paragraph or more. Use teh prompts belowAnswer these questions in a paragraph or more. Use teh prompts below
Answer these questions in a paragraph or more. Use teh prompts below
 
Book Summary : Everybody Lies
Book Summary : Everybody LiesBook Summary : Everybody Lies
Book Summary : Everybody Lies
 
AI & VR for Academy of Medical Educators.pptx
AI & VR for Academy of Medical Educators.pptxAI & VR for Academy of Medical Educators.pptx
AI & VR for Academy of Medical Educators.pptx
 
تحليل البيانات وتفسير المعطيات
تحليل البيانات وتفسير المعطياتتحليل البيانات وتفسير المعطيات
تحليل البيانات وتفسير المعطيات
 
Mayo O&M slides (4-28-13)
Mayo O&M slides (4-28-13)Mayo O&M slides (4-28-13)
Mayo O&M slides (4-28-13)
 
Statistical "Reforms": Fixing Science or Threats to Replication and Falsifica...
Statistical "Reforms": Fixing Science or Threats to Replication and Falsifica...Statistical "Reforms": Fixing Science or Threats to Replication and Falsifica...
Statistical "Reforms": Fixing Science or Threats to Replication and Falsifica...
 
May Newsletter
May NewsletterMay Newsletter
May Newsletter
 
David Hand:Trustworthiness of statistical analysis
David Hand:Trustworthiness of statistical analysisDavid Hand:Trustworthiness of statistical analysis
David Hand:Trustworthiness of statistical analysis
 
Causality for Policy Assessment and 
Impact Analysis
Causality for Policy Assessment and 
Impact AnalysisCausality for Policy Assessment and 
Impact Analysis
Causality for Policy Assessment and 
Impact Analysis
 
Synopsis b2 b3 and b4
Synopsis   b2 b3 and b4Synopsis   b2 b3 and b4
Synopsis b2 b3 and b4
 
Evidence-Based Decision Making for Hospital Administrators
Evidence-Based Decision Making for Hospital AdministratorsEvidence-Based Decision Making for Hospital Administrators
Evidence-Based Decision Making for Hospital Administrators
 
Jerait PDF.pdf
Jerait PDF.pdfJerait PDF.pdf
Jerait PDF.pdf
 
10 Ways Dial Testing Will Improve Your Research
10 Ways Dial Testing Will Improve Your Research10 Ways Dial Testing Will Improve Your Research
10 Ways Dial Testing Will Improve Your Research
 
Does preregistration improve the interpretability and credibility of research...
Does preregistration improve the interpretability and credibility of research...Does preregistration improve the interpretability and credibility of research...
Does preregistration improve the interpretability and credibility of research...
 
The replication crisis: are P-values the problem and are Bayes factors the so...
The replication crisis: are P-values the problem and are Bayes factors the so...The replication crisis: are P-values the problem and are Bayes factors the so...
The replication crisis: are P-values the problem and are Bayes factors the so...
 

More from jemille6

“The importance of philosophy of science for statistical science and vice versa”
“The importance of philosophy of science for statistical science and vice versa”“The importance of philosophy of science for statistical science and vice versa”
“The importance of philosophy of science for statistical science and vice versa”
jemille6
 
Statistical Inference as Severe Testing: Beyond Performance and Probabilism
Statistical Inference as Severe Testing: Beyond Performance and ProbabilismStatistical Inference as Severe Testing: Beyond Performance and Probabilism
Statistical Inference as Severe Testing: Beyond Performance and Probabilism
jemille6
 
D. Mayo JSM slides v2.pdf
D. Mayo JSM slides v2.pdfD. Mayo JSM slides v2.pdf
D. Mayo JSM slides v2.pdf
jemille6
 
reid-postJSM-DRC.pdf
reid-postJSM-DRC.pdfreid-postJSM-DRC.pdf
reid-postJSM-DRC.pdf
jemille6
 
Errors of the Error Gatekeepers: The case of Statistical Significance 2016-2022
Errors of the Error Gatekeepers: The case of Statistical Significance 2016-2022Errors of the Error Gatekeepers: The case of Statistical Significance 2016-2022
Errors of the Error Gatekeepers: The case of Statistical Significance 2016-2022
jemille6
 
Causal inference is not statistical inference
Causal inference is not statistical inferenceCausal inference is not statistical inference
Causal inference is not statistical inference
jemille6
 
What are questionable research practices?
What are questionable research practices?What are questionable research practices?
What are questionable research practices?
jemille6
 
What's the question?
What's the question? What's the question?
What's the question?
jemille6
 
The neglected importance of complexity in statistics and Metascience
The neglected importance of complexity in statistics and MetascienceThe neglected importance of complexity in statistics and Metascience
The neglected importance of complexity in statistics and Metascience
jemille6
 
Mathematically Elegant Answers to Research Questions No One is Asking (meta-a...
Mathematically Elegant Answers to Research Questions No One is Asking (meta-a...Mathematically Elegant Answers to Research Questions No One is Asking (meta-a...
Mathematically Elegant Answers to Research Questions No One is Asking (meta-a...
jemille6
 
On Severity, the Weight of Evidence, and the Relationship Between the Two
On Severity, the Weight of Evidence, and the Relationship Between the TwoOn Severity, the Weight of Evidence, and the Relationship Between the Two
On Severity, the Weight of Evidence, and the Relationship Between the Two
jemille6
 
Revisiting the Two Cultures in Statistical Modeling and Inference as they rel...
Revisiting the Two Cultures in Statistical Modeling and Inference as they rel...Revisiting the Two Cultures in Statistical Modeling and Inference as they rel...
Revisiting the Two Cultures in Statistical Modeling and Inference as they rel...
jemille6
 
Comparing Frequentists and Bayesian Control of Multiple Testing
Comparing Frequentists and Bayesian Control of Multiple TestingComparing Frequentists and Bayesian Control of Multiple Testing
Comparing Frequentists and Bayesian Control of Multiple Testing
jemille6
 
Good Data Dredging
Good Data DredgingGood Data Dredging
Good Data Dredging
jemille6
 
The Duality of Parameters and the Duality of Probability
The Duality of Parameters and the Duality of ProbabilityThe Duality of Parameters and the Duality of Probability
The Duality of Parameters and the Duality of Probability
jemille6
 
The Statistics Wars and Their Causalities (refs)
The Statistics Wars and Their Causalities (refs)The Statistics Wars and Their Causalities (refs)
The Statistics Wars and Their Causalities (refs)
jemille6
 
The Statistics Wars and Their Casualties (w/refs)
The Statistics Wars and Their Casualties (w/refs)The Statistics Wars and Their Casualties (w/refs)
The Statistics Wars and Their Casualties (w/refs)
jemille6
 
On the interpretation of the mathematical characteristics of statistical test...
On the interpretation of the mathematical characteristics of statistical test...On the interpretation of the mathematical characteristics of statistical test...
On the interpretation of the mathematical characteristics of statistical test...
jemille6
 
The two statistical cornerstones of replicability: addressing selective infer...
The two statistical cornerstones of replicability: addressing selective infer...The two statistical cornerstones of replicability: addressing selective infer...
The two statistical cornerstones of replicability: addressing selective infer...
jemille6
 
The Statistics Wars and Their Casualties
The Statistics Wars and Their CasualtiesThe Statistics Wars and Their Casualties
The Statistics Wars and Their Casualties
jemille6
 

More from jemille6 (20)

“The importance of philosophy of science for statistical science and vice versa”
“The importance of philosophy of science for statistical science and vice versa”“The importance of philosophy of science for statistical science and vice versa”
“The importance of philosophy of science for statistical science and vice versa”
 
Statistical Inference as Severe Testing: Beyond Performance and Probabilism
Statistical Inference as Severe Testing: Beyond Performance and ProbabilismStatistical Inference as Severe Testing: Beyond Performance and Probabilism
Statistical Inference as Severe Testing: Beyond Performance and Probabilism
 
D. Mayo JSM slides v2.pdf
D. Mayo JSM slides v2.pdfD. Mayo JSM slides v2.pdf
D. Mayo JSM slides v2.pdf
 
reid-postJSM-DRC.pdf
reid-postJSM-DRC.pdfreid-postJSM-DRC.pdf
reid-postJSM-DRC.pdf
 
Errors of the Error Gatekeepers: The case of Statistical Significance 2016-2022
Errors of the Error Gatekeepers: The case of Statistical Significance 2016-2022Errors of the Error Gatekeepers: The case of Statistical Significance 2016-2022
Errors of the Error Gatekeepers: The case of Statistical Significance 2016-2022
 
Causal inference is not statistical inference
Causal inference is not statistical inferenceCausal inference is not statistical inference
Causal inference is not statistical inference
 
What are questionable research practices?
What are questionable research practices?What are questionable research practices?
What are questionable research practices?
 
What's the question?
What's the question? What's the question?
What's the question?
 
The neglected importance of complexity in statistics and Metascience
The neglected importance of complexity in statistics and MetascienceThe neglected importance of complexity in statistics and Metascience
The neglected importance of complexity in statistics and Metascience
 
Mathematically Elegant Answers to Research Questions No One is Asking (meta-a...
Mathematically Elegant Answers to Research Questions No One is Asking (meta-a...Mathematically Elegant Answers to Research Questions No One is Asking (meta-a...
Mathematically Elegant Answers to Research Questions No One is Asking (meta-a...
 
On Severity, the Weight of Evidence, and the Relationship Between the Two
On Severity, the Weight of Evidence, and the Relationship Between the TwoOn Severity, the Weight of Evidence, and the Relationship Between the Two
On Severity, the Weight of Evidence, and the Relationship Between the Two
 
Revisiting the Two Cultures in Statistical Modeling and Inference as they rel...
Revisiting the Two Cultures in Statistical Modeling and Inference as they rel...Revisiting the Two Cultures in Statistical Modeling and Inference as they rel...
Revisiting the Two Cultures in Statistical Modeling and Inference as they rel...
 
Comparing Frequentists and Bayesian Control of Multiple Testing
Comparing Frequentists and Bayesian Control of Multiple TestingComparing Frequentists and Bayesian Control of Multiple Testing
Comparing Frequentists and Bayesian Control of Multiple Testing
 
Good Data Dredging
Good Data DredgingGood Data Dredging
Good Data Dredging
 
The Duality of Parameters and the Duality of Probability
The Duality of Parameters and the Duality of ProbabilityThe Duality of Parameters and the Duality of Probability
The Duality of Parameters and the Duality of Probability
 
The Statistics Wars and Their Causalities (refs)
The Statistics Wars and Their Causalities (refs)The Statistics Wars and Their Causalities (refs)
The Statistics Wars and Their Causalities (refs)
 
The Statistics Wars and Their Casualties (w/refs)
The Statistics Wars and Their Casualties (w/refs)The Statistics Wars and Their Casualties (w/refs)
The Statistics Wars and Their Casualties (w/refs)
 
On the interpretation of the mathematical characteristics of statistical test...
On the interpretation of the mathematical characteristics of statistical test...On the interpretation of the mathematical characteristics of statistical test...
On the interpretation of the mathematical characteristics of statistical test...
 
The two statistical cornerstones of replicability: addressing selective infer...
The two statistical cornerstones of replicability: addressing selective infer...The two statistical cornerstones of replicability: addressing selective infer...
The two statistical cornerstones of replicability: addressing selective infer...
 
The Statistics Wars and Their Casualties
The Statistics Wars and Their CasualtiesThe Statistics Wars and Their Casualties
The Statistics Wars and Their Casualties
 

Recently uploaded

Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
Jean Carlos Nunes Paixão
 
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptxChapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
Mohd Adib Abd Muin, Senior Lecturer at Universiti Utara Malaysia
 
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
Celine George
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
Peter Windle
 
The Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collectionThe Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collection
Israel Genealogy Research Association
 
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat  Leveraging AI for Diversity, Equity, and InclusionExecutive Directors Chat  Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
TechSoup
 
Normal Labour/ Stages of Labour/ Mechanism of Labour
Normal Labour/ Stages of Labour/ Mechanism of LabourNormal Labour/ Stages of Labour/ Mechanism of Labour
Normal Labour/ Stages of Labour/ Mechanism of Labour
Wasim Ak
 
Francesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptxFrancesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptx
EduSkills OECD
 
Acetabularia Information For Class 9 .docx
Acetabularia Information For Class 9  .docxAcetabularia Information For Class 9  .docx
Acetabularia Information For Class 9 .docx
vaibhavrinwa19
 
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama UniversityNatural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
Akanksha trivedi rama nursing college kanpur.
 
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBCSTRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
kimdan468
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
DeeptiGupta154
 
A Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptxA Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptx
thanhdowork
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
Pavel ( NSTU)
 
MASS MEDIA STUDIES-835-CLASS XI Resource Material.pdf
MASS MEDIA STUDIES-835-CLASS XI Resource Material.pdfMASS MEDIA STUDIES-835-CLASS XI Resource Material.pdf
MASS MEDIA STUDIES-835-CLASS XI Resource Material.pdf
goswamiyash170123
 
Azure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHatAzure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHat
Scholarhat
 
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
Levi Shapiro
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
TechSoup
 
"Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe..."Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe...
SACHIN R KONDAGURI
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
Jisc
 

Recently uploaded (20)

Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
 
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptxChapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
 
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
 
The Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collectionThe Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collection
 
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat  Leveraging AI for Diversity, Equity, and InclusionExecutive Directors Chat  Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
 
Normal Labour/ Stages of Labour/ Mechanism of Labour
Normal Labour/ Stages of Labour/ Mechanism of LabourNormal Labour/ Stages of Labour/ Mechanism of Labour
Normal Labour/ Stages of Labour/ Mechanism of Labour
 
Francesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptxFrancesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptx
 
Acetabularia Information For Class 9 .docx
Acetabularia Information For Class 9  .docxAcetabularia Information For Class 9  .docx
Acetabularia Information For Class 9 .docx
 
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama UniversityNatural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
 
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBCSTRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
 
A Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptxA Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptx
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
 
MASS MEDIA STUDIES-835-CLASS XI Resource Material.pdf
MASS MEDIA STUDIES-835-CLASS XI Resource Material.pdfMASS MEDIA STUDIES-835-CLASS XI Resource Material.pdf
MASS MEDIA STUDIES-835-CLASS XI Resource Material.pdf
 
Azure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHatAzure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHat
 
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
 
"Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe..."Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe...
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
 

D. Lakens: Preregistration as a Tool to Evaluate the Severity of a Test

  • 1. Preregistration as a Tool to Evaluate the Severity of a Test Daniël Lakens
  • 5. Do you trust Daryl Bem predicted an effect of erotic pictures? And if not, why should anyone trust you?
  • 6. Who cares if the effect was predicted? Severe testers care.
  • 7. “The main way a theory gets money in the bank is by predicting facts that, absent the theory, would be antecedently improbable.” Meehl, 1990
  • 8. Why do we care about prediction? Do we want temporal novelty? No. Mayo, 2018
  • 10. Why do we care about prediction? Do we want theoretical novelty? No. Mayo, 2018
  • 11. Why do we care about prediction? Do we want use novelty? Maybe, but it’s a bit vague. Mayo, 2018
  • 12. The real underlying question is: Is the severity of the test violated? Mayo, 2018
  • 13. Preregistration has a long history in science – but the replication crisis and rise of the internet created momentum in psych. Wiseman, Watt, & Kornbrot, 2019
  • 16. Often raised in the context of Type 1 errors, but just as relevant for Type 2 errors (e.g., not finding effects if they exist).
  • 17. Prior to 2000, 17/30 studies the number of null results in large National Heart Lung, and Blood Institute funded trials showed an effect. After pre-registration became required, only 2/25. Kaplan & Irvin, 2015
  • 19. “In order to provide hard evidence for or against an empirical proposition, one has to resort to strictly confirmatory studies. The degree to which the scientific community will accept semiconfirmatory studies as evidence depends partly on the plausibility of the claim under scrutiny.” Wagenmakers, Wetzels, Borsboom, van der Maas, 2012
  • 22. So initially, preregistration was presented purely as an approach to make it possible to ‘audit’ a p-value, and check if it they did not lose their “error probing capacity” (Mayo, 2018).
  • 23. However, researchers also too easily assumed preregistered studies are always more compelling: “This is particularly important if one wants to convince a skeptical audience of a controversial claim: After all, confirmatory studies are much more compelling than exploratory studies.” Wagenmakers, Wetzels, Borsboom, van der Maas, 2012
  • 24. Taken together, these practices [reducing p-hacking and publication bias, and power analysis] will ensure that articles published as Registered Reports have a substantially higher truth value than regular studies. Such articles can therefore be expected to be more replicable and have a greater impact on the field. Chambers, NeuroChambers blog, 2012
  • 25. Preregistration clarifies the distinction between planned and unplanned research by reducing unnoticed flexibility. This improves credibility of findings and calibration of uncertainty. Nosek, Beck, Campbell, Flake, Hardwicke, Mellor, van ‘t Veer, Vazire, 2019
  • 26. In practice, confirmatory might be much more compelling, have improved credibility of findings, and higher truth value. They might also not.
  • 27. By not presenting this as an empirical possibility on average, people promoting registered reports upset some of their peers. Multiple people have made a point along the lines of:
  • 28. “Thus, the danger in focusing on the achievement of methodological precision is that we forget to consider critically whether we have actually examined the issues at hand in the best possible way, or whether we even are asking the appropriate questions. To the extent that this is the case, we are at risk of becoming methodological fetishists” Ellemers, 2013
  • 29. The discussion about costs and benefits of preregistration has been hindered by a lack of a conceptual analysis of what preregistration aims to accomplish.
  • 31. Preregistration adds value for people who, based on their philosophy of science, increase their trust in claims that are supported by severe tests and predictive successes. Lakens, 2020
  • 32. Preregistration has the goal to allow others to transparently evaluate the capacity of a test to support or falsify a prediction. Lakens, 2020
  • 33. Preregistration itself does not make a study better or worse compared to a non-preregistered study. Lakens, 2020
  • 34. For example, preregistering a tautological prediction (the theory of reasoned action predicts reasoned action) is not a severe test. Fiedler, 2004
  • 35. Let alone that many preregistrations are of very low quality (which social norms currently do not allow you to say out loud, because “they tried”).
  • 36. Let alone that many preregistrations are of very low quality (which social norms currently do not allow you to say out loud, because “they tried”).
  • 37. However, preregistration makes it relatively more likely that tests have desired long run error rates, and thus makes it relatively more likely that tests are more severe.
  • 38. But Daniël, surely that is not the only thing preregistration for?
  • 39. Yes, that is all it does. Preregistration only has the goal to allow others to transparently evaluate the capacity of a test to support or falsify a prediction.
  • 40. Sure, completing a preregistration has other consequences. Thinking through your study might improve it.
  • 41. But there is no need to publicly post the preregistration to reap that benefit. They are simply positive externalities.
  • 42. But there is no need to publicly post the preregistration to reap that benefit. They are simply positive externalities.
  • 43. Why does this distinction matter? If the use of a tool is detached from a philosophy of science it risks becoming a heuristic.
  • 44. Researchers should not choose to preregister because it has become a new norm, they should preregister because it supports their goal: severe tests.

Editor's Notes

  1. https://www.youtube.com/watch?v=0Tdiu5kwjKs
  2. What if the effect would have been present only for negative pictures? Or romantic and erotic pictures? P-value (one-sided) is 0.013. Bonferroni correction: alpha = 0.01. Not significant.
  3. What if the effect would have been present only for negative pictures? Or romantic and erotic pictures? P-value (one-sided) is 0.013. Bonferroni correction: alpha = 0.01. Not significant.
  4. http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0132382 Large randomized controlled trials registered at clinicaltrials are more likely to report null results.