SlideShare a Scribd company logo
1 of 22
Download to read offline
Data-dependent analytical choices
relying on NHST should not be trusted!
Jesper W. Schneider
Danish Centre for Studies in Research and Research Policy,
Aarhus University, Denmark
jws@ps.au.dk
NHST = “null hypothesis significance test”
• A standard mode of inference in social and behavioral science is to establish
stylized facts using statistical significance in quantitative studies
• NHST is the practice of selection on statistical significance using an arbitrary
threshold (p < .05) to decide whether something is “true” or “false”
• NHST is when we try to knock down a strawman hypothesis to be able to say
something about our preferred hypothesis which we do not test
My claim
• Findings in Scientometrics relying on NHST are not to be trusted!
• … or we should be sceptical of claims from individual studies, especially when “decisions”
are based on NHST
• Why?
• At best they are vey susceptible to “garden of forking paths”
• At worst they have been deliberately p-hacked
• So my arguments are primarily in the statistical realm
But is this really relevant for
our “field”?
Knowledge production modes in
Scientometrics
• The field is diverse when it comes to methodology and methods
• Its core and boundaries are somewhat elusive
• The quantitative methodological domain is seemingly descriptive (exploratory) and often
case based
• … at least on the surface, as many studies are tacitly confirmatory studying explicit or
implicit hypotheses relying on NHST
• … and here we should watch out!
‘‘… the data did not show a significant relationship between the proportion of female
authors and the number of citations received, controlled by the number of authors who
signed each paper and the journal impact factor (r = -0.085, p = 0.052)”
So what’s the fuzz about?
Checked observed data
Log-transform skewed
data
X 1 is correlated with y
include as covariate
Use OLS regression P < .05
This is usually what is presented … the straight way to
the “truth”, and seemingly the only way to the “truth”
- A pre-determined path!!
Researcher-degrees-
of-freedom
Seemingly an endless number of choices,
so go ahead and pick what you want and
make your own recipe
Checked observed data
Log-transform skewed
data
X 1 is correlated with y
include as covariate
Use OLS regression P < .05
… and your recipe gave you this!
But if our recipe or the choices
we made under way were
slightly different
Checked observed data
Log-transform skewed
data
X 1 is correlated with y
include as covariate
Use OLS regression P < .05
Data are not skewed
X 1 is correlated with y
include as covariate
Use OLS regresion P < .??
We chose a slightly different path?
Checked observed data
Log-transform skewed
data
X 1 is correlated with y
include as covariate
Use OLS regression P < .05
Data are not skewed
X 1 is correlated with y
include as covariate
Use OLS regresion P < .??
X 1 is not correlated
with y and is excluded
Use OLS regresion P < .??
Or yet another?
Same data, many potential
paths leading to potentially
different outcomes!
The “garden of forking paths”
Is this a problem?
• Yes
• Because selection on significance (confirmatory)
• Multi-comparisons problem leading to many false-positive claims
• No
• If NHST is dropped = exploratory analysis
• If pre-registered
• If all data and choices are open and transparent
• Take away message:
• Different stories can be spun from the same data
• A p-value is specific for the path you chose and thus variable with choices
But worse!
P-hacking
• Looking at data and changing
paths a long the way chasing
significance
• Not exactly a pre-determined
path
• Leads to many problems, not
least abundant false-positive
claims
• P-values are uninterpretable
Average power ≈ 24%
in social and
behavioural sciences,
but 90% of studies
find an “effect”
Everybody is a p-hacker but some do it unintentionally
So what is the relation to
reproducibility?
Reproducibility
• It is no wonder that “effects” or “findings” do not “replicate”, or go in all directions when
reproducibility is judged on a statistical basis
• The basic fact is that “effects” vary across conditions – “we cannot step into the same
river twice”
• We do not have they required “controlled settings”
• We usually have indirect and very noisy measurements
• Effects are generally small, “the low hanging fruits have been grabbed”
• Researcher-degrees-of-freedom are countless
• Data analytical choices are many and arbitrary
• A main culprit is the labelling of “replications” as success or failures!
• … this is NHST with its arbitrary thresholds for success or failure and it leads to “false-
positives”
My overall message
• No replication is truly direct, and I recommend moving away from the classification of
replications as “direct” or “conceptual” to a framework in which we accept that effects
vary across conditions
• Relatedly, we should stop labeling replications as successes or failures and instead use
continuous measures to compare different studies
• But the general impression of “replication” or the “potential of replication” is quite clear
and central
• For example, if effects can vary by context, this provides more reason why
“replication” is necessary for scientific progress
• Full transparency of data collection and processing procedures, of actual data, of all
analyses done and how they are done, providing code are essential
• … and stop ‘over-selling’ your selective results
My overall message
• A statistically significant result does not mean that we have found a “true” or “real”
effect
• Few studies provide definitive evidence … although the stories they tell are often framed
in that way
• We need many studies of the same phenomenon, call it “replication” but we should
express the difference between old and new studies in terms of the expected variation
in the effect between conditions, not whether it is a success or failure
Quantitative studies in Scientometrics
relying on NHST is no exception to this!!
Thank you very much!

More Related Content

What's hot

Surviving statistics lecture 1
Surviving statistics lecture 1Surviving statistics lecture 1
Surviving statistics lecture 1MikeBlyth
 
Basic Statistical Concepts & Decision-Making
Basic Statistical Concepts & Decision-MakingBasic Statistical Concepts & Decision-Making
Basic Statistical Concepts & Decision-MakingPenn State University
 
Analysis of the article "A Predictive Analytics Primer" by Thomas H. Davenport
Analysis of the article "A Predictive Analytics Primer" by Thomas H. DavenportAnalysis of the article "A Predictive Analytics Primer" by Thomas H. Davenport
Analysis of the article "A Predictive Analytics Primer" by Thomas H. DavenportVaibhav Srivastav
 
Yoav Benjamini, "In the world beyond p<.05: When & How to use P<.0499..."
Yoav Benjamini, "In the world beyond p<.05: When & How to use P<.0499..."Yoav Benjamini, "In the world beyond p<.05: When & How to use P<.0499..."
Yoav Benjamini, "In the world beyond p<.05: When & How to use P<.0499..."jemille6
 
Improving predictions: Lasso, Ridge and Stein's paradox
Improving predictions: Lasso, Ridge and Stein's paradoxImproving predictions: Lasso, Ridge and Stein's paradox
Improving predictions: Lasso, Ridge and Stein's paradoxMaarten van Smeden
 
Chapter 1 (continued)
Chapter 1 (continued)Chapter 1 (continued)
Chapter 1 (continued)Robert Carels
 
Mention The Steps That Econometricians Take To Carry Out An Empirical Study -...
Mention The Steps That Econometricians Take To Carry Out An Empirical Study -...Mention The Steps That Econometricians Take To Carry Out An Empirical Study -...
Mention The Steps That Econometricians Take To Carry Out An Empirical Study -...PhD Assistance
 
"The Statistical Replication Crisis: Paradoxes and Scapegoats”
"The Statistical Replication Crisis: Paradoxes and Scapegoats”"The Statistical Replication Crisis: Paradoxes and Scapegoats”
"The Statistical Replication Crisis: Paradoxes and Scapegoats”jemille6
 
Aron chpt 7 ed effect size f2011
Aron chpt 7 ed effect size f2011Aron chpt 7 ed effect size f2011
Aron chpt 7 ed effect size f2011Sandra Nicks
 
sience 2.0 : an illustration of good research practices in a real study
sience 2.0 : an illustration of good research practices in a real studysience 2.0 : an illustration of good research practices in a real study
sience 2.0 : an illustration of good research practices in a real studywolf vanpaemel
 
Senn repligate
Senn repligateSenn repligate
Senn repligatejemille6
 
D. Mayo: Replication Research Under an Error Statistical Philosophy
D. Mayo: Replication Research Under an Error Statistical Philosophy D. Mayo: Replication Research Under an Error Statistical Philosophy
D. Mayo: Replication Research Under an Error Statistical Philosophy jemille6
 
Insights from psychology on lack of reproducibility
Insights from psychology on lack of reproducibilityInsights from psychology on lack of reproducibility
Insights from psychology on lack of reproducibilityDorothy Bishop
 
Talk on reproducibility in EEG research
Talk on reproducibility in EEG researchTalk on reproducibility in EEG research
Talk on reproducibility in EEG researchDorothy Bishop
 
Clinical prediction models: development, validation and beyond
Clinical prediction models:development, validation and beyondClinical prediction models:development, validation and beyond
Clinical prediction models: development, validation and beyondMaarten van Smeden
 

What's hot (20)

Surviving statistics lecture 1
Surviving statistics lecture 1Surviving statistics lecture 1
Surviving statistics lecture 1
 
Bayes rpp bristol
Bayes rpp bristolBayes rpp bristol
Bayes rpp bristol
 
Basic Statistical Concepts & Decision-Making
Basic Statistical Concepts & Decision-MakingBasic Statistical Concepts & Decision-Making
Basic Statistical Concepts & Decision-Making
 
Analysis of the article "A Predictive Analytics Primer" by Thomas H. Davenport
Analysis of the article "A Predictive Analytics Primer" by Thomas H. DavenportAnalysis of the article "A Predictive Analytics Primer" by Thomas H. Davenport
Analysis of the article "A Predictive Analytics Primer" by Thomas H. Davenport
 
Yoav Benjamini, "In the world beyond p<.05: When & How to use P<.0499..."
Yoav Benjamini, "In the world beyond p<.05: When & How to use P<.0499..."Yoav Benjamini, "In the world beyond p<.05: When & How to use P<.0499..."
Yoav Benjamini, "In the world beyond p<.05: When & How to use P<.0499..."
 
Improving predictions: Lasso, Ridge and Stein's paradox
Improving predictions: Lasso, Ridge and Stein's paradoxImproving predictions: Lasso, Ridge and Stein's paradox
Improving predictions: Lasso, Ridge and Stein's paradox
 
Chapter 1 (continued)
Chapter 1 (continued)Chapter 1 (continued)
Chapter 1 (continued)
 
Mention The Steps That Econometricians Take To Carry Out An Empirical Study -...
Mention The Steps That Econometricians Take To Carry Out An Empirical Study -...Mention The Steps That Econometricians Take To Carry Out An Empirical Study -...
Mention The Steps That Econometricians Take To Carry Out An Empirical Study -...
 
P-values in crisis
P-values in crisisP-values in crisis
P-values in crisis
 
Hypothesis
HypothesisHypothesis
Hypothesis
 
"The Statistical Replication Crisis: Paradoxes and Scapegoats”
"The Statistical Replication Crisis: Paradoxes and Scapegoats”"The Statistical Replication Crisis: Paradoxes and Scapegoats”
"The Statistical Replication Crisis: Paradoxes and Scapegoats”
 
Aron chpt 7 ed effect size f2011
Aron chpt 7 ed effect size f2011Aron chpt 7 ed effect size f2011
Aron chpt 7 ed effect size f2011
 
sience 2.0 : an illustration of good research practices in a real study
sience 2.0 : an illustration of good research practices in a real studysience 2.0 : an illustration of good research practices in a real study
sience 2.0 : an illustration of good research practices in a real study
 
Senn repligate
Senn repligateSenn repligate
Senn repligate
 
Hypothesis
HypothesisHypothesis
Hypothesis
 
D. Mayo: Replication Research Under an Error Statistical Philosophy
D. Mayo: Replication Research Under an Error Statistical Philosophy D. Mayo: Replication Research Under an Error Statistical Philosophy
D. Mayo: Replication Research Under an Error Statistical Philosophy
 
Insights from psychology on lack of reproducibility
Insights from psychology on lack of reproducibilityInsights from psychology on lack of reproducibility
Insights from psychology on lack of reproducibility
 
Research Method
Research MethodResearch Method
Research Method
 
Talk on reproducibility in EEG research
Talk on reproducibility in EEG researchTalk on reproducibility in EEG research
Talk on reproducibility in EEG research
 
Clinical prediction models: development, validation and beyond
Clinical prediction models:development, validation and beyondClinical prediction models:development, validation and beyond
Clinical prediction models: development, validation and beyond
 

Similar to Sti2018 jws

Presentation research- chapter 10-11 istiqlal
Presentation research- chapter 10-11 istiqlalPresentation research- chapter 10-11 istiqlal
Presentation research- chapter 10-11 istiqlalIstiqlalEid
 
P-values the gold measure of statistical validity are not as reliable as many...
P-values the gold measure of statistical validity are not as reliable as many...P-values the gold measure of statistical validity are not as reliable as many...
P-values the gold measure of statistical validity are not as reliable as many...David Pratap
 
Tests of significance Periodontology
Tests of significance PeriodontologyTests of significance Periodontology
Tests of significance PeriodontologySaiLakshmi128
 
Hypothesis testing
Hypothesis testingHypothesis testing
Hypothesis testingpraveen3030
 
Hypothesis....Phd in Management, HR, HRM, HRD, Management
Hypothesis....Phd in Management, HR, HRM, HRD, ManagementHypothesis....Phd in Management, HR, HRM, HRD, Management
Hypothesis....Phd in Management, HR, HRM, HRD, Managementdr m m bagali, phd in hr
 
De-Mystifying Stats: A primer on basic statistics
De-Mystifying Stats: A primer on basic statisticsDe-Mystifying Stats: A primer on basic statistics
De-Mystifying Stats: A primer on basic statisticsGillian Byrne
 
CourseProjectReviewPaper.jktamanaPDF
CourseProjectReviewPaper.jktamanaPDFCourseProjectReviewPaper.jktamanaPDF
CourseProjectReviewPaper.jktamanaPDFJasmine Tamanaha
 
321423152 e-0016087606-session39134-201012122352 (1)
321423152 e-0016087606-session39134-201012122352 (1)321423152 e-0016087606-session39134-201012122352 (1)
321423152 e-0016087606-session39134-201012122352 (1)Iin Angriyani
 
Statistics basics for oncologist kiran
Statistics basics for oncologist kiranStatistics basics for oncologist kiran
Statistics basics for oncologist kiranKiran Ramakrishna
 
Hypothesis Testing. Inferential Statistics pt. 2
Hypothesis Testing. Inferential Statistics pt. 2Hypothesis Testing. Inferential Statistics pt. 2
Hypothesis Testing. Inferential Statistics pt. 2John Labrador
 
Statistics for IB Biology
Statistics for IB BiologyStatistics for IB Biology
Statistics for IB BiologyEran Earland
 
Qualitative Research and CEHP (Turell & Howson)
Qualitative Research and CEHP (Turell & Howson)Qualitative Research and CEHP (Turell & Howson)
Qualitative Research and CEHP (Turell & Howson)D. Warnick Consulting
 
Inferential include one or more of the inferential statistical procedures.docx
Inferential include one or more of the inferential statistical procedures.docxInferential include one or more of the inferential statistical procedures.docx
Inferential include one or more of the inferential statistical procedures.docxwrite4
 
When to use, What Statistical Test for data Analysis modified.pptx
When to use, What Statistical Test for data Analysis modified.pptxWhen to use, What Statistical Test for data Analysis modified.pptx
When to use, What Statistical Test for data Analysis modified.pptxAsokan R
 
Principles of Diagnostic Testing and ROC 2016
Principles of Diagnostic Testing and ROC 2016Principles of Diagnostic Testing and ROC 2016
Principles of Diagnostic Testing and ROC 2016evadew1
 
Bio-Statistics in Bio-Medical research
Bio-Statistics in Bio-Medical researchBio-Statistics in Bio-Medical research
Bio-Statistics in Bio-Medical researchShinjan Patra
 

Similar to Sti2018 jws (20)

Presentation research- chapter 10-11 istiqlal
Presentation research- chapter 10-11 istiqlalPresentation research- chapter 10-11 istiqlal
Presentation research- chapter 10-11 istiqlal
 
P-values the gold measure of statistical validity are not as reliable as many...
P-values the gold measure of statistical validity are not as reliable as many...P-values the gold measure of statistical validity are not as reliable as many...
P-values the gold measure of statistical validity are not as reliable as many...
 
Tests of significance Periodontology
Tests of significance PeriodontologyTests of significance Periodontology
Tests of significance Periodontology
 
Hypothesis testing
Hypothesis testingHypothesis testing
Hypothesis testing
 
Hypothesis....Phd in Management, HR, HRM, HRD, Management
Hypothesis....Phd in Management, HR, HRM, HRD, ManagementHypothesis....Phd in Management, HR, HRM, HRD, Management
Hypothesis....Phd in Management, HR, HRM, HRD, Management
 
De-Mystifying Stats: A primer on basic statistics
De-Mystifying Stats: A primer on basic statisticsDe-Mystifying Stats: A primer on basic statistics
De-Mystifying Stats: A primer on basic statistics
 
CourseProjectReviewPaper.jktamanaPDF
CourseProjectReviewPaper.jktamanaPDFCourseProjectReviewPaper.jktamanaPDF
CourseProjectReviewPaper.jktamanaPDF
 
321423152 e-0016087606-session39134-201012122352 (1)
321423152 e-0016087606-session39134-201012122352 (1)321423152 e-0016087606-session39134-201012122352 (1)
321423152 e-0016087606-session39134-201012122352 (1)
 
Module7_RamdomError.pptx
Module7_RamdomError.pptxModule7_RamdomError.pptx
Module7_RamdomError.pptx
 
What is research
What is researchWhat is research
What is research
 
Statistics basics for oncologist kiran
Statistics basics for oncologist kiranStatistics basics for oncologist kiran
Statistics basics for oncologist kiran
 
Data in science
Data in science Data in science
Data in science
 
Hypothesis Testing. Inferential Statistics pt. 2
Hypothesis Testing. Inferential Statistics pt. 2Hypothesis Testing. Inferential Statistics pt. 2
Hypothesis Testing. Inferential Statistics pt. 2
 
Statistics for IB Biology
Statistics for IB BiologyStatistics for IB Biology
Statistics for IB Biology
 
Qualitative Research and CEHP (Turell & Howson)
Qualitative Research and CEHP (Turell & Howson)Qualitative Research and CEHP (Turell & Howson)
Qualitative Research and CEHP (Turell & Howson)
 
Hypotheses
HypothesesHypotheses
Hypotheses
 
Inferential include one or more of the inferential statistical procedures.docx
Inferential include one or more of the inferential statistical procedures.docxInferential include one or more of the inferential statistical procedures.docx
Inferential include one or more of the inferential statistical procedures.docx
 
When to use, What Statistical Test for data Analysis modified.pptx
When to use, What Statistical Test for data Analysis modified.pptxWhen to use, What Statistical Test for data Analysis modified.pptx
When to use, What Statistical Test for data Analysis modified.pptx
 
Principles of Diagnostic Testing and ROC 2016
Principles of Diagnostic Testing and ROC 2016Principles of Diagnostic Testing and ROC 2016
Principles of Diagnostic Testing and ROC 2016
 
Bio-Statistics in Bio-Medical research
Bio-Statistics in Bio-Medical researchBio-Statistics in Bio-Medical research
Bio-Statistics in Bio-Medical research
 

Recently uploaded

Uncommon Grace The Autobiography of Isaac Folorunso
Uncommon Grace The Autobiography of Isaac FolorunsoUncommon Grace The Autobiography of Isaac Folorunso
Uncommon Grace The Autobiography of Isaac FolorunsoKayode Fayemi
 
The workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdf
The workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdfThe workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdf
The workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdfSenaatti-kiinteistöt
 
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptx
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptxChiulli_Aurora_Oman_Raffaele_Beowulf.pptx
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptxraffaeleoman
 
Proofreading- Basics to Artificial Intelligence Integration - Presentation:Sl...
Proofreading- Basics to Artificial Intelligence Integration - Presentation:Sl...Proofreading- Basics to Artificial Intelligence Integration - Presentation:Sl...
Proofreading- Basics to Artificial Intelligence Integration - Presentation:Sl...David Celestin
 
My Presentation "In Your Hands" by Halle Bailey
My Presentation "In Your Hands" by Halle BaileyMy Presentation "In Your Hands" by Halle Bailey
My Presentation "In Your Hands" by Halle Baileyhlharris
 
AWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdf
AWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdfAWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdf
AWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdfSkillCertProExams
 
Digital collaboration with Microsoft 365 as extension of Drupal
Digital collaboration with Microsoft 365 as extension of DrupalDigital collaboration with Microsoft 365 as extension of Drupal
Digital collaboration with Microsoft 365 as extension of DrupalFabian de Rijk
 
Dreaming Music Video Treatment _ Project & Portfolio III
Dreaming Music Video Treatment _ Project & Portfolio IIIDreaming Music Video Treatment _ Project & Portfolio III
Dreaming Music Video Treatment _ Project & Portfolio IIINhPhngng3
 
Report Writing Webinar Training
Report Writing Webinar TrainingReport Writing Webinar Training
Report Writing Webinar TrainingKylaCullinane
 
If this Giant Must Walk: A Manifesto for a New Nigeria
If this Giant Must Walk: A Manifesto for a New NigeriaIf this Giant Must Walk: A Manifesto for a New Nigeria
If this Giant Must Walk: A Manifesto for a New NigeriaKayode Fayemi
 
lONG QUESTION ANSWER PAKISTAN STUDIES10.
lONG QUESTION ANSWER PAKISTAN STUDIES10.lONG QUESTION ANSWER PAKISTAN STUDIES10.
lONG QUESTION ANSWER PAKISTAN STUDIES10.lodhisaajjda
 
Dreaming Marissa Sánchez Music Video Treatment
Dreaming Marissa Sánchez Music Video TreatmentDreaming Marissa Sánchez Music Video Treatment
Dreaming Marissa Sánchez Music Video Treatmentnswingard
 
SOLID WASTE MANAGEMENT SYSTEM OF FENI PAURASHAVA, BANGLADESH.pdf
SOLID WASTE MANAGEMENT SYSTEM OF FENI PAURASHAVA, BANGLADESH.pdfSOLID WASTE MANAGEMENT SYSTEM OF FENI PAURASHAVA, BANGLADESH.pdf
SOLID WASTE MANAGEMENT SYSTEM OF FENI PAURASHAVA, BANGLADESH.pdfMahamudul Hasan
 
Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...
Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...
Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...amilabibi1
 

Recently uploaded (15)

Uncommon Grace The Autobiography of Isaac Folorunso
Uncommon Grace The Autobiography of Isaac FolorunsoUncommon Grace The Autobiography of Isaac Folorunso
Uncommon Grace The Autobiography of Isaac Folorunso
 
ICT role in 21st century education and it's challenges.pdf
ICT role in 21st century education and it's challenges.pdfICT role in 21st century education and it's challenges.pdf
ICT role in 21st century education and it's challenges.pdf
 
The workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdf
The workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdfThe workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdf
The workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdf
 
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptx
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptxChiulli_Aurora_Oman_Raffaele_Beowulf.pptx
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptx
 
Proofreading- Basics to Artificial Intelligence Integration - Presentation:Sl...
Proofreading- Basics to Artificial Intelligence Integration - Presentation:Sl...Proofreading- Basics to Artificial Intelligence Integration - Presentation:Sl...
Proofreading- Basics to Artificial Intelligence Integration - Presentation:Sl...
 
My Presentation "In Your Hands" by Halle Bailey
My Presentation "In Your Hands" by Halle BaileyMy Presentation "In Your Hands" by Halle Bailey
My Presentation "In Your Hands" by Halle Bailey
 
AWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdf
AWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdfAWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdf
AWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdf
 
Digital collaboration with Microsoft 365 as extension of Drupal
Digital collaboration with Microsoft 365 as extension of DrupalDigital collaboration with Microsoft 365 as extension of Drupal
Digital collaboration with Microsoft 365 as extension of Drupal
 
Dreaming Music Video Treatment _ Project & Portfolio III
Dreaming Music Video Treatment _ Project & Portfolio IIIDreaming Music Video Treatment _ Project & Portfolio III
Dreaming Music Video Treatment _ Project & Portfolio III
 
Report Writing Webinar Training
Report Writing Webinar TrainingReport Writing Webinar Training
Report Writing Webinar Training
 
If this Giant Must Walk: A Manifesto for a New Nigeria
If this Giant Must Walk: A Manifesto for a New NigeriaIf this Giant Must Walk: A Manifesto for a New Nigeria
If this Giant Must Walk: A Manifesto for a New Nigeria
 
lONG QUESTION ANSWER PAKISTAN STUDIES10.
lONG QUESTION ANSWER PAKISTAN STUDIES10.lONG QUESTION ANSWER PAKISTAN STUDIES10.
lONG QUESTION ANSWER PAKISTAN STUDIES10.
 
Dreaming Marissa Sánchez Music Video Treatment
Dreaming Marissa Sánchez Music Video TreatmentDreaming Marissa Sánchez Music Video Treatment
Dreaming Marissa Sánchez Music Video Treatment
 
SOLID WASTE MANAGEMENT SYSTEM OF FENI PAURASHAVA, BANGLADESH.pdf
SOLID WASTE MANAGEMENT SYSTEM OF FENI PAURASHAVA, BANGLADESH.pdfSOLID WASTE MANAGEMENT SYSTEM OF FENI PAURASHAVA, BANGLADESH.pdf
SOLID WASTE MANAGEMENT SYSTEM OF FENI PAURASHAVA, BANGLADESH.pdf
 
Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...
Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...
Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...
 

Sti2018 jws

  • 1. Data-dependent analytical choices relying on NHST should not be trusted! Jesper W. Schneider Danish Centre for Studies in Research and Research Policy, Aarhus University, Denmark jws@ps.au.dk
  • 2. NHST = “null hypothesis significance test” • A standard mode of inference in social and behavioral science is to establish stylized facts using statistical significance in quantitative studies • NHST is the practice of selection on statistical significance using an arbitrary threshold (p < .05) to decide whether something is “true” or “false” • NHST is when we try to knock down a strawman hypothesis to be able to say something about our preferred hypothesis which we do not test
  • 3. My claim • Findings in Scientometrics relying on NHST are not to be trusted! • … or we should be sceptical of claims from individual studies, especially when “decisions” are based on NHST • Why? • At best they are vey susceptible to “garden of forking paths” • At worst they have been deliberately p-hacked • So my arguments are primarily in the statistical realm
  • 4. But is this really relevant for our “field”?
  • 5. Knowledge production modes in Scientometrics • The field is diverse when it comes to methodology and methods • Its core and boundaries are somewhat elusive • The quantitative methodological domain is seemingly descriptive (exploratory) and often case based • … at least on the surface, as many studies are tacitly confirmatory studying explicit or implicit hypotheses relying on NHST • … and here we should watch out! ‘‘… the data did not show a significant relationship between the proportion of female authors and the number of citations received, controlled by the number of authors who signed each paper and the journal impact factor (r = -0.085, p = 0.052)”
  • 6. So what’s the fuzz about?
  • 7. Checked observed data Log-transform skewed data X 1 is correlated with y include as covariate Use OLS regression P < .05 This is usually what is presented … the straight way to the “truth”, and seemingly the only way to the “truth” - A pre-determined path!!
  • 8. Researcher-degrees- of-freedom Seemingly an endless number of choices, so go ahead and pick what you want and make your own recipe
  • 9. Checked observed data Log-transform skewed data X 1 is correlated with y include as covariate Use OLS regression P < .05 … and your recipe gave you this!
  • 10. But if our recipe or the choices we made under way were slightly different
  • 11. Checked observed data Log-transform skewed data X 1 is correlated with y include as covariate Use OLS regression P < .05 Data are not skewed X 1 is correlated with y include as covariate Use OLS regresion P < .?? We chose a slightly different path?
  • 12. Checked observed data Log-transform skewed data X 1 is correlated with y include as covariate Use OLS regression P < .05 Data are not skewed X 1 is correlated with y include as covariate Use OLS regresion P < .?? X 1 is not correlated with y and is excluded Use OLS regresion P < .?? Or yet another?
  • 13. Same data, many potential paths leading to potentially different outcomes!
  • 14. The “garden of forking paths”
  • 15. Is this a problem? • Yes • Because selection on significance (confirmatory) • Multi-comparisons problem leading to many false-positive claims • No • If NHST is dropped = exploratory analysis • If pre-registered • If all data and choices are open and transparent • Take away message: • Different stories can be spun from the same data • A p-value is specific for the path you chose and thus variable with choices
  • 17. P-hacking • Looking at data and changing paths a long the way chasing significance • Not exactly a pre-determined path • Leads to many problems, not least abundant false-positive claims • P-values are uninterpretable Average power ≈ 24% in social and behavioural sciences, but 90% of studies find an “effect” Everybody is a p-hacker but some do it unintentionally
  • 18. So what is the relation to reproducibility?
  • 19. Reproducibility • It is no wonder that “effects” or “findings” do not “replicate”, or go in all directions when reproducibility is judged on a statistical basis • The basic fact is that “effects” vary across conditions – “we cannot step into the same river twice” • We do not have they required “controlled settings” • We usually have indirect and very noisy measurements • Effects are generally small, “the low hanging fruits have been grabbed” • Researcher-degrees-of-freedom are countless • Data analytical choices are many and arbitrary • A main culprit is the labelling of “replications” as success or failures! • … this is NHST with its arbitrary thresholds for success or failure and it leads to “false- positives”
  • 20. My overall message • No replication is truly direct, and I recommend moving away from the classification of replications as “direct” or “conceptual” to a framework in which we accept that effects vary across conditions • Relatedly, we should stop labeling replications as successes or failures and instead use continuous measures to compare different studies • But the general impression of “replication” or the “potential of replication” is quite clear and central • For example, if effects can vary by context, this provides more reason why “replication” is necessary for scientific progress • Full transparency of data collection and processing procedures, of actual data, of all analyses done and how they are done, providing code are essential • … and stop ‘over-selling’ your selective results
  • 21. My overall message • A statistically significant result does not mean that we have found a “true” or “real” effect • Few studies provide definitive evidence … although the stories they tell are often framed in that way • We need many studies of the same phenomenon, call it “replication” but we should express the difference between old and new studies in terms of the expected variation in the effect between conditions, not whether it is a success or failure Quantitative studies in Scientometrics relying on NHST is no exception to this!!
  • 22. Thank you very much!