SlideShare a Scribd company logo
1 of 1
Download to read offline
ISSN 1806-3713© 2015 Sociedade Brasileira de Pneumologia e Tisiologia
http://dx.doi.org/10.1590/S1806-37132015000000215
J Bras Pneumol. 2015;41(5):485-485 Continuing Education:
Scientific Methodology
485
What does the p value really mean?
Juliana Carvalho Ferreira1,3
, Cecilia Maria Patino2,3
Why calculate a p value?
Consider an experiment in which 10 subjects receive a
placebo, and another 10 receive an experimental diuretic.
After 8 h, the average urine output in the placebo group is
769 mL, versus 814 mL in the diuretic group—a difference
of 45 mL (Figure 1). How do we know if that difference
means the drug works and is not just a result of chance?
of 45 mL in the average urine output between groups
under the null hypothesis. Because this is a very small
probability, we reject the null hypothesis. It does not
mean that the drug is a diuretic, nor that there is 97%
chance of the drug being a diuretic.
Misconceptions about the p value
Clinical versus statistical significance of the
effect size
There is a misconception that a very small p value
means the difference between groups is highly relevant.
Looking at the p value alone deviates our attention from
the effect size. In our example, the p value is significant
but a drug that increases urine output by 45 mL has no
clinical relevance.
Nonsignificant p values
Another misconception is that if the p value is greater
than 5%, the new treatment has no effect. The p value
indicates the probability of observing a difference as
large or larger than what was observed, under the
null hypothesis. But if the new treatment has an effect
of smaller size, a study with a small sample may be
underpowered to detect it.
Overinterpreting a nonsignificant p value that is
close to 5%
Yet another misconception is that if the p value is close
to 5%, there is a trend towards a group difference. It is
inappropriate to interpret a p value of, say, 0.06, as a
trend towards a difference. A p value of 0.06 means
that there is a probability of 6% of obtaining that result
by chance when the treatment has no real effect. Because
we set the significance level at 5%, the null hypothesis
should not be rejected.
Effect sizes versus p values
Many researchers believe that the p value is the most
important number to report. However, we should focus
on the effect size. Avoid reporting the p value alone
and preferably report the mean values for each group,
the difference, and the 95% confidence interval—then
the p value.
Recommended Literature
1.	 Glantz SA. Primer in Biostatistics, 5th
ed. New York: McGraw-Hill; 2002.
Figure 1. Urine output (mL) for each subject in the placebo
(squares) and new drug groups (diamonds).
1000
900
800
700
600
Placebo New drug
Placebo
New drug
1. Divisão de Pneumologia, Instituto do Coração – InCor – Hospital das Clínicas, Faculdade de Medicina, Universidade de São Paulo, São Paulo, Brasil.
2. Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA.
3. Methods in Epidemiologic, Clinical and Operations Research–MECOR–program, American Thoracic Society/Asociación Latinoamericana del Tórax.
The most common way to approach this problem is
to use statistical hypothesis testing. First, we state the
null hypothesis of no statistical difference between the
groups and the alternative hypothesis of a statistical
difference. Then we select a statistical test to compute
a test statistic, which is a standardized numerical
measure of the between-group difference. Under the
null hypothesis, we expect the test statistic value to be
small, but there is a small probability that it is large,
just by chance. Once we calculate the test statistic, we
use it to calculate the p-value.
The p value is defined as the probability of observing
the given value of the test statistic, or greater, under the
null hypothesis. Traditionally, the cut-off value to reject
the null hypothesis is 0.05, which means that when no
difference exists, such an extreme value for the test
statistic is expected less than 5% of the time.
Now let us go back to our case: we are comparing means
and assuming that the data is normally distributed, so
we use a t-test and compute a t-statistic of 2.34, with
a p value of 0.031. Because we use a 0.05 cutoff for
the p value, we reject the null hypothesis and conclude
that there is a statistically significant difference between
groups. So what does “p = 0.031” mean? It means that
there is only a 3% probability of observing a difference

More Related Content

What's hot

Minimally important differences
Minimally important differencesMinimally important differences
Minimally important differencesStephen Senn
 
Numbers needed to mislead
Numbers needed to misleadNumbers needed to mislead
Numbers needed to misleadStephen Senn
 
Experimental design part 4 groups
Experimental design part 4 groupsExperimental design part 4 groups
Experimental design part 4 groupsKevin Hamill
 
Clinical trials: quo vadis in the age of covid?
Clinical trials: quo vadis in the age of covid?Clinical trials: quo vadis in the age of covid?
Clinical trials: quo vadis in the age of covid?Stephen Senn
 
What is your question
What is your questionWhat is your question
What is your questionStephenSenn2
 
Personalised medicine a sceptical view
Personalised medicine a sceptical viewPersonalised medicine a sceptical view
Personalised medicine a sceptical viewStephen Senn
 
Experimental design part 1
Experimental design  part 1Experimental design  part 1
Experimental design part 1Kevin Hamill
 
NNTs, responder analysis & overlap measures
NNTs, responder analysis & overlap measuresNNTs, responder analysis & overlap measures
NNTs, responder analysis & overlap measuresStephen Senn
 
Experimental design part 2 measurements
Experimental design part 2 measurements Experimental design part 2 measurements
Experimental design part 2 measurements Kevin Hamill
 
Hypothesis testing in statistics
Hypothesis testing in statisticsHypothesis testing in statistics
Hypothesis testing in statisticsMuhammadFardeen4
 
Bio statistical analysis in clinical research
Bio statistical analysis  in clinical research  Bio statistical analysis  in clinical research
Bio statistical analysis in clinical research Helwan University
 
Experimental design part 3 controls
Experimental design part 3 controlsExperimental design part 3 controls
Experimental design part 3 controlsKevin Hamill
 
Sample size & meta analysis
Sample size & meta analysisSample size & meta analysis
Sample size & meta analysisdrsrb
 
Eugm 2011 pocock - dm cs-and-adaptive-trials
Eugm 2011   pocock - dm cs-and-adaptive-trialsEugm 2011   pocock - dm cs-and-adaptive-trials
Eugm 2011 pocock - dm cs-and-adaptive-trialsCytel USA
 
Michael Festing - MedicReS World Congress 2011
Michael Festing - MedicReS World Congress 2011Michael Festing - MedicReS World Congress 2011
Michael Festing - MedicReS World Congress 2011MedicReS
 
Talk on reproducibility in EEG research
Talk on reproducibility in EEG researchTalk on reproducibility in EEG research
Talk on reproducibility in EEG researchDorothy Bishop
 
Ruminations on replication
Ruminations on replicationRuminations on replication
Ruminations on replicationUlrich Dirnagl
 

What's hot (20)

Minimally important differences
Minimally important differencesMinimally important differences
Minimally important differences
 
Statistics basics
Statistics basicsStatistics basics
Statistics basics
 
Numbers needed to mislead
Numbers needed to misleadNumbers needed to mislead
Numbers needed to mislead
 
Experimental design part 4 groups
Experimental design part 4 groupsExperimental design part 4 groups
Experimental design part 4 groups
 
What is the importance of calculating sample size?
What is the importance of calculating sample size?What is the importance of calculating sample size?
What is the importance of calculating sample size?
 
Clinical trials: quo vadis in the age of covid?
Clinical trials: quo vadis in the age of covid?Clinical trials: quo vadis in the age of covid?
Clinical trials: quo vadis in the age of covid?
 
What is your question
What is your questionWhat is your question
What is your question
 
Personalised medicine a sceptical view
Personalised medicine a sceptical viewPersonalised medicine a sceptical view
Personalised medicine a sceptical view
 
Experimental design part 1
Experimental design  part 1Experimental design  part 1
Experimental design part 1
 
NNTs, responder analysis & overlap measures
NNTs, responder analysis & overlap measuresNNTs, responder analysis & overlap measures
NNTs, responder analysis & overlap measures
 
Ocw Statistical Analysis
Ocw Statistical AnalysisOcw Statistical Analysis
Ocw Statistical Analysis
 
Experimental design part 2 measurements
Experimental design part 2 measurements Experimental design part 2 measurements
Experimental design part 2 measurements
 
Hypothesis testing in statistics
Hypothesis testing in statisticsHypothesis testing in statistics
Hypothesis testing in statistics
 
Bio statistical analysis in clinical research
Bio statistical analysis  in clinical research  Bio statistical analysis  in clinical research
Bio statistical analysis in clinical research
 
Experimental design part 3 controls
Experimental design part 3 controlsExperimental design part 3 controls
Experimental design part 3 controls
 
Sample size & meta analysis
Sample size & meta analysisSample size & meta analysis
Sample size & meta analysis
 
Eugm 2011 pocock - dm cs-and-adaptive-trials
Eugm 2011   pocock - dm cs-and-adaptive-trialsEugm 2011   pocock - dm cs-and-adaptive-trials
Eugm 2011 pocock - dm cs-and-adaptive-trials
 
Michael Festing - MedicReS World Congress 2011
Michael Festing - MedicReS World Congress 2011Michael Festing - MedicReS World Congress 2011
Michael Festing - MedicReS World Congress 2011
 
Talk on reproducibility in EEG research
Talk on reproducibility in EEG researchTalk on reproducibility in EEG research
Talk on reproducibility in EEG research
 
Ruminations on replication
Ruminations on replicationRuminations on replication
Ruminations on replication
 

Viewers also liked

Logistic regression
Logistic regressionLogistic regression
Logistic regression緯鈞 沈
 
If you liked it you should've put a p-value on it ...or not
If you liked it you should've put a p-value on it ...or notIf you liked it you should've put a p-value on it ...or not
If you liked it you should've put a p-value on it ...or notKrzysztof Gorgolewski
 
MAD Konsep P value dan Confidence Interval
MAD Konsep P value dan Confidence IntervalMAD Konsep P value dan Confidence Interval
MAD Konsep P value dan Confidence IntervalNajMah Usman
 
Logistic regression
Logistic regressionLogistic regression
Logistic regressionsaba khan
 
Multiple Linear Regression II and ANOVA I
Multiple Linear Regression II and ANOVA IMultiple Linear Regression II and ANOVA I
Multiple Linear Regression II and ANOVA IJames Neill
 
Present and Past Modals of Deduction with Sherlock Holmes
Present and Past Modals of Deduction with Sherlock HolmesPresent and Past Modals of Deduction with Sherlock Holmes
Present and Past Modals of Deduction with Sherlock HolmesDavid Mainwood
 

Viewers also liked (8)

Logistic regression
Logistic regressionLogistic regression
Logistic regression
 
If you liked it you should've put a p-value on it ...or not
If you liked it you should've put a p-value on it ...or notIf you liked it you should've put a p-value on it ...or not
If you liked it you should've put a p-value on it ...or not
 
MAD Konsep P value dan Confidence Interval
MAD Konsep P value dan Confidence IntervalMAD Konsep P value dan Confidence Interval
MAD Konsep P value dan Confidence Interval
 
Logistic regression
Logistic regressionLogistic regression
Logistic regression
 
Multiple Linear Regression II and ANOVA I
Multiple Linear Regression II and ANOVA IMultiple Linear Regression II and ANOVA I
Multiple Linear Regression II and ANOVA I
 
Two sample t-test
Two sample t-testTwo sample t-test
Two sample t-test
 
Logistic Regression Analysis
Logistic Regression AnalysisLogistic Regression Analysis
Logistic Regression Analysis
 
Present and Past Modals of Deduction with Sherlock Holmes
Present and Past Modals of Deduction with Sherlock HolmesPresent and Past Modals of Deduction with Sherlock Holmes
Present and Past Modals of Deduction with Sherlock Holmes
 

Similar to What does the p value really mean?

Research Sample size by Dr Allah Yar Malik
Research Sample size by Dr Allah Yar MalikResearch Sample size by Dr Allah Yar Malik
Research Sample size by Dr Allah Yar Malikhuraismalik
 
Common Statistical Concerns in Clinical Trials
Common Statistical Concerns in Clinical TrialsCommon Statistical Concerns in Clinical Trials
Common Statistical Concerns in Clinical TrialsClin Plus
 
P-values the gold measure of statistical validity are not as reliable as many...
P-values the gold measure of statistical validity are not as reliable as many...P-values the gold measure of statistical validity are not as reliable as many...
P-values the gold measure of statistical validity are not as reliable as many...David Pratap
 
Introduction-to-Hypothesis-Testing Explained in detail
Introduction-to-Hypothesis-Testing Explained in detailIntroduction-to-Hypothesis-Testing Explained in detail
Introduction-to-Hypothesis-Testing Explained in detailShriramKargaonkar
 
Hypothesis testing, error and bias
Hypothesis testing, error and biasHypothesis testing, error and bias
Hypothesis testing, error and biasDr.Jatin Chhaya
 
Basics of Sample Size Estimation
Basics of Sample Size EstimationBasics of Sample Size Estimation
Basics of Sample Size EstimationMandar Baviskar
 
Statistics in clinical and translational research common pitfalls
Statistics in clinical and translational research  common pitfallsStatistics in clinical and translational research  common pitfalls
Statistics in clinical and translational research common pitfallsPavlos Msaouel, MD, PhD
 
Hypothesis Testing. Inferential Statistics pt. 2
Hypothesis Testing. Inferential Statistics pt. 2Hypothesis Testing. Inferential Statistics pt. 2
Hypothesis Testing. Inferential Statistics pt. 2John Labrador
 
Page 266LEARNING OBJECTIVES· Explain how researchers use inf.docx
Page 266LEARNING OBJECTIVES· Explain how researchers use inf.docxPage 266LEARNING OBJECTIVES· Explain how researchers use inf.docx
Page 266LEARNING OBJECTIVES· Explain how researchers use inf.docxkarlhennesey
 
Understanding clinical trial's statistics
Understanding clinical trial's statisticsUnderstanding clinical trial's statistics
Understanding clinical trial's statisticsMagdy Khames Aly
 
Statistical significance vs Clinical significance
Statistical significance vs Clinical significanceStatistical significance vs Clinical significance
Statistical significance vs Clinical significanceVini Mehta
 
20 OCT-Hypothesis Testing.ppt
20 OCT-Hypothesis Testing.ppt20 OCT-Hypothesis Testing.ppt
20 OCT-Hypothesis Testing.pptShivraj Nile
 
Critical Appriaisal Skills Basic 1 | May 4th 2011
Critical Appriaisal Skills Basic 1 | May 4th 2011Critical Appriaisal Skills Basic 1 | May 4th 2011
Critical Appriaisal Skills Basic 1 | May 4th 2011NES
 
How to read a paper
How to read a paperHow to read a paper
How to read a paperfaheta
 
STATISTICS : Changing the way we do: Hypothesis testing, effect size, power, ...
STATISTICS : Changing the way we do: Hypothesis testing, effect size, power, ...STATISTICS : Changing the way we do: Hypothesis testing, effect size, power, ...
STATISTICS : Changing the way we do: Hypothesis testing, effect size, power, ...Musfera Nara Vadia
 
Hypothesis testing and p values 06
Hypothesis testing and p values  06Hypothesis testing and p values  06
Hypothesis testing and p values 06DrZahid Khan
 

Similar to What does the p value really mean? (20)

Research Sample size by Dr Allah Yar Malik
Research Sample size by Dr Allah Yar MalikResearch Sample size by Dr Allah Yar Malik
Research Sample size by Dr Allah Yar Malik
 
Common Statistical Concerns in Clinical Trials
Common Statistical Concerns in Clinical TrialsCommon Statistical Concerns in Clinical Trials
Common Statistical Concerns in Clinical Trials
 
P-values the gold measure of statistical validity are not as reliable as many...
P-values the gold measure of statistical validity are not as reliable as many...P-values the gold measure of statistical validity are not as reliable as many...
P-values the gold measure of statistical validity are not as reliable as many...
 
Introduction-to-Hypothesis-Testing Explained in detail
Introduction-to-Hypothesis-Testing Explained in detailIntroduction-to-Hypothesis-Testing Explained in detail
Introduction-to-Hypothesis-Testing Explained in detail
 
Hypothesis testing, error and bias
Hypothesis testing, error and biasHypothesis testing, error and bias
Hypothesis testing, error and bias
 
Basics of Sample Size Estimation
Basics of Sample Size EstimationBasics of Sample Size Estimation
Basics of Sample Size Estimation
 
Statistical significance
Statistical significanceStatistical significance
Statistical significance
 
Statistics in clinical and translational research common pitfalls
Statistics in clinical and translational research  common pitfallsStatistics in clinical and translational research  common pitfalls
Statistics in clinical and translational research common pitfalls
 
Berd 5-6
Berd 5-6Berd 5-6
Berd 5-6
 
Hypothesis Testing. Inferential Statistics pt. 2
Hypothesis Testing. Inferential Statistics pt. 2Hypothesis Testing. Inferential Statistics pt. 2
Hypothesis Testing. Inferential Statistics pt. 2
 
Brmedj00047 0038
Brmedj00047 0038Brmedj00047 0038
Brmedj00047 0038
 
Page 266LEARNING OBJECTIVES· Explain how researchers use inf.docx
Page 266LEARNING OBJECTIVES· Explain how researchers use inf.docxPage 266LEARNING OBJECTIVES· Explain how researchers use inf.docx
Page 266LEARNING OBJECTIVES· Explain how researchers use inf.docx
 
Understanding clinical trial's statistics
Understanding clinical trial's statisticsUnderstanding clinical trial's statistics
Understanding clinical trial's statistics
 
Statistical significance vs Clinical significance
Statistical significance vs Clinical significanceStatistical significance vs Clinical significance
Statistical significance vs Clinical significance
 
20 OCT-Hypothesis Testing.ppt
20 OCT-Hypothesis Testing.ppt20 OCT-Hypothesis Testing.ppt
20 OCT-Hypothesis Testing.ppt
 
Critical Appriaisal Skills Basic 1 | May 4th 2011
Critical Appriaisal Skills Basic 1 | May 4th 2011Critical Appriaisal Skills Basic 1 | May 4th 2011
Critical Appriaisal Skills Basic 1 | May 4th 2011
 
How to read a paper
How to read a paperHow to read a paper
How to read a paper
 
STATISTICS : Changing the way we do: Hypothesis testing, effect size, power, ...
STATISTICS : Changing the way we do: Hypothesis testing, effect size, power, ...STATISTICS : Changing the way we do: Hypothesis testing, effect size, power, ...
STATISTICS : Changing the way we do: Hypothesis testing, effect size, power, ...
 
Hypothesis testing and p values 06
Hypothesis testing and p values  06Hypothesis testing and p values  06
Hypothesis testing and p values 06
 
Hypothesis testing 1.0
Hypothesis testing 1.0Hypothesis testing 1.0
Hypothesis testing 1.0
 

Recently uploaded

Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 
Class 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdfClass 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdfakmcokerachita
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Celine George
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxHistory Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxsocialsciencegdgrohi
 
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptxENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptxAnaBeatriceAblay2
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxAvyJaneVismanos
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
Painted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaPainted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaVirag Sontakke
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17Celine George
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsKarinaGenton
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxRaymartEstabillo3
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 

Recently uploaded (20)

Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
Class 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdfClass 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdf
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxHistory Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
 
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptxENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptx
 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
Painted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaPainted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of India
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its Characteristics
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 

What does the p value really mean?

  • 1. ISSN 1806-3713© 2015 Sociedade Brasileira de Pneumologia e Tisiologia http://dx.doi.org/10.1590/S1806-37132015000000215 J Bras Pneumol. 2015;41(5):485-485 Continuing Education: Scientific Methodology 485 What does the p value really mean? Juliana Carvalho Ferreira1,3 , Cecilia Maria Patino2,3 Why calculate a p value? Consider an experiment in which 10 subjects receive a placebo, and another 10 receive an experimental diuretic. After 8 h, the average urine output in the placebo group is 769 mL, versus 814 mL in the diuretic group—a difference of 45 mL (Figure 1). How do we know if that difference means the drug works and is not just a result of chance? of 45 mL in the average urine output between groups under the null hypothesis. Because this is a very small probability, we reject the null hypothesis. It does not mean that the drug is a diuretic, nor that there is 97% chance of the drug being a diuretic. Misconceptions about the p value Clinical versus statistical significance of the effect size There is a misconception that a very small p value means the difference between groups is highly relevant. Looking at the p value alone deviates our attention from the effect size. In our example, the p value is significant but a drug that increases urine output by 45 mL has no clinical relevance. Nonsignificant p values Another misconception is that if the p value is greater than 5%, the new treatment has no effect. The p value indicates the probability of observing a difference as large or larger than what was observed, under the null hypothesis. But if the new treatment has an effect of smaller size, a study with a small sample may be underpowered to detect it. Overinterpreting a nonsignificant p value that is close to 5% Yet another misconception is that if the p value is close to 5%, there is a trend towards a group difference. It is inappropriate to interpret a p value of, say, 0.06, as a trend towards a difference. A p value of 0.06 means that there is a probability of 6% of obtaining that result by chance when the treatment has no real effect. Because we set the significance level at 5%, the null hypothesis should not be rejected. Effect sizes versus p values Many researchers believe that the p value is the most important number to report. However, we should focus on the effect size. Avoid reporting the p value alone and preferably report the mean values for each group, the difference, and the 95% confidence interval—then the p value. Recommended Literature 1. Glantz SA. Primer in Biostatistics, 5th ed. New York: McGraw-Hill; 2002. Figure 1. Urine output (mL) for each subject in the placebo (squares) and new drug groups (diamonds). 1000 900 800 700 600 Placebo New drug Placebo New drug 1. Divisão de Pneumologia, Instituto do Coração – InCor – Hospital das Clínicas, Faculdade de Medicina, Universidade de São Paulo, São Paulo, Brasil. 2. Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA. 3. Methods in Epidemiologic, Clinical and Operations Research–MECOR–program, American Thoracic Society/Asociación Latinoamericana del Tórax. The most common way to approach this problem is to use statistical hypothesis testing. First, we state the null hypothesis of no statistical difference between the groups and the alternative hypothesis of a statistical difference. Then we select a statistical test to compute a test statistic, which is a standardized numerical measure of the between-group difference. Under the null hypothesis, we expect the test statistic value to be small, but there is a small probability that it is large, just by chance. Once we calculate the test statistic, we use it to calculate the p-value. The p value is defined as the probability of observing the given value of the test statistic, or greater, under the null hypothesis. Traditionally, the cut-off value to reject the null hypothesis is 0.05, which means that when no difference exists, such an extreme value for the test statistic is expected less than 5% of the time. Now let us go back to our case: we are comparing means and assuming that the data is normally distributed, so we use a t-test and compute a t-statistic of 2.34, with a p value of 0.031. Because we use a 0.05 cutoff for the p value, we reject the null hypothesis and conclude that there is a statistically significant difference between groups. So what does “p = 0.031” mean? It means that there is only a 3% probability of observing a difference