SlideShare a Scribd company logo
statspecialist.com http://www.statspecialist.com/blog/a-p-value-what-does-it-mean-really/ 
A p-value; what does it mean really? 
Statistics 
Specialist 
On several occasions I have been approached by students and researchers to “help 
them get a p-value for their results”. They believed that the purpose of an analysis was 
to dig this value from the data otherwise the analysis would be incomplete. 
Sir Ronald Fisher, a British statistician and geneticist, introduced the p-value in 1925. 
This was around the time when he was developing computational algorithms for 
analyzing data from his balanced experimental designs. He wrapped up his work in his 
first book, Statistical Methods for Research Workers. The book went through many editions and translation over 
time, and later became the standard reference work for scientists in many disciplines. 
At the time he assumed 0.05 as a reference point for rejecting a null hypothesis but not as a sharp cut off. 
Fisher’s philosophy of significance testing interpreted the p-value as a measure of evidence from a single 
experiment. As a measure of evidence, the p-value was meant to be combined with other sources of information. 
Thus, there was no set threshold for “significance” (Fisher, 1973). 
P-values have since been widely misunderstood in a lot of circles where it is reported. Goodman’s article on the 
misinterpretation of p-values lists some misconceptions. In brief the article puts across that a p-value of say 0.05 
does not mean that: there is only a 5% chance that the null hypothesis is true, there is a 5% chance of a Type I 
error (i.e. false positive), there is a 95% chance that the results would replicate if the study were repeated, there 
is no difference between groups or that you have proved your experimental hypothesis. 
A p-value should be interpreted as: The probability of getting the results you have observed or more 
extreme results given that the null hypothesis is true. This might still not be clear, so let’s have usual coin 
toss examples frequent in introduction to probability lessons. 
Suppose we toss a fair coin 20 times and observe the number of heads that come up, we would expect to obtain 
10 heads in our experiment. This is because, for a fair coin, the probability of turning heads is 0.5 and so the 
expected number of heads will be 20*0.5=10. 
Now let’s experiment with a coin with an unknown probability of turning heads. Our aim in the experiment is to 
quantify the evidence against our null hypothesis that the coin is fair. In our experiment the coin lands heads on 
16 out of 20 tosses. 
How do we interpret this result? Is it unusual given that we were expecting about 10 heads? Let’s calculate a p-value. 
Remember that a p-value was the probability of getting the observed results (16 heads) or more extreme 
results (17, 18, 19, or 20 heads) if our null hypothesis is true- the coin is fair. Considering each toss as a Bernoulli 
experiment we can easily obtain the probability that in 20 trials we get x (x=16,.., 20) number of heads using the 
binomial function: 
Where is the probability of success in each trial. 
The p-value obtained is 0.0059. This could mean that an unlikely event occurred i.e. a fair coin landing 
heads 16 times or that the coin is not fair! However, the p-value does not tell us which is which. Many people 
conclude that such an unlikely event suggests that the coin is not fair; rejecting the null hypothesis that the coin is 
fair, but do not recognize that there is a second possibility in the circumstance. So, I have heard statements like 
‘the p-value was <0.05 which proves that the null hypothesis is true’ or ‘the p-value was <0.05, therefore, we 
accept the null hypothesis’. This is where the misinterpretation comes.
What are your views or questions on the p-value?

More Related Content

What's hot

Theoretical Probability
Theoretical ProbabilityTheoretical Probability
Theoretical Probability
Lindsey Brown
 
Probability Overview
Probability OverviewProbability Overview
Probability Overview
mmeddin
 
PROBABILITY AND IT'S TYPES WITH RULES
PROBABILITY AND IT'S TYPES WITH RULESPROBABILITY AND IT'S TYPES WITH RULES
PROBABILITY AND IT'S TYPES WITH RULES
Bhargavi Bhanu
 

What's hot (20)

Theoretical Probability
Theoretical ProbabilityTheoretical Probability
Theoretical Probability
 
Probability Overview
Probability OverviewProbability Overview
Probability Overview
 
Probability
ProbabilityProbability
Probability
 
Basic probability concept
Basic probability conceptBasic probability concept
Basic probability concept
 
Basic concept of probability
Basic concept of probabilityBasic concept of probability
Basic concept of probability
 
different kinds of probability
different kinds of probabilitydifferent kinds of probability
different kinds of probability
 
PROBABILITY
PROBABILITYPROBABILITY
PROBABILITY
 
Probability
ProbabilityProbability
Probability
 
probability
probabilityprobability
probability
 
Nossi ch 10
Nossi ch 10Nossi ch 10
Nossi ch 10
 
Probability
ProbabilityProbability
Probability
 
Achint probability powerpoint
Achint probability powerpointAchint probability powerpoint
Achint probability powerpoint
 
PROBABILITY AND IT'S TYPES WITH RULES
PROBABILITY AND IT'S TYPES WITH RULESPROBABILITY AND IT'S TYPES WITH RULES
PROBABILITY AND IT'S TYPES WITH RULES
 
Probability 10th class
Probability 10th classProbability 10th class
Probability 10th class
 
Probability theory
Probability theory Probability theory
Probability theory
 
probability
probabilityprobability
probability
 
Probability
ProbabilityProbability
Probability
 
Probability
ProbabilityProbability
Probability
 
Basic concepts of probability
Basic concepts of probability Basic concepts of probability
Basic concepts of probability
 
Probability
ProbabilityProbability
Probability
 

Similar to A p-value: what does it mean, really?

Morestatistics22 091208004743-phpapp01
Morestatistics22 091208004743-phpapp01Morestatistics22 091208004743-phpapp01
Morestatistics22 091208004743-phpapp01
mandrewmartin
 
35818 Topic Discussion7Number of Pages 1 (Double Spaced).docx
35818 Topic Discussion7Number of Pages 1 (Double Spaced).docx35818 Topic Discussion7Number of Pages 1 (Double Spaced).docx
35818 Topic Discussion7Number of Pages 1 (Double Spaced).docx
rhetttrevannion
 
35819 Topic Discussion8Number of Pages 1 (Double Spaced).docx
35819 Topic Discussion8Number of Pages 1 (Double Spaced).docx35819 Topic Discussion8Number of Pages 1 (Double Spaced).docx
35819 Topic Discussion8Number of Pages 1 (Double Spaced).docx
rhetttrevannion
 
35812 Topic discussion1Number of Pages 1 (Double Spaced).docx
35812 Topic discussion1Number of Pages 1 (Double Spaced).docx35812 Topic discussion1Number of Pages 1 (Double Spaced).docx
35812 Topic discussion1Number of Pages 1 (Double Spaced).docx
rhetttrevannion
 
El espionaje de datos y la falacia de las pruebas múltiples
El espionaje de datos y la falacia de las pruebas múltiplesEl espionaje de datos y la falacia de las pruebas múltiples
El espionaje de datos y la falacia de las pruebas múltiples
afgallegos1997
 

Similar to A p-value: what does it mean, really? (20)

P value part 1
P value part 1P value part 1
P value part 1
 
Reporting Results of Statistical Analysis
Reporting Results of Statistical Analysis Reporting Results of Statistical Analysis
Reporting Results of Statistical Analysis
 
Ncert class-10-mathematics-chapter-15
Ncert class-10-mathematics-chapter-15Ncert class-10-mathematics-chapter-15
Ncert class-10-mathematics-chapter-15
 
Hypothesis
HypothesisHypothesis
Hypothesis
 
Stats chapter 11
Stats chapter 11Stats chapter 11
Stats chapter 11
 
Probability in daily life
Probability in daily lifeProbability in daily life
Probability in daily life
 
Test of significance
Test of significanceTest of significance
Test of significance
 
Morestatistics22 091208004743-phpapp01
Morestatistics22 091208004743-phpapp01Morestatistics22 091208004743-phpapp01
Morestatistics22 091208004743-phpapp01
 
35818 Topic Discussion7Number of Pages 1 (Double Spaced).docx
35818 Topic Discussion7Number of Pages 1 (Double Spaced).docx35818 Topic Discussion7Number of Pages 1 (Double Spaced).docx
35818 Topic Discussion7Number of Pages 1 (Double Spaced).docx
 
Hypothesis
HypothesisHypothesis
Hypothesis
 
35819 Topic Discussion8Number of Pages 1 (Double Spaced).docx
35819 Topic Discussion8Number of Pages 1 (Double Spaced).docx35819 Topic Discussion8Number of Pages 1 (Double Spaced).docx
35819 Topic Discussion8Number of Pages 1 (Double Spaced).docx
 
Stats chapter 11
Stats chapter 11Stats chapter 11
Stats chapter 11
 
vinayjoshi-131204045346-phpapp02.pdf
vinayjoshi-131204045346-phpapp02.pdfvinayjoshi-131204045346-phpapp02.pdf
vinayjoshi-131204045346-phpapp02.pdf
 
Null hypothesis AND ALTERNAT HYPOTHESIS
Null hypothesis AND ALTERNAT HYPOTHESISNull hypothesis AND ALTERNAT HYPOTHESIS
Null hypothesis AND ALTERNAT HYPOTHESIS
 
35812 Topic discussion1Number of Pages 1 (Double Spaced).docx
35812 Topic discussion1Number of Pages 1 (Double Spaced).docx35812 Topic discussion1Number of Pages 1 (Double Spaced).docx
35812 Topic discussion1Number of Pages 1 (Double Spaced).docx
 
Fundamentals Probability 08072009
Fundamentals Probability 08072009Fundamentals Probability 08072009
Fundamentals Probability 08072009
 
More Statistics
More StatisticsMore Statistics
More Statistics
 
Probability theory
Probability theoryProbability theory
Probability theory
 
El espionaje de datos y la falacia de las pruebas múltiples
El espionaje de datos y la falacia de las pruebas múltiplesEl espionaje de datos y la falacia de las pruebas múltiples
El espionaje de datos y la falacia de las pruebas múltiples
 
Risk Management - CH 7 - Hypothesis Tests and Confidence | CMT Level 3 | Char...
Risk Management - CH 7 - Hypothesis Tests and Confidence | CMT Level 3 | Char...Risk Management - CH 7 - Hypothesis Tests and Confidence | CMT Level 3 | Char...
Risk Management - CH 7 - Hypothesis Tests and Confidence | CMT Level 3 | Char...
 

Recently uploaded

Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
benishzehra469
 
Computer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage sComputer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage s
MAQIB18
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
ewymefz
 
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
nscud
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
axoqas
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
vcaxypu
 
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Domenico Conte
 
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
ukgaet
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
ewymefz
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
ewymefz
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
yhkoc
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
axoqas
 

Recently uploaded (20)

Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
 
How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?
 
Computer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage sComputer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage s
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
 
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
 
Using PDB Relocation to Move a Single PDB to Another Existing CDB
Using PDB Relocation to Move a Single PDB to Another Existing CDBUsing PDB Relocation to Move a Single PDB to Another Existing CDB
Using PDB Relocation to Move a Single PDB to Another Existing CDB
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
 
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
 
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
 
Uber Ride Supply Demand Gap Analysis Report
Uber Ride Supply Demand Gap Analysis ReportUber Ride Supply Demand Gap Analysis Report
Uber Ride Supply Demand Gap Analysis Report
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
 

A p-value: what does it mean, really?

  • 1. statspecialist.com http://www.statspecialist.com/blog/a-p-value-what-does-it-mean-really/ A p-value; what does it mean really? Statistics Specialist On several occasions I have been approached by students and researchers to “help them get a p-value for their results”. They believed that the purpose of an analysis was to dig this value from the data otherwise the analysis would be incomplete. Sir Ronald Fisher, a British statistician and geneticist, introduced the p-value in 1925. This was around the time when he was developing computational algorithms for analyzing data from his balanced experimental designs. He wrapped up his work in his first book, Statistical Methods for Research Workers. The book went through many editions and translation over time, and later became the standard reference work for scientists in many disciplines. At the time he assumed 0.05 as a reference point for rejecting a null hypothesis but not as a sharp cut off. Fisher’s philosophy of significance testing interpreted the p-value as a measure of evidence from a single experiment. As a measure of evidence, the p-value was meant to be combined with other sources of information. Thus, there was no set threshold for “significance” (Fisher, 1973). P-values have since been widely misunderstood in a lot of circles where it is reported. Goodman’s article on the misinterpretation of p-values lists some misconceptions. In brief the article puts across that a p-value of say 0.05 does not mean that: there is only a 5% chance that the null hypothesis is true, there is a 5% chance of a Type I error (i.e. false positive), there is a 95% chance that the results would replicate if the study were repeated, there is no difference between groups or that you have proved your experimental hypothesis. A p-value should be interpreted as: The probability of getting the results you have observed or more extreme results given that the null hypothesis is true. This might still not be clear, so let’s have usual coin toss examples frequent in introduction to probability lessons. Suppose we toss a fair coin 20 times and observe the number of heads that come up, we would expect to obtain 10 heads in our experiment. This is because, for a fair coin, the probability of turning heads is 0.5 and so the expected number of heads will be 20*0.5=10. Now let’s experiment with a coin with an unknown probability of turning heads. Our aim in the experiment is to quantify the evidence against our null hypothesis that the coin is fair. In our experiment the coin lands heads on 16 out of 20 tosses. How do we interpret this result? Is it unusual given that we were expecting about 10 heads? Let’s calculate a p-value. Remember that a p-value was the probability of getting the observed results (16 heads) or more extreme results (17, 18, 19, or 20 heads) if our null hypothesis is true- the coin is fair. Considering each toss as a Bernoulli experiment we can easily obtain the probability that in 20 trials we get x (x=16,.., 20) number of heads using the binomial function: Where is the probability of success in each trial. The p-value obtained is 0.0059. This could mean that an unlikely event occurred i.e. a fair coin landing heads 16 times or that the coin is not fair! However, the p-value does not tell us which is which. Many people conclude that such an unlikely event suggests that the coin is not fair; rejecting the null hypothesis that the coin is fair, but do not recognize that there is a second possibility in the circumstance. So, I have heard statements like ‘the p-value was <0.05 which proves that the null hypothesis is true’ or ‘the p-value was <0.05, therefore, we accept the null hypothesis’. This is where the misinterpretation comes.
  • 2. What are your views or questions on the p-value?