SlideShare a Scribd company logo
1
Quantitative Science I
Hypothesis Testing
Hamdy F. F. Mahmoud, PhD
Collegiate Assistant Professor
Statistics Department @ VT
o Hypothesis testing for a population mean
o Steps of hypotheses testing:
▪ Null Hypothesis and Alternative Hypothesis
▪ Test statistic
▪ P-value
▪ Conclusion
o Relationship between Confidence Interval and Hypothesis Testing
o Read Chapter 6.1,6.2
This lecture note covers
Statistical Inference
❑There are two common types of statistical inference:
▪ Confidence interval is used when your goal is to estimate a
population parameter.
▪ Tests of significance is used to assess evidence in the data
about some claim.
❑A test of significance is a formal procedure for comparing
observed data with a claim (also called a hypothesis) whose truth
we want to assess.
▪ The claim is a statement about a parameter, like the population
proportion p or the population mean µ.
❑ We express the results of a significance test in terms of a
probability, called the P-value, that measures how well the data and
the claim agree.
The Reasoning of Tests of Significance
❑ Assume that you have been told that the average grade in a certain
course is 60/100 (claimed value).
▪ You take a group of students taking that course and collect the
grades of all of them.
▪ You calculate the statistic: sample mean and obtain ഥX = 90/100.
This looks like a high grade!!!
▪ We see that ഥX > 60. We would like to know just how certain we can
be that μ > 60.
▪ A confidence interval is not quite what we need. For example, if
we construct the CI that, with 95% CI, μ is between [58, 90]. It
does not directly tell us how confident we can be that μ > 60 and
how strong the evidence against the claim.
Tests of Significance
▪ Our aim will be to infer µ, the value of the mean for the
population.
▪ We are going to start with a very unrealistic situation:
assuming we know 𝜎, the standard deviation of the
distribution for the population.
Steps in Significance Tests
1. State the null and alternative hypothesis.
2. Calculate a test statistic to measure the compatibility between
the null hypothesis and the data.
‐ Test statistic =
estimate from data − 𝑡ℎ𝑒 𝑐𝑙𝑎𝑖𝑚𝑒𝑑 𝑣𝑎𝑙𝑢𝑒 𝑢𝑛𝑑𝑒𝑟𝐻0
𝑠𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝑑𝑒𝑣𝑖𝑎𝑡𝑖𝑜𝑛 𝑜𝑓 𝑡ℎ𝑒 𝑒𝑠𝑡𝑖𝑚𝑎𝑡𝑒
3. Calculate the probability of the estimate (the statistic you
measured) under the null hypothesis - P-value.
4. State a conclusion regarding evidence against the null
hypothesis.
Step 1: Null and alternative hypotheses
▪ The null hypothesis is the claim which is initially
favored or believed to be true. Often default or
uninteresting situation of “no effect” or “no difference”.
▪ THEN, we usually need to determine if there is strong
enough evidence against it.
▪ The test of significance is designed to assess the
strength of the evidence against the null hypothesis.
Back to our motivating example
Claimed value = 60/100, actually obtain ഥ𝒙 = 90/100.
1) Assuming that μ=60, is it just a rare case?
2) How rare is it? Is there some evidence that maybe the
average grade is greater than 60?
▪ The statement being tested is that the mean of the population
(the value of the parameter µ) is 60 – Null Hypothesis, 𝐇 𝟎.
‐ The test of significance is designed to assess the strength
of evidence against the null hypothesis.
▪ The alternate statement is that the mean of the population
(the value of the parameter µ) is > 60 – Alternative
Hypothesis, 𝐇 𝒂.
‐ The test of significance is designed to assess the strength
of evidence to support the alternative hypothesis.
Practice on null and alternative hypotheses formulation
Specifications for a water pipe call for a mean breaking strength μ
of more than 2000 lb per linear foot. Engineers will perform a
hypothesis test to decide whether to use a certain kind of pipe.
They will select a random sample of 1 ft sections of pipe, measure
their breaking strengths, and perform a hypothesis test. The pipe
will not be used unless the engineers can conclude that μ > 2000.
▪ How to set up the null hypothesis and the alternative hypothesis?
H0: μ = 60 vs. Ha: μ <60
Suspect the average grade is lower. One-sided Ha.
H0: μ = 60 vs. Ha: μ >60
Suspect the average grade is higher. One-sided Ha.
H0: μ =60 vs. Ha: μ ≠60
Suspect the average grade is different. Two-sided Ha.
Note:
you must decide on the setting, based on general knowledge,
before you see the data or other measurements.
Hypotheses Possibilities
The Basic Idea
Every time we perform a hypothesis test, this is the basic
procedure that we will follow:
1.We'll make an initial assumption about the population
parameter.
2.We'll collect evidence or else use somebody else's
evidence (in either case, our evidence will come in the
form of data).
3.Based on the available evidence (data), we'll decide
whether to "reject" or "not reject" our initial assumption.
Step 2: Test Statistic: Z Test for 𝛍
▪ We want to test whether we have evidence that the
mean of the population has a certain value μ0.
H0: 𝜇 = 𝜇0
▪ From the data (sample size n) we measure the sample
mean ത𝑋.
Z = Test Statistic =
ത𝑋 − 𝜇0
𝜎
√𝑛
Based on the CLT, ഥ𝒙 comes from a distribution N(µ0,
𝝈
𝒏
)
We know that under 𝑯 𝟎 the mean value for the population is µ0.
Step 3: P-value
❑In performing a hypothesis test, we
essentially put the null hypothesis
on trial. We begin by assuming that
H0 is true, just as we begin a trial by
assuming a defendant to be
innocent.
❑The hypothesis test involves
measuring the strength of the
disagreement between the sample
and H0 to produce a number
between 0 and 1, called a P value.
❑P-value is a probability, computed
assuming that H0 is true, that the
test statistics would take as
extreme or more extreme values as
the one actually observed.
More about P-value…
When the P-value is small, there are 2 choices:
1. The null hypothesis is true, and our observed effect is
extremely rare!
OR more likely…
2. The null hypothesis is false, and our data is telling us this
by the small P-value!
Significance Level
▪ We need a cut-off point (decisive value) that we can compare our
P-value to and draw a conclusion or make a decision. In other
words, how much evidence do we need to reject H0 ?
▪ This cut-off point is the significance level. It is announced in
advance and serves as a standard on how much evidence against
H0 we need to reject H0. Usually denoted α.
▪ Typical values of α: 0.05, 0.01.
▪ If not stated otherwise, assume α=0.05.
Step 4: The conclusion/decision
▪ If the P-value is smaller than a fixed significance level α, then
we reject the null hypothesis (in favor of the alternative).
▪ Otherwise we don’t have enough evidence to reject the null.
‐ If we don’t reject the null, do we accept it?
▪ Note: Should always report a P-value with your conclusion
and write the conclusion in terms of the problem.
Step 4: The conclusion/decision
Statistical Significance
The final step in performing a significance test is to draw a
conclusion ―reject H0 or fail to reject H0.
▪ If our sample result is too unlikely to have happened by
chance assuming H0 is true, then we will reject H0.
▪ Otherwise, we will fail to reject H0.
• Note: A fail-to-reject H0 decision in a significance test
does not mean that H0 is true. For that reason, you
should never “accept H0” or use language implying
that you believe H0 is true.
Why “fail to reject” H0 vs. “accept” H0?
❑ 𝐻0 Hypothesis: There are NO racoons in the backyard.
• Observation 1: I randomly go out and do not see racoons.
• Conclusion: 𝑯 𝟎 hypothesis “seems” to be correct now.
• Observation 2 at a later time: I see racoons in the yard...
• Conclusion: 𝑯 𝟎 hypothesis is incorrect!!!
Why not “accept null hypothesis”?
Can NOT “prove truth”, only “disprove truth”
▪ We fail to reject 𝐻0 Hypothesis based on Observation 1 may be
DUE to bad sample or small sample size.
▪ Only rejection is significant, that is, if reject 𝐻0, we have
significant conclusion that 𝝁 = 𝝁 𝟎 is untrue.
Tests for a Population Mean
Example 1: [Two-sided test]
• A scale is to be calibrated by weighing a 1000 g test weight 60 times.
The 60 scale readings have mean 1000.6 g and standard deviation 2 g.
• Find the P-value for testing 𝐻0: μ = 1000 versus 𝐻1 : μ ≠ 1000.
1000 1000.6
2.32
-2.32
Example 2 [One-sided Test]
▪ The article “Wear in Boundary Lubrication” (S. Hsu, R. Munro, and M.
Shen, Journal of Engineering Tribology, 2002:427–441) discusses
several experiments involving various lubricants. In one experiment, 45
steel balls lubricated with purified paraffin were subjected to a 40 kg
load at 600 rpm for 60 minutes. The average wear, measured by the
reduction in diameter, was 673.2 μm, and the standard deviation was
14.9 μm. Assume that the specification for a lubricant is that the mean
wear be less than 675 μm.
▪ Find the P-value for the testing 𝐻0 : μ ≥ 675 versus 𝐻1 : μ < 675.
Tests for a Population Mean
One-sided vs. two-sided
▪ If, based on previous data or experience, we expect “increase”,
“more”, “better”, etc. (“decrease”, “less”, “worse”, etc.), then
we can use a one-sided test.
▪ Otherwise, by default, we use two-sided. Key words:
“different”, “departures”, “changed”…
The Relationship between Hypothesis Tests and Confidence Interval
❑In a hypothesis test for a population mean μ, we specify a
particular value of μ (the null hypothesis) and determine
whether that value is plausible.
❑In contrast, a confidence interval for a population mean μ
can be thought of as the collection of all values for μ that
meet a certain criterion of plausibility, specified by the
confidence level 100(1 − α)%.
A level α two-sided significance test rejects H0: µ=µ0 exactly when
µ0 falls outside a level 1- α confidence interval for µ.
Conclusions after using a Confidence Interval to do a Hypothesis Testing
Claimed value from null hypothesis fits
inside the CI?
Yes No
Fail to reject H0. Reject H0.
Relationship between C.I. and H.T. – recall example 1
• A scale is to be calibrated by weighing a 1000 g test weight 60
times. The 60 scale readings have mean 1000.6 g and standard
deviation 2 g.
• Find the 90% C.I. for the mean weight of the scale readings.
C=90% → z*=1.645
margin of error = 1.645×
2
√60
= 0.425
C.I. = (1000.6-0.425, 1000.6+0.425) = (1000.175, 1001.025).
• At 𝜶 = 𝟎. 𝟏, since 𝜇0 = 1000 is outside the above C.I. We
reject H0. We have significant evidence that the population
mean is different from 1000 minutes.
Choosing the level of significance
• α=0.05 is accepted standard, but…
• if the conclusion that Ha is true has “costly” implications,
smaller α may be appropriate
• not always need to make a decision: describing the evidence by
P-value may be enough
• no sharp border between statistically significant and
insignificant
Statistical vs. practical significance
• Statistically significant effect may be small:
Example (“Executive” blood pressure):
• µ0 = 128
• σ = 15
• n = 1000 obs.
• sample mean = 127
‐ Z = (127-128)/ (15/sqrt(1000)) = -2.11
‐ P-value for two-sided Ha = 2*0.0174=0.0348
‐ Significant??
▪ Stat. significance is not necessarily practical significance.
▪ Outliers may produce or destroy statistical significance.

More Related Content

What's hot

hypothesis testing-tests of proportions and variances in six sigma
hypothesis testing-tests of proportions and variances in six sigmahypothesis testing-tests of proportions and variances in six sigma
hypothesis testing-tests of proportions and variances in six sigma
vdheerajk
 
Introduction to Maximum Likelihood Estimator
Introduction to Maximum Likelihood EstimatorIntroduction to Maximum Likelihood Estimator
Introduction to Maximum Likelihood Estimator
Amir Al-Ansary
 
Hypothesis testing ppt final
Hypothesis testing ppt finalHypothesis testing ppt final
Hypothesis testing ppt final
piyushdhaker
 
Hypothesis testing
Hypothesis testingHypothesis testing
Hypothesis testing
CarlAchilles28
 
Lecture2 hypothesis testing
Lecture2 hypothesis testingLecture2 hypothesis testing
Lecture2 hypothesis testing
o_devinyak
 
Research method ch07 statistical methods 1
Research method ch07 statistical methods 1Research method ch07 statistical methods 1
Research method ch07 statistical methods 1
naranbatn
 
Hypothesis testing
Hypothesis testingHypothesis testing
Hypothesis testing
Shameer P Hamsa
 
Confidence Intervals: Basic concepts and overview
Confidence Intervals: Basic concepts and overviewConfidence Intervals: Basic concepts and overview
Confidence Intervals: Basic concepts and overview
Rizwan S A
 
Hypothesis Testing
Hypothesis TestingHypothesis Testing
Hypothesis Testing
Ken Plummer
 
TOPIC- HYPOTHESIS TESTING RMS.pptx
TOPIC- HYPOTHESIS TESTING  RMS.pptxTOPIC- HYPOTHESIS TESTING  RMS.pptx
TOPIC- HYPOTHESIS TESTING RMS.pptx
anamikamishra29
 
Inferential statistics powerpoint
Inferential statistics powerpointInferential statistics powerpoint
Inferential statistics powerpoint
kellula
 
8.1
8.18.1
Simple linear regressionn and Correlation
Simple linear regressionn and CorrelationSimple linear regressionn and Correlation
Simple linear regressionn and Correlation
Southern Range, Berhampur, Odisha
 
The Method Of Maximum Likelihood
The Method Of Maximum LikelihoodThe Method Of Maximum Likelihood
The Method Of Maximum Likelihood
Max Chipulu
 
Statistical inference 2
Statistical inference 2Statistical inference 2
Statistical inference 2
safi Ullah
 
Normality tests
Normality testsNormality tests
Normality tests
Dr Lipilekha Patnaik
 
Maximum likelihood estimation
Maximum likelihood estimationMaximum likelihood estimation
Maximum likelihood estimation
zihad164
 
Introduction to probability distributions-Statistics and probability analysis
Introduction to probability distributions-Statistics and probability analysis Introduction to probability distributions-Statistics and probability analysis
Introduction to probability distributions-Statistics and probability analysis
Vijay Hemmadi
 
Unit 4 Tests of Significance
Unit 4 Tests of SignificanceUnit 4 Tests of Significance
Unit 4 Tests of Significance
Rai University
 
Correlation
CorrelationCorrelation

What's hot (20)

hypothesis testing-tests of proportions and variances in six sigma
hypothesis testing-tests of proportions and variances in six sigmahypothesis testing-tests of proportions and variances in six sigma
hypothesis testing-tests of proportions and variances in six sigma
 
Introduction to Maximum Likelihood Estimator
Introduction to Maximum Likelihood EstimatorIntroduction to Maximum Likelihood Estimator
Introduction to Maximum Likelihood Estimator
 
Hypothesis testing ppt final
Hypothesis testing ppt finalHypothesis testing ppt final
Hypothesis testing ppt final
 
Hypothesis testing
Hypothesis testingHypothesis testing
Hypothesis testing
 
Lecture2 hypothesis testing
Lecture2 hypothesis testingLecture2 hypothesis testing
Lecture2 hypothesis testing
 
Research method ch07 statistical methods 1
Research method ch07 statistical methods 1Research method ch07 statistical methods 1
Research method ch07 statistical methods 1
 
Hypothesis testing
Hypothesis testingHypothesis testing
Hypothesis testing
 
Confidence Intervals: Basic concepts and overview
Confidence Intervals: Basic concepts and overviewConfidence Intervals: Basic concepts and overview
Confidence Intervals: Basic concepts and overview
 
Hypothesis Testing
Hypothesis TestingHypothesis Testing
Hypothesis Testing
 
TOPIC- HYPOTHESIS TESTING RMS.pptx
TOPIC- HYPOTHESIS TESTING  RMS.pptxTOPIC- HYPOTHESIS TESTING  RMS.pptx
TOPIC- HYPOTHESIS TESTING RMS.pptx
 
Inferential statistics powerpoint
Inferential statistics powerpointInferential statistics powerpoint
Inferential statistics powerpoint
 
8.1
8.18.1
8.1
 
Simple linear regressionn and Correlation
Simple linear regressionn and CorrelationSimple linear regressionn and Correlation
Simple linear regressionn and Correlation
 
The Method Of Maximum Likelihood
The Method Of Maximum LikelihoodThe Method Of Maximum Likelihood
The Method Of Maximum Likelihood
 
Statistical inference 2
Statistical inference 2Statistical inference 2
Statistical inference 2
 
Normality tests
Normality testsNormality tests
Normality tests
 
Maximum likelihood estimation
Maximum likelihood estimationMaximum likelihood estimation
Maximum likelihood estimation
 
Introduction to probability distributions-Statistics and probability analysis
Introduction to probability distributions-Statistics and probability analysis Introduction to probability distributions-Statistics and probability analysis
Introduction to probability distributions-Statistics and probability analysis
 
Unit 4 Tests of Significance
Unit 4 Tests of SignificanceUnit 4 Tests of Significance
Unit 4 Tests of Significance
 
Correlation
CorrelationCorrelation
Correlation
 

Similar to Test of hypotheses part i

Chapter 20 and 21 combined testing hypotheses about proportions 2013
Chapter 20 and 21 combined testing hypotheses about proportions 2013Chapter 20 and 21 combined testing hypotheses about proportions 2013
Chapter 20 and 21 combined testing hypotheses about proportions 2013
calculistictt
 
20200519073328de6dca404c.pdfkshhjejhehdhd
20200519073328de6dca404c.pdfkshhjejhehdhd20200519073328de6dca404c.pdfkshhjejhehdhd
20200519073328de6dca404c.pdfkshhjejhehdhd
HimanshuSharma723273
 
Tests of significance
Tests of significanceTests of significance
Tests of significance
Shubhanshu Gupta
 
Hypothesis testing1
Hypothesis testing1Hypothesis testing1
Hypothesis testing1
HanaaBayomy
 
hypothesis test
 hypothesis test hypothesis test
hypothesis test
Unsa Shakir
 
Basics of Hypothesis testing for Pharmacy
Basics of Hypothesis testing for PharmacyBasics of Hypothesis testing for Pharmacy
Basics of Hypothesis testing for Pharmacy
Parag Shah
 
RESEARCH METHODS LESSON 3
RESEARCH METHODS LESSON 3RESEARCH METHODS LESSON 3
RESEARCH METHODS LESSON 3
DR. TIRIMBA IBRAHIM
 
7 hypothesis testing
7 hypothesis testing7 hypothesis testing
7 hypothesis testing
AASHISHSHRIVASTAV1
 
Review Z Test Ci 1
Review Z Test Ci 1Review Z Test Ci 1
Review Z Test Ci 1
shoffma5
 
Hypothesis testing
Hypothesis testingHypothesis testing
Hypothesis testing
Nirajan Bam
 
Hypothesis
HypothesisHypothesis
Hypothesis Testing.pptx
Hypothesis Testing.pptxHypothesis Testing.pptx
Hypothesis Testing.pptx
heencomm
 
Hypothesis
HypothesisHypothesis
Hypothesis
Dr. Priyanka Jain
 
Chapter 9 Fundamental of Hypothesis Testing.ppt
Chapter 9 Fundamental of Hypothesis Testing.pptChapter 9 Fundamental of Hypothesis Testing.ppt
Chapter 9 Fundamental of Hypothesis Testing.ppt
HasanGilani3
 
Hipotesis y muestreo estadístico
Hipotesis y muestreo estadísticoHipotesis y muestreo estadístico
Hipotesis y muestreo estadístico
Universidad Nacional Agraria de la Selva
 
Testing of hypothesis
Testing of hypothesisTesting of hypothesis
Testing of hypothesis
Sanjay Basukala
 
Risk Management - CH 7 - Hypothesis Tests and Confidence | CMT Level 3 | Char...
Risk Management - CH 7 - Hypothesis Tests and Confidence | CMT Level 3 | Char...Risk Management - CH 7 - Hypothesis Tests and Confidence | CMT Level 3 | Char...
Risk Management - CH 7 - Hypothesis Tests and Confidence | CMT Level 3 | Char...
Professional Training Academy
 
Hypothesis testing
Hypothesis testingHypothesis testing
Hypothesis testing
Sathish Rajamani
 
Estimation and hypothesis testing 1 (graduate statistics2)
Estimation and hypothesis testing 1 (graduate statistics2)Estimation and hypothesis testing 1 (graduate statistics2)
Estimation and hypothesis testing 1 (graduate statistics2)
Harve Abella
 
Hypothesis Testing Assignment Help
Hypothesis Testing Assignment HelpHypothesis Testing Assignment Help
Hypothesis Testing Assignment Help
HelpWithAssignment.com
 

Similar to Test of hypotheses part i (20)

Chapter 20 and 21 combined testing hypotheses about proportions 2013
Chapter 20 and 21 combined testing hypotheses about proportions 2013Chapter 20 and 21 combined testing hypotheses about proportions 2013
Chapter 20 and 21 combined testing hypotheses about proportions 2013
 
20200519073328de6dca404c.pdfkshhjejhehdhd
20200519073328de6dca404c.pdfkshhjejhehdhd20200519073328de6dca404c.pdfkshhjejhehdhd
20200519073328de6dca404c.pdfkshhjejhehdhd
 
Tests of significance
Tests of significanceTests of significance
Tests of significance
 
Hypothesis testing1
Hypothesis testing1Hypothesis testing1
Hypothesis testing1
 
hypothesis test
 hypothesis test hypothesis test
hypothesis test
 
Basics of Hypothesis testing for Pharmacy
Basics of Hypothesis testing for PharmacyBasics of Hypothesis testing for Pharmacy
Basics of Hypothesis testing for Pharmacy
 
RESEARCH METHODS LESSON 3
RESEARCH METHODS LESSON 3RESEARCH METHODS LESSON 3
RESEARCH METHODS LESSON 3
 
7 hypothesis testing
7 hypothesis testing7 hypothesis testing
7 hypothesis testing
 
Review Z Test Ci 1
Review Z Test Ci 1Review Z Test Ci 1
Review Z Test Ci 1
 
Hypothesis testing
Hypothesis testingHypothesis testing
Hypothesis testing
 
Hypothesis
HypothesisHypothesis
Hypothesis
 
Hypothesis Testing.pptx
Hypothesis Testing.pptxHypothesis Testing.pptx
Hypothesis Testing.pptx
 
Hypothesis
HypothesisHypothesis
Hypothesis
 
Chapter 9 Fundamental of Hypothesis Testing.ppt
Chapter 9 Fundamental of Hypothesis Testing.pptChapter 9 Fundamental of Hypothesis Testing.ppt
Chapter 9 Fundamental of Hypothesis Testing.ppt
 
Hipotesis y muestreo estadístico
Hipotesis y muestreo estadísticoHipotesis y muestreo estadístico
Hipotesis y muestreo estadístico
 
Testing of hypothesis
Testing of hypothesisTesting of hypothesis
Testing of hypothesis
 
Risk Management - CH 7 - Hypothesis Tests and Confidence | CMT Level 3 | Char...
Risk Management - CH 7 - Hypothesis Tests and Confidence | CMT Level 3 | Char...Risk Management - CH 7 - Hypothesis Tests and Confidence | CMT Level 3 | Char...
Risk Management - CH 7 - Hypothesis Tests and Confidence | CMT Level 3 | Char...
 
Hypothesis testing
Hypothesis testingHypothesis testing
Hypothesis testing
 
Estimation and hypothesis testing 1 (graduate statistics2)
Estimation and hypothesis testing 1 (graduate statistics2)Estimation and hypothesis testing 1 (graduate statistics2)
Estimation and hypothesis testing 1 (graduate statistics2)
 
Hypothesis Testing Assignment Help
Hypothesis Testing Assignment HelpHypothesis Testing Assignment Help
Hypothesis Testing Assignment Help
 

Recently uploaded

The History of Stoke Newington Street Names
The History of Stoke Newington Street NamesThe History of Stoke Newington Street Names
The History of Stoke Newington Street Names
History of Stoke Newington
 
BÀI TẬP DẠY THÊM TIẾNG ANH LỚP 7 CẢ NĂM FRIENDS PLUS SÁCH CHÂN TRỜI SÁNG TẠO ...
BÀI TẬP DẠY THÊM TIẾNG ANH LỚP 7 CẢ NĂM FRIENDS PLUS SÁCH CHÂN TRỜI SÁNG TẠO ...BÀI TẬP DẠY THÊM TIẾNG ANH LỚP 7 CẢ NĂM FRIENDS PLUS SÁCH CHÂN TRỜI SÁNG TẠO ...
BÀI TẬP DẠY THÊM TIẾNG ANH LỚP 7 CẢ NĂM FRIENDS PLUS SÁCH CHÂN TRỜI SÁNG TẠO ...
Nguyen Thanh Tu Collection
 
writing about opinions about Australia the movie
writing about opinions about Australia the moviewriting about opinions about Australia the movie
writing about opinions about Australia the movie
Nicholas Montgomery
 
Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...
Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...
Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...
imrankhan141184
 
REASIGNACION 2024 UGEL CHUPACA 2024 UGEL CHUPACA.pdf
REASIGNACION 2024 UGEL CHUPACA 2024 UGEL CHUPACA.pdfREASIGNACION 2024 UGEL CHUPACA 2024 UGEL CHUPACA.pdf
REASIGNACION 2024 UGEL CHUPACA 2024 UGEL CHUPACA.pdf
giancarloi8888
 
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
Nguyen Thanh Tu Collection
 
A Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdfA Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdf
Jean Carlos Nunes Paixão
 
Temple of Asclepius in Thrace. Excavation results
Temple of Asclepius in Thrace. Excavation resultsTemple of Asclepius in Thrace. Excavation results
Temple of Asclepius in Thrace. Excavation results
Krassimira Luka
 
A Visual Guide to 1 Samuel | A Tale of Two Hearts
A Visual Guide to 1 Samuel | A Tale of Two HeartsA Visual Guide to 1 Samuel | A Tale of Two Hearts
A Visual Guide to 1 Samuel | A Tale of Two Hearts
Steve Thomason
 
Benner "Expanding Pathways to Publishing Careers"
Benner "Expanding Pathways to Publishing Careers"Benner "Expanding Pathways to Publishing Careers"
Benner "Expanding Pathways to Publishing Careers"
National Information Standards Organization (NISO)
 
Pengantar Penggunaan Flutter - Dart programming language1.pptx
Pengantar Penggunaan Flutter - Dart programming language1.pptxPengantar Penggunaan Flutter - Dart programming language1.pptx
Pengantar Penggunaan Flutter - Dart programming language1.pptx
Fajar Baskoro
 
RHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem students
RHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem studentsRHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem students
RHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem students
Himanshu Rai
 
How to deliver Powerpoint Presentations.pptx
How to deliver Powerpoint  Presentations.pptxHow to deliver Powerpoint  Presentations.pptx
How to deliver Powerpoint Presentations.pptx
HajraNaeem15
 
UGC NET Exam Paper 1- Unit 1:Teaching Aptitude
UGC NET Exam Paper 1- Unit 1:Teaching AptitudeUGC NET Exam Paper 1- Unit 1:Teaching Aptitude
UGC NET Exam Paper 1- Unit 1:Teaching Aptitude
S. Raj Kumar
 
How to Make a Field Mandatory in Odoo 17
How to Make a Field Mandatory in Odoo 17How to Make a Field Mandatory in Odoo 17
How to Make a Field Mandatory in Odoo 17
Celine George
 
Philippine Edukasyong Pantahanan at Pangkabuhayan (EPP) Curriculum
Philippine Edukasyong Pantahanan at Pangkabuhayan (EPP) CurriculumPhilippine Edukasyong Pantahanan at Pangkabuhayan (EPP) Curriculum
Philippine Edukasyong Pantahanan at Pangkabuhayan (EPP) Curriculum
MJDuyan
 
مصحف القراءات العشر أعد أحرف الخلاف سمير بسيوني.pdf
مصحف القراءات العشر   أعد أحرف الخلاف سمير بسيوني.pdfمصحف القراءات العشر   أعد أحرف الخلاف سمير بسيوني.pdf
مصحف القراءات العشر أعد أحرف الخلاف سمير بسيوني.pdf
سمير بسيوني
 
NEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptx
NEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptxNEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptx
NEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptx
iammrhaywood
 
BIOLOGY NATIONAL EXAMINATION COUNCIL (NECO) 2024 PRACTICAL MANUAL.pptx
BIOLOGY NATIONAL EXAMINATION COUNCIL (NECO) 2024 PRACTICAL MANUAL.pptxBIOLOGY NATIONAL EXAMINATION COUNCIL (NECO) 2024 PRACTICAL MANUAL.pptx
BIOLOGY NATIONAL EXAMINATION COUNCIL (NECO) 2024 PRACTICAL MANUAL.pptx
RidwanHassanYusuf
 
Présentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptx
Présentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptxPrésentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptx
Présentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptx
siemaillard
 

Recently uploaded (20)

The History of Stoke Newington Street Names
The History of Stoke Newington Street NamesThe History of Stoke Newington Street Names
The History of Stoke Newington Street Names
 
BÀI TẬP DẠY THÊM TIẾNG ANH LỚP 7 CẢ NĂM FRIENDS PLUS SÁCH CHÂN TRỜI SÁNG TẠO ...
BÀI TẬP DẠY THÊM TIẾNG ANH LỚP 7 CẢ NĂM FRIENDS PLUS SÁCH CHÂN TRỜI SÁNG TẠO ...BÀI TẬP DẠY THÊM TIẾNG ANH LỚP 7 CẢ NĂM FRIENDS PLUS SÁCH CHÂN TRỜI SÁNG TẠO ...
BÀI TẬP DẠY THÊM TIẾNG ANH LỚP 7 CẢ NĂM FRIENDS PLUS SÁCH CHÂN TRỜI SÁNG TẠO ...
 
writing about opinions about Australia the movie
writing about opinions about Australia the moviewriting about opinions about Australia the movie
writing about opinions about Australia the movie
 
Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...
Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...
Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...
 
REASIGNACION 2024 UGEL CHUPACA 2024 UGEL CHUPACA.pdf
REASIGNACION 2024 UGEL CHUPACA 2024 UGEL CHUPACA.pdfREASIGNACION 2024 UGEL CHUPACA 2024 UGEL CHUPACA.pdf
REASIGNACION 2024 UGEL CHUPACA 2024 UGEL CHUPACA.pdf
 
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
 
A Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdfA Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdf
 
Temple of Asclepius in Thrace. Excavation results
Temple of Asclepius in Thrace. Excavation resultsTemple of Asclepius in Thrace. Excavation results
Temple of Asclepius in Thrace. Excavation results
 
A Visual Guide to 1 Samuel | A Tale of Two Hearts
A Visual Guide to 1 Samuel | A Tale of Two HeartsA Visual Guide to 1 Samuel | A Tale of Two Hearts
A Visual Guide to 1 Samuel | A Tale of Two Hearts
 
Benner "Expanding Pathways to Publishing Careers"
Benner "Expanding Pathways to Publishing Careers"Benner "Expanding Pathways to Publishing Careers"
Benner "Expanding Pathways to Publishing Careers"
 
Pengantar Penggunaan Flutter - Dart programming language1.pptx
Pengantar Penggunaan Flutter - Dart programming language1.pptxPengantar Penggunaan Flutter - Dart programming language1.pptx
Pengantar Penggunaan Flutter - Dart programming language1.pptx
 
RHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem students
RHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem studentsRHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem students
RHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem students
 
How to deliver Powerpoint Presentations.pptx
How to deliver Powerpoint  Presentations.pptxHow to deliver Powerpoint  Presentations.pptx
How to deliver Powerpoint Presentations.pptx
 
UGC NET Exam Paper 1- Unit 1:Teaching Aptitude
UGC NET Exam Paper 1- Unit 1:Teaching AptitudeUGC NET Exam Paper 1- Unit 1:Teaching Aptitude
UGC NET Exam Paper 1- Unit 1:Teaching Aptitude
 
How to Make a Field Mandatory in Odoo 17
How to Make a Field Mandatory in Odoo 17How to Make a Field Mandatory in Odoo 17
How to Make a Field Mandatory in Odoo 17
 
Philippine Edukasyong Pantahanan at Pangkabuhayan (EPP) Curriculum
Philippine Edukasyong Pantahanan at Pangkabuhayan (EPP) CurriculumPhilippine Edukasyong Pantahanan at Pangkabuhayan (EPP) Curriculum
Philippine Edukasyong Pantahanan at Pangkabuhayan (EPP) Curriculum
 
مصحف القراءات العشر أعد أحرف الخلاف سمير بسيوني.pdf
مصحف القراءات العشر   أعد أحرف الخلاف سمير بسيوني.pdfمصحف القراءات العشر   أعد أحرف الخلاف سمير بسيوني.pdf
مصحف القراءات العشر أعد أحرف الخلاف سمير بسيوني.pdf
 
NEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptx
NEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptxNEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptx
NEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptx
 
BIOLOGY NATIONAL EXAMINATION COUNCIL (NECO) 2024 PRACTICAL MANUAL.pptx
BIOLOGY NATIONAL EXAMINATION COUNCIL (NECO) 2024 PRACTICAL MANUAL.pptxBIOLOGY NATIONAL EXAMINATION COUNCIL (NECO) 2024 PRACTICAL MANUAL.pptx
BIOLOGY NATIONAL EXAMINATION COUNCIL (NECO) 2024 PRACTICAL MANUAL.pptx
 
Présentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptx
Présentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptxPrésentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptx
Présentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptx
 

Test of hypotheses part i

  • 1. 1 Quantitative Science I Hypothesis Testing Hamdy F. F. Mahmoud, PhD Collegiate Assistant Professor Statistics Department @ VT
  • 2. o Hypothesis testing for a population mean o Steps of hypotheses testing: ▪ Null Hypothesis and Alternative Hypothesis ▪ Test statistic ▪ P-value ▪ Conclusion o Relationship between Confidence Interval and Hypothesis Testing o Read Chapter 6.1,6.2 This lecture note covers
  • 3. Statistical Inference ❑There are two common types of statistical inference: ▪ Confidence interval is used when your goal is to estimate a population parameter. ▪ Tests of significance is used to assess evidence in the data about some claim. ❑A test of significance is a formal procedure for comparing observed data with a claim (also called a hypothesis) whose truth we want to assess. ▪ The claim is a statement about a parameter, like the population proportion p or the population mean µ. ❑ We express the results of a significance test in terms of a probability, called the P-value, that measures how well the data and the claim agree.
  • 4. The Reasoning of Tests of Significance ❑ Assume that you have been told that the average grade in a certain course is 60/100 (claimed value). ▪ You take a group of students taking that course and collect the grades of all of them. ▪ You calculate the statistic: sample mean and obtain ഥX = 90/100. This looks like a high grade!!! ▪ We see that ഥX > 60. We would like to know just how certain we can be that μ > 60. ▪ A confidence interval is not quite what we need. For example, if we construct the CI that, with 95% CI, μ is between [58, 90]. It does not directly tell us how confident we can be that μ > 60 and how strong the evidence against the claim.
  • 5. Tests of Significance ▪ Our aim will be to infer µ, the value of the mean for the population. ▪ We are going to start with a very unrealistic situation: assuming we know 𝜎, the standard deviation of the distribution for the population.
  • 6. Steps in Significance Tests 1. State the null and alternative hypothesis. 2. Calculate a test statistic to measure the compatibility between the null hypothesis and the data. ‐ Test statistic = estimate from data − 𝑡ℎ𝑒 𝑐𝑙𝑎𝑖𝑚𝑒𝑑 𝑣𝑎𝑙𝑢𝑒 𝑢𝑛𝑑𝑒𝑟𝐻0 𝑠𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝑑𝑒𝑣𝑖𝑎𝑡𝑖𝑜𝑛 𝑜𝑓 𝑡ℎ𝑒 𝑒𝑠𝑡𝑖𝑚𝑎𝑡𝑒 3. Calculate the probability of the estimate (the statistic you measured) under the null hypothesis - P-value. 4. State a conclusion regarding evidence against the null hypothesis.
  • 7. Step 1: Null and alternative hypotheses ▪ The null hypothesis is the claim which is initially favored or believed to be true. Often default or uninteresting situation of “no effect” or “no difference”. ▪ THEN, we usually need to determine if there is strong enough evidence against it. ▪ The test of significance is designed to assess the strength of the evidence against the null hypothesis.
  • 8. Back to our motivating example Claimed value = 60/100, actually obtain ഥ𝒙 = 90/100. 1) Assuming that μ=60, is it just a rare case? 2) How rare is it? Is there some evidence that maybe the average grade is greater than 60? ▪ The statement being tested is that the mean of the population (the value of the parameter µ) is 60 – Null Hypothesis, 𝐇 𝟎. ‐ The test of significance is designed to assess the strength of evidence against the null hypothesis. ▪ The alternate statement is that the mean of the population (the value of the parameter µ) is > 60 – Alternative Hypothesis, 𝐇 𝒂. ‐ The test of significance is designed to assess the strength of evidence to support the alternative hypothesis.
  • 9. Practice on null and alternative hypotheses formulation Specifications for a water pipe call for a mean breaking strength μ of more than 2000 lb per linear foot. Engineers will perform a hypothesis test to decide whether to use a certain kind of pipe. They will select a random sample of 1 ft sections of pipe, measure their breaking strengths, and perform a hypothesis test. The pipe will not be used unless the engineers can conclude that μ > 2000. ▪ How to set up the null hypothesis and the alternative hypothesis?
  • 10. H0: μ = 60 vs. Ha: μ <60 Suspect the average grade is lower. One-sided Ha. H0: μ = 60 vs. Ha: μ >60 Suspect the average grade is higher. One-sided Ha. H0: μ =60 vs. Ha: μ ≠60 Suspect the average grade is different. Two-sided Ha. Note: you must decide on the setting, based on general knowledge, before you see the data or other measurements. Hypotheses Possibilities
  • 11. The Basic Idea Every time we perform a hypothesis test, this is the basic procedure that we will follow: 1.We'll make an initial assumption about the population parameter. 2.We'll collect evidence or else use somebody else's evidence (in either case, our evidence will come in the form of data). 3.Based on the available evidence (data), we'll decide whether to "reject" or "not reject" our initial assumption.
  • 12. Step 2: Test Statistic: Z Test for 𝛍 ▪ We want to test whether we have evidence that the mean of the population has a certain value μ0. H0: 𝜇 = 𝜇0 ▪ From the data (sample size n) we measure the sample mean ത𝑋. Z = Test Statistic = ത𝑋 − 𝜇0 𝜎 √𝑛 Based on the CLT, ഥ𝒙 comes from a distribution N(µ0, 𝝈 𝒏 ) We know that under 𝑯 𝟎 the mean value for the population is µ0.
  • 13. Step 3: P-value ❑In performing a hypothesis test, we essentially put the null hypothesis on trial. We begin by assuming that H0 is true, just as we begin a trial by assuming a defendant to be innocent. ❑The hypothesis test involves measuring the strength of the disagreement between the sample and H0 to produce a number between 0 and 1, called a P value. ❑P-value is a probability, computed assuming that H0 is true, that the test statistics would take as extreme or more extreme values as the one actually observed.
  • 14. More about P-value… When the P-value is small, there are 2 choices: 1. The null hypothesis is true, and our observed effect is extremely rare! OR more likely… 2. The null hypothesis is false, and our data is telling us this by the small P-value!
  • 15. Significance Level ▪ We need a cut-off point (decisive value) that we can compare our P-value to and draw a conclusion or make a decision. In other words, how much evidence do we need to reject H0 ? ▪ This cut-off point is the significance level. It is announced in advance and serves as a standard on how much evidence against H0 we need to reject H0. Usually denoted α. ▪ Typical values of α: 0.05, 0.01. ▪ If not stated otherwise, assume α=0.05.
  • 16. Step 4: The conclusion/decision ▪ If the P-value is smaller than a fixed significance level α, then we reject the null hypothesis (in favor of the alternative). ▪ Otherwise we don’t have enough evidence to reject the null. ‐ If we don’t reject the null, do we accept it? ▪ Note: Should always report a P-value with your conclusion and write the conclusion in terms of the problem.
  • 17. Step 4: The conclusion/decision
  • 18. Statistical Significance The final step in performing a significance test is to draw a conclusion ―reject H0 or fail to reject H0. ▪ If our sample result is too unlikely to have happened by chance assuming H0 is true, then we will reject H0. ▪ Otherwise, we will fail to reject H0. • Note: A fail-to-reject H0 decision in a significance test does not mean that H0 is true. For that reason, you should never “accept H0” or use language implying that you believe H0 is true.
  • 19. Why “fail to reject” H0 vs. “accept” H0? ❑ 𝐻0 Hypothesis: There are NO racoons in the backyard. • Observation 1: I randomly go out and do not see racoons. • Conclusion: 𝑯 𝟎 hypothesis “seems” to be correct now. • Observation 2 at a later time: I see racoons in the yard... • Conclusion: 𝑯 𝟎 hypothesis is incorrect!!! Why not “accept null hypothesis”? Can NOT “prove truth”, only “disprove truth” ▪ We fail to reject 𝐻0 Hypothesis based on Observation 1 may be DUE to bad sample or small sample size. ▪ Only rejection is significant, that is, if reject 𝐻0, we have significant conclusion that 𝝁 = 𝝁 𝟎 is untrue.
  • 20.
  • 21. Tests for a Population Mean Example 1: [Two-sided test] • A scale is to be calibrated by weighing a 1000 g test weight 60 times. The 60 scale readings have mean 1000.6 g and standard deviation 2 g. • Find the P-value for testing 𝐻0: μ = 1000 versus 𝐻1 : μ ≠ 1000.
  • 23. Example 2 [One-sided Test] ▪ The article “Wear in Boundary Lubrication” (S. Hsu, R. Munro, and M. Shen, Journal of Engineering Tribology, 2002:427–441) discusses several experiments involving various lubricants. In one experiment, 45 steel balls lubricated with purified paraffin were subjected to a 40 kg load at 600 rpm for 60 minutes. The average wear, measured by the reduction in diameter, was 673.2 μm, and the standard deviation was 14.9 μm. Assume that the specification for a lubricant is that the mean wear be less than 675 μm. ▪ Find the P-value for the testing 𝐻0 : μ ≥ 675 versus 𝐻1 : μ < 675. Tests for a Population Mean
  • 24.
  • 25. One-sided vs. two-sided ▪ If, based on previous data or experience, we expect “increase”, “more”, “better”, etc. (“decrease”, “less”, “worse”, etc.), then we can use a one-sided test. ▪ Otherwise, by default, we use two-sided. Key words: “different”, “departures”, “changed”…
  • 26. The Relationship between Hypothesis Tests and Confidence Interval ❑In a hypothesis test for a population mean μ, we specify a particular value of μ (the null hypothesis) and determine whether that value is plausible. ❑In contrast, a confidence interval for a population mean μ can be thought of as the collection of all values for μ that meet a certain criterion of plausibility, specified by the confidence level 100(1 − α)%. A level α two-sided significance test rejects H0: µ=µ0 exactly when µ0 falls outside a level 1- α confidence interval for µ.
  • 27. Conclusions after using a Confidence Interval to do a Hypothesis Testing Claimed value from null hypothesis fits inside the CI? Yes No Fail to reject H0. Reject H0.
  • 28. Relationship between C.I. and H.T. – recall example 1 • A scale is to be calibrated by weighing a 1000 g test weight 60 times. The 60 scale readings have mean 1000.6 g and standard deviation 2 g. • Find the 90% C.I. for the mean weight of the scale readings. C=90% → z*=1.645 margin of error = 1.645× 2 √60 = 0.425 C.I. = (1000.6-0.425, 1000.6+0.425) = (1000.175, 1001.025). • At 𝜶 = 𝟎. 𝟏, since 𝜇0 = 1000 is outside the above C.I. We reject H0. We have significant evidence that the population mean is different from 1000 minutes.
  • 29. Choosing the level of significance • α=0.05 is accepted standard, but… • if the conclusion that Ha is true has “costly” implications, smaller α may be appropriate • not always need to make a decision: describing the evidence by P-value may be enough • no sharp border between statistically significant and insignificant
  • 30. Statistical vs. practical significance • Statistically significant effect may be small: Example (“Executive” blood pressure): • µ0 = 128 • σ = 15 • n = 1000 obs. • sample mean = 127 ‐ Z = (127-128)/ (15/sqrt(1000)) = -2.11 ‐ P-value for two-sided Ha = 2*0.0174=0.0348 ‐ Significant?? ▪ Stat. significance is not necessarily practical significance. ▪ Outliers may produce or destroy statistical significance.