SlideShare a Scribd company logo
Introduction to Probability and Statistics
                          12th Week (5/31)



     Hypothesis Testing (II)
중간고사
중간고사
Statistical Hypotheses and
                    Null Hypotheses
•Statistical hypotheses: Assumptions or guesses about the populations
 involved. (Such assumptions, which may or may not be true)


•Null hypotheses (H0): Hypothesis that there is no difference between the
 procedures. We formulate it if we want to decide whether one procedure is
 better than another.


•Alternative hypotheses (H1): Any hypothesis that differs from a given null
 hypothesis

Example 1. For example, if the null hypothesis is p = 0.5, possible
 alternative hypotheses are p =0.7, or p ≠ 0.5.
Table 7-2     Type I and Type II Errors
                                      True State of Nature
                                   The null           The null
                                 hypothesis is      hypothesis is
                                     true              false
              We decide to       Type I error
                                                       Correct
                reject the     (rejecting a true
                                                       decision
             null hypothesis   null hypothesis)
  Decision
                We fail to                           Type II error
                                   Correct
                reject the                         (failing to reject
                                   decision
             null hypothesis                          a false null
                                                      hypothesis)
P Value


•Small P values provide evidence for rejecting the null hypothesis in favor of
 the alternative hypothesis, and large P values provide evidence for not
 rejecting the null hypothesis in favor of the alternative hypothesis.

•The P value and the level of significance do not provide criteria for
 rejecting or not rejecting the null hypothesis by itself, but for rejecting or
 not rejecting the null hypothesis in favor of the alternative hypothesis.

• When the test statistic S is the standard normal random variable, the
 table in Appendix C is sufficient to compute the P value, but when S is one
 of the t, F, or chi-square random variables, all of which have different
 distributions depending on their degrees of freedom, either computer
 software or more extensive tables will be needed to compute the P value.
Concepts of Hypothesis Testing…
    For example, if we’re trying to decide whether the mean is
    not equal to 350, a large value of (say, 600) would provide
    enough evidence.

    If is close to 350 (say, 355) we could not say that this
    provides a great deal of evidence to infer that the population
    mean is different than 350.




Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.   11.7
Concepts of Hypothesis Testing (4)…
    The two possible decisions that can be made:

    Conclude that there is enough evidence to support the alternative
    hypothesis
    (also stated as: reject the null hypothesis in favor of the alternative)

    Conclude that there is not enough evidence to support the
    alternative hypothesis
    (also stated as: failing to reject the null hypothesis in favor of the
    alternative)
    NOTE: we do not say that we accept the null hypothesis if a
    statistician is around…


Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.           11.8
Concepts of Hypothesis Testing (2)…
    The testing procedure begins with the assumption that the
    null hypothesis is true.

    Thus, until we have further statistical evidence, we will
    assume:

                      H0: = 350 (assumed to be TRUE)
    The next step will be to determine the sampling distribution
    of the sample mean assuming the true mean is 350.
         is normal with              350
                                           75/SQRT(25) = 15
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.   11.9
Is the Sample Mean in the Guts of the Sampling Distribution??




Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.   11.10
Three ways to determine this: First way
    1. Unstandardized test statistic: Is in the guts of the
       sampling distribution? Depends on what you define as
       the “guts” of the sampling distribution.

    If we define the guts as the center 95% of the distribution
        [this means α = 0.05], then the critical values that define
        the guts will be 1.96 standard deviations of X-Bar on
        either side of the mean of the sampling distribution
        [350], or
        UCV = 350 + 1.96*15 = 350 + 29.4 = 379.4
        LCV = 350 – 1.96*15 = 350 – 29.4 = 320.6

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.   11.11
1. Unstandardized Test Statistic Approach




Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.   11.12
Three ways to determine this: Second way
    2. Standardized test statistic: Since we defined the “guts” of
    the sampling distribution to be the center 95% [α = 0.05],
           If the Z-Score for the sample mean is greater than
    1.96, we know that will be in the reject region on the right
    side or
            If the Z-Score for the sample mean is less than -1.97,
    we know that will be in the reject region on the left side.

    Z=(                   -           )/             = (370.16 – 350)/15 = 1.344

    Is this Z-Score in the guts of the sampling distribution???
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.                 11.13
2. Standardized Test Statistic Approach




Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.   11.14
Three ways to determine this: Third way
    3. The p-value approach (which is generally used with a computer and
    statistical software): Increase the “Rejection Region” until it
    “captures” the sample mean.

    For this example, since is to the right of the mean, calculate
     P( > 370.16) = P(Z > 1.344) = 0.0901
    Since this is a two tailed test, you must double this area for the p-value.
            p-value = 2*(0.0901) = 0.1802
    Since we defined the guts as the center 95% [α = 0.05], the reject
    region is the other 5%. Since our sample mean, , is in the 18.02%
    region, it cannot be in our 5% rejection region [α = 0.05].



Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.         11.15
3. p-value approach




Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.   11.16
Statistical Conclusions:
    Unstandardized Test Statistic:
       Since LCV (320.6) < (370.16) < UCV (379.4), we
       reject the null hypothesis at a 5% level of significance.

    Standardized Test Statistic:
       Since -Zα/2(-1.96) < Z(1.344) < Zα/2 (1.96), we fail to reject
       the null hypothesis at a 5% level of significance.

    P-value:
       Since p-value (0.1802) > 0.05 [α], we fail to reject the
       hull hypothesis at a 5% level of significance.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.   11.17
Example 11.1…
    A department store manager determines that a new billing
    system will be cost-effective only if the mean monthly
    account is more than $170.

    A random sample of 400 monthly accounts is drawn, for
    which the sample mean is $178. The accounts are
    approximately normally distributed with a standard deviation
    of $65.

     Can we conclude that the new system will be cost-effective?


Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.   11.18
Example 11.1…
    The system will be cost effective if the mean account
    balance for all customers is greater than $170.

    We express this belief as a our research hypothesis, that is:

                      H1: > 170 (this is what we want to determine)


    Thus, our null hypothesis becomes:

         H0: = 170 (this specifies a single value for the
    parameter of interest) – Actually H0: μ < 170
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.    11.19
Example 11.1…
    What we want to show:
         H1: > 170
                      H0: < 170 (we’ll assume this is true)
    Normally we put Ho first.
    We know:
         n = 400,
           = 178, and
           = 65
           = 65/SQRT(400) = 3.25
         α = 0.05
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.   11.20
Example 11.1… Rejection Region…
    The rejection region is a range of values such that if the test
    statistic falls into that range, we decide to reject the null
    hypothesis in favor of the alternative hypothesis.




                               is the critical value of              to reject H0.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.                   11.21
Example 11.1…
    At a 5% significance level (i.e.                                 =0.05), we get [all α in one tail]
                      Zα = Z0.05 = 1.645
    Therefore, UCV = 170 + 1.645*3.25 = 175.35
    Since our sample mean (178) is greater than the critical value we
    calculated (175.35), we reject the null hypothesis in favor of H1
    OR
                                                                                  (>1.645) Reject null

    OR
    p-value = P(                          > 178) = P(Z > 2.46) = 0.0069 < 0.05 Reject null



Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.                                        11.22
Example 11.1… The Big Picture…




          H1: > 170                                                  =175.34
          H0: = 170
                                                                               =178
                   Reject H0 in favor of
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.                    11.23
Conclusions of a Test of Hypothesis…
    If we reject the null hypothesis, we conclude that there is
    enough evidence to infer that the alternative hypothesis is
    true.

    If we fail to reject the null hypothesis, we conclude that there
    is not enough statistical evidence to infer that the alternative
    hypothesis is true. This does not mean that we have proven
    that the null hypothesis is true!

    Keep in mind that committing a Type I error OR a Type II
    error can be VERY bad depending on the problem.

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.   11.24
One tail test with rejection region on right
    The last example was a one tail test, because the rejection
    region is located in only one tail of the sampling distribution:




    More correctly, this was an example of a right tail test.
         H1: μ > 170
                      H0: μ < 170

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.   11.25
One tail test with rejection region on left
    The rejection region will be in the left tail.




Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.   11.26
Two tail test with rejection region in both tails
    The rejection region is split equally between the two tails.




Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.   11.27
Example 11.2… Students work
    AT&T’s argues that its rates are such that customers won’t
    see a difference in their phone bills between them and their
    competitors. They calculate the mean and standard deviation
    for all their customers at $17.09 and $3.87 (respectively).
    Note: Don’t know the true value for σ, so we estimate σ from
    the data [σ ~ s = 3.87] – large sample so don’t worry.
    They then sample 100 customers at random and recalculate a
    monthly phone bill based on competitor’s rates.
    Our null and alternative hypotheses are
    H1: ≠ 17.09. We do this by assuming that:
    H0: = 17.09

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.   11.28
Example 11.2…
    The rejection region is set up so we can reject the null
    hypothesis when the test statistic is large or when it is small.




                   stat is “small”                                   stat is “large”

    That is, we set up a two-tail rejection region. The total area
    in the rejection region must sum to , so we divide α by 2.

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.                     11.29
Example 11.2…
    At a 5% significance level (i.e. = .05), we have
       /2 = .025. Thus, z.025 = 1.96 and our rejection region is:


                                         z < –1.96                   -or-        z > 1.96




                                                          -z.025            +z.025     z
                                                                      0

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.                          11.30
Example 11.2…
    From the data, we calculate                                      = 17.55

    Using our standardized test statistic:



    We find that:

    Since z = 1.19 is not greater than 1.96, nor less than –1.96
                  we cannot reject the null hypothesis in favor of
                  H1. That is “there is insufficient evidence to
                  infer that there is a difference between the
                  bills of AT&T and the competitor.”             11.31
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
Probability of a Type II Error –
    A Type II error occurs when a false null hypothesis is not
    rejected or “you accept the null when it is not true” but don’t
    say it this way if a statistician is around.

    In practice, this is by far the most serious error you can make
    in most cases, especially in the “quality field”.




Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.   11.32
Probability you ship pills whose mean amount of medication is 7 mg approximately 67%




Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.                         11.33
Special Tests of Significance for Large
           Samples: Means
Special Tests of Significance for Large
           Samples: Means
Special Tests of Significance for Large
           Samples: Means
Special Tests of Significance for Large
           Samples: Means
Special Tests of Significance for Large
        Samples: Proportions
Special Tests of Significance for Large
        Samples: Proportions
Special Tests of Significance for Large
   Samples: Difference of Means
Special Tests of Significance for Large
   Samples: Difference of Means
Special Tests of Significance for Large
   Samples: Difference of Means
Special Tests of Significance for Large
   Samples: Difference of Means
Special Tests of Significance for Large
   Samples: Difference of Means
Special Tests of Significance for Large
   Samples: Difference of Means
Special Tests of Significance for Large
 Samples: Difference of Proportions
Special Tests of Significance for Large
 Samples: Difference of Proportions
Special Tests of Significance for Large
 Samples: Difference of Proportions
Special Tests of Significance for Small
           Samples: Means
Special Tests of Significance for Small
           Samples: Means
Special Tests of Significance for Small
           Samples: Means
Special Tests of Significance for Small
           Samples: Means
Using the Student’s t Distribution for
Small Samples (One Sample T-Test)
   When the sample size is small
    (approximately < 100) then the Student’s t
    distribution should be used (see Appendix B)
   The test statistic is known as “t”.
   The curve of the t distribution is flatter than
    that of the Z distribution but as the sample
    size increases, the t-curve starts to resemble
    the Z-curve (see text p. 230 for illustration)
Degrees of Freedom

   The curve of the t distribution varies with
    sample size (the smaller the size, the flatter
    the curve)
   In using the t-table, we use “degrees of
    freedom” based on the sample size.
   For a one-sample test, df = N – 1.
   When looking at the table, find the t-value for
    the appropriate df = N-1. This will be the
    cutoff point for your critical region.
Formula for one sample t-test:


                Χ−µ
   t=
            S
                   N −1
Example

   A random sample of 26 sociology
    graduates scored 458 on the GRE
    advanced sociology test with a standard
    deviation of 20. Is this significantly
    different from the population average
    (µ = 440)?
Solution (using five step model)

   Step 1: Make Assumptions and Meet Test
    Requirements:

   1. Random sample
   2. Level of measurement is interval-ratio
   3. The sample is small (<100)
Solution (cont.)

Step 2: State the null and alternate hypotheses.

H0: µ = 440 (or H0:    = μ)

H1: µ ≠ 440
Solution (cont.)
    Step 3: Select Sampling Distribution and
     Establish the Critical Region

1.   Small sample, I-R level, so use t
     distribution.
2.   Alpha (α) = .05
3.   Degrees of Freedom = N-1 = 26-1 = 25
4.   Critical t = ±2.060
Solution (cont.)
   Step 4: Use Formula to Compute the Test Statistic




           Χ−µ
            458 − 440
t=        =            = 4.5
   S        20
     N −1       26 − 1
Looking at the curve for the t distribution
Alpha (α) = .05




          t= -2.060         t = +2.060


               c             c           t= +4.50
                                              I
Step 5 Make a Decision and Interpret
    Results
   The obtained t score fell in the Critical Region, so
    we reject the H0 (t (obtained) > t (critical)
       If the H0 were true, a sample outcome of 458
        would be unlikely.
       Therefore, the H0 is false and must be rejected.

   Sociology graduates have a GRE score that is
    significantly different from the general student body
    (t = 4.5, df = 25, α = .05).
Testing Sample Proportions:

   When your variable is at the nominal (or
    ordinal) level the one sample z-test for
    proportions should be used.
   If the data are in % format, convert to a
    proportion first.
   The method is the same as the one sample
    Z-test for means (see above)
Special Tests of Significance for Small
          Samples: Variance
Special Tests of Significance for Small
          Samples: Variance
Special Tests of Significance for Small
          Samples: Variance
Special Tests of Significance for Small
   Samples: Difference of Means
Special Tests of Significance for Small
   Samples: Difference of Means
Special Tests of Significance for Small
    Samples: Ratios of Variances
Special Tests of Significance for Small
    Samples: Ratios of Variances
Special Tests of Significance for Small
    Samples: Ratios of Variances
Summary
Example #1
Example #1 - Answer
Example #2
Example #2 - Answer
Example #3
Example #3 - Answer
Example #4
Example #4
Example #5
Example #5 – Answer
Example #6
Ex18) A test of the breaking strengths of 6 ropes manufactured by a company showed a
mean breaking strength of 7750 lb and a standard deviation of 145 lb, whereas the
manufacturer claimed a mean breaking strength of 8000 lb. Can we support the
manufacturér’s claim at a level of significance of (a) 0.05, (b) 0.01? (c) W hat is the P value
of the test?
Example #6 - Answer

More Related Content

What's hot

Chapter10
Chapter10Chapter10
Chapter10
Richard Ferreria
 
Probability and basic statistics with R
Probability and basic statistics with RProbability and basic statistics with R
Probability and basic statistics with R
Alberto Labarga
 
Introduction to hypothesis testing ppt @ bec doms
Introduction to hypothesis testing ppt @ bec domsIntroduction to hypothesis testing ppt @ bec doms
Introduction to hypothesis testing ppt @ bec doms
Babasab Patil
 
Chapter 12
Chapter 12Chapter 12
Chapter 12
Tuul Tuul
 
Chapter 10
Chapter 10Chapter 10
Chapter 10bmcfad01
 
Chapter 06
Chapter 06 Chapter 06
Chapter 06
Tuul Tuul
 
hypothesis test
 hypothesis test hypothesis test
hypothesis test
Unsa Shakir
 
Presentation on Hypothesis Test by Ashik Amin Prem
Presentation on Hypothesis Test by Ashik Amin PremPresentation on Hypothesis Test by Ashik Amin Prem
Presentation on Hypothesis Test by Ashik Amin Prem
AshikAminPrem
 
Les5e ppt 07
Les5e ppt 07Les5e ppt 07
Les5e ppt 07
Subas Nandy
 
Chapter 07
Chapter 07Chapter 07
Chapter 07bmcfad01
 
Chapter 05
Chapter 05 Chapter 05
Chapter 05
Tuul Tuul
 
Testing a claim about a proportion
Testing a claim about a proportion  Testing a claim about a proportion
Testing a claim about a proportion
Long Beach City College
 
IN ORDER TO IMPLEMENT A SET OF RULES / TUTORIALOUTLET DOT COM
IN ORDER TO IMPLEMENT A SET OF RULES / TUTORIALOUTLET DOT COMIN ORDER TO IMPLEMENT A SET OF RULES / TUTORIALOUTLET DOT COM
IN ORDER TO IMPLEMENT A SET OF RULES / TUTORIALOUTLET DOT COM
jorge0050
 
Test hypothesis
Test hypothesisTest hypothesis
Test hypothesis
Homework Guru
 
Testing a Claim About a Standard Deviation or Variance
Testing a Claim About a Standard Deviation or VarianceTesting a Claim About a Standard Deviation or Variance
Testing a Claim About a Standard Deviation or Variance
Long Beach City College
 
Chapter 11
Chapter 11Chapter 11
Chapter 11bmcfad01
 
Les5e ppt 10
Les5e ppt 10Les5e ppt 10
Les5e ppt 10
Subas Nandy
 
Chapter 12
Chapter 12Chapter 12
Chapter 12bmcfad01
 
V. pacáková, d. brebera
V. pacáková, d. breberaV. pacáková, d. brebera
V. pacáková, d. brebera
logyalaa
 

What's hot (20)

Chapter10
Chapter10Chapter10
Chapter10
 
Probability and basic statistics with R
Probability and basic statistics with RProbability and basic statistics with R
Probability and basic statistics with R
 
Introduction to hypothesis testing ppt @ bec doms
Introduction to hypothesis testing ppt @ bec domsIntroduction to hypothesis testing ppt @ bec doms
Introduction to hypothesis testing ppt @ bec doms
 
Chapter 12
Chapter 12Chapter 12
Chapter 12
 
Chapter 10
Chapter 10Chapter 10
Chapter 10
 
Chapter 06
Chapter 06 Chapter 06
Chapter 06
 
hypothesis test
 hypothesis test hypothesis test
hypothesis test
 
Presentation on Hypothesis Test by Ashik Amin Prem
Presentation on Hypothesis Test by Ashik Amin PremPresentation on Hypothesis Test by Ashik Amin Prem
Presentation on Hypothesis Test by Ashik Amin Prem
 
Les5e ppt 07
Les5e ppt 07Les5e ppt 07
Les5e ppt 07
 
Chapter 07
Chapter 07Chapter 07
Chapter 07
 
Chapter 05
Chapter 05 Chapter 05
Chapter 05
 
Testing a claim about a proportion
Testing a claim about a proportion  Testing a claim about a proportion
Testing a claim about a proportion
 
IN ORDER TO IMPLEMENT A SET OF RULES / TUTORIALOUTLET DOT COM
IN ORDER TO IMPLEMENT A SET OF RULES / TUTORIALOUTLET DOT COMIN ORDER TO IMPLEMENT A SET OF RULES / TUTORIALOUTLET DOT COM
IN ORDER TO IMPLEMENT A SET OF RULES / TUTORIALOUTLET DOT COM
 
Test hypothesis
Test hypothesisTest hypothesis
Test hypothesis
 
Testing a Claim About a Standard Deviation or Variance
Testing a Claim About a Standard Deviation or VarianceTesting a Claim About a Standard Deviation or Variance
Testing a Claim About a Standard Deviation or Variance
 
Ch09(1)
Ch09(1)Ch09(1)
Ch09(1)
 
Chapter 11
Chapter 11Chapter 11
Chapter 11
 
Les5e ppt 10
Les5e ppt 10Les5e ppt 10
Les5e ppt 10
 
Chapter 12
Chapter 12Chapter 12
Chapter 12
 
V. pacáková, d. brebera
V. pacáková, d. breberaV. pacáková, d. brebera
V. pacáková, d. brebera
 

Viewers also liked

The Doppler Effect
The Doppler EffectThe Doppler Effect
The Doppler Effect
katieliw
 
parametric test of difference z test f test one-way_two-way_anova
parametric test of difference z test f test one-way_two-way_anova parametric test of difference z test f test one-way_two-way_anova
parametric test of difference z test f test one-way_two-way_anova
Tess Anoza
 
Kabanata 4
Kabanata 4Kabanata 4
Kabanata 4
Atty Infact
 
Research paper in filipino
Research paper in filipinoResearch paper in filipino
Research paper in filipino
SFYC
 
THESIS - WIKANG FILIPINO, SA MAKABAGONG PANAHON
THESIS - WIKANG FILIPINO, SA MAKABAGONG PANAHONTHESIS - WIKANG FILIPINO, SA MAKABAGONG PANAHON
THESIS - WIKANG FILIPINO, SA MAKABAGONG PANAHON
Mi L
 
THESIS (Pananaliksik) Tagalog
THESIS (Pananaliksik) TagalogTHESIS (Pananaliksik) Tagalog
THESIS (Pananaliksik) Tagalog
hm alumia
 

Viewers also liked (6)

The Doppler Effect
The Doppler EffectThe Doppler Effect
The Doppler Effect
 
parametric test of difference z test f test one-way_two-way_anova
parametric test of difference z test f test one-way_two-way_anova parametric test of difference z test f test one-way_two-way_anova
parametric test of difference z test f test one-way_two-way_anova
 
Kabanata 4
Kabanata 4Kabanata 4
Kabanata 4
 
Research paper in filipino
Research paper in filipinoResearch paper in filipino
Research paper in filipino
 
THESIS - WIKANG FILIPINO, SA MAKABAGONG PANAHON
THESIS - WIKANG FILIPINO, SA MAKABAGONG PANAHONTHESIS - WIKANG FILIPINO, SA MAKABAGONG PANAHON
THESIS - WIKANG FILIPINO, SA MAKABAGONG PANAHON
 
THESIS (Pananaliksik) Tagalog
THESIS (Pananaliksik) TagalogTHESIS (Pananaliksik) Tagalog
THESIS (Pananaliksik) Tagalog
 

Similar to 12주차

chapter11.ppt
chapter11.pptchapter11.ppt
chapter11.ppt
HasanGilani3
 
Hypothesis
HypothesisHypothesis
Hypothesis
Abhishek Gupta
 
Math3010 week 4
Math3010 week 4Math3010 week 4
Math3010 week 4
stanbridge
 
Morestatistics22 091208004743-phpapp01
Morestatistics22 091208004743-phpapp01Morestatistics22 091208004743-phpapp01
Morestatistics22 091208004743-phpapp01mandrewmartin
 
hypothesis_testing-ch9-39-14402.pdf
hypothesis_testing-ch9-39-14402.pdfhypothesis_testing-ch9-39-14402.pdf
hypothesis_testing-ch9-39-14402.pdf
Jorge Vega Rodríguez
 
Testing of hypotheses
Testing of hypothesesTesting of hypotheses
Testing of hypotheses
RajThakuri
 
Hypothesis Testing (Statistical Significance)1Hypo.docx
Hypothesis Testing (Statistical Significance)1Hypo.docxHypothesis Testing (Statistical Significance)1Hypo.docx
Hypothesis Testing (Statistical Significance)1Hypo.docx
adampcarr67227
 
Chapter8 Introduction to Estimation Hypothesis Testing.pdf
Chapter8 Introduction to Estimation Hypothesis Testing.pdfChapter8 Introduction to Estimation Hypothesis Testing.pdf
Chapter8 Introduction to Estimation Hypothesis Testing.pdf
mekkimekki5
 
Hypothesis testing Part1
Hypothesis testing Part1Hypothesis testing Part1
Hypothesis testing Part1
Akhila Prabhakaran
 
Business Statistics Chapter 9
Business Statistics Chapter 9Business Statistics Chapter 9
Business Statistics Chapter 9
Lux PP
 
B.tech admission in india
B.tech admission in indiaB.tech admission in india
B.tech admission in india
Edhole.com
 
Chap010.ppt
Chap010.pptChap010.ppt
Chap010.ppt
ManoloTaquire
 
Int 150 The Moral Instinct”1. Most cultures agree that abus.docx
Int 150 The Moral Instinct”1.   Most cultures agree that abus.docxInt 150 The Moral Instinct”1.   Most cultures agree that abus.docx
Int 150 The Moral Instinct”1. Most cultures agree that abus.docx
mariuse18nolet
 
Testing of Hypothesis, p-value, Gaussian distribution, null hypothesis
Testing of Hypothesis, p-value, Gaussian distribution, null hypothesisTesting of Hypothesis, p-value, Gaussian distribution, null hypothesis
Testing of Hypothesis, p-value, Gaussian distribution, null hypothesis
svmmcradonco1
 
Study unit 4 (oct 2014)
Study unit 4 (oct 2014)Study unit 4 (oct 2014)
Study unit 4 (oct 2014)
vfvfx
 
Study unit 4 (oct 2014)
Study unit 4 (oct 2014)Study unit 4 (oct 2014)
Study unit 4 (oct 2014)
vfvfx
 
Lesson05_new
Lesson05_newLesson05_new
Lesson05_newshengvn
 
testingofhypothesis-150108150611-conversion-gate02.pptx
testingofhypothesis-150108150611-conversion-gate02.pptxtestingofhypothesis-150108150611-conversion-gate02.pptx
testingofhypothesis-150108150611-conversion-gate02.pptx
wigatox256
 

Similar to 12주차 (20)

Chapter11 (1)
Chapter11 (1)Chapter11 (1)
Chapter11 (1)
 
chapter11.ppt
chapter11.pptchapter11.ppt
chapter11.ppt
 
Hypothesis
HypothesisHypothesis
Hypothesis
 
Math3010 week 4
Math3010 week 4Math3010 week 4
Math3010 week 4
 
More Statistics
More StatisticsMore Statistics
More Statistics
 
Morestatistics22 091208004743-phpapp01
Morestatistics22 091208004743-phpapp01Morestatistics22 091208004743-phpapp01
Morestatistics22 091208004743-phpapp01
 
hypothesis_testing-ch9-39-14402.pdf
hypothesis_testing-ch9-39-14402.pdfhypothesis_testing-ch9-39-14402.pdf
hypothesis_testing-ch9-39-14402.pdf
 
Testing of hypotheses
Testing of hypothesesTesting of hypotheses
Testing of hypotheses
 
Hypothesis Testing (Statistical Significance)1Hypo.docx
Hypothesis Testing (Statistical Significance)1Hypo.docxHypothesis Testing (Statistical Significance)1Hypo.docx
Hypothesis Testing (Statistical Significance)1Hypo.docx
 
Chapter8 Introduction to Estimation Hypothesis Testing.pdf
Chapter8 Introduction to Estimation Hypothesis Testing.pdfChapter8 Introduction to Estimation Hypothesis Testing.pdf
Chapter8 Introduction to Estimation Hypothesis Testing.pdf
 
Hypothesis testing Part1
Hypothesis testing Part1Hypothesis testing Part1
Hypothesis testing Part1
 
Business Statistics Chapter 9
Business Statistics Chapter 9Business Statistics Chapter 9
Business Statistics Chapter 9
 
B.tech admission in india
B.tech admission in indiaB.tech admission in india
B.tech admission in india
 
Chap010.ppt
Chap010.pptChap010.ppt
Chap010.ppt
 
Int 150 The Moral Instinct”1. Most cultures agree that abus.docx
Int 150 The Moral Instinct”1.   Most cultures agree that abus.docxInt 150 The Moral Instinct”1.   Most cultures agree that abus.docx
Int 150 The Moral Instinct”1. Most cultures agree that abus.docx
 
Testing of Hypothesis, p-value, Gaussian distribution, null hypothesis
Testing of Hypothesis, p-value, Gaussian distribution, null hypothesisTesting of Hypothesis, p-value, Gaussian distribution, null hypothesis
Testing of Hypothesis, p-value, Gaussian distribution, null hypothesis
 
Study unit 4 (oct 2014)
Study unit 4 (oct 2014)Study unit 4 (oct 2014)
Study unit 4 (oct 2014)
 
Study unit 4 (oct 2014)
Study unit 4 (oct 2014)Study unit 4 (oct 2014)
Study unit 4 (oct 2014)
 
Lesson05_new
Lesson05_newLesson05_new
Lesson05_new
 
testingofhypothesis-150108150611-conversion-gate02.pptx
testingofhypothesis-150108150611-conversion-gate02.pptxtestingofhypothesis-150108150611-conversion-gate02.pptx
testingofhypothesis-150108150611-conversion-gate02.pptx
 

More from Kookmin University (9)

13주차
13주차13주차
13주차
 
11주차
11주차11주차
11주차
 
10주차
10주차10주차
10주차
 
9주차
9주차9주차
9주차
 
7주차
7주차7주차
7주차
 
6주차
6주차6주차
6주차
 
5주차
5주차5주차
5주차
 
4주차
4주차4주차
4주차
 
2주차
2주차2주차
2주차
 

12주차

  • 1. Introduction to Probability and Statistics 12th Week (5/31) Hypothesis Testing (II)
  • 4. Statistical Hypotheses and Null Hypotheses •Statistical hypotheses: Assumptions or guesses about the populations involved. (Such assumptions, which may or may not be true) •Null hypotheses (H0): Hypothesis that there is no difference between the procedures. We formulate it if we want to decide whether one procedure is better than another. •Alternative hypotheses (H1): Any hypothesis that differs from a given null hypothesis Example 1. For example, if the null hypothesis is p = 0.5, possible alternative hypotheses are p =0.7, or p ≠ 0.5.
  • 5. Table 7-2 Type I and Type II Errors True State of Nature The null The null hypothesis is hypothesis is true false We decide to Type I error Correct reject the (rejecting a true decision null hypothesis null hypothesis) Decision We fail to Type II error Correct reject the (failing to reject decision null hypothesis a false null hypothesis)
  • 6. P Value •Small P values provide evidence for rejecting the null hypothesis in favor of the alternative hypothesis, and large P values provide evidence for not rejecting the null hypothesis in favor of the alternative hypothesis. •The P value and the level of significance do not provide criteria for rejecting or not rejecting the null hypothesis by itself, but for rejecting or not rejecting the null hypothesis in favor of the alternative hypothesis. • When the test statistic S is the standard normal random variable, the table in Appendix C is sufficient to compute the P value, but when S is one of the t, F, or chi-square random variables, all of which have different distributions depending on their degrees of freedom, either computer software or more extensive tables will be needed to compute the P value.
  • 7. Concepts of Hypothesis Testing… For example, if we’re trying to decide whether the mean is not equal to 350, a large value of (say, 600) would provide enough evidence. If is close to 350 (say, 355) we could not say that this provides a great deal of evidence to infer that the population mean is different than 350. Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 11.7
  • 8. Concepts of Hypothesis Testing (4)… The two possible decisions that can be made: Conclude that there is enough evidence to support the alternative hypothesis (also stated as: reject the null hypothesis in favor of the alternative) Conclude that there is not enough evidence to support the alternative hypothesis (also stated as: failing to reject the null hypothesis in favor of the alternative) NOTE: we do not say that we accept the null hypothesis if a statistician is around… Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 11.8
  • 9. Concepts of Hypothesis Testing (2)… The testing procedure begins with the assumption that the null hypothesis is true. Thus, until we have further statistical evidence, we will assume: H0: = 350 (assumed to be TRUE) The next step will be to determine the sampling distribution of the sample mean assuming the true mean is 350. is normal with 350 75/SQRT(25) = 15 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 11.9
  • 10. Is the Sample Mean in the Guts of the Sampling Distribution?? Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 11.10
  • 11. Three ways to determine this: First way 1. Unstandardized test statistic: Is in the guts of the sampling distribution? Depends on what you define as the “guts” of the sampling distribution. If we define the guts as the center 95% of the distribution [this means α = 0.05], then the critical values that define the guts will be 1.96 standard deviations of X-Bar on either side of the mean of the sampling distribution [350], or UCV = 350 + 1.96*15 = 350 + 29.4 = 379.4 LCV = 350 – 1.96*15 = 350 – 29.4 = 320.6 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 11.11
  • 12. 1. Unstandardized Test Statistic Approach Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 11.12
  • 13. Three ways to determine this: Second way 2. Standardized test statistic: Since we defined the “guts” of the sampling distribution to be the center 95% [α = 0.05], If the Z-Score for the sample mean is greater than 1.96, we know that will be in the reject region on the right side or If the Z-Score for the sample mean is less than -1.97, we know that will be in the reject region on the left side. Z=( - )/ = (370.16 – 350)/15 = 1.344 Is this Z-Score in the guts of the sampling distribution??? Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 11.13
  • 14. 2. Standardized Test Statistic Approach Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 11.14
  • 15. Three ways to determine this: Third way 3. The p-value approach (which is generally used with a computer and statistical software): Increase the “Rejection Region” until it “captures” the sample mean. For this example, since is to the right of the mean, calculate P( > 370.16) = P(Z > 1.344) = 0.0901 Since this is a two tailed test, you must double this area for the p-value. p-value = 2*(0.0901) = 0.1802 Since we defined the guts as the center 95% [α = 0.05], the reject region is the other 5%. Since our sample mean, , is in the 18.02% region, it cannot be in our 5% rejection region [α = 0.05]. Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 11.15
  • 16. 3. p-value approach Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 11.16
  • 17. Statistical Conclusions: Unstandardized Test Statistic: Since LCV (320.6) < (370.16) < UCV (379.4), we reject the null hypothesis at a 5% level of significance. Standardized Test Statistic: Since -Zα/2(-1.96) < Z(1.344) < Zα/2 (1.96), we fail to reject the null hypothesis at a 5% level of significance. P-value: Since p-value (0.1802) > 0.05 [α], we fail to reject the hull hypothesis at a 5% level of significance. Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 11.17
  • 18. Example 11.1… A department store manager determines that a new billing system will be cost-effective only if the mean monthly account is more than $170. A random sample of 400 monthly accounts is drawn, for which the sample mean is $178. The accounts are approximately normally distributed with a standard deviation of $65. Can we conclude that the new system will be cost-effective? Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 11.18
  • 19. Example 11.1… The system will be cost effective if the mean account balance for all customers is greater than $170. We express this belief as a our research hypothesis, that is: H1: > 170 (this is what we want to determine) Thus, our null hypothesis becomes: H0: = 170 (this specifies a single value for the parameter of interest) – Actually H0: μ < 170 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 11.19
  • 20. Example 11.1… What we want to show: H1: > 170 H0: < 170 (we’ll assume this is true) Normally we put Ho first. We know: n = 400, = 178, and = 65 = 65/SQRT(400) = 3.25 α = 0.05 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 11.20
  • 21. Example 11.1… Rejection Region… The rejection region is a range of values such that if the test statistic falls into that range, we decide to reject the null hypothesis in favor of the alternative hypothesis. is the critical value of to reject H0. Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 11.21
  • 22. Example 11.1… At a 5% significance level (i.e. =0.05), we get [all α in one tail] Zα = Z0.05 = 1.645 Therefore, UCV = 170 + 1.645*3.25 = 175.35 Since our sample mean (178) is greater than the critical value we calculated (175.35), we reject the null hypothesis in favor of H1 OR (>1.645) Reject null OR p-value = P( > 178) = P(Z > 2.46) = 0.0069 < 0.05 Reject null Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 11.22
  • 23. Example 11.1… The Big Picture… H1: > 170 =175.34 H0: = 170 =178 Reject H0 in favor of Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 11.23
  • 24. Conclusions of a Test of Hypothesis… If we reject the null hypothesis, we conclude that there is enough evidence to infer that the alternative hypothesis is true. If we fail to reject the null hypothesis, we conclude that there is not enough statistical evidence to infer that the alternative hypothesis is true. This does not mean that we have proven that the null hypothesis is true! Keep in mind that committing a Type I error OR a Type II error can be VERY bad depending on the problem. Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 11.24
  • 25. One tail test with rejection region on right The last example was a one tail test, because the rejection region is located in only one tail of the sampling distribution: More correctly, this was an example of a right tail test. H1: μ > 170 H0: μ < 170 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 11.25
  • 26. One tail test with rejection region on left The rejection region will be in the left tail. Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 11.26
  • 27. Two tail test with rejection region in both tails The rejection region is split equally between the two tails. Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 11.27
  • 28. Example 11.2… Students work AT&T’s argues that its rates are such that customers won’t see a difference in their phone bills between them and their competitors. They calculate the mean and standard deviation for all their customers at $17.09 and $3.87 (respectively). Note: Don’t know the true value for σ, so we estimate σ from the data [σ ~ s = 3.87] – large sample so don’t worry. They then sample 100 customers at random and recalculate a monthly phone bill based on competitor’s rates. Our null and alternative hypotheses are H1: ≠ 17.09. We do this by assuming that: H0: = 17.09 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 11.28
  • 29. Example 11.2… The rejection region is set up so we can reject the null hypothesis when the test statistic is large or when it is small. stat is “small” stat is “large” That is, we set up a two-tail rejection region. The total area in the rejection region must sum to , so we divide α by 2. Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 11.29
  • 30. Example 11.2… At a 5% significance level (i.e. = .05), we have /2 = .025. Thus, z.025 = 1.96 and our rejection region is: z < –1.96 -or- z > 1.96 -z.025 +z.025 z 0 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 11.30
  • 31. Example 11.2… From the data, we calculate = 17.55 Using our standardized test statistic: We find that: Since z = 1.19 is not greater than 1.96, nor less than –1.96 we cannot reject the null hypothesis in favor of H1. That is “there is insufficient evidence to infer that there is a difference between the bills of AT&T and the competitor.” 11.31 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
  • 32. Probability of a Type II Error – A Type II error occurs when a false null hypothesis is not rejected or “you accept the null when it is not true” but don’t say it this way if a statistician is around. In practice, this is by far the most serious error you can make in most cases, especially in the “quality field”. Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 11.32
  • 33. Probability you ship pills whose mean amount of medication is 7 mg approximately 67% Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 11.33
  • 34. Special Tests of Significance for Large Samples: Means
  • 35. Special Tests of Significance for Large Samples: Means
  • 36. Special Tests of Significance for Large Samples: Means
  • 37. Special Tests of Significance for Large Samples: Means
  • 38. Special Tests of Significance for Large Samples: Proportions
  • 39. Special Tests of Significance for Large Samples: Proportions
  • 40. Special Tests of Significance for Large Samples: Difference of Means
  • 41. Special Tests of Significance for Large Samples: Difference of Means
  • 42. Special Tests of Significance for Large Samples: Difference of Means
  • 43. Special Tests of Significance for Large Samples: Difference of Means
  • 44. Special Tests of Significance for Large Samples: Difference of Means
  • 45. Special Tests of Significance for Large Samples: Difference of Means
  • 46. Special Tests of Significance for Large Samples: Difference of Proportions
  • 47. Special Tests of Significance for Large Samples: Difference of Proportions
  • 48. Special Tests of Significance for Large Samples: Difference of Proportions
  • 49. Special Tests of Significance for Small Samples: Means
  • 50. Special Tests of Significance for Small Samples: Means
  • 51. Special Tests of Significance for Small Samples: Means
  • 52. Special Tests of Significance for Small Samples: Means
  • 53. Using the Student’s t Distribution for Small Samples (One Sample T-Test)  When the sample size is small (approximately < 100) then the Student’s t distribution should be used (see Appendix B)  The test statistic is known as “t”.  The curve of the t distribution is flatter than that of the Z distribution but as the sample size increases, the t-curve starts to resemble the Z-curve (see text p. 230 for illustration)
  • 54. Degrees of Freedom  The curve of the t distribution varies with sample size (the smaller the size, the flatter the curve)  In using the t-table, we use “degrees of freedom” based on the sample size.  For a one-sample test, df = N – 1.  When looking at the table, find the t-value for the appropriate df = N-1. This will be the cutoff point for your critical region.
  • 55. Formula for one sample t-test: Χ−µ t= S N −1
  • 56. Example  A random sample of 26 sociology graduates scored 458 on the GRE advanced sociology test with a standard deviation of 20. Is this significantly different from the population average (µ = 440)?
  • 57. Solution (using five step model)  Step 1: Make Assumptions and Meet Test Requirements:  1. Random sample  2. Level of measurement is interval-ratio  3. The sample is small (<100)
  • 58. Solution (cont.) Step 2: State the null and alternate hypotheses. H0: µ = 440 (or H0: = μ) H1: µ ≠ 440
  • 59. Solution (cont.)  Step 3: Select Sampling Distribution and Establish the Critical Region 1. Small sample, I-R level, so use t distribution. 2. Alpha (α) = .05 3. Degrees of Freedom = N-1 = 26-1 = 25 4. Critical t = ±2.060
  • 60. Solution (cont.)  Step 4: Use Formula to Compute the Test Statistic Χ−µ 458 − 440 t= = = 4.5 S 20 N −1 26 − 1
  • 61. Looking at the curve for the t distribution Alpha (α) = .05 t= -2.060 t = +2.060 c c t= +4.50 I
  • 62. Step 5 Make a Decision and Interpret Results  The obtained t score fell in the Critical Region, so we reject the H0 (t (obtained) > t (critical)  If the H0 were true, a sample outcome of 458 would be unlikely.  Therefore, the H0 is false and must be rejected.  Sociology graduates have a GRE score that is significantly different from the general student body (t = 4.5, df = 25, α = .05).
  • 63. Testing Sample Proportions:  When your variable is at the nominal (or ordinal) level the one sample z-test for proportions should be used.  If the data are in % format, convert to a proportion first.  The method is the same as the one sample Z-test for means (see above)
  • 64. Special Tests of Significance for Small Samples: Variance
  • 65. Special Tests of Significance for Small Samples: Variance
  • 66. Special Tests of Significance for Small Samples: Variance
  • 67. Special Tests of Significance for Small Samples: Difference of Means
  • 68. Special Tests of Significance for Small Samples: Difference of Means
  • 69. Special Tests of Significance for Small Samples: Ratios of Variances
  • 70. Special Tests of Significance for Small Samples: Ratios of Variances
  • 71. Special Tests of Significance for Small Samples: Ratios of Variances
  • 74. Example #1 - Answer
  • 76. Example #2 - Answer
  • 78. Example #3 - Answer
  • 82. Example #5 – Answer
  • 83. Example #6 Ex18) A test of the breaking strengths of 6 ropes manufactured by a company showed a mean breaking strength of 7750 lb and a standard deviation of 145 lb, whereas the manufacturer claimed a mean breaking strength of 8000 lb. Can we support the manufacturér’s claim at a level of significance of (a) 0.05, (b) 0.01? (c) W hat is the P value of the test?
  • 84. Example #6 - Answer