Elementary Statistics
Chapter 7: Estimating
Parameters and
Determining Sample
Sizes
7.3 Estimating a
Population Standard
Deviation or Variance
1
7.1 Estimating a Population Proportion
7.2 Estimating a Population Mean
7.3 Estimating a Population Standard Deviation or Variance
7.4 Bootstrapping: Using Technology for Estimates
2
Chapter 7:
Estimating Parameters and Determining Sample Sizes
Objectives:
• Find the confidence interval for a proportion.
• Determine the minimum sample size for finding a confidence interval for a proportion.
• Find the confidence interval for the mean when  is known.
• Determine the minimum sample size for finding a confidence interval for the mean.
• Find the confidence interval for the mean when  is unknown.
• Find a confidence interval for a variance and a standard deviation.
Key Concept:
Use sample standard deviation s (or a sample variance s²) to estimate the value
of the corresponding population standard deviation σ (or population variance σ²).
The sample variance s² is the best point estimate of the population variance σ².
To Construct a confidence interval estimate of a population standard deviation, use χ²
distribution.
When products that fit together (such as pipes) are manufactured, it is important to
keep the variations of the diameters of the products as small as possible; otherwise,
they will not fit together properly and will have to be scrapped.
In the manufacture of medicines, the variance and standard deviation of the medication
in the pills play an important role in making sure patients receive the proper dosage.
For these reasons, confidence intervals for variances and standard deviations are
necessary.
3
7.3 Estimating a Population Standard Deviation or Variance
Key points about the χ² (chi-square or chi-squared) distribution:
The chi-square distribution must be used to calculate confidence intervals for variances and standard deviations.
The chi-square variable is similar to the t variable in that its distribution is a family of curves based on the number of
degrees of freedom. The symbol for chi-square is χ² (Greek letter chi, pronounced “ki”).
A chi-square variable cannot be negative, and the distributions are skewed to the right.
At about 100 degrees of freedom, the chi-square distribution becomes somewhat symmetric.
The area under each chi-square distribution is equal to 1.00, or 100%.
In a normally distributed population with variance σ2 , assume that we
randomly select independent samples of size n and, for each sample,
compute the sample variance s2 (which is the square of the sample
standard deviation s).
The sample statistic χ2 (pronounced chi-square)
χ2 =
(𝑛−1)𝑠2
𝜎2 has a sampling distribution called the chi-square distribution.
4
7.3 Estimating a Population Standard Deviation or Variance, Chi-Square Distribution
χ2 =
(𝑛−1)𝑠2
𝜎2
Chi-Square Distribution & Confidence Interval
Critical Values of χ² We denote a right-tailed critical value by χR
² and we denote a left-tailed
critical value by χL
². Those critical values can be found by using technology or χ² -Table.
Degrees of Freedom: df = n − 1
As the number of degrees of freedom increases, the chi-square distribution approaches a
normal distribution.
Not symmetric: Confidence interval estimate of σ² does not fit a format of
s² − E < σ² < s² + E, so we must do separate calculations for the upper and lower confidence
interval limits.
Critical value of χ² in the body of the table corresponds to an area given in the top row of the
table, and each area in that top row is a cumulative area to the right of the critical value.
5
(𝑛 − 1)𝑠2
𝜒right
2 < 𝜎2
<
(𝑛 − 1)𝑠2
𝜒left
2 , d.f. = 𝑛 − 1
(𝑛 − 1)𝑠2
𝜒right
2 < 𝜎 <
(𝑛 − 1)𝑠2
𝜒left
2 , d.f. = 𝑛 − 1
Example 1: Finding Critical Value of 𝝌𝟐
Find the values for χ²
R and χ²
L for a
90% confidence interval when n = 25.
6
df = n – 1 =24
χ²
R: 1 – 0.90 = 0.10.
Divide by 2 to get 𝐴𝑅 = 0.05
χ²
L: 𝐴𝐿 =1 – 0.05 = 0.95.
𝜒R
2
=36.415, 𝜒L
2
=13.848
TI Calculator
𝝌𝟐
− Distribution Value
1. Math
2. Solver (at the bottom):
3. Put 𝝌𝟐
CDF (2nd , Vars, Down
to 𝝌𝟐
CDF): (Lower, Upper, df)
4. (Lower = 0, Upper = x, df),
Enter
5. Subtract
𝜶
𝟐
, and Enter
6. Click on Alpha key and Enter
Statistics calculator:
https://www.danielsoper.com/statcalc/default.aspx
Example 2: Finding Critical Value of χ²
Find the values for χ²
R and χ²
L for a 95%
confidence interval when n = 22.
7
𝜒R
2
=35.479
𝜒L
2
=10.283
df = n – 1 = 21
χ²
R: 1 – 0.95 = 0.05. Divide by 2 to get 𝐴𝑅 = 0.025
χ²
L: 𝐴𝐿 = 1 – 0.025 = 0.975
Confidence Interval for Estimating a Population Standard Deviation or Variance
σ = population standard deviation,
σ² = population variance
s = sample standard deviation,
s² = sample variance
n = number of sample values
E = margin of error χ²
χ²
L = left-tailed critical value of χ²
χ²
R = left-tailed critical value of χ²
1. The sample is a simple random sample.
2. The population must have normally distributed
values. The requirement of a normal distribution
is much stricter here than in earlier sections, so
large departures from normal distributions can
result in large errors.
8
(𝑛 − 1)𝑠2
𝜒R
2 < 𝜎2 <
(𝑛 − 1)𝑠2
𝜒L
2
d.f. = 𝑛 − 1
(𝑛 − 1)𝑠2
𝜒R
2 < 𝜎 <
(𝑛 − 1)𝑠2
𝜒L
2
Find the 95% confidence interval for the variance and standard deviation
of the Vitamin B1 content of cartoon of milks if a sample of 20 milk
cartoons has a standard deviation of 1.6 milligrams. (Assume ND)
Example 3
Given: Random sample and n = 20, s = 1.6 mg, 95% CI = ?
9
df = n – 1 = 19
χ²
R: 1 – 0.95 = 0.05 → 𝐴𝑅 = 0.025
χ²
L: 𝐴𝐿 =1 – 0.025 = 0.975
χ²
R = 32.852 and χ²
L = 8.907
(19)(1.6)2
32.852
< 𝜎2
<
(19)(1.6)2
8.907
1.4806 < 𝜎2 < 5.4609
1.4806 < 𝜎 < 5.4609
1.2168 < 𝜎 < 2.3369
(𝑛 − 1)𝑠2
𝜒R
2 < 𝜎2 <
(𝑛 − 1)𝑠2
𝜒L
2 →
(𝑛 − 1)𝑠2
𝜒R
2 < 𝜎 <
(𝑛 − 1)𝑠2
𝜒L
2
Find the 90% confidence interval for the variance and standard deviation for the price
in dollars of an adult single-day Theme park ticket. The data represent a selected
sample of local Theme parks. Assume the variable is normally distributed.
59 54 53 52 51 39 49 46 49 48
Example 4
Given: ND Random sample and n = 10, 90% CI = ?
10
df = n – 1 = 9
χ²
R: 1 – 0.90 = 0.1 → 𝐴𝑅 = 0.05
χ²
L: 𝐴𝐿 = 1 – 0.05 = 0.95
χ²
R = 16.919, χ²
L = 3.325
(9)(28.2)
16.919
< 𝜎2 <
(9)(28.2)
3.325
15.0127 < 𝜎2 < 76.3910
15.0127 < 𝜎 < 76.3910
3.8746 < 𝜎 < 8.7402
Technology: s = 5.3125, s2 = 28.2
(𝑛 − 1)𝑠2
𝜒R
2 < 𝜎2 <
(𝑛 − 1)𝑠2
𝜒L
2 →
(𝑛 − 1)𝑠2
𝜒R
2 < 𝜎 <
(𝑛 − 1)𝑠2
𝜒L
2
TI Calculator:
How to enter data:
1. Stat
2. Edit
3. ClrList 𝑳𝟏
4. Or Highlight & Clear
5. Type in your data in L1, ..
Mean, SD, 5-number summary
1. Stat
2. Calc
3. Select 1 for 1 variable
4. Type: L1 (second 1)
5. Enter (or Calculate)
6. Scroll down for 5-number summary
A group of 22 subjects took an IQ test. The subjects had a standard
deviation IQ score of 14.3. Construct a 95% confidence interval estimate
of σ, the standard deviation of the population from which the sample was
obtained. (Assume Normally distributed data)
Example 5
Given: ND, Random sample and n = 22, s = 14.3, 95% CI = ?
11
df = n – 1 = 21
χ²
R: 1 – 0.95 = 0.05 → 𝐴𝑅 = 0.025
χ²
L: 𝐴𝐿 = 1 – 0.025 = 0.975
𝜒R
2
=35.479
𝜒L
2
=10.283
(22 − 1)(14.3)2
35.479
< 𝜎2 <
(22 − 1)(14.3)2
10.283
121.0375 < 𝜎2 < 417.6106
11.0017 < 𝜎 < 20.4355
Based on this result, we have 95% confidence that the
limits of 11.0 and 20.4 contain the true value of σ.
(𝑛 − 1)𝑠2
𝜒R
2 < 𝜎2 <
(𝑛 − 1)𝑠2
𝜒L
2 →
(𝑛 − 1)𝑠2
𝜒R
2 < 𝜎 <
(𝑛 − 1)𝑠2
𝜒L
2
Determining Sample Sizes
The procedures for finding the sample size necessary to estimate s are much more complex than the
procedures given earlier for means and proportions. For normally distributed populations, the table on
the following slide, or the following formula can be used:
12
To be 95%
confident that s
is within …
Of the value of σ,
the sample size n
should be at least
1% 19,205
5% 768
10% 192
20% 48
30% 21
40% 12
50% 8
To be 99%
confident that s
is within …
Of the value of σ,
the sample size n
should be at least
1% 33,218
5% 1,338
10% 336
20% 85
30% 38
40% 22
50% 14
Example 6: We want to estimate the standard deviation σ of all IQ scores of
people with exposure to lead. We want to be 99% confident that our estimate
is within 5% of the true value of σ. How large should the sample be? Assume
that the population is normally distributed.
Obtain a simple
random sample of
1338 IQ scores
from the
population of
subjects exposed
to lead.
𝑛 =
1
2
𝑍𝛼/2
𝑑
2
𝑑 = 𝑑𝑒𝑐𝑖𝑚𝑎𝑙 𝑓𝑜𝑟𝑚 𝑜𝑓
𝑡ℎ𝑒 𝑝𝑒𝑟𝑐𝑒𝑛𝑡𝑎𝑔𝑒 𝑒𝑟𝑟𝑜𝑟
Example: to be 95%
confident that s is within 15%
of 𝜎,
𝑛 =
1
2
1.96
0.15
2
= 85.37
𝑛 = 86

Estimating a Population Standard Deviation or Variance

  • 1.
    Elementary Statistics Chapter 7:Estimating Parameters and Determining Sample Sizes 7.3 Estimating a Population Standard Deviation or Variance 1
  • 2.
    7.1 Estimating aPopulation Proportion 7.2 Estimating a Population Mean 7.3 Estimating a Population Standard Deviation or Variance 7.4 Bootstrapping: Using Technology for Estimates 2 Chapter 7: Estimating Parameters and Determining Sample Sizes Objectives: • Find the confidence interval for a proportion. • Determine the minimum sample size for finding a confidence interval for a proportion. • Find the confidence interval for the mean when  is known. • Determine the minimum sample size for finding a confidence interval for the mean. • Find the confidence interval for the mean when  is unknown. • Find a confidence interval for a variance and a standard deviation.
  • 3.
    Key Concept: Use samplestandard deviation s (or a sample variance s²) to estimate the value of the corresponding population standard deviation σ (or population variance σ²). The sample variance s² is the best point estimate of the population variance σ². To Construct a confidence interval estimate of a population standard deviation, use χ² distribution. When products that fit together (such as pipes) are manufactured, it is important to keep the variations of the diameters of the products as small as possible; otherwise, they will not fit together properly and will have to be scrapped. In the manufacture of medicines, the variance and standard deviation of the medication in the pills play an important role in making sure patients receive the proper dosage. For these reasons, confidence intervals for variances and standard deviations are necessary. 3 7.3 Estimating a Population Standard Deviation or Variance
  • 4.
    Key points aboutthe χ² (chi-square or chi-squared) distribution: The chi-square distribution must be used to calculate confidence intervals for variances and standard deviations. The chi-square variable is similar to the t variable in that its distribution is a family of curves based on the number of degrees of freedom. The symbol for chi-square is χ² (Greek letter chi, pronounced “ki”). A chi-square variable cannot be negative, and the distributions are skewed to the right. At about 100 degrees of freedom, the chi-square distribution becomes somewhat symmetric. The area under each chi-square distribution is equal to 1.00, or 100%. In a normally distributed population with variance σ2 , assume that we randomly select independent samples of size n and, for each sample, compute the sample variance s2 (which is the square of the sample standard deviation s). The sample statistic χ2 (pronounced chi-square) χ2 = (𝑛−1)𝑠2 𝜎2 has a sampling distribution called the chi-square distribution. 4 7.3 Estimating a Population Standard Deviation or Variance, Chi-Square Distribution χ2 = (𝑛−1)𝑠2 𝜎2
  • 5.
    Chi-Square Distribution &Confidence Interval Critical Values of χ² We denote a right-tailed critical value by χR ² and we denote a left-tailed critical value by χL ². Those critical values can be found by using technology or χ² -Table. Degrees of Freedom: df = n − 1 As the number of degrees of freedom increases, the chi-square distribution approaches a normal distribution. Not symmetric: Confidence interval estimate of σ² does not fit a format of s² − E < σ² < s² + E, so we must do separate calculations for the upper and lower confidence interval limits. Critical value of χ² in the body of the table corresponds to an area given in the top row of the table, and each area in that top row is a cumulative area to the right of the critical value. 5 (𝑛 − 1)𝑠2 𝜒right 2 < 𝜎2 < (𝑛 − 1)𝑠2 𝜒left 2 , d.f. = 𝑛 − 1 (𝑛 − 1)𝑠2 𝜒right 2 < 𝜎 < (𝑛 − 1)𝑠2 𝜒left 2 , d.f. = 𝑛 − 1
  • 6.
    Example 1: FindingCritical Value of 𝝌𝟐 Find the values for χ² R and χ² L for a 90% confidence interval when n = 25. 6 df = n – 1 =24 χ² R: 1 – 0.90 = 0.10. Divide by 2 to get 𝐴𝑅 = 0.05 χ² L: 𝐴𝐿 =1 – 0.05 = 0.95. 𝜒R 2 =36.415, 𝜒L 2 =13.848 TI Calculator 𝝌𝟐 − Distribution Value 1. Math 2. Solver (at the bottom): 3. Put 𝝌𝟐 CDF (2nd , Vars, Down to 𝝌𝟐 CDF): (Lower, Upper, df) 4. (Lower = 0, Upper = x, df), Enter 5. Subtract 𝜶 𝟐 , and Enter 6. Click on Alpha key and Enter Statistics calculator: https://www.danielsoper.com/statcalc/default.aspx
  • 7.
    Example 2: FindingCritical Value of χ² Find the values for χ² R and χ² L for a 95% confidence interval when n = 22. 7 𝜒R 2 =35.479 𝜒L 2 =10.283 df = n – 1 = 21 χ² R: 1 – 0.95 = 0.05. Divide by 2 to get 𝐴𝑅 = 0.025 χ² L: 𝐴𝐿 = 1 – 0.025 = 0.975
  • 8.
    Confidence Interval forEstimating a Population Standard Deviation or Variance σ = population standard deviation, σ² = population variance s = sample standard deviation, s² = sample variance n = number of sample values E = margin of error χ² χ² L = left-tailed critical value of χ² χ² R = left-tailed critical value of χ² 1. The sample is a simple random sample. 2. The population must have normally distributed values. The requirement of a normal distribution is much stricter here than in earlier sections, so large departures from normal distributions can result in large errors. 8 (𝑛 − 1)𝑠2 𝜒R 2 < 𝜎2 < (𝑛 − 1)𝑠2 𝜒L 2 d.f. = 𝑛 − 1 (𝑛 − 1)𝑠2 𝜒R 2 < 𝜎 < (𝑛 − 1)𝑠2 𝜒L 2
  • 9.
    Find the 95%confidence interval for the variance and standard deviation of the Vitamin B1 content of cartoon of milks if a sample of 20 milk cartoons has a standard deviation of 1.6 milligrams. (Assume ND) Example 3 Given: Random sample and n = 20, s = 1.6 mg, 95% CI = ? 9 df = n – 1 = 19 χ² R: 1 – 0.95 = 0.05 → 𝐴𝑅 = 0.025 χ² L: 𝐴𝐿 =1 – 0.025 = 0.975 χ² R = 32.852 and χ² L = 8.907 (19)(1.6)2 32.852 < 𝜎2 < (19)(1.6)2 8.907 1.4806 < 𝜎2 < 5.4609 1.4806 < 𝜎 < 5.4609 1.2168 < 𝜎 < 2.3369 (𝑛 − 1)𝑠2 𝜒R 2 < 𝜎2 < (𝑛 − 1)𝑠2 𝜒L 2 → (𝑛 − 1)𝑠2 𝜒R 2 < 𝜎 < (𝑛 − 1)𝑠2 𝜒L 2
  • 10.
    Find the 90%confidence interval for the variance and standard deviation for the price in dollars of an adult single-day Theme park ticket. The data represent a selected sample of local Theme parks. Assume the variable is normally distributed. 59 54 53 52 51 39 49 46 49 48 Example 4 Given: ND Random sample and n = 10, 90% CI = ? 10 df = n – 1 = 9 χ² R: 1 – 0.90 = 0.1 → 𝐴𝑅 = 0.05 χ² L: 𝐴𝐿 = 1 – 0.05 = 0.95 χ² R = 16.919, χ² L = 3.325 (9)(28.2) 16.919 < 𝜎2 < (9)(28.2) 3.325 15.0127 < 𝜎2 < 76.3910 15.0127 < 𝜎 < 76.3910 3.8746 < 𝜎 < 8.7402 Technology: s = 5.3125, s2 = 28.2 (𝑛 − 1)𝑠2 𝜒R 2 < 𝜎2 < (𝑛 − 1)𝑠2 𝜒L 2 → (𝑛 − 1)𝑠2 𝜒R 2 < 𝜎 < (𝑛 − 1)𝑠2 𝜒L 2 TI Calculator: How to enter data: 1. Stat 2. Edit 3. ClrList 𝑳𝟏 4. Or Highlight & Clear 5. Type in your data in L1, .. Mean, SD, 5-number summary 1. Stat 2. Calc 3. Select 1 for 1 variable 4. Type: L1 (second 1) 5. Enter (or Calculate) 6. Scroll down for 5-number summary
  • 11.
    A group of22 subjects took an IQ test. The subjects had a standard deviation IQ score of 14.3. Construct a 95% confidence interval estimate of σ, the standard deviation of the population from which the sample was obtained. (Assume Normally distributed data) Example 5 Given: ND, Random sample and n = 22, s = 14.3, 95% CI = ? 11 df = n – 1 = 21 χ² R: 1 – 0.95 = 0.05 → 𝐴𝑅 = 0.025 χ² L: 𝐴𝐿 = 1 – 0.025 = 0.975 𝜒R 2 =35.479 𝜒L 2 =10.283 (22 − 1)(14.3)2 35.479 < 𝜎2 < (22 − 1)(14.3)2 10.283 121.0375 < 𝜎2 < 417.6106 11.0017 < 𝜎 < 20.4355 Based on this result, we have 95% confidence that the limits of 11.0 and 20.4 contain the true value of σ. (𝑛 − 1)𝑠2 𝜒R 2 < 𝜎2 < (𝑛 − 1)𝑠2 𝜒L 2 → (𝑛 − 1)𝑠2 𝜒R 2 < 𝜎 < (𝑛 − 1)𝑠2 𝜒L 2
  • 12.
    Determining Sample Sizes Theprocedures for finding the sample size necessary to estimate s are much more complex than the procedures given earlier for means and proportions. For normally distributed populations, the table on the following slide, or the following formula can be used: 12 To be 95% confident that s is within … Of the value of σ, the sample size n should be at least 1% 19,205 5% 768 10% 192 20% 48 30% 21 40% 12 50% 8 To be 99% confident that s is within … Of the value of σ, the sample size n should be at least 1% 33,218 5% 1,338 10% 336 20% 85 30% 38 40% 22 50% 14 Example 6: We want to estimate the standard deviation σ of all IQ scores of people with exposure to lead. We want to be 99% confident that our estimate is within 5% of the true value of σ. How large should the sample be? Assume that the population is normally distributed. Obtain a simple random sample of 1338 IQ scores from the population of subjects exposed to lead. 𝑛 = 1 2 𝑍𝛼/2 𝑑 2 𝑑 = 𝑑𝑒𝑐𝑖𝑚𝑎𝑙 𝑓𝑜𝑟𝑚 𝑜𝑓 𝑡ℎ𝑒 𝑝𝑒𝑟𝑐𝑒𝑛𝑡𝑎𝑔𝑒 𝑒𝑟𝑟𝑜𝑟 Example: to be 95% confident that s is within 15% of 𝜎, 𝑛 = 1 2 1.96 0.15 2 = 85.37 𝑛 = 86