SlideShare a Scribd company logo
1 of 14
Confidence Interval Module
One of the key concepts of statistics enabling statisticians to
make incredibly accurate predictions is called the Central Limit
Theorem. The Central Limit Theorem is defined in this way:
· For samples of a sufficiently large size, the real distribution of
means is almost always approximately normal.
· The distribution of means gets closer and closer to normal as
the sample size gets larger and larger, regardless of what the
original variable looks like (positively or negatively skewed).
· In other words, the original variable does not have to be
normally distributed.
· This is because, if we as eccentric researchers, drew an almost
infinite number of random samples from a single population
(such as the student body of NMSU), the means calculated from
the many samples of that population will be normally
distributed and the mean calculated from all of those samples
would be a very close approximation to the true population
mean. It is this very characteristic that makes it possible for us,
using sound probability based sampling techniques, to make
highly accurate statements about characteristics of a population
based upon the statistics calculated on a sample drawn from that
population.
· Furthermore, we can calculate a statistic known as the
standard error of the mean (abbreviated s.e.) that describes the
variability of the distribution of all possible sample means in
the same way that we used the standard deviation to describe
the variability of a single sample. We will use the standard
error of the mean (s.e.) to calculate the statistic that is the topic
of this module, the confidence interval.
The formula that we use to calculate the standard error of the
mean is:
s.e. = s / √N – 1
where s = the standard deviation calculated from the sample;
and
N = the sample size.
So the formula tells us that the standard error of the mean is
equal to the
standard deviation divided by the square root of the sample size
minus 1.
This is the preferred formula for practicing professionals as it
accounts for errors that may be a function of the particular
sample we have selected.
THE CONFIDENCE INTERVAL (CI)
The formula for the CI is a function of the sample size (N).
For samples sizes ≥ 100, the formula for the CI is:
CI = (the sample mean) + & - Z(s.e.).
Let’s look at an example to see how this formula works.
* Please use a pdf doc. “how to solve the problem”, I have
provided for you under the “notes” link.
Example 1
Suppose that we conducted interviews with 140 randomly
selected individuals (N = 140) in a large metropolitan area. We
assured these individuals that their answers would remain
confidential, and we asked them about their law-breaking
behavior. Among other questions the individuals were asked to
self-report the number of times per month they exceeded the
speed limit. One of the objectives of the study was to estimate
(make an inference about) the average number of times per
month residents in all metropolitan areas across the country
exceeded the speed limit. The sample statistics we obtained
were as follows:
Mean = 12.4 times
S = 3.2 times
N = 140
Let’s construct a 95% CI around our estimate of the mean drawn
from this sample.
The sample mean of 12.4 times tells us that, on average, the
individuals from our sample exceed the speed limit about 12.4
times a month. This sample mean estimate is our best point
estimate of the true population mean. We know full well that
12.4 times is not the true population mean and that repeated
samples will yield different means. What does our sample mean
tell us about the mean of the entire population of metropolitan
residents? This is the question we are really trying to answer.
We want to make our point estimate of 12.4 more reliable and at
the same time, give ourselves the ability to make a probability
statement about the confidence we have in our estimate. To do
this, we use the CI equation above to construct a 95%
confidence interval around the sample mean estimate of 12.4.
We have all the information we need to fill in the information
for the formula except for the Z score. The Z score for a 95%
CI is 1.96. From the Z Table, we can find the correct Z score
corresponding to 95 %. Remember that the total area under the
normal distribution/curve equals 100% and that half of that
area, 50%, is above and below the mean. If we are looking for
the Z score corresponding to 95% we first divide 95% in half
leaving a total of 47.5% above and below the mean with 2.5% in
the tail above and below our 95% confidence interval on either
side of the mean. Next we look inside the Z Table (the numbers
corresponding to areas under the normal curve) for the number
that comes closest to .4750 (47.5%) without going under .4750
and identify the corresponding Z score. The correct Z score is
1.96 where the area is .4750.
Now we can solve the equation.
95% CI = 12.4 + & - 1.96 (3.2 / √140 – 1)
= 12.4 + & - 1.96 (3.2 / √139)
= 12.4 + & - 1.96 (3.2 / 11.79)
= 12.4 + & - 1.96 (.27)
= 12.4 + & - .53
12.4 - .53 = 11.87
12.4 + .53 = 12.93
95% CI = 11.87 to 12.93
So what does this interval tell us? It tells us that based on our
sample data; we can be 95 percent confident that the mean
number of self-admitted speeding violations among all residents
of metropolitan areas lies between 11.87 and 12.93 times per
month. That is, theoretically speaking, if we had taken a large
number of random samples from this sample population and
calculated 95% confidence intervals around the means obtained
from each sample, approximately 95% of these intervals would
include the true population mean and 5 percent would not.
Example 2
Let’s say for the sake of argument that we only wanted a 90%
CI about our sample mean, rather than a 95% CI for our point
estimate of 12.4. From the Z Table, we can find the correct Z
score corresponding to 90%. Remember that the total area
under the normal distribution/curve equals 100% and that half
of that area, 50%, is above and below the mean. If we are
looking for the Z score corresponding to 90% we first divide
90% in half leaving a total of 45% above and below the mean
with 5% in the tail above and below our 90% confidence
interval on either side of the mean. Next we look inside the Z
Table (the numbers corresponding to areas under the normal
curve) for the number that comes closest to .4500 (45%) without
going under .4500 and identify the corresponding Z score. The
correct Z score is 1.65 where the area is .4505. A Z score of
1.64 would be incorrect because the area of .4495 is less than
45 percent and thus our CI estimate would not truly be a 90%
confidence level estimate.
As in example 1 we will insert 1.65 into the CI equation and
solve.
90% CI = 12.4 + & - 1.65 (3.2 / √140 – 1)
= 12.4 + & - 1.65 (3.2 / √139)
= 12.4 + & - 1.65 (3.2 / 11.79)
= 12.4 + & - 1.65 (.27)
= 12.4 + & - .44
12.4 - .44 = 11.96
12.4 + .44 = 12.84
90% CI = 11.96 – 12.84
The interval indicates that we are 90 percent confident that the
true population mean speeding violation score falls between
11.96 and 12.84 times per month. Notice that the interval for a
90% confidence interval is narrower than for a 95% confidence
interval. You can see, then, that we are less confident (90
percent vs. 95 percent confident) that our true population means
falls into this interval. By lowering our level of confidence, we
gained some precision in our estimate. We could reduce the
width of our confidence interval even more, but we would pay
the price in levels of confidence.
Example 3
Let’s say that we took a new sample only this time we randomly
select and interview 901 individuals, asking the same questions.
Our sample data for this sample are:
Sample mean = 12.4 times
S = 3.2 times
N = 901
Now lets recalculate our 90% CI.
90% CI = 12.4 + & - 1.65 (3.2 / √901 – 1)
= 12.4 + & - 1.65 (3.2 / √900)
= 12.4 + & - 1.65 (3.2 / 30)
= 12.4 + & - 1.65 ( .11)
= 12.4 + & - .18
12.4 - .18 = 12.22
12.4 + .18 = 12.58
90% CI = 12.22 – 12.58
The interval indicates that we are 90 percent confident that the
true population mean speeding violation score falls between
12.22 and 12.58 times per month. Notice that the interval is
considerably smaller than in Example 2 where the sample size is
140. Why is this? By increasing the sample size, the s.e.
became smaller. We can see this mathematically, but what is
the theoretical reasoning for this change? As our sample size
increased, we captured a greater proportion of the variability in
self-reported speeding violations that exists in the total
population. Consequently, our confidence interval estimate is
more precise. The lesson learned is that whenever you have a
choice between a smaller or a larger sample, choose the larger
sample as your estimates (inferences) about the population will
be more accurate.
Example 4
We have been calculating the confidence interval for samples
where N ≥ 100. What if the sample size is less than 100, N <
100?
In this situation, we must use the two-tailed “T” distribution,
from the Table of T Values. I have provided to you as a pdf
doc. under the “notes” link. We use the two-tailed T
distribution because we are working with a confidence interval
and are concerned with the area between two points on either
side of the mean. This means that we will use the column
headings beneath the label “Level of Significance for Two-
Tailed Test.”
Let’s continue with our effort to estimate the number of self-
reported speeding violations and construct a confidence interval
using the T distribution.
Let’s say we are short on research funds and we are only able to
randomly select and interview 17 individuals and we want to
construct a 90%CI around our estimate of the population mean.
From our sample we obtained the following statistics:
Sample mean = 12.4 times
S = 3.2 times
N = 17
The formula we use is the same as that for samples where N ≥
100 except instead of using Z, we use T. The only trick is to
determine which value of T from the Table of T Values we will
use. The first task is to determine the correct column. For a
90% confidence level we will select the column labeled “.10”.
If we wanted a confidence level of 95% we would select the
column labeled “.05”. If we wanted a confidence level of 98%
we would select the column labeled “.02”. If we wanted a
confidence level of 99% we would select the column labeled
“.01”. These levels (.10, .05, .02, .01) represent the total area
remaining in the two tails of the curve that are outside of our
confidence interval. For example, when we construct a 90%
confidence interval 10% of the area under the curve lies outside
the confidence interval boundaries (100 – 90 = 10) and that
remaining 10% is split equally on either side of the boundary
such that 5% remains below the lower boundary of the
confidence interval and 5% remains above the upper boundary
of the confidence interval. The same logic holds true for any
given level of confidence when we are constructing a
confidence interval.
The second task is to select the correct row. To do this we must
calculate something called the degrees of freedom (abbreviated
df). The degrees of freedom (df) = N -1. In this example, df =
17 – 1 = 16. Now we are able to find the appropriate value for
T to insert into our confidence interval formula. The degrees of
freedom are located in the very first column and begin with 1
and go sequentially through 30 and then moves to 40, 60, 120,
and infinity. Go down the column for df until you arrive at 16.
Go across the row for 16 until you are in the column for .10.
That number is 1.746. Now we are ready to construct our 90%
CI.
90% CI = 12.4 + & - 1.746 ( 3.2 / √17 – 1)
= 12.4 + & - 1.746 (3.2 / √16)
= 12.4 + & - 1.746 (3.2 / 4)
= 12.4 + & - 1.746 (0.8)
= 12.4 + & - 1.397
12.4 – 1.397 = 11.003
12.4 + 1.397 = 13.797
90% CI = 11.003 – 13.797
The interval indicates that we are 90 percent confident that the
true population mean speeding violation score falls between
11.003 to 13.797. Notice that the interval is considerably larger
than the intervals in any of the prior examples. This difference
is due to the same phenomenon I discussed in example 3 above
regarding the effect of sample size on the accuracy of our
estimates of the true population mean.

More Related Content

Similar to Confidence Interval ModuleOne of the key concepts of statist.docx

Statistik 1 7 estimasi & ci
Statistik 1 7 estimasi & ciStatistik 1 7 estimasi & ci
Statistik 1 7 estimasi & ciSelvin Hadi
 
RMH Concise Revision Guide - the Basics of EBM
RMH Concise Revision Guide -  the Basics of EBMRMH Concise Revision Guide -  the Basics of EBM
RMH Concise Revision Guide - the Basics of EBMAyselTuracli
 
Mca admission in india
Mca admission in indiaMca admission in india
Mca admission in indiaEdhole.com
 
Introduction to Statistics - Part 2
Introduction to Statistics - Part 2Introduction to Statistics - Part 2
Introduction to Statistics - Part 2Damian T. Gordon
 
Confidence intervals
Confidence intervalsConfidence intervals
Confidence intervalsTanay Tandon
 
Answer the questions in one paragraph 4-5 sentences. · Why did t.docx
Answer the questions in one paragraph 4-5 sentences. · Why did t.docxAnswer the questions in one paragraph 4-5 sentences. · Why did t.docx
Answer the questions in one paragraph 4-5 sentences. · Why did t.docxboyfieldhouse
 
Statistical inference with Python
Statistical inference with PythonStatistical inference with Python
Statistical inference with PythonJohnson Ubah
 
Point and Interval Estimation
Point and Interval EstimationPoint and Interval Estimation
Point and Interval EstimationShubham Mehta
 
Basic statistics for pharmaceutical (Part 1)
Basic statistics for pharmaceutical (Part 1)Basic statistics for pharmaceutical (Part 1)
Basic statistics for pharmaceutical (Part 1)Syed Muhammad Danish
 
101_sampling__population_Sept_2020.ppt
101_sampling__population_Sept_2020.ppt101_sampling__population_Sept_2020.ppt
101_sampling__population_Sept_2020.pptAndrei33323
 
WEEK 5 HOMEWORK 5THIS WEEK INVOLVES READING NEW TABLES, THE t-TA.docx
WEEK 5 HOMEWORK 5THIS WEEK INVOLVES READING NEW TABLES, THE t-TA.docxWEEK 5 HOMEWORK 5THIS WEEK INVOLVES READING NEW TABLES, THE t-TA.docx
WEEK 5 HOMEWORK 5THIS WEEK INVOLVES READING NEW TABLES, THE t-TA.docxcockekeshia
 
Sampling methods theory and practice
Sampling methods theory and practice Sampling methods theory and practice
Sampling methods theory and practice Ravindra Sharma
 
Bca admission in india
Bca admission in indiaBca admission in india
Bca admission in indiaEdhole.com
 
Section 7 Analyzing our Marketing Test, Survey Results .docx
Section 7 Analyzing our Marketing Test, Survey Results .docxSection 7 Analyzing our Marketing Test, Survey Results .docx
Section 7 Analyzing our Marketing Test, Survey Results .docxkenjordan97598
 

Similar to Confidence Interval ModuleOne of the key concepts of statist.docx (20)

Statistik 1 7 estimasi & ci
Statistik 1 7 estimasi & ciStatistik 1 7 estimasi & ci
Statistik 1 7 estimasi & ci
 
Chapter 11
Chapter 11Chapter 11
Chapter 11
 
RMH Concise Revision Guide - the Basics of EBM
RMH Concise Revision Guide -  the Basics of EBMRMH Concise Revision Guide -  the Basics of EBM
RMH Concise Revision Guide - the Basics of EBM
 
Mca admission in india
Mca admission in indiaMca admission in india
Mca admission in india
 
6. point and interval estimation
6. point and interval estimation6. point and interval estimation
6. point and interval estimation
 
Introduction to Statistics - Part 2
Introduction to Statistics - Part 2Introduction to Statistics - Part 2
Introduction to Statistics - Part 2
 
Confidence intervals
Confidence intervalsConfidence intervals
Confidence intervals
 
Answer the questions in one paragraph 4-5 sentences. · Why did t.docx
Answer the questions in one paragraph 4-5 sentences. · Why did t.docxAnswer the questions in one paragraph 4-5 sentences. · Why did t.docx
Answer the questions in one paragraph 4-5 sentences. · Why did t.docx
 
Statistical inference with Python
Statistical inference with PythonStatistical inference with Python
Statistical inference with Python
 
Point and Interval Estimation
Point and Interval EstimationPoint and Interval Estimation
Point and Interval Estimation
 
Basic statistics for pharmaceutical (Part 1)
Basic statistics for pharmaceutical (Part 1)Basic statistics for pharmaceutical (Part 1)
Basic statistics for pharmaceutical (Part 1)
 
101_sampling__population_Sept_2020.ppt
101_sampling__population_Sept_2020.ppt101_sampling__population_Sept_2020.ppt
101_sampling__population_Sept_2020.ppt
 
WEEK 5 HOMEWORK 5THIS WEEK INVOLVES READING NEW TABLES, THE t-TA.docx
WEEK 5 HOMEWORK 5THIS WEEK INVOLVES READING NEW TABLES, THE t-TA.docxWEEK 5 HOMEWORK 5THIS WEEK INVOLVES READING NEW TABLES, THE t-TA.docx
WEEK 5 HOMEWORK 5THIS WEEK INVOLVES READING NEW TABLES, THE t-TA.docx
 
Inorganic CHEMISTRY
Inorganic CHEMISTRYInorganic CHEMISTRY
Inorganic CHEMISTRY
 
Sampling methods theory and practice
Sampling methods theory and practice Sampling methods theory and practice
Sampling methods theory and practice
 
Bca admission in india
Bca admission in indiaBca admission in india
Bca admission in india
 
Section 7 Analyzing our Marketing Test, Survey Results .docx
Section 7 Analyzing our Marketing Test, Survey Results .docxSection 7 Analyzing our Marketing Test, Survey Results .docx
Section 7 Analyzing our Marketing Test, Survey Results .docx
 
Estimating a Population Proportion
Estimating a Population ProportionEstimating a Population Proportion
Estimating a Population Proportion
 
Estimating a Population Proportion
Estimating a Population ProportionEstimating a Population Proportion
Estimating a Population Proportion
 
Applied statistics part 1
Applied statistics part 1Applied statistics part 1
Applied statistics part 1
 

More from maxinesmith73660

You have been chosen to present in front of your local governing boa.docx
You have been chosen to present in front of your local governing boa.docxYou have been chosen to present in front of your local governing boa.docx
You have been chosen to present in front of your local governing boa.docxmaxinesmith73660
 
You have been charged with overseeing the implementation of cybersec.docx
You have been charged with overseeing the implementation of cybersec.docxYou have been charged with overseeing the implementation of cybersec.docx
You have been charged with overseeing the implementation of cybersec.docxmaxinesmith73660
 
You have been commissioned to create a manual covering the installat.docx
You have been commissioned to create a manual covering the installat.docxYou have been commissioned to create a manual covering the installat.docx
You have been commissioned to create a manual covering the installat.docxmaxinesmith73660
 
You have been challenged by a mentor you respect and admire to demon.docx
You have been challenged by a mentor you respect and admire to demon.docxYou have been challenged by a mentor you respect and admire to demon.docx
You have been challenged by a mentor you respect and admire to demon.docxmaxinesmith73660
 
You have been chosen as the consultant group to assess the organizat.docx
You have been chosen as the consultant group to assess the organizat.docxYou have been chosen as the consultant group to assess the organizat.docx
You have been chosen as the consultant group to assess the organizat.docxmaxinesmith73660
 
You have been assigned a reading by WMF Petrie; Diospolis Parva (.docx
You have been assigned a reading by WMF Petrie; Diospolis Parva (.docxYou have been assigned a reading by WMF Petrie; Diospolis Parva (.docx
You have been assigned a reading by WMF Petrie; Diospolis Parva (.docxmaxinesmith73660
 
You have been asked to speak to city, municipal, and state elected a.docx
You have been asked to speak to city, municipal, and state elected a.docxYou have been asked to speak to city, municipal, and state elected a.docx
You have been asked to speak to city, municipal, and state elected a.docxmaxinesmith73660
 
You have been asked to provide a presentation, covering the history .docx
You have been asked to provide a presentation, covering the history .docxYou have been asked to provide a presentation, covering the history .docx
You have been asked to provide a presentation, covering the history .docxmaxinesmith73660
 
You have been asked to organize a community health fair at a loc.docx
You have been asked to organize a community health fair at a loc.docxYou have been asked to organize a community health fair at a loc.docx
You have been asked to organize a community health fair at a loc.docxmaxinesmith73660
 
You have been asked to explain the differences between certain categ.docx
You have been asked to explain the differences between certain categ.docxYou have been asked to explain the differences between certain categ.docx
You have been asked to explain the differences between certain categ.docxmaxinesmith73660
 
You have been asked to evaluate a 3-year-old child in your clinic.  .docx
You have been asked to evaluate a 3-year-old child in your clinic.  .docxYou have been asked to evaluate a 3-year-old child in your clinic.  .docx
You have been asked to evaluate a 3-year-old child in your clinic.  .docxmaxinesmith73660
 
You have been asked to develop UML diagrams to graphically depict .docx
You have been asked to develop UML diagrams to graphically depict .docxYou have been asked to develop UML diagrams to graphically depict .docx
You have been asked to develop UML diagrams to graphically depict .docxmaxinesmith73660
 
You have been asked to develop UML diagrams to graphically depict an.docx
You have been asked to develop UML diagrams to graphically depict an.docxYou have been asked to develop UML diagrams to graphically depict an.docx
You have been asked to develop UML diagrams to graphically depict an.docxmaxinesmith73660
 
You have been asked to develop a quality improvement (QI) process fo.docx
You have been asked to develop a quality improvement (QI) process fo.docxYou have been asked to develop a quality improvement (QI) process fo.docx
You have been asked to develop a quality improvement (QI) process fo.docxmaxinesmith73660
 
You have been asked to design and deliver a Microsoft PowerPoint pre.docx
You have been asked to design and deliver a Microsoft PowerPoint pre.docxYou have been asked to design and deliver a Microsoft PowerPoint pre.docx
You have been asked to design and deliver a Microsoft PowerPoint pre.docxmaxinesmith73660
 
You have been asked to be the project manager for the development of.docx
You have been asked to be the project manager for the development of.docxYou have been asked to be the project manager for the development of.docx
You have been asked to be the project manager for the development of.docxmaxinesmith73660
 
You have been asked to conduct research on a past forensic case to a.docx
You have been asked to conduct research on a past forensic case to a.docxYou have been asked to conduct research on a past forensic case to a.docx
You have been asked to conduct research on a past forensic case to a.docxmaxinesmith73660
 
You have been asked for the summary to include the following compone.docx
You have been asked for the summary to include the following compone.docxYou have been asked for the summary to include the following compone.docx
You have been asked for the summary to include the following compone.docxmaxinesmith73660
 
You have been asked to be the project manager for the developmen.docx
You have been asked to be the project manager for the developmen.docxYou have been asked to be the project manager for the developmen.docx
You have been asked to be the project manager for the developmen.docxmaxinesmith73660
 
You have been asked by management, as a senior member of your co.docx
You have been asked by management, as a senior member of your co.docxYou have been asked by management, as a senior member of your co.docx
You have been asked by management, as a senior member of your co.docxmaxinesmith73660
 

More from maxinesmith73660 (20)

You have been chosen to present in front of your local governing boa.docx
You have been chosen to present in front of your local governing boa.docxYou have been chosen to present in front of your local governing boa.docx
You have been chosen to present in front of your local governing boa.docx
 
You have been charged with overseeing the implementation of cybersec.docx
You have been charged with overseeing the implementation of cybersec.docxYou have been charged with overseeing the implementation of cybersec.docx
You have been charged with overseeing the implementation of cybersec.docx
 
You have been commissioned to create a manual covering the installat.docx
You have been commissioned to create a manual covering the installat.docxYou have been commissioned to create a manual covering the installat.docx
You have been commissioned to create a manual covering the installat.docx
 
You have been challenged by a mentor you respect and admire to demon.docx
You have been challenged by a mentor you respect and admire to demon.docxYou have been challenged by a mentor you respect and admire to demon.docx
You have been challenged by a mentor you respect and admire to demon.docx
 
You have been chosen as the consultant group to assess the organizat.docx
You have been chosen as the consultant group to assess the organizat.docxYou have been chosen as the consultant group to assess the organizat.docx
You have been chosen as the consultant group to assess the organizat.docx
 
You have been assigned a reading by WMF Petrie; Diospolis Parva (.docx
You have been assigned a reading by WMF Petrie; Diospolis Parva (.docxYou have been assigned a reading by WMF Petrie; Diospolis Parva (.docx
You have been assigned a reading by WMF Petrie; Diospolis Parva (.docx
 
You have been asked to speak to city, municipal, and state elected a.docx
You have been asked to speak to city, municipal, and state elected a.docxYou have been asked to speak to city, municipal, and state elected a.docx
You have been asked to speak to city, municipal, and state elected a.docx
 
You have been asked to provide a presentation, covering the history .docx
You have been asked to provide a presentation, covering the history .docxYou have been asked to provide a presentation, covering the history .docx
You have been asked to provide a presentation, covering the history .docx
 
You have been asked to organize a community health fair at a loc.docx
You have been asked to organize a community health fair at a loc.docxYou have been asked to organize a community health fair at a loc.docx
You have been asked to organize a community health fair at a loc.docx
 
You have been asked to explain the differences between certain categ.docx
You have been asked to explain the differences between certain categ.docxYou have been asked to explain the differences between certain categ.docx
You have been asked to explain the differences between certain categ.docx
 
You have been asked to evaluate a 3-year-old child in your clinic.  .docx
You have been asked to evaluate a 3-year-old child in your clinic.  .docxYou have been asked to evaluate a 3-year-old child in your clinic.  .docx
You have been asked to evaluate a 3-year-old child in your clinic.  .docx
 
You have been asked to develop UML diagrams to graphically depict .docx
You have been asked to develop UML diagrams to graphically depict .docxYou have been asked to develop UML diagrams to graphically depict .docx
You have been asked to develop UML diagrams to graphically depict .docx
 
You have been asked to develop UML diagrams to graphically depict an.docx
You have been asked to develop UML diagrams to graphically depict an.docxYou have been asked to develop UML diagrams to graphically depict an.docx
You have been asked to develop UML diagrams to graphically depict an.docx
 
You have been asked to develop a quality improvement (QI) process fo.docx
You have been asked to develop a quality improvement (QI) process fo.docxYou have been asked to develop a quality improvement (QI) process fo.docx
You have been asked to develop a quality improvement (QI) process fo.docx
 
You have been asked to design and deliver a Microsoft PowerPoint pre.docx
You have been asked to design and deliver a Microsoft PowerPoint pre.docxYou have been asked to design and deliver a Microsoft PowerPoint pre.docx
You have been asked to design and deliver a Microsoft PowerPoint pre.docx
 
You have been asked to be the project manager for the development of.docx
You have been asked to be the project manager for the development of.docxYou have been asked to be the project manager for the development of.docx
You have been asked to be the project manager for the development of.docx
 
You have been asked to conduct research on a past forensic case to a.docx
You have been asked to conduct research on a past forensic case to a.docxYou have been asked to conduct research on a past forensic case to a.docx
You have been asked to conduct research on a past forensic case to a.docx
 
You have been asked for the summary to include the following compone.docx
You have been asked for the summary to include the following compone.docxYou have been asked for the summary to include the following compone.docx
You have been asked for the summary to include the following compone.docx
 
You have been asked to be the project manager for the developmen.docx
You have been asked to be the project manager for the developmen.docxYou have been asked to be the project manager for the developmen.docx
You have been asked to be the project manager for the developmen.docx
 
You have been asked by management, as a senior member of your co.docx
You have been asked by management, as a senior member of your co.docxYou have been asked by management, as a senior member of your co.docx
You have been asked by management, as a senior member of your co.docx
 

Recently uploaded

Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfSumit Tiwari
 
internship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerinternship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerunnathinaik
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting DataJhengPantaleon
 
Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfMahmoud M. Sallam
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxRaymartEstabillo3
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17Celine George
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxAvyJaneVismanos
 
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxHistory Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxsocialsciencegdgrohi
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 

Recently uploaded (20)

Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
 
internship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerinternship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developer
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
 
Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdf
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptx
 
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxHistory Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 

Confidence Interval ModuleOne of the key concepts of statist.docx

  • 1. Confidence Interval Module One of the key concepts of statistics enabling statisticians to make incredibly accurate predictions is called the Central Limit Theorem. The Central Limit Theorem is defined in this way: · For samples of a sufficiently large size, the real distribution of means is almost always approximately normal. · The distribution of means gets closer and closer to normal as the sample size gets larger and larger, regardless of what the original variable looks like (positively or negatively skewed). · In other words, the original variable does not have to be normally distributed. · This is because, if we as eccentric researchers, drew an almost infinite number of random samples from a single population (such as the student body of NMSU), the means calculated from the many samples of that population will be normally distributed and the mean calculated from all of those samples would be a very close approximation to the true population mean. It is this very characteristic that makes it possible for us, using sound probability based sampling techniques, to make highly accurate statements about characteristics of a population based upon the statistics calculated on a sample drawn from that population. · Furthermore, we can calculate a statistic known as the standard error of the mean (abbreviated s.e.) that describes the variability of the distribution of all possible sample means in the same way that we used the standard deviation to describe
  • 2. the variability of a single sample. We will use the standard error of the mean (s.e.) to calculate the statistic that is the topic of this module, the confidence interval. The formula that we use to calculate the standard error of the mean is: s.e. = s / √N – 1 where s = the standard deviation calculated from the sample; and N = the sample size. So the formula tells us that the standard error of the mean is equal to the standard deviation divided by the square root of the sample size minus 1.
  • 3. This is the preferred formula for practicing professionals as it accounts for errors that may be a function of the particular sample we have selected. THE CONFIDENCE INTERVAL (CI) The formula for the CI is a function of the sample size (N). For samples sizes ≥ 100, the formula for the CI is: CI = (the sample mean) + & - Z(s.e.). Let’s look at an example to see how this formula works. * Please use a pdf doc. “how to solve the problem”, I have provided for you under the “notes” link. Example 1 Suppose that we conducted interviews with 140 randomly selected individuals (N = 140) in a large metropolitan area. We assured these individuals that their answers would remain confidential, and we asked them about their law-breaking behavior. Among other questions the individuals were asked to self-report the number of times per month they exceeded the speed limit. One of the objectives of the study was to estimate (make an inference about) the average number of times per month residents in all metropolitan areas across the country exceeded the speed limit. The sample statistics we obtained were as follows: Mean = 12.4 times
  • 4. S = 3.2 times N = 140 Let’s construct a 95% CI around our estimate of the mean drawn from this sample. The sample mean of 12.4 times tells us that, on average, the individuals from our sample exceed the speed limit about 12.4 times a month. This sample mean estimate is our best point estimate of the true population mean. We know full well that 12.4 times is not the true population mean and that repeated samples will yield different means. What does our sample mean tell us about the mean of the entire population of metropolitan residents? This is the question we are really trying to answer. We want to make our point estimate of 12.4 more reliable and at the same time, give ourselves the ability to make a probability statement about the confidence we have in our estimate. To do this, we use the CI equation above to construct a 95% confidence interval around the sample mean estimate of 12.4. We have all the information we need to fill in the information for the formula except for the Z score. The Z score for a 95% CI is 1.96. From the Z Table, we can find the correct Z score corresponding to 95 %. Remember that the total area under the normal distribution/curve equals 100% and that half of that area, 50%, is above and below the mean. If we are looking for the Z score corresponding to 95% we first divide 95% in half leaving a total of 47.5% above and below the mean with 2.5% in the tail above and below our 95% confidence interval on either side of the mean. Next we look inside the Z Table (the numbers
  • 5. corresponding to areas under the normal curve) for the number that comes closest to .4750 (47.5%) without going under .4750 and identify the corresponding Z score. The correct Z score is 1.96 where the area is .4750. Now we can solve the equation. 95% CI = 12.4 + & - 1.96 (3.2 / √140 – 1) = 12.4 + & - 1.96 (3.2 / √139) = 12.4 + & - 1.96 (3.2 / 11.79) = 12.4 + & - 1.96 (.27) = 12.4 + & - .53 12.4 - .53 = 11.87
  • 6. 12.4 + .53 = 12.93 95% CI = 11.87 to 12.93 So what does this interval tell us? It tells us that based on our sample data; we can be 95 percent confident that the mean number of self-admitted speeding violations among all residents of metropolitan areas lies between 11.87 and 12.93 times per month. That is, theoretically speaking, if we had taken a large number of random samples from this sample population and calculated 95% confidence intervals around the means obtained from each sample, approximately 95% of these intervals would include the true population mean and 5 percent would not. Example 2 Let’s say for the sake of argument that we only wanted a 90% CI about our sample mean, rather than a 95% CI for our point estimate of 12.4. From the Z Table, we can find the correct Z score corresponding to 90%. Remember that the total area under the normal distribution/curve equals 100% and that half of that area, 50%, is above and below the mean. If we are looking for the Z score corresponding to 90% we first divide 90% in half leaving a total of 45% above and below the mean with 5% in the tail above and below our 90% confidence interval on either side of the mean. Next we look inside the Z Table (the numbers corresponding to areas under the normal curve) for the number that comes closest to .4500 (45%) without going under .4500 and identify the corresponding Z score. The correct Z score is 1.65 where the area is .4505. A Z score of 1.64 would be incorrect because the area of .4495 is less than 45 percent and thus our CI estimate would not truly be a 90%
  • 7. confidence level estimate. As in example 1 we will insert 1.65 into the CI equation and solve. 90% CI = 12.4 + & - 1.65 (3.2 / √140 – 1) = 12.4 + & - 1.65 (3.2 / √139) = 12.4 + & - 1.65 (3.2 / 11.79) = 12.4 + & - 1.65 (.27) = 12.4 + & - .44
  • 8. 12.4 - .44 = 11.96 12.4 + .44 = 12.84 90% CI = 11.96 – 12.84 The interval indicates that we are 90 percent confident that the true population mean speeding violation score falls between 11.96 and 12.84 times per month. Notice that the interval for a 90% confidence interval is narrower than for a 95% confidence interval. You can see, then, that we are less confident (90 percent vs. 95 percent confident) that our true population means falls into this interval. By lowering our level of confidence, we gained some precision in our estimate. We could reduce the width of our confidence interval even more, but we would pay the price in levels of confidence. Example 3 Let’s say that we took a new sample only this time we randomly select and interview 901 individuals, asking the same questions. Our sample data for this sample are: Sample mean = 12.4 times
  • 9. S = 3.2 times N = 901 Now lets recalculate our 90% CI. 90% CI = 12.4 + & - 1.65 (3.2 / √901 – 1) = 12.4 + & - 1.65 (3.2 / √900) = 12.4 + & - 1.65 (3.2 / 30) = 12.4 + & - 1.65 ( .11) = 12.4 + & - .18
  • 10. 12.4 - .18 = 12.22 12.4 + .18 = 12.58 90% CI = 12.22 – 12.58 The interval indicates that we are 90 percent confident that the true population mean speeding violation score falls between 12.22 and 12.58 times per month. Notice that the interval is considerably smaller than in Example 2 where the sample size is 140. Why is this? By increasing the sample size, the s.e. became smaller. We can see this mathematically, but what is the theoretical reasoning for this change? As our sample size increased, we captured a greater proportion of the variability in self-reported speeding violations that exists in the total population. Consequently, our confidence interval estimate is more precise. The lesson learned is that whenever you have a choice between a smaller or a larger sample, choose the larger sample as your estimates (inferences) about the population will be more accurate. Example 4 We have been calculating the confidence interval for samples where N ≥ 100. What if the sample size is less than 100, N <
  • 11. 100? In this situation, we must use the two-tailed “T” distribution, from the Table of T Values. I have provided to you as a pdf doc. under the “notes” link. We use the two-tailed T distribution because we are working with a confidence interval and are concerned with the area between two points on either side of the mean. This means that we will use the column headings beneath the label “Level of Significance for Two- Tailed Test.” Let’s continue with our effort to estimate the number of self- reported speeding violations and construct a confidence interval using the T distribution. Let’s say we are short on research funds and we are only able to randomly select and interview 17 individuals and we want to construct a 90%CI around our estimate of the population mean. From our sample we obtained the following statistics: Sample mean = 12.4 times S = 3.2 times N = 17
  • 12. The formula we use is the same as that for samples where N ≥ 100 except instead of using Z, we use T. The only trick is to determine which value of T from the Table of T Values we will use. The first task is to determine the correct column. For a 90% confidence level we will select the column labeled “.10”. If we wanted a confidence level of 95% we would select the column labeled “.05”. If we wanted a confidence level of 98% we would select the column labeled “.02”. If we wanted a confidence level of 99% we would select the column labeled “.01”. These levels (.10, .05, .02, .01) represent the total area remaining in the two tails of the curve that are outside of our confidence interval. For example, when we construct a 90% confidence interval 10% of the area under the curve lies outside the confidence interval boundaries (100 – 90 = 10) and that remaining 10% is split equally on either side of the boundary such that 5% remains below the lower boundary of the confidence interval and 5% remains above the upper boundary of the confidence interval. The same logic holds true for any given level of confidence when we are constructing a confidence interval. The second task is to select the correct row. To do this we must calculate something called the degrees of freedom (abbreviated df). The degrees of freedom (df) = N -1. In this example, df = 17 – 1 = 16. Now we are able to find the appropriate value for T to insert into our confidence interval formula. The degrees of freedom are located in the very first column and begin with 1 and go sequentially through 30 and then moves to 40, 60, 120, and infinity. Go down the column for df until you arrive at 16. Go across the row for 16 until you are in the column for .10. That number is 1.746. Now we are ready to construct our 90% CI. 90% CI = 12.4 + & - 1.746 ( 3.2 / √17 – 1)
  • 13. = 12.4 + & - 1.746 (3.2 / √16) = 12.4 + & - 1.746 (3.2 / 4) = 12.4 + & - 1.746 (0.8) = 12.4 + & - 1.397 12.4 – 1.397 = 11.003 12.4 + 1.397 = 13.797
  • 14. 90% CI = 11.003 – 13.797 The interval indicates that we are 90 percent confident that the true population mean speeding violation score falls between 11.003 to 13.797. Notice that the interval is considerably larger than the intervals in any of the prior examples. This difference is due to the same phenomenon I discussed in example 3 above regarding the effect of sample size on the accuracy of our estimates of the true population mean.