SlideShare a Scribd company logo
1 of 27
Download to read offline
Data Analysis and Statistical Methods
Statistics 651
http://www.stat.tamu.edu/~suhasini/teaching.html
Lecture 13 (MWF) Designing the experiment: Margin of Error
Suhasini Subba Rao
Lecture 13 (MWF) Confidence intervals and Margin of Error
Terminology: The population and sample mean
• The population mean µ is the mean of the entire population. The
population can be (and often is) infinite.
• Suppose that X1, . . . , Xn are numbers drawn from the population. The
sample mean is the average of X1, . . . , Xn.
1
Lecture 13 (MWF) Confidence intervals and Margin of Error
Terminology: Standard deviations and errors
• The standard deviation is a measure of variation/spread of a variable (in
the population). This is typically denoted as σ. See Lecture 4.
• The standard error is a measure of variation/spread of the sample mean.
The standard error of the sample mean is
σ
√
n
.
See Lecture 12.
• Usually, σ is unknown. To get some idea of the spread, we estimate it
2
Lecture 13 (MWF) Confidence intervals and Margin of Error
from the sample {Xi}n
i=1 using the formula
s =
v
u
u
t 1
n − 1
n
X
i=1
(Xi − X̄)2.
We call s the sample standard deviation. It is an estimator of the
standard deviation σ. Usually s 6= σ. Often s < σ (especially, when the
sample size is not large).
• Since σ is usually unknown, the standard error σ/
√
n is usually unknown.
Instead we estimate it using the sample standard error is
s
√
n
.
3
Lecture 13 (MWF) Confidence intervals and Margin of Error
Margin of Error
• According to a recent survey, Americans walk on average 18 miles a week
with a margin of error of 2.5 miles.
• What does this mean in terms of confidence intervals?
• 18 miles corresponds to the (sample mean) average number of miles
walked in the sample, the margin of error is the plus and minus in the
confidence interval.
In other words for a 95% confidence interval
Margin of Error = 1.96 ×
σ
√
n
.
• The smaller the margin of error the more precisely we can pin point the
population mean. Of course it is worth bearing in mind that we can
4
Lecture 13 (MWF) Confidence intervals and Margin of Error
never be sure that our confidence interval contains the mean, which is
why we prescribe a level (such as a 95%) to the interval. In other words,
we can never be sure that the population mean is within the prescribed
margin of error of the sample mean.
• Of course, it is clear a small margin error gives a better level of precision
than a large one.
• In the next few slides, we recall factors which influence the MoE.
5
Lecture 13 (MWF) Confidence intervals and Margin of Error
Relationships: Sample size and MoE
• We compare the 95% confidence intervals for n = 9 and n = 25. We see
n=9

X̄ − 1.96
σ
√
9
, X̄ + 1.96
σ
√
9

n=25

X̄ − 1.96
σ
√
25
, X̄ + 1.96
σ
√
25

What are the lengths of the above intervals?
– For n=9 the margin of error is 1.96 × σ
√
9
.
– For n=25 the margin of error is 1.96 × σ
√
25
.
Important The margin of error does not depend on X̄.
6
Lecture 13 (MWF) Confidence intervals and Margin of Error
Example: X̄ = 10.38, σ =
√
33.
n=9 [10.38 ± 1.96
√
33
√
9
] = [6.63, 14.13] MoE=3.75
n=25 [10.38 ± 1.96
√
33
√
25
] = [8.12, 12.63] MoE = 2.255
• The second interval has a smaller margin of error.
• When the sample size is large the estimator “tends” to be closer to the
true parameter. Thus the confidence interval will be narrower; since the
margin of error is smaller.
7
Lecture 13 (MWF) Confidence intervals and Margin of Error
Relationships: the standard deviation and MoE
• The variability in the sample, measured by σ will have an impact of the
reliability of an estimator and its margin of error.
• Example: Suppose X̄ = 10.38, n = 9, but the variability in the two
populations are different:
σ = 5.7

10.38 ± 1.96
5.7
√
9

= [6.63, 14.13], MoE = 3.75
σ = 10

10.38 ± 1.96
10
√
9

= [3.38, 16.91], MoE = 6.8
• The more variablility within the population (as measure by the standard
deviation) the more variability in the sample mean (as measure by the
standard error). Usually we cannot do anything about this.
8
Lecture 13 (MWF) Confidence intervals and Margin of Error
How large an interval to use?
• You read in a newspaper that “the proportion of the public that support
same-sex marriage is 55% ± 15%.”
• This means a survey was done, the proportion in the survey who said
they supported same-sex marriage was 55% and the confidence interval
for the population proportion is [55 − 15, 55 + 15]% = [40, 70]%.
• This is an extremely large interval, it is so wide, that it is uninformative
about the majority opinion of the public.
• The reason it is too wide is that the sample size is too small.
• Conclusion: This experiment was not designed well.
9
Lecture 13 (MWF) Confidence intervals and Margin of Error
• Typically, before data is calculated, we need to decide how large a sample
to collect.
• This is usually done by deciding how much “above and below” the
estimator is acceptable. For example, an interval of the type [55-
3,55+3]% = [52,58]% tells us that the majority appear to support
same-sex marriage.
• The 3% is is the margin of error. Given a margin of error we can then
determine the sample size to collect.
10
Lecture 13 (MWF) Confidence intervals and Margin of Error
Choosing the sample size for estimating µ
• In an ideal world we would have a large sample size. A large sample size
gives a small standard error, which in turn yields a small margin of error.
• However, obtaining very large samples can be impossible for many
different reasons.
• How can one determine the number of observations to be included in a
sample? How to choose the sample size n?
Answer: Usually we have a margin of error in mind. We can accept the
reliability of a estimator up to a certain margin or error. Once we know
what margin of error is acceptable we can then choose the sample size.
11
Lecture 13 (MWF) Confidence intervals and Margin of Error
Formula for choosing the sample size
• To choose the sample size according to the margin of error, we need
to know (or guess apriori) the standard deviation σ (if we don’t know
what it is, then we err on the cautious side and use a value that seems
reasonable but large).
• We recall that in the confidence interval:

X̄ − 1.96
σ
√
n
, X̄ + 1.96
σ
√
n

the margin of error is MoE = 1.96 σ
√
n
.
• Therefore, if we want to choose the sample size such that the margin of
12
Lecture 13 (MWF) Confidence intervals and Margin of Error
error for a given E we need to solve for
MoE = 1.96
σ
√
n
solving for n gives n =

1.96σ
E
2
.
13
Lecture 13 (MWF) Confidence intervals and Margin of Error
• Example: Suppose we guess that σ =
√
3 and we want the margin of
error MoE = 0.25. The confidence interval is

X̄ − 1.96 ×
√
3
√
n
, X̄ + 1.96 ×
√
3
√
n
#
and we solve 1.96 ×
√
3
√
n
= 0.25. This gives
n =
2 × 1.96
√
3
0.5
2
= 184.4
• Of course, a larger value of n will give a smaller margin of error, so we
round up and use n = 185. In other words, for this experiment we need
to choose a sample size of at least 185 to be sure that the margin error
is at most 0.25.
14
Lecture 13 (MWF) Confidence intervals and Margin of Error
The general formula
• Changing the confidence level simply changes 1.96 to another number.
• If the objective is to use a 99% CI, we first look up 0.5% in the z-tables,
z0.5% = 2.57. Then we solve MoE = 2.57 σ
√
n
n =

2.57 × σ
MoE
2
.
• In general, for the (100 − α)% CI use the formula MoE = |zα/2| σ
√
n
and
solve it to give
n =

|zα/2| × σ
MoE
2
.
15
Lecture 13 (MWF) Confidence intervals and Margin of Error
Example 4: Heights
• Researchers want to estimate the mean height of students at a university
(in meters) with a margin of error of 0.04 (using a 95% CI level). The
sample standard deviation from a small sample taken previously is 0.113.
How many students must they sample to achieve their specifications?
• Solution 4:
– Since the true population standard deviation is unknown, they use the
sample standard deviation in the calculation.
– Use the formula with MoE= 0.04 gives the solution
n =
(1.96)2
(0.113)2
(0.04)2
= 30.65.
– They must sample 31 people such that a 95% confidence interval has
length 2 × 0.04 = 0.08.
16
Lecture 13 (MWF) Confidence intervals and Margin of Error
Example 5 (more practice): Caffine content
• The caffine content is coffee is being analysed and it is known that
standard deviation of a randomly selected coffee is 7.1mg.
Suppose 100 cups of coffee are analysed, and the total weight of caffine
in all the cups is
P100
i=1 Xi = 11000mg, construct a 95% CI for the mean
caffine content.
Construct an 80% CI for the mean caffine.
Find the minimum number of coffees which must be analysed for the
80% CI to have MoE 0.45mg?
• Solution 5:
• The total weight of caffine for the 100 cups is 11000 mg. Therefore the
sample average of caffine per cup is x̄ = 11000/100 = 110.
17
Lecture 13 (MWF) Confidence intervals and Margin of Error
• Calculating the 95% CI (use the formala or calculate yourself): z0.05/2 =
z0.025 = 1.96, n = 100, σ = 7.1 and X̄ = 110.
• The CI is:

110 − 1.96 ×
7.1
√
100
, 110 + 1.96 ×
7.1
√
100

= [108.6, 111.4] .
• To construct an 80% CI only one thing has to change, that is we only
have to replace the 1.96 above with another number. To find this value
go to the normal table an look inside it for 0.1, you should see −1.28.
Replace 1.96 with 1.28 to give

110 − 1.28 ×
7.1
√
100
, 110 + 1.28 ×
7.1
√
100

= [109.1, 110.9].
18
Lecture 13 (MWF) Confidence intervals and Margin of Error
• If we want the MoE to have length 0.45, then the interval

X̄ − 1.28 ×
7.1
√
n
, X̄ + 1.28 ×
7.1
√
n

must have length 1. This means that
MoE = 0.5 = 1.28 ×
7.1
√
n
.
Solve this (or use the formula) to give
n =

1.28 × 7.1
0.45
2
= 400.
• Hence we need to sample at least 400 cups to obtain a margin of error
which is 0.45 (half of what existed previously).
19
Lecture 13 (MWF) Confidence intervals and Margin of Error
Example 6 (when standard deviation is unknown)
How large a sample size do we require such that the margin of error
for a 95% confidence interval for the mean of human heights is maximum
0.25 inch. The standard deviation is unknown, but it is believed that σ lies
somewhere between 2-5 inches.
• Why this question matters: In general the standard deviation will be
unknown. But we can ‘guess’ limits on how large or small it is based on
own expertize.
20
Lecture 13 (MWF) Confidence intervals and Margin of Error
Solution 6
The more variable the data the larger the confidence interval. Therefore,
when given a range of standard deviations and our aim is that the margin
of error should be no larger than 0.25 (i.e. 0.25 or less), then we need to
use the largest standard deviation in the given range in the calculation
• In other words
n =

1.96 × σlargest
0.25
2
=

1.96 × 5
0.25
2
= 1537.
• For any other σ  5, using n = 1537 will lead to a Margin of Error which
is less than 0.25. To see why, recall
MoE = 1.96 ×
σ
√
n
= 1.96 ×
σ
1.96×5
0.25
 = 0.25 ×
σ
5
.
21
Lecture 13 (MWF) Confidence intervals and Margin of Error
Thus we see that if the true σ  5, then the MoE will be less than 0.25,
since σ/5  1.
• If we use σ = 2 in the margin of error calculation, then n = 246.
However, if the true σ  2 using n = 246 will lead to a Margin of Error
which is larger than 0.25.
22
Lecture 13 (MWF) Confidence intervals and Margin of Error
Margin of Error calculations using software
• There are various software tools on the web that will do margin of error
calculations. For example,
https://www.emathhelp.net/calculators/probability-statistics/margin-of-error-calculator/
• Here is one by survey monkey
https://www.surveymonkey.com/mp/margin-of-error-calculator/. This
calculator is specifically designed for calculating the MoE of proportions;
where the standard deviation need not be specified. Here is another one
http://www.raosoft.com/samplesize.html, which which can give smaller
sample sizes if a proportion is specified. We cover this later on in the
course.
• The calculations done in class assume that the population size is infinite
(or that the sample is a SRS, that is the same person is sampled again).
23
Lecture 13 (MWF) Confidence intervals and Margin of Error
However, when surveying certain populations the population size will be
finite.
• Therefore, some calculators will also ask for the population size. Using
the finite population size they make what is called a finite sample
correction. You can read more about it here:
https://en.wikipedia.org/wiki/Margin_of_error#Effect_of_population_size.
24
Lecture 13 (MWF) Confidence intervals and Margin of Error
Example 7
A confidence interval for the length of parrots is [4,10] inches. It is
based on a sample size n. By what factor should the sample size increase
such that the margin of error reduces to 1?
25
Lecture 13 (MWF) Confidence intervals and Margin of Error
Solution 7
• The original margin of error is 3. Thus 1.96 × σ/
√
n = 3.
• We want to increase the sample size such that it decreases to 1.
1.96 ×
σ
√
Factor × n
= 1
⇒ 1.96 ×
σ
√
Factor ×
√
n
=
3
√
Factor
= 1.
Solving for this we see that we need to increase the sample size by factor
9 in order to decrease the margin of error by a factor 3.
• Conclusion An extremely large increase in sample size has to be made
for a moderate reduction in margin of error.
26

More Related Content

Similar to lecture13MWF.pdflecture13MWF.pdflecture13MWF.pdf

Tbs910 sampling hypothesis regression
Tbs910 sampling hypothesis regressionTbs910 sampling hypothesis regression
Tbs910 sampling hypothesis regressionStephen Ong
 
Sampling
SamplingSampling
Samplingal amin
 
Quantitative MethodsChoosing a Sample.pptxChoosing a Samp.docx
Quantitative MethodsChoosing a Sample.pptxChoosing a Samp.docxQuantitative MethodsChoosing a Sample.pptxChoosing a Samp.docx
Quantitative MethodsChoosing a Sample.pptxChoosing a Samp.docxamrit47
 
Module 7 Interval estimatorsMaster for Business Statistics.docx
Module 7 Interval estimatorsMaster for Business Statistics.docxModule 7 Interval estimatorsMaster for Business Statistics.docx
Module 7 Interval estimatorsMaster for Business Statistics.docxgilpinleeanna
 
Lesson04_new
Lesson04_newLesson04_new
Lesson04_newshengvn
 
Answer the questions in one paragraph 4-5 sentences. · Why did t.docx
Answer the questions in one paragraph 4-5 sentences. · Why did t.docxAnswer the questions in one paragraph 4-5 sentences. · Why did t.docx
Answer the questions in one paragraph 4-5 sentences. · Why did t.docxboyfieldhouse
 
Confidence Level and Sample Size
Confidence Level and Sample SizeConfidence Level and Sample Size
Confidence Level and Sample SizeEmilio Fer Villa
 
Lecture-3 inferential stastistics.ppt
Lecture-3 inferential stastistics.pptLecture-3 inferential stastistics.ppt
Lecture-3 inferential stastistics.pptfantahungedamu
 
Confidence intervals
Confidence intervalsConfidence intervals
Confidence intervalsTanay Tandon
 
Findings, Conclusions, & RecommendationsReport Writing
Findings, Conclusions, & RecommendationsReport WritingFindings, Conclusions, & RecommendationsReport Writing
Findings, Conclusions, & RecommendationsReport WritingShainaBoling829
 
Lesson04_Static11
Lesson04_Static11Lesson04_Static11
Lesson04_Static11thangv
 
How to compute for sample size.pptx
How to compute for sample size.pptxHow to compute for sample size.pptx
How to compute for sample size.pptxnoelmartinez003
 
101_sampling__population_Sept_2020.ppt
101_sampling__population_Sept_2020.ppt101_sampling__population_Sept_2020.ppt
101_sampling__population_Sept_2020.pptAndrei33323
 
Chapter 8 review
Chapter 8 reviewChapter 8 review
Chapter 8 reviewdrahkos1
 
Statistik 1 7 estimasi & ci
Statistik 1 7 estimasi & ciStatistik 1 7 estimasi & ci
Statistik 1 7 estimasi & ciSelvin Hadi
 

Similar to lecture13MWF.pdflecture13MWF.pdflecture13MWF.pdf (20)

Estimation
EstimationEstimation
Estimation
 
Chapter09
Chapter09Chapter09
Chapter09
 
Tbs910 sampling hypothesis regression
Tbs910 sampling hypothesis regressionTbs910 sampling hypothesis regression
Tbs910 sampling hypothesis regression
 
Sampling
SamplingSampling
Sampling
 
Quantitative MethodsChoosing a Sample.pptxChoosing a Samp.docx
Quantitative MethodsChoosing a Sample.pptxChoosing a Samp.docxQuantitative MethodsChoosing a Sample.pptxChoosing a Samp.docx
Quantitative MethodsChoosing a Sample.pptxChoosing a Samp.docx
 
Taxi for Professor Evans
Taxi for Professor EvansTaxi for Professor Evans
Taxi for Professor Evans
 
Module 7 Interval estimatorsMaster for Business Statistics.docx
Module 7 Interval estimatorsMaster for Business Statistics.docxModule 7 Interval estimatorsMaster for Business Statistics.docx
Module 7 Interval estimatorsMaster for Business Statistics.docx
 
Lesson04_new
Lesson04_newLesson04_new
Lesson04_new
 
Estimating a Population Mean
Estimating a Population MeanEstimating a Population Mean
Estimating a Population Mean
 
Answer the questions in one paragraph 4-5 sentences. · Why did t.docx
Answer the questions in one paragraph 4-5 sentences. · Why did t.docxAnswer the questions in one paragraph 4-5 sentences. · Why did t.docx
Answer the questions in one paragraph 4-5 sentences. · Why did t.docx
 
Confidence Level and Sample Size
Confidence Level and Sample SizeConfidence Level and Sample Size
Confidence Level and Sample Size
 
Lecture-3 inferential stastistics.ppt
Lecture-3 inferential stastistics.pptLecture-3 inferential stastistics.ppt
Lecture-3 inferential stastistics.ppt
 
Practice Test 3B Solution
Practice Test 3B SolutionPractice Test 3B Solution
Practice Test 3B Solution
 
Confidence intervals
Confidence intervalsConfidence intervals
Confidence intervals
 
Findings, Conclusions, & RecommendationsReport Writing
Findings, Conclusions, & RecommendationsReport WritingFindings, Conclusions, & RecommendationsReport Writing
Findings, Conclusions, & RecommendationsReport Writing
 
Lesson04_Static11
Lesson04_Static11Lesson04_Static11
Lesson04_Static11
 
How to compute for sample size.pptx
How to compute for sample size.pptxHow to compute for sample size.pptx
How to compute for sample size.pptx
 
101_sampling__population_Sept_2020.ppt
101_sampling__population_Sept_2020.ppt101_sampling__population_Sept_2020.ppt
101_sampling__population_Sept_2020.ppt
 
Chapter 8 review
Chapter 8 reviewChapter 8 review
Chapter 8 review
 
Statistik 1 7 estimasi & ci
Statistik 1 7 estimasi & ciStatistik 1 7 estimasi & ci
Statistik 1 7 estimasi & ci
 

Recently uploaded

꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
Digi Khata Problem along complete plan.pptx
Digi Khata Problem along complete plan.pptxDigi Khata Problem along complete plan.pptx
Digi Khata Problem along complete plan.pptxTanveerAhmed817946
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystSamantha Rae Coolbeth
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 
Aminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
Aminabad Call Girl Agent 9548273370 , Call Girls Service LucknowAminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
Aminabad Call Girl Agent 9548273370 , Call Girls Service Lucknowmakika9823
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSAishani27
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Delhi Call girls
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一ffjhghh
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 

Recently uploaded (20)

꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
Digi Khata Problem along complete plan.pptx
Digi Khata Problem along complete plan.pptxDigi Khata Problem along complete plan.pptx
Digi Khata Problem along complete plan.pptx
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data Analyst
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 
Aminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
Aminabad Call Girl Agent 9548273370 , Call Girls Service LucknowAminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
Aminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICS
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 

lecture13MWF.pdflecture13MWF.pdflecture13MWF.pdf

  • 1. Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Lecture 13 (MWF) Designing the experiment: Margin of Error Suhasini Subba Rao
  • 2. Lecture 13 (MWF) Confidence intervals and Margin of Error Terminology: The population and sample mean • The population mean µ is the mean of the entire population. The population can be (and often is) infinite. • Suppose that X1, . . . , Xn are numbers drawn from the population. The sample mean is the average of X1, . . . , Xn. 1
  • 3. Lecture 13 (MWF) Confidence intervals and Margin of Error Terminology: Standard deviations and errors • The standard deviation is a measure of variation/spread of a variable (in the population). This is typically denoted as σ. See Lecture 4. • The standard error is a measure of variation/spread of the sample mean. The standard error of the sample mean is σ √ n . See Lecture 12. • Usually, σ is unknown. To get some idea of the spread, we estimate it 2
  • 4. Lecture 13 (MWF) Confidence intervals and Margin of Error from the sample {Xi}n i=1 using the formula s = v u u t 1 n − 1 n X i=1 (Xi − X̄)2. We call s the sample standard deviation. It is an estimator of the standard deviation σ. Usually s 6= σ. Often s < σ (especially, when the sample size is not large). • Since σ is usually unknown, the standard error σ/ √ n is usually unknown. Instead we estimate it using the sample standard error is s √ n . 3
  • 5. Lecture 13 (MWF) Confidence intervals and Margin of Error Margin of Error • According to a recent survey, Americans walk on average 18 miles a week with a margin of error of 2.5 miles. • What does this mean in terms of confidence intervals? • 18 miles corresponds to the (sample mean) average number of miles walked in the sample, the margin of error is the plus and minus in the confidence interval. In other words for a 95% confidence interval Margin of Error = 1.96 × σ √ n . • The smaller the margin of error the more precisely we can pin point the population mean. Of course it is worth bearing in mind that we can 4
  • 6. Lecture 13 (MWF) Confidence intervals and Margin of Error never be sure that our confidence interval contains the mean, which is why we prescribe a level (such as a 95%) to the interval. In other words, we can never be sure that the population mean is within the prescribed margin of error of the sample mean. • Of course, it is clear a small margin error gives a better level of precision than a large one. • In the next few slides, we recall factors which influence the MoE. 5
  • 7. Lecture 13 (MWF) Confidence intervals and Margin of Error Relationships: Sample size and MoE • We compare the 95% confidence intervals for n = 9 and n = 25. We see n=9 X̄ − 1.96 σ √ 9 , X̄ + 1.96 σ √ 9 n=25 X̄ − 1.96 σ √ 25 , X̄ + 1.96 σ √ 25 What are the lengths of the above intervals? – For n=9 the margin of error is 1.96 × σ √ 9 . – For n=25 the margin of error is 1.96 × σ √ 25 . Important The margin of error does not depend on X̄. 6
  • 8. Lecture 13 (MWF) Confidence intervals and Margin of Error Example: X̄ = 10.38, σ = √ 33. n=9 [10.38 ± 1.96 √ 33 √ 9 ] = [6.63, 14.13] MoE=3.75 n=25 [10.38 ± 1.96 √ 33 √ 25 ] = [8.12, 12.63] MoE = 2.255 • The second interval has a smaller margin of error. • When the sample size is large the estimator “tends” to be closer to the true parameter. Thus the confidence interval will be narrower; since the margin of error is smaller. 7
  • 9. Lecture 13 (MWF) Confidence intervals and Margin of Error Relationships: the standard deviation and MoE • The variability in the sample, measured by σ will have an impact of the reliability of an estimator and its margin of error. • Example: Suppose X̄ = 10.38, n = 9, but the variability in the two populations are different: σ = 5.7 10.38 ± 1.96 5.7 √ 9 = [6.63, 14.13], MoE = 3.75 σ = 10 10.38 ± 1.96 10 √ 9 = [3.38, 16.91], MoE = 6.8 • The more variablility within the population (as measure by the standard deviation) the more variability in the sample mean (as measure by the standard error). Usually we cannot do anything about this. 8
  • 10. Lecture 13 (MWF) Confidence intervals and Margin of Error How large an interval to use? • You read in a newspaper that “the proportion of the public that support same-sex marriage is 55% ± 15%.” • This means a survey was done, the proportion in the survey who said they supported same-sex marriage was 55% and the confidence interval for the population proportion is [55 − 15, 55 + 15]% = [40, 70]%. • This is an extremely large interval, it is so wide, that it is uninformative about the majority opinion of the public. • The reason it is too wide is that the sample size is too small. • Conclusion: This experiment was not designed well. 9
  • 11. Lecture 13 (MWF) Confidence intervals and Margin of Error • Typically, before data is calculated, we need to decide how large a sample to collect. • This is usually done by deciding how much “above and below” the estimator is acceptable. For example, an interval of the type [55- 3,55+3]% = [52,58]% tells us that the majority appear to support same-sex marriage. • The 3% is is the margin of error. Given a margin of error we can then determine the sample size to collect. 10
  • 12. Lecture 13 (MWF) Confidence intervals and Margin of Error Choosing the sample size for estimating µ • In an ideal world we would have a large sample size. A large sample size gives a small standard error, which in turn yields a small margin of error. • However, obtaining very large samples can be impossible for many different reasons. • How can one determine the number of observations to be included in a sample? How to choose the sample size n? Answer: Usually we have a margin of error in mind. We can accept the reliability of a estimator up to a certain margin or error. Once we know what margin of error is acceptable we can then choose the sample size. 11
  • 13. Lecture 13 (MWF) Confidence intervals and Margin of Error Formula for choosing the sample size • To choose the sample size according to the margin of error, we need to know (or guess apriori) the standard deviation σ (if we don’t know what it is, then we err on the cautious side and use a value that seems reasonable but large). • We recall that in the confidence interval: X̄ − 1.96 σ √ n , X̄ + 1.96 σ √ n the margin of error is MoE = 1.96 σ √ n . • Therefore, if we want to choose the sample size such that the margin of 12
  • 14. Lecture 13 (MWF) Confidence intervals and Margin of Error error for a given E we need to solve for MoE = 1.96 σ √ n solving for n gives n = 1.96σ E 2 . 13
  • 15. Lecture 13 (MWF) Confidence intervals and Margin of Error • Example: Suppose we guess that σ = √ 3 and we want the margin of error MoE = 0.25. The confidence interval is X̄ − 1.96 × √ 3 √ n , X̄ + 1.96 × √ 3 √ n # and we solve 1.96 × √ 3 √ n = 0.25. This gives n = 2 × 1.96 √ 3 0.5 2 = 184.4 • Of course, a larger value of n will give a smaller margin of error, so we round up and use n = 185. In other words, for this experiment we need to choose a sample size of at least 185 to be sure that the margin error is at most 0.25. 14
  • 16. Lecture 13 (MWF) Confidence intervals and Margin of Error The general formula • Changing the confidence level simply changes 1.96 to another number. • If the objective is to use a 99% CI, we first look up 0.5% in the z-tables, z0.5% = 2.57. Then we solve MoE = 2.57 σ √ n n = 2.57 × σ MoE 2 . • In general, for the (100 − α)% CI use the formula MoE = |zα/2| σ √ n and solve it to give n = |zα/2| × σ MoE 2 . 15
  • 17. Lecture 13 (MWF) Confidence intervals and Margin of Error Example 4: Heights • Researchers want to estimate the mean height of students at a university (in meters) with a margin of error of 0.04 (using a 95% CI level). The sample standard deviation from a small sample taken previously is 0.113. How many students must they sample to achieve their specifications? • Solution 4: – Since the true population standard deviation is unknown, they use the sample standard deviation in the calculation. – Use the formula with MoE= 0.04 gives the solution n = (1.96)2 (0.113)2 (0.04)2 = 30.65. – They must sample 31 people such that a 95% confidence interval has length 2 × 0.04 = 0.08. 16
  • 18. Lecture 13 (MWF) Confidence intervals and Margin of Error Example 5 (more practice): Caffine content • The caffine content is coffee is being analysed and it is known that standard deviation of a randomly selected coffee is 7.1mg. Suppose 100 cups of coffee are analysed, and the total weight of caffine in all the cups is P100 i=1 Xi = 11000mg, construct a 95% CI for the mean caffine content. Construct an 80% CI for the mean caffine. Find the minimum number of coffees which must be analysed for the 80% CI to have MoE 0.45mg? • Solution 5: • The total weight of caffine for the 100 cups is 11000 mg. Therefore the sample average of caffine per cup is x̄ = 11000/100 = 110. 17
  • 19. Lecture 13 (MWF) Confidence intervals and Margin of Error • Calculating the 95% CI (use the formala or calculate yourself): z0.05/2 = z0.025 = 1.96, n = 100, σ = 7.1 and X̄ = 110. • The CI is: 110 − 1.96 × 7.1 √ 100 , 110 + 1.96 × 7.1 √ 100 = [108.6, 111.4] . • To construct an 80% CI only one thing has to change, that is we only have to replace the 1.96 above with another number. To find this value go to the normal table an look inside it for 0.1, you should see −1.28. Replace 1.96 with 1.28 to give 110 − 1.28 × 7.1 √ 100 , 110 + 1.28 × 7.1 √ 100 = [109.1, 110.9]. 18
  • 20. Lecture 13 (MWF) Confidence intervals and Margin of Error • If we want the MoE to have length 0.45, then the interval X̄ − 1.28 × 7.1 √ n , X̄ + 1.28 × 7.1 √ n must have length 1. This means that MoE = 0.5 = 1.28 × 7.1 √ n . Solve this (or use the formula) to give n = 1.28 × 7.1 0.45 2 = 400. • Hence we need to sample at least 400 cups to obtain a margin of error which is 0.45 (half of what existed previously). 19
  • 21. Lecture 13 (MWF) Confidence intervals and Margin of Error Example 6 (when standard deviation is unknown) How large a sample size do we require such that the margin of error for a 95% confidence interval for the mean of human heights is maximum 0.25 inch. The standard deviation is unknown, but it is believed that σ lies somewhere between 2-5 inches. • Why this question matters: In general the standard deviation will be unknown. But we can ‘guess’ limits on how large or small it is based on own expertize. 20
  • 22. Lecture 13 (MWF) Confidence intervals and Margin of Error Solution 6 The more variable the data the larger the confidence interval. Therefore, when given a range of standard deviations and our aim is that the margin of error should be no larger than 0.25 (i.e. 0.25 or less), then we need to use the largest standard deviation in the given range in the calculation • In other words n = 1.96 × σlargest 0.25 2 = 1.96 × 5 0.25 2 = 1537. • For any other σ 5, using n = 1537 will lead to a Margin of Error which is less than 0.25. To see why, recall MoE = 1.96 × σ √ n = 1.96 × σ 1.96×5 0.25 = 0.25 × σ 5 . 21
  • 23. Lecture 13 (MWF) Confidence intervals and Margin of Error Thus we see that if the true σ 5, then the MoE will be less than 0.25, since σ/5 1. • If we use σ = 2 in the margin of error calculation, then n = 246. However, if the true σ 2 using n = 246 will lead to a Margin of Error which is larger than 0.25. 22
  • 24. Lecture 13 (MWF) Confidence intervals and Margin of Error Margin of Error calculations using software • There are various software tools on the web that will do margin of error calculations. For example, https://www.emathhelp.net/calculators/probability-statistics/margin-of-error-calculator/ • Here is one by survey monkey https://www.surveymonkey.com/mp/margin-of-error-calculator/. This calculator is specifically designed for calculating the MoE of proportions; where the standard deviation need not be specified. Here is another one http://www.raosoft.com/samplesize.html, which which can give smaller sample sizes if a proportion is specified. We cover this later on in the course. • The calculations done in class assume that the population size is infinite (or that the sample is a SRS, that is the same person is sampled again). 23
  • 25. Lecture 13 (MWF) Confidence intervals and Margin of Error However, when surveying certain populations the population size will be finite. • Therefore, some calculators will also ask for the population size. Using the finite population size they make what is called a finite sample correction. You can read more about it here: https://en.wikipedia.org/wiki/Margin_of_error#Effect_of_population_size. 24
  • 26. Lecture 13 (MWF) Confidence intervals and Margin of Error Example 7 A confidence interval for the length of parrots is [4,10] inches. It is based on a sample size n. By what factor should the sample size increase such that the margin of error reduces to 1? 25
  • 27. Lecture 13 (MWF) Confidence intervals and Margin of Error Solution 7 • The original margin of error is 3. Thus 1.96 × σ/ √ n = 3. • We want to increase the sample size such that it decreases to 1. 1.96 × σ √ Factor × n = 1 ⇒ 1.96 × σ √ Factor × √ n = 3 √ Factor = 1. Solving for this we see that we need to increase the sample size by factor 9 in order to decrease the margin of error by a factor 3. • Conclusion An extremely large increase in sample size has to be made for a moderate reduction in margin of error. 26