SlideShare a Scribd company logo
1 of 31
By: 
admission.edhole.com
 6.6: Small-sample inference for a proportion 
 7.1: Large sample comparisons for two 
independent sample means. 
 7.2: Difference between two large sample 
proportions. 
2 
admission.edhole.com
 So far, we have been making estimates and 
inferences about a single sample statistic 
 Now, we will begin making estimates and inferences 
for two sample statistics at once. 
• many real-life problems involve such comparisons 
• two-group problems often serve as a starting point for 
more involved statistics, as we shall see in this class. 
3 
admission.edhole.com
 Two independent random samples: 
• Two subsamples, each with a mean score for some other 
variable 
• example: Comparisons of work hours by race or sex 
• example: Comparison of earnings by marital status 
 Two dependent random samples: 
• Two observations are being compared for each “unit” in 
the sample 
• example: before-and-after measurements of the same 
person at two time points 
• example: earnings before and after marriage 
• husband-wife differences 
4 
admission.edhole.com
Hypothesis testing as we have done it so far: 
 Test statistic: z = (Ybar - mo) / (s /SQRT(n)) 
 What can we do when we make inferences about a 
difference between population means (m2 - m1)? 
• Treat one sample mean as if it were mo ? 
• (NO: too much type I error) 
• Calculate a confidence interval for each sample mean 
and see if they overlap? 
• (NO: too much type II error) 
5 
admission.edhole.com
Is Y2 –Y1an appropriate way to evaluate m2 - m1? 
• Answer: Yes. We can appropriately define (m2 - m1) as a 
parameter of interest and estimate it in an unbiased way 
with (Y2 – Y1) just as we would estimate m with Y. 
• This line of argument may seem trivial, but it becomes 
important when we work with variance and standard 
deviations. 
6 
admission.edhole.com
Comparing standard errors: 
 A&F 213: formula without derivation 
 Is s2 
Ybar2 - s2 
Ybar1an appropriate way to estimate s2 
(Ybar2-Ybar1)? 
• No! 
"s2 
(Ybar2-Ybar1)= s2 
(Ybar2) - 2s(Ybar2,Ybar1) + s2 
(Ybar1) 
• Where 2s(Ybar2,Ybar1) reflects how much the observations for the 
two groups are dependent. 
• For independent groups, 2s(Ybar2,Ybar1) = 0, 
so s2 
(Ybar2-Ybar1)= s2 
(Ybar2) + s2 
(Ybar1) 
7 
admission.edhole.com
 The parameter of interest is m2 - m1 
 Assumptions: 
• the sample is drawn from a random sample of some sort, 
• the parameter of interest is a variable with an interval 
scale, 
• the sample size is large enough that the sampling 
distribution of Ybar2 – Ybar1 is approximately normal. 
• The two samples are drawn independently 
8 
admission.edhole.com
 The null hypothesis will be that there is no 
difference between the population means. This 
means that any difference we observe is due to 
random chance. 
 Ho: m2 - m1 = 0 
• (We can specify an alpha level now if we want) 
 Q: Would it matter if we used 
Ho: m1 - m2 = 0 ? 
Ho: m1 = m2 ? 
9 
admission.edhole.com
 The test statistic has a standard form: 
• z = (estimate of parameter – Ho value of parameter) 
standard error of parameter 
z Y Y 
= - - 
2 1 ( ) 0 
2 
2 
2 
1 
s 
s 
 Q: If the null hypothesis is that the means are the 
same, why do we estimate two different standard 
deviations? 
10 
2 
1 
n 
n 
+ 
admission.edhole.com
P-value of calculated z: 
• Table A 
• Stata: display 2 * (1 – normal(z) ) 
• Stata: testi (no data, just parameters) 
• Stata: ttest (if data file in memory) 
11 
admission.edhole.com
Step 5: Conclusion. 
 Compare the p-value from step 4 to the alpha level 
in step 1. 
If p < α, reject H0 If p ≥ α, do not reject H0 
 State a conclusion about the statistical significance 
of the test. 
 Briefly discuss the substantive importance of your 
findings. 
12 
admission.edhole.com
 Do women spend more time on housework than 
men? 
 Data from the 1988 National Survey of Families 
and Households: 
• sex sample size mean hours s.d 
• men 4252 18.1 12.9 
• women 6764 32.6 18.2 
 The parameter of interest is m2 - m1 
13 
admission.edhole.com
1. Assumptions: random sample, interval-scale variable, 
sample size large enough that the sampling distribution of 
m2 - m1is approximately normal, independent groups 
2. Hypothesis: Ho: m2 - m1= 0 
3. Test statistic: 
z = ((32.6 – 18.1) – 0) / SQRT((12.9)2/4252 + (18.2)2/6764) = 48.8 
1. p-value: p<.001 
2. conclusion: 
a. reject H0: these sample differences are very unlikely to occur if men 
and women do the same number of hours of housework. 
b. furthermore, the observed difference of 14.5 hours per week is a 
substantively important difference in the amount of housework. 
14 
admission.edhole.com
2 
2 
2 
1 
s 
c i = Y -Y ± z s + 
 housework example with 99% interval: 
 c.i…. 
= (32.6 – 18.1) +/- 2.58*( √((12.9)2/4252 + (18.2)2/6764)) 
= 14.5 +/- 2.58*.30 
= 14.5 +/- .8, or (13.7,15.3) 
 By this analysis, the 99% confidence interval for the 
difference in housework is 13.7 to 15.3 hours. 
15 
( ) 
2 
1 
2 1 . . 
n 
n 
admission.edhole.com
 Immediate (no data, just parameters) 
• ttesti 4252 18.1 12.9 6764 32.6 18.2, unequal 
• Q: why ttesti with large samples? 
 For the immediate command, you need the following: 
• sample size for group 1 (n = 4252) 
• mean for group 1 
• standard deviation for group 1 
• sample size for group 2 
• mean for group 2 
• standard deviation for group 2 
• instructions to not assume equal variance (, unequal) 
16 
admission.edhole.com
. ttesti 4252 18.1 12.9 6764 32.6 18.2, unequal 
Two-sample t test with unequal variances 
------------------------------------------------------------------------------ 
| Obs Mean Std. Err. Std. Dev. [95% Conf. Interval] 
---------+-------------------------------------------------------------------- 
x | 4252 18.1 .1978304 12.9 17.71215 18.48785 
y | 6764 32.6 .221294 18.2 32.16619 33.03381 
---------+-------------------------------------------------------------------- 
combined | 11016 27.00323 .1697512 17.8166 26.67049 27.33597 
---------+-------------------------------------------------------------------- 
diff | -14.5 .2968297 -15.08184 -13.91816 
------------------------------------------------------------------------------ 
Satterthwaite's degrees of freedom: 10858.6 
Ho: mean(x) - mean(y) = diff = 0 
Ha: diff < 0 Ha: diff != 0 Ha: diff > 0 
t = -48.8496 t = -48.8496 t = -48.8496 
P < t = 0.0000 P > |t| = 0.0000 P > t = 1.0000 
17 
admission.edhole.com
. ttest YEARSJOB, by(nonstandard) unequal 
Two-sample t test with unequal variances 
------------------------------------------------------------------------------ 
Group | Obs Mean Std. Err. Std. Dev. [95% Conf. Interval] 
---------+-------------------------------------------------------------------- 
0 | 980 9.430612 .2788544 8.729523 8.883391 9.977833 
1 | 379 7.907652 .3880947 7.555398 7.144557 8.670747 
---------+-------------------------------------------------------------------- 
combined | 1359 9.005887 .2290413 8.443521 8.556573 9.4552 
---------+-------------------------------------------------------------------- 
diff | 1.522961 .4778884 .5848756 2.461045 
------------------------------------------------------------------------------ 
diff = mean(0) - mean(1) t = 3.1869 
Ho: diff = 0 Satterthwaite's degrees of freedom = 787.963 
Ha: diff < 0 Ha: diff != 0 Ha: diff > 0 
Pr(T < t) = 0.9993 Pr(|T| > |t|) = 0.0015 Pr(T > t) = 0.0007 
18 
admission.edhole.com
. ttest conrinc if wrkstat==1, by(wrkslf) unequal 
Two-sample t test with unequal variances 
------------------------------------------------------------------------------ 
Group | Obs Mean Std. Err. Std. Dev. [95% Conf. Interval] 
---------+-------------------------------------------------------------------- 
self-emp | 190 48514.62 2406.263 33168.05 43768.03 53261.2 
someone | 1263 34417.11 636.9954 22638 33167.43 35666.8 
---------+-------------------------------------------------------------------- 
combined | 1453 36260.56 648.5844 24722.9 34988.3 37532.82 
---------+-------------------------------------------------------------------- 
diff | 14097.5 2489.15 9191.402 19003.6 
------------------------------------------------------------------------------ 
diff = mean(self-emp) - mean(someone) t = 5.6636 
Ho: diff = 0 Satterthwaite's degrees of freedom = 216.259 
Ha: diff < 0 Ha: diff != 0 Ha: diff > 0 
Pr(T < t) = 1.0000 Pr(|T| > |t|) = 0.0000 Pr(T > t) = 0.0000 
19 
admission.edhole.com
 In 1982 and 1994, respondents in the General Social Survey 
were asked: “Do you agree or disagree with this statement? 
‘Women should take care of running their homes and leave 
running the country up to men.’” 
• Year Agree Disagree Total 
• 1982 122 223 345 
• 1994 268 1632 1900 
• Total 390 1855 2245 
 Do a formal test to decide whether opinions differed in the 
two years. 
20 
admission.edhole.com
 The parameter of interest is π2 - π1 
 Assumptions: 
• the sample is drawn from a random sample of some sort, 
• the parameter of interest is a variable with an interval 
scale, 
• the sample size is large enough that the sampling 
distribution of Pihat2 – Pihat1 is approximately normal. 
• The two samples are drawn independently 
21 
admission.edhole.com
The null hypothesis will be that there is no 
difference between the population proportions. This 
means that any difference we observe is due to 
random chance. 
Ho: π2 - π1 = 0 
(State an alpha here if you want to.) 
22 
admission.edhole.com
The test statistic has a standard form: 
 z = (estimate of parameter – Ho value of parameter) 
standard error of parameter 
ö 
( ˆ ˆ ) 
= - 
p p 
2 1 
æ 
ˆ 1 ˆ 1 1 
n n 
 Where pihat is the overall weighted average 
• This means we are assuming equal variance in the two 
populations. 
• Q: why do we use an assumption of equal variance to 
estimate the standard error for the t-test? 
23 
( ) ÷ ÷ø 
ç çè 
- + 
1 2 
z 
p p 
admission.edhole.com
P-value of calculated z: 
• Table A, or 
• Stata: display 2 * (1 – normal(z) ), or 
• Stata: testi (no data, just parameters) 
• Stata: ttest (if data file in memory) 
24 
admission.edhole.com
Conclusion: 
 Compare the p-value from step 4 to the alpha level 
in step 1. 
If p < α, reject H0 If p ≥ α, do not reject H0 
 State a conclusion about the statistical significance 
of the test. 
 Briefly discuss the substantive importance of your 
findings. 
25 
admission.edhole.com
1. Assumptions: random sample, interval-scale variable, 
sample size large enough that the sampling distribution of 
m2 - m1is approximately normal, independent groups 
2. Hypothesis: Ho: π2 - π1= 0 
3. Test statistic: 
z = (122/345 – 268/1900) / 
SQRT[(390/2245)*(1 - 390/2245)*(1/345 + 1/1900)] 
= 9.59 
1. p-value: p<<.001 
2. conclusion: 
a. reject H0: attitudes were clearly different in 1994 than in 1982. 
b. furthermore, the observed difference of .21 is a substantively 
important change in attitudes. 
26 
admission.edhole.com
 confidence interval: 
c i = P - P ± z P - P + - 
P P 
. . (1 ) (1 ) 
1 1 
2 2 
 Notice that there is no overall weighted average Pihat, 
as there is in a significance test for proportions. 
• Instead, we estimate two separate variances from the 
separate proportions. 
• Why? 
27 
( ) 
2 
1 
2 1 
n 
n 
admission.edhole.com
. prtesti 345 .3536 1900 .1411 
 STATA needs the following information: 
• sample size for group 1 (n = 345) 
• proportion for group 1 (p = 122/345) 
• sample size for group 2 (n = 1900) 
• proportion for group 2 (p = 268/1900) 
28 
admission.edhole.com
. prtesti 345 .3536 1900 .1411 
Two-sample test of proportion x: Number of obs = 345 
y: Number of obs = 1900 
------------------------------------------------------------------------------ 
Variable | Mean Std. Err. z P>|z| [95% Conf. Interval] 
-------------+---------------------------------------------------------------- 
x | .3536 .0257393 .3031518 .4040482 
y | .1411 .0079865 .1254467 .1567533 
-------------+---------------------------------------------------------------- 
diff | .2125 .0269499 .1596791 .2653209 
| under Ho: .0221741 9.58 0.000 
------------------------------------------------------------------------------ 
Ho: proportion(x) - proportion(y) = diff = 0 
Ha: diff < 0 Ha: diff != 0 Ha: diff > 0 
z = 9.583 z = 9.583 z = 9.583 
P < z = 1.0000 P > |z| = 0.0000 P > z = 0.0000 
Note the use of one standard error (unequal variance) for the 
confidence interval, and another (equal variance) for the 
significance test. 
29 
admission.edhole.com
. prtest nonstandard if (RACECEN1==1 | RACECEN1==2), by(RACECEN1) 
Two-sample test of proportion 1: Number of obs = 1389 
2: Number of obs = 260 
------------------------------------------------------------------------------ 
Variable | Mean Std. Err. z P>|z| [95% Conf. Interval] 
-------------+---------------------------------------------------------------- 
1 | .2800576 .0120482 .2564436 .3036716 
2 | .3538462 .0296544 .2957247 .4119676 
-------------+---------------------------------------------------------------- 
diff | -.0737886 .0320084 -.1365239 -.0110532 
| under Ho: .0307147 -2.40 0.016 
------------------------------------------------------------------------------ 
diff = prop(1) - prop(2) z = -2.4024 
Ho: diff = 0 
Ha: diff < 0 Ha: diff != 0 Ha: diff > 0 
Pr(Z < z) = 0.0081 Pr(|Z| < |z|) = 0.0163 Pr(Z > z) = 0.9919 
30 
admission.edhole.com
. gen byte wrkslf0=wrkslf-1 
(152 missing values generated) 
. prtest wrkslf0 if wrkstat==1, by(sex) 
Two-sample test of proportion male: Number of obs = 874 
female: Number of obs = 743 
------------------------------------------------------------------------------ 
Variable | Mean Std. Err. z P>|z| [95% Conf. Interval] 
-------------+---------------------------------------------------------------- 
male | .8272311 .0127876 .8021678 .8522944 
female | .9044415 .0107853 .8833027 .9255802 
-------------+---------------------------------------------------------------- 
diff | -.0772103 .0167286 -.1099978 -.0444229 
| under Ho: .0171735 -4.50 0.000 
------------------------------------------------------------------------------ 
diff = prop(male) - prop(female) z = -4.4959 
Ho: diff = 0 
Ha: diff < 0 Ha: diff != 0 Ha: diff > 0 
Pr(Z < z) = 0.0000 Pr(|Z| < |z|) = 0.0000 Pr(Z > z) = 1.0000 
31 
admission.edhole.com

More Related Content

What's hot

OR Linear Programming
OR Linear ProgrammingOR Linear Programming
OR Linear Programming
chaitu87
 
Str t-test1
Str   t-test1Str   t-test1
Str t-test1
iamkim
 

What's hot (19)

Regression vs Neural Net
Regression vs Neural NetRegression vs Neural Net
Regression vs Neural Net
 
Seattle.Slides.7
Seattle.Slides.7Seattle.Slides.7
Seattle.Slides.7
 
Two-sample Hypothesis Tests
Two-sample Hypothesis Tests Two-sample Hypothesis Tests
Two-sample Hypothesis Tests
 
Normal probability distribution
Normal probability distributionNormal probability distribution
Normal probability distribution
 
Statistics
StatisticsStatistics
Statistics
 
Matlab lab manual
Matlab lab manualMatlab lab manual
Matlab lab manual
 
Normal distribution slide share
Normal distribution slide shareNormal distribution slide share
Normal distribution slide share
 
OR Linear Programming
OR Linear ProgrammingOR Linear Programming
OR Linear Programming
 
MCA_UNIT-2_Computer Oriented Numerical Statistical Methods
MCA_UNIT-2_Computer Oriented Numerical Statistical MethodsMCA_UNIT-2_Computer Oriented Numerical Statistical Methods
MCA_UNIT-2_Computer Oriented Numerical Statistical Methods
 
Linear Rules
Linear RulesLinear Rules
Linear Rules
 
Some study materials
Some study materialsSome study materials
Some study materials
 
Normal distri
Normal distriNormal distri
Normal distri
 
The siegel-tukey-test-for-equal-variability
The siegel-tukey-test-for-equal-variabilityThe siegel-tukey-test-for-equal-variability
The siegel-tukey-test-for-equal-variability
 
Str t-test1
Str   t-test1Str   t-test1
Str t-test1
 
Differential calculus maxima minima
Differential calculus  maxima minimaDifferential calculus  maxima minima
Differential calculus maxima minima
 
Pattern Discovery - part I
Pattern Discovery - part IPattern Discovery - part I
Pattern Discovery - part I
 
Standard normal distribution
Standard normal distributionStandard normal distribution
Standard normal distribution
 
Lesson 27 using statistical techniques in analyzing data
Lesson 27 using statistical techniques in analyzing dataLesson 27 using statistical techniques in analyzing data
Lesson 27 using statistical techniques in analyzing data
 
Lesson 23 planning data analyses using statistics
Lesson 23 planning data analyses using statisticsLesson 23 planning data analyses using statistics
Lesson 23 planning data analyses using statistics
 

Viewers also liked (9)

Audience Presentation
Audience PresentationAudience Presentation
Audience Presentation
 
Centre haas
Centre haasCentre haas
Centre haas
 
prezentacja
 				prezentacja 				prezentacja
prezentacja
 
Latihan Dalam Perkhidmatan Skc
Latihan Dalam  Perkhidmatan SkcLatihan Dalam  Perkhidmatan Skc
Latihan Dalam Perkhidmatan Skc
 
Ch07
Ch07Ch07
Ch07
 
The churches in judea
The churches in judeaThe churches in judea
The churches in judea
 
Becas
BecasBecas
Becas
 
Toner Recycling Program - PrintersPlus Print Green
Toner Recycling Program - PrintersPlus Print GreenToner Recycling Program - PrintersPlus Print Green
Toner Recycling Program - PrintersPlus Print Green
 
Actm usermanual
Actm usermanualActm usermanual
Actm usermanual
 

Similar to Admission in India

การสุ่มตัวอย่างในงานวิจัยสาธารณสุข
การสุ่มตัวอย่างในงานวิจัยสาธารณสุขการสุ่มตัวอย่างในงานวิจัยสาธารณสุข
การสุ่มตัวอย่างในงานวิจัยสาธารณสุข
Ultraman Taro
 
Lesson06_new
Lesson06_newLesson06_new
Lesson06_new
shengvn
 
09 test of hypothesis small sample.ppt
09 test of hypothesis small sample.ppt09 test of hypothesis small sample.ppt
09 test of hypothesis small sample.ppt
Pooja Sakhla
 

Similar to Admission in India (20)

Top schools in noida
Top schools in noidaTop schools in noida
Top schools in noida
 
Video lectures for b.tech
Video lectures for b.techVideo lectures for b.tech
Video lectures for b.tech
 
WSDM2019tutorial
WSDM2019tutorialWSDM2019tutorial
WSDM2019tutorial
 
ecir2019tutorial
ecir2019tutorialecir2019tutorial
ecir2019tutorial
 
ecir2019tutorial-finalised
ecir2019tutorial-finalisedecir2019tutorial-finalised
ecir2019tutorial-finalised
 
Chapter07.pdf
Chapter07.pdfChapter07.pdf
Chapter07.pdf
 
การสุ่มตัวอย่างในงานวิจัยสาธารณสุข
การสุ่มตัวอย่างในงานวิจัยสาธารณสุขการสุ่มตัวอย่างในงานวิจัยสาธารณสุข
การสุ่มตัวอย่างในงานวิจัยสาธารณสุข
 
Statr session 17 and 18
Statr session 17 and 18Statr session 17 and 18
Statr session 17 and 18
 
Statr session 17 and 18 (ASTR)
Statr session 17 and 18 (ASTR)Statr session 17 and 18 (ASTR)
Statr session 17 and 18 (ASTR)
 
Normal Distribution
Normal DistributionNormal Distribution
Normal Distribution
 
Two variances or standard deviations
Two variances or standard deviations  Two variances or standard deviations
Two variances or standard deviations
 
Two Means, Independent Samples
Two Means, Independent SamplesTwo Means, Independent Samples
Two Means, Independent Samples
 
Topic 1 part 2
Topic 1 part 2Topic 1 part 2
Topic 1 part 2
 
hypothesis testing-tests of proportions and variances in six sigma
hypothesis testing-tests of proportions and variances in six sigmahypothesis testing-tests of proportions and variances in six sigma
hypothesis testing-tests of proportions and variances in six sigma
 
Advanced Statistics And Probability (MSC 615
Advanced Statistics And Probability (MSC 615Advanced Statistics And Probability (MSC 615
Advanced Statistics And Probability (MSC 615
 
Lesson06_new
Lesson06_newLesson06_new
Lesson06_new
 
Module 6: Outlier Detection for Two Sample Case
Module 6: Outlier Detection for Two Sample CaseModule 6: Outlier Detection for Two Sample Case
Module 6: Outlier Detection for Two Sample Case
 
Hypothesis testing
Hypothesis testingHypothesis testing
Hypothesis testing
 
sigir2018tutorial
sigir2018tutorialsigir2018tutorial
sigir2018tutorial
 
09 test of hypothesis small sample.ppt
09 test of hypothesis small sample.ppt09 test of hypothesis small sample.ppt
09 test of hypothesis small sample.ppt
 

More from Edhole.com

More from Edhole.com (20)

Ca in patna
Ca in patnaCa in patna
Ca in patna
 
Chartered accountant in dwarka
Chartered accountant in dwarkaChartered accountant in dwarka
Chartered accountant in dwarka
 
Ca in dwarka
Ca in dwarkaCa in dwarka
Ca in dwarka
 
Ca firm in dwarka
Ca firm in dwarkaCa firm in dwarka
Ca firm in dwarka
 
Website development company surat
Website development company suratWebsite development company surat
Website development company surat
 
Website designing company in surat
Website designing company in suratWebsite designing company in surat
Website designing company in surat
 
Website dsigning company in india
Website dsigning company in indiaWebsite dsigning company in india
Website dsigning company in india
 
Website designing company in delhi
Website designing company in delhiWebsite designing company in delhi
Website designing company in delhi
 
Ca in patna
Ca in patnaCa in patna
Ca in patna
 
Chartered accountant in dwarka
Chartered accountant in dwarkaChartered accountant in dwarka
Chartered accountant in dwarka
 
Ca firm in dwarka
Ca firm in dwarkaCa firm in dwarka
Ca firm in dwarka
 
Ca in dwarka
Ca in dwarkaCa in dwarka
Ca in dwarka
 
Website development company surat
Website development company suratWebsite development company surat
Website development company surat
 
Website designing company in surat
Website designing company in suratWebsite designing company in surat
Website designing company in surat
 
Website designing company in india
Website designing company in indiaWebsite designing company in india
Website designing company in india
 
Website designing company in delhi
Website designing company in delhiWebsite designing company in delhi
Website designing company in delhi
 
Website designing company in mumbai
Website designing company in mumbaiWebsite designing company in mumbai
Website designing company in mumbai
 
Website development company surat
Website development company suratWebsite development company surat
Website development company surat
 
Website desinging company in surat
Website desinging company in suratWebsite desinging company in surat
Website desinging company in surat
 
Website designing company in india
Website designing company in indiaWebsite designing company in india
Website designing company in india
 

Recently uploaded

Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
negromaestrong
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 

Recently uploaded (20)

Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Asian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptxAsian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptx
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
Magic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptxMagic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptx
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxSKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 

Admission in India

  • 2.  6.6: Small-sample inference for a proportion  7.1: Large sample comparisons for two independent sample means.  7.2: Difference between two large sample proportions. 2 admission.edhole.com
  • 3.  So far, we have been making estimates and inferences about a single sample statistic  Now, we will begin making estimates and inferences for two sample statistics at once. • many real-life problems involve such comparisons • two-group problems often serve as a starting point for more involved statistics, as we shall see in this class. 3 admission.edhole.com
  • 4.  Two independent random samples: • Two subsamples, each with a mean score for some other variable • example: Comparisons of work hours by race or sex • example: Comparison of earnings by marital status  Two dependent random samples: • Two observations are being compared for each “unit” in the sample • example: before-and-after measurements of the same person at two time points • example: earnings before and after marriage • husband-wife differences 4 admission.edhole.com
  • 5. Hypothesis testing as we have done it so far:  Test statistic: z = (Ybar - mo) / (s /SQRT(n))  What can we do when we make inferences about a difference between population means (m2 - m1)? • Treat one sample mean as if it were mo ? • (NO: too much type I error) • Calculate a confidence interval for each sample mean and see if they overlap? • (NO: too much type II error) 5 admission.edhole.com
  • 6. Is Y2 –Y1an appropriate way to evaluate m2 - m1? • Answer: Yes. We can appropriately define (m2 - m1) as a parameter of interest and estimate it in an unbiased way with (Y2 – Y1) just as we would estimate m with Y. • This line of argument may seem trivial, but it becomes important when we work with variance and standard deviations. 6 admission.edhole.com
  • 7. Comparing standard errors:  A&F 213: formula without derivation  Is s2 Ybar2 - s2 Ybar1an appropriate way to estimate s2 (Ybar2-Ybar1)? • No! "s2 (Ybar2-Ybar1)= s2 (Ybar2) - 2s(Ybar2,Ybar1) + s2 (Ybar1) • Where 2s(Ybar2,Ybar1) reflects how much the observations for the two groups are dependent. • For independent groups, 2s(Ybar2,Ybar1) = 0, so s2 (Ybar2-Ybar1)= s2 (Ybar2) + s2 (Ybar1) 7 admission.edhole.com
  • 8.  The parameter of interest is m2 - m1  Assumptions: • the sample is drawn from a random sample of some sort, • the parameter of interest is a variable with an interval scale, • the sample size is large enough that the sampling distribution of Ybar2 – Ybar1 is approximately normal. • The two samples are drawn independently 8 admission.edhole.com
  • 9.  The null hypothesis will be that there is no difference between the population means. This means that any difference we observe is due to random chance.  Ho: m2 - m1 = 0 • (We can specify an alpha level now if we want)  Q: Would it matter if we used Ho: m1 - m2 = 0 ? Ho: m1 = m2 ? 9 admission.edhole.com
  • 10.  The test statistic has a standard form: • z = (estimate of parameter – Ho value of parameter) standard error of parameter z Y Y = - - 2 1 ( ) 0 2 2 2 1 s s  Q: If the null hypothesis is that the means are the same, why do we estimate two different standard deviations? 10 2 1 n n + admission.edhole.com
  • 11. P-value of calculated z: • Table A • Stata: display 2 * (1 – normal(z) ) • Stata: testi (no data, just parameters) • Stata: ttest (if data file in memory) 11 admission.edhole.com
  • 12. Step 5: Conclusion.  Compare the p-value from step 4 to the alpha level in step 1. If p < α, reject H0 If p ≥ α, do not reject H0  State a conclusion about the statistical significance of the test.  Briefly discuss the substantive importance of your findings. 12 admission.edhole.com
  • 13.  Do women spend more time on housework than men?  Data from the 1988 National Survey of Families and Households: • sex sample size mean hours s.d • men 4252 18.1 12.9 • women 6764 32.6 18.2  The parameter of interest is m2 - m1 13 admission.edhole.com
  • 14. 1. Assumptions: random sample, interval-scale variable, sample size large enough that the sampling distribution of m2 - m1is approximately normal, independent groups 2. Hypothesis: Ho: m2 - m1= 0 3. Test statistic: z = ((32.6 – 18.1) – 0) / SQRT((12.9)2/4252 + (18.2)2/6764) = 48.8 1. p-value: p<.001 2. conclusion: a. reject H0: these sample differences are very unlikely to occur if men and women do the same number of hours of housework. b. furthermore, the observed difference of 14.5 hours per week is a substantively important difference in the amount of housework. 14 admission.edhole.com
  • 15. 2 2 2 1 s c i = Y -Y ± z s +  housework example with 99% interval:  c.i…. = (32.6 – 18.1) +/- 2.58*( √((12.9)2/4252 + (18.2)2/6764)) = 14.5 +/- 2.58*.30 = 14.5 +/- .8, or (13.7,15.3)  By this analysis, the 99% confidence interval for the difference in housework is 13.7 to 15.3 hours. 15 ( ) 2 1 2 1 . . n n admission.edhole.com
  • 16.  Immediate (no data, just parameters) • ttesti 4252 18.1 12.9 6764 32.6 18.2, unequal • Q: why ttesti with large samples?  For the immediate command, you need the following: • sample size for group 1 (n = 4252) • mean for group 1 • standard deviation for group 1 • sample size for group 2 • mean for group 2 • standard deviation for group 2 • instructions to not assume equal variance (, unequal) 16 admission.edhole.com
  • 17. . ttesti 4252 18.1 12.9 6764 32.6 18.2, unequal Two-sample t test with unequal variances ------------------------------------------------------------------------------ | Obs Mean Std. Err. Std. Dev. [95% Conf. Interval] ---------+-------------------------------------------------------------------- x | 4252 18.1 .1978304 12.9 17.71215 18.48785 y | 6764 32.6 .221294 18.2 32.16619 33.03381 ---------+-------------------------------------------------------------------- combined | 11016 27.00323 .1697512 17.8166 26.67049 27.33597 ---------+-------------------------------------------------------------------- diff | -14.5 .2968297 -15.08184 -13.91816 ------------------------------------------------------------------------------ Satterthwaite's degrees of freedom: 10858.6 Ho: mean(x) - mean(y) = diff = 0 Ha: diff < 0 Ha: diff != 0 Ha: diff > 0 t = -48.8496 t = -48.8496 t = -48.8496 P < t = 0.0000 P > |t| = 0.0000 P > t = 1.0000 17 admission.edhole.com
  • 18. . ttest YEARSJOB, by(nonstandard) unequal Two-sample t test with unequal variances ------------------------------------------------------------------------------ Group | Obs Mean Std. Err. Std. Dev. [95% Conf. Interval] ---------+-------------------------------------------------------------------- 0 | 980 9.430612 .2788544 8.729523 8.883391 9.977833 1 | 379 7.907652 .3880947 7.555398 7.144557 8.670747 ---------+-------------------------------------------------------------------- combined | 1359 9.005887 .2290413 8.443521 8.556573 9.4552 ---------+-------------------------------------------------------------------- diff | 1.522961 .4778884 .5848756 2.461045 ------------------------------------------------------------------------------ diff = mean(0) - mean(1) t = 3.1869 Ho: diff = 0 Satterthwaite's degrees of freedom = 787.963 Ha: diff < 0 Ha: diff != 0 Ha: diff > 0 Pr(T < t) = 0.9993 Pr(|T| > |t|) = 0.0015 Pr(T > t) = 0.0007 18 admission.edhole.com
  • 19. . ttest conrinc if wrkstat==1, by(wrkslf) unequal Two-sample t test with unequal variances ------------------------------------------------------------------------------ Group | Obs Mean Std. Err. Std. Dev. [95% Conf. Interval] ---------+-------------------------------------------------------------------- self-emp | 190 48514.62 2406.263 33168.05 43768.03 53261.2 someone | 1263 34417.11 636.9954 22638 33167.43 35666.8 ---------+-------------------------------------------------------------------- combined | 1453 36260.56 648.5844 24722.9 34988.3 37532.82 ---------+-------------------------------------------------------------------- diff | 14097.5 2489.15 9191.402 19003.6 ------------------------------------------------------------------------------ diff = mean(self-emp) - mean(someone) t = 5.6636 Ho: diff = 0 Satterthwaite's degrees of freedom = 216.259 Ha: diff < 0 Ha: diff != 0 Ha: diff > 0 Pr(T < t) = 1.0000 Pr(|T| > |t|) = 0.0000 Pr(T > t) = 0.0000 19 admission.edhole.com
  • 20.  In 1982 and 1994, respondents in the General Social Survey were asked: “Do you agree or disagree with this statement? ‘Women should take care of running their homes and leave running the country up to men.’” • Year Agree Disagree Total • 1982 122 223 345 • 1994 268 1632 1900 • Total 390 1855 2245  Do a formal test to decide whether opinions differed in the two years. 20 admission.edhole.com
  • 21.  The parameter of interest is π2 - π1  Assumptions: • the sample is drawn from a random sample of some sort, • the parameter of interest is a variable with an interval scale, • the sample size is large enough that the sampling distribution of Pihat2 – Pihat1 is approximately normal. • The two samples are drawn independently 21 admission.edhole.com
  • 22. The null hypothesis will be that there is no difference between the population proportions. This means that any difference we observe is due to random chance. Ho: π2 - π1 = 0 (State an alpha here if you want to.) 22 admission.edhole.com
  • 23. The test statistic has a standard form:  z = (estimate of parameter – Ho value of parameter) standard error of parameter ö ( ˆ ˆ ) = - p p 2 1 æ ˆ 1 ˆ 1 1 n n  Where pihat is the overall weighted average • This means we are assuming equal variance in the two populations. • Q: why do we use an assumption of equal variance to estimate the standard error for the t-test? 23 ( ) ÷ ÷ø ç çè - + 1 2 z p p admission.edhole.com
  • 24. P-value of calculated z: • Table A, or • Stata: display 2 * (1 – normal(z) ), or • Stata: testi (no data, just parameters) • Stata: ttest (if data file in memory) 24 admission.edhole.com
  • 25. Conclusion:  Compare the p-value from step 4 to the alpha level in step 1. If p < α, reject H0 If p ≥ α, do not reject H0  State a conclusion about the statistical significance of the test.  Briefly discuss the substantive importance of your findings. 25 admission.edhole.com
  • 26. 1. Assumptions: random sample, interval-scale variable, sample size large enough that the sampling distribution of m2 - m1is approximately normal, independent groups 2. Hypothesis: Ho: π2 - π1= 0 3. Test statistic: z = (122/345 – 268/1900) / SQRT[(390/2245)*(1 - 390/2245)*(1/345 + 1/1900)] = 9.59 1. p-value: p<<.001 2. conclusion: a. reject H0: attitudes were clearly different in 1994 than in 1982. b. furthermore, the observed difference of .21 is a substantively important change in attitudes. 26 admission.edhole.com
  • 27.  confidence interval: c i = P - P ± z P - P + - P P . . (1 ) (1 ) 1 1 2 2  Notice that there is no overall weighted average Pihat, as there is in a significance test for proportions. • Instead, we estimate two separate variances from the separate proportions. • Why? 27 ( ) 2 1 2 1 n n admission.edhole.com
  • 28. . prtesti 345 .3536 1900 .1411  STATA needs the following information: • sample size for group 1 (n = 345) • proportion for group 1 (p = 122/345) • sample size for group 2 (n = 1900) • proportion for group 2 (p = 268/1900) 28 admission.edhole.com
  • 29. . prtesti 345 .3536 1900 .1411 Two-sample test of proportion x: Number of obs = 345 y: Number of obs = 1900 ------------------------------------------------------------------------------ Variable | Mean Std. Err. z P>|z| [95% Conf. Interval] -------------+---------------------------------------------------------------- x | .3536 .0257393 .3031518 .4040482 y | .1411 .0079865 .1254467 .1567533 -------------+---------------------------------------------------------------- diff | .2125 .0269499 .1596791 .2653209 | under Ho: .0221741 9.58 0.000 ------------------------------------------------------------------------------ Ho: proportion(x) - proportion(y) = diff = 0 Ha: diff < 0 Ha: diff != 0 Ha: diff > 0 z = 9.583 z = 9.583 z = 9.583 P < z = 1.0000 P > |z| = 0.0000 P > z = 0.0000 Note the use of one standard error (unequal variance) for the confidence interval, and another (equal variance) for the significance test. 29 admission.edhole.com
  • 30. . prtest nonstandard if (RACECEN1==1 | RACECEN1==2), by(RACECEN1) Two-sample test of proportion 1: Number of obs = 1389 2: Number of obs = 260 ------------------------------------------------------------------------------ Variable | Mean Std. Err. z P>|z| [95% Conf. Interval] -------------+---------------------------------------------------------------- 1 | .2800576 .0120482 .2564436 .3036716 2 | .3538462 .0296544 .2957247 .4119676 -------------+---------------------------------------------------------------- diff | -.0737886 .0320084 -.1365239 -.0110532 | under Ho: .0307147 -2.40 0.016 ------------------------------------------------------------------------------ diff = prop(1) - prop(2) z = -2.4024 Ho: diff = 0 Ha: diff < 0 Ha: diff != 0 Ha: diff > 0 Pr(Z < z) = 0.0081 Pr(|Z| < |z|) = 0.0163 Pr(Z > z) = 0.9919 30 admission.edhole.com
  • 31. . gen byte wrkslf0=wrkslf-1 (152 missing values generated) . prtest wrkslf0 if wrkstat==1, by(sex) Two-sample test of proportion male: Number of obs = 874 female: Number of obs = 743 ------------------------------------------------------------------------------ Variable | Mean Std. Err. z P>|z| [95% Conf. Interval] -------------+---------------------------------------------------------------- male | .8272311 .0127876 .8021678 .8522944 female | .9044415 .0107853 .8833027 .9255802 -------------+---------------------------------------------------------------- diff | -.0772103 .0167286 -.1099978 -.0444229 | under Ho: .0171735 -4.50 0.000 ------------------------------------------------------------------------------ diff = prop(male) - prop(female) z = -4.4959 Ho: diff = 0 Ha: diff < 0 Ha: diff != 0 Ha: diff > 0 Pr(Z < z) = 0.0000 Pr(|Z| < |z|) = 0.0000 Pr(Z > z) = 1.0000 31 admission.edhole.com