SlideShare a Scribd company logo
1 of 20
HYPOTHESIS TESTING
DO DISTRICTS WITH SMALLER
CLASSES HAVE HIGHER TEST SCORES?
Scatterplot of test score v. student-teacher ratio
What does this figure show?
WE NEED TO GET SOME NUMERICAL EVIDENCE
ON WHETHER DISTRICTS WITH LOW STRS HAVE
HIGHER TEST SCORES – BUT HOW?
1. Compare average test scores in districts with low STRs to those with high STRs
(“estimation”)
2. Test the “null” hypothesis that the mean test scores in the two types of districts are
the same, against the “alternative” hypothesis that they differ (“hypothesis testing”)
3. Estimate an interval for the difference in the mean test scores, high v. low STR districts
(“confidence interval”)
INITIAL DATA ANALYSIS: COMPARE
DISTRICTS WITH “SMALL” (STR < 20)
AND “LARGE” (STR ≥ 20) CLASS SIZES
Class Size Average score Standard deviation
(s2)
n
Small 657.4 19.4 238
Large 650.0 17.9 182
Y
1. Estimation of Δ = difference between group means
2. Test the hypothesis that Δ = 0
3. Construct a confidence interval for Δ
1. ESTIMATION
largesmall
small large
1 1small large
1 1
657.4 650.0
7.4
nn
i i
i i
Y Y Y Y
n n 
  
 

 
Is this a large difference in a real-world sense?
• Standard deviation across districts = 19.1
• Difference between 60th and 75th percentiles of test score distribution
is 667.6 – 659.4 = 8.2
• This is a big enough difference to be important for school reform
discussions, for parents, or for a school committee?
2. HYPOTHESIS TESTING
2 2
Difference-in-means test: compute the -statistic,
(remember this?)
( )s l
s l
s l s l
s s
s l
n n
t
Y Y Y Y
t
SE Y Y
 
 

2 2
1
where ( ) is the “standard error” of , the subscripts
and/refer to “small” and “large” STR districts, and
1
( ) (etc.)
1
s
s l s l
n
s i s
is
SE Y Y Y Y s
s Y Y
n 
 
 


•
2. HYPOTHESIS TESTING (2 OF 2)
Compute the difference-of-means t-statistic:
Size s2 n
small 657.4 19.4 238
large 650.0 17.9 182
Y
2 2 2 2
19.4 17.9
238 182
657.4 650.0 7.4
4.05
1.83s l
s l
s l
s s
n n
Y Y
t
 
   

so reject (at the 5% significance level) the null hypothesis that the two
means are the same.
1.96,t
3. CONFIDENCE INTERVAL
A 95% confidence interval for the difference between the means is,
1.96
7.4
( )
1.96 1.83 (3.8, 11.0)
( )S l S lY Y Y YSE  
   
Two equivalent statements:
1. The 95% confidence interval for Δ doesn’t include 0;
2. The hypothesis that Δ = 0 is rejected at the 5% level.
HYPOTHESIS TESTING – OLS REGRESSION
HYPOTHESIS TESTING AND THE STANDARD ERROR OF Β1
The objective is to test a hypothesis, like β1 = 0, using data – to reach a
tentative conclusion whether the (null) hypothesis is correct or incorrect.
General setup
Null hypothesis and two-sided alternative:
H0: β1 = β1,0 vs. H1: β1 ≠ β1,0
where β1,0 is the hypothesized value under the null.
Null hypothesis and one-sided alternative:
H0: β1 = β1,0 vs. H1: β1 < β1,0
GENERAL APPROACH: CONSTRUCT T-
STATISTIC, AND COMPUTE P-VALUE (OR
COMPARE TO THE N(0,1) CRITICAL VALUE)
:• In general
estimator - hypothesized value
standard error of the estimator
t 
where the SE of the estimator is the square root of an
estimator of the variance of the estimator.
,0
:
/
Y
Y
Y
t
S n

• For testing the mean of Y
,• 1For testing 1 1,0
1
ˆ
,
ˆ( )
t
SE
 



1
1
ˆwhere ( ) the square root of an estimator of the variance
ˆof the sampling distribution of
SE 


1
ˆ (1of 2)( )Formula for SE 
Recall the expression for the variance of (large n):
2
1 2 2 2 2
var[( ) ]ˆvar( ) , where ( ) .
( ) ( )
i x i v
i i X i
X X
X u
v X u
n n
 
 
 

   
1
2 2
ˆThe estimator of the variance of replaces the unknown population
values of and by estimators constructed from the data:v X

 
1
2
2
2 1
ˆ 22 2
2
1
1
ˆ
estimator of1 1 2
ˆ
(estimator of ) 1
( )
ˆ ˆwhere ( ) .
n
i
v i
n
X
i
i
i i i
v
n
n n
X X
n
v X X u







   
 
 
 
 


1
ˆ (2 of 2)( )Formula for SE 
1
1
2
2 1
ˆ 2
2
1
2
ˆ1 1
1
ˆ
1 2
ˆ ˆ ˆ, where ( ) .
1
( )
ˆ ˆˆ( ) the standard error of
n
i
i
i i i
n
i
i
v
n
v X X u
n
X X
n
SE



  



   
 
 
 
 


This is a bit nasty, but:
• It is less complicated than it seems. The numerator estimates
var(v), the denominator estimates [var(X)]2.
• Why the degrees-of-freedom adjustment n – 2? Because two
coefficients have been estimated (β0 and β1).
1
ˆ( ) is computed by regression softwareSE •
• Your regression software has memorized this formula so you
don’t need to.
SUMMARY:
TO TEST H0: Β1 = Β1,0 V. H1: Β1 ≠
Β1,0
• Construct the t-statistic
1
1 1,0 1 1,0
2
1 ˆ
ˆ ˆ
ˆ( ) ˆ
t
SE

   
 
 
 
• Reject at 5% significance level if
• The p-value is probability in tails of normal
outside |tact|; you reject at the 5% significance level if the p-
value is < 5%.
t 1.96
[ ]Pr act
t tp  
1
ˆThis procedure relies on the large- approximation that is normally
distributed; typically = 50 is large enough for the approximation
to be excellent.
n
n
•
EXAMPLE: TEST SCORES AND STR,
CALIFORNIA DATA (1 OF 2)
Regression software reports the standard errors:
0 1
1 1,0
1,0
1
ˆ ˆ( ) 10.4 ( ) 0.52
ˆ 2.28 0
-statistic testing 0 4.38
ˆ 0.52( )
SE SE
t
SE
 
 


 
  
    
• The 1% 2-sided significance level is 2.58, so we reject the null
at the 1% significance level.
• Alternatively, we can compute the p-value…
Estimated regression line: Test Score = 698.9 – 2.28 x STR
EXAMPLE: TEST SCORES AND STR,
CALIFORNIA DATA (2 OF 2)
The p-value based on the large-n standard normal
approximation to the t-statistic is 0.00001 (10–5)
CONFIDENCE INTERVALS FOR Β1
(SECTION 5.2)
Recall that a 95% confidence is, equivalently:
• The set of points that cannot be rejected at the 5% significance
level;
• A set-valued function of the data (an interval that is a function
of the data) that contains the true parameter value 95% of the
time in repeated samples.
Because the t-statistic for β1 is N(0,1) in large samples, construction of a 95%
confidence for β1 is just like the case of the sample mean:
1 1 1
ˆ ˆ95% confidence interval for { 1.96 ( )}SE    
0 1
1
1 1
ˆ ˆ( ) 10.4 ( ) 0.52
ˆ95% confidence interval for :
ˆ ˆ{ 1.96 ( )} { 2.28 1.96 0.52}
( 3.30, 1.26)
SE SE
SE
 

 
 
     
  
CONFIDENCE INTERVAL EXAMPLE
The following two statements are equivalent (why?)
• The 95% confidence interval does not include zero;
• The hypothesis β1 = 0 is rejected at the 5% level
OLS REGRESSION: READING STATA
OUTPUT
regress testscr str, robust
Regression with robust standard errors Number of obs = 420
F( 1, 418) = 19.26
Prob > F = 0.0000
R-squared = 0.0512
Root MSE = 18.581
-------------------------------------------------------------------------
| Robust
testscr | Coef. Std. Err. t P>|t| [95% Conf. Interval]
--------+----------------------------------------------------------------
str | -2.279808 .5194892 -4.38 0.000 -3.300945 -1.258671
_cons | 698.933 10.36436 67.44 0.000 678.5602 719.3057
-------------------------------------------------------------------------
so:
· 2
1
1
698.9 2.28 , , ,
( ) ( )
( 0) , -value (2-sided)
395% 2-sided conf. .interval
10.4 0.52
.
30for is
0
1.2, 6
5
( )

  
    
  
TestScore STR R SER
t p
1
4.38 0.00
8
0
.6

SUMMARY OF STATISTICAL
INFERENCE ABOUT Β0 AND Β1
Estimation:
0 1
0 1
ˆ ˆOLS estimators and
ˆ ˆand have approximately normal sampling distributions in large samples
 
 
•
•
Testing:
• H0: β1 = β1,0 v. β1 ≠ β1,0 (β1,0 is the value of β1 under H0)
1 1,0 1
ˆ ˆ ˆ( )/ ( )t SE   •
• p-value = area under standard normal outside tact (large n)
Confidence Intervals:
• This is the set of β1 that is not rejected at the 5% level
• The 95% CI contains the true β1 in 95% of all samples.
1 1 1
ˆ ˆ95% confidence interval for is { 1.96 ( )}SE   •

More Related Content

What's hot

Propteties of Standard Deviation
Propteties of Standard DeviationPropteties of Standard Deviation
Propteties of Standard DeviationSahil Jindal
 
The Normal Distribution and Other Continuous Distributions
The Normal Distribution and Other Continuous DistributionsThe Normal Distribution and Other Continuous Distributions
The Normal Distribution and Other Continuous DistributionsYesica Adicondro
 
Standard deviation (3)
Standard deviation (3)Standard deviation (3)
Standard deviation (3)Sonali Prasad
 
Variance and standard deviation
Variance and standard deviationVariance and standard deviation
Variance and standard deviationAmrit Swaroop
 
Measure of Dispersion
Measure of DispersionMeasure of Dispersion
Measure of Dispersionelly_gaa
 
Standard deviation
Standard deviationStandard deviation
Standard deviationMai Ngoc Duc
 
Estimating a Population Standard Deviation or Variance
Estimating a Population Standard Deviation or VarianceEstimating a Population Standard Deviation or Variance
Estimating a Population Standard Deviation or VarianceLong Beach City College
 
Standard Deviation
Standard DeviationStandard Deviation
Standard Deviationpwheeles
 
Estimating a Population Standard Deviation or Variance
Estimating a Population Standard Deviation or Variance Estimating a Population Standard Deviation or Variance
Estimating a Population Standard Deviation or Variance Long Beach City College
 
Standard Deviation and Variance
Standard Deviation and VarianceStandard Deviation and Variance
Standard Deviation and VarianceJufil Hombria
 
Business Statistics Chapter 6
Business Statistics Chapter 6Business Statistics Chapter 6
Business Statistics Chapter 6Lux PP
 

What's hot (19)

Z-Scores
Z-ScoresZ-Scores
Z-Scores
 
coefficient variation
coefficient variationcoefficient variation
coefficient variation
 
Propteties of Standard Deviation
Propteties of Standard DeviationPropteties of Standard Deviation
Propteties of Standard Deviation
 
The Normal Distribution and Other Continuous Distributions
The Normal Distribution and Other Continuous DistributionsThe Normal Distribution and Other Continuous Distributions
The Normal Distribution and Other Continuous Distributions
 
z-scores
z-scoresz-scores
z-scores
 
Standard deviation (3)
Standard deviation (3)Standard deviation (3)
Standard deviation (3)
 
Static Models of Continuous Variables
Static Models of Continuous VariablesStatic Models of Continuous Variables
Static Models of Continuous Variables
 
Variance and standard deviation
Variance and standard deviationVariance and standard deviation
Variance and standard deviation
 
Measure of Dispersion
Measure of DispersionMeasure of Dispersion
Measure of Dispersion
 
Standard deviation
Standard deviationStandard deviation
Standard deviation
 
Sd
SdSd
Sd
 
Z scores
Z scoresZ scores
Z scores
 
ERF Training Workshop Panel Data 3
ERF Training WorkshopPanel Data 3ERF Training WorkshopPanel Data 3
ERF Training Workshop Panel Data 3
 
Chapter11
Chapter11Chapter11
Chapter11
 
Estimating a Population Standard Deviation or Variance
Estimating a Population Standard Deviation or VarianceEstimating a Population Standard Deviation or Variance
Estimating a Population Standard Deviation or Variance
 
Standard Deviation
Standard DeviationStandard Deviation
Standard Deviation
 
Estimating a Population Standard Deviation or Variance
Estimating a Population Standard Deviation or Variance Estimating a Population Standard Deviation or Variance
Estimating a Population Standard Deviation or Variance
 
Standard Deviation and Variance
Standard Deviation and VarianceStandard Deviation and Variance
Standard Deviation and Variance
 
Business Statistics Chapter 6
Business Statistics Chapter 6Business Statistics Chapter 6
Business Statistics Chapter 6
 

Similar to Does Class Size Impact Test Scores

Estimating a Population Standard Deviation or Variance
Estimating a Population Standard Deviation or VarianceEstimating a Population Standard Deviation or Variance
Estimating a Population Standard Deviation or VarianceLong Beach City College
 
Chi-square, Yates, Fisher & McNemar
Chi-square, Yates, Fisher & McNemarChi-square, Yates, Fisher & McNemar
Chi-square, Yates, Fisher & McNemarAzmi Mohd Tamil
 
Lec. 10: Making Assumptions of Missing data
Lec. 10: Making Assumptions of Missing dataLec. 10: Making Assumptions of Missing data
Lec. 10: Making Assumptions of Missing dataMohamadKharseh1
 
Hypothesis testing - II.pptx
Hypothesis testing - II.pptxHypothesis testing - II.pptx
Hypothesis testing - II.pptxShashvatSingh12
 
Statistics
StatisticsStatistics
Statisticsmegamsma
 
02 PSBE3_PPT.Ch01_2_Examining Distribution.ppt
02 PSBE3_PPT.Ch01_2_Examining Distribution.ppt02 PSBE3_PPT.Ch01_2_Examining Distribution.ppt
02 PSBE3_PPT.Ch01_2_Examining Distribution.pptBishoyRomani
 
The standard normal curve & its application in biomedical sciences
The standard normal curve & its application in biomedical sciencesThe standard normal curve & its application in biomedical sciences
The standard normal curve & its application in biomedical sciencesAbhi Manu
 
Ali, Redescending M-estimator
Ali, Redescending M-estimator Ali, Redescending M-estimator
Ali, Redescending M-estimator Muhammad Ali
 
Admission in India
Admission in IndiaAdmission in India
Admission in IndiaEdhole.com
 
BUS173 Lecture 5.pdf
BUS173 Lecture 5.pdfBUS173 Lecture 5.pdf
BUS173 Lecture 5.pdfSusantoSaha1
 
simple linear regression - brief introduction
simple linear regression - brief introductionsimple linear regression - brief introduction
simple linear regression - brief introductionedinyoka
 
Advanced Econometrics L5-6.pptx
Advanced Econometrics L5-6.pptxAdvanced Econometrics L5-6.pptx
Advanced Econometrics L5-6.pptxakashayosha
 

Similar to Does Class Size Impact Test Scores (20)

Estimating a Population Standard Deviation or Variance
Estimating a Population Standard Deviation or VarianceEstimating a Population Standard Deviation or Variance
Estimating a Population Standard Deviation or Variance
 
Chi-square, Yates, Fisher & McNemar
Chi-square, Yates, Fisher & McNemarChi-square, Yates, Fisher & McNemar
Chi-square, Yates, Fisher & McNemar
 
Lec. 10: Making Assumptions of Missing data
Lec. 10: Making Assumptions of Missing dataLec. 10: Making Assumptions of Missing data
Lec. 10: Making Assumptions of Missing data
 
Hypothesis testing
Hypothesis testingHypothesis testing
Hypothesis testing
 
Two Means, Independent Samples
Two Means, Independent SamplesTwo Means, Independent Samples
Two Means, Independent Samples
 
Cairo 02 Stat Inference
Cairo 02 Stat InferenceCairo 02 Stat Inference
Cairo 02 Stat Inference
 
Errors2
Errors2Errors2
Errors2
 
Hypothesis testing - II.pptx
Hypothesis testing - II.pptxHypothesis testing - II.pptx
Hypothesis testing - II.pptx
 
Statistics
StatisticsStatistics
Statistics
 
Medical statistics2
Medical statistics2Medical statistics2
Medical statistics2
 
02 PSBE3_PPT.Ch01_2_Examining Distribution.ppt
02 PSBE3_PPT.Ch01_2_Examining Distribution.ppt02 PSBE3_PPT.Ch01_2_Examining Distribution.ppt
02 PSBE3_PPT.Ch01_2_Examining Distribution.ppt
 
The standard normal curve & its application in biomedical sciences
The standard normal curve & its application in biomedical sciencesThe standard normal curve & its application in biomedical sciences
The standard normal curve & its application in biomedical sciences
 
Ali, Redescending M-estimator
Ali, Redescending M-estimator Ali, Redescending M-estimator
Ali, Redescending M-estimator
 
Admission in India
Admission in IndiaAdmission in India
Admission in India
 
BUS173 Lecture 5.pdf
BUS173 Lecture 5.pdfBUS173 Lecture 5.pdf
BUS173 Lecture 5.pdf
 
Statistics chm 235
Statistics chm 235Statistics chm 235
Statistics chm 235
 
simple linear regression - brief introduction
simple linear regression - brief introductionsimple linear regression - brief introduction
simple linear regression - brief introduction
 
L estimation
L estimationL estimation
L estimation
 
Advanced Econometrics L5-6.pptx
Advanced Econometrics L5-6.pptxAdvanced Econometrics L5-6.pptx
Advanced Econometrics L5-6.pptx
 
Chapter07.pdf
Chapter07.pdfChapter07.pdf
Chapter07.pdf
 

More from Ryan Herzog

Chapter 14 - Great Recession
Chapter 14 - Great RecessionChapter 14 - Great Recession
Chapter 14 - Great RecessionRyan Herzog
 
Chapter 13 - AD/AS
Chapter 13 - AD/ASChapter 13 - AD/AS
Chapter 13 - AD/ASRyan Herzog
 
Chapter 12 - Monetary Policy
Chapter 12 - Monetary PolicyChapter 12 - Monetary Policy
Chapter 12 - Monetary PolicyRyan Herzog
 
Chapter 11 - IS Curve
Chapter 11 - IS CurveChapter 11 - IS Curve
Chapter 11 - IS CurveRyan Herzog
 
Chapter 10 - Great Recession
Chapter 10 - Great RecessionChapter 10 - Great Recession
Chapter 10 - Great RecessionRyan Herzog
 
Chapter 9 - Short Run
Chapter 9 - Short RunChapter 9 - Short Run
Chapter 9 - Short RunRyan Herzog
 
Chapter 8 - Inflation
Chapter 8 - InflationChapter 8 - Inflation
Chapter 8 - InflationRyan Herzog
 
Chapter 7 - Labor Market
Chapter 7 - Labor MarketChapter 7 - Labor Market
Chapter 7 - Labor MarketRyan Herzog
 
Chapter 6 - Romer Model
Chapter 6 - Romer Model Chapter 6 - Romer Model
Chapter 6 - Romer Model Ryan Herzog
 
Chapter 5 - Solow Model for Growth
Chapter 5 - Solow Model for GrowthChapter 5 - Solow Model for Growth
Chapter 5 - Solow Model for GrowthRyan Herzog
 
Chapter 4 - Model of Production
Chapter 4 - Model of ProductionChapter 4 - Model of Production
Chapter 4 - Model of ProductionRyan Herzog
 
Chapter 3 - Long-Run Economic Growth
Chapter 3 - Long-Run Economic GrowthChapter 3 - Long-Run Economic Growth
Chapter 3 - Long-Run Economic GrowthRyan Herzog
 
Chapter 2 - Measuring the Macroeconomy
Chapter 2 - Measuring the MacroeconomyChapter 2 - Measuring the Macroeconomy
Chapter 2 - Measuring the MacroeconomyRyan Herzog
 
Topic 7 (questions)
Topic 7 (questions)Topic 7 (questions)
Topic 7 (questions)Ryan Herzog
 
Topic 5 (multiple regression)
Topic 5 (multiple regression)Topic 5 (multiple regression)
Topic 5 (multiple regression)Ryan Herzog
 
Topic 6 (model specification)
Topic 6 (model specification)Topic 6 (model specification)
Topic 6 (model specification)Ryan Herzog
 
Topic 5 (multiple regression)
Topic 5 (multiple regression)Topic 5 (multiple regression)
Topic 5 (multiple regression)Ryan Herzog
 
Topic 4 (binary)
Topic 4 (binary)Topic 4 (binary)
Topic 4 (binary)Ryan Herzog
 

More from Ryan Herzog (20)

Chapter 14 - Great Recession
Chapter 14 - Great RecessionChapter 14 - Great Recession
Chapter 14 - Great Recession
 
Chapter 13 - AD/AS
Chapter 13 - AD/ASChapter 13 - AD/AS
Chapter 13 - AD/AS
 
Chapter 12 - Monetary Policy
Chapter 12 - Monetary PolicyChapter 12 - Monetary Policy
Chapter 12 - Monetary Policy
 
Chapter 11 - IS Curve
Chapter 11 - IS CurveChapter 11 - IS Curve
Chapter 11 - IS Curve
 
Chapter 10 - Great Recession
Chapter 10 - Great RecessionChapter 10 - Great Recession
Chapter 10 - Great Recession
 
Chapter 9 - Short Run
Chapter 9 - Short RunChapter 9 - Short Run
Chapter 9 - Short Run
 
Chapter 8 - Inflation
Chapter 8 - InflationChapter 8 - Inflation
Chapter 8 - Inflation
 
Chapter 7 - Labor Market
Chapter 7 - Labor MarketChapter 7 - Labor Market
Chapter 7 - Labor Market
 
Chapter 6 - Romer Model
Chapter 6 - Romer Model Chapter 6 - Romer Model
Chapter 6 - Romer Model
 
Chapter 5 - Solow Model for Growth
Chapter 5 - Solow Model for GrowthChapter 5 - Solow Model for Growth
Chapter 5 - Solow Model for Growth
 
Chapter 4 - Model of Production
Chapter 4 - Model of ProductionChapter 4 - Model of Production
Chapter 4 - Model of Production
 
Chapter 3 - Long-Run Economic Growth
Chapter 3 - Long-Run Economic GrowthChapter 3 - Long-Run Economic Growth
Chapter 3 - Long-Run Economic Growth
 
Chapter 2 - Measuring the Macroeconomy
Chapter 2 - Measuring the MacroeconomyChapter 2 - Measuring the Macroeconomy
Chapter 2 - Measuring the Macroeconomy
 
Topic 7 (data)
Topic 7 (data)Topic 7 (data)
Topic 7 (data)
 
Inequality
InequalityInequality
Inequality
 
Topic 7 (questions)
Topic 7 (questions)Topic 7 (questions)
Topic 7 (questions)
 
Topic 5 (multiple regression)
Topic 5 (multiple regression)Topic 5 (multiple regression)
Topic 5 (multiple regression)
 
Topic 6 (model specification)
Topic 6 (model specification)Topic 6 (model specification)
Topic 6 (model specification)
 
Topic 5 (multiple regression)
Topic 5 (multiple regression)Topic 5 (multiple regression)
Topic 5 (multiple regression)
 
Topic 4 (binary)
Topic 4 (binary)Topic 4 (binary)
Topic 4 (binary)
 

Recently uploaded

Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for BeginnersSabitha Banu
 
Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfMahmoud M. Sallam
 
Hierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementHierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementmkooblal
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfSumit Tiwari
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
Blooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxBlooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxUnboundStockton
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersSabitha Banu
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfMr Bounab Samir
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxDr.Ibrahim Hassaan
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...JhezDiaz1
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Jisc
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxiammrhaywood
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 

Recently uploaded (20)

Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for Beginners
 
Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdf
 
OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...
 
Hierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementHierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of management
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
Blooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxBlooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docx
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginners
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptx
 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 

Does Class Size Impact Test Scores

  • 2. DO DISTRICTS WITH SMALLER CLASSES HAVE HIGHER TEST SCORES? Scatterplot of test score v. student-teacher ratio What does this figure show?
  • 3. WE NEED TO GET SOME NUMERICAL EVIDENCE ON WHETHER DISTRICTS WITH LOW STRS HAVE HIGHER TEST SCORES – BUT HOW? 1. Compare average test scores in districts with low STRs to those with high STRs (“estimation”) 2. Test the “null” hypothesis that the mean test scores in the two types of districts are the same, against the “alternative” hypothesis that they differ (“hypothesis testing”) 3. Estimate an interval for the difference in the mean test scores, high v. low STR districts (“confidence interval”)
  • 4. INITIAL DATA ANALYSIS: COMPARE DISTRICTS WITH “SMALL” (STR < 20) AND “LARGE” (STR ≥ 20) CLASS SIZES Class Size Average score Standard deviation (s2) n Small 657.4 19.4 238 Large 650.0 17.9 182 Y 1. Estimation of Δ = difference between group means 2. Test the hypothesis that Δ = 0 3. Construct a confidence interval for Δ
  • 5. 1. ESTIMATION largesmall small large 1 1small large 1 1 657.4 650.0 7.4 nn i i i i Y Y Y Y n n          Is this a large difference in a real-world sense? • Standard deviation across districts = 19.1 • Difference between 60th and 75th percentiles of test score distribution is 667.6 – 659.4 = 8.2 • This is a big enough difference to be important for school reform discussions, for parents, or for a school committee?
  • 6. 2. HYPOTHESIS TESTING 2 2 Difference-in-means test: compute the -statistic, (remember this?) ( )s l s l s l s l s s s l n n t Y Y Y Y t SE Y Y      2 2 1 where ( ) is the “standard error” of , the subscripts and/refer to “small” and “large” STR districts, and 1 ( ) (etc.) 1 s s l s l n s i s is SE Y Y Y Y s s Y Y n        •
  • 7. 2. HYPOTHESIS TESTING (2 OF 2) Compute the difference-of-means t-statistic: Size s2 n small 657.4 19.4 238 large 650.0 17.9 182 Y 2 2 2 2 19.4 17.9 238 182 657.4 650.0 7.4 4.05 1.83s l s l s l s s n n Y Y t        so reject (at the 5% significance level) the null hypothesis that the two means are the same. 1.96,t
  • 8. 3. CONFIDENCE INTERVAL A 95% confidence interval for the difference between the means is, 1.96 7.4 ( ) 1.96 1.83 (3.8, 11.0) ( )S l S lY Y Y YSE       Two equivalent statements: 1. The 95% confidence interval for Δ doesn’t include 0; 2. The hypothesis that Δ = 0 is rejected at the 5% level.
  • 9. HYPOTHESIS TESTING – OLS REGRESSION
  • 10. HYPOTHESIS TESTING AND THE STANDARD ERROR OF Β1 The objective is to test a hypothesis, like β1 = 0, using data – to reach a tentative conclusion whether the (null) hypothesis is correct or incorrect. General setup Null hypothesis and two-sided alternative: H0: β1 = β1,0 vs. H1: β1 ≠ β1,0 where β1,0 is the hypothesized value under the null. Null hypothesis and one-sided alternative: H0: β1 = β1,0 vs. H1: β1 < β1,0
  • 11. GENERAL APPROACH: CONSTRUCT T- STATISTIC, AND COMPUTE P-VALUE (OR COMPARE TO THE N(0,1) CRITICAL VALUE) :• In general estimator - hypothesized value standard error of the estimator t  where the SE of the estimator is the square root of an estimator of the variance of the estimator. ,0 : / Y Y Y t S n  • For testing the mean of Y ,• 1For testing 1 1,0 1 ˆ , ˆ( ) t SE      1 1 ˆwhere ( ) the square root of an estimator of the variance ˆof the sampling distribution of SE   
  • 12. 1 ˆ (1of 2)( )Formula for SE  Recall the expression for the variance of (large n): 2 1 2 2 2 2 var[( ) ]ˆvar( ) , where ( ) . ( ) ( ) i x i v i i X i X X X u v X u n n            1 2 2 ˆThe estimator of the variance of replaces the unknown population values of and by estimators constructed from the data:v X    1 2 2 2 1 ˆ 22 2 2 1 1 ˆ estimator of1 1 2 ˆ (estimator of ) 1 ( ) ˆ ˆwhere ( ) . n i v i n X i i i i i v n n n X X n v X X u                     
  • 13. 1 ˆ (2 of 2)( )Formula for SE  1 1 2 2 1 ˆ 2 2 1 2 ˆ1 1 1 ˆ 1 2 ˆ ˆ ˆ, where ( ) . 1 ( ) ˆ ˆˆ( ) the standard error of n i i i i i n i i v n v X X u n X X n SE                        This is a bit nasty, but: • It is less complicated than it seems. The numerator estimates var(v), the denominator estimates [var(X)]2. • Why the degrees-of-freedom adjustment n – 2? Because two coefficients have been estimated (β0 and β1). 1 ˆ( ) is computed by regression softwareSE • • Your regression software has memorized this formula so you don’t need to.
  • 14. SUMMARY: TO TEST H0: Β1 = Β1,0 V. H1: Β1 ≠ Β1,0 • Construct the t-statistic 1 1 1,0 1 1,0 2 1 ˆ ˆ ˆ ˆ( ) ˆ t SE            • Reject at 5% significance level if • The p-value is probability in tails of normal outside |tact|; you reject at the 5% significance level if the p- value is < 5%. t 1.96 [ ]Pr act t tp   1 ˆThis procedure relies on the large- approximation that is normally distributed; typically = 50 is large enough for the approximation to be excellent. n n •
  • 15. EXAMPLE: TEST SCORES AND STR, CALIFORNIA DATA (1 OF 2) Regression software reports the standard errors: 0 1 1 1,0 1,0 1 ˆ ˆ( ) 10.4 ( ) 0.52 ˆ 2.28 0 -statistic testing 0 4.38 ˆ 0.52( ) SE SE t SE                 • The 1% 2-sided significance level is 2.58, so we reject the null at the 1% significance level. • Alternatively, we can compute the p-value… Estimated regression line: Test Score = 698.9 – 2.28 x STR
  • 16. EXAMPLE: TEST SCORES AND STR, CALIFORNIA DATA (2 OF 2) The p-value based on the large-n standard normal approximation to the t-statistic is 0.00001 (10–5)
  • 17. CONFIDENCE INTERVALS FOR Β1 (SECTION 5.2) Recall that a 95% confidence is, equivalently: • The set of points that cannot be rejected at the 5% significance level; • A set-valued function of the data (an interval that is a function of the data) that contains the true parameter value 95% of the time in repeated samples. Because the t-statistic for β1 is N(0,1) in large samples, construction of a 95% confidence for β1 is just like the case of the sample mean: 1 1 1 ˆ ˆ95% confidence interval for { 1.96 ( )}SE    
  • 18. 0 1 1 1 1 ˆ ˆ( ) 10.4 ( ) 0.52 ˆ95% confidence interval for : ˆ ˆ{ 1.96 ( )} { 2.28 1.96 0.52} ( 3.30, 1.26) SE SE SE                 CONFIDENCE INTERVAL EXAMPLE The following two statements are equivalent (why?) • The 95% confidence interval does not include zero; • The hypothesis β1 = 0 is rejected at the 5% level
  • 19. OLS REGRESSION: READING STATA OUTPUT regress testscr str, robust Regression with robust standard errors Number of obs = 420 F( 1, 418) = 19.26 Prob > F = 0.0000 R-squared = 0.0512 Root MSE = 18.581 ------------------------------------------------------------------------- | Robust testscr | Coef. Std. Err. t P>|t| [95% Conf. Interval] --------+---------------------------------------------------------------- str | -2.279808 .5194892 -4.38 0.000 -3.300945 -1.258671 _cons | 698.933 10.36436 67.44 0.000 678.5602 719.3057 ------------------------------------------------------------------------- so: · 2 1 1 698.9 2.28 , , , ( ) ( ) ( 0) , -value (2-sided) 395% 2-sided conf. .interval 10.4 0.52 . 30for is 0 1.2, 6 5 ( )             TestScore STR R SER t p 1 4.38 0.00 8 0 .6 
  • 20. SUMMARY OF STATISTICAL INFERENCE ABOUT Β0 AND Β1 Estimation: 0 1 0 1 ˆ ˆOLS estimators and ˆ ˆand have approximately normal sampling distributions in large samples     • • Testing: • H0: β1 = β1,0 v. β1 ≠ β1,0 (β1,0 is the value of β1 under H0) 1 1,0 1 ˆ ˆ ˆ( )/ ( )t SE   • • p-value = area under standard normal outside tact (large n) Confidence Intervals: • This is the set of β1 that is not rejected at the 5% level • The 95% CI contains the true β1 in 95% of all samples. 1 1 1 ˆ ˆ95% confidence interval for is { 1.96 ( )}SE   •

Editor's Notes

  1. gen less_20 = 0 replace less_20 = 1 if str<20 bysort less_20: sum testscr
  2. sum testscr if less_20==1 scalar mean_less20 = r(mean) sum testscr if less_20==0 scalar mean_more20 = r(mean)