SlideShare a Scribd company logo
1 of 108
Download to read offline
Learning more with every year:
School year productivity and international
learning divergence
Abhijeet Singh
University of Oxford
Presentation at RISE Summer Meeting
CGD June 2015
Introduction
What we know about learning outcomes in developing countries
Learning outcomes very poor in many developing countries,
especially in South Asia and SSA
Less known in comparative settings (under-represented in
PISA, TIMSS)
But available results suggest differences in performance within
developing countries is big
Vietnam-Peru gap in math is 1.4 SD in PISA 2012
US-Finland gap - 0.38 SD
Introduction
What we know about learning outcomes in developing countries
Learning outcomes very poor in many developing countries,
especially in South Asia and SSA
Less known in comparative settings (under-represented in
PISA, TIMSS)
But available results suggest differences in performance within
developing countries is big
Vietnam-Peru gap in math is 1.4 SD in PISA 2012
US-Finland gap - 0.38 SD
Introduction
What we know about learning outcomes in developing countries
Learning outcomes very poor in many developing countries,
especially in South Asia and SSA
Less known in comparative settings (under-represented in
PISA, TIMSS)
But available results suggest differences in performance within
developing countries is big
Vietnam-Peru gap in math is 1.4 SD in PISA 2012
US-Finland gap - 0.38 SD
Introduction
What we know about learning outcomes in developing countries
Learning outcomes very poor in many developing countries,
especially in South Asia and SSA
Less known in comparative settings (under-represented in
PISA, TIMSS)
But available results suggest differences in performance within
developing countries is big
Vietnam-Peru gap in math is 1.4 SD in PISA 2012
US-Finland gap - 0.38 SD
Introduction
What we know about learning outcomes in developing countries
Learning outcomes very poor in many developing countries,
especially in South Asia and SSA
Less known in comparative settings (under-represented in
PISA, TIMSS)
But available results suggest differences in performance within
developing countries is big
Vietnam-Peru gap in math is 1.4 SD in PISA 2012
US-Finland gap - 0.38 SD
What is this paper about?
Two key questions
At what age are gaps in human capital evident across
countries?
PISA, or even TIMSS, only study gaps well into schooling
How much does the differential productivity of primary
schooling affect international gaps in human capital?
Can we say anything about the relative effectiveness of
schooling systems?
What is this paper about?
Two key questions
At what age are gaps in human capital evident across
countries?
PISA, or even TIMSS, only study gaps well into schooling
How much does the differential productivity of primary
schooling affect international gaps in human capital?
Can we say anything about the relative effectiveness of
schooling systems?
What is this paper about?
Two key questions
At what age are gaps in human capital evident across
countries?
PISA, or even TIMSS, only study gaps well into schooling
How much does the differential productivity of primary
schooling affect international gaps in human capital?
Can we say anything about the relative effectiveness of
schooling systems?
What is this paper about?
Two key questions
At what age are gaps in human capital evident across
countries?
PISA, or even TIMSS, only study gaps well into schooling
How much does the differential productivity of primary
schooling affect international gaps in human capital?
Can we say anything about the relative effectiveness of
schooling systems?
Why this matters
Knowing when and how learning gaps evolve is informative for
understanding when policy interventions might work:
Effectiveness of interventions varies importantly across the age
of children
Understanding sources of divergence useful for identifying
domains in which intervention necessary
we don’t just want a league table.
Important differences between educational systems may have
important information for policy
But most economics of education in developing countries is
focused on specific interventions within a given institutional
setting
little work on ‘business-as-usual’ productivity of time spent in
school
Why this matters
Knowing when and how learning gaps evolve is informative for
understanding when policy interventions might work:
Effectiveness of interventions varies importantly across the age
of children
Understanding sources of divergence useful for identifying
domains in which intervention necessary
we don’t just want a league table.
Important differences between educational systems may have
important information for policy
But most economics of education in developing countries is
focused on specific interventions within a given institutional
setting
little work on ‘business-as-usual’ productivity of time spent in
school
Why this matters
Knowing when and how learning gaps evolve is informative for
understanding when policy interventions might work:
Effectiveness of interventions varies importantly across the age
of children
Understanding sources of divergence useful for identifying
domains in which intervention necessary
we don’t just want a league table.
Important differences between educational systems may have
important information for policy
But most economics of education in developing countries is
focused on specific interventions within a given institutional
setting
little work on ‘business-as-usual’ productivity of time spent in
school
What I do
Use child level panel data for Ethiopia, India, Peru and Vietnam to:
Compare distributions of achievement of children at 5 and 8
years across four Young Lives countries
Examine how the gap evolves over the age group of the
children
Is there growth between 5-8 years?
Do rankings change across ages?
Estimate value-added (VA) models examining sources of the
gap
Causally identify differential productivity of schooling with VA
and IV estimates using enrollment discontinuities
What I do
Use child level panel data for Ethiopia, India, Peru and Vietnam to:
Compare distributions of achievement of children at 5 and 8
years across four Young Lives countries
Examine how the gap evolves over the age group of the
children
Is there growth between 5-8 years?
Do rankings change across ages?
Estimate value-added (VA) models examining sources of the
gap
Causally identify differential productivity of schooling with VA
and IV estimates using enrollment discontinuities
What I do
Use child level panel data for Ethiopia, India, Peru and Vietnam to:
Compare distributions of achievement of children at 5 and 8
years across four Young Lives countries
Examine how the gap evolves over the age group of the
children
Is there growth between 5-8 years?
Do rankings change across ages?
Estimate value-added (VA) models examining sources of the
gap
Causally identify differential productivity of schooling with VA
and IV estimates using enrollment discontinuities
What I do
Use child level panel data for Ethiopia, India, Peru and Vietnam to:
Compare distributions of achievement of children at 5 and 8
years across four Young Lives countries
Examine how the gap evolves over the age group of the
children
Is there growth between 5-8 years?
Do rankings change across ages?
Estimate value-added (VA) models examining sources of the
gap
Causally identify differential productivity of schooling with VA
and IV estimates using enrollment discontinuities
What I do
Use child level panel data for Ethiopia, India, Peru and Vietnam to:
Compare distributions of achievement of children at 5 and 8
years across four Young Lives countries
Examine how the gap evolves over the age group of the
children
Is there growth between 5-8 years?
Do rankings change across ages?
Estimate value-added (VA) models examining sources of the
gap
Causally identify differential productivity of schooling with VA
and IV estimates using enrollment discontinuities
Contribution
First analysis of the emergence and evolution of gaps in
cognitive achievement across countries, using internationally
comparable child-level panel data, at a critical age for skill
formation
similar work on racial gaps in US, socio-economic gaps in the
UK etc. but nothing across countries
no studies of comparable age range in developing countries
Causal identification of learning-gains-per-year in different
countries using micro panel and RD-based identification
Contribution
First analysis of the emergence and evolution of gaps in
cognitive achievement across countries, using internationally
comparable child-level panel data, at a critical age for skill
formation
similar work on racial gaps in US, socio-economic gaps in the
UK etc. but nothing across countries
no studies of comparable age range in developing countries
Causal identification of learning-gains-per-year in different
countries using micro panel and RD-based identification
Contribution
First analysis of the emergence and evolution of gaps in
cognitive achievement across countries, using internationally
comparable child-level panel data, at a critical age for skill
formation
similar work on racial gaps in US, socio-economic gaps in the
UK etc. but nothing across countries
no studies of comparable age range in developing countries
Causal identification of learning-gains-per-year in different
countries using micro panel and RD-based identification
What this relates to
Literature
Methodologically:
Racial and gender test score gaps: e.g. Fryer and Levitt
(2004, 2006, 2010, 2013), Todd and Wolpin (2007)
Value-added models using HH panel data: Todd and Wolpin
(2003, 2007), Fiorini and Keane (2014)
Results:
Growth decompositions with varying school quality:
Hanushek and Kimko (2000), Hanushek and Woessmann
(various), Schoellmann (2012), Kaarsen (2014)
Cross-country productivity differences in diff sectors: Hall
and Jones (1996), Bloom and Van Reenen (2007), Gollin et.
al. (2014)
Micro studies on learning in developing countries: e.g.
Glewwe and Kremer (2006), Kremer et. al. (2013), McEwan
(2013)
What this relates to
Literature
Methodologically:
Racial and gender test score gaps: e.g. Fryer and Levitt
(2004, 2006, 2010, 2013), Todd and Wolpin (2007)
Value-added models using HH panel data: Todd and Wolpin
(2003, 2007), Fiorini and Keane (2014)
Results:
Growth decompositions with varying school quality:
Hanushek and Kimko (2000), Hanushek and Woessmann
(various), Schoellmann (2012), Kaarsen (2014)
Cross-country productivity differences in diff sectors: Hall
and Jones (1996), Bloom and Van Reenen (2007), Gollin et.
al. (2014)
Micro studies on learning in developing countries: e.g.
Glewwe and Kremer (2006), Kremer et. al. (2013), McEwan
(2013)
What this relates to
Literature
Methodologically:
Racial and gender test score gaps: e.g. Fryer and Levitt
(2004, 2006, 2010, 2013), Todd and Wolpin (2007)
Value-added models using HH panel data: Todd and Wolpin
(2003, 2007), Fiorini and Keane (2014)
Results:
Growth decompositions with varying school quality:
Hanushek and Kimko (2000), Hanushek and Woessmann
(various), Schoellmann (2012), Kaarsen (2014)
Cross-country productivity differences in diff sectors: Hall
and Jones (1996), Bloom and Van Reenen (2007), Gollin et.
al. (2014)
Micro studies on learning in developing countries: e.g.
Glewwe and Kremer (2006), Kremer et. al. (2013), McEwan
(2013)
Data
Young Lives survey structure
Round 1 Round 2 Round 3
051015
Ageinyears
O
ct2002
D
ec
2006
N
ov
2009
Time
Younger cohort Older cohort
Graph shows median age of children and time of interview across countries
By age of children
Timing of survey rounds
Data
Young Lives survey test data
Use data from the 2006/7 and 2009 rounds on quantitative
proficiency
Cognitive Development Assessment Quant. sub-scale for 5
year old sample
Mathematics tests for 8 year old children
Identical tests administered across all four countries in each
round
can be linked within round across the four countries using Item
Response Theory
Data
Young Lives survey test data
Use data from the 2006/7 and 2009 rounds on quantitative
proficiency
Cognitive Development Assessment Quant. sub-scale for 5
year old sample
Mathematics tests for 8 year old children
Identical tests administered across all four countries in each
round
can be linked within round across the four countries using Item
Response Theory
Data
Table : Descriptives on age and school progression
Cohort Variable Statistics Ethiopia India Peru Vietnam
YC 2006 (5-years) Enrolment Mean 0.04 0.45 0.01 0.01
YC 2009 (8-years) Enrolment Mean 0.77 0.99 0.98 0.98
OC 2006 (12-years) Enrolment Mean 0.95 0.89 0.99 0.97
OC 2009 (15-years) Enrolment Mean 0.89 0.77 0.92 0.77
Older cohort Age of entry Mean 7.19 5.04 5.88 6.07
SD 1.52 0.71 0.57 0.48
YC 2009 (8-years) Grade Mean 0.64 1.63 1.31 1.71
SD 0.77 1 0.58 0.57
Grade refers to highest grade completed
Learning differences at 5 and 8
Table : Linked test scores at 5,and 8 years
Age group Statistics Countries
Ethiopia India Peru Vietnam
5 years
Mean 454 498.3 520.4 524.7
N 1846 1904 1893 1935
8 years
Mean 419.1 495.9 518.2 563.6
N 1885 1930 1943 1964
Scores are IRT test scores generated within an age sample,pooling data from
all countries, and normalized to have a mean of 500 and an SD of 100 in the
pooled sample. Scores are comparable across countries but not across age
groups.
Do rankings change across age groups?
0.2.4.6.81
200 400 600 800 1000
CDA Scores, 2006
5 years
0.2.4.6.81
200 400 600 800 1000
Math Scores, 2009
8 years
Empirical CDFs
Distribution of achievement
Ethiopia India
Peru Vietnam
Rankings are unchanged but are the gaps growing?
Between 5 and 8 years of age
Rankings are unchanged but are the gaps growing?
Between 5 and 8 years of age
p10 p90400450500550600
Mathscores(2009)
300 400 500 600 700
CDA scores (2006)
Ethiopia India
Peru Vietnam
Where are the gaps coming from?
Knowing differences in levels and trends between countries
informative but not enough.
Even trend differences need not imply differential effectiveness
of schools across countries
endowments differ - e.g. parental education, home inputs,
nutrition, other environmental differences
but differential effectiveness, and malleable environmental
sources of learning divergence, are where policy might make a
difference
Where are the gaps coming from?
Knowing differences in levels and trends between countries
informative but not enough.
Even trend differences need not imply differential effectiveness
of schools across countries
endowments differ - e.g. parental education, home inputs,
nutrition, other environmental differences
but differential effectiveness, and malleable environmental
sources of learning divergence, are where policy might make a
difference
Where are the gaps coming from?
Knowing differences in levels and trends between countries
informative but not enough.
Even trend differences need not imply differential effectiveness
of schools across countries
endowments differ - e.g. parental education, home inputs,
nutrition, other environmental differences
but differential effectiveness, and malleable environmental
sources of learning divergence, are where policy might make a
difference
Where are the gaps coming from?
Knowing differences in levels and trends between countries
informative but not enough.
Even trend differences need not imply differential effectiveness
of schools across countries
endowments differ - e.g. parental education, home inputs,
nutrition, other environmental differences
but differential effectiveness, and malleable environmental
sources of learning divergence, are where policy might make a
difference
Do child-specific endowments explain divergence?
Value-added models with common coefficients: Specifications
Yic,2009 = φc (1)
+β1.Yic,2006 (2)
+β2.Xi (3)
+β3.TUica + ϵica (4)
Xi (Background) - male, eldest child, wealth index, age,
caregiver’s education, height-for-age in 2009
TUica (time use) - time use on different activities
Yic,2006 (lagged achievement) - 2006 quantitative achievement
measures
A lot differs across samples
At 8 years of age
Ethiopia India Peru Vietnam
Mean SD N Mean SD N Mean SD N Mean SD N
Child and background characteristics (Xic)
Male 0.53 0.5 1881 0.53 0.5 1903 0.5 0.5 1892 0.51 0.5 1916
First born 0.23 0.42 1881 0.39 0.49 1903 0.37 0.48 1892 0.46 0.5 1916
Caregiver’s Education 2.95 3.73 1874 3.7 4.44 1900 7.75 4.64 1892 6.88 3.83 1908
Age in months 97.48 4.05 1879 96.03 3.92 1903 95.35 3.63 1890 97.09 3.75 1915
Height-for-age z-score -1.21 1.05 1877 -1.44 1.03 1898 -1.14 1.03 1890 -1.07 1.05 1900
Wealth index (2006) 0.28 0.18 1881 0.46 0.2 1902 0.47 0.23 1892 0.51 0.2 1914
Time use (hours spent on a typical day; TUic,a)
— Doing domestic tasks 1.66 1.37 1881 0.33 0.58 1903 0.87 0.7 1887 0.54 0.66 1899
— Tasks on family farm/business etc. 1.5 2.22 1880 0.01 0.1 1903 0.25 0.66 1886 0.09 0.48 1897
— Paid work outside household 0.01 0.28 1880 0.01 0.2 1903 0 0.08 1887 0 0.07 1897
— At school 4.91 2.54 1881 7.72 0.95 1903 6.02 0.9 1887 5.04 1.31 1898
— Studying outside school time 0.99 0.89 1881 1.86 1.09 1903 1.87 0.83 1886 2.82 1.49 1897
— General leisure etc. 4.44 2.39 1881 4.71 1.54 1903 4.13 1.65 1887 5.55 1.65 1898
— Caring for others 0.83 1.21 1881 0.21 0.5 1903 0.48 0.88 1886 0.24 0.66 1878
Do child-specific endowments explain divergence?
Value-added models with common coefficients: Results
(1) (2) (3) (4)
Dep var: Mathematics score (2009)
VARIABLES 8-years old
Country dummies
India 76.3*** 64.5*** 61.6*** 16.3***
(3.01) (2.92) (2.97) (3.58)
Peru 96.7*** 79.1*** 65.2*** 48.2***
(2.75) (2.71) (2.69) (2.85)
Vietnam 146*** 127*** 108*** 92.2***
(3.04) (3.06) (2.97) (3.45)
Lagged test scores Y Y Y
Background vars (Xic) Y Y
Time use (TUic,a) Y
Robust standard errors in parentheses *** p<0.01, ** p<0.05, * p<0.1.
Does differential productivity of home inputs explain
divergence?
Country-specific production function estimates
Previous specification had a very strong implicit assumption:
the effect of inputs on achievement is the same across
countries
So I run the same specifications separately for each country
sample
allows for each input parameter to be different across countries
but makes interpretation difficult since four sets of input
coefficients
Key result: Between 5-8 years, divergence with Vietnam not
explained by levels of inputs
Does differential productivity of home inputs explain
divergence?
Country-specific production function estimates
Previous specification had a very strong implicit assumption:
the effect of inputs on achievement is the same across
countries
So I run the same specifications separately for each country
sample
allows for each input parameter to be different across countries
but makes interpretation difficult since four sets of input
coefficients
Key result: Between 5-8 years, divergence with Vietnam not
explained by levels of inputs
Does differential productivity of home inputs explain
divergence?
Country-specific production function estimates
Previous specification had a very strong implicit assumption:
the effect of inputs on achievement is the same across
countries
So I run the same specifications separately for each country
sample
allows for each input parameter to be different across countries
but makes interpretation difficult since four sets of input
coefficients
Key result: Between 5-8 years, divergence with Vietnam not
explained by levels of inputs
Does differential productivity of home inputs explain
divergence?
Country-specific production function estimates
Previous specification had a very strong implicit assumption:
the effect of inputs on achievement is the same across
countries
So I run the same specifications separately for each country
sample
allows for each input parameter to be different across countries
but makes interpretation difficult since four sets of input
coefficients
Key result: Between 5-8 years, divergence with Vietnam not
explained by levels of inputs
Does differential productivity of home inputs explain
divergence?
Country-specific production function estimates
Previous specification had a very strong implicit assumption:
the effect of inputs on achievement is the same across
countries
So I run the same specifications separately for each country
sample
allows for each input parameter to be different across countries
but makes interpretation difficult since four sets of input
coefficients
Key result: Between 5-8 years, divergence with Vietnam not
explained by levels of inputs
Predicted mean scores under counterfactual scenarios
8-year olds
Coefficients (βc)
Without time use With time use
Ethiopia India Peru Vietnam Ethiopia India Peru Vietnam
Ethiopia 420.79 485.28 495.47 523.15 420.75 390.94 486.66 488.38
(9.87) (10.64) (5.49) (13.48) (10.85) (16.72) (9.62) (19.19)
Inputs India 450.36 497.32 503.74 539.9 487.38 497.32 516.86 563.24
(Xic; TUica) (11.54) (9.59) (4.97) (11.02) (10.39) (9.87) (7.99) (14.79)
Yic,a−1 Peru 470.66 514.64 517.73 559.32 479.48 468.87 517.74 557.66
(11.35) (10.7) (4.65) (10.53) (10.93) (10.96) (5.65) (11.68)
Vietnam 478.69 518.05 522.35 567.03 492.1 476.78 520.84 568.22
(11.08) (9.76) (4.51) (9.16) (12.06) (13.14) (7.09) (11.43)
Cells contain linear predictions of test scores using combinations of country-specific production function
parameters (βc ) with country-specific input levels (Xic and TUic). Standard errors of predictions
in parentheses.
Estimating the quality of schooling
Specifications above include no schooling measures
But we know exposure of schooling differs, esp. in Ethiopia
Suspect that quality of schooling differs too
What I do: include highest grade completed in the
specifications and re-estimate
Identification reliant on relevant unobserved heterogeneity
being absorbed by controls and lag
Will show RD-type IV estimates
These are the most ‘complete’ VA specifications in the paper
Estimating the quality of schooling
Specifications above include no schooling measures
But we know exposure of schooling differs, esp. in Ethiopia
Suspect that quality of schooling differs too
What I do: include highest grade completed in the
specifications and re-estimate
Identification reliant on relevant unobserved heterogeneity
being absorbed by controls and lag
Will show RD-type IV estimates
These are the most ‘complete’ VA specifications in the paper
Estimating the quality of schooling
Specifications above include no schooling measures
But we know exposure of schooling differs, esp. in Ethiopia
Suspect that quality of schooling differs too
What I do: include highest grade completed in the
specifications and re-estimate
Identification reliant on relevant unobserved heterogeneity
being absorbed by controls and lag
Will show RD-type IV estimates
These are the most ‘complete’ VA specifications in the paper
VAMs with grade effectiveness
8-year olds
(1) (2) (3) (4) (5) (6) (7) (8)
VARIABLES Dep var: Mathematics score (2009)
Without time use With time use
Ethiopia India Peru Vietnam Ethiopia India Peru Vietnam
Highest grade completed 40.9*** 27.4*** 33.6*** 60.9*** 28.4*** 25.4*** 32.6*** 55.2***
(4.67) (2.03) (3.60) (14.6) (4.48) (1.62) (3.55) (10.9)
Male 3.26 12.7*** 8.73*** 1.65 4.44 11.6*** 8.92*** 1.62
(5.61) (3.05) (2.22) (2.39) (4.82) (3.13) (2.47) (2.66)
Caregiver’s education level 3.76*** 2.40*** 2.23*** 3.16*** 2.74*** 1.86*** 2.10*** 2.18***
(0.66) (0.70) (0.49) (0.80) (0.52) (0.49) (0.48) (0.72)
Age in months 1.26** 0.51 -0.067 0.18 1.30** 0.60 0.0079 0.69
(0.53) (0.45) (0.30) (1.10) (0.56) (0.41) (0.30) (0.87)
Height-for-age (2009) 9.31*** 5.38** 5.22** 7.14*** 5.30** 4.79** 4.82** 4.81***
(2.64) (2.21) (1.92) (1.78) (2.33) (1.85) (1.73) (1.56)
Wealth index (2006) 151*** 53.6** 17.6* 78.3*** 105*** 31.0* 18.1* 59.0***
(25.9) (23.8) (8.80) (20.9) (18.8) (17.8) (8.91) (19.0)
Lagged CDA scores (2006) 0.067*** 0.13*** 0.100*** 0.065* 0.045* 0.12*** 0.100*** 0.049
(0.023) (0.027) (0.021) (0.032) (0.022) (0.027) (0.020) (0.030)
Constant 196*** 306*** 401*** 354*** 129* 97.6* 313*** 333***
(49.2) (45.5) (29.5) (74.1) (72.0) (53.8) (38.8) (65.5)
Observations 1,835 1,892 1,888 1,907 1,834 1,892 1,881 1,858
R-squared 0.340 0.276 0.343 0.437 0.410 0.365 0.370 0.458
Robust standard errors in parentheses. Standard errors are clustered at site level. *** p<0.01, ** p<0.05, * p<0.1
Can you trust VA estimates
Comparing with IV results
What if you don’t believe that grades completed are
conditionally exogenous?
Identification reliant on relevant unobserved heterogeneity
being absorbed by controls and lag
Way out: try looking for an IV which affects the highest grade
completed at a particular age
but does not directly determine learning, conditional on
controls
Solution: Plausibly exogenous variation coming from
enrolment thresholds
Creates discontinuity in the number of grades completed at
particular calendar months
Conditional on age and previous learning, should be excludable
Can you trust VA estimates
Comparing with IV results
What if you don’t believe that grades completed are
conditionally exogenous?
Identification reliant on relevant unobserved heterogeneity
being absorbed by controls and lag
Way out: try looking for an IV which affects the highest grade
completed at a particular age
but does not directly determine learning, conditional on
controls
Solution: Plausibly exogenous variation coming from
enrolment thresholds
Creates discontinuity in the number of grades completed at
particular calendar months
Conditional on age and previous learning, should be excludable
Can you trust VA estimates
Comparing with IV results
What if you don’t believe that grades completed are
conditionally exogenous?
Identification reliant on relevant unobserved heterogeneity
being absorbed by controls and lag
Way out: try looking for an IV which affects the highest grade
completed at a particular age
but does not directly determine learning, conditional on
controls
Solution: Plausibly exogenous variation coming from
enrolment thresholds
Creates discontinuity in the number of grades completed at
particular calendar months
Conditional on age and previous learning, should be excludable
Can you trust VA estimates
Comparing with IV results
What if you don’t believe that grades completed are
conditionally exogenous?
Identification reliant on relevant unobserved heterogeneity
being absorbed by controls and lag
Way out: try looking for an IV which affects the highest grade
completed at a particular age
but does not directly determine learning, conditional on
controls
Solution: Plausibly exogenous variation coming from
enrolment thresholds
Creates discontinuity in the number of grades completed at
particular calendar months
Conditional on age and previous learning, should be excludable
Enrolment threshold based discontinuities in grade
completion
0.511.520.511.52
Jan
01Feb
01M
ar01Apr01M
ay
01Jun
01Jul01Aug
01Sept01O
ct01N
ov
01D
ec
01Jan
02Feb
02M
ar02Apr02M
ay
02Jun
02Jul02Aug
02Sep
02O
ct02O
ct02
Jan
01Feb
01M
ar01Apr01M
ay
01Jun
01Jul01Aug
01Sept01O
ct01N
ov
01D
ec
01Jan
02Feb
02M
ar02Apr02M
ay
02Jun
02Jul02Aug
02Sep
02O
ct02O
ct02
Jan
01Feb
01M
ar01Apr01M
ay
01Jun
01Jul01Aug
01Sept01O
ct01N
ov
01D
ec
01Jan
02Feb
02M
ar02Apr02M
ay
02Jun
02Jul02Aug
02Sep
02O
ct02O
ct02
Jan
01Feb
01M
ar01Apr01M
ay
01Jun
01Jul01Aug
01Sept01O
ct01N
ov
01D
ec
01Jan
02Feb
02M
ar02Apr02M
ay
02Jun
02Jul02Aug
02Sep
02O
ct02O
ct02
Ethiopia India
Peru Vietnam
Averagegradeattained
By month of birth
Mean grade completed by 2009
IV specifications
First stage:
gradesi,2009 = µ + γ1.Thresholdi + γ2.Xi + γ3.sitei + ϵica (5)
Second stage:
Yic,a = αc + β1.Yic,a−1 + β2.Xic + β3.gradeica + γ.sitei + ϵica
+β4.TUic,a
Same as old VAM but for inclusion of site fixed effects
OK here because not comparing constant terms
IV specifications
First stage:
gradesi,2009 = µ + γ1.Thresholdi + γ2.Xi + γ3.sitei + ϵica (5)
Second stage:
Yic,a = αc + β1.Yic,a−1 + β2.Xic + β3.gradeica + γ.sitei + ϵica
+β4.TUic,a
Same as old VAM but for inclusion of site fixed effects
OK here because not comparing constant terms
Discontinuity based results on grade effectiveness
Peru and Vietnam
(1) (2) (3) (4)
VARIABLES Dep var: Math scores (2009)
Peru Vietnam
Highest grade completed 20.1*** 20.9*** 47.3*** 46.3***
(7.61) (7.96) (7.49) (7.16)
Male 9.43*** 9.96*** 1.34 1.56
(2.39) (2.63) (2.36) (2.46)
Caregiver’s education level 2.31*** 2.14*** 3.05*** 2.41***
(0.40) (0.37) (0.61) (0.55)
Age in months 0.94 0.87 0.41 0.64
(0.66) (0.71) (0.57) (0.53)
Height-for-age (2009) 6.15*** 5.59*** 6.00*** 4.18***
(2.20) (2.00) (1.96) (1.44)
Wealth index (2006) 29.7*** 29.0*** 40.2** 28.6**
(7.67) (7.84) (16.2) (13.4)
Lagged CDA scores (2006) 0.13*** 0.12*** 0.11*** 0.088***
(0.020) (0.020) (0.031) (0.027)
Constant 290*** 227*** 375*** 316***
(58.2) (69.2) (55.5) (60.2)
Observations 1,888 1,881 1,907 1,858
R-squared 0.366 0.393 0.481 0.504
Kleibergen-Paap F-statistic 108 110 113 152
Robust standard errors in parentheses. Standard errors are clustered at site level. *** p<0.01, ** p<0.05, * p<0.1
Test scores are IRT scores normalized to have a mean of 500 and SD of 100 in the pooled four-country
sample at each age. Estimation includes a vector of site fixed effects and other covariates, coefficients for which are not reported.
Robustness checks
Flexible lags: Possibility (even suggestion) of non-linearity in
the effect of lag on current achievement
estimate everything with third-order polynomial of lag / bins of
achievement
Measurement error in the lag
instrument lag with vocabulary test in the 8-year old cohort
assumes independent measurement error across tests
Overall: Persistence parameter might be off but basic story
stays.
Robustness checks
Flexible lags: Possibility (even suggestion) of non-linearity in
the effect of lag on current achievement
estimate everything with third-order polynomial of lag / bins of
achievement
Measurement error in the lag
instrument lag with vocabulary test in the 8-year old cohort
assumes independent measurement error across tests
Overall: Persistence parameter might be off but basic story
stays.
Robustness checks
Flexible lags: Possibility (even suggestion) of non-linearity in
the effect of lag on current achievement
estimate everything with third-order polynomial of lag / bins of
achievement
Measurement error in the lag
instrument lag with vocabulary test in the 8-year old cohort
assumes independent measurement error across tests
Overall: Persistence parameter might be off but basic story
stays.
Robustness checks
Flexible lags: Possibility (even suggestion) of non-linearity in
the effect of lag on current achievement
estimate everything with third-order polynomial of lag / bins of
achievement
Measurement error in the lag
instrument lag with vocabulary test in the 8-year old cohort
assumes independent measurement error across tests
Overall: Persistence parameter might be off but basic story
stays.
Robustness checks
Flexible lags: Possibility (even suggestion) of non-linearity in
the effect of lag on current achievement
estimate everything with third-order polynomial of lag / bins of
achievement
Measurement error in the lag
instrument lag with vocabulary test in the 8-year old cohort
assumes independent measurement error across tests
Overall: Persistence parameter might be off but basic story
stays.
Robustness checks
Flexible lags: Possibility (even suggestion) of non-linearity in
the effect of lag on current achievement
estimate everything with third-order polynomial of lag / bins of
achievement
Measurement error in the lag
instrument lag with vocabulary test in the 8-year old cohort
assumes independent measurement error across tests
Overall: Persistence parameter might be off but basic story
stays.
Pulling it all together
Levels of learning are low except for Vietnam
Differences start early by 5 and grow further later
Between 5-8, divergence with Vietnam reflects differential
effectiveness of a school year
School productivity differences are huge!
These productivity differences are causally identified
Pulling it all together
Levels of learning are low except for Vietnam
Differences start early by 5 and grow further later
Between 5-8, divergence with Vietnam reflects differential
effectiveness of a school year
School productivity differences are huge!
These productivity differences are causally identified
Pulling it all together
Levels of learning are low except for Vietnam
Differences start early by 5 and grow further later
Between 5-8, divergence with Vietnam reflects differential
effectiveness of a school year
School productivity differences are huge!
These productivity differences are causally identified
Pulling it all together
Levels of learning are low except for Vietnam
Differences start early by 5 and grow further later
Between 5-8, divergence with Vietnam reflects differential
effectiveness of a school year
School productivity differences are huge!
These productivity differences are causally identified
Pulling it all together
Levels of learning are low except for Vietnam
Differences start early by 5 and grow further later
Between 5-8, divergence with Vietnam reflects differential
effectiveness of a school year
School productivity differences are huge!
These productivity differences are causally identified
What these results imply
Early divergence provides suggestive support for preschool
interventions
Evidence (except on nutrition) usually based on OECD or LAC
But major divergence after 5 is due to differences in school
productivity at primary school level
It isn’t all over by 5. School productivity is a variable policy
can affect!
Differences in school productivity across countries raise an
important question:
why is productivity so much higher in some countries?
This is not the focus of most of the work in education in dev
econ (but still important)
What these results imply
Early divergence provides suggestive support for preschool
interventions
Evidence (except on nutrition) usually based on OECD or LAC
But major divergence after 5 is due to differences in school
productivity at primary school level
It isn’t all over by 5. School productivity is a variable policy
can affect!
Differences in school productivity across countries raise an
important question:
why is productivity so much higher in some countries?
This is not the focus of most of the work in education in dev
econ (but still important)
What these results imply
Early divergence provides suggestive support for preschool
interventions
Evidence (except on nutrition) usually based on OECD or LAC
But major divergence after 5 is due to differences in school
productivity at primary school level
It isn’t all over by 5. School productivity is a variable policy
can affect!
Differences in school productivity across countries raise an
important question:
why is productivity so much higher in some countries?
This is not the focus of most of the work in education in dev
econ (but still important)
What these results imply
Early divergence provides suggestive support for preschool
interventions
Evidence (except on nutrition) usually based on OECD or LAC
But major divergence after 5 is due to differences in school
productivity at primary school level
It isn’t all over by 5. School productivity is a variable policy
can affect!
Differences in school productivity across countries raise an
important question:
why is productivity so much higher in some countries?
This is not the focus of most of the work in education in dev
econ (but still important)
How should we comparably measure school productivity
A pitch for linked micro panels
Linked micro panel data has important advantages for this
question:
PISA, TIMSS go to very selective non-OECD countries
and the divergence has already happened by 15 years of age
and selection on enrollment and attendance
Returns to schooling for migrants in the US
differential selection of migrants makes individual country
estimates unreliable
Schoellman (2012) rankings put India way ahead of Vietnam,
Ethiopia and Peru
the exact opposite of what I find! (or levels in PISA!)
A lot could be done if educational interventions linked tests...
How should we comparably measure school productivity
A pitch for linked micro panels
Linked micro panel data has important advantages for this
question:
PISA, TIMSS go to very selective non-OECD countries
and the divergence has already happened by 15 years of age
and selection on enrollment and attendance
Returns to schooling for migrants in the US
differential selection of migrants makes individual country
estimates unreliable
Schoellman (2012) rankings put India way ahead of Vietnam,
Ethiopia and Peru
the exact opposite of what I find! (or levels in PISA!)
A lot could be done if educational interventions linked tests...
How should we comparably measure school productivity
A pitch for linked micro panels
Linked micro panel data has important advantages for this
question:
PISA, TIMSS go to very selective non-OECD countries
and the divergence has already happened by 15 years of age
and selection on enrollment and attendance
Returns to schooling for migrants in the US
differential selection of migrants makes individual country
estimates unreliable
Schoellman (2012) rankings put India way ahead of Vietnam,
Ethiopia and Peru
the exact opposite of what I find! (or levels in PISA!)
A lot could be done if educational interventions linked tests...
How should we comparably measure school productivity
A pitch for linked micro panels
Linked micro panel data has important advantages for this
question:
PISA, TIMSS go to very selective non-OECD countries
and the divergence has already happened by 15 years of age
and selection on enrollment and attendance
Returns to schooling for migrants in the US
differential selection of migrants makes individual country
estimates unreliable
Schoellman (2012) rankings put India way ahead of Vietnam,
Ethiopia and Peru
the exact opposite of what I find! (or levels in PISA!)
A lot could be done if educational interventions linked tests...
How should we comparably measure school productivity
A pitch for linked micro panels
Linked micro panel data has important advantages for this
question:
PISA, TIMSS go to very selective non-OECD countries
and the divergence has already happened by 15 years of age
and selection on enrollment and attendance
Returns to schooling for migrants in the US
differential selection of migrants makes individual country
estimates unreliable
Schoellman (2012) rankings put India way ahead of Vietnam,
Ethiopia and Peru
the exact opposite of what I find! (or levels in PISA!)
A lot could be done if educational interventions linked tests...
How should we comparably measure school productivity
A pitch for linked micro panels
Linked micro panel data has important advantages for this
question:
PISA, TIMSS go to very selective non-OECD countries
and the divergence has already happened by 15 years of age
and selection on enrollment and attendance
Returns to schooling for migrants in the US
differential selection of migrants makes individual country
estimates unreliable
Schoellman (2012) rankings put India way ahead of Vietnam,
Ethiopia and Peru
the exact opposite of what I find! (or levels in PISA!)
A lot could be done if educational interventions linked tests...
How should we comparably measure school productivity
A pitch for linked micro panels
Linked micro panel data has important advantages for this
question:
PISA, TIMSS go to very selective non-OECD countries
and the divergence has already happened by 15 years of age
and selection on enrollment and attendance
Returns to schooling for migrants in the US
differential selection of migrants makes individual country
estimates unreliable
Schoellman (2012) rankings put India way ahead of Vietnam,
Ethiopia and Peru
the exact opposite of what I find! (or levels in PISA!)
A lot could be done if educational interventions linked tests...
How should we comparably measure school productivity
A pitch for linked micro panels
Linked micro panel data has important advantages for this
question:
PISA, TIMSS go to very selective non-OECD countries
and the divergence has already happened by 15 years of age
and selection on enrollment and attendance
Returns to schooling for migrants in the US
differential selection of migrants makes individual country
estimates unreliable
Schoellman (2012) rankings put India way ahead of Vietnam,
Ethiopia and Peru
the exact opposite of what I find! (or levels in PISA!)
A lot could be done if educational interventions linked tests...
How should we comparably measure school productivity
A pitch for linked micro panels
Linked micro panel data has important advantages for this
question:
PISA, TIMSS go to very selective non-OECD countries
and the divergence has already happened by 15 years of age
and selection on enrollment and attendance
Returns to schooling for migrants in the US
differential selection of migrants makes individual country
estimates unreliable
Schoellman (2012) rankings put India way ahead of Vietnam,
Ethiopia and Peru
the exact opposite of what I find! (or levels in PISA!)
A lot could be done if educational interventions linked tests...
Comments/Questions/Feedback
Appendix: Item Response Theory
How I link scores
Decades long history in education and psychometrics – GRE,
GMAT, SAT, NAEP, TIMSS
The basic idea:The focus of IRT is at the item level.
Models the probability that an individual with given ability will
get an item right
The overall ability estimate (test score) generated by analyzing
an individual’s response to different items each defined by their
own characteristics
Many advantages (see e.g. Das and Zajonc, 2010):
Most importantly (for me) the ability to link
But also much better diagnostics for cross-cultural comparisons
Less arbitrary than summing up correct responses
Caveat: Linking requires common items across samples
can’t directly compare across age groups
Appendix: Item Response Theory
How I link scores
Decades long history in education and psychometrics – GRE,
GMAT, SAT, NAEP, TIMSS
The basic idea:The focus of IRT is at the item level.
Models the probability that an individual with given ability will
get an item right
The overall ability estimate (test score) generated by analyzing
an individual’s response to different items each defined by their
own characteristics
Many advantages (see e.g. Das and Zajonc, 2010):
Most importantly (for me) the ability to link
But also much better diagnostics for cross-cultural comparisons
Less arbitrary than summing up correct responses
Caveat: Linking requires common items across samples
can’t directly compare across age groups
Appendix: Item Response Theory
How I link scores
Decades long history in education and psychometrics – GRE,
GMAT, SAT, NAEP, TIMSS
The basic idea:The focus of IRT is at the item level.
Models the probability that an individual with given ability will
get an item right
The overall ability estimate (test score) generated by analyzing
an individual’s response to different items each defined by their
own characteristics
Many advantages (see e.g. Das and Zajonc, 2010):
Most importantly (for me) the ability to link
But also much better diagnostics for cross-cultural comparisons
Less arbitrary than summing up correct responses
Caveat: Linking requires common items across samples
can’t directly compare across age groups
Appendix: Item Response Theory
How I link scores
Decades long history in education and psychometrics – GRE,
GMAT, SAT, NAEP, TIMSS
The basic idea:The focus of IRT is at the item level.
Models the probability that an individual with given ability will
get an item right
The overall ability estimate (test score) generated by analyzing
an individual’s response to different items each defined by their
own characteristics
Many advantages (see e.g. Das and Zajonc, 2010):
Most importantly (for me) the ability to link
But also much better diagnostics for cross-cultural comparisons
Less arbitrary than summing up correct responses
Caveat: Linking requires common items across samples
can’t directly compare across age groups
Appendix: Item Response Theory
Item Characteristic Curve
Appendix: Item Response Theory
3 Parameter Logistic (3PL) Model
Item Response Function:
Pg (θi ) = cg +
1 − cg
1 + exp(−1.7.ag .(θi − bg))
(6)
cg is the pseudo-guessing parameter - with multiple choice
questions, even the lowest ability can get some answers right.
Set to zero for non-MCQ to get 2PL model
bg is the difficulty parameter - the level at which the
probability of getting item right is 0.5 in 2 PL
ag is the discrimination parameter - slope of the ICC at b –
how quickly the likelihood of success changes with respect to
ability.
Appendix: Item Response Theory
3 Parameter Logistic (3PL) Model
Item Response Function:
Pg (θi ) = cg +
1 − cg
1 + exp(−1.7.ag .(θi − bg))
(6)
cg is the pseudo-guessing parameter - with multiple choice
questions, even the lowest ability can get some answers right.
Set to zero for non-MCQ to get 2PL model
bg is the difficulty parameter - the level at which the
probability of getting item right is 0.5 in 2 PL
ag is the discrimination parameter - slope of the ICC at b –
how quickly the likelihood of success changes with respect to
ability.
Appendix: Item Response Theory
3 Parameter Logistic (3PL) Model
Item Response Function:
Pg (θi ) = cg +
1 − cg
1 + exp(−1.7.ag .(θi − bg))
(6)
cg is the pseudo-guessing parameter - with multiple choice
questions, even the lowest ability can get some answers right.
Set to zero for non-MCQ to get 2PL model
bg is the difficulty parameter - the level at which the
probability of getting item right is 0.5 in 2 PL
ag is the discrimination parameter - slope of the ICC at b –
how quickly the likelihood of success changes with respect to
ability.
Appendix: Item Response Theory
Core Assumptions
1. Unidimensionality - A single latent individual-specific trait
determines performance on the test
2. No Differential Item Functioning: Implicit in ICC, item
characteristics are person-invariant
2.1 particularly important in cross-cultural settings
3. (Conditional) local independence:
3.1 Item responses are independent across individuals (no
cheating!)
3.2 Conditional on ability, item responses are locally independent
across questions for the same individual
Under these assumptions, can recover estimates of ability and item
characteristics given matrix of responses by individuals
Appendix: Item Response Theory
Core Assumptions
1. Unidimensionality - A single latent individual-specific trait
determines performance on the test
2. No Differential Item Functioning: Implicit in ICC, item
characteristics are person-invariant
2.1 particularly important in cross-cultural settings
3. (Conditional) local independence:
3.1 Item responses are independent across individuals (no
cheating!)
3.2 Conditional on ability, item responses are locally independent
across questions for the same individual
Under these assumptions, can recover estimates of ability and item
characteristics given matrix of responses by individuals
Appendix: Item Response Theory
Core Assumptions
1. Unidimensionality - A single latent individual-specific trait
determines performance on the test
2. No Differential Item Functioning: Implicit in ICC, item
characteristics are person-invariant
2.1 particularly important in cross-cultural settings
3. (Conditional) local independence:
3.1 Item responses are independent across individuals (no
cheating!)
3.2 Conditional on ability, item responses are locally independent
across questions for the same individual
Under these assumptions, can recover estimates of ability and item
characteristics given matrix of responses by individuals
Appendix: Item Response Theory
Core Assumptions
1. Unidimensionality - A single latent individual-specific trait
determines performance on the test
2. No Differential Item Functioning: Implicit in ICC, item
characteristics are person-invariant
2.1 particularly important in cross-cultural settings
3. (Conditional) local independence:
3.1 Item responses are independent across individuals (no
cheating!)
3.2 Conditional on ability, item responses are locally independent
across questions for the same individual
Under these assumptions, can recover estimates of ability and item
characteristics given matrix of responses by individuals
Appendix: Item Response Theory
Core Assumptions
1. Unidimensionality - A single latent individual-specific trait
determines performance on the test
2. No Differential Item Functioning: Implicit in ICC, item
characteristics are person-invariant
2.1 particularly important in cross-cultural settings
3. (Conditional) local independence:
3.1 Item responses are independent across individuals (no
cheating!)
3.2 Conditional on ability, item responses are locally independent
across questions for the same individual
Under these assumptions, can recover estimates of ability and item
characteristics given matrix of responses by individuals
Appendix: Item Response Theory
Core Assumptions
1. Unidimensionality - A single latent individual-specific trait
determines performance on the test
2. No Differential Item Functioning: Implicit in ICC, item
characteristics are person-invariant
2.1 particularly important in cross-cultural settings
3. (Conditional) local independence:
3.1 Item responses are independent across individuals (no
cheating!)
3.2 Conditional on ability, item responses are locally independent
across questions for the same individual
Under these assumptions, can recover estimates of ability and item
characteristics given matrix of responses by individuals
Appendix: Item Response Theory
How does linking work?
IRT only identifies the latent ability up to a linear
transformation
need to fix the scale somewhere
e.g. fix min and max. GRE used to be from 200 to 800
(130-170 now)
or fix mean and SD. PISA and TIMSS have mean of 500 and
SD of 100
Item characteristics are fixed and can be used to link across
samples
common items serve as ‘anchors’ which bring two assessments
on a common scale
only a subset of items need to be common
Without sufficient common items:
Still can do IRT but scores not on comparable scales
important because then you can’t use panel methods such as
differencing or fixed effects without very strong assumptions
about the two distributions
Appendix: Item Response Theory
How does linking work?
IRT only identifies the latent ability up to a linear
transformation
need to fix the scale somewhere
e.g. fix min and max. GRE used to be from 200 to 800
(130-170 now)
or fix mean and SD. PISA and TIMSS have mean of 500 and
SD of 100
Item characteristics are fixed and can be used to link across
samples
common items serve as ‘anchors’ which bring two assessments
on a common scale
only a subset of items need to be common
Without sufficient common items:
Still can do IRT but scores not on comparable scales
important because then you can’t use panel methods such as
differencing or fixed effects without very strong assumptions
about the two distributions
Appendix: Item Response Theory
How does linking work?
IRT only identifies the latent ability up to a linear
transformation
need to fix the scale somewhere
e.g. fix min and max. GRE used to be from 200 to 800
(130-170 now)
or fix mean and SD. PISA and TIMSS have mean of 500 and
SD of 100
Item characteristics are fixed and can be used to link across
samples
common items serve as ‘anchors’ which bring two assessments
on a common scale
only a subset of items need to be common
Without sufficient common items:
Still can do IRT but scores not on comparable scales
important because then you can’t use panel methods such as
differencing or fixed effects without very strong assumptions
about the two distributions
Appendix: Item Response Theory
How does linking work?
IRT only identifies the latent ability up to a linear
transformation
need to fix the scale somewhere
e.g. fix min and max. GRE used to be from 200 to 800
(130-170 now)
or fix mean and SD. PISA and TIMSS have mean of 500 and
SD of 100
Item characteristics are fixed and can be used to link across
samples
common items serve as ‘anchors’ which bring two assessments
on a common scale
only a subset of items need to be common
Without sufficient common items:
Still can do IRT but scores not on comparable scales
important because then you can’t use panel methods such as
differencing or fixed effects without very strong assumptions
about the two distributions
Appendix: Item Response Theory
How does linking work?
IRT only identifies the latent ability up to a linear
transformation
need to fix the scale somewhere
e.g. fix min and max. GRE used to be from 200 to 800
(130-170 now)
or fix mean and SD. PISA and TIMSS have mean of 500 and
SD of 100
Item characteristics are fixed and can be used to link across
samples
common items serve as ‘anchors’ which bring two assessments
on a common scale
only a subset of items need to be common
Without sufficient common items:
Still can do IRT but scores not on comparable scales
important because then you can’t use panel methods such as
differencing or fixed effects without very strong assumptions
about the two distributions
Appendix: Item Response Theory
How does linking work?
IRT only identifies the latent ability up to a linear
transformation
need to fix the scale somewhere
e.g. fix min and max. GRE used to be from 200 to 800
(130-170 now)
or fix mean and SD. PISA and TIMSS have mean of 500 and
SD of 100
Item characteristics are fixed and can be used to link across
samples
common items serve as ‘anchors’ which bring two assessments
on a common scale
only a subset of items need to be common
Without sufficient common items:
Still can do IRT but scores not on comparable scales
important because then you can’t use panel methods such as
differencing or fixed effects without very strong assumptions
about the two distributions
Appendix: Item Response Theory
How does linking work?
IRT only identifies the latent ability up to a linear
transformation
need to fix the scale somewhere
e.g. fix min and max. GRE used to be from 200 to 800
(130-170 now)
or fix mean and SD. PISA and TIMSS have mean of 500 and
SD of 100
Item characteristics are fixed and can be used to link across
samples
common items serve as ‘anchors’ which bring two assessments
on a common scale
only a subset of items need to be common
Without sufficient common items:
Still can do IRT but scores not on comparable scales
important because then you can’t use panel methods such as
differencing or fixed effects without very strong assumptions
about the two distributions
Appendix: Item Response Theory
How does linking work?
IRT only identifies the latent ability up to a linear
transformation
need to fix the scale somewhere
e.g. fix min and max. GRE used to be from 200 to 800
(130-170 now)
or fix mean and SD. PISA and TIMSS have mean of 500 and
SD of 100
Item characteristics are fixed and can be used to link across
samples
common items serve as ‘anchors’ which bring two assessments
on a common scale
only a subset of items need to be common
Without sufficient common items:
Still can do IRT but scores not on comparable scales
important because then you can’t use panel methods such as
differencing or fixed effects without very strong assumptions
about the two distributions
Appendix: Item Response Theory
How does linking work?
IRT only identifies the latent ability up to a linear
transformation
need to fix the scale somewhere
e.g. fix min and max. GRE used to be from 200 to 800
(130-170 now)
or fix mean and SD. PISA and TIMSS have mean of 500 and
SD of 100
Item characteristics are fixed and can be used to link across
samples
common items serve as ‘anchors’ which bring two assessments
on a common scale
only a subset of items need to be common
Without sufficient common items:
Still can do IRT but scores not on comparable scales
important because then you can’t use panel methods such as
differencing or fixed effects without very strong assumptions
about the two distributions
Appendix: Item Response Theory
How does linking work?
IRT only identifies the latent ability up to a linear
transformation
need to fix the scale somewhere
e.g. fix min and max. GRE used to be from 200 to 800
(130-170 now)
or fix mean and SD. PISA and TIMSS have mean of 500 and
SD of 100
Item characteristics are fixed and can be used to link across
samples
common items serve as ‘anchors’ which bring two assessments
on a common scale
only a subset of items need to be common
Without sufficient common items:
Still can do IRT but scores not on comparable scales
important because then you can’t use panel methods such as
differencing or fixed effects without very strong assumptions
about the two distributions
Appendix: Differential Item Functioning
When it’s not a problem
C:/Users/pemb2850/Dropbox/Learning levels -
Appendix: Differential Item Functioning
When it IS a problem
C:/Users/pemb2850/Dropbox/Learning levels -

More Related Content

What's hot

Need for change presentation
Need for change presentationNeed for change presentation
Need for change presentationCharlene Clarke
 
Varkey Foundation Global Parent´s Survey
Varkey Foundation Global Parent´s SurveyVarkey Foundation Global Parent´s Survey
Varkey Foundation Global Parent´s SurveyFundación Varkey
 
Are our children learning? Literacy and Numeracy Across East Africa - 2013 r...
Are our children learning? Literacy and Numeracy Across East Africa -  2013 r...Are our children learning? Literacy and Numeracy Across East Africa -  2013 r...
Are our children learning? Literacy and Numeracy Across East Africa - 2013 r...KenyaSchoolReport.com
 
Career readiness during COVID: How schools can help students enter the labour...
Career readiness during COVID: How schools can help students enter the labour...Career readiness during COVID: How schools can help students enter the labour...
Career readiness during COVID: How schools can help students enter the labour...EduSkills OECD
 
Future of education - Austria
Future of education - AustriaFuture of education - Austria
Future of education - AustriaOECD
 
ELC Exxon Mobile Case Competition Winner Emory University
ELC Exxon Mobile Case Competition Winner Emory UniversityELC Exxon Mobile Case Competition Winner Emory University
ELC Exxon Mobile Case Competition Winner Emory UniversityIesha Scott
 
Enhancing Young Hispanic DLLs' Achievement
Enhancing Young Hispanic DLLs' AchievementEnhancing Young Hispanic DLLs' Achievement
Enhancing Young Hispanic DLLs' AchievementDebra Ackerman
 
LTS Exploring parental influence
LTS Exploring parental influenceLTS Exploring parental influence
LTS Exploring parental influenceDave Doucette
 
Gifted Educators Conference_ CF RESA, 12, 2010
Gifted Educators Conference_ CF RESA, 12, 2010Gifted Educators Conference_ CF RESA, 12, 2010
Gifted Educators Conference_ CF RESA, 12, 2010Beth Holmes
 
Nhsc transitions report
Nhsc transitions reportNhsc transitions report
Nhsc transitions reportHank Maine
 
William symonds frankfort sept 21
William symonds frankfort sept 21William symonds frankfort sept 21
William symonds frankfort sept 21KYTech
 
Research Paper Draft TPOLS496 Internship
Research Paper Draft TPOLS496 InternshipResearch Paper Draft TPOLS496 Internship
Research Paper Draft TPOLS496 InternshipJ. Bradon Rothschild
 
A Right Denied - The Critical Need For Genuine School Reform
A Right Denied - The Critical Need For Genuine School ReformA Right Denied - The Critical Need For Genuine School Reform
A Right Denied - The Critical Need For Genuine School ReformLeila Jerusalem
 
Achievement in america
Achievement in americaAchievement in america
Achievement in americankyec
 
Converge 2014: The Next Generation - Harms
Converge 2014: The Next Generation - HarmsConverge 2014: The Next Generation - Harms
Converge 2014: The Next Generation - HarmsConverge Consulting
 
The State of the Charter School Movement
The State of the Charter School Movement The State of the Charter School Movement
The State of the Charter School Movement Heather Buchheim
 
HLEG thematic workshop on "Inequality of Opportunity", Dirk van Damme
HLEG thematic workshop on "Inequality of Opportunity", Dirk van DammeHLEG thematic workshop on "Inequality of Opportunity", Dirk van Damme
HLEG thematic workshop on "Inequality of Opportunity", Dirk van DammeStatsCommunications
 

What's hot (18)

Need for change presentation
Need for change presentationNeed for change presentation
Need for change presentation
 
Varkey Foundation Global Parent´s Survey
Varkey Foundation Global Parent´s SurveyVarkey Foundation Global Parent´s Survey
Varkey Foundation Global Parent´s Survey
 
Are our children learning? Literacy and Numeracy Across East Africa - 2013 r...
Are our children learning? Literacy and Numeracy Across East Africa -  2013 r...Are our children learning? Literacy and Numeracy Across East Africa -  2013 r...
Are our children learning? Literacy and Numeracy Across East Africa - 2013 r...
 
Career readiness during COVID: How schools can help students enter the labour...
Career readiness during COVID: How schools can help students enter the labour...Career readiness during COVID: How schools can help students enter the labour...
Career readiness during COVID: How schools can help students enter the labour...
 
Future of education - Austria
Future of education - AustriaFuture of education - Austria
Future of education - Austria
 
ELC Exxon Mobile Case Competition Winner Emory University
ELC Exxon Mobile Case Competition Winner Emory UniversityELC Exxon Mobile Case Competition Winner Emory University
ELC Exxon Mobile Case Competition Winner Emory University
 
Enhancing Young Hispanic DLLs' Achievement
Enhancing Young Hispanic DLLs' AchievementEnhancing Young Hispanic DLLs' Achievement
Enhancing Young Hispanic DLLs' Achievement
 
LTS Exploring parental influence
LTS Exploring parental influenceLTS Exploring parental influence
LTS Exploring parental influence
 
Gifted Educators Conference_ CF RESA, 12, 2010
Gifted Educators Conference_ CF RESA, 12, 2010Gifted Educators Conference_ CF RESA, 12, 2010
Gifted Educators Conference_ CF RESA, 12, 2010
 
Nhsc transitions report
Nhsc transitions reportNhsc transitions report
Nhsc transitions report
 
William symonds frankfort sept 21
William symonds frankfort sept 21William symonds frankfort sept 21
William symonds frankfort sept 21
 
Wasb 2008
Wasb 2008Wasb 2008
Wasb 2008
 
Research Paper Draft TPOLS496 Internship
Research Paper Draft TPOLS496 InternshipResearch Paper Draft TPOLS496 Internship
Research Paper Draft TPOLS496 Internship
 
A Right Denied - The Critical Need For Genuine School Reform
A Right Denied - The Critical Need For Genuine School ReformA Right Denied - The Critical Need For Genuine School Reform
A Right Denied - The Critical Need For Genuine School Reform
 
Achievement in america
Achievement in americaAchievement in america
Achievement in america
 
Converge 2014: The Next Generation - Harms
Converge 2014: The Next Generation - HarmsConverge 2014: The Next Generation - Harms
Converge 2014: The Next Generation - Harms
 
The State of the Charter School Movement
The State of the Charter School Movement The State of the Charter School Movement
The State of the Charter School Movement
 
HLEG thematic workshop on "Inequality of Opportunity", Dirk van Damme
HLEG thematic workshop on "Inequality of Opportunity", Dirk van DammeHLEG thematic workshop on "Inequality of Opportunity", Dirk van Damme
HLEG thematic workshop on "Inequality of Opportunity", Dirk van Damme
 

Viewers also liked

Global citizenship
Global citizenshipGlobal citizenship
Global citizenshipFelix Dodds
 
Designing Lessons for Global Citizenship
Designing Lessons for Global CitizenshipDesigning Lessons for Global Citizenship
Designing Lessons for Global CitizenshipHonor Moorman
 
Intercultural learning sardegna
Intercultural  learning sardegnaIntercultural  learning sardegna
Intercultural learning sardegnasocialactiveyouth
 
Assessment Of Intercultural Learning Joe Van Dalen
Assessment Of Intercultural Learning Joe Van DalenAssessment Of Intercultural Learning Joe Van Dalen
Assessment Of Intercultural Learning Joe Van DalenMLTA of NSW
 
Going Global: Preparing Students for Global Citizenship
Going Global: Preparing Students for Global CitizenshipGoing Global: Preparing Students for Global Citizenship
Going Global: Preparing Students for Global CitizenshipLucy Gray
 
Study Tour in Boston, U.S.A -- Communities of Intercultural Learning Boston (...
Study Tour in Boston, U.S.A -- Communities of Intercultural Learning Boston (...Study Tour in Boston, U.S.A -- Communities of Intercultural Learning Boston (...
Study Tour in Boston, U.S.A -- Communities of Intercultural Learning Boston (...Jack Chan
 
Global citizenship PCTHE presentation team john 2014
Global citizenship PCTHE presentation team john 2014Global citizenship PCTHE presentation team john 2014
Global citizenship PCTHE presentation team john 2014i_am_emma
 
Interc lote 2010
Interc lote 2010Interc lote 2010
Interc lote 2010Andrew Lian
 
Intercultural Learning from the Inside Out: Supporting Faculty, Staff, and St...
Intercultural Learning from the Inside Out: Supporting Faculty, Staff, and St...Intercultural Learning from the Inside Out: Supporting Faculty, Staff, and St...
Intercultural Learning from the Inside Out: Supporting Faculty, Staff, and St...CIEE
 
Student Perspectives on Intercultural Learning from an Online Teacher Educati...
Student Perspectives on Intercultural Learning from an Online Teacher Educati...Student Perspectives on Intercultural Learning from an Online Teacher Educati...
Student Perspectives on Intercultural Learning from an Online Teacher Educati...Shannon Sauro
 
What is intercultural education?
What is intercultural education?What is intercultural education?
What is intercultural education?James Penstone
 
Developing Critical Thinking in Our Youngest Learners
Developing Critical Thinking in Our Youngest LearnersDeveloping Critical Thinking in Our Youngest Learners
Developing Critical Thinking in Our Youngest LearnersJennifer Jones
 

Viewers also liked (14)

Global citizenship
Global citizenshipGlobal citizenship
Global citizenship
 
Intercultural learning in brief
Intercultural learning in briefIntercultural learning in brief
Intercultural learning in brief
 
Designing Lessons for Global Citizenship
Designing Lessons for Global CitizenshipDesigning Lessons for Global Citizenship
Designing Lessons for Global Citizenship
 
Intercultural learning sardegna
Intercultural  learning sardegnaIntercultural  learning sardegna
Intercultural learning sardegna
 
Assessment Of Intercultural Learning Joe Van Dalen
Assessment Of Intercultural Learning Joe Van DalenAssessment Of Intercultural Learning Joe Van Dalen
Assessment Of Intercultural Learning Joe Van Dalen
 
Going Global: Preparing Students for Global Citizenship
Going Global: Preparing Students for Global CitizenshipGoing Global: Preparing Students for Global Citizenship
Going Global: Preparing Students for Global Citizenship
 
Study Tour in Boston, U.S.A -- Communities of Intercultural Learning Boston (...
Study Tour in Boston, U.S.A -- Communities of Intercultural Learning Boston (...Study Tour in Boston, U.S.A -- Communities of Intercultural Learning Boston (...
Study Tour in Boston, U.S.A -- Communities of Intercultural Learning Boston (...
 
Global citizenship PCTHE presentation team john 2014
Global citizenship PCTHE presentation team john 2014Global citizenship PCTHE presentation team john 2014
Global citizenship PCTHE presentation team john 2014
 
Interc lote 2010
Interc lote 2010Interc lote 2010
Interc lote 2010
 
INTERCULTURAL COMMUNICATION COMPETENCE
INTERCULTURAL COMMUNICATION COMPETENCEINTERCULTURAL COMMUNICATION COMPETENCE
INTERCULTURAL COMMUNICATION COMPETENCE
 
Intercultural Learning from the Inside Out: Supporting Faculty, Staff, and St...
Intercultural Learning from the Inside Out: Supporting Faculty, Staff, and St...Intercultural Learning from the Inside Out: Supporting Faculty, Staff, and St...
Intercultural Learning from the Inside Out: Supporting Faculty, Staff, and St...
 
Student Perspectives on Intercultural Learning from an Online Teacher Educati...
Student Perspectives on Intercultural Learning from an Online Teacher Educati...Student Perspectives on Intercultural Learning from an Online Teacher Educati...
Student Perspectives on Intercultural Learning from an Online Teacher Educati...
 
What is intercultural education?
What is intercultural education?What is intercultural education?
What is intercultural education?
 
Developing Critical Thinking in Our Youngest Learners
Developing Critical Thinking in Our Youngest LearnersDeveloping Critical Thinking in Our Youngest Learners
Developing Critical Thinking in Our Youngest Learners
 

Similar to Learning more with every year: School year productivity and international learning divergence

Unraveling a Secret: Vietnam's Outstanding Performance on the PISA test
Unraveling a Secret: Vietnam's Outstanding Performance on the PISA testUnraveling a Secret: Vietnam's Outstanding Performance on the PISA test
Unraveling a Secret: Vietnam's Outstanding Performance on the PISA testJame Quintina
 
Inequality is the Problem.pptx
Inequality is the Problem.pptxInequality is the Problem.pptx
Inequality is the Problem.pptxLucaMariaPesando1
 
Singer_CLASP_November_2015.pptx
Singer_CLASP_November_2015.pptxSinger_CLASP_November_2015.pptx
Singer_CLASP_November_2015.pptxKumarNatarajan24
 
Improving Early Equity: From evidence to action PPT from Webinar 26 October 2022
Improving Early Equity: From evidence to action PPT from Webinar 26 October 2022Improving Early Equity: From evidence to action PPT from Webinar 26 October 2022
Improving Early Equity: From evidence to action PPT from Webinar 26 October 2022EduSkills OECD
 
Lifespan psychology lecture - 5.2
Lifespan psychology   lecture - 5.2Lifespan psychology   lecture - 5.2
Lifespan psychology lecture - 5.2kclancy
 
C Nofo Action Research Pt 2 2009
C Nofo Action Research Pt 2 2009C Nofo Action Research Pt 2 2009
C Nofo Action Research Pt 2 2009guest3f82a4
 
How Inequalities Develop Through Childhood (ISCI)
How Inequalities Develop Through Childhood (ISCI)How Inequalities Develop Through Childhood (ISCI)
How Inequalities Develop Through Childhood (ISCI)Young Lives Oxford
 
Measuring What Matters for Child Well-being and Policies - Key messages in a ...
Measuring What Matters for Child Well-being and Policies - Key messages in a ...Measuring What Matters for Child Well-being and Policies - Key messages in a ...
Measuring What Matters for Child Well-being and Policies - Key messages in a ...StatsCommunications
 
The High Cost of Low Educational Performance
The High Cost of Low Educational PerformanceThe High Cost of Low Educational Performance
The High Cost of Low Educational PerformanceMike Blamires
 
Starting on track_to_career_and_college_readiness
Starting on track_to_career_and_college_readinessStarting on track_to_career_and_college_readiness
Starting on track_to_career_and_college_readinessalester1025
 
3 Solutions to Support Greater Educational Equity Right Now
3 Solutions to Support Greater Educational Equity Right Now3 Solutions to Support Greater Educational Equity Right Now
3 Solutions to Support Greater Educational Equity Right NowDreamBox Learning
 
Starting On Track To Career And College Readiness
Starting On Track To Career And College ReadinessStarting On Track To Career And College Readiness
Starting On Track To Career And College Readinessalester1025
 
Starting on track_to_career_and_college_readiness
Starting on track_to_career_and_college_readinessStarting on track_to_career_and_college_readiness
Starting on track_to_career_and_college_readinessalester1025
 
2011 g-ottawa (pisa) - rev 1.1
2011 g-ottawa (pisa) - rev 1.12011 g-ottawa (pisa) - rev 1.1
2011 g-ottawa (pisa) - rev 1.1OECD
 
Webb_Dicaire_774Final
Webb_Dicaire_774FinalWebb_Dicaire_774Final
Webb_Dicaire_774FinalSimon Webb
 
Achieving Equitable Outcomes with Results-Based Accountability
Achieving Equitable Outcomes with Results-Based Accountability Achieving Equitable Outcomes with Results-Based Accountability
Achieving Equitable Outcomes with Results-Based Accountability Clear Impact
 

Similar to Learning more with every year: School year productivity and international learning divergence (20)

Singh a gender_gaps_9sept2016
Singh a gender_gaps_9sept2016Singh a gender_gaps_9sept2016
Singh a gender_gaps_9sept2016
 
Unraveling a Secret: Vietnam's Outstanding Performance on the PISA test
Unraveling a Secret: Vietnam's Outstanding Performance on the PISA testUnraveling a Secret: Vietnam's Outstanding Performance on the PISA test
Unraveling a Secret: Vietnam's Outstanding Performance on the PISA test
 
Inequality is the Problem.pptx
Inequality is the Problem.pptxInequality is the Problem.pptx
Inequality is the Problem.pptx
 
Singer_CLASP_November_2015.pptx
Singer_CLASP_November_2015.pptxSinger_CLASP_November_2015.pptx
Singer_CLASP_November_2015.pptx
 
Improving Early Equity: From evidence to action PPT from Webinar 26 October 2022
Improving Early Equity: From evidence to action PPT from Webinar 26 October 2022Improving Early Equity: From evidence to action PPT from Webinar 26 October 2022
Improving Early Equity: From evidence to action PPT from Webinar 26 October 2022
 
Favara 2016-espe
Favara 2016-espeFavara 2016-espe
Favara 2016-espe
 
Lifespan psychology lecture - 5.2
Lifespan psychology   lecture - 5.2Lifespan psychology   lecture - 5.2
Lifespan psychology lecture - 5.2
 
C Nofo Action Research Pt 2 2009
C Nofo Action Research Pt 2 2009C Nofo Action Research Pt 2 2009
C Nofo Action Research Pt 2 2009
 
How Inequalities Develop Through Childhood (ISCI)
How Inequalities Develop Through Childhood (ISCI)How Inequalities Develop Through Childhood (ISCI)
How Inequalities Develop Through Childhood (ISCI)
 
Measuring What Matters for Child Well-being and Policies - Key messages in a ...
Measuring What Matters for Child Well-being and Policies - Key messages in a ...Measuring What Matters for Child Well-being and Policies - Key messages in a ...
Measuring What Matters for Child Well-being and Policies - Key messages in a ...
 
2013 globalteacherstatusindex
2013 globalteacherstatusindex2013 globalteacherstatusindex
2013 globalteacherstatusindex
 
The High Cost of Low Educational Performance
The High Cost of Low Educational PerformanceThe High Cost of Low Educational Performance
The High Cost of Low Educational Performance
 
Starting on track_to_career_and_college_readiness
Starting on track_to_career_and_college_readinessStarting on track_to_career_and_college_readiness
Starting on track_to_career_and_college_readiness
 
3 Solutions to Support Greater Educational Equity Right Now
3 Solutions to Support Greater Educational Equity Right Now3 Solutions to Support Greater Educational Equity Right Now
3 Solutions to Support Greater Educational Equity Right Now
 
Starting On Track To Career And College Readiness
Starting On Track To Career And College ReadinessStarting On Track To Career And College Readiness
Starting On Track To Career And College Readiness
 
Starting on track_to_career_and_college_readiness
Starting on track_to_career_and_college_readinessStarting on track_to_career_and_college_readiness
Starting on track_to_career_and_college_readiness
 
LUSARDI Annamaria - 2014 Symposium to Advance Financial Literacy
LUSARDI Annamaria - 2014 Symposium to Advance Financial LiteracyLUSARDI Annamaria - 2014 Symposium to Advance Financial Literacy
LUSARDI Annamaria - 2014 Symposium to Advance Financial Literacy
 
2011 g-ottawa (pisa) - rev 1.1
2011 g-ottawa (pisa) - rev 1.12011 g-ottawa (pisa) - rev 1.1
2011 g-ottawa (pisa) - rev 1.1
 
Webb_Dicaire_774Final
Webb_Dicaire_774FinalWebb_Dicaire_774Final
Webb_Dicaire_774Final
 
Achieving Equitable Outcomes with Results-Based Accountability
Achieving Equitable Outcomes with Results-Based Accountability Achieving Equitable Outcomes with Results-Based Accountability
Achieving Equitable Outcomes with Results-Based Accountability
 

More from Young Lives Oxford

Marriage and Divorce among Adolescents: Before and After COVID19, why we can'...
Marriage and Divorce among Adolescents: Before and After COVID19, why we can'...Marriage and Divorce among Adolescents: Before and After COVID19, why we can'...
Marriage and Divorce among Adolescents: Before and After COVID19, why we can'...Young Lives Oxford
 
Promoting Equitable Learning: Changing Teachers and Systems
Promoting Equitable Learning: Changing Teachers and SystemsPromoting Equitable Learning: Changing Teachers and Systems
Promoting Equitable Learning: Changing Teachers and SystemsYoung Lives Oxford
 
"Unlocking the black box: what's happening in 'more effective' classrooms in ...
"Unlocking the black box: what's happening in 'more effective' classrooms in ..."Unlocking the black box: what's happening in 'more effective' classrooms in ...
"Unlocking the black box: what's happening in 'more effective' classrooms in ...Young Lives Oxford
 
Challenges and Priorities - Child protection and use of evidence to inform po...
Challenges and Priorities - Child protection and use of evidence to inform po...Challenges and Priorities - Child protection and use of evidence to inform po...
Challenges and Priorities - Child protection and use of evidence to inform po...Young Lives Oxford
 
Ensure strong beginnings and support for development from conception to adole...
Ensure strong beginnings and support for development from conception to adole...Ensure strong beginnings and support for development from conception to adole...
Ensure strong beginnings and support for development from conception to adole...Young Lives Oxford
 
'How can we best support young people in situations of adversity?'
'How can we best support young people in situations of adversity?''How can we best support young people in situations of adversity?'
'How can we best support young people in situations of adversity?'Young Lives Oxford
 
Intersecting inequalities: Evidence from Young Lives India
Intersecting inequalities: Evidence from Young Lives IndiaIntersecting inequalities: Evidence from Young Lives India
Intersecting inequalities: Evidence from Young Lives IndiaYoung Lives Oxford
 
Young Lives 2016-17 School Survey: Value-added analysis and school effectiveness
Young Lives 2016-17 School Survey: Value-added analysis and school effectivenessYoung Lives 2016-17 School Survey: Value-added analysis and school effectiveness
Young Lives 2016-17 School Survey: Value-added analysis and school effectivenessYoung Lives Oxford
 
Inequalities in educational opportunities and outcomes in secondary schools i...
Inequalities in educational opportunities and outcomes in secondary schools i...Inequalities in educational opportunities and outcomes in secondary schools i...
Inequalities in educational opportunities and outcomes in secondary schools i...Young Lives Oxford
 
Early-Life Undernourishment in Developing Countries: Prevalence, Impacts over...
Early-Life Undernourishment in Developing Countries: Prevalence, Impacts over...Early-Life Undernourishment in Developing Countries: Prevalence, Impacts over...
Early-Life Undernourishment in Developing Countries: Prevalence, Impacts over...Young Lives Oxford
 
System Expansion Step Three: Capitalising on Student Talents for a Middle-Inc...
System Expansion Step Three: Capitalising on Student Talents for a Middle-Inc...System Expansion Step Three: Capitalising on Student Talents for a Middle-Inc...
System Expansion Step Three: Capitalising on Student Talents for a Middle-Inc...Young Lives Oxford
 
Beyond the basics: Access and equity in the expansion of post-compulsory scho...
Beyond the basics: Access and equity in the expansion of post-compulsory scho...Beyond the basics: Access and equity in the expansion of post-compulsory scho...
Beyond the basics: Access and equity in the expansion of post-compulsory scho...Young Lives Oxford
 
Private Schools in India: More Learning, More Inequality
Private Schools in India: More Learning, More InequalityPrivate Schools in India: More Learning, More Inequality
Private Schools in India: More Learning, More InequalityYoung Lives Oxford
 
Learn, Grow and Thrive CSW presentation
Learn, Grow and Thrive CSW presentationLearn, Grow and Thrive CSW presentation
Learn, Grow and Thrive CSW presentationYoung Lives Oxford
 
Key findings from the 2016-17 Young Lives School Survey in Vietnam
Key findings from the 2016-17 Young Lives School Survey in VietnamKey findings from the 2016-17 Young Lives School Survey in Vietnam
Key findings from the 2016-17 Young Lives School Survey in VietnamYoung Lives Oxford
 
Beating the Odds: Why have some children fared well despite growing up in pov...
Beating the Odds: Why have some children fared well despite growing up in pov...Beating the Odds: Why have some children fared well despite growing up in pov...
Beating the Odds: Why have some children fared well despite growing up in pov...Young Lives Oxford
 
Social determinants of wellbeing in early adolescence
Social determinants of wellbeing in early adolescenceSocial determinants of wellbeing in early adolescence
Social determinants of wellbeing in early adolescenceYoung Lives Oxford
 
Unequal opportunities: Inequalities in secondary education in India, Vietnam ...
Unequal opportunities: Inequalities in secondary education in India, Vietnam ...Unequal opportunities: Inequalities in secondary education in India, Vietnam ...
Unequal opportunities: Inequalities in secondary education in India, Vietnam ...Young Lives Oxford
 
Beyond the Basics: Access and equity in the expansion of post-compulsory scho...
Beyond the Basics: Access and equity in the expansion of post-compulsory scho...Beyond the Basics: Access and equity in the expansion of post-compulsory scho...
Beyond the Basics: Access and equity in the expansion of post-compulsory scho...Young Lives Oxford
 

More from Young Lives Oxford (20)

Marriage and Divorce among Adolescents: Before and After COVID19, why we can'...
Marriage and Divorce among Adolescents: Before and After COVID19, why we can'...Marriage and Divorce among Adolescents: Before and After COVID19, why we can'...
Marriage and Divorce among Adolescents: Before and After COVID19, why we can'...
 
Promoting Equitable Learning: Changing Teachers and Systems
Promoting Equitable Learning: Changing Teachers and SystemsPromoting Equitable Learning: Changing Teachers and Systems
Promoting Equitable Learning: Changing Teachers and Systems
 
"Unlocking the black box: what's happening in 'more effective' classrooms in ...
"Unlocking the black box: what's happening in 'more effective' classrooms in ..."Unlocking the black box: what's happening in 'more effective' classrooms in ...
"Unlocking the black box: what's happening in 'more effective' classrooms in ...
 
Gender and Violence
Gender and ViolenceGender and Violence
Gender and Violence
 
Challenges and Priorities - Child protection and use of evidence to inform po...
Challenges and Priorities - Child protection and use of evidence to inform po...Challenges and Priorities - Child protection and use of evidence to inform po...
Challenges and Priorities - Child protection and use of evidence to inform po...
 
Ensure strong beginnings and support for development from conception to adole...
Ensure strong beginnings and support for development from conception to adole...Ensure strong beginnings and support for development from conception to adole...
Ensure strong beginnings and support for development from conception to adole...
 
'How can we best support young people in situations of adversity?'
'How can we best support young people in situations of adversity?''How can we best support young people in situations of adversity?'
'How can we best support young people in situations of adversity?'
 
Intersecting inequalities: Evidence from Young Lives India
Intersecting inequalities: Evidence from Young Lives IndiaIntersecting inequalities: Evidence from Young Lives India
Intersecting inequalities: Evidence from Young Lives India
 
Young Lives 2016-17 School Survey: Value-added analysis and school effectiveness
Young Lives 2016-17 School Survey: Value-added analysis and school effectivenessYoung Lives 2016-17 School Survey: Value-added analysis and school effectiveness
Young Lives 2016-17 School Survey: Value-added analysis and school effectiveness
 
Inequalities in educational opportunities and outcomes in secondary schools i...
Inequalities in educational opportunities and outcomes in secondary schools i...Inequalities in educational opportunities and outcomes in secondary schools i...
Inequalities in educational opportunities and outcomes in secondary schools i...
 
Early-Life Undernourishment in Developing Countries: Prevalence, Impacts over...
Early-Life Undernourishment in Developing Countries: Prevalence, Impacts over...Early-Life Undernourishment in Developing Countries: Prevalence, Impacts over...
Early-Life Undernourishment in Developing Countries: Prevalence, Impacts over...
 
System Expansion Step Three: Capitalising on Student Talents for a Middle-Inc...
System Expansion Step Three: Capitalising on Student Talents for a Middle-Inc...System Expansion Step Three: Capitalising on Student Talents for a Middle-Inc...
System Expansion Step Three: Capitalising on Student Talents for a Middle-Inc...
 
Beyond the basics: Access and equity in the expansion of post-compulsory scho...
Beyond the basics: Access and equity in the expansion of post-compulsory scho...Beyond the basics: Access and equity in the expansion of post-compulsory scho...
Beyond the basics: Access and equity in the expansion of post-compulsory scho...
 
Private Schools in India: More Learning, More Inequality
Private Schools in India: More Learning, More InequalityPrivate Schools in India: More Learning, More Inequality
Private Schools in India: More Learning, More Inequality
 
Learn, Grow and Thrive CSW presentation
Learn, Grow and Thrive CSW presentationLearn, Grow and Thrive CSW presentation
Learn, Grow and Thrive CSW presentation
 
Key findings from the 2016-17 Young Lives School Survey in Vietnam
Key findings from the 2016-17 Young Lives School Survey in VietnamKey findings from the 2016-17 Young Lives School Survey in Vietnam
Key findings from the 2016-17 Young Lives School Survey in Vietnam
 
Beating the Odds: Why have some children fared well despite growing up in pov...
Beating the Odds: Why have some children fared well despite growing up in pov...Beating the Odds: Why have some children fared well despite growing up in pov...
Beating the Odds: Why have some children fared well despite growing up in pov...
 
Social determinants of wellbeing in early adolescence
Social determinants of wellbeing in early adolescenceSocial determinants of wellbeing in early adolescence
Social determinants of wellbeing in early adolescence
 
Unequal opportunities: Inequalities in secondary education in India, Vietnam ...
Unequal opportunities: Inequalities in secondary education in India, Vietnam ...Unequal opportunities: Inequalities in secondary education in India, Vietnam ...
Unequal opportunities: Inequalities in secondary education in India, Vietnam ...
 
Beyond the Basics: Access and equity in the expansion of post-compulsory scho...
Beyond the Basics: Access and equity in the expansion of post-compulsory scho...Beyond the Basics: Access and equity in the expansion of post-compulsory scho...
Beyond the Basics: Access and equity in the expansion of post-compulsory scho...
 

Recently uploaded

Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting DataJhengPantaleon
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,Virag Sontakke
 
Biting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfBiting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfadityarao40181
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17Celine George
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsKarinaGenton
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 

Recently uploaded (20)

Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
 
Biting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfBiting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdf
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its Characteristics
 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 

Learning more with every year: School year productivity and international learning divergence

  • 1. Learning more with every year: School year productivity and international learning divergence Abhijeet Singh University of Oxford Presentation at RISE Summer Meeting CGD June 2015
  • 2. Introduction What we know about learning outcomes in developing countries Learning outcomes very poor in many developing countries, especially in South Asia and SSA Less known in comparative settings (under-represented in PISA, TIMSS) But available results suggest differences in performance within developing countries is big Vietnam-Peru gap in math is 1.4 SD in PISA 2012 US-Finland gap - 0.38 SD
  • 3. Introduction What we know about learning outcomes in developing countries Learning outcomes very poor in many developing countries, especially in South Asia and SSA Less known in comparative settings (under-represented in PISA, TIMSS) But available results suggest differences in performance within developing countries is big Vietnam-Peru gap in math is 1.4 SD in PISA 2012 US-Finland gap - 0.38 SD
  • 4. Introduction What we know about learning outcomes in developing countries Learning outcomes very poor in many developing countries, especially in South Asia and SSA Less known in comparative settings (under-represented in PISA, TIMSS) But available results suggest differences in performance within developing countries is big Vietnam-Peru gap in math is 1.4 SD in PISA 2012 US-Finland gap - 0.38 SD
  • 5. Introduction What we know about learning outcomes in developing countries Learning outcomes very poor in many developing countries, especially in South Asia and SSA Less known in comparative settings (under-represented in PISA, TIMSS) But available results suggest differences in performance within developing countries is big Vietnam-Peru gap in math is 1.4 SD in PISA 2012 US-Finland gap - 0.38 SD
  • 6. Introduction What we know about learning outcomes in developing countries Learning outcomes very poor in many developing countries, especially in South Asia and SSA Less known in comparative settings (under-represented in PISA, TIMSS) But available results suggest differences in performance within developing countries is big Vietnam-Peru gap in math is 1.4 SD in PISA 2012 US-Finland gap - 0.38 SD
  • 7. What is this paper about? Two key questions At what age are gaps in human capital evident across countries? PISA, or even TIMSS, only study gaps well into schooling How much does the differential productivity of primary schooling affect international gaps in human capital? Can we say anything about the relative effectiveness of schooling systems?
  • 8. What is this paper about? Two key questions At what age are gaps in human capital evident across countries? PISA, or even TIMSS, only study gaps well into schooling How much does the differential productivity of primary schooling affect international gaps in human capital? Can we say anything about the relative effectiveness of schooling systems?
  • 9. What is this paper about? Two key questions At what age are gaps in human capital evident across countries? PISA, or even TIMSS, only study gaps well into schooling How much does the differential productivity of primary schooling affect international gaps in human capital? Can we say anything about the relative effectiveness of schooling systems?
  • 10. What is this paper about? Two key questions At what age are gaps in human capital evident across countries? PISA, or even TIMSS, only study gaps well into schooling How much does the differential productivity of primary schooling affect international gaps in human capital? Can we say anything about the relative effectiveness of schooling systems?
  • 11. Why this matters Knowing when and how learning gaps evolve is informative for understanding when policy interventions might work: Effectiveness of interventions varies importantly across the age of children Understanding sources of divergence useful for identifying domains in which intervention necessary we don’t just want a league table. Important differences between educational systems may have important information for policy But most economics of education in developing countries is focused on specific interventions within a given institutional setting little work on ‘business-as-usual’ productivity of time spent in school
  • 12. Why this matters Knowing when and how learning gaps evolve is informative for understanding when policy interventions might work: Effectiveness of interventions varies importantly across the age of children Understanding sources of divergence useful for identifying domains in which intervention necessary we don’t just want a league table. Important differences between educational systems may have important information for policy But most economics of education in developing countries is focused on specific interventions within a given institutional setting little work on ‘business-as-usual’ productivity of time spent in school
  • 13. Why this matters Knowing when and how learning gaps evolve is informative for understanding when policy interventions might work: Effectiveness of interventions varies importantly across the age of children Understanding sources of divergence useful for identifying domains in which intervention necessary we don’t just want a league table. Important differences between educational systems may have important information for policy But most economics of education in developing countries is focused on specific interventions within a given institutional setting little work on ‘business-as-usual’ productivity of time spent in school
  • 14. What I do Use child level panel data for Ethiopia, India, Peru and Vietnam to: Compare distributions of achievement of children at 5 and 8 years across four Young Lives countries Examine how the gap evolves over the age group of the children Is there growth between 5-8 years? Do rankings change across ages? Estimate value-added (VA) models examining sources of the gap Causally identify differential productivity of schooling with VA and IV estimates using enrollment discontinuities
  • 15. What I do Use child level panel data for Ethiopia, India, Peru and Vietnam to: Compare distributions of achievement of children at 5 and 8 years across four Young Lives countries Examine how the gap evolves over the age group of the children Is there growth between 5-8 years? Do rankings change across ages? Estimate value-added (VA) models examining sources of the gap Causally identify differential productivity of schooling with VA and IV estimates using enrollment discontinuities
  • 16. What I do Use child level panel data for Ethiopia, India, Peru and Vietnam to: Compare distributions of achievement of children at 5 and 8 years across four Young Lives countries Examine how the gap evolves over the age group of the children Is there growth between 5-8 years? Do rankings change across ages? Estimate value-added (VA) models examining sources of the gap Causally identify differential productivity of schooling with VA and IV estimates using enrollment discontinuities
  • 17. What I do Use child level panel data for Ethiopia, India, Peru and Vietnam to: Compare distributions of achievement of children at 5 and 8 years across four Young Lives countries Examine how the gap evolves over the age group of the children Is there growth between 5-8 years? Do rankings change across ages? Estimate value-added (VA) models examining sources of the gap Causally identify differential productivity of schooling with VA and IV estimates using enrollment discontinuities
  • 18. What I do Use child level panel data for Ethiopia, India, Peru and Vietnam to: Compare distributions of achievement of children at 5 and 8 years across four Young Lives countries Examine how the gap evolves over the age group of the children Is there growth between 5-8 years? Do rankings change across ages? Estimate value-added (VA) models examining sources of the gap Causally identify differential productivity of schooling with VA and IV estimates using enrollment discontinuities
  • 19. Contribution First analysis of the emergence and evolution of gaps in cognitive achievement across countries, using internationally comparable child-level panel data, at a critical age for skill formation similar work on racial gaps in US, socio-economic gaps in the UK etc. but nothing across countries no studies of comparable age range in developing countries Causal identification of learning-gains-per-year in different countries using micro panel and RD-based identification
  • 20. Contribution First analysis of the emergence and evolution of gaps in cognitive achievement across countries, using internationally comparable child-level panel data, at a critical age for skill formation similar work on racial gaps in US, socio-economic gaps in the UK etc. but nothing across countries no studies of comparable age range in developing countries Causal identification of learning-gains-per-year in different countries using micro panel and RD-based identification
  • 21. Contribution First analysis of the emergence and evolution of gaps in cognitive achievement across countries, using internationally comparable child-level panel data, at a critical age for skill formation similar work on racial gaps in US, socio-economic gaps in the UK etc. but nothing across countries no studies of comparable age range in developing countries Causal identification of learning-gains-per-year in different countries using micro panel and RD-based identification
  • 22. What this relates to Literature Methodologically: Racial and gender test score gaps: e.g. Fryer and Levitt (2004, 2006, 2010, 2013), Todd and Wolpin (2007) Value-added models using HH panel data: Todd and Wolpin (2003, 2007), Fiorini and Keane (2014) Results: Growth decompositions with varying school quality: Hanushek and Kimko (2000), Hanushek and Woessmann (various), Schoellmann (2012), Kaarsen (2014) Cross-country productivity differences in diff sectors: Hall and Jones (1996), Bloom and Van Reenen (2007), Gollin et. al. (2014) Micro studies on learning in developing countries: e.g. Glewwe and Kremer (2006), Kremer et. al. (2013), McEwan (2013)
  • 23. What this relates to Literature Methodologically: Racial and gender test score gaps: e.g. Fryer and Levitt (2004, 2006, 2010, 2013), Todd and Wolpin (2007) Value-added models using HH panel data: Todd and Wolpin (2003, 2007), Fiorini and Keane (2014) Results: Growth decompositions with varying school quality: Hanushek and Kimko (2000), Hanushek and Woessmann (various), Schoellmann (2012), Kaarsen (2014) Cross-country productivity differences in diff sectors: Hall and Jones (1996), Bloom and Van Reenen (2007), Gollin et. al. (2014) Micro studies on learning in developing countries: e.g. Glewwe and Kremer (2006), Kremer et. al. (2013), McEwan (2013)
  • 24. What this relates to Literature Methodologically: Racial and gender test score gaps: e.g. Fryer and Levitt (2004, 2006, 2010, 2013), Todd and Wolpin (2007) Value-added models using HH panel data: Todd and Wolpin (2003, 2007), Fiorini and Keane (2014) Results: Growth decompositions with varying school quality: Hanushek and Kimko (2000), Hanushek and Woessmann (various), Schoellmann (2012), Kaarsen (2014) Cross-country productivity differences in diff sectors: Hall and Jones (1996), Bloom and Van Reenen (2007), Gollin et. al. (2014) Micro studies on learning in developing countries: e.g. Glewwe and Kremer (2006), Kremer et. al. (2013), McEwan (2013)
  • 25. Data Young Lives survey structure Round 1 Round 2 Round 3 051015 Ageinyears O ct2002 D ec 2006 N ov 2009 Time Younger cohort Older cohort Graph shows median age of children and time of interview across countries By age of children Timing of survey rounds
  • 26. Data Young Lives survey test data Use data from the 2006/7 and 2009 rounds on quantitative proficiency Cognitive Development Assessment Quant. sub-scale for 5 year old sample Mathematics tests for 8 year old children Identical tests administered across all four countries in each round can be linked within round across the four countries using Item Response Theory
  • 27. Data Young Lives survey test data Use data from the 2006/7 and 2009 rounds on quantitative proficiency Cognitive Development Assessment Quant. sub-scale for 5 year old sample Mathematics tests for 8 year old children Identical tests administered across all four countries in each round can be linked within round across the four countries using Item Response Theory
  • 28. Data Table : Descriptives on age and school progression Cohort Variable Statistics Ethiopia India Peru Vietnam YC 2006 (5-years) Enrolment Mean 0.04 0.45 0.01 0.01 YC 2009 (8-years) Enrolment Mean 0.77 0.99 0.98 0.98 OC 2006 (12-years) Enrolment Mean 0.95 0.89 0.99 0.97 OC 2009 (15-years) Enrolment Mean 0.89 0.77 0.92 0.77 Older cohort Age of entry Mean 7.19 5.04 5.88 6.07 SD 1.52 0.71 0.57 0.48 YC 2009 (8-years) Grade Mean 0.64 1.63 1.31 1.71 SD 0.77 1 0.58 0.57 Grade refers to highest grade completed
  • 29. Learning differences at 5 and 8 Table : Linked test scores at 5,and 8 years Age group Statistics Countries Ethiopia India Peru Vietnam 5 years Mean 454 498.3 520.4 524.7 N 1846 1904 1893 1935 8 years Mean 419.1 495.9 518.2 563.6 N 1885 1930 1943 1964 Scores are IRT test scores generated within an age sample,pooling data from all countries, and normalized to have a mean of 500 and an SD of 100 in the pooled sample. Scores are comparable across countries but not across age groups.
  • 30. Do rankings change across age groups? 0.2.4.6.81 200 400 600 800 1000 CDA Scores, 2006 5 years 0.2.4.6.81 200 400 600 800 1000 Math Scores, 2009 8 years Empirical CDFs Distribution of achievement Ethiopia India Peru Vietnam
  • 31. Rankings are unchanged but are the gaps growing? Between 5 and 8 years of age
  • 32. Rankings are unchanged but are the gaps growing? Between 5 and 8 years of age p10 p90400450500550600 Mathscores(2009) 300 400 500 600 700 CDA scores (2006) Ethiopia India Peru Vietnam
  • 33. Where are the gaps coming from? Knowing differences in levels and trends between countries informative but not enough. Even trend differences need not imply differential effectiveness of schools across countries endowments differ - e.g. parental education, home inputs, nutrition, other environmental differences but differential effectiveness, and malleable environmental sources of learning divergence, are where policy might make a difference
  • 34. Where are the gaps coming from? Knowing differences in levels and trends between countries informative but not enough. Even trend differences need not imply differential effectiveness of schools across countries endowments differ - e.g. parental education, home inputs, nutrition, other environmental differences but differential effectiveness, and malleable environmental sources of learning divergence, are where policy might make a difference
  • 35. Where are the gaps coming from? Knowing differences in levels and trends between countries informative but not enough. Even trend differences need not imply differential effectiveness of schools across countries endowments differ - e.g. parental education, home inputs, nutrition, other environmental differences but differential effectiveness, and malleable environmental sources of learning divergence, are where policy might make a difference
  • 36. Where are the gaps coming from? Knowing differences in levels and trends between countries informative but not enough. Even trend differences need not imply differential effectiveness of schools across countries endowments differ - e.g. parental education, home inputs, nutrition, other environmental differences but differential effectiveness, and malleable environmental sources of learning divergence, are where policy might make a difference
  • 37. Do child-specific endowments explain divergence? Value-added models with common coefficients: Specifications Yic,2009 = φc (1) +β1.Yic,2006 (2) +β2.Xi (3) +β3.TUica + ϵica (4) Xi (Background) - male, eldest child, wealth index, age, caregiver’s education, height-for-age in 2009 TUica (time use) - time use on different activities Yic,2006 (lagged achievement) - 2006 quantitative achievement measures
  • 38. A lot differs across samples At 8 years of age Ethiopia India Peru Vietnam Mean SD N Mean SD N Mean SD N Mean SD N Child and background characteristics (Xic) Male 0.53 0.5 1881 0.53 0.5 1903 0.5 0.5 1892 0.51 0.5 1916 First born 0.23 0.42 1881 0.39 0.49 1903 0.37 0.48 1892 0.46 0.5 1916 Caregiver’s Education 2.95 3.73 1874 3.7 4.44 1900 7.75 4.64 1892 6.88 3.83 1908 Age in months 97.48 4.05 1879 96.03 3.92 1903 95.35 3.63 1890 97.09 3.75 1915 Height-for-age z-score -1.21 1.05 1877 -1.44 1.03 1898 -1.14 1.03 1890 -1.07 1.05 1900 Wealth index (2006) 0.28 0.18 1881 0.46 0.2 1902 0.47 0.23 1892 0.51 0.2 1914 Time use (hours spent on a typical day; TUic,a) — Doing domestic tasks 1.66 1.37 1881 0.33 0.58 1903 0.87 0.7 1887 0.54 0.66 1899 — Tasks on family farm/business etc. 1.5 2.22 1880 0.01 0.1 1903 0.25 0.66 1886 0.09 0.48 1897 — Paid work outside household 0.01 0.28 1880 0.01 0.2 1903 0 0.08 1887 0 0.07 1897 — At school 4.91 2.54 1881 7.72 0.95 1903 6.02 0.9 1887 5.04 1.31 1898 — Studying outside school time 0.99 0.89 1881 1.86 1.09 1903 1.87 0.83 1886 2.82 1.49 1897 — General leisure etc. 4.44 2.39 1881 4.71 1.54 1903 4.13 1.65 1887 5.55 1.65 1898 — Caring for others 0.83 1.21 1881 0.21 0.5 1903 0.48 0.88 1886 0.24 0.66 1878
  • 39. Do child-specific endowments explain divergence? Value-added models with common coefficients: Results (1) (2) (3) (4) Dep var: Mathematics score (2009) VARIABLES 8-years old Country dummies India 76.3*** 64.5*** 61.6*** 16.3*** (3.01) (2.92) (2.97) (3.58) Peru 96.7*** 79.1*** 65.2*** 48.2*** (2.75) (2.71) (2.69) (2.85) Vietnam 146*** 127*** 108*** 92.2*** (3.04) (3.06) (2.97) (3.45) Lagged test scores Y Y Y Background vars (Xic) Y Y Time use (TUic,a) Y Robust standard errors in parentheses *** p<0.01, ** p<0.05, * p<0.1.
  • 40. Does differential productivity of home inputs explain divergence? Country-specific production function estimates Previous specification had a very strong implicit assumption: the effect of inputs on achievement is the same across countries So I run the same specifications separately for each country sample allows for each input parameter to be different across countries but makes interpretation difficult since four sets of input coefficients Key result: Between 5-8 years, divergence with Vietnam not explained by levels of inputs
  • 41. Does differential productivity of home inputs explain divergence? Country-specific production function estimates Previous specification had a very strong implicit assumption: the effect of inputs on achievement is the same across countries So I run the same specifications separately for each country sample allows for each input parameter to be different across countries but makes interpretation difficult since four sets of input coefficients Key result: Between 5-8 years, divergence with Vietnam not explained by levels of inputs
  • 42. Does differential productivity of home inputs explain divergence? Country-specific production function estimates Previous specification had a very strong implicit assumption: the effect of inputs on achievement is the same across countries So I run the same specifications separately for each country sample allows for each input parameter to be different across countries but makes interpretation difficult since four sets of input coefficients Key result: Between 5-8 years, divergence with Vietnam not explained by levels of inputs
  • 43. Does differential productivity of home inputs explain divergence? Country-specific production function estimates Previous specification had a very strong implicit assumption: the effect of inputs on achievement is the same across countries So I run the same specifications separately for each country sample allows for each input parameter to be different across countries but makes interpretation difficult since four sets of input coefficients Key result: Between 5-8 years, divergence with Vietnam not explained by levels of inputs
  • 44. Does differential productivity of home inputs explain divergence? Country-specific production function estimates Previous specification had a very strong implicit assumption: the effect of inputs on achievement is the same across countries So I run the same specifications separately for each country sample allows for each input parameter to be different across countries but makes interpretation difficult since four sets of input coefficients Key result: Between 5-8 years, divergence with Vietnam not explained by levels of inputs
  • 45. Predicted mean scores under counterfactual scenarios 8-year olds Coefficients (βc) Without time use With time use Ethiopia India Peru Vietnam Ethiopia India Peru Vietnam Ethiopia 420.79 485.28 495.47 523.15 420.75 390.94 486.66 488.38 (9.87) (10.64) (5.49) (13.48) (10.85) (16.72) (9.62) (19.19) Inputs India 450.36 497.32 503.74 539.9 487.38 497.32 516.86 563.24 (Xic; TUica) (11.54) (9.59) (4.97) (11.02) (10.39) (9.87) (7.99) (14.79) Yic,a−1 Peru 470.66 514.64 517.73 559.32 479.48 468.87 517.74 557.66 (11.35) (10.7) (4.65) (10.53) (10.93) (10.96) (5.65) (11.68) Vietnam 478.69 518.05 522.35 567.03 492.1 476.78 520.84 568.22 (11.08) (9.76) (4.51) (9.16) (12.06) (13.14) (7.09) (11.43) Cells contain linear predictions of test scores using combinations of country-specific production function parameters (βc ) with country-specific input levels (Xic and TUic). Standard errors of predictions in parentheses.
  • 46. Estimating the quality of schooling Specifications above include no schooling measures But we know exposure of schooling differs, esp. in Ethiopia Suspect that quality of schooling differs too What I do: include highest grade completed in the specifications and re-estimate Identification reliant on relevant unobserved heterogeneity being absorbed by controls and lag Will show RD-type IV estimates These are the most ‘complete’ VA specifications in the paper
  • 47. Estimating the quality of schooling Specifications above include no schooling measures But we know exposure of schooling differs, esp. in Ethiopia Suspect that quality of schooling differs too What I do: include highest grade completed in the specifications and re-estimate Identification reliant on relevant unobserved heterogeneity being absorbed by controls and lag Will show RD-type IV estimates These are the most ‘complete’ VA specifications in the paper
  • 48. Estimating the quality of schooling Specifications above include no schooling measures But we know exposure of schooling differs, esp. in Ethiopia Suspect that quality of schooling differs too What I do: include highest grade completed in the specifications and re-estimate Identification reliant on relevant unobserved heterogeneity being absorbed by controls and lag Will show RD-type IV estimates These are the most ‘complete’ VA specifications in the paper
  • 49. VAMs with grade effectiveness 8-year olds (1) (2) (3) (4) (5) (6) (7) (8) VARIABLES Dep var: Mathematics score (2009) Without time use With time use Ethiopia India Peru Vietnam Ethiopia India Peru Vietnam Highest grade completed 40.9*** 27.4*** 33.6*** 60.9*** 28.4*** 25.4*** 32.6*** 55.2*** (4.67) (2.03) (3.60) (14.6) (4.48) (1.62) (3.55) (10.9) Male 3.26 12.7*** 8.73*** 1.65 4.44 11.6*** 8.92*** 1.62 (5.61) (3.05) (2.22) (2.39) (4.82) (3.13) (2.47) (2.66) Caregiver’s education level 3.76*** 2.40*** 2.23*** 3.16*** 2.74*** 1.86*** 2.10*** 2.18*** (0.66) (0.70) (0.49) (0.80) (0.52) (0.49) (0.48) (0.72) Age in months 1.26** 0.51 -0.067 0.18 1.30** 0.60 0.0079 0.69 (0.53) (0.45) (0.30) (1.10) (0.56) (0.41) (0.30) (0.87) Height-for-age (2009) 9.31*** 5.38** 5.22** 7.14*** 5.30** 4.79** 4.82** 4.81*** (2.64) (2.21) (1.92) (1.78) (2.33) (1.85) (1.73) (1.56) Wealth index (2006) 151*** 53.6** 17.6* 78.3*** 105*** 31.0* 18.1* 59.0*** (25.9) (23.8) (8.80) (20.9) (18.8) (17.8) (8.91) (19.0) Lagged CDA scores (2006) 0.067*** 0.13*** 0.100*** 0.065* 0.045* 0.12*** 0.100*** 0.049 (0.023) (0.027) (0.021) (0.032) (0.022) (0.027) (0.020) (0.030) Constant 196*** 306*** 401*** 354*** 129* 97.6* 313*** 333*** (49.2) (45.5) (29.5) (74.1) (72.0) (53.8) (38.8) (65.5) Observations 1,835 1,892 1,888 1,907 1,834 1,892 1,881 1,858 R-squared 0.340 0.276 0.343 0.437 0.410 0.365 0.370 0.458 Robust standard errors in parentheses. Standard errors are clustered at site level. *** p<0.01, ** p<0.05, * p<0.1
  • 50. Can you trust VA estimates Comparing with IV results What if you don’t believe that grades completed are conditionally exogenous? Identification reliant on relevant unobserved heterogeneity being absorbed by controls and lag Way out: try looking for an IV which affects the highest grade completed at a particular age but does not directly determine learning, conditional on controls Solution: Plausibly exogenous variation coming from enrolment thresholds Creates discontinuity in the number of grades completed at particular calendar months Conditional on age and previous learning, should be excludable
  • 51. Can you trust VA estimates Comparing with IV results What if you don’t believe that grades completed are conditionally exogenous? Identification reliant on relevant unobserved heterogeneity being absorbed by controls and lag Way out: try looking for an IV which affects the highest grade completed at a particular age but does not directly determine learning, conditional on controls Solution: Plausibly exogenous variation coming from enrolment thresholds Creates discontinuity in the number of grades completed at particular calendar months Conditional on age and previous learning, should be excludable
  • 52. Can you trust VA estimates Comparing with IV results What if you don’t believe that grades completed are conditionally exogenous? Identification reliant on relevant unobserved heterogeneity being absorbed by controls and lag Way out: try looking for an IV which affects the highest grade completed at a particular age but does not directly determine learning, conditional on controls Solution: Plausibly exogenous variation coming from enrolment thresholds Creates discontinuity in the number of grades completed at particular calendar months Conditional on age and previous learning, should be excludable
  • 53. Can you trust VA estimates Comparing with IV results What if you don’t believe that grades completed are conditionally exogenous? Identification reliant on relevant unobserved heterogeneity being absorbed by controls and lag Way out: try looking for an IV which affects the highest grade completed at a particular age but does not directly determine learning, conditional on controls Solution: Plausibly exogenous variation coming from enrolment thresholds Creates discontinuity in the number of grades completed at particular calendar months Conditional on age and previous learning, should be excludable
  • 54. Enrolment threshold based discontinuities in grade completion 0.511.520.511.52 Jan 01Feb 01M ar01Apr01M ay 01Jun 01Jul01Aug 01Sept01O ct01N ov 01D ec 01Jan 02Feb 02M ar02Apr02M ay 02Jun 02Jul02Aug 02Sep 02O ct02O ct02 Jan 01Feb 01M ar01Apr01M ay 01Jun 01Jul01Aug 01Sept01O ct01N ov 01D ec 01Jan 02Feb 02M ar02Apr02M ay 02Jun 02Jul02Aug 02Sep 02O ct02O ct02 Jan 01Feb 01M ar01Apr01M ay 01Jun 01Jul01Aug 01Sept01O ct01N ov 01D ec 01Jan 02Feb 02M ar02Apr02M ay 02Jun 02Jul02Aug 02Sep 02O ct02O ct02 Jan 01Feb 01M ar01Apr01M ay 01Jun 01Jul01Aug 01Sept01O ct01N ov 01D ec 01Jan 02Feb 02M ar02Apr02M ay 02Jun 02Jul02Aug 02Sep 02O ct02O ct02 Ethiopia India Peru Vietnam Averagegradeattained By month of birth Mean grade completed by 2009
  • 55. IV specifications First stage: gradesi,2009 = µ + γ1.Thresholdi + γ2.Xi + γ3.sitei + ϵica (5) Second stage: Yic,a = αc + β1.Yic,a−1 + β2.Xic + β3.gradeica + γ.sitei + ϵica +β4.TUic,a Same as old VAM but for inclusion of site fixed effects OK here because not comparing constant terms
  • 56. IV specifications First stage: gradesi,2009 = µ + γ1.Thresholdi + γ2.Xi + γ3.sitei + ϵica (5) Second stage: Yic,a = αc + β1.Yic,a−1 + β2.Xic + β3.gradeica + γ.sitei + ϵica +β4.TUic,a Same as old VAM but for inclusion of site fixed effects OK here because not comparing constant terms
  • 57. Discontinuity based results on grade effectiveness Peru and Vietnam (1) (2) (3) (4) VARIABLES Dep var: Math scores (2009) Peru Vietnam Highest grade completed 20.1*** 20.9*** 47.3*** 46.3*** (7.61) (7.96) (7.49) (7.16) Male 9.43*** 9.96*** 1.34 1.56 (2.39) (2.63) (2.36) (2.46) Caregiver’s education level 2.31*** 2.14*** 3.05*** 2.41*** (0.40) (0.37) (0.61) (0.55) Age in months 0.94 0.87 0.41 0.64 (0.66) (0.71) (0.57) (0.53) Height-for-age (2009) 6.15*** 5.59*** 6.00*** 4.18*** (2.20) (2.00) (1.96) (1.44) Wealth index (2006) 29.7*** 29.0*** 40.2** 28.6** (7.67) (7.84) (16.2) (13.4) Lagged CDA scores (2006) 0.13*** 0.12*** 0.11*** 0.088*** (0.020) (0.020) (0.031) (0.027) Constant 290*** 227*** 375*** 316*** (58.2) (69.2) (55.5) (60.2) Observations 1,888 1,881 1,907 1,858 R-squared 0.366 0.393 0.481 0.504 Kleibergen-Paap F-statistic 108 110 113 152 Robust standard errors in parentheses. Standard errors are clustered at site level. *** p<0.01, ** p<0.05, * p<0.1 Test scores are IRT scores normalized to have a mean of 500 and SD of 100 in the pooled four-country sample at each age. Estimation includes a vector of site fixed effects and other covariates, coefficients for which are not reported.
  • 58. Robustness checks Flexible lags: Possibility (even suggestion) of non-linearity in the effect of lag on current achievement estimate everything with third-order polynomial of lag / bins of achievement Measurement error in the lag instrument lag with vocabulary test in the 8-year old cohort assumes independent measurement error across tests Overall: Persistence parameter might be off but basic story stays.
  • 59. Robustness checks Flexible lags: Possibility (even suggestion) of non-linearity in the effect of lag on current achievement estimate everything with third-order polynomial of lag / bins of achievement Measurement error in the lag instrument lag with vocabulary test in the 8-year old cohort assumes independent measurement error across tests Overall: Persistence parameter might be off but basic story stays.
  • 60. Robustness checks Flexible lags: Possibility (even suggestion) of non-linearity in the effect of lag on current achievement estimate everything with third-order polynomial of lag / bins of achievement Measurement error in the lag instrument lag with vocabulary test in the 8-year old cohort assumes independent measurement error across tests Overall: Persistence parameter might be off but basic story stays.
  • 61. Robustness checks Flexible lags: Possibility (even suggestion) of non-linearity in the effect of lag on current achievement estimate everything with third-order polynomial of lag / bins of achievement Measurement error in the lag instrument lag with vocabulary test in the 8-year old cohort assumes independent measurement error across tests Overall: Persistence parameter might be off but basic story stays.
  • 62. Robustness checks Flexible lags: Possibility (even suggestion) of non-linearity in the effect of lag on current achievement estimate everything with third-order polynomial of lag / bins of achievement Measurement error in the lag instrument lag with vocabulary test in the 8-year old cohort assumes independent measurement error across tests Overall: Persistence parameter might be off but basic story stays.
  • 63. Robustness checks Flexible lags: Possibility (even suggestion) of non-linearity in the effect of lag on current achievement estimate everything with third-order polynomial of lag / bins of achievement Measurement error in the lag instrument lag with vocabulary test in the 8-year old cohort assumes independent measurement error across tests Overall: Persistence parameter might be off but basic story stays.
  • 64. Pulling it all together Levels of learning are low except for Vietnam Differences start early by 5 and grow further later Between 5-8, divergence with Vietnam reflects differential effectiveness of a school year School productivity differences are huge! These productivity differences are causally identified
  • 65. Pulling it all together Levels of learning are low except for Vietnam Differences start early by 5 and grow further later Between 5-8, divergence with Vietnam reflects differential effectiveness of a school year School productivity differences are huge! These productivity differences are causally identified
  • 66. Pulling it all together Levels of learning are low except for Vietnam Differences start early by 5 and grow further later Between 5-8, divergence with Vietnam reflects differential effectiveness of a school year School productivity differences are huge! These productivity differences are causally identified
  • 67. Pulling it all together Levels of learning are low except for Vietnam Differences start early by 5 and grow further later Between 5-8, divergence with Vietnam reflects differential effectiveness of a school year School productivity differences are huge! These productivity differences are causally identified
  • 68. Pulling it all together Levels of learning are low except for Vietnam Differences start early by 5 and grow further later Between 5-8, divergence with Vietnam reflects differential effectiveness of a school year School productivity differences are huge! These productivity differences are causally identified
  • 69. What these results imply Early divergence provides suggestive support for preschool interventions Evidence (except on nutrition) usually based on OECD or LAC But major divergence after 5 is due to differences in school productivity at primary school level It isn’t all over by 5. School productivity is a variable policy can affect! Differences in school productivity across countries raise an important question: why is productivity so much higher in some countries? This is not the focus of most of the work in education in dev econ (but still important)
  • 70. What these results imply Early divergence provides suggestive support for preschool interventions Evidence (except on nutrition) usually based on OECD or LAC But major divergence after 5 is due to differences in school productivity at primary school level It isn’t all over by 5. School productivity is a variable policy can affect! Differences in school productivity across countries raise an important question: why is productivity so much higher in some countries? This is not the focus of most of the work in education in dev econ (but still important)
  • 71. What these results imply Early divergence provides suggestive support for preschool interventions Evidence (except on nutrition) usually based on OECD or LAC But major divergence after 5 is due to differences in school productivity at primary school level It isn’t all over by 5. School productivity is a variable policy can affect! Differences in school productivity across countries raise an important question: why is productivity so much higher in some countries? This is not the focus of most of the work in education in dev econ (but still important)
  • 72. What these results imply Early divergence provides suggestive support for preschool interventions Evidence (except on nutrition) usually based on OECD or LAC But major divergence after 5 is due to differences in school productivity at primary school level It isn’t all over by 5. School productivity is a variable policy can affect! Differences in school productivity across countries raise an important question: why is productivity so much higher in some countries? This is not the focus of most of the work in education in dev econ (but still important)
  • 73. How should we comparably measure school productivity A pitch for linked micro panels Linked micro panel data has important advantages for this question: PISA, TIMSS go to very selective non-OECD countries and the divergence has already happened by 15 years of age and selection on enrollment and attendance Returns to schooling for migrants in the US differential selection of migrants makes individual country estimates unreliable Schoellman (2012) rankings put India way ahead of Vietnam, Ethiopia and Peru the exact opposite of what I find! (or levels in PISA!) A lot could be done if educational interventions linked tests...
  • 74. How should we comparably measure school productivity A pitch for linked micro panels Linked micro panel data has important advantages for this question: PISA, TIMSS go to very selective non-OECD countries and the divergence has already happened by 15 years of age and selection on enrollment and attendance Returns to schooling for migrants in the US differential selection of migrants makes individual country estimates unreliable Schoellman (2012) rankings put India way ahead of Vietnam, Ethiopia and Peru the exact opposite of what I find! (or levels in PISA!) A lot could be done if educational interventions linked tests...
  • 75. How should we comparably measure school productivity A pitch for linked micro panels Linked micro panel data has important advantages for this question: PISA, TIMSS go to very selective non-OECD countries and the divergence has already happened by 15 years of age and selection on enrollment and attendance Returns to schooling for migrants in the US differential selection of migrants makes individual country estimates unreliable Schoellman (2012) rankings put India way ahead of Vietnam, Ethiopia and Peru the exact opposite of what I find! (or levels in PISA!) A lot could be done if educational interventions linked tests...
  • 76. How should we comparably measure school productivity A pitch for linked micro panels Linked micro panel data has important advantages for this question: PISA, TIMSS go to very selective non-OECD countries and the divergence has already happened by 15 years of age and selection on enrollment and attendance Returns to schooling for migrants in the US differential selection of migrants makes individual country estimates unreliable Schoellman (2012) rankings put India way ahead of Vietnam, Ethiopia and Peru the exact opposite of what I find! (or levels in PISA!) A lot could be done if educational interventions linked tests...
  • 77. How should we comparably measure school productivity A pitch for linked micro panels Linked micro panel data has important advantages for this question: PISA, TIMSS go to very selective non-OECD countries and the divergence has already happened by 15 years of age and selection on enrollment and attendance Returns to schooling for migrants in the US differential selection of migrants makes individual country estimates unreliable Schoellman (2012) rankings put India way ahead of Vietnam, Ethiopia and Peru the exact opposite of what I find! (or levels in PISA!) A lot could be done if educational interventions linked tests...
  • 78. How should we comparably measure school productivity A pitch for linked micro panels Linked micro panel data has important advantages for this question: PISA, TIMSS go to very selective non-OECD countries and the divergence has already happened by 15 years of age and selection on enrollment and attendance Returns to schooling for migrants in the US differential selection of migrants makes individual country estimates unreliable Schoellman (2012) rankings put India way ahead of Vietnam, Ethiopia and Peru the exact opposite of what I find! (or levels in PISA!) A lot could be done if educational interventions linked tests...
  • 79. How should we comparably measure school productivity A pitch for linked micro panels Linked micro panel data has important advantages for this question: PISA, TIMSS go to very selective non-OECD countries and the divergence has already happened by 15 years of age and selection on enrollment and attendance Returns to schooling for migrants in the US differential selection of migrants makes individual country estimates unreliable Schoellman (2012) rankings put India way ahead of Vietnam, Ethiopia and Peru the exact opposite of what I find! (or levels in PISA!) A lot could be done if educational interventions linked tests...
  • 80. How should we comparably measure school productivity A pitch for linked micro panels Linked micro panel data has important advantages for this question: PISA, TIMSS go to very selective non-OECD countries and the divergence has already happened by 15 years of age and selection on enrollment and attendance Returns to schooling for migrants in the US differential selection of migrants makes individual country estimates unreliable Schoellman (2012) rankings put India way ahead of Vietnam, Ethiopia and Peru the exact opposite of what I find! (or levels in PISA!) A lot could be done if educational interventions linked tests...
  • 81. How should we comparably measure school productivity A pitch for linked micro panels Linked micro panel data has important advantages for this question: PISA, TIMSS go to very selective non-OECD countries and the divergence has already happened by 15 years of age and selection on enrollment and attendance Returns to schooling for migrants in the US differential selection of migrants makes individual country estimates unreliable Schoellman (2012) rankings put India way ahead of Vietnam, Ethiopia and Peru the exact opposite of what I find! (or levels in PISA!) A lot could be done if educational interventions linked tests...
  • 83. Appendix: Item Response Theory How I link scores Decades long history in education and psychometrics – GRE, GMAT, SAT, NAEP, TIMSS The basic idea:The focus of IRT is at the item level. Models the probability that an individual with given ability will get an item right The overall ability estimate (test score) generated by analyzing an individual’s response to different items each defined by their own characteristics Many advantages (see e.g. Das and Zajonc, 2010): Most importantly (for me) the ability to link But also much better diagnostics for cross-cultural comparisons Less arbitrary than summing up correct responses Caveat: Linking requires common items across samples can’t directly compare across age groups
  • 84. Appendix: Item Response Theory How I link scores Decades long history in education and psychometrics – GRE, GMAT, SAT, NAEP, TIMSS The basic idea:The focus of IRT is at the item level. Models the probability that an individual with given ability will get an item right The overall ability estimate (test score) generated by analyzing an individual’s response to different items each defined by their own characteristics Many advantages (see e.g. Das and Zajonc, 2010): Most importantly (for me) the ability to link But also much better diagnostics for cross-cultural comparisons Less arbitrary than summing up correct responses Caveat: Linking requires common items across samples can’t directly compare across age groups
  • 85. Appendix: Item Response Theory How I link scores Decades long history in education and psychometrics – GRE, GMAT, SAT, NAEP, TIMSS The basic idea:The focus of IRT is at the item level. Models the probability that an individual with given ability will get an item right The overall ability estimate (test score) generated by analyzing an individual’s response to different items each defined by their own characteristics Many advantages (see e.g. Das and Zajonc, 2010): Most importantly (for me) the ability to link But also much better diagnostics for cross-cultural comparisons Less arbitrary than summing up correct responses Caveat: Linking requires common items across samples can’t directly compare across age groups
  • 86. Appendix: Item Response Theory How I link scores Decades long history in education and psychometrics – GRE, GMAT, SAT, NAEP, TIMSS The basic idea:The focus of IRT is at the item level. Models the probability that an individual with given ability will get an item right The overall ability estimate (test score) generated by analyzing an individual’s response to different items each defined by their own characteristics Many advantages (see e.g. Das and Zajonc, 2010): Most importantly (for me) the ability to link But also much better diagnostics for cross-cultural comparisons Less arbitrary than summing up correct responses Caveat: Linking requires common items across samples can’t directly compare across age groups
  • 87. Appendix: Item Response Theory Item Characteristic Curve
  • 88. Appendix: Item Response Theory 3 Parameter Logistic (3PL) Model Item Response Function: Pg (θi ) = cg + 1 − cg 1 + exp(−1.7.ag .(θi − bg)) (6) cg is the pseudo-guessing parameter - with multiple choice questions, even the lowest ability can get some answers right. Set to zero for non-MCQ to get 2PL model bg is the difficulty parameter - the level at which the probability of getting item right is 0.5 in 2 PL ag is the discrimination parameter - slope of the ICC at b – how quickly the likelihood of success changes with respect to ability.
  • 89. Appendix: Item Response Theory 3 Parameter Logistic (3PL) Model Item Response Function: Pg (θi ) = cg + 1 − cg 1 + exp(−1.7.ag .(θi − bg)) (6) cg is the pseudo-guessing parameter - with multiple choice questions, even the lowest ability can get some answers right. Set to zero for non-MCQ to get 2PL model bg is the difficulty parameter - the level at which the probability of getting item right is 0.5 in 2 PL ag is the discrimination parameter - slope of the ICC at b – how quickly the likelihood of success changes with respect to ability.
  • 90. Appendix: Item Response Theory 3 Parameter Logistic (3PL) Model Item Response Function: Pg (θi ) = cg + 1 − cg 1 + exp(−1.7.ag .(θi − bg)) (6) cg is the pseudo-guessing parameter - with multiple choice questions, even the lowest ability can get some answers right. Set to zero for non-MCQ to get 2PL model bg is the difficulty parameter - the level at which the probability of getting item right is 0.5 in 2 PL ag is the discrimination parameter - slope of the ICC at b – how quickly the likelihood of success changes with respect to ability.
  • 91. Appendix: Item Response Theory Core Assumptions 1. Unidimensionality - A single latent individual-specific trait determines performance on the test 2. No Differential Item Functioning: Implicit in ICC, item characteristics are person-invariant 2.1 particularly important in cross-cultural settings 3. (Conditional) local independence: 3.1 Item responses are independent across individuals (no cheating!) 3.2 Conditional on ability, item responses are locally independent across questions for the same individual Under these assumptions, can recover estimates of ability and item characteristics given matrix of responses by individuals
  • 92. Appendix: Item Response Theory Core Assumptions 1. Unidimensionality - A single latent individual-specific trait determines performance on the test 2. No Differential Item Functioning: Implicit in ICC, item characteristics are person-invariant 2.1 particularly important in cross-cultural settings 3. (Conditional) local independence: 3.1 Item responses are independent across individuals (no cheating!) 3.2 Conditional on ability, item responses are locally independent across questions for the same individual Under these assumptions, can recover estimates of ability and item characteristics given matrix of responses by individuals
  • 93. Appendix: Item Response Theory Core Assumptions 1. Unidimensionality - A single latent individual-specific trait determines performance on the test 2. No Differential Item Functioning: Implicit in ICC, item characteristics are person-invariant 2.1 particularly important in cross-cultural settings 3. (Conditional) local independence: 3.1 Item responses are independent across individuals (no cheating!) 3.2 Conditional on ability, item responses are locally independent across questions for the same individual Under these assumptions, can recover estimates of ability and item characteristics given matrix of responses by individuals
  • 94. Appendix: Item Response Theory Core Assumptions 1. Unidimensionality - A single latent individual-specific trait determines performance on the test 2. No Differential Item Functioning: Implicit in ICC, item characteristics are person-invariant 2.1 particularly important in cross-cultural settings 3. (Conditional) local independence: 3.1 Item responses are independent across individuals (no cheating!) 3.2 Conditional on ability, item responses are locally independent across questions for the same individual Under these assumptions, can recover estimates of ability and item characteristics given matrix of responses by individuals
  • 95. Appendix: Item Response Theory Core Assumptions 1. Unidimensionality - A single latent individual-specific trait determines performance on the test 2. No Differential Item Functioning: Implicit in ICC, item characteristics are person-invariant 2.1 particularly important in cross-cultural settings 3. (Conditional) local independence: 3.1 Item responses are independent across individuals (no cheating!) 3.2 Conditional on ability, item responses are locally independent across questions for the same individual Under these assumptions, can recover estimates of ability and item characteristics given matrix of responses by individuals
  • 96. Appendix: Item Response Theory Core Assumptions 1. Unidimensionality - A single latent individual-specific trait determines performance on the test 2. No Differential Item Functioning: Implicit in ICC, item characteristics are person-invariant 2.1 particularly important in cross-cultural settings 3. (Conditional) local independence: 3.1 Item responses are independent across individuals (no cheating!) 3.2 Conditional on ability, item responses are locally independent across questions for the same individual Under these assumptions, can recover estimates of ability and item characteristics given matrix of responses by individuals
  • 97. Appendix: Item Response Theory How does linking work? IRT only identifies the latent ability up to a linear transformation need to fix the scale somewhere e.g. fix min and max. GRE used to be from 200 to 800 (130-170 now) or fix mean and SD. PISA and TIMSS have mean of 500 and SD of 100 Item characteristics are fixed and can be used to link across samples common items serve as ‘anchors’ which bring two assessments on a common scale only a subset of items need to be common Without sufficient common items: Still can do IRT but scores not on comparable scales important because then you can’t use panel methods such as differencing or fixed effects without very strong assumptions about the two distributions
  • 98. Appendix: Item Response Theory How does linking work? IRT only identifies the latent ability up to a linear transformation need to fix the scale somewhere e.g. fix min and max. GRE used to be from 200 to 800 (130-170 now) or fix mean and SD. PISA and TIMSS have mean of 500 and SD of 100 Item characteristics are fixed and can be used to link across samples common items serve as ‘anchors’ which bring two assessments on a common scale only a subset of items need to be common Without sufficient common items: Still can do IRT but scores not on comparable scales important because then you can’t use panel methods such as differencing or fixed effects without very strong assumptions about the two distributions
  • 99. Appendix: Item Response Theory How does linking work? IRT only identifies the latent ability up to a linear transformation need to fix the scale somewhere e.g. fix min and max. GRE used to be from 200 to 800 (130-170 now) or fix mean and SD. PISA and TIMSS have mean of 500 and SD of 100 Item characteristics are fixed and can be used to link across samples common items serve as ‘anchors’ which bring two assessments on a common scale only a subset of items need to be common Without sufficient common items: Still can do IRT but scores not on comparable scales important because then you can’t use panel methods such as differencing or fixed effects without very strong assumptions about the two distributions
  • 100. Appendix: Item Response Theory How does linking work? IRT only identifies the latent ability up to a linear transformation need to fix the scale somewhere e.g. fix min and max. GRE used to be from 200 to 800 (130-170 now) or fix mean and SD. PISA and TIMSS have mean of 500 and SD of 100 Item characteristics are fixed and can be used to link across samples common items serve as ‘anchors’ which bring two assessments on a common scale only a subset of items need to be common Without sufficient common items: Still can do IRT but scores not on comparable scales important because then you can’t use panel methods such as differencing or fixed effects without very strong assumptions about the two distributions
  • 101. Appendix: Item Response Theory How does linking work? IRT only identifies the latent ability up to a linear transformation need to fix the scale somewhere e.g. fix min and max. GRE used to be from 200 to 800 (130-170 now) or fix mean and SD. PISA and TIMSS have mean of 500 and SD of 100 Item characteristics are fixed and can be used to link across samples common items serve as ‘anchors’ which bring two assessments on a common scale only a subset of items need to be common Without sufficient common items: Still can do IRT but scores not on comparable scales important because then you can’t use panel methods such as differencing or fixed effects without very strong assumptions about the two distributions
  • 102. Appendix: Item Response Theory How does linking work? IRT only identifies the latent ability up to a linear transformation need to fix the scale somewhere e.g. fix min and max. GRE used to be from 200 to 800 (130-170 now) or fix mean and SD. PISA and TIMSS have mean of 500 and SD of 100 Item characteristics are fixed and can be used to link across samples common items serve as ‘anchors’ which bring two assessments on a common scale only a subset of items need to be common Without sufficient common items: Still can do IRT but scores not on comparable scales important because then you can’t use panel methods such as differencing or fixed effects without very strong assumptions about the two distributions
  • 103. Appendix: Item Response Theory How does linking work? IRT only identifies the latent ability up to a linear transformation need to fix the scale somewhere e.g. fix min and max. GRE used to be from 200 to 800 (130-170 now) or fix mean and SD. PISA and TIMSS have mean of 500 and SD of 100 Item characteristics are fixed and can be used to link across samples common items serve as ‘anchors’ which bring two assessments on a common scale only a subset of items need to be common Without sufficient common items: Still can do IRT but scores not on comparable scales important because then you can’t use panel methods such as differencing or fixed effects without very strong assumptions about the two distributions
  • 104. Appendix: Item Response Theory How does linking work? IRT only identifies the latent ability up to a linear transformation need to fix the scale somewhere e.g. fix min and max. GRE used to be from 200 to 800 (130-170 now) or fix mean and SD. PISA and TIMSS have mean of 500 and SD of 100 Item characteristics are fixed and can be used to link across samples common items serve as ‘anchors’ which bring two assessments on a common scale only a subset of items need to be common Without sufficient common items: Still can do IRT but scores not on comparable scales important because then you can’t use panel methods such as differencing or fixed effects without very strong assumptions about the two distributions
  • 105. Appendix: Item Response Theory How does linking work? IRT only identifies the latent ability up to a linear transformation need to fix the scale somewhere e.g. fix min and max. GRE used to be from 200 to 800 (130-170 now) or fix mean and SD. PISA and TIMSS have mean of 500 and SD of 100 Item characteristics are fixed and can be used to link across samples common items serve as ‘anchors’ which bring two assessments on a common scale only a subset of items need to be common Without sufficient common items: Still can do IRT but scores not on comparable scales important because then you can’t use panel methods such as differencing or fixed effects without very strong assumptions about the two distributions
  • 106. Appendix: Item Response Theory How does linking work? IRT only identifies the latent ability up to a linear transformation need to fix the scale somewhere e.g. fix min and max. GRE used to be from 200 to 800 (130-170 now) or fix mean and SD. PISA and TIMSS have mean of 500 and SD of 100 Item characteristics are fixed and can be used to link across samples common items serve as ‘anchors’ which bring two assessments on a common scale only a subset of items need to be common Without sufficient common items: Still can do IRT but scores not on comparable scales important because then you can’t use panel methods such as differencing or fixed effects without very strong assumptions about the two distributions
  • 107. Appendix: Differential Item Functioning When it’s not a problem C:/Users/pemb2850/Dropbox/Learning levels -
  • 108. Appendix: Differential Item Functioning When it IS a problem C:/Users/pemb2850/Dropbox/Learning levels -