SlideShare a Scribd company logo
1 of 34
Download to read offline
Instrumental variable
regression
Devesh Roy
ICAR-IFPRI training
September 21, 2015
Economics 20 - Prof. Anderson 2
Instrumental Variables & 2SLS
y = b0 + b1x1 + b2x2 + . . . bkxk + u
x1 = p0 + p1z + p2x2 + . . . pkxk + v
Economics 20 - Prof. Anderson 3
Why Use Instrumental Variables?
• Instrumental Variables (IV) estimation is used when your model has
endogenous x’s
• That is, whenever Cov(x,u) ≠ 0
• Thus, IV can be used to address the problem of omitted variable bias
• Additionally, IV can be used to solve the classic errors-in-variables
problem
Economics 20 - Prof. Anderson 4
What Is an Instrumental Variable?
• In order for a variable, z, to serve as a valid instrument for x, the
following must be true
• The instrument must be exogenous
• That is, Cov(z,u) = 0
• The instrument must be correlated with the endogenous variable x
• That is, Cov(z,x) ≠ 0
Economics 20 - Prof. Anderson 5
More on Valid Instruments
• We have to use common sense and economic theory to decide if it
makes sense to assume Cov(z,u) = 0
• We can test if Cov(z,x) ≠ 0
• Just testing H0: p1 = 0 in x = p0 + p1z + v
• Sometimes refer to this regression as the first-stage regression
Economics 20 - Prof. Anderson 6
IV Estimation in the Simple Regression Case
• For y = b0 + b1x + u, and given our assumptions
• Cov(z,y) = b1Cov(z,x) + Cov(z,u), so
• b1 = Cov(z,y) / Cov(z,x)
• Then the IV estimator for b1 is
  
  




xxzz
yyzz
ii
ii
1
ˆb
Economics 20 - Prof. Anderson 7
Inference with IV Estimation
• The homoskedasticity assumption in this case is E(u2|z) = s2 = Var(u)
• As in the OLS case, given the asymptotic variance, we can estimate
the standard error
 
  2
,
2
1
2
,
2
2
1
ˆˆ
ˆ
zxx
zxx
RSST
se
n
Var
s
b
s
s
b


Economics 20 - Prof. Anderson 8
IV versus OLS estimation
• Standard error in IV case differs from OLS only in the R2 from
regressing x on z
• Since R2 < 1, IV standard errors are larger
• However, IV is consistent, while OLS is inconsistent, when Cov(x,u) ≠
0
• The stronger the correlation between z and x, the smaller the IV
standard errors
Economics 20 - Prof. Anderson 9
The Effect of Poor Instruments
• What if our assumption that Cov(z,u) = 0 is false?
• The IV estimator will be inconsistent, too
• Can compare asymptotic bias in OLS and IV
• Prefer IV if Corr(z,u)/Corr(z,x) < Corr(x,u)
x
u
x
u
uxCorr
xzCorr
uzCorr
s
s
bb
s
s
bb


),(
~
plim:OLS
),(
),(ˆplim:IV
11
11
Economics 20 - Prof. Anderson 10
IV Estimation in the Multiple Regression Case
• IV estimation can be extended to the multiple regression case
• Call the model we are interested in estimating the structural model
• Our problem is that one or more of the variables are endogenous
• We need an instrument for each endogenous variable
Economics 20 - Prof. Anderson 11
Multiple Regression IV (cont)
• Write the structural model as y1 = b0 + b1y2 + b2z1 + u1, where y2 is
endogenous and z1 is exogenous
• Let z2 be the instrument, so Cov(z2,u1) = 0 and
• y2 = p0 + p1z1 + p2z2 + v2, where p2 ≠ 0
• This reduced form equation regresses the endogenous variable on all
exogenous ones
Economics 20 - Prof. Anderson 12
Two Stage Least Squares (2SLS)
• It’s possible to have multiple instruments
• Consider our original structural model, and let y2 = p0 + p1z1 + p2z2 +
p3z3 + v2
• Here we’re assuming that both z2 and z3 are valid instruments – they
do not appear in the structural model and are uncorrelated with the
structural error term, u1
Economics 20 - Prof. Anderson 13
Best Instrument
• Could use either z2 or z3 as an instrument
• The best instrument is a linear combination of all of the exogenous
variables, y2* = p0 + p1z1 + p2z2 + p3z3
• We can estimate y2* by regressing y2 on z1, z2 and z3 – can call this
the first stage
• If then substitute ŷ2 for y2 in the structural model, get same
coefficient as IV
Economics 20 - Prof. Anderson 14
More on 2SLS
• While the coefficients are the same, the standard errors from doing
2SLS by hand are incorrect, so let Stata do it for you
• Method extends to multiple endogenous variables – need to be sure
that we have at least as many excluded exogenous variables
(instruments) as there are endogenous variables in the structural
equation
Economics 20 - Prof. Anderson 15
Addressing Errors-in-Variables with IV
Estimation
• Remember the classical errors-in-variables problem where we
observe x1 instead of x1*
• Where x1 = x1* + e1, and e1 is uncorrelated with x1* and x2
• If there is a z, such that Corr(z,u) = 0 and Corr(z,x1) ≠ 0, then
• IV will remove the attenuation bias
Economics 20 - Prof. Anderson 16
Testing for Endogeneity
• Since OLS is preferred to IV if we do not have an endogeneity
problem, then we’d like to be able to test for endogeneity
• If we do not have endogeneity, both OLS and IV are consistent
• Idea of Hausman test is to see if the estimates from OLS and IV are
different
Economics 20 - Prof. Anderson 17
Testing for Endogeneity (cont)
• While it’s a good idea to see if IV and OLS have different implications,
it’s easier to use a regression test for endogeneity
• If y2 is endogenous, then v2 (from the reduced form equation) and u1
from the structural model will be correlated
• The test is based on this observation
Economics 20 - Prof. Anderson 18
Testing for Endogeneity (cont)
• Save the residuals from the first stage
• Include the residual in the structural equation (which of course has y2
in it)
• If the coefficient on the residual is statistically different from zero,
reject the null of exogeneity
• If multiple endogenous variables, jointly test the residuals from each
first stage
Economics 20 - Prof. Anderson 19
Testing Overidentifying Restrictions
• If there is just one instrument for our endogenous variable, we can’t
test whether the instrument is uncorrelated with the error
• We say the model is just identified
• If we have multiple instruments, it is possible to test the
overidentifying restrictions – to see if some of the instruments are
correlated with the error
Economics 20 - Prof. Anderson 20
The OverID Test
• Estimate the structural model using IV and obtain the residuals
• Regress the residuals on all the exogenous variables and obtain the
R2 to form nR2
• Under the null that all instruments are uncorrelated with the error,
LM ~ cq
2 where q is the number of extra instruments
Economics 20 - Prof. Anderson 21
Testing for Heteroskedasticity
• When using 2SLS, we need a slight adjustment to the Breusch-Pagan
test
• Get the residuals from the IV estimation
• Regress these residuals squared on all of the exogenous variables in
the model (including the instruments)
• Test for the joint significance
Economics 20 - Prof. Anderson 22
Testing for Serial Correlation
• When using 2SLS, we need a slight adjustment to the test for serial
correlation
• Get the residuals from the IV estimation
• Re-estimate the structural model by 2SLS, including the lagged
residuals, and using the same instruments as originally
• Can do 2SLS on a quasi-differenced model, using quasi-differenced
instruments
Economics 20 - Prof. Anderson 23
Simultaneous Equations
y1 = a1y2 + b1z1 + u1
y2 = a2y1 + b2z2 + u2
Economics 20 - Prof. Anderson 24
Simultaneity
• Simultaneity is a specific type of endogeneity problem in which the
explanatory variable is jointly determined with the dependent
variable
• As with other types of endogeneity, IV estimation can solve the
problem
• Some special issues to consider with simultaneous equations models
(SEM)
Economics 20 - Prof. Anderson 25
Supply and Demand Example
• Start with an equation you’d like to estimate, say a labor supply
function
• hs = a1w + b1z + u1, where
• w is the wage and z is a supply shifter
• Call this a structural equation – it’s derived from economic theory
and has a causal interpretation where w directly affects hs
Economics 20 - Prof. Anderson 26
Example (cont)
• Problem that can’t just regress observed hours on wage, since
observed hours are determined by the equilibrium of supply and
demand
• Consider a second structural equation, in this case the labor demand
function
• hd = a2w + u2
• So hours are determined by a SEM
Economics 20 - Prof. Anderson 27
Example (cont)
• Both h and w are endogenous because they are both determined by
the equilibrium of supply and demand
• z is exogenous, and it’s the availability of this exogenous supply
shifter that allows us to identify the structural demand equation
• With no observed demand shifters, supply is not identified and
cannot be estimated
Economics 20 - Prof. Anderson 28
Identification of Demand Equation
w
h
D
S (z=z1)
S (z=z2)
S (z=z3)
Economics 20 - Prof. Anderson 29
Using IV to Estimate Demand
• So, we can estimate the structural demand equation, using z as an
instrument for w
• First stage equation is w = p0 + p1z + v2
• Second stage equation is h = a2ŵ + u2
• Thus, 2SLS provides a consistent estimator of a2, the slope of the
demand curve
• We cannot estimate a1, the slope of the supply curve
Economics 20 - Prof. Anderson 30
The General SEM
• Suppose you want to estimate the structural equation: y1 = a1y2 +
b1z1 + u1
• where, y2 = a2y1 + b2z2 + u2
• Thus, y2 = a2(a1y2 + b1z1 + u1) + b2z2 + u2
• So, (1 – a2a1)y2 = a2 b1z1 + b2z2 + a2 u1 + u2, which can be rewritten
as
• y2 = p1z1 + p2z2 + v2
Economics 20 - Prof. Anderson 31
The General SEM (continued)
• By substituting this reduced form in for y2, we can
see that since v2 is a linear function of u1, y2 is
correlated with the error term and a1 is biased –
call it simultaneity bias
• The sign of the bias is complicated, but can use
the simple regression as a rule of thumb
• In the simple regression case, the bias is the same
sign as a2/(1 – a2a1)
Economics 20 - Prof. Anderson 32
Identification of General SEM
• Let z1 be all the exogenous variables in the first equation, and z2 be
all the exogenous variables in the second equation
• It’s okay for there to be overlap in z1 and z2
• To identify equation 1, there must be some variables in z2 that are
not in z1
• To identify equation 2, there must be some variables in z1 that are
not in z2
Economics 20 - Prof. Anderson 33
Rank and Order Conditions
• We refer to this as the rank condition
• Note that the exogenous variable excluded from the first equation
must have a non-zero coefficient in the second equation for the rank
condition to hold
• Note that the order condition clearly holds if the rank condition does
– there will be an exogenous variable for the endogenous one
Economics 20 - Prof. Anderson 34
Estimation of the General SEM
• Estimation of SEM is straightforward
• The instruments for 2SLS are the exogenous variables from both
equations
• Can extend the idea to systems with more than 2 equations
• For a given identified equation, the instruments are all of the
exogenous variables in the whole system

More Related Content

What's hot

Trigonometry Lesson: Introduction & Basics
Trigonometry Lesson: Introduction & BasicsTrigonometry Lesson: Introduction & Basics
Trigonometry Lesson: Introduction & BasicsChelseaDarling0
 
Logistic regression with SPSS
Logistic regression with SPSSLogistic regression with SPSS
Logistic regression with SPSSLNIPE
 
Real estate regression model King County
Real estate regression model   King CountyReal estate regression model   King County
Real estate regression model King CountyThuPhungMBA
 
What is a kendall's tau?
What is a kendall's tau?What is a kendall's tau?
What is a kendall's tau?Ken Plummer
 
Lecture 12 orhogonality - 6.1 6.2 6.3
Lecture 12   orhogonality - 6.1 6.2 6.3Lecture 12   orhogonality - 6.1 6.2 6.3
Lecture 12 orhogonality - 6.1 6.2 6.3njit-ronbrown
 
Fuzzy logic in approximate Reasoning
Fuzzy logic in approximate ReasoningFuzzy logic in approximate Reasoning
Fuzzy logic in approximate ReasoningHoàng Đức
 
Topic 15 correlation spss
Topic 15 correlation spssTopic 15 correlation spss
Topic 15 correlation spssSizwan Ahammed
 
2.7 Ordered pairs
2.7 Ordered pairs2.7 Ordered pairs
2.7 Ordered pairsJan Plaza
 
4.5. logistic regression
4.5. logistic regression4.5. logistic regression
4.5. logistic regressionA M
 
2.4 Scatterplots, correlation, and regression
2.4 Scatterplots, correlation, and regression2.4 Scatterplots, correlation, and regression
2.4 Scatterplots, correlation, and regressionLong Beach City College
 
Inner product spaces
Inner product spacesInner product spaces
Inner product spacesEasyStudy3
 
Chapter III.pptx
Chapter III.pptxChapter III.pptx
Chapter III.pptxBeamlak5
 
Decision Making Using the Analytic Hierarchy Process (AHP); A Step by Step A...
Decision Making Using the Analytic Hierarchy Process (AHP);  A Step by Step A...Decision Making Using the Analytic Hierarchy Process (AHP);  A Step by Step A...
Decision Making Using the Analytic Hierarchy Process (AHP); A Step by Step A...Hamed Taherdoost
 
multiple linear regression in spss (procedure and output)
multiple linear regression in spss (procedure and output)multiple linear regression in spss (procedure and output)
multiple linear regression in spss (procedure and output)Unexplord Solutions LLP
 

What's hot (20)

Trigonometry Lesson: Introduction & Basics
Trigonometry Lesson: Introduction & BasicsTrigonometry Lesson: Introduction & Basics
Trigonometry Lesson: Introduction & Basics
 
Logistic regression with SPSS
Logistic regression with SPSSLogistic regression with SPSS
Logistic regression with SPSS
 
Real estate regression model King County
Real estate regression model   King CountyReal estate regression model   King County
Real estate regression model King County
 
Chapter 17 - Multivariable Calculus
Chapter 17 - Multivariable CalculusChapter 17 - Multivariable Calculus
Chapter 17 - Multivariable Calculus
 
What is a kendall's tau?
What is a kendall's tau?What is a kendall's tau?
What is a kendall's tau?
 
Lecture 12 orhogonality - 6.1 6.2 6.3
Lecture 12   orhogonality - 6.1 6.2 6.3Lecture 12   orhogonality - 6.1 6.2 6.3
Lecture 12 orhogonality - 6.1 6.2 6.3
 
Fuzzy logic in approximate Reasoning
Fuzzy logic in approximate ReasoningFuzzy logic in approximate Reasoning
Fuzzy logic in approximate Reasoning
 
Bays theorem of probability
Bays theorem of probabilityBays theorem of probability
Bays theorem of probability
 
Spss tutorial 1
Spss tutorial 1Spss tutorial 1
Spss tutorial 1
 
Instrumental Variables
Instrumental VariablesInstrumental Variables
Instrumental Variables
 
Topic 15 correlation spss
Topic 15 correlation spssTopic 15 correlation spss
Topic 15 correlation spss
 
2.7 Ordered pairs
2.7 Ordered pairs2.7 Ordered pairs
2.7 Ordered pairs
 
4.5. logistic regression
4.5. logistic regression4.5. logistic regression
4.5. logistic regression
 
2.4 Scatterplots, correlation, and regression
2.4 Scatterplots, correlation, and regression2.4 Scatterplots, correlation, and regression
2.4 Scatterplots, correlation, and regression
 
Inner product spaces
Inner product spacesInner product spaces
Inner product spaces
 
Truth table
Truth tableTruth table
Truth table
 
Chapter III.pptx
Chapter III.pptxChapter III.pptx
Chapter III.pptx
 
Decision Making Using the Analytic Hierarchy Process (AHP); A Step by Step A...
Decision Making Using the Analytic Hierarchy Process (AHP);  A Step by Step A...Decision Making Using the Analytic Hierarchy Process (AHP);  A Step by Step A...
Decision Making Using the Analytic Hierarchy Process (AHP); A Step by Step A...
 
Regression
RegressionRegression
Regression
 
multiple linear regression in spss (procedure and output)
multiple linear regression in spss (procedure and output)multiple linear regression in spss (procedure and output)
multiple linear regression in spss (procedure and output)
 

Similar to ICAR-IFPRI- Instrumental Variable Regression- Devesh Roy, IFPRI

notes.pdf
notes.pdfnotes.pdf
notes.pdfDevjan2
 
Lecture0701.pptx by statistics by Mohammed anwar
Lecture0701.pptx by statistics by Mohammed anwarLecture0701.pptx by statistics by Mohammed anwar
Lecture0701.pptx by statistics by Mohammed anwarmahirasadman
 
Supporting Vector Machine
Supporting Vector MachineSupporting Vector Machine
Supporting Vector MachineSumit Singh
 
Chapter8
Chapter8Chapter8
Chapter8Vu Vo
 
Linear Non-Gaussian Structural Equation Models
Linear Non-Gaussian Structural Equation ModelsLinear Non-Gaussian Structural Equation Models
Linear Non-Gaussian Structural Equation ModelsShiga University, RIKEN
 
An Algorithm to Find the Visible Region of a Polygon
An Algorithm to Find the Visible Region of a PolygonAn Algorithm to Find the Visible Region of a Polygon
An Algorithm to Find the Visible Region of a PolygonKasun Ranga Wijeweera
 
2. Linear Algebra for Machine Learning: Basis and Dimension
2. Linear Algebra for Machine Learning: Basis and Dimension2. Linear Algebra for Machine Learning: Basis and Dimension
2. Linear Algebra for Machine Learning: Basis and DimensionCeni Babaoglu, PhD
 
3. Weighted residual methods (1).pptx
3. Weighted residual methods (1).pptx3. Weighted residual methods (1).pptx
3. Weighted residual methods (1).pptxDeepu Sivakumar
 
Chapter4
Chapter4Chapter4
Chapter4Vu Vo
 
Convex Partitioning of a Polygon into Smaller Number of Pieces with Lowest Me...
Convex Partitioning of a Polygon into Smaller Number of Pieces with Lowest Me...Convex Partitioning of a Polygon into Smaller Number of Pieces with Lowest Me...
Convex Partitioning of a Polygon into Smaller Number of Pieces with Lowest Me...Kasun Ranga Wijeweera
 
LECT_01.pptx
LECT_01.pptxLECT_01.pptx
LECT_01.pptxMistAe1
 
Finite mathematics
Finite mathematicsFinite mathematics
Finite mathematicsIgor Rivin
 

Similar to ICAR-IFPRI- Instrumental Variable Regression- Devesh Roy, IFPRI (20)

notes.pdf
notes.pdfnotes.pdf
notes.pdf
 
Differential equations
Differential equationsDifferential equations
Differential equations
 
Lecture0701.pptx by statistics by Mohammed anwar
Lecture0701.pptx by statistics by Mohammed anwarLecture0701.pptx by statistics by Mohammed anwar
Lecture0701.pptx by statistics by Mohammed anwar
 
Supporting Vector Machine
Supporting Vector MachineSupporting Vector Machine
Supporting Vector Machine
 
Econometrics- lecture 10 and 11
Econometrics- lecture 10 and 11Econometrics- lecture 10 and 11
Econometrics- lecture 10 and 11
 
Flor incomp26 anim
Flor incomp26 animFlor incomp26 anim
Flor incomp26 anim
 
Chapter8
Chapter8Chapter8
Chapter8
 
Linear Non-Gaussian Structural Equation Models
Linear Non-Gaussian Structural Equation ModelsLinear Non-Gaussian Structural Equation Models
Linear Non-Gaussian Structural Equation Models
 
ch7_slidesB.pdf
ch7_slidesB.pdfch7_slidesB.pdf
ch7_slidesB.pdf
 
An Algorithm to Find the Visible Region of a Polygon
An Algorithm to Find the Visible Region of a PolygonAn Algorithm to Find the Visible Region of a Polygon
An Algorithm to Find the Visible Region of a Polygon
 
ICAR-IFPRI: Multiple regressions prof anderson inference lecture - Devesh Roy
ICAR-IFPRI: Multiple regressions prof anderson inference lecture - Devesh RoyICAR-IFPRI: Multiple regressions prof anderson inference lecture - Devesh Roy
ICAR-IFPRI: Multiple regressions prof anderson inference lecture - Devesh Roy
 
Energy principles Att 6607
Energy principles Att 6607Energy principles Att 6607
Energy principles Att 6607
 
2. Linear Algebra for Machine Learning: Basis and Dimension
2. Linear Algebra for Machine Learning: Basis and Dimension2. Linear Algebra for Machine Learning: Basis and Dimension
2. Linear Algebra for Machine Learning: Basis and Dimension
 
Lecture 01.ppt
Lecture 01.pptLecture 01.ppt
Lecture 01.ppt
 
3. Weighted residual methods (1).pptx
3. Weighted residual methods (1).pptx3. Weighted residual methods (1).pptx
3. Weighted residual methods (1).pptx
 
Logic
LogicLogic
Logic
 
Chapter4
Chapter4Chapter4
Chapter4
 
Convex Partitioning of a Polygon into Smaller Number of Pieces with Lowest Me...
Convex Partitioning of a Polygon into Smaller Number of Pieces with Lowest Me...Convex Partitioning of a Polygon into Smaller Number of Pieces with Lowest Me...
Convex Partitioning of a Polygon into Smaller Number of Pieces with Lowest Me...
 
LECT_01.pptx
LECT_01.pptxLECT_01.pptx
LECT_01.pptx
 
Finite mathematics
Finite mathematicsFinite mathematics
Finite mathematics
 

More from International Food Policy Research Institute- South Asia Office

More from International Food Policy Research Institute- South Asia Office (20)

8. Dr Rao.pdf
8. Dr Rao.pdf8. Dr Rao.pdf
8. Dr Rao.pdf
 
13. Manish Patel.pdf
13. Manish Patel.pdf13. Manish Patel.pdf
13. Manish Patel.pdf
 
15. Smita Sirohi.pdf
15. Smita Sirohi.pdf15. Smita Sirohi.pdf
15. Smita Sirohi.pdf
 
6. CD Mayee.pdf
6. CD Mayee.pdf6. CD Mayee.pdf
6. CD Mayee.pdf
 
10. Keshavulu_1.pdf
10. Keshavulu_1.pdf10. Keshavulu_1.pdf
10. Keshavulu_1.pdf
 
12. Swati Nayak.pdf
12. Swati Nayak.pdf12. Swati Nayak.pdf
12. Swati Nayak.pdf
 
16. Dr Anjani.pdf
16. Dr Anjani.pdf16. Dr Anjani.pdf
16. Dr Anjani.pdf
 
9. Malavika Dadlani_1.pdf
9. Malavika Dadlani_1.pdf9. Malavika Dadlani_1.pdf
9. Malavika Dadlani_1.pdf
 
7. Neeru Bhooshan.pdf
7. Neeru Bhooshan.pdf7. Neeru Bhooshan.pdf
7. Neeru Bhooshan.pdf
 
4. Raj Ganesh.pdf
4. Raj Ganesh.pdf4. Raj Ganesh.pdf
4. Raj Ganesh.pdf
 
11. Surinder K Tikoo_1.pdf
11. Surinder K Tikoo_1.pdf11. Surinder K Tikoo_1.pdf
11. Surinder K Tikoo_1.pdf
 
3. DK Yadava.pdf
3. DK Yadava.pdf3. DK Yadava.pdf
3. DK Yadava.pdf
 
14. Paresh Verma_1.pdf
14. Paresh Verma_1.pdf14. Paresh Verma_1.pdf
14. Paresh Verma_1.pdf
 
5. Ram Kaundinya.pdf
5. Ram Kaundinya.pdf5. Ram Kaundinya.pdf
5. Ram Kaundinya.pdf
 
1. Dr Anjani.pdf
1. Dr Anjani.pdf1. Dr Anjani.pdf
1. Dr Anjani.pdf
 
2. David Spielman.pdf
2. David Spielman.pdf2. David Spielman.pdf
2. David Spielman.pdf
 
GFPR 2021 South Asia Launch Ppt - Dr. Shahidur Rashid
GFPR 2021 South Asia Launch Ppt - Dr. Shahidur RashidGFPR 2021 South Asia Launch Ppt - Dr. Shahidur Rashid
GFPR 2021 South Asia Launch Ppt - Dr. Shahidur Rashid
 
GFPR 2021 South Asia Launch Ppt - Dr. Johan Swinnen
GFPR 2021 South Asia Launch Ppt - Dr. Johan SwinnenGFPR 2021 South Asia Launch Ppt - Dr. Johan Swinnen
GFPR 2021 South Asia Launch Ppt - Dr. Johan Swinnen
 
Book Launch : Agricultural Transformation in Nepal: Trends, Prospects and Pol...
Book Launch : Agricultural Transformation in Nepal: Trends, Prospects and Pol...Book Launch : Agricultural Transformation in Nepal: Trends, Prospects and Pol...
Book Launch : Agricultural Transformation in Nepal: Trends, Prospects and Pol...
 
Understanding the landscape of pulse policy in India and implications for trade
Understanding the landscape of pulse policy in India and implications for tradeUnderstanding the landscape of pulse policy in India and implications for trade
Understanding the landscape of pulse policy in India and implications for trade
 

Recently uploaded

Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
Biting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfBiting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfadityarao40181
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Celine George
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsKarinaGenton
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxAvyJaneVismanos
 
Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfMahmoud M. Sallam
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 
Painted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaPainted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaVirag Sontakke
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfSumit Tiwari
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting DataJhengPantaleon
 

Recently uploaded (20)

Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
Biting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfBiting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdf
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its Characteristics
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptx
 
Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdf
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
Painted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaPainted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of India
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
 

ICAR-IFPRI- Instrumental Variable Regression- Devesh Roy, IFPRI

  • 2. Economics 20 - Prof. Anderson 2 Instrumental Variables & 2SLS y = b0 + b1x1 + b2x2 + . . . bkxk + u x1 = p0 + p1z + p2x2 + . . . pkxk + v
  • 3. Economics 20 - Prof. Anderson 3 Why Use Instrumental Variables? • Instrumental Variables (IV) estimation is used when your model has endogenous x’s • That is, whenever Cov(x,u) ≠ 0 • Thus, IV can be used to address the problem of omitted variable bias • Additionally, IV can be used to solve the classic errors-in-variables problem
  • 4. Economics 20 - Prof. Anderson 4 What Is an Instrumental Variable? • In order for a variable, z, to serve as a valid instrument for x, the following must be true • The instrument must be exogenous • That is, Cov(z,u) = 0 • The instrument must be correlated with the endogenous variable x • That is, Cov(z,x) ≠ 0
  • 5. Economics 20 - Prof. Anderson 5 More on Valid Instruments • We have to use common sense and economic theory to decide if it makes sense to assume Cov(z,u) = 0 • We can test if Cov(z,x) ≠ 0 • Just testing H0: p1 = 0 in x = p0 + p1z + v • Sometimes refer to this regression as the first-stage regression
  • 6. Economics 20 - Prof. Anderson 6 IV Estimation in the Simple Regression Case • For y = b0 + b1x + u, and given our assumptions • Cov(z,y) = b1Cov(z,x) + Cov(z,u), so • b1 = Cov(z,y) / Cov(z,x) • Then the IV estimator for b1 is           xxzz yyzz ii ii 1 ˆb
  • 7. Economics 20 - Prof. Anderson 7 Inference with IV Estimation • The homoskedasticity assumption in this case is E(u2|z) = s2 = Var(u) • As in the OLS case, given the asymptotic variance, we can estimate the standard error     2 , 2 1 2 , 2 2 1 ˆˆ ˆ zxx zxx RSST se n Var s b s s b  
  • 8. Economics 20 - Prof. Anderson 8 IV versus OLS estimation • Standard error in IV case differs from OLS only in the R2 from regressing x on z • Since R2 < 1, IV standard errors are larger • However, IV is consistent, while OLS is inconsistent, when Cov(x,u) ≠ 0 • The stronger the correlation between z and x, the smaller the IV standard errors
  • 9. Economics 20 - Prof. Anderson 9 The Effect of Poor Instruments • What if our assumption that Cov(z,u) = 0 is false? • The IV estimator will be inconsistent, too • Can compare asymptotic bias in OLS and IV • Prefer IV if Corr(z,u)/Corr(z,x) < Corr(x,u) x u x u uxCorr xzCorr uzCorr s s bb s s bb   ),( ~ plim:OLS ),( ),(ˆplim:IV 11 11
  • 10. Economics 20 - Prof. Anderson 10 IV Estimation in the Multiple Regression Case • IV estimation can be extended to the multiple regression case • Call the model we are interested in estimating the structural model • Our problem is that one or more of the variables are endogenous • We need an instrument for each endogenous variable
  • 11. Economics 20 - Prof. Anderson 11 Multiple Regression IV (cont) • Write the structural model as y1 = b0 + b1y2 + b2z1 + u1, where y2 is endogenous and z1 is exogenous • Let z2 be the instrument, so Cov(z2,u1) = 0 and • y2 = p0 + p1z1 + p2z2 + v2, where p2 ≠ 0 • This reduced form equation regresses the endogenous variable on all exogenous ones
  • 12. Economics 20 - Prof. Anderson 12 Two Stage Least Squares (2SLS) • It’s possible to have multiple instruments • Consider our original structural model, and let y2 = p0 + p1z1 + p2z2 + p3z3 + v2 • Here we’re assuming that both z2 and z3 are valid instruments – they do not appear in the structural model and are uncorrelated with the structural error term, u1
  • 13. Economics 20 - Prof. Anderson 13 Best Instrument • Could use either z2 or z3 as an instrument • The best instrument is a linear combination of all of the exogenous variables, y2* = p0 + p1z1 + p2z2 + p3z3 • We can estimate y2* by regressing y2 on z1, z2 and z3 – can call this the first stage • If then substitute ŷ2 for y2 in the structural model, get same coefficient as IV
  • 14. Economics 20 - Prof. Anderson 14 More on 2SLS • While the coefficients are the same, the standard errors from doing 2SLS by hand are incorrect, so let Stata do it for you • Method extends to multiple endogenous variables – need to be sure that we have at least as many excluded exogenous variables (instruments) as there are endogenous variables in the structural equation
  • 15. Economics 20 - Prof. Anderson 15 Addressing Errors-in-Variables with IV Estimation • Remember the classical errors-in-variables problem where we observe x1 instead of x1* • Where x1 = x1* + e1, and e1 is uncorrelated with x1* and x2 • If there is a z, such that Corr(z,u) = 0 and Corr(z,x1) ≠ 0, then • IV will remove the attenuation bias
  • 16. Economics 20 - Prof. Anderson 16 Testing for Endogeneity • Since OLS is preferred to IV if we do not have an endogeneity problem, then we’d like to be able to test for endogeneity • If we do not have endogeneity, both OLS and IV are consistent • Idea of Hausman test is to see if the estimates from OLS and IV are different
  • 17. Economics 20 - Prof. Anderson 17 Testing for Endogeneity (cont) • While it’s a good idea to see if IV and OLS have different implications, it’s easier to use a regression test for endogeneity • If y2 is endogenous, then v2 (from the reduced form equation) and u1 from the structural model will be correlated • The test is based on this observation
  • 18. Economics 20 - Prof. Anderson 18 Testing for Endogeneity (cont) • Save the residuals from the first stage • Include the residual in the structural equation (which of course has y2 in it) • If the coefficient on the residual is statistically different from zero, reject the null of exogeneity • If multiple endogenous variables, jointly test the residuals from each first stage
  • 19. Economics 20 - Prof. Anderson 19 Testing Overidentifying Restrictions • If there is just one instrument for our endogenous variable, we can’t test whether the instrument is uncorrelated with the error • We say the model is just identified • If we have multiple instruments, it is possible to test the overidentifying restrictions – to see if some of the instruments are correlated with the error
  • 20. Economics 20 - Prof. Anderson 20 The OverID Test • Estimate the structural model using IV and obtain the residuals • Regress the residuals on all the exogenous variables and obtain the R2 to form nR2 • Under the null that all instruments are uncorrelated with the error, LM ~ cq 2 where q is the number of extra instruments
  • 21. Economics 20 - Prof. Anderson 21 Testing for Heteroskedasticity • When using 2SLS, we need a slight adjustment to the Breusch-Pagan test • Get the residuals from the IV estimation • Regress these residuals squared on all of the exogenous variables in the model (including the instruments) • Test for the joint significance
  • 22. Economics 20 - Prof. Anderson 22 Testing for Serial Correlation • When using 2SLS, we need a slight adjustment to the test for serial correlation • Get the residuals from the IV estimation • Re-estimate the structural model by 2SLS, including the lagged residuals, and using the same instruments as originally • Can do 2SLS on a quasi-differenced model, using quasi-differenced instruments
  • 23. Economics 20 - Prof. Anderson 23 Simultaneous Equations y1 = a1y2 + b1z1 + u1 y2 = a2y1 + b2z2 + u2
  • 24. Economics 20 - Prof. Anderson 24 Simultaneity • Simultaneity is a specific type of endogeneity problem in which the explanatory variable is jointly determined with the dependent variable • As with other types of endogeneity, IV estimation can solve the problem • Some special issues to consider with simultaneous equations models (SEM)
  • 25. Economics 20 - Prof. Anderson 25 Supply and Demand Example • Start with an equation you’d like to estimate, say a labor supply function • hs = a1w + b1z + u1, where • w is the wage and z is a supply shifter • Call this a structural equation – it’s derived from economic theory and has a causal interpretation where w directly affects hs
  • 26. Economics 20 - Prof. Anderson 26 Example (cont) • Problem that can’t just regress observed hours on wage, since observed hours are determined by the equilibrium of supply and demand • Consider a second structural equation, in this case the labor demand function • hd = a2w + u2 • So hours are determined by a SEM
  • 27. Economics 20 - Prof. Anderson 27 Example (cont) • Both h and w are endogenous because they are both determined by the equilibrium of supply and demand • z is exogenous, and it’s the availability of this exogenous supply shifter that allows us to identify the structural demand equation • With no observed demand shifters, supply is not identified and cannot be estimated
  • 28. Economics 20 - Prof. Anderson 28 Identification of Demand Equation w h D S (z=z1) S (z=z2) S (z=z3)
  • 29. Economics 20 - Prof. Anderson 29 Using IV to Estimate Demand • So, we can estimate the structural demand equation, using z as an instrument for w • First stage equation is w = p0 + p1z + v2 • Second stage equation is h = a2ŵ + u2 • Thus, 2SLS provides a consistent estimator of a2, the slope of the demand curve • We cannot estimate a1, the slope of the supply curve
  • 30. Economics 20 - Prof. Anderson 30 The General SEM • Suppose you want to estimate the structural equation: y1 = a1y2 + b1z1 + u1 • where, y2 = a2y1 + b2z2 + u2 • Thus, y2 = a2(a1y2 + b1z1 + u1) + b2z2 + u2 • So, (1 – a2a1)y2 = a2 b1z1 + b2z2 + a2 u1 + u2, which can be rewritten as • y2 = p1z1 + p2z2 + v2
  • 31. Economics 20 - Prof. Anderson 31 The General SEM (continued) • By substituting this reduced form in for y2, we can see that since v2 is a linear function of u1, y2 is correlated with the error term and a1 is biased – call it simultaneity bias • The sign of the bias is complicated, but can use the simple regression as a rule of thumb • In the simple regression case, the bias is the same sign as a2/(1 – a2a1)
  • 32. Economics 20 - Prof. Anderson 32 Identification of General SEM • Let z1 be all the exogenous variables in the first equation, and z2 be all the exogenous variables in the second equation • It’s okay for there to be overlap in z1 and z2 • To identify equation 1, there must be some variables in z2 that are not in z1 • To identify equation 2, there must be some variables in z1 that are not in z2
  • 33. Economics 20 - Prof. Anderson 33 Rank and Order Conditions • We refer to this as the rank condition • Note that the exogenous variable excluded from the first equation must have a non-zero coefficient in the second equation for the rank condition to hold • Note that the order condition clearly holds if the rank condition does – there will be an exogenous variable for the endogenous one
  • 34. Economics 20 - Prof. Anderson 34 Estimation of the General SEM • Estimation of SEM is straightforward • The instruments for 2SLS are the exogenous variables from both equations • Can extend the idea to systems with more than 2 equations • For a given identified equation, the instruments are all of the exogenous variables in the whole system