Panel data regression models analyze data that follows individuals or cross-sectional units over time, allowing for multiple observations of each unit. This creates a two-dimensional panel data set with a cross-sectional dimension (N units) and time series dimension (T time periods). Panel data can be balanced if all units are observed over the same time periods, or unbalanced if time periods vary across units. Panel data offers efficiency gains over cross-sectional or time series data alone by increasing observations and reducing collinearity. Regression models for panel data include fixed effects, random effects, and models allowing intercept and/or slope to vary across units or time.
Premium Call Girls Bangalore Call Girls Service Just Call 🍑👄6378878445 🍑👄 Top...
Panel data
1. Panel Data Regression Models
Dr. T.Sampathkumar
Assistant Professor in Economics
Govt.Arts College (Autonomous)
Coimbatore.641 018.
tsampath_136@yahoo.com
2. Time series data
(data collected on one individual/unit over several time periods)
or
Cross-sectional data
(data collected on several individuals/units at one point in time)
3. Time Series Data
Sl.No Year Y X1 X2
1 1991 2985 125 35
2 1992 4517 214 50
3 1993 4925 284 54
. . . . .
. . . . .
19 2009 3105 568 235
20 2010 4258 715 368
6. A longitudinal, or panel, data set is one that follows a given sample
of individuals over time, and thus provides multiple observations on
each individual in the sample.
Panel data are repeated cross-sections over time with space and time
dimensions.
Panel Data
7. A Micro-panel data (Short panel) set is a panel for which the time dimension T
is largely less important than the individual dimension N
T << N
A Macro-panel data (Long Panel) set is a panel for which the time dimension T
is similar to the individual dimension
T ≈ N
A panel is said to be balanced if we have the same time periods,
t = 1, ...... ,T,
for each cross section observation.
For an unbalanced panel, the time dimension, denoted Ti , is specific to each
individual.
10. Terminology and notations:
Individual or cross section unit : country, region, state, firm, consumer,
individual or countries etc.
Double index : i (for cross-section unit) and t (for time)
yit for i = 1, ..,N and t = 1, ..,T
Yit , Xit
11. Potential gains
• Panel data usually give a large number of data points (N * T),
increasing the degrees of freedom and reducing the collinearity
among explanatory variables
• Improves the efficiency of econometric estimates
• Especially suitable to study dynamics of change
• Panel data involve two dimensions: a cross-sectional dimension N,
• and a time-series dimension T.
• Minimize bias due to aggregation
13. Ÿ (Y- Ῡ) = β1 Ẍ1it (Xit - Xit)+β2Ẍ2it (Xit - Xit)
Now, for each “i” average this equation over time i.e.,
ΣY/N (Ῡ)
and
ΣX/N (X)
Demeaning Method (within estimation)
Take the deviation of Y (or) X from its mean value. i.e.,
Y- Ῡ (= Ÿ) and X - X (=Ẍ)
Ÿ = β1 Ẍ1it+β2Ẍ2it + üit
Yit = β1Xit + β2Xit +ai + uit
15. To find out the individual or time effect, there are certain possibilities
1. Intercept and slope are constant and error term captures space and time effect
2. Intercept may change across samples (constant slope)
3. Intercept may change across time period (constant slope)
4. Both intercept and slope change
16. Yit = β0+β1Xit + β2Xit + uit
Both intercept and slope are constant
17. Yit = α1+α2d2i+α3d3i+α4d4i+β1Xit + β2Xit + uit
d’s are dummy variable. If there are four cross sections,
d=1 for cross section 1 and 0 if not
Intercept varies across cross sections, but slope is constant
Least Square Dummy Variable Method (LSDM)
18. Yit = λo+λ1t1t+……….+ λ9t9t +β1Xit + β2Xit + uit
t’s are time dummy variable. If there are 10 time periods,
t=1 for time period 1 and 0 if not
Intercept varies across time periods (time dummy), but slope is
constant
19. Both intercept and slope change across individuals
Yit = α1+α2d2i+α3d3i+α4d4i+β1Xit + β2Xit +
ϒ1(d2x1) + ϒ2 (d2x2)+ϒ3 (d3x1)+ϒ4 (d3x2)+ϒ5 (d4x1)+ϒ6 (d4x2)+ui
There are four cross sections and two explanatory variables
20. Random Effects Model (Error Component Model, (ECM))
Yit = β0+β1Xit + β2Xit + 𝜔it
𝜔it = εi +uit
εi = individual specific error term
uit = combined time and cross section error term
𝜀i = ~ N (0, 𝜎𝜖
2
)
uit = ~ N (0, 𝜎 𝑢
2)
E (𝜀i uit ) = 0
E (𝜀i 𝜀𝑗 ) = 0
E (uit uis ) = 0
21. whether or not the individuals can be viewed as a random sample from
a large population
E (εi Xi ) = 0
If yes: random effects, if no: fixed effects
The relation between T and N
for large T and small N not a big difference
for small T and large N random effects estimators are more effcient
than fixed effects
Choice between Random and Fixed Effects Model
22. Hausman Test
To decide between fixed or random effects you can run a Hausman
test where the null hypothesis is that the preferred model is random
effects vs. the alternative the fixed effects
It basically tests whether the unique errors (ui) are correlated with the
regressors, the null hypothesis is they are not.
Decision: if the test value (chi-2) is less than 0.05 %, use fixed effect model
or random effects is more efficient.