Heteroscedasticity Remedial Measures.pptx

Heteroscedasticity
Remedial
Measures
Devendra Patil
M.Sc. Applied Statistics (Sem-IV)
Roll no.: 05

Introduction
2
We can define heteroscedasticity as the
condition in which the variance of the error
term or the residual term in a regression
model varies. As you can see in the above
diagram, in the case of homoscedasticity,
the data points are equally scattered while
in the case of heteroscedasticity, the data
points are not equally scattered.

Possible reasons for arising
Heteroscedasticity:
3
1.Often occurs in those data sets which have a large range
between the largest and the smallest observed values i.e.
when there are outliers.
2.When the model is not correctly specified.
3.If observations are mixed with different measures of scale.
4.When incorrect transformation of data is used to perform the
regression.
5.Skewness in the distribution of a regressor, and maybe some
other sources.

Effects of Heteroscedasticity
4
• OLS (Ordinary Least Square) estimators are not the Best Linear Unbiased
Estimator(BLUE) and their variance is not the lowest of all other
unbiased estimators.
• Estimators are no longer best/efficient.
• The tests of hypothesis (like t-test, F-test) are no longer valid due to the
inconsistency in the co-variance matrix of the estimated regression
coefficients.

Weighted Least Squares (WLS) Estimator
Presentation title 6
• The Weighted Least Squares estimator is the OLS estimator, which is applied
to a transformed model after multiplying each term on both sides of the
regression equation by a “weight”, denoted by wi . For instance, consider the
following general linear regression model with Heteroscedasticity:
• Yᵢ = 𝛽0 + 1Xᵢ1+ui ; i = 1,2, … n
• Var(ui) = ²Zᵢ² ; where Zᵢ = some function of Xᵢ
• To obtain WLS estimator, the transformed model will be:
• wᵢYᵢ= wᵢ 𝛽0 + 1(wᵢXᵢ1) + wᵢui ; i= 1,2, … n

7
Question : For the model 𝒀𝒊 = 𝜷𝑿𝒊 + 𝒖𝒊 𝐰ith variance var(𝒖𝒊) = 𝝈𝟐𝒁𝒊
𝟐
, prove that WLS estimator
of 𝜷 has lower variance then its OLS estimator , where the weight is 𝒘𝒊 =
𝟏
𝒁𝒊
ANSWER:- For the model 𝒀𝒊 = 𝜷𝑿𝒊 + 𝒖𝒊 𝐰ith variance var(𝒖𝒊) =𝝈𝟐𝒁𝒊
𝟐
The OLS estimator of 𝜷 : 𝜷 =
𝑿𝒊𝒀𝒊
𝑿𝒊
𝟐 and var( 𝜷 )=
𝑿𝒊
𝟐
var(𝒖𝒊)
(𝑿𝒊
𝟐)𝟐 =
𝝈𝟐 𝑿𝒊
𝟐
𝒁𝒊
𝟐
(𝑿𝒊
𝟐)𝟐
If we divide entire equation by 𝒁𝒊 ,
𝒀𝒊
𝒁𝒊
= 𝜷
𝑿𝒊
𝒁𝒊
+
𝒖𝒊
𝒁𝒊
or 𝒚𝒊 = 𝜷𝒙𝒊 + 𝒗𝒊
Here var(𝒗𝒊) =
var(𝒖𝒊)
𝒁𝒊
𝟐 =
𝝈𝟐𝒁𝒊
𝟐
𝒁𝒊
𝟐 = 𝝈𝟐 ( constant – Homoscedasticity )
The WLS estimator of 𝜷 : 𝜷∗ =
𝒙𝒊𝒚𝒊
𝒙𝒊
𝟐 =
𝒙𝒊(𝜷𝒙𝒊+𝒗𝒊)
𝒙𝒊
𝟐 = 𝜷 +
𝒙𝒊𝒗𝒊
𝒙𝒊
𝟐
Var(𝜷∗ )=E(𝜷∗ - 𝜷)² =
𝒙𝒊
𝟐
𝑬(𝒗𝒊
𝟐
)
(𝒙𝒊
𝟐)𝟐 =
𝝈𝟐 𝒙𝒊
𝟐
(𝒙𝒊
𝟐)𝟐 =
𝝈𝟐
𝒙𝒊
𝟐 [E(𝒖𝒊 , 𝒖𝒋) = 0 ]….(𝑿𝒊’s are independent)

8
Hence,
𝒗𝒂𝒓(𝜷∗)
𝒗𝒂𝒓(𝜷)
=
𝝈𝟐
𝒙𝒊
𝟐
(
𝝈𝟐 𝑿𝒊
𝟐 𝒁𝒊
𝟐
(𝑿𝒊
𝟐)𝟐
)
=
(𝑿𝒊
𝟐
)𝟐
{( 𝑿𝒊
𝟐 𝒁𝒊
𝟐) (𝒙𝒊
𝟐)}
=
(𝑿𝒊
𝟐
)𝟐
( 𝑿𝒊
𝟐 𝒁𝒊
𝟐)( (
𝑿𝒊
𝒁𝒊
)𝟐)
Let XᵢZᵢ = 𝒂𝒊,
𝑿𝒊
𝒁𝒊
= 𝒃𝒊 ; 𝒂𝒊𝒃𝒊= 𝑿𝒊
𝟐
Hence,
𝒗𝒂𝒓(𝜷)
=
(𝜮𝒂𝒊𝒃𝒊)𝟐
( 𝒂𝒊
𝟐 )( 𝒃𝒊
𝟐 )
According to Cauchy-Schwartz, (𝜮𝒂𝒊𝒃𝒊)𝟐 < ( 𝒂𝒊
𝟐
)( 𝒃𝒊
𝟐
)
Var(𝜷∗)<Var(𝜷)
When 𝒂𝒊= 𝜽 𝒃𝒊
𝒗𝒂𝒓(𝜷)
=
( 𝒂𝒊
𝟐 )( 𝒃𝒊
𝟐 )
=1 i.e., Var(𝜷∗)=Var(𝜷)

9
𝒗𝒂𝒓(𝜷)
=
( 𝒂𝒊
𝟐 )( 𝒃𝒊
𝟐 )
=1 i.e., Var(𝜷∗)=Var(𝜷)
𝒂𝒊
𝒃𝒊
=
𝑿𝒊𝒁𝒊
𝑿𝒊
𝒁𝒊
= 𝒁𝒊
𝟐
=𝜽
i.e., var(𝒖𝒊)= 𝝈𝟐𝒁𝒊
𝟐
= 𝝈𝟐𝜽

FEASIBLE GLS
10
•
Here ,we do not know the nature of heteroscadasticity
• Step 1
• Y= 𝛽0 + 1X1+…….+ kXk + uᵢ and calculate 𝒖𝟐
(as further 𝒖𝟐
log(𝒖𝟐
))
• Step 2
• 𝒈𝟐 =Log(𝒖𝟐)= 𝜹𝟎 + 𝜹𝟏 X1….. +𝜹𝒌Xk + error
• Step 3
h(x)=exp(𝒈𝟐)
• Step 4
Y= 𝛽0 + 1X1+……. + kXk using h(x) weights as WLS

11
𝒀𝟎 = 𝜷𝟎 + 𝜷𝟏 X1+…….+ 𝜷𝒌 Xk +ui
Var(u|x)= σ²h(x)
h(x)=exp(𝛿0+𝜹1X1….. + 𝜹kXk )
Var(u|x)=σ²exp(𝛿0+𝜹1X1….. + 𝜹kXk )
Log(𝒖𝟐)= 𝜹𝟎 + 𝜹𝟏 X1….. + 𝜹𝒌 Xk + error
hᵢ(x)=exp(𝜹𝟎 + 𝜹𝟏 X1….. + 𝜹𝒌 Xk )
hᵢ(x) in WLS as weights
FEASIBLE
GLS

Remedial measures when true
error variance (𝝈ᵢ²)is unknown

13
• WLS method makes an implicit assumption that true error
variance (𝝈ᵢ²) is known. However, in reality, it is difficult to have
knowledge of the true error variance. Thus, we need some
other methods to obtain consistent estimate of variance of
error term.
• In this method, we need to make some assumptions about
true error variance (𝝈ᵢ²) and transform the original regression
model. After transformation, the new model satisfies
Homoscedasticity assumption. Let’s say original regression
model is:
Yᵢ = 𝛽1+𝛽2Xᵢ+ ui and var (uᵢ) = 𝝈ᵢ² ; i= 1,2, … n

When the error variance is proportional to
Xᵢ
Run the original OLS regression and obtain the residuals. Plot the square of these residuals , (σᵢ²) , against the
explanatory variable X. If we get a pattern similar to figure 1, then we say that error variance is proportional
to Xi or linearly related to Xᵢ and (σ²) is the factor of proportionality, which is a constant. Symbolically, E (Ui² )
= σ² Xi i = 1,2, … n
Now we transform the original regression model by dividing
original regression equation by √Xi , we get:
Yi /√Xi = β₁/√Xi + β₂ Xi/√Xi +ui / √Xi i = 1,2, … .n
= β₁ /√Xi + 𝜷𝟐 √Xi +Vi i = 1,2, … . n
Here, Vi= ui /√Xi and Xi > 0This transformed regression
equation is called “Square Root Transformation” and the error
variance Vi is Homoscedastic.
Proof:
E(Vi²)= E (ui / √Xi )²= E(Ui² )/ Xi = σ ²

When the error variance is proportional to
Xᵢ²
Run the original OLS regression and obtain the residuals. Plot the square of these residuals , (σᵢ²) , against the
explanatory variable X. If we get a pattern similar to figure 2, then we say that error variance is proportional
to Xi or non-linearly related to Xᵢ and (σ²) is the factor of proportionality, which is a constant. Symbolically, E
(Ui² ) = σ² Xᵢ² i = 1,2, … n
Now we transform the original regression model by dividing
original regression equation by Xi , we get:
Yi /Xi = β₁/Xi + β₂ Xi/Xi +ui / Xi i = 1,2, … .n
= β₁ /Xi + 𝜷𝟐 +Vi i = 1,2, … . n
Here, Vi= ui /Xi and Xi > 0This transformed regression
equation is called “Square Transformation” and the error
variance Vi is Homoscedastic.
Proof:
E(Vi²)= E (ui / Xi )²= E(Ui² )/ Xᵢ² = σ ²

When the error variance is
proportional to square of the mean
value of Y
According to this assumption, the error variance is proportional to square of the
mean value of Y and σ² is a constant. Symbolically, E (Uᵢ²) = σ² . [E (Yᵢ)] ²; i =
1,2, … . n.
Now we transform the original regression model by dividing it by E(Yᵢ) and we
get: Where, E(Yᵢ) = β₁ + β₂ Xᵢ
Yᵢ/E(Yᵢ) = β₁ /E(Yᵢ) + β₂Xᵢ/E(Yᵢ) + μᵢ /E(Yᵢ)
= β₁ /E(Yᵢ) + β₂Xᵢ/E(Yᵢ) + Vᵢ; i = 1,2, … . n
Where, Vᵢ= μᵢ /E(Yᵢ)
We can show that the error variance Vᵢ is Homoscedastic
Proof: E (Vᵢ²)= E (uᵢ / E(Yᵢ))² = E(Uᵢ)² / E(Yᵢ)² = σ²

E(Yi) depends on 𝜷₁ and𝜷₂ which are unknown. We know
𝒀𝒊= 𝜷₁ + 𝜷₂𝑿𝒊
Which is an estimator of E(Yi)
First, we run the usual OLS regression, disregarding the
heteroscedasticity problem , and obtain 𝒀𝒊 then using the
estimated 𝒀𝒊 , we transform our model:
𝒀𝒊
𝒀𝒊
=𝜷₁
𝟏
𝒀𝒊
+ 𝜷₂
𝑿𝒊
𝒀𝒊
+
𝒖𝒊
𝒀𝒊
………. i = 1,2, … . n.
The transformation will perform satisfactorily in practice if the
sample size is reasonably large.

18
log transformation of the original regression model can help to
reduce the problem of heteroscedasticity. Symbolically,
log(Yᵢ)=𝜷₁ + 𝜷₂log(Xᵢ) +uᵢ
i = 1,2, … . n.
Log Transformation

23
The best model is the model having high p-value:
Remedial Measure 1 is the best as p-value =0.6251

24
HETEROSCEDASTICITY
HOMOSCEDASTICITY

Some “problems” associated with this
transformation method
25
• In multiple regression models, we may not decide which of the X variables should be
chosen for transforming the data.
• Log transformation is not applicable if some of the Y and X values are zero or negative.
• It may happen that the ratios of variables are found to be correlated even though the
original variables are uncorrelated or random. For instance, in the model, Yᵢ= β₁ +
β₂Xᵢ+ uᵢ ; Y and X may not be correlated but in the transformed model Yᵢ /Xᵢ= β₁/Xᵢ+ β₂
+ uᵢ /Xᵢ , Yᵢ/ Xᵢ and 1/Xᵢ are often found to be correlated. Hence, there is a problem of
spurious correlation.

Summary
All of the remedial measures discussed above are just a way
to speculate about the nature of the population error
variance, 𝝈ᵢ² and which method is to be used depends upon
the nature of the problem and severity of Heteroscedasticity.
26

Thank you
DEVENDRA PATIL
EMAIL:
devendrapatil1631@gmail.com
LinkedIn:
linkedin.com/in/devendrapatil161299

Heteroscedasticity Remedial Measures.pptx

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Heteroscedasticity Remedial Measures.pptx

Similar to Heteroscedasticity Remedial Measures.pptx (20)

Recently uploaded

Recently uploaded (20)

Heteroscedasticity Remedial Measures.pptx