Time Series, Vitalii Radchenko

What is Econometrics?
• is the application of statistical methods to economic data
and is described as the branch of economics that aims to
give empirical content to economic relations

• Basic tools:

• Basic tools:
• linear regression models

• Basic tools:
• linear regression models
• statistical theory

The main goals
• To ﬁnd estimators that have desirable statistical properties:

The main goals
• unbiasedness

The main goals
• unbiasedness
• eﬃciency

The main goals
• unbiasedness
• eﬃciency
• consistency

The main goals
• unbiasedness
• eﬃciency
• consistency
• Applied for

The main goals
• unbiasedness
• eﬃciency
• consistency
• Applied for
• assessing economic theories

The main goals
• unbiasedness
• eﬃciency
• consistency
• Applied for
• forecasting macroeconomic indexes

The main goals
• unbiasedness
• eﬃciency
• consistency
• Applied for
• predicting revenue

The main goals
• unbiasedness
• eﬃciency
• consistency
• Applied for
• estimating the impact of something

The main goals
• unbiasedness
• eﬃciency
• consistency
• Applied for
Interpretability and Statistical robustness

Government
The main goals
• unbiasedness
• eﬃciency
• consistency
• Applied for
Interpretability and Statistical robustness

Basic approach
Linear regression

Basic approach
Linear regression
• Residuals assumptions:

Basic approach
Linear regression
1. Mean equals 0

Basic approach
Linear regression
1. Mean equals 0
2. Deviation is constant (homoscedasticity)

Basic approach
Linear regression
1. Mean equals 0
3. Independent residuals (covariation equals 0)

Basic approach
Linear regression
1. Mean equals 0
4. Independence of residuals and regressors

Basic approach
Linear regression
1. Mean equals 0
4. Independence of residuals and regressors
5. Residuals should be normally distributed

Gauss-Markov theorem
If all ﬁve assumptions are satisﬁed for a simple linear
regression, then the variance of the OLS estimates will be
the smallest among all unbiased estimates

Hypothesis estimation
• If the assumption of the residuals normality is satisﬁed,
then we test the hypotheses by comparing with the
values of the Fisher distribution

• If not satisﬁed – Chi-square

Hypothesis
• The adequacy of the model (H0: R^2 = 0)

Hypothesis
• The signiﬁcance of the correlation between the variables
(H0: rxy = 0)

Hypothesis
(H0: rxy = 0)
• The signiﬁcance of the regression coeﬃcients (H0: B = 0)

Hypothesis
(H0: rxy = 0)
• Multicollinearity (VIF, Farr-Glauber criterion)

Hypothesis
(H0: rxy = 0)
• Multicollinearity (VIF, Farr-Glauber criterion)
• Check of the functional form (criterion RESET)

Heteroskedasticity
• Identiﬁcation: Golﬀred-Quondt criterion, White criterion,
Broyush-Pagan criterion, Glaser criterion

Heteroskedasticity
• Second condition is not satisﬁed for the variance equality

Heteroskedasticity
• Second condition is not satisﬁed for the variance equality
• Weighted least squares

Autocorrelation
• Correlated residuals

Autocorrelation
• Identiﬁcation: the Durbin-Watson criterion, Broych-Godfrey

Autocorrelation
• The generalized least-squares method

Autocorrelation
• Select correlation coeﬃcient:

Autocorrelation
• Durbin-Watson estimate

Autocorrelation
• Cochrane-Orcatte method

Autocorrelation
• Cochrane-Orcatte method
• Hildretha-Lou method

Time Series Patterns
Trend
• A trend exists when there is a long-term increase or decrease in the data. It does
not have to be linear. Sometimes we will refer to a trend “changing direction”
when it might go from an increasing trend to a decreasing trend.

Trend
Seasonal

Trend
Seasonal
• A seasonal pattern exists when a series is inﬂuenced by seasonal factors (e.g.,
the quarter of the year, the month, or day of the week). Seasonality is always of a
ﬁxed and known period.

Trend
Seasonal
Cyclic

Trend
Seasonal
Cyclic
• A cyclic pattern exists when data exhibit rises and falls that are not of ﬁxed
period. The duration of these ﬂuctuations is usually of at least 2 years.

Time series decomposition
• additive model
yt = St +Tt + Et
St - seasonal component, Tt – trend-cycle component, Et – reminder

Time series decomposition
• additive model
• multiplicative model
yt = St +Tt + Et
yt = St iTt i Et
St - seasonal component, Tt – trend-cycle component, Et – reminder

Time Series Decomposition
• What is the problem here?

Time Series Decomposition
• What is the problem here? Seasonal component is not meaningful

Stationarity and differencing
• Stationary data:

• mean is a constant

• variance is a constant

• covariance is not a function of time

• Tests:

• Tests:
• Augmented Dickey-Fuller (a unit root is present)

• Tests:
• KPSS (trend stationary)

• Tests:
Problem is that the absence of a unit root is not a proof of stationarity

• Tests:
• Make data stationary:

• Tests:
• log-transformation

• Tests:
• diﬀerencing

• Tests:
• diﬀerencing
• log-transformation and diﬀerencing

Autoregressive models
• forecast the variable of interest using a linear combination of past
values of the variable

yt = c +φ1yt−1 +φ2yt−2 +…+φpyt−p + et

• For an AR(1) model:

• When ϕ1=0, yt is equivalent to white noise

• When ϕ1=1 and c=0, yt is equivalent to a random walk

• When ϕ1=1 and c≠0, yt is equivalent to a random walk with drift

• When ϕ1=1 and c≠0, yt is equivalent to a random walk with drift
• When ϕ1<0, yt tends to oscillate between positive and negative
values.

Moving average models
• Rather than use past values of the forecast variable in a
regression, a moving average model uses past forecast
errors in a regression-like model
yt = c + et +θ1et−1 +θ2et−2 +…+θpet−p

ARIMA
• combine diﬀerencing with autoregression and a moving average model, we obtain
a non-seasonal ARIMA model
yt
'
= c +φ1yt−1
'
+…+φpyt−p
'
+θ1et−1 +…+θqet−q + et

ARIMA
• Information Criteria
yt
'
= c +φ1yt−1
'
+…+φpyt−p
'

ARIMA
• Akaike’s Information Criterion (AIC)
yt
'
= c +φ1yt−1
'
+…+φpyt−p
'
AIC = −2log(L)+ 2(p + q + k +1)

ARIMA
• Bayesian Information Criterion (BIC)
yt
'
= c +φ1yt−1
'
+…+φpyt−p
'
AIC = −2log(L)+ 2(p + q + k +1)
BIC = AIC + (log(L)− 2)(p + q + k +1)

ARIMA
• Bayesian Information Criterion (BIC)
• corrected Akaike’s Information Criterion (AICc)
yt
'
= c +φ1yt−1
'
+…+φpyt−p
'
AIC = −2log(L)+ 2(p + q + k +1)
BIC = AIC + (log(L)− 2)(p + q + k +1)
AICc = AIC +
2(p + q + k +1)(p + q + k + 2)
T − p − q − k − 2

Other econometrics models
• SARIMA – ARIMA with seasonal component

• ARFIMA – ARIMA allowing non-integer values in diﬀerencing
parameter

parameter
• VAR – vector autoregression (allows to predict multiple target
variables and to learn dynamic relationships between them)

parameter
• ARCH (GARCH) – (assume that we have heteroskedastisity,
predicting mean and deviation separately)

parameter
• Hierarchical time series (predict high and low level)

parameter
• Hierarchical time series (predict high and low level)
Forecasting: principles and practice (Rob J Hyndman)

Why econometrics models
are good?

are good?
Simple

are good?
Simple
Statistically robust

are good?
Simple
Statistically robust
Interpretable

are bad?

are bad?
Simple

are bad?
• mostly linear and can catch non-linear dependencies
Simple

are bad?
• takes long time to optimize and train models
Simple

are bad?
• accuracy is not very good
Simple

are bad?
• accuracy is not very good
• the same features could be generated manually and used
with more complex models
Simple

Neural Networks
• RNN is a hype theme

Neural Networks
• Sequence-to-Sequence modeling
.. 3 2 1
LSTM
1 2 3 ..

Neural Networks
• Experiment with number of lag features and adding “external” data
.. 3 2 1
LSTM
1 2 3 ..
.. 3 2 1
LSTM
1 2 3 ..
Dense
External features

Neural Networks
• Works with multiple time series with a long history
.. 3 2 1
LSTM
1 2 3 ..
.. 3 2 1
LSTM
1 2 3 ..
Dense
External features

Neural Networks
• Works with multiple time series with a long history
• Usually works worse than linear or boosting models
.. 3 2 1
LSTM
1 2 3 ..
.. 3 2 1
LSTM
1 2 3 ..
Dense
External features

Stacking
Validation
Linear Regressions
Boosting

Stacking
Train Test
Train Test
Train Test
Train Test
Fold 1
Fold 2
Fold 3
Fold 4

Stacking
Train Test
Train Test
Train Test
Train Test
Fold 1
Fold 2
Fold 3
Fold 4Step 1
• train linear regression and optimize parameters on Fold 1

Stacking
Train Test
Train Test
Train Test
Train Test
Fold 1
Fold 2
Fold 3
Fold 4Step 1
Step 2
• predict train and test on Fold 2. Use predictions as a new feature and apply boosting

Stacking
Train Test
Train Test
Train Test
Train Test
Fold 1
Fold 2
Fold 3
Fold 4Step 1
Step 2
Step 3

Stacking
Train Test
Train Test
Train Test
Train Test
Fold 1
Fold 2
Fold 3
Fold 4Step 1
Step 2
Step 3
Step 4
…

Stacking
Train Test
Train Test
Train Test
Train Test
Fold 1
Fold 2
Fold 3
Fold 4Step 1
Step 2
Step 3
Step 4
…
Last step
• validate ﬁnal results on Fold 2, Fold 3 and Fold 4

Interpretation
• Linear regression - the easiest way to interpret features

Interpretation
• MARS (Earth) – a ﬂexible regression method that automatically searches for interactions and non-linear
relationships

Interpretation
relationships
• parameters: number of interactions, regularization, smoothing etc

Interpretation
relationships
• ELI5 – support Lightgbm and Xgboost sklearn api

Interpretation
relationships
• show weights and explain predictions

Interpretation
relationships
• show weights and explain predictions
• as it is linear approximation, <BIAS> is usually big (misleading)

Summary
• Choose metric and validation approach based on business task

Summary
• Do basic EDA and start with log-transformation

Summary
• Create a simple baseline

Summary
• Generate default features

Summary
• Try linear and boosting models (and RNN)

Summary
• Add more features

Summary
• Don’t forget about ensembles and stacking :)

Summary
• Don’t forget about ensembles and stacking :)
• Check feature weights and prediction explanation

Contacts
• ODS-slack: @vradchenko

• Email: radchenko.vitaliy.o@gmail.com

• Facebook: https://www.facebook.com/
vitaliyradchenko127

Time Series, Vitalii Radchenko

Recommended

Recommended

More Related Content

Similar to Time Series, Vitalii Radchenko

Similar to Time Series, Vitalii Radchenko (20)

More from Sigma Software

More from Sigma Software (20)

Recently uploaded

Recently uploaded (20)

Time Series, Vitalii Radchenko