SlideShare a Scribd company logo
1 of 7
Download to read offline
Arthur Charpentier, Econometrics & Machine Learning - 2017
Econometrics & Machine Learning
A. Charpentier, E. Flachaire & A. Ly
https://arxiv.org/abs/1708.06992
@freakonometrics freakonometrics freakonometrics.hypotheses.org 1
Arthur Charpentier, Econometrics & Machine Learning - 2017
Probabilistic Foundations of Econometrics
see Haavelmo (1944)
see Duo (1997)
@freakonometrics freakonometrics freakonometrics.hypotheses.org 2
Arthur Charpentier, Econometrics & Machine Learning - 2017
Probabilistic Foundations of Econometrics
There is a probabilistic space (Ω, F, P) such that data {yi, xi} can be seen as
realizations of random variables {Yi, Xi}.
Consider a conditional distribution for Y |X, e.g.
(Y |X = x)
L
∼ N(µ(x), σ2
) where µ(x) = β0 + xT
β, and β ∈ Rp
.
for the linear model, with some extension when it is in the exponential family
(GLM).
Then use maximum likelihood for inference, to derive confidence interval
(quantification of uncertainty), etc.
Importance of unbiased estimators (Cramer Rao lower bound for the variance).
@freakonometrics freakonometrics freakonometrics.hypotheses.org 3
Arthur Charpentier, Econometrics & Machine Learning - 2017
Loss Functions
Gneiting in a statistical context: a statistics T is said to be ellicitable if there is
: R × R → R+ such that
T(Y ) = argmin
x∈R R
l(x, y)dF(y) = argmin
x∈R
E l(x, Y ) where Y
L
∼ F
(e.g. T(x) = x and = 2, or T(x) = median(x) and = 1).
In machine-learning, we want to solve
m (x) = argmin
m∈M
n
i=1
(yi, m(x))
using optimization techniques, in a not-too-complex space M.
@freakonometrics freakonometrics freakonometrics.hypotheses.org 4
Arthur Charpentier, Econometrics & Machine Learning - 2017
Overfit ?
Underfit: true yi = β0 + xT
1 β1 + xT
2 β2 + εi vs. fitted model yi = b0 + xT
1 b1 + ηi.
b1 = (XT
1 X1)−1
XT
1 y = β1 + (X1X1)−1
XT
1 X2β2
β12
+ (XT
1 X1)−1
XT
1 ε
νi
i.e. E[b1] = β1 + β12 = β1, unless X1 ⊥ X2 (see Frish-Waugh Theorem).
Overfit: true yi = β0 + xT
1 β1 + εi vs. fitted model yi = b0 + xT
1 b1 + xT
2 b2 + ηi.
In that case E[b1] = β1 but no-longer efficient.
@freakonometrics freakonometrics freakonometrics.hypotheses.org 5
Arthur Charpentier, Econometrics & Machine Learning - 2017
Occam’s Razor and Parsimony
Importance of penalty in order to avoid a too complex model (overfit).
Consider some linear predictor, y = Hy where H = X(XT
X)−1
XT
y, or more
generally y = Sy for some smoothing matrix S.
In econometrics, consider some ex-post penalty for model selection,
AIC = −2 log L + 2p = deviance + 2p
where p is the dimension (i.e. trace(S) in a more general setting).
In machine-learning, penalty is added in the objective function, see Ridge or
LASSO regression
(β0,λ, βλ) = argmin
(β0,β)
n
i=1
(yi, β0 + xiTβ) + λ β
Goodhart’s law, “when a measure becomes a target, it ceases to be a good
measure”
@freakonometrics freakonometrics freakonometrics.hypotheses.org 6
Arthur Charpentier, Econometrics & Machine Learning - 2017
Boosting, or learning from previous errors
Construct a sequence of models
m(k)
(x) = m(k−1)
(x) + α · f (x)
where
f = argmin
f∈W
n
i=1
(yi − m(k−1)
(xi), f(xi))
for some set of weak learner W.
Problem: where to stop to avoid overfit...
@freakonometrics freakonometrics freakonometrics.hypotheses.org 7
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
qq
qq
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
qq
q
qq
q
q
q
q
q
q
q
q
0 2 4 6 8 10
−2−101

More Related Content

What's hot

What's hot (20)

Sildes buenos aires
Sildes buenos airesSildes buenos aires
Sildes buenos aires
 
Slides econometrics-2018-graduate-3
Slides econometrics-2018-graduate-3Slides econometrics-2018-graduate-3
Slides econometrics-2018-graduate-3
 
Predictive Modeling in Insurance in the context of (possibly) big data
Predictive Modeling in Insurance in the context of (possibly) big dataPredictive Modeling in Insurance in the context of (possibly) big data
Predictive Modeling in Insurance in the context of (possibly) big data
 
Side 2019 #6
Side 2019 #6Side 2019 #6
Side 2019 #6
 
Slides econometrics-2018-graduate-4
Slides econometrics-2018-graduate-4Slides econometrics-2018-graduate-4
Slides econometrics-2018-graduate-4
 
Side 2019 #9
Side 2019 #9Side 2019 #9
Side 2019 #9
 
Hands-On Algorithms for Predictive Modeling
Hands-On Algorithms for Predictive ModelingHands-On Algorithms for Predictive Modeling
Hands-On Algorithms for Predictive Modeling
 
Sufficient statistics
Sufficient statisticsSufficient statistics
Sufficient statistics
 
Slides econometrics-2018-graduate-2
Slides econometrics-2018-graduate-2Slides econometrics-2018-graduate-2
Slides econometrics-2018-graduate-2
 
Side 2019 #5
Side 2019 #5Side 2019 #5
Side 2019 #5
 
Slides econometrics-2018-graduate-1
Slides econometrics-2018-graduate-1Slides econometrics-2018-graduate-1
Slides econometrics-2018-graduate-1
 
Truth, deduction, computation lecture i (last one)
Truth, deduction, computation   lecture i (last one)Truth, deduction, computation   lecture i (last one)
Truth, deduction, computation lecture i (last one)
 
Varese italie #2
Varese italie #2Varese italie #2
Varese italie #2
 
Econ. Seminar Uqam
Econ. Seminar UqamEcon. Seminar Uqam
Econ. Seminar Uqam
 
Slides ub-1
Slides ub-1Slides ub-1
Slides ub-1
 
Predicates and quantifiers presentation topics
Predicates  and quantifiers  presentation topicsPredicates  and quantifiers  presentation topics
Predicates and quantifiers presentation topics
 
transformations and nonparametric inference
transformations and nonparametric inferencetransformations and nonparametric inference
transformations and nonparametric inference
 
Rencontres Mutualistes
Rencontres MutualistesRencontres Mutualistes
Rencontres Mutualistes
 
Slides ub-3
Slides ub-3Slides ub-3
Slides ub-3
 
Slides ub-2
Slides ub-2Slides ub-2
Slides ub-2
 

Similar to Slides econ-lm

Slides econometrics-2017-graduate-2
Slides econometrics-2017-graduate-2Slides econometrics-2017-graduate-2
Slides econometrics-2017-graduate-2Arthur Charpentier
 
Approximation Methods Of Solutions For Equilibrium Problem In Hilbert Spaces
Approximation Methods Of Solutions For Equilibrium Problem In Hilbert SpacesApproximation Methods Of Solutions For Equilibrium Problem In Hilbert Spaces
Approximation Methods Of Solutions For Equilibrium Problem In Hilbert SpacesLisa Garcia
 
Existence Theory for Second Order Nonlinear Functional Random Differential Eq...
Existence Theory for Second Order Nonlinear Functional Random Differential Eq...Existence Theory for Second Order Nonlinear Functional Random Differential Eq...
Existence Theory for Second Order Nonlinear Functional Random Differential Eq...IOSR Journals
 
Slides barcelona Machine Learning
Slides barcelona Machine LearningSlides barcelona Machine Learning
Slides barcelona Machine LearningArthur Charpentier
 
Research internship on optimal stochastic theory with financial application u...
Research internship on optimal stochastic theory with financial application u...Research internship on optimal stochastic theory with financial application u...
Research internship on optimal stochastic theory with financial application u...Asma Ben Slimene
 
Presentation on stochastic control problem with financial applications (Merto...
Presentation on stochastic control problem with financial applications (Merto...Presentation on stochastic control problem with financial applications (Merto...
Presentation on stochastic control problem with financial applications (Merto...Asma Ben Slimene
 
Influence of the sampling on Functional Data Analysis
Influence of the sampling on Functional Data AnalysisInfluence of the sampling on Functional Data Analysis
Influence of the sampling on Functional Data Analysistuxette
 
Statistics (1): estimation, Chapter 1: Models
Statistics (1): estimation, Chapter 1: ModelsStatistics (1): estimation, Chapter 1: Models
Statistics (1): estimation, Chapter 1: ModelsChristian Robert
 
a - ANTI FUZZY NEW IDEAL OF PUALGEBRA
a - ANTI FUZZY NEW IDEAL OF PUALGEBRAa - ANTI FUZZY NEW IDEAL OF PUALGEBRA
a - ANTI FUZZY NEW IDEAL OF PUALGEBRAijfls
 
α -ANTI FUZZY NEW IDEAL OF PUALGEBRA
α -ANTI FUZZY NEW IDEAL OF PUALGEBRAα -ANTI FUZZY NEW IDEAL OF PUALGEBRA
α -ANTI FUZZY NEW IDEAL OF PUALGEBRAWireilla
 
Graduate Econometrics Course, part 4, 2017
Graduate Econometrics Course, part 4, 2017Graduate Econometrics Course, part 4, 2017
Graduate Econometrics Course, part 4, 2017Arthur Charpentier
 

Similar to Slides econ-lm (20)

Varese italie seminar
Varese italie seminarVarese italie seminar
Varese italie seminar
 
Slides ACTINFO 2016
Slides ACTINFO 2016Slides ACTINFO 2016
Slides ACTINFO 2016
 
Slides Bank England
Slides Bank EnglandSlides Bank England
Slides Bank England
 
Slides econometrics-2017-graduate-2
Slides econometrics-2017-graduate-2Slides econometrics-2017-graduate-2
Slides econometrics-2017-graduate-2
 
Proba stats-r1-2017
Proba stats-r1-2017Proba stats-r1-2017
Proba stats-r1-2017
 
Classification
ClassificationClassification
Classification
 
Approximation Methods Of Solutions For Equilibrium Problem In Hilbert Spaces
Approximation Methods Of Solutions For Equilibrium Problem In Hilbert SpacesApproximation Methods Of Solutions For Equilibrium Problem In Hilbert Spaces
Approximation Methods Of Solutions For Equilibrium Problem In Hilbert Spaces
 
Existence Theory for Second Order Nonlinear Functional Random Differential Eq...
Existence Theory for Second Order Nonlinear Functional Random Differential Eq...Existence Theory for Second Order Nonlinear Functional Random Differential Eq...
Existence Theory for Second Order Nonlinear Functional Random Differential Eq...
 
Slides ensae-2016-8
Slides ensae-2016-8Slides ensae-2016-8
Slides ensae-2016-8
 
Slides barcelona Machine Learning
Slides barcelona Machine LearningSlides barcelona Machine Learning
Slides barcelona Machine Learning
 
Research internship on optimal stochastic theory with financial application u...
Research internship on optimal stochastic theory with financial application u...Research internship on optimal stochastic theory with financial application u...
Research internship on optimal stochastic theory with financial application u...
 
Presentation on stochastic control problem with financial applications (Merto...
Presentation on stochastic control problem with financial applications (Merto...Presentation on stochastic control problem with financial applications (Merto...
Presentation on stochastic control problem with financial applications (Merto...
 
Slides ensae 8
Slides ensae 8Slides ensae 8
Slides ensae 8
 
Slides erasmus
Slides erasmusSlides erasmus
Slides erasmus
 
Influence of the sampling on Functional Data Analysis
Influence of the sampling on Functional Data AnalysisInfluence of the sampling on Functional Data Analysis
Influence of the sampling on Functional Data Analysis
 
Statistics (1): estimation, Chapter 1: Models
Statistics (1): estimation, Chapter 1: ModelsStatistics (1): estimation, Chapter 1: Models
Statistics (1): estimation, Chapter 1: Models
 
a - ANTI FUZZY NEW IDEAL OF PUALGEBRA
a - ANTI FUZZY NEW IDEAL OF PUALGEBRAa - ANTI FUZZY NEW IDEAL OF PUALGEBRA
a - ANTI FUZZY NEW IDEAL OF PUALGEBRA
 
α -ANTI FUZZY NEW IDEAL OF PUALGEBRA
α -ANTI FUZZY NEW IDEAL OF PUALGEBRAα -ANTI FUZZY NEW IDEAL OF PUALGEBRA
α -ANTI FUZZY NEW IDEAL OF PUALGEBRA
 
Econometrics 2017-graduate-3
Econometrics 2017-graduate-3Econometrics 2017-graduate-3
Econometrics 2017-graduate-3
 
Graduate Econometrics Course, part 4, 2017
Graduate Econometrics Course, part 4, 2017Graduate Econometrics Course, part 4, 2017
Graduate Econometrics Course, part 4, 2017
 

More from Arthur Charpentier (20)

Family History and Life Insurance
Family History and Life InsuranceFamily History and Life Insurance
Family History and Life Insurance
 
ACT6100 introduction
ACT6100 introductionACT6100 introduction
ACT6100 introduction
 
Family History and Life Insurance (UConn actuarial seminar)
Family History and Life Insurance (UConn actuarial seminar)Family History and Life Insurance (UConn actuarial seminar)
Family History and Life Insurance (UConn actuarial seminar)
 
Control epidemics
Control epidemics Control epidemics
Control epidemics
 
STT5100 Automne 2020, introduction
STT5100 Automne 2020, introductionSTT5100 Automne 2020, introduction
STT5100 Automne 2020, introduction
 
Family History and Life Insurance
Family History and Life InsuranceFamily History and Life Insurance
Family History and Life Insurance
 
Machine Learning in Actuarial Science & Insurance
Machine Learning in Actuarial Science & InsuranceMachine Learning in Actuarial Science & Insurance
Machine Learning in Actuarial Science & Insurance
 
Optimal Control and COVID-19
Optimal Control and COVID-19Optimal Control and COVID-19
Optimal Control and COVID-19
 
Slides OICA 2020
Slides OICA 2020Slides OICA 2020
Slides OICA 2020
 
Lausanne 2019 #3
Lausanne 2019 #3Lausanne 2019 #3
Lausanne 2019 #3
 
Lausanne 2019 #4
Lausanne 2019 #4Lausanne 2019 #4
Lausanne 2019 #4
 
Lausanne 2019 #2
Lausanne 2019 #2Lausanne 2019 #2
Lausanne 2019 #2
 
Lausanne 2019 #1
Lausanne 2019 #1Lausanne 2019 #1
Lausanne 2019 #1
 
Side 2019 #11
Side 2019 #11Side 2019 #11
Side 2019 #11
 
Side 2019 #12
Side 2019 #12Side 2019 #12
Side 2019 #12
 
Side 2019 #8
Side 2019 #8Side 2019 #8
Side 2019 #8
 
Side 2019 #7
Side 2019 #7Side 2019 #7
Side 2019 #7
 
Side 2019 #3
Side 2019 #3Side 2019 #3
Side 2019 #3
 
Pareto Models, Slides EQUINEQ
Pareto Models, Slides EQUINEQPareto Models, Slides EQUINEQ
Pareto Models, Slides EQUINEQ
 
Mutualisation et Segmentation
Mutualisation et SegmentationMutualisation et Segmentation
Mutualisation et Segmentation
 

Recently uploaded

Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsKarinaGenton
 
Concept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfConcept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfUmakantAnnand
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Celine George
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17Celine George
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting DataJhengPantaleon
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 

Recently uploaded (20)

Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its Characteristics
 
Concept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfConcept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.Compdf
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 

Slides econ-lm

  • 1. Arthur Charpentier, Econometrics & Machine Learning - 2017 Econometrics & Machine Learning A. Charpentier, E. Flachaire & A. Ly https://arxiv.org/abs/1708.06992 @freakonometrics freakonometrics freakonometrics.hypotheses.org 1
  • 2. Arthur Charpentier, Econometrics & Machine Learning - 2017 Probabilistic Foundations of Econometrics see Haavelmo (1944) see Duo (1997) @freakonometrics freakonometrics freakonometrics.hypotheses.org 2
  • 3. Arthur Charpentier, Econometrics & Machine Learning - 2017 Probabilistic Foundations of Econometrics There is a probabilistic space (Ω, F, P) such that data {yi, xi} can be seen as realizations of random variables {Yi, Xi}. Consider a conditional distribution for Y |X, e.g. (Y |X = x) L ∼ N(µ(x), σ2 ) where µ(x) = β0 + xT β, and β ∈ Rp . for the linear model, with some extension when it is in the exponential family (GLM). Then use maximum likelihood for inference, to derive confidence interval (quantification of uncertainty), etc. Importance of unbiased estimators (Cramer Rao lower bound for the variance). @freakonometrics freakonometrics freakonometrics.hypotheses.org 3
  • 4. Arthur Charpentier, Econometrics & Machine Learning - 2017 Loss Functions Gneiting in a statistical context: a statistics T is said to be ellicitable if there is : R × R → R+ such that T(Y ) = argmin x∈R R l(x, y)dF(y) = argmin x∈R E l(x, Y ) where Y L ∼ F (e.g. T(x) = x and = 2, or T(x) = median(x) and = 1). In machine-learning, we want to solve m (x) = argmin m∈M n i=1 (yi, m(x)) using optimization techniques, in a not-too-complex space M. @freakonometrics freakonometrics freakonometrics.hypotheses.org 4
  • 5. Arthur Charpentier, Econometrics & Machine Learning - 2017 Overfit ? Underfit: true yi = β0 + xT 1 β1 + xT 2 β2 + εi vs. fitted model yi = b0 + xT 1 b1 + ηi. b1 = (XT 1 X1)−1 XT 1 y = β1 + (X1X1)−1 XT 1 X2β2 β12 + (XT 1 X1)−1 XT 1 ε νi i.e. E[b1] = β1 + β12 = β1, unless X1 ⊥ X2 (see Frish-Waugh Theorem). Overfit: true yi = β0 + xT 1 β1 + εi vs. fitted model yi = b0 + xT 1 b1 + xT 2 b2 + ηi. In that case E[b1] = β1 but no-longer efficient. @freakonometrics freakonometrics freakonometrics.hypotheses.org 5
  • 6. Arthur Charpentier, Econometrics & Machine Learning - 2017 Occam’s Razor and Parsimony Importance of penalty in order to avoid a too complex model (overfit). Consider some linear predictor, y = Hy where H = X(XT X)−1 XT y, or more generally y = Sy for some smoothing matrix S. In econometrics, consider some ex-post penalty for model selection, AIC = −2 log L + 2p = deviance + 2p where p is the dimension (i.e. trace(S) in a more general setting). In machine-learning, penalty is added in the objective function, see Ridge or LASSO regression (β0,λ, βλ) = argmin (β0,β) n i=1 (yi, β0 + xiTβ) + λ β Goodhart’s law, “when a measure becomes a target, it ceases to be a good measure” @freakonometrics freakonometrics freakonometrics.hypotheses.org 6
  • 7. Arthur Charpentier, Econometrics & Machine Learning - 2017 Boosting, or learning from previous errors Construct a sequence of models m(k) (x) = m(k−1) (x) + α · f (x) where f = argmin f∈W n i=1 (yi − m(k−1) (xi), f(xi)) for some set of weak learner W. Problem: where to stop to avoid overfit... @freakonometrics freakonometrics freakonometrics.hypotheses.org 7 q q q q q q q q q q q q q q q qq q q q q q q q q q q q q q q q q q q qq q q q qq qq qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q qq q q q q q q q qq q qq q q q q q q q q 0 2 4 6 8 10 −2−101