SlideShare a Scribd company logo
Adapting Neural Networks for
the Estimation of Treatment
Effects
Authors: Claudia Shi, David M. Blei, and
Victor Veitch
Presented by Viswanath Gangavaram
Estimation of Average Treatment Effect
Where Q_hat is an estimate of the conditional outcome Q(t,x) = E[ Y | x, t]
● Conditional Outcome Model(COM)
○ S-Learner( Single Model Approach)
■ Q(t,x) = E[ Y | x, t]
○ Advantages:
■ Given it’s a single model, it’s efficient with data when compare to two model
approaches
○ Disadvantages:
■ Estimated ATE is biased towards zero when actual ATE is small
Various Estimation Techniques
● Grouped Conditional Outcome Model(GCOM)
○ T-Learner( TWo Model Approach)
■ T(X) = E[Y | T=1, X ]
■ C(X) = E[Y | T=0, X ]
○ Advantages:
■ Can model small ATEs
■ Not so good data efficient estimator
○ Disadvantages:
■ Not good in handling local sparsity
Various Estimation Techniques
● X-Learner
○ T-Learner( Two Model & Two Stage Approach)
■ T(X) = E[Y | T=1, X ]
■ C(X) = E[Y | T=0, X ]
■ DT(X) = E[Y - C(X) | X] on train data
■ DC(X) = E[T(X) - Y | X] on test data
■ Causal effect = DC(X) * g(X) + DT(X) * (1-g(X))
● Note: g(X) is P(T=1|X)
○ Advantages:
■ Like T-learner can model small ATE & no so data efficient estimators
○ Disadvantages:
■ Can handle local sparsity
Various Estimation Techniques
Various Estimation Techniques
● TARNET
○ Jointly learn T(X) and C(X) with intermediate shared representation
○ Somewhat data efficient estimator
● Generally, estimation proceeds in two stages. First, we fit models for the
expected outcome and the probability of treatment for each unit. Second,
we add this fitted models in a downstream estimator of causal effect.
● Neural Networks are great choice for the first stage models
● This paper tackles the case of adapting design & training of neural networks
which are used in first stage for the task of estimating causal effect.
Adapting Neural Networks for the estimation of Treatment effects
● Proposed a dragonnet architecture which takes advantage of propensity
score theorem. In dragonnet we learn Q_hat(1,.), Q_hat(0,.) & g(x) jointly with
shared representations both for outcome models & propensity score model.
● Proposed a regularization procedure, targeted regularization, that induces a
bias towards models that have non-parametrically optimal asymptotic
properties
Adapting Neural Networks for the estimation of Treatment effects
Intuition behind dragonnet architecture
● PST in words: Propensity as a balancing score property
○ It suffices to adjust for only the information in X that is relevant for predicting the outcome.
○ The parts that relevant for predicting the outcome but not for predicting the treatment are
irrelevant for estimating the causal effect.
○ Authors posits, conditioning on those irrelevant parts are going to hurt finite-sample
performance
● Two-stage model (Transfer learning)
○ Learn propensity score model
○ Remove the output layer & freeze weights
○ Learn conditional outcome model
○ Estimate ATE
Dragonnet
● An end-to-end procedure for predicting propensity
score and conditional outcomes from covariates and
treatment
● Z(X) is a share representation layer
● 2 layer neural networks for outcome models, whereas
for propensity score a simple linear map(followed by
sigmoid). This simple map forces the representation
layer to tightly couple to estimated propensity scores
● Trade off prediction quality to achieve good
representation of the propensity score
● This trade-off improves ATE estimation even when we
use a downstream estimator that does not use the
estimated propensity scores.
Dragonnet
Targeted Regularization: Non-Parametric Estimation theory at work
Targeted Regularization: Non-Parametric Estimation theory at work
Targeted Regularization: Non-Parametric Estimation theory at work
Targeted Minimum Loss Estimation(TMLE)
Targeted Regularization: Non-Parametric Estimation theory at work
Results & Comments
Let’s go to the main paper

More Related Content

What's hot

2. System Simulation modeling unit i
2. System Simulation modeling unit i2. System Simulation modeling unit i
2. System Simulation modeling unit i
Amita Gautam
 
Types of Mathematical Model.
Types of Mathematical Model.Types of Mathematical Model.
Types of Mathematical Model.
Megha Sharma
 
A tour of the top 10 algorithms for machine learning newbies
A tour of the top 10 algorithms for machine learning newbiesA tour of the top 10 algorithms for machine learning newbies
A tour of the top 10 algorithms for machine learning newbies
Vimal Gupta
 
Advanced regression and model selection
Advanced regression and model selectionAdvanced regression and model selection
Advanced regression and model selection
Ankit Jain
 
Output analysis of a single model
Output analysis of a single modelOutput analysis of a single model
Output analysis of a single model
De La Salle University-Manila
 
Machine Learning Seminar
Machine Learning SeminarMachine Learning Seminar
Machine Learning Seminar
Edwin Efraín Jiménez Lepe
 
Statistical Pattern recognition(1)
Statistical Pattern recognition(1)Statistical Pattern recognition(1)
Statistical Pattern recognition(1)Syed Atif Naseem
 
Supervised learning
Supervised learningSupervised learning
Supervised learning
Alia Hamwi
 
Numerical diffrentiation and integration
Numerical diffrentiation and integrationNumerical diffrentiation and integration
Numerical diffrentiation and integration
POONAMSINGH373
 
Summary.ppt
Summary.pptSummary.ppt
Summary.pptbutest
 
Advanced analytics proposal review guide
Advanced analytics proposal review guideAdvanced analytics proposal review guide
Advanced analytics proposal review guide
Eddy Ti
 
Steps in Simulation Study
Steps in Simulation StudySteps in Simulation Study
Steps in Simulation Study
Nalin Adhikari
 
A3 using data modelling to consider alternatives updated
A3 using data modelling to consider alternatives updatedA3 using data modelling to consider alternatives updated
A3 using data modelling to consider alternatives updated
Peter R Breach
 
MUMS: Transition & SPUQ Workshop - Stochastic Simulators: Issues, Methods, Un...
MUMS: Transition & SPUQ Workshop - Stochastic Simulators: Issues, Methods, Un...MUMS: Transition & SPUQ Workshop - Stochastic Simulators: Issues, Methods, Un...
MUMS: Transition & SPUQ Workshop - Stochastic Simulators: Issues, Methods, Un...
The Statistical and Applied Mathematical Sciences Institute
 
Covidien Dc Work Keys 101 2
Covidien Dc    Work Keys 101 2Covidien Dc    Work Keys 101 2
Covidien Dc Work Keys 101 2
LauraAtACT
 
Coursera 1week
Coursera  1weekCoursera  1week
Coursera 1week
csl9496
 
Optimization
OptimizationOptimization
Optimization
Sümeyra Erol
 
Time Series Analysis/ Forecasting
Time Series Analysis/ Forecasting  Time Series Analysis/ Forecasting
Time Series Analysis/ Forecasting
Samudra Kanankearachchi
 
Unit 1 introduction to simulation
Unit 1 introduction to simulationUnit 1 introduction to simulation
Unit 1 introduction to simulation
DevaKumari Vijay
 

What's hot (20)

2. System Simulation modeling unit i
2. System Simulation modeling unit i2. System Simulation modeling unit i
2. System Simulation modeling unit i
 
Types of Mathematical Model.
Types of Mathematical Model.Types of Mathematical Model.
Types of Mathematical Model.
 
A tour of the top 10 algorithms for machine learning newbies
A tour of the top 10 algorithms for machine learning newbiesA tour of the top 10 algorithms for machine learning newbies
A tour of the top 10 algorithms for machine learning newbies
 
Advanced regression and model selection
Advanced regression and model selectionAdvanced regression and model selection
Advanced regression and model selection
 
Output analysis of a single model
Output analysis of a single modelOutput analysis of a single model
Output analysis of a single model
 
Machine Learning Seminar
Machine Learning SeminarMachine Learning Seminar
Machine Learning Seminar
 
Statistical Pattern recognition(1)
Statistical Pattern recognition(1)Statistical Pattern recognition(1)
Statistical Pattern recognition(1)
 
Supervised learning
Supervised learningSupervised learning
Supervised learning
 
Numerical diffrentiation and integration
Numerical diffrentiation and integrationNumerical diffrentiation and integration
Numerical diffrentiation and integration
 
Summary.ppt
Summary.pptSummary.ppt
Summary.ppt
 
Advanced analytics proposal review guide
Advanced analytics proposal review guideAdvanced analytics proposal review guide
Advanced analytics proposal review guide
 
Steps in Simulation Study
Steps in Simulation StudySteps in Simulation Study
Steps in Simulation Study
 
A3 using data modelling to consider alternatives updated
A3 using data modelling to consider alternatives updatedA3 using data modelling to consider alternatives updated
A3 using data modelling to consider alternatives updated
 
MUMS: Transition & SPUQ Workshop - Stochastic Simulators: Issues, Methods, Un...
MUMS: Transition & SPUQ Workshop - Stochastic Simulators: Issues, Methods, Un...MUMS: Transition & SPUQ Workshop - Stochastic Simulators: Issues, Methods, Un...
MUMS: Transition & SPUQ Workshop - Stochastic Simulators: Issues, Methods, Un...
 
Input modeling
Input modelingInput modeling
Input modeling
 
Covidien Dc Work Keys 101 2
Covidien Dc    Work Keys 101 2Covidien Dc    Work Keys 101 2
Covidien Dc Work Keys 101 2
 
Coursera 1week
Coursera  1weekCoursera  1week
Coursera 1week
 
Optimization
OptimizationOptimization
Optimization
 
Time Series Analysis/ Forecasting
Time Series Analysis/ Forecasting  Time Series Analysis/ Forecasting
Time Series Analysis/ Forecasting
 
Unit 1 introduction to simulation
Unit 1 introduction to simulationUnit 1 introduction to simulation
Unit 1 introduction to simulation
 

Similar to Adapting neural networks for the estimation of treatment effects

Aaa ped-17-Unsupervised Learning: Dimensionality reduction
Aaa ped-17-Unsupervised Learning: Dimensionality reductionAaa ped-17-Unsupervised Learning: Dimensionality reduction
Aaa ped-17-Unsupervised Learning: Dimensionality reduction
AminaRepo
 
ngboost.pptx
ngboost.pptxngboost.pptx
ngboost.pptx
MohamedAliHabib3
 
Deep Learning Module 2A Training MLP.pptx
Deep Learning Module 2A Training MLP.pptxDeep Learning Module 2A Training MLP.pptx
Deep Learning Module 2A Training MLP.pptx
vipul6601
 
Machine learning using matlab.pdf
Machine learning using matlab.pdfMachine learning using matlab.pdf
Machine learning using matlab.pdf
ppvijith
 
Principal component analysis and lda
Principal component analysis and ldaPrincipal component analysis and lda
Principal component analysis and lda
Suresh Pokharel
 
Uncertainty Awareness in Integrating Machine Learning and Game Theory
Uncertainty Awareness in Integrating Machine Learning and Game TheoryUncertainty Awareness in Integrating Machine Learning and Game Theory
Uncertainty Awareness in Integrating Machine Learning and Game Theory
Rikiya Takahashi
 
The Sample Average Approximation Method for Stochastic Programs with Integer ...
The Sample Average Approximation Method for Stochastic Programs with Integer ...The Sample Average Approximation Method for Stochastic Programs with Integer ...
The Sample Average Approximation Method for Stochastic Programs with Integer ...
SSA KPI
 
Artificial Intelligence Course: Linear models
Artificial Intelligence Course: Linear models Artificial Intelligence Course: Linear models
Artificial Intelligence Course: Linear models
ananth
 
An Application of Assignment Problem in Laptop Selection Problem Using MATLAB
An Application of Assignment Problem in Laptop Selection Problem Using MATLABAn Application of Assignment Problem in Laptop Selection Problem Using MATLAB
An Application of Assignment Problem in Laptop Selection Problem Using MATLAB
mathsjournal
 
Machine Learning.pdf
Machine Learning.pdfMachine Learning.pdf
Machine Learning.pdf
BeyaNasr1
 
How to correctly estimate the effect of online advertisement(About Double Mac...
How to correctly estimate the effect of online advertisement(About Double Mac...How to correctly estimate the effect of online advertisement(About Double Mac...
How to correctly estimate the effect of online advertisement(About Double Mac...
Yusuke Kaneko
 
[EMBC 2021] Hierarchical Consistency Regularized Mean Teacher for Semi-superv...
[EMBC 2021] Hierarchical Consistency Regularized Mean Teacher for Semi-superv...[EMBC 2021] Hierarchical Consistency Regularized Mean Teacher for Semi-superv...
[EMBC 2021] Hierarchical Consistency Regularized Mean Teacher for Semi-superv...
Ziyuan Zhao
 
The Green Lab - [05 A] Experiment design (basics)
The Green Lab - [05 A] Experiment design (basics)The Green Lab - [05 A] Experiment design (basics)
The Green Lab - [05 A] Experiment design (basics)
Ivano Malavolta
 
Machine learning
Machine learningMachine learning
Machine learning
Mike Martinez
 
DC02. Interpretation of predictions
DC02. Interpretation of predictionsDC02. Interpretation of predictions
DC02. Interpretation of predictions
Anton Kulesh
 
Py data19 final
Py data19   finalPy data19   final
Py data19 final
Maria Navarro Jiménez
 
Ai saturdays presentation
Ai saturdays presentationAi saturdays presentation
Ai saturdays presentation
Gurram Poorna Prudhvi
 
Artificial Intelligence Chapter 9 Negnevitsky
Artificial Intelligence Chapter 9 NegnevitskyArtificial Intelligence Chapter 9 Negnevitsky
Artificial Intelligence Chapter 9 Negnevitskylopanath
 
An Application of Assignment Problem in Laptop Selection Problem Using MATLAB
An Application of Assignment Problem in Laptop Selection Problem Using MATLAB An Application of Assignment Problem in Laptop Selection Problem Using MATLAB
An Application of Assignment Problem in Laptop Selection Problem Using MATLAB
mathsjournal
 
A brief introduction to Searn Algorithm
A brief introduction to Searn AlgorithmA brief introduction to Searn Algorithm
A brief introduction to Searn Algorithm
Supun Abeysinghe
 

Similar to Adapting neural networks for the estimation of treatment effects (20)

Aaa ped-17-Unsupervised Learning: Dimensionality reduction
Aaa ped-17-Unsupervised Learning: Dimensionality reductionAaa ped-17-Unsupervised Learning: Dimensionality reduction
Aaa ped-17-Unsupervised Learning: Dimensionality reduction
 
ngboost.pptx
ngboost.pptxngboost.pptx
ngboost.pptx
 
Deep Learning Module 2A Training MLP.pptx
Deep Learning Module 2A Training MLP.pptxDeep Learning Module 2A Training MLP.pptx
Deep Learning Module 2A Training MLP.pptx
 
Machine learning using matlab.pdf
Machine learning using matlab.pdfMachine learning using matlab.pdf
Machine learning using matlab.pdf
 
Principal component analysis and lda
Principal component analysis and ldaPrincipal component analysis and lda
Principal component analysis and lda
 
Uncertainty Awareness in Integrating Machine Learning and Game Theory
Uncertainty Awareness in Integrating Machine Learning and Game TheoryUncertainty Awareness in Integrating Machine Learning and Game Theory
Uncertainty Awareness in Integrating Machine Learning and Game Theory
 
The Sample Average Approximation Method for Stochastic Programs with Integer ...
The Sample Average Approximation Method for Stochastic Programs with Integer ...The Sample Average Approximation Method for Stochastic Programs with Integer ...
The Sample Average Approximation Method for Stochastic Programs with Integer ...
 
Artificial Intelligence Course: Linear models
Artificial Intelligence Course: Linear models Artificial Intelligence Course: Linear models
Artificial Intelligence Course: Linear models
 
An Application of Assignment Problem in Laptop Selection Problem Using MATLAB
An Application of Assignment Problem in Laptop Selection Problem Using MATLABAn Application of Assignment Problem in Laptop Selection Problem Using MATLAB
An Application of Assignment Problem in Laptop Selection Problem Using MATLAB
 
Machine Learning.pdf
Machine Learning.pdfMachine Learning.pdf
Machine Learning.pdf
 
How to correctly estimate the effect of online advertisement(About Double Mac...
How to correctly estimate the effect of online advertisement(About Double Mac...How to correctly estimate the effect of online advertisement(About Double Mac...
How to correctly estimate the effect of online advertisement(About Double Mac...
 
[EMBC 2021] Hierarchical Consistency Regularized Mean Teacher for Semi-superv...
[EMBC 2021] Hierarchical Consistency Regularized Mean Teacher for Semi-superv...[EMBC 2021] Hierarchical Consistency Regularized Mean Teacher for Semi-superv...
[EMBC 2021] Hierarchical Consistency Regularized Mean Teacher for Semi-superv...
 
The Green Lab - [05 A] Experiment design (basics)
The Green Lab - [05 A] Experiment design (basics)The Green Lab - [05 A] Experiment design (basics)
The Green Lab - [05 A] Experiment design (basics)
 
Machine learning
Machine learningMachine learning
Machine learning
 
DC02. Interpretation of predictions
DC02. Interpretation of predictionsDC02. Interpretation of predictions
DC02. Interpretation of predictions
 
Py data19 final
Py data19   finalPy data19   final
Py data19 final
 
Ai saturdays presentation
Ai saturdays presentationAi saturdays presentation
Ai saturdays presentation
 
Artificial Intelligence Chapter 9 Negnevitsky
Artificial Intelligence Chapter 9 NegnevitskyArtificial Intelligence Chapter 9 Negnevitsky
Artificial Intelligence Chapter 9 Negnevitsky
 
An Application of Assignment Problem in Laptop Selection Problem Using MATLAB
An Application of Assignment Problem in Laptop Selection Problem Using MATLAB An Application of Assignment Problem in Laptop Selection Problem Using MATLAB
An Application of Assignment Problem in Laptop Selection Problem Using MATLAB
 
A brief introduction to Searn Algorithm
A brief introduction to Searn AlgorithmA brief introduction to Searn Algorithm
A brief introduction to Searn Algorithm
 

Recently uploaded

原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
u86oixdj
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
haila53
 
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
oz8q3jxlp
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
ewymefz
 
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
pchutichetpong
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
slg6lamcq
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
ewymefz
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
NABLAS株式会社
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
AbhimanyuSinha9
 
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
John Andrews
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
g4dpvqap0
 
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
u86oixdj
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
74nqk8xf
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
axoqas
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
TravisMalana
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
ewymefz
 

Recently uploaded (20)

原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
 
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
 
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
 
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
 
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
 

Adapting neural networks for the estimation of treatment effects

  • 1. Adapting Neural Networks for the Estimation of Treatment Effects Authors: Claudia Shi, David M. Blei, and Victor Veitch Presented by Viswanath Gangavaram
  • 2. Estimation of Average Treatment Effect Where Q_hat is an estimate of the conditional outcome Q(t,x) = E[ Y | x, t]
  • 3. ● Conditional Outcome Model(COM) ○ S-Learner( Single Model Approach) ■ Q(t,x) = E[ Y | x, t] ○ Advantages: ■ Given it’s a single model, it’s efficient with data when compare to two model approaches ○ Disadvantages: ■ Estimated ATE is biased towards zero when actual ATE is small Various Estimation Techniques
  • 4. ● Grouped Conditional Outcome Model(GCOM) ○ T-Learner( TWo Model Approach) ■ T(X) = E[Y | T=1, X ] ■ C(X) = E[Y | T=0, X ] ○ Advantages: ■ Can model small ATEs ■ Not so good data efficient estimator ○ Disadvantages: ■ Not good in handling local sparsity Various Estimation Techniques
  • 5. ● X-Learner ○ T-Learner( Two Model & Two Stage Approach) ■ T(X) = E[Y | T=1, X ] ■ C(X) = E[Y | T=0, X ] ■ DT(X) = E[Y - C(X) | X] on train data ■ DC(X) = E[T(X) - Y | X] on test data ■ Causal effect = DC(X) * g(X) + DT(X) * (1-g(X)) ● Note: g(X) is P(T=1|X) ○ Advantages: ■ Like T-learner can model small ATE & no so data efficient estimators ○ Disadvantages: ■ Can handle local sparsity Various Estimation Techniques
  • 6. Various Estimation Techniques ● TARNET ○ Jointly learn T(X) and C(X) with intermediate shared representation ○ Somewhat data efficient estimator
  • 7. ● Generally, estimation proceeds in two stages. First, we fit models for the expected outcome and the probability of treatment for each unit. Second, we add this fitted models in a downstream estimator of causal effect. ● Neural Networks are great choice for the first stage models ● This paper tackles the case of adapting design & training of neural networks which are used in first stage for the task of estimating causal effect. Adapting Neural Networks for the estimation of Treatment effects
  • 8. ● Proposed a dragonnet architecture which takes advantage of propensity score theorem. In dragonnet we learn Q_hat(1,.), Q_hat(0,.) & g(x) jointly with shared representations both for outcome models & propensity score model. ● Proposed a regularization procedure, targeted regularization, that induces a bias towards models that have non-parametrically optimal asymptotic properties Adapting Neural Networks for the estimation of Treatment effects
  • 9.
  • 10. Intuition behind dragonnet architecture ● PST in words: Propensity as a balancing score property ○ It suffices to adjust for only the information in X that is relevant for predicting the outcome. ○ The parts that relevant for predicting the outcome but not for predicting the treatment are irrelevant for estimating the causal effect. ○ Authors posits, conditioning on those irrelevant parts are going to hurt finite-sample performance ● Two-stage model (Transfer learning) ○ Learn propensity score model ○ Remove the output layer & freeze weights ○ Learn conditional outcome model ○ Estimate ATE
  • 11. Dragonnet ● An end-to-end procedure for predicting propensity score and conditional outcomes from covariates and treatment ● Z(X) is a share representation layer ● 2 layer neural networks for outcome models, whereas for propensity score a simple linear map(followed by sigmoid). This simple map forces the representation layer to tightly couple to estimated propensity scores ● Trade off prediction quality to achieve good representation of the propensity score ● This trade-off improves ATE estimation even when we use a downstream estimator that does not use the estimated propensity scores.
  • 13. Targeted Regularization: Non-Parametric Estimation theory at work
  • 14. Targeted Regularization: Non-Parametric Estimation theory at work
  • 15. Targeted Regularization: Non-Parametric Estimation theory at work Targeted Minimum Loss Estimation(TMLE)
  • 16. Targeted Regularization: Non-Parametric Estimation theory at work
  • 17. Results & Comments Let’s go to the main paper