SlideShare a Scribd company logo
1 of 35
Causal Inference in
Epidemiology
Part 2
PH 250B Fall 2016
Jade Benjamin-‐Chung,PhD MPH
Colford-‐HubbardResearch Group
Adapted from Professor Jen Ahern’s 250B slides
Outline
1. What does causal inference entail?
2. Using directed acyclic graphs
a. DAG basics
b. Identifying confounding
c. Understanding selection bias
3. Causal perspective on effect modification
a. Brief recap of effect modification (EM)
b. Linking EM in our studies to reality
c. Types of interaction
d. Causal interaction / EM
1. Sufficient cause model (“causalpies”)
2. Potential outcomes model (“causal types”)
e. Choosing which measure of interaction to estimate and report
4. Integrating causal concepts into your research
Reality of study design
• We often don’t have ideal data on our population
of interest
• The data we collect are incomplete
• Statistics can help us understand correlations or
associations between exposures and outcomes
• Typically what we really want to know is if the
exposure causes the outcome
What does causal inference entail?
• Careful definition of our estimation goals
• A set of assumptions that allow us to link our observed
data to ideal data that would be used to reach our
goals
• Causal inference techniques help us
– Express assumptions about our data in a transparent,
mathematical form
– Provide us with mathematical steps to translate
assumptions into quantities that can be estimated with
observed data
Pearl, Glymour & Jewell, 2016
5
1. Define research hypothesis
– Your hypothesis can include possible effect modification
– Determine to what extent you aim to make causal inferences
using your data
2. Determine study design (trial, cohort, etc.)
3. Draw a DAG
a. Identify potential confounders
b. Choose which variables to measure
4. Analyze your data
a.
b.
c.
Control for confounders identified in step 3
Assess effect modification on the additive or multiplicative
scale
Make statistical inferences
5. Make scientific inferences about your hypothesis
Causal inference in your research
Outline
1. What does causal inference entail?
2. Using directed acyclic graphs
a. DAG basics
b. Identifying confounding
c. Understanding selection bias
3. Causal perspective on effect modification
a. Brief recap of effect modification (EM)
b. Linking EM in our studies to reality
c. Types of interaction
d. Causal interaction / EM
1. Sufficient cause model (“causalpies”)
2. Potential outcomes model (“causal types”)
e. Choosing which measure of interaction to estimate and report
4. Integrating causal concepts into your research
Causal diagrams as mathematical language
“Graphical methods now
provide a powerful
symbolic machinery for
deriving the consequences
of causal assumptions
when such assumptions
are combined with
statistical data.”
Pearl J, 2009, Causality
8
Directed Acyclic Graphs (DAGs)
• Visually depict assumptions about causal relationships
between exposures, outcomes, and other variables
– Depicts the “data generating process”
• DAGs depict our knowledge (or beliefs) about the “data
generating process”
• DAGs are informed by subject matter knowledge, prior
research, and a priori hypotheses
• Learning curve on terminology and approach – practice helps!
Can be very useful tool once you are comfortable with it
How can we use DAGs?
Generally
• Document assumptions about cause-‐effect
relationships
• Explore implications of those assumptions
• Assess how to make causal inferences from both
one’s data and one’s assumptions
Today
• To understand selection bias
• To identify confounding
Pearl J, 2009, Causality
• Direct causal relationships between variables
are represented by arrows
– Directed
– All causal relationships have a direction
– A given variable cannot be simultaneously a cause
and an effect
SES Prenatal Care
1
0
DAG construction
Malnutrition (M)
Infection (I) I (t=0) I (t=1)
M (t=0) M (t=1)
• There are no feedback loops
– Acyclic
– Causes always precede their effects
– To avoid feedback loops, extend graph over time
1
1
DAG construction
Vitamins Birth Defects
Prenatal Care
Difficulty conceivingSES
Maternal genetics
• Parent & Child:
– Directly connected by an arrow
– Prenatal care is a “parent” of birthdefects
– Birth defects is a “child” of pre-‐natalcare
1
2
DAG terminology
Vitamins Birth Defects
Prenatal Care
Difficulty conceivingSES
Maternal genetics
• Ancestor & Descendant:
– Connected by a directed path of a series of arrows
– SES is an “ancestor” of Birth Defects
– Birth Defects is a “descendant” of SES
1
3
DAG terminology
Smoking
Smoking
CancerTar Mutations
Cancer
• Absence of a directed path from X to Y
implies X has no effect on Y
– Directed paths not in the graph as important as those in
the graph
• Note: Not all intermediate steps between two
variables need to be represented
– Depends on level of detail of the model
1
4
DAG assumptions
• DAGs assume that all common causes of exposure
and disease are included
– Common causes that are not observed should still be
included
– These are often denoted with a “U” to indicate they were
unmeasured
U (religious
beliefs, culture,
lifestyle, etc.)
Alcohol Use
Smoking
Heart Disease
DAG assumptions
1
5
Example
Speed
Bicycle Fall
1
6
Example
Speed
Bicycle
Characteristics
Road/Lane/Path
Surface
Bicycle Traffic
Road Grade
Car Traffic
Bicycle Fall
1
7
SpeedCar Traffic
Example
Rider Skill/Experience
Bicycle
Characteristics
Road/Lane/Path
Surface
Bicycle Traffic
Road Grade
Bicycle Fall
1
8
Speed
Bicycle
Characteristics
Road/Lane/Path
Surface
Bicycle Traffic
Road Grade
Car Traffic
Populace Bicycle
Awareness
Bicycle Lane/
Path
Example
Rider Skill/Experience
Bicycle Fall
1
9
Speed
Rider Skill/Experience
Bicycle
Characteristics
Bicycle Traffic
Road/Lane/Path
Surface
Road Grade
Car Traffic
Populace Bicycle
Awareness
Bicycle Lane/
Path
Example
Bicycle Fall
20
What are some
assumptions are we
making?
Bicycle lane/path
only has an effect on
bicycle falls through
its effect on bicycle
traffic
Road surface does
not affect bicycle
traffic
All common causes of
speed and bicycle fall
are included (even
those unmeasured)
Statistical underpinnings of DAGs
• Multiple possible causal models for this DAG:
Y Z
X = School funding
Y = SAT Scores
Z = College Acceptance
X = UX
Y = (x/3) + UY
Z = (y/16) + UZ
X = Number of hours worked per week
Y = Number of training hours per week
Z = Race completion time
X = UX
Y = 84 – x + UY
Z = (100/y) + UZ
UX UY
X
UZ
Pearl, Glymour & Jewell, 2016
Statistical underpinnings of DAGs
• Both models share the same statistical relationships:
• Z and Y are dependent
• Y and X are dependent
• Z and X are likely dependent
• Z and X are independent
depending on the values of Y
Y Z
UX UY
X
UZ
X = School funding
Y = SAT Scores
Z = College Acceptance
X = UX
Y = (x/3) + UY
Z = (y/16) + UZ
Pearl, Glymour & Jewell, 2016
Statistical underpinnings of DAGs
• Both models share the same statistical relationships
For specific values of these variables (lower case x, y, z):
• Z and Y are dependent
• Y and X are dependent
• Z and X are likely dependent
• Z and X are independent
depending on the values of Y
Y Z
UX UY
X
UZ
Pearl, Glymour & Jewell, 2016
Conditioning on a variable in a DAG
• “Conditioning” on a variable means filtering the data into
groups based on the value of a variable
• A box is often used around a variable denote that it is being
conditioned on (e.g., in this DAG we condition on Y)
• This is equivalent to stratifying the data or controlling for a
variable in a statistical model
X Y Z
UX UY UZ
DAG configurations
X
X
Y Z
Y
Z
X Z
Y
Chain
Fork
Collider
* Has special considerations
and challenges
Pearl, Glymour & Jewell, 2016
CancerDiet
BMI
Colliders
In this example,
BMI is a collider
2
6
CancerDiet
BMI
Colliders
Conditioning on
BMI induces an
association
between diet and
cancer
CancerDiet
BMI
Among those who
have had a BMI
decrease there
will be larger
numbers of
dieters and larger
numbers of
people with
cancer than in the
total population
2
7
Colliders
YX
Why does conditioning on a collider induce an
association between its parents?
Example: Z=X+Y
Pearl, Glymour & Jewell, 2016
2
8
Do not condition on Z:
• X=3 ! know nothing about Y
Z
Colliders
YX
Z
Why does conditioning on a collider induce an
association between its parents?
Example: Z=X+Y
Pearl, Glymour & Jewell, 2016
2
9
Do not condition on Z:
• X=3 ! know nothing about Y
Condition on Z:
• Z=10, X=3 ! know Y=7
Thus, X and Y are dependent
given that (“conditional on”)
Z = 10
3
0
Strengths of DAGs
• Can determine which variables depend on each
other in our causal model without knowing the
specific functions (e.g., Z=X+Y in the previous slide)
connecting them (Pearl, Glymour & Jewell, 2016)
• Allow us to link our causal model to our statistical
relationships in our data
• DAGS can incorporate measurement error as well
(Hernan & Cole, 2009)
3
1
Limitations of DAGs
• Cannot display effect modification easily (example of road
surface)
• Arrows in graphs do not provide specific definitions of effects
(contrast with counterfactuals)
• Can become extremely complicated when representing real
data structures
• Are not designed to capture effects of infectious disease
interventions that may impact not only intervention
recipients but also non-‐recipients(e.g., herd effects of
vaccines)
DAG limitations
Example of extremely complicated
3
2
3
3
DAGs
• DAG itself is not used to analyze data from the study
you’ve conducted
– Informs how study is designed/data are collected
– Informs how data are analyzed
– Helps identify which research questions are answerable in
a given data set
• Utility of DAGs dependent on accuracy/correctness
of associations we represent in the diagram
3
4
Non-‐parametricstructural equation
models
• Non-‐parametricstructural equation models
(NPSEMs) provide a link between DAGs and
counterfactuals and are a way to analyze data
• They encode relationships between variables that
can include many possible equations and functional
forms
• Non-‐parametricestimation used to avoid
assumptions of typical SEMs (e.g., linearity)
• Learn more about this topic in PH252D
Example of NPSEM
Y Z
UX UY
X
UZ
Previous example:
X = School funding
Y = SAT Scores
Z = College Acceptance
X = UX
Y = (x/3) + UY
Z = (y/16) + UZ
Pearl, Glymour & Jewell, 2016
NPSEM:
X = School funding
Y = SAT Scores
Z = College Acceptance
X = fX(UX)
Y = fY(X, UY)
Z = fZ(Y, UZ)

More Related Content

What's hot

Bias in epidemiology uploaded
Bias in epidemiology uploadedBias in epidemiology uploaded
Bias in epidemiology uploadedKumar Mrigesh
 
Epidemiological study designs
Epidemiological study designsEpidemiological study designs
Epidemiological study designsKayode Afolabi
 
Nested case control study
Nested case control studyNested case control study
Nested case control studyPrayas Gautam
 
Cross sectional research desighn
Cross  sectional research desighnCross  sectional research desighn
Cross sectional research desighnAhmad Mashhood
 
4.4. effect modification
4.4. effect modification4.4. effect modification
4.4. effect modificationA M
 
2. ph250b.14 measures of association 1
2.  ph250b.14  measures of association 12.  ph250b.14  measures of association 1
2. ph250b.14 measures of association 1A M
 
Intro biostat1&2
Intro biostat1&2Intro biostat1&2
Intro biostat1&2Lucidante1
 
Fundamentals and Study Design of Epidemiology
Fundamentals and Study Design of EpidemiologyFundamentals and Study Design of Epidemiology
Fundamentals and Study Design of EpidemiologyFarazaJaved
 
Association and Causation
Association and CausationAssociation and Causation
Association and CausationRizwan S A
 
Measures of association 2013
Measures of association 2013Measures of association 2013
Measures of association 2013dinahoefer11
 
Chapter seven. epidemiologic study designs
Chapter seven. epidemiologic study designsChapter seven. epidemiologic study designs
Chapter seven. epidemiologic study designsOmar Osman Eid
 

What's hot (20)

Association & cousation
Association & cousationAssociation & cousation
Association & cousation
 
Bias in epidemiology uploaded
Bias in epidemiology uploadedBias in epidemiology uploaded
Bias in epidemiology uploaded
 
Epidemiological study designs
Epidemiological study designsEpidemiological study designs
Epidemiological study designs
 
Nested case control study
Nested case control studyNested case control study
Nested case control study
 
Association and causation
Association and causationAssociation and causation
Association and causation
 
Cross sectional research desighn
Cross  sectional research desighnCross  sectional research desighn
Cross sectional research desighn
 
4.4. effect modification
4.4. effect modification4.4. effect modification
4.4. effect modification
 
2. ph250b.14 measures of association 1
2.  ph250b.14  measures of association 12.  ph250b.14  measures of association 1
2. ph250b.14 measures of association 1
 
Bias and confounding
Bias and confoundingBias and confounding
Bias and confounding
 
Intro biostat1&2
Intro biostat1&2Intro biostat1&2
Intro biostat1&2
 
Association and causation
Association and causationAssociation and causation
Association and causation
 
Epidemiology Study Design
Epidemiology Study DesignEpidemiology Study Design
Epidemiology Study Design
 
Association and Causation
Association and CausationAssociation and Causation
Association and Causation
 
Bias and confounder
Bias and confounderBias and confounder
Bias and confounder
 
5. cohort studies
5. cohort  studies5. cohort  studies
5. cohort studies
 
Fundamentals and Study Design of Epidemiology
Fundamentals and Study Design of EpidemiologyFundamentals and Study Design of Epidemiology
Fundamentals and Study Design of Epidemiology
 
Association and Causation
Association and CausationAssociation and Causation
Association and Causation
 
Biostatistics
BiostatisticsBiostatistics
Biostatistics
 
Measures of association 2013
Measures of association 2013Measures of association 2013
Measures of association 2013
 
Chapter seven. epidemiologic study designs
Chapter seven. epidemiologic study designsChapter seven. epidemiologic study designs
Chapter seven. epidemiologic study designs
 

Viewers also liked

5.3.1 causal em
5.3.1 causal em5.3.1 causal em
5.3.1 causal emA M
 
5.3.4 reporting em
5.3.4 reporting em5.3.4 reporting em
5.3.4 reporting emA M
 
5.3.2 sufficient cause em
5.3.2 sufficient cause em5.3.2 sufficient cause em
5.3.2 sufficient cause emA M
 
Epidemiologic measures and policy formulation lessons from potential outcomes...
Epidemiologic measures and policy formulation lessons from potential outcomes...Epidemiologic measures and policy formulation lessons from potential outcomes...
Epidemiologic measures and policy formulation lessons from potential outcomes...Bsie
 
5.2.3 dags for selection bias
5.2.3 dags for selection bias5.2.3 dags for selection bias
5.2.3 dags for selection biasA M
 
5.3.3 potential outcomes em
5.3.3 potential outcomes em5.3.3 potential outcomes em
5.3.3 potential outcomes emA M
 
5.1.2 counterfactual framework
5.1.2 counterfactual framework5.1.2 counterfactual framework
5.1.2 counterfactual frameworkA M
 
5.1.3 hills criteria
5.1.3 hills criteria5.1.3 hills criteria
5.1.3 hills criteriaA M
 
5.1.1 sufficient component cause model
5.1.1 sufficient component cause model5.1.1 sufficient component cause model
5.1.1 sufficient component cause modelA M
 
5.2.2 dags for confounding
5.2.2 dags for confounding5.2.2 dags for confounding
5.2.2 dags for confoundingA M
 
5.3.5 causal inference in research
5.3.5 causal inference in research5.3.5 causal inference in research
5.3.5 causal inference in researchA M
 

Viewers also liked (11)

5.3.1 causal em
5.3.1 causal em5.3.1 causal em
5.3.1 causal em
 
5.3.4 reporting em
5.3.4 reporting em5.3.4 reporting em
5.3.4 reporting em
 
5.3.2 sufficient cause em
5.3.2 sufficient cause em5.3.2 sufficient cause em
5.3.2 sufficient cause em
 
Epidemiologic measures and policy formulation lessons from potential outcomes...
Epidemiologic measures and policy formulation lessons from potential outcomes...Epidemiologic measures and policy formulation lessons from potential outcomes...
Epidemiologic measures and policy formulation lessons from potential outcomes...
 
5.2.3 dags for selection bias
5.2.3 dags for selection bias5.2.3 dags for selection bias
5.2.3 dags for selection bias
 
5.3.3 potential outcomes em
5.3.3 potential outcomes em5.3.3 potential outcomes em
5.3.3 potential outcomes em
 
5.1.2 counterfactual framework
5.1.2 counterfactual framework5.1.2 counterfactual framework
5.1.2 counterfactual framework
 
5.1.3 hills criteria
5.1.3 hills criteria5.1.3 hills criteria
5.1.3 hills criteria
 
5.1.1 sufficient component cause model
5.1.1 sufficient component cause model5.1.1 sufficient component cause model
5.1.1 sufficient component cause model
 
5.2.2 dags for confounding
5.2.2 dags for confounding5.2.2 dags for confounding
5.2.2 dags for confounding
 
5.3.5 causal inference in research
5.3.5 causal inference in research5.3.5 causal inference in research
5.3.5 causal inference in research
 

Similar to 5.2.1 dags

Introduction to Directed Acyclic Graphs.pptx
Introduction to Directed Acyclic Graphs.pptxIntroduction to Directed Acyclic Graphs.pptx
Introduction to Directed Acyclic Graphs.pptxDivyanshGupta922023
 
Confounding and Directed Acyclic Graphs
Confounding and Directed Acyclic GraphsConfounding and Directed Acyclic Graphs
Confounding and Directed Acyclic GraphsDarren L Dahly PhD
 
Debunk bullshit in statistics QN
Debunk bullshit in statistics QNDebunk bullshit in statistics QN
Debunk bullshit in statistics QNQuan Nguyen
 
Endogeneity and Entrepreneurship Research
Endogeneity and Entrepreneurship ResearchEndogeneity and Entrepreneurship Research
Endogeneity and Entrepreneurship ResearchBrian Anderson
 
Scatterplots and Cautions of Correlation
Scatterplots and Cautions of CorrelationScatterplots and Cautions of Correlation
Scatterplots and Cautions of CorrelationOleg Janke
 
Intergenerational mobility, intergenerational effects, the role of family bac...
Intergenerational mobility, intergenerational effects, the role of family bac...Intergenerational mobility, intergenerational effects, the role of family bac...
Intergenerational mobility, intergenerational effects, the role of family bac...Stockholm Institute of Transition Economics
 
Perspective of feature selection in bioinformatics
Perspective of feature selection in bioinformaticsPerspective of feature selection in bioinformatics
Perspective of feature selection in bioinformaticsGianluca Bontempi
 
Generalized SEIR Model on Large Networks
Generalized SEIR Model on Large NetworksGeneralized SEIR Model on Large Networks
Generalized SEIR Model on Large NetworksDatabricks
 
Descriptive statistics-Skewness-Kurtosis-Correlation.ppt
Descriptive statistics-Skewness-Kurtosis-Correlation.pptDescriptive statistics-Skewness-Kurtosis-Correlation.ppt
Descriptive statistics-Skewness-Kurtosis-Correlation.pptRama Krishna
 
Statistics for IB Biology
Statistics for IB BiologyStatistics for IB Biology
Statistics for IB BiologyEran Earland
 
Representing and generating uncertainty effectively presentatıon
Representing and generating uncertainty effectively presentatıonRepresenting and generating uncertainty effectively presentatıon
Representing and generating uncertainty effectively presentatıonAzdeen Najah
 
Chapter 2 Simple Linear Regression Model.pptx
Chapter 2 Simple Linear Regression Model.pptxChapter 2 Simple Linear Regression Model.pptx
Chapter 2 Simple Linear Regression Model.pptxaschalew shiferaw
 
Regression shrinkage: better answers to causal questions
Regression shrinkage: better answers to causal questionsRegression shrinkage: better answers to causal questions
Regression shrinkage: better answers to causal questionsMaarten van Smeden
 
Statistical analysis of correlated data using generalized estimating equation...
Statistical analysis of correlated data using generalized estimating equation...Statistical analysis of correlated data using generalized estimating equation...
Statistical analysis of correlated data using generalized estimating equation...Angelina Lessa
 
Correlations and t scores (2)
Correlations and t scores (2)Correlations and t scores (2)
Correlations and t scores (2)Pedro Martinez
 

Similar to 5.2.1 dags (20)

Introduction to Directed Acyclic Graphs.pptx
Introduction to Directed Acyclic Graphs.pptxIntroduction to Directed Acyclic Graphs.pptx
Introduction to Directed Acyclic Graphs.pptx
 
Confounding and Directed Acyclic Graphs
Confounding and Directed Acyclic GraphsConfounding and Directed Acyclic Graphs
Confounding and Directed Acyclic Graphs
 
Debunk bullshit in statistics QN
Debunk bullshit in statistics QNDebunk bullshit in statistics QN
Debunk bullshit in statistics QN
 
Endogeneity and Entrepreneurship Research
Endogeneity and Entrepreneurship ResearchEndogeneity and Entrepreneurship Research
Endogeneity and Entrepreneurship Research
 
Scatterplots and Cautions of Correlation
Scatterplots and Cautions of CorrelationScatterplots and Cautions of Correlation
Scatterplots and Cautions of Correlation
 
Prasanta Bhattacharya - WESST - Social Networks and Causal Inference
Prasanta Bhattacharya - WESST - Social Networks and Causal InferencePrasanta Bhattacharya - WESST - Social Networks and Causal Inference
Prasanta Bhattacharya - WESST - Social Networks and Causal Inference
 
Module7_RamdomError.pptx
Module7_RamdomError.pptxModule7_RamdomError.pptx
Module7_RamdomError.pptx
 
Correlation.pptx.pdf
Correlation.pptx.pdfCorrelation.pptx.pdf
Correlation.pptx.pdf
 
Intergenerational mobility, intergenerational effects, the role of family bac...
Intergenerational mobility, intergenerational effects, the role of family bac...Intergenerational mobility, intergenerational effects, the role of family bac...
Intergenerational mobility, intergenerational effects, the role of family bac...
 
Perspective of feature selection in bioinformatics
Perspective of feature selection in bioinformaticsPerspective of feature selection in bioinformatics
Perspective of feature selection in bioinformatics
 
Generalized SEIR Model on Large Networks
Generalized SEIR Model on Large NetworksGeneralized SEIR Model on Large Networks
Generalized SEIR Model on Large Networks
 
Descriptive statistics-Skewness-Kurtosis-Correlation.ppt
Descriptive statistics-Skewness-Kurtosis-Correlation.pptDescriptive statistics-Skewness-Kurtosis-Correlation.ppt
Descriptive statistics-Skewness-Kurtosis-Correlation.ppt
 
Statistics for IB Biology
Statistics for IB BiologyStatistics for IB Biology
Statistics for IB Biology
 
Representing and generating uncertainty effectively presentatıon
Representing and generating uncertainty effectively presentatıonRepresenting and generating uncertainty effectively presentatıon
Representing and generating uncertainty effectively presentatıon
 
Aussem
AussemAussem
Aussem
 
Chapter 2 Simple Linear Regression Model.pptx
Chapter 2 Simple Linear Regression Model.pptxChapter 2 Simple Linear Regression Model.pptx
Chapter 2 Simple Linear Regression Model.pptx
 
Regression shrinkage: better answers to causal questions
Regression shrinkage: better answers to causal questionsRegression shrinkage: better answers to causal questions
Regression shrinkage: better answers to causal questions
 
Statistical analysis of correlated data using generalized estimating equation...
Statistical analysis of correlated data using generalized estimating equation...Statistical analysis of correlated data using generalized estimating equation...
Statistical analysis of correlated data using generalized estimating equation...
 
Correlations and t scores (2)
Correlations and t scores (2)Correlations and t scores (2)
Correlations and t scores (2)
 
ECONOMETRICS I ASA
ECONOMETRICS I ASAECONOMETRICS I ASA
ECONOMETRICS I ASA
 

More from A M

Transparency7
Transparency7Transparency7
Transparency7A M
 
Transparency6
Transparency6Transparency6
Transparency6A M
 
Transparency5
Transparency5Transparency5
Transparency5A M
 
Transparency4
Transparency4Transparency4
Transparency4A M
 
Transparency3
Transparency3Transparency3
Transparency3A M
 
Transparency2
Transparency2Transparency2
Transparency2A M
 
Transparency1
Transparency1Transparency1
Transparency1A M
 
4.5. logistic regression
4.5. logistic regression4.5. logistic regression
4.5. logistic regressionA M
 
4.3.2. controlling confounding stratification
4.3.2. controlling confounding stratification4.3.2. controlling confounding stratification
4.3.2. controlling confounding stratificationA M
 
4.3.1. controlling confounding matching
4.3.1. controlling confounding matching4.3.1. controlling confounding matching
4.3.1. controlling confounding matchingA M
 
4.2.4. confounding counterfactual
4.2.4. confounding counterfactual4.2.4. confounding counterfactual
4.2.4. confounding counterfactualA M
 
4.2.3. confounding collapsability
4.2.3. confounding collapsability4.2.3. confounding collapsability
4.2.3. confounding collapsabilityA M
 
4.2.2. confounding classical approach
4.2.2. confounding classical approach4.2.2. confounding classical approach
4.2.2. confounding classical approachA M
 
4.2.1. confounding mixing of effects
4.2.1. confounding mixing of effects4.2.1. confounding mixing of effects
4.2.1. confounding mixing of effectsA M
 
4.1. introduction
4.1. introduction4.1. introduction
4.1. introductionA M
 
4 Threats to validity from confounding bias and effect modification
4 Threats to validity from confounding bias and effect modification4 Threats to validity from confounding bias and effect modification
4 Threats to validity from confounding bias and effect modificationA M
 
6.5 strengths and challenges
6.5 strengths and challenges6.5 strengths and challenges
6.5 strengths and challengesA M
 
6.7 summaries
6.7 summaries6.7 summaries
6.7 summariesA M
 
6.6 examples
6.6 examples6.6 examples
6.6 examplesA M
 

More from A M (19)

Transparency7
Transparency7Transparency7
Transparency7
 
Transparency6
Transparency6Transparency6
Transparency6
 
Transparency5
Transparency5Transparency5
Transparency5
 
Transparency4
Transparency4Transparency4
Transparency4
 
Transparency3
Transparency3Transparency3
Transparency3
 
Transparency2
Transparency2Transparency2
Transparency2
 
Transparency1
Transparency1Transparency1
Transparency1
 
4.5. logistic regression
4.5. logistic regression4.5. logistic regression
4.5. logistic regression
 
4.3.2. controlling confounding stratification
4.3.2. controlling confounding stratification4.3.2. controlling confounding stratification
4.3.2. controlling confounding stratification
 
4.3.1. controlling confounding matching
4.3.1. controlling confounding matching4.3.1. controlling confounding matching
4.3.1. controlling confounding matching
 
4.2.4. confounding counterfactual
4.2.4. confounding counterfactual4.2.4. confounding counterfactual
4.2.4. confounding counterfactual
 
4.2.3. confounding collapsability
4.2.3. confounding collapsability4.2.3. confounding collapsability
4.2.3. confounding collapsability
 
4.2.2. confounding classical approach
4.2.2. confounding classical approach4.2.2. confounding classical approach
4.2.2. confounding classical approach
 
4.2.1. confounding mixing of effects
4.2.1. confounding mixing of effects4.2.1. confounding mixing of effects
4.2.1. confounding mixing of effects
 
4.1. introduction
4.1. introduction4.1. introduction
4.1. introduction
 
4 Threats to validity from confounding bias and effect modification
4 Threats to validity from confounding bias and effect modification4 Threats to validity from confounding bias and effect modification
4 Threats to validity from confounding bias and effect modification
 
6.5 strengths and challenges
6.5 strengths and challenges6.5 strengths and challenges
6.5 strengths and challenges
 
6.7 summaries
6.7 summaries6.7 summaries
6.7 summaries
 
6.6 examples
6.6 examples6.6 examples
6.6 examples
 

Recently uploaded

Spellings Wk 4 and Wk 5 for Grade 4 at CAPS
Spellings Wk 4 and Wk 5 for Grade 4 at CAPSSpellings Wk 4 and Wk 5 for Grade 4 at CAPS
Spellings Wk 4 and Wk 5 for Grade 4 at CAPSAnaAcapella
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxDr. Sarita Anand
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxDenish Jangid
 
Philosophy of china and it's charactistics
Philosophy of china and it's charactisticsPhilosophy of china and it's charactistics
Philosophy of china and it's charactisticshameyhk98
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxRamakrishna Reddy Bijjam
 
AIM of Education-Teachers Training-2024.ppt
AIM of Education-Teachers Training-2024.pptAIM of Education-Teachers Training-2024.ppt
AIM of Education-Teachers Training-2024.pptNishitharanjan Rout
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - Englishneillewis46
 
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptxOn_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptxPooja Bhuva
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxannathomasp01
 
FICTIONAL SALESMAN/SALESMAN SNSW 2024.pdf
FICTIONAL SALESMAN/SALESMAN SNSW 2024.pdfFICTIONAL SALESMAN/SALESMAN SNSW 2024.pdf
FICTIONAL SALESMAN/SALESMAN SNSW 2024.pdfPondicherry University
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024Elizabeth Walsh
 
How to Manage Call for Tendor in Odoo 17
How to Manage Call for Tendor in Odoo 17How to Manage Call for Tendor in Odoo 17
How to Manage Call for Tendor in Odoo 17Celine George
 
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfUnit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfDr Vijay Vishwakarma
 
Tatlong Kwento ni Lola basyang-1.pdf arts
Tatlong Kwento ni Lola basyang-1.pdf artsTatlong Kwento ni Lola basyang-1.pdf arts
Tatlong Kwento ni Lola basyang-1.pdf artsNbelano25
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfSherif Taha
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsMebane Rash
 
21st_Century_Skills_Framework_Final_Presentation_2.pptx
21st_Century_Skills_Framework_Final_Presentation_2.pptx21st_Century_Skills_Framework_Final_Presentation_2.pptx
21st_Century_Skills_Framework_Final_Presentation_2.pptxJoelynRubio1
 
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...Amil baba
 
Details on CBSE Compartment Exam.pptx1111
Details on CBSE Compartment Exam.pptx1111Details on CBSE Compartment Exam.pptx1111
Details on CBSE Compartment Exam.pptx1111GangaMaiya1
 

Recently uploaded (20)

Spellings Wk 4 and Wk 5 for Grade 4 at CAPS
Spellings Wk 4 and Wk 5 for Grade 4 at CAPSSpellings Wk 4 and Wk 5 for Grade 4 at CAPS
Spellings Wk 4 and Wk 5 for Grade 4 at CAPS
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptx
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Philosophy of china and it's charactistics
Philosophy of china and it's charactisticsPhilosophy of china and it's charactistics
Philosophy of china and it's charactistics
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
AIM of Education-Teachers Training-2024.ppt
AIM of Education-Teachers Training-2024.pptAIM of Education-Teachers Training-2024.ppt
AIM of Education-Teachers Training-2024.ppt
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptxOn_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
 
FICTIONAL SALESMAN/SALESMAN SNSW 2024.pdf
FICTIONAL SALESMAN/SALESMAN SNSW 2024.pdfFICTIONAL SALESMAN/SALESMAN SNSW 2024.pdf
FICTIONAL SALESMAN/SALESMAN SNSW 2024.pdf
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024
 
How to Manage Call for Tendor in Odoo 17
How to Manage Call for Tendor in Odoo 17How to Manage Call for Tendor in Odoo 17
How to Manage Call for Tendor in Odoo 17
 
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfUnit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
 
Tatlong Kwento ni Lola basyang-1.pdf arts
Tatlong Kwento ni Lola basyang-1.pdf artsTatlong Kwento ni Lola basyang-1.pdf arts
Tatlong Kwento ni Lola basyang-1.pdf arts
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
21st_Century_Skills_Framework_Final_Presentation_2.pptx
21st_Century_Skills_Framework_Final_Presentation_2.pptx21st_Century_Skills_Framework_Final_Presentation_2.pptx
21st_Century_Skills_Framework_Final_Presentation_2.pptx
 
Call Girls in Uttam Nagar (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in  Uttam Nagar (delhi) call me [🔝9953056974🔝] escort service 24X7Call Girls in  Uttam Nagar (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in Uttam Nagar (delhi) call me [🔝9953056974🔝] escort service 24X7
 
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
 
Details on CBSE Compartment Exam.pptx1111
Details on CBSE Compartment Exam.pptx1111Details on CBSE Compartment Exam.pptx1111
Details on CBSE Compartment Exam.pptx1111
 

5.2.1 dags

  • 1. Causal Inference in Epidemiology Part 2 PH 250B Fall 2016 Jade Benjamin-‐Chung,PhD MPH Colford-‐HubbardResearch Group Adapted from Professor Jen Ahern’s 250B slides
  • 2. Outline 1. What does causal inference entail? 2. Using directed acyclic graphs a. DAG basics b. Identifying confounding c. Understanding selection bias 3. Causal perspective on effect modification a. Brief recap of effect modification (EM) b. Linking EM in our studies to reality c. Types of interaction d. Causal interaction / EM 1. Sufficient cause model (“causalpies”) 2. Potential outcomes model (“causal types”) e. Choosing which measure of interaction to estimate and report 4. Integrating causal concepts into your research
  • 3. Reality of study design • We often don’t have ideal data on our population of interest • The data we collect are incomplete • Statistics can help us understand correlations or associations between exposures and outcomes • Typically what we really want to know is if the exposure causes the outcome
  • 4. What does causal inference entail? • Careful definition of our estimation goals • A set of assumptions that allow us to link our observed data to ideal data that would be used to reach our goals • Causal inference techniques help us – Express assumptions about our data in a transparent, mathematical form – Provide us with mathematical steps to translate assumptions into quantities that can be estimated with observed data Pearl, Glymour & Jewell, 2016
  • 5. 5 1. Define research hypothesis – Your hypothesis can include possible effect modification – Determine to what extent you aim to make causal inferences using your data 2. Determine study design (trial, cohort, etc.) 3. Draw a DAG a. Identify potential confounders b. Choose which variables to measure 4. Analyze your data a. b. c. Control for confounders identified in step 3 Assess effect modification on the additive or multiplicative scale Make statistical inferences 5. Make scientific inferences about your hypothesis Causal inference in your research
  • 6. Outline 1. What does causal inference entail? 2. Using directed acyclic graphs a. DAG basics b. Identifying confounding c. Understanding selection bias 3. Causal perspective on effect modification a. Brief recap of effect modification (EM) b. Linking EM in our studies to reality c. Types of interaction d. Causal interaction / EM 1. Sufficient cause model (“causalpies”) 2. Potential outcomes model (“causal types”) e. Choosing which measure of interaction to estimate and report 4. Integrating causal concepts into your research
  • 7. Causal diagrams as mathematical language “Graphical methods now provide a powerful symbolic machinery for deriving the consequences of causal assumptions when such assumptions are combined with statistical data.” Pearl J, 2009, Causality
  • 8. 8 Directed Acyclic Graphs (DAGs) • Visually depict assumptions about causal relationships between exposures, outcomes, and other variables – Depicts the “data generating process” • DAGs depict our knowledge (or beliefs) about the “data generating process” • DAGs are informed by subject matter knowledge, prior research, and a priori hypotheses • Learning curve on terminology and approach – practice helps! Can be very useful tool once you are comfortable with it
  • 9. How can we use DAGs? Generally • Document assumptions about cause-‐effect relationships • Explore implications of those assumptions • Assess how to make causal inferences from both one’s data and one’s assumptions Today • To understand selection bias • To identify confounding Pearl J, 2009, Causality
  • 10. • Direct causal relationships between variables are represented by arrows – Directed – All causal relationships have a direction – A given variable cannot be simultaneously a cause and an effect SES Prenatal Care 1 0 DAG construction
  • 11. Malnutrition (M) Infection (I) I (t=0) I (t=1) M (t=0) M (t=1) • There are no feedback loops – Acyclic – Causes always precede their effects – To avoid feedback loops, extend graph over time 1 1 DAG construction
  • 12. Vitamins Birth Defects Prenatal Care Difficulty conceivingSES Maternal genetics • Parent & Child: – Directly connected by an arrow – Prenatal care is a “parent” of birthdefects – Birth defects is a “child” of pre-‐natalcare 1 2 DAG terminology
  • 13. Vitamins Birth Defects Prenatal Care Difficulty conceivingSES Maternal genetics • Ancestor & Descendant: – Connected by a directed path of a series of arrows – SES is an “ancestor” of Birth Defects – Birth Defects is a “descendant” of SES 1 3 DAG terminology
  • 14. Smoking Smoking CancerTar Mutations Cancer • Absence of a directed path from X to Y implies X has no effect on Y – Directed paths not in the graph as important as those in the graph • Note: Not all intermediate steps between two variables need to be represented – Depends on level of detail of the model 1 4 DAG assumptions
  • 15. • DAGs assume that all common causes of exposure and disease are included – Common causes that are not observed should still be included – These are often denoted with a “U” to indicate they were unmeasured U (religious beliefs, culture, lifestyle, etc.) Alcohol Use Smoking Heart Disease DAG assumptions 1 5
  • 19. Speed Bicycle Characteristics Road/Lane/Path Surface Bicycle Traffic Road Grade Car Traffic Populace Bicycle Awareness Bicycle Lane/ Path Example Rider Skill/Experience Bicycle Fall 1 9
  • 20. Speed Rider Skill/Experience Bicycle Characteristics Bicycle Traffic Road/Lane/Path Surface Road Grade Car Traffic Populace Bicycle Awareness Bicycle Lane/ Path Example Bicycle Fall 20 What are some assumptions are we making? Bicycle lane/path only has an effect on bicycle falls through its effect on bicycle traffic Road surface does not affect bicycle traffic All common causes of speed and bicycle fall are included (even those unmeasured)
  • 21. Statistical underpinnings of DAGs • Multiple possible causal models for this DAG: Y Z X = School funding Y = SAT Scores Z = College Acceptance X = UX Y = (x/3) + UY Z = (y/16) + UZ X = Number of hours worked per week Y = Number of training hours per week Z = Race completion time X = UX Y = 84 – x + UY Z = (100/y) + UZ UX UY X UZ Pearl, Glymour & Jewell, 2016
  • 22. Statistical underpinnings of DAGs • Both models share the same statistical relationships: • Z and Y are dependent • Y and X are dependent • Z and X are likely dependent • Z and X are independent depending on the values of Y Y Z UX UY X UZ X = School funding Y = SAT Scores Z = College Acceptance X = UX Y = (x/3) + UY Z = (y/16) + UZ Pearl, Glymour & Jewell, 2016
  • 23. Statistical underpinnings of DAGs • Both models share the same statistical relationships For specific values of these variables (lower case x, y, z): • Z and Y are dependent • Y and X are dependent • Z and X are likely dependent • Z and X are independent depending on the values of Y Y Z UX UY X UZ Pearl, Glymour & Jewell, 2016
  • 24. Conditioning on a variable in a DAG • “Conditioning” on a variable means filtering the data into groups based on the value of a variable • A box is often used around a variable denote that it is being conditioned on (e.g., in this DAG we condition on Y) • This is equivalent to stratifying the data or controlling for a variable in a statistical model X Y Z UX UY UZ
  • 25. DAG configurations X X Y Z Y Z X Z Y Chain Fork Collider * Has special considerations and challenges Pearl, Glymour & Jewell, 2016
  • 27. CancerDiet BMI Colliders Conditioning on BMI induces an association between diet and cancer CancerDiet BMI Among those who have had a BMI decrease there will be larger numbers of dieters and larger numbers of people with cancer than in the total population 2 7
  • 28. Colliders YX Why does conditioning on a collider induce an association between its parents? Example: Z=X+Y Pearl, Glymour & Jewell, 2016 2 8 Do not condition on Z: • X=3 ! know nothing about Y Z
  • 29. Colliders YX Z Why does conditioning on a collider induce an association between its parents? Example: Z=X+Y Pearl, Glymour & Jewell, 2016 2 9 Do not condition on Z: • X=3 ! know nothing about Y Condition on Z: • Z=10, X=3 ! know Y=7 Thus, X and Y are dependent given that (“conditional on”) Z = 10
  • 30. 3 0 Strengths of DAGs • Can determine which variables depend on each other in our causal model without knowing the specific functions (e.g., Z=X+Y in the previous slide) connecting them (Pearl, Glymour & Jewell, 2016) • Allow us to link our causal model to our statistical relationships in our data • DAGS can incorporate measurement error as well (Hernan & Cole, 2009)
  • 31. 3 1 Limitations of DAGs • Cannot display effect modification easily (example of road surface) • Arrows in graphs do not provide specific definitions of effects (contrast with counterfactuals) • Can become extremely complicated when representing real data structures • Are not designed to capture effects of infectious disease interventions that may impact not only intervention recipients but also non-‐recipients(e.g., herd effects of vaccines)
  • 32. DAG limitations Example of extremely complicated 3 2
  • 33. 3 3 DAGs • DAG itself is not used to analyze data from the study you’ve conducted – Informs how study is designed/data are collected – Informs how data are analyzed – Helps identify which research questions are answerable in a given data set • Utility of DAGs dependent on accuracy/correctness of associations we represent in the diagram
  • 34. 3 4 Non-‐parametricstructural equation models • Non-‐parametricstructural equation models (NPSEMs) provide a link between DAGs and counterfactuals and are a way to analyze data • They encode relationships between variables that can include many possible equations and functional forms • Non-‐parametricestimation used to avoid assumptions of typical SEMs (e.g., linearity) • Learn more about this topic in PH252D
  • 35. Example of NPSEM Y Z UX UY X UZ Previous example: X = School funding Y = SAT Scores Z = College Acceptance X = UX Y = (x/3) + UY Z = (y/16) + UZ Pearl, Glymour & Jewell, 2016 NPSEM: X = School funding Y = SAT Scores Z = College Acceptance X = fX(UX) Y = fY(X, UY) Z = fZ(Y, UZ)