SlideShare a Scribd company logo
1 of 81
Download to read offline
Mendelian Randomisation
James McMurray
PhD Student
Department of Empirical Inference
29/07/2014
What is Mendelian Randomisation?
What is Mendelian Randomisation?
Approach to test for a causal effect from observational data in
the presence of certain confounding factors.
What is Mendelian Randomisation?
Approach to test for a causal effect from observational data in
the presence of certain confounding factors.
Uses the measured variation of genes of known function, to
bound the causal effect of a modifiable exposure
(environment) on a phenotype (disease).
What is Mendelian Randomisation?
Approach to test for a causal effect from observational data in
the presence of certain confounding factors.
Uses the measured variation of genes of known function, to
bound the causal effect of a modifiable exposure
(environment) on a phenotype (disease).
Fundamental idea is that the genotypes are randomly assigned
(due to meiosis).
What is Mendelian Randomisation?
Approach to test for a causal effect from observational data in
the presence of certain confounding factors.
Uses the measured variation of genes of known function, to
bound the causal effect of a modifiable exposure
(environment) on a phenotype (disease).
Fundamental idea is that the genotypes are randomly assigned
(due to meiosis).
This allows them to be used as an instrumental variable.
What is Mendelian Randomisation?: DAG
Motivation: Why Observational Data?
Motivation: Why Observational Data?
Randomised Control Trials (RCTs) are the gold standard for
causal inference.
Motivation: Why Observational Data?
Randomised Control Trials (RCTs) are the gold standard for
causal inference.
However, it is often not ethical or possible to carry out RCTs.
Motivation: Why Observational Data?
Randomised Control Trials (RCTs) are the gold standard for
causal inference.
However, it is often not ethical or possible to carry out RCTs.
E.g. we cannot randomly assign a lifetime of heavy smoking
and no smoking to groups of individuals.
Motivation: Why Observational Data?
Randomised Control Trials (RCTs) are the gold standard for
causal inference.
However, it is often not ethical or possible to carry out RCTs.
E.g. we cannot randomly assign a lifetime of heavy smoking
and no smoking to groups of individuals.
This leads to a need to use observational data.
Motivation: Why Observational Data?
Randomised Control Trials (RCTs) are the gold standard for
causal inference.
However, it is often not ethical or possible to carry out RCTs.
E.g. we cannot randomly assign a lifetime of heavy smoking
and no smoking to groups of individuals.
This leads to a need to use observational data.
But this requires many assumptions.
Instrumental Variables (IV)
Instrumental Variables (IV)
In the previous DAG, G is the instrumental variable
(instrument).
Instrumental Variables (IV)
In the previous DAG, G is the instrumental variable
(instrument).
Because it affects Y only through X (exclusively)
Instrumental Variables (IV)
In the previous DAG, G is the instrumental variable
(instrument).
Because it affects Y only through X (exclusively)
Therefore, under certain assumptions, if G is correlated with Y
then we can infer the edge X → Y
Instrumental Variables (IV)
In the previous DAG, G is the instrumental variable
(instrument).
Because it affects Y only through X (exclusively)
Therefore, under certain assumptions, if G is correlated with Y
then we can infer the edge X → Y
First we will consider an example
Instrumental Variables (IV): Example
Instrumental Variables (IV): Example
If the assumptions are met, then if we observe that an
increase in tax leads to a reduction in lung cancer, then one
could infer that smoking is a “cause” of lung cancer (though
possibly indirectly itself - i.e. it’s not “smoking” per se, but
the tar in the lungs, etc.).
Instrumental Variables (IV): Assumptions: Z → X
Instrumental Variables (IV): Assumptions: Z → X
We must know (a priori) that the causal direction is Z → X
and not X → Z.
Instrumental Variables (IV): Assumptions: Z → X
We must know (a priori) that the causal direction is Z → X
and not X → Z.
This is what makes the causal structure unique and
identifiable.
Instrumental Variables (IV): Assumptions: Z → X
We must know (a priori) that the causal direction is Z → X
and not X → Z.
This is what makes the causal structure unique and
identifiable.
Note this does not mean that the Z we choose has to be the
“true” cause of X. For example, if Z is a SNP, we can choose
a SNP in linkage disequilibrium with Z, so long as it is
independent of all the other variables but still correlated with
X.
Instrumental Variables (IV): Assumptions: Z → X
We must know (a priori) that the causal direction is Z → X
and not X → Z.
This is what makes the causal structure unique and
identifiable.
Note this does not mean that the Z we choose has to be the
“true” cause of X. For example, if Z is a SNP, we can choose
a SNP in linkage disequilibrium with Z, so long as it is
independent of all the other variables but still correlated with
X.
The more correlated Z is with X, the better.
Example: Can use associated SNP as instrument
Instrumental Variables (IV): Assumptions: Z ⊥⊥ U
Instrumental Variables (IV): Assumptions: Z ⊥⊥ U
No factor can affect both the instrument and the effects. For
example, there cannot be a factor that causes both higher
taxes and less cancer (e.g. differences in health awareness in
different countries).
Instrumental Variables (IV): Assumptions: No Z → Y
Instrumental Variables (IV): Assumptions: No Z → Y
Z cannot directly affect Y (or indirectly, except through X).
Instrumental Variables (IV): Assumptions: No Z → Y
Z cannot directly affect Y (or indirectly, except through X).
I.e. there cannot exist other mechanisms by which Z affects Y
(i.e. high tobacco tax increases substance abuse, leading to
higher rates of cancer)
Instrumental Variables (IV): Assumptions: Faithfulness
Instrumental Variables (IV): Assumptions: Faithfulness
Assume that the true underlying DAG manifests itself in the
observed data
Instrumental Variables (IV): Assumptions: Faithfulness
Assume that the true underlying DAG manifests itself in the
observed data
I.e. causal effects do not cancel out
Instrumental Variables (IV): Assumptions: Faithfulness
Assume that the true underlying DAG manifests itself in the
observed data
I.e. causal effects do not cancel out
Reasonable assumption, because the contrary would require
very specific parameters
Instrumental Variables (IV): Assumptions: Faithfulness
Assume that the true underlying DAG manifests itself in the
observed data
I.e. causal effects do not cancel out
Reasonable assumption, because the contrary would require
very specific parameters
But note that if relations are deterministic, implied
Conditional Independencies do not hold and faithfulness is
violated
Instrumental Variables (IV): Assumptions: Faithfulness
Assume that the true underlying DAG manifests itself in the
observed data
I.e. causal effects do not cancel out
Reasonable assumption, because the contrary would require
very specific parameters
But note that if relations are deterministic, implied
Conditional Independencies do not hold and faithfulness is
violated
Note that, in practice, the sample size is important in testing
the independencies in the data.
Instrumental Variables (IV): Assumptions: DAG
What is Mendelian Randomisation?: Original example
What is Mendelian Randomisation?: Original example
Katan MB (1986): Apolipoprotein E isoforms, serum
cholesterol, and cancer.
What is Mendelian Randomisation?: Original example
Katan MB (1986): Apolipoprotein E isoforms, serum
cholesterol, and cancer.
Do low serum cholesterol levels increase cancer risk?
What is Mendelian Randomisation?: Original example
Katan MB (1986): Apolipoprotein E isoforms, serum
cholesterol, and cancer.
Do low serum cholesterol levels increase cancer risk?
But maybe both cancer risk and cholesterol levels are affected
by diet (confounders)
What is Mendelian Randomisation?: Original example
Katan MB (1986): Apolipoprotein E isoforms, serum
cholesterol, and cancer.
Do low serum cholesterol levels increase cancer risk?
But maybe both cancer risk and cholesterol levels are affected
by diet (confounders)
Or latent tumours cause the lower cholesterol level (reverse
causation)
What is Mendelian Randomisation?: Original example
Katan MB (1986): Apolipoprotein E isoforms, serum
cholesterol, and cancer.
Do low serum cholesterol levels increase cancer risk?
But maybe both cancer risk and cholesterol levels are affected
by diet (confounders)
Or latent tumours cause the lower cholesterol level (reverse
causation)
But patients with Abetalipoproteinemia (inability to absorb
cholesterol) - did not appear predisposed to cancer
What is Mendelian Randomisation?: Original example
Katan MB (1986): Apolipoprotein E isoforms, serum
cholesterol, and cancer.
Do low serum cholesterol levels increase cancer risk?
But maybe both cancer risk and cholesterol levels are affected
by diet (confounders)
Or latent tumours cause the lower cholesterol level (reverse
causation)
But patients with Abetalipoproteinemia (inability to absorb
cholesterol) - did not appear predisposed to cancer
Led Katan to idea of finding a large group genetically
predisposed to lower cholesterol levels
What is Mendelian Randomisation?: Original example
Katan MB (1986): Apolipoprotein E isoforms, serum
cholesterol, and cancer.
Do low serum cholesterol levels increase cancer risk?
But maybe both cancer risk and cholesterol levels are affected
by diet (confounders)
Or latent tumours cause the lower cholesterol level (reverse
causation)
But patients with Abetalipoproteinemia (inability to absorb
cholesterol) - did not appear predisposed to cancer
Led Katan to idea of finding a large group genetically
predisposed to lower cholesterol levels
This is Mendelian Randomisation.
What is Mendelian Randomisation?: Original example
Katan MB (1986): Apolipoprotein E isoforms, serum
cholesterol, and cancer.
Do low serum cholesterol levels increase cancer risk?
But maybe both cancer risk and cholesterol levels are affected
by diet (confounders)
Or latent tumours cause the lower cholesterol level (reverse
causation)
But patients with Abetalipoproteinemia (inability to absorb
cholesterol) - did not appear predisposed to cancer
Led Katan to idea of finding a large group genetically
predisposed to lower cholesterol levels
This is Mendelian Randomisation.
Note this does not require that the genetic variants are direct
determinants of health. But, uses the association to improve
inferences of the effects of modifiable environmental risks on
health.
What is Mendelian Randomisation?: Original example
What is Mendelian Randomisation?: Original example
Apolipoprotein E (ApoE) gene was known to affect serum
cholesterol, with the ApoE2 variant being associated with
lower levels.
What is Mendelian Randomisation?: Original example
Apolipoprotein E (ApoE) gene was known to affect serum
cholesterol, with the ApoE2 variant being associated with
lower levels.
Many individuals carry ApoE2 variant and so have lower
cholesterol levels from birth
What is Mendelian Randomisation?: Original example
Apolipoprotein E (ApoE) gene was known to affect serum
cholesterol, with the ApoE2 variant being associated with
lower levels.
Many individuals carry ApoE2 variant and so have lower
cholesterol levels from birth
Since genes are randomly assigned during meiosis (due to
recombination), ApoE2 carriers should not be different from
ApoE carriers in any other way (diet, etc.), so there is no
confounding via the genome - note these assumptions.
What is Mendelian Randomisation?: Original example
Apolipoprotein E (ApoE) gene was known to affect serum
cholesterol, with the ApoE2 variant being associated with
lower levels.
Many individuals carry ApoE2 variant and so have lower
cholesterol levels from birth
Since genes are randomly assigned during meiosis (due to
recombination), ApoE2 carriers should not be different from
ApoE carriers in any other way (diet, etc.), so there is no
confounding via the genome - note these assumptions.
Therefore if low serum cholesterol is really causal for cancer,
the cancer patients should have more ApoE2 alleles than the
controls - if not then the levels would be similar in both
groups.
What is Mendelian Randomisation?: Original example
What is Mendelian Randomisation?: Original example
Katan only provided the suggestion, but the method has since
been used for many different analyses with some success, such
as the link between blood pressure and stroke risk.
What is Mendelian Randomisation?: Original example
Katan only provided the suggestion, but the method has since
been used for many different analyses with some success, such
as the link between blood pressure and stroke risk.
However, some conclusions have later been disproved by
Randomised Control Trials. To understand why, we must
consider the biological assumptions.
Panmixia
Panmixia
Recall the assumption that the genotype is randomly assigned
- this implies panmixia
Panmixia
Recall the assumption that the genotype is randomly assigned
- this implies panmixia
That is, there is no selective breeding (so random mating)
Panmixia
Recall the assumption that the genotype is randomly assigned
- this implies panmixia
That is, there is no selective breeding (so random mating)
Implies that all recombination is possible
Panmixia
Recall the assumption that the genotype is randomly assigned
- this implies panmixia
That is, there is no selective breeding (so random mating)
Implies that all recombination is possible
In our DAG, this means that G is not influences by Y (or other
variables)
Panmixia
Recall the assumption that the genotype is randomly assigned
- this implies panmixia
That is, there is no selective breeding (so random mating)
Implies that all recombination is possible
In our DAG, this means that G is not influences by Y (or other
variables)
Not entirely accurate, as demonstrated by Population
Stratification
Population Stratification
Population Stratification
Systematic difference in allele frequencies between
subpopulations, due to ancestry
Population Stratification
Systematic difference in allele frequencies between
subpopulations, due to ancestry
For example, physical separation leads to non-random mating
Population Stratification
Systematic difference in allele frequencies between
subpopulations, due to ancestry
For example, physical separation leads to non-random mating
Leads to different genetic drift in different subpopulations (i.e.
changes in allele frequency over time due to random sampling)
Population Stratification
Systematic difference in allele frequencies between
subpopulations, due to ancestry
For example, physical separation leads to non-random mating
Leads to different genetic drift in different subpopulations (i.e.
changes in allele frequency over time due to random sampling)
Means that the genotype is not randomly assigned when
considered across sub-populations
Population Stratification
Systematic difference in allele frequencies between
subpopulations, due to ancestry
For example, physical separation leads to non-random mating
Leads to different genetic drift in different subpopulations (i.e.
changes in allele frequency over time due to random sampling)
Means that the genotype is not randomly assigned when
considered across sub-populations
E.g. Lactose intolerance
Canalization
Canalization
Variation in robustness of phenotypes to genotype and
environments
Canalization
Variation in robustness of phenotypes to genotype and
environments
Waddington Drosophilia experiment:
Exposed Drosophilia pupae to heat shock
Developed Cross-veinless phenotype (no cross-veins in wings)
By selecting for this phenotype, eventually appears without
heat shock
Led to theory of organisms rolling downhill in to “canals” of
the epigenetic landscape with development, becoming more
robust to variation
Think of it like an optimisation problem
Canalization
Variation in robustness of phenotypes to genotype and
environments
Waddington Drosophilia experiment:
Exposed Drosophilia pupae to heat shock
Developed Cross-veinless phenotype (no cross-veins in wings)
By selecting for this phenotype, eventually appears without
heat shock
Led to theory of organisms rolling downhill in to “canals” of
the epigenetic landscape with development, becoming more
robust to variation
Think of it like an optimisation problem
Exact mechanisms unknown
Canalization
Variation in robustness of phenotypes to genotype and
environments
Waddington Drosophilia experiment:
Exposed Drosophilia pupae to heat shock
Developed Cross-veinless phenotype (no cross-veins in wings)
By selecting for this phenotype, eventually appears without
heat shock
Led to theory of organisms rolling downhill in to “canals” of
the epigenetic landscape with development, becoming more
robust to variation
Think of it like an optimisation problem
Exact mechanisms unknown
Acts as confounder between genotype, environment and
phenotype
Pleiotropy
Pleiotropy
One gene can affect many (even seemingly unrelated)
phenotypes
Pleiotropy
One gene can affect many (even seemingly unrelated)
phenotypes
Mendelian Randomisation makes the assumption of no
pleiotropy
Pleiotropy
One gene can affect many (even seemingly unrelated)
phenotypes
Mendelian Randomisation makes the assumption of no
pleiotropy
In this case, this means that we know the genotype is only
influencing the phenotype via the considered exposure
Pleiotropy
One gene can affect many (even seemingly unrelated)
phenotypes
Mendelian Randomisation makes the assumption of no
pleiotropy
In this case, this means that we know the genotype is only
influencing the phenotype via the considered exposure
I.e. ApoE2 only affects serum cholesterol levels, and cannot
affect cancer risk by other, unobserved means.
Pleiotropy
One gene can affect many (even seemingly unrelated)
phenotypes
Mendelian Randomisation makes the assumption of no
pleiotropy
In this case, this means that we know the genotype is only
influencing the phenotype via the considered exposure
I.e. ApoE2 only affects serum cholesterol levels, and cannot
affect cancer risk by other, unobserved means.
This is a big assumption, prior knowledge is necessary.
Pleiotropy
One gene can affect many (even seemingly unrelated)
phenotypes
Mendelian Randomisation makes the assumption of no
pleiotropy
In this case, this means that we know the genotype is only
influencing the phenotype via the considered exposure
I.e. ApoE2 only affects serum cholesterol levels, and cannot
affect cancer risk by other, unobserved means.
This is a big assumption, prior knowledge is necessary.
If possible, using multiple, independent SNPs (instruments)
helps to alleviate this issue (as if they are all consistent then it
is unlikely that they all have other pathways causing the same
change) - but note they must not be in Linkage
Disequilibrium!
The real underlying DAG?
Conclusion
Instrumental variables are a method to infer causal relations
from observational data, given certain assumptions.
Applied in Genetic Epidemiology with Mendelian
Randomisation.
Has had some success, but underlying biology poses problems.
Can we improve robustness with more measurements of
intermediate phenotypes? (gene methylation, RNAseq,
proteomics) - multi-step Mendelian Randomisation
Can we improve identification of appropriate instruments?
(e.g. whole genome sequencing makes it easier to identify
population stratification)
Thanks for your time
Questions?

More Related Content

What's hot

Calibration and Quality controls of automated hematology analyzer
Calibration and Quality controls of automated hematology analyzerCalibration and Quality controls of automated hematology analyzer
Calibration and Quality controls of automated hematology analyzerPranav S
 
Parametric and Non Parametric methods
Parametric and Non Parametric methods Parametric and Non Parametric methods
Parametric and Non Parametric methods Kishor6460
 
Predictive value and likelihood ratio
Predictive value and likelihood ratio Predictive value and likelihood ratio
Predictive value and likelihood ratio Abino David
 
Regression analysis algorithm
Regression analysis algorithm Regression analysis algorithm
Regression analysis algorithm Sammer Qader
 
Student's t test and variations
Student's t test and variationsStudent's t test and variations
Student's t test and variationsRizwan S A
 
Preanalytical quality control practices in clinical laboratory
Preanalytical quality control practices in clinical laboratoryPreanalytical quality control practices in clinical laboratory
Preanalytical quality control practices in clinical laboratoryDr. Rajesh Bendre
 
How to read a receiver operating characteritic (ROC) curve
How to read a receiver operating characteritic (ROC) curveHow to read a receiver operating characteritic (ROC) curve
How to read a receiver operating characteritic (ROC) curveSamir Haffar
 
Experimental Studies
Experimental StudiesExperimental Studies
Experimental StudiesAbhijit Das
 
Preparing and publishing a scientific manuscript
Preparing and publishing a scientific manuscriptPreparing and publishing a scientific manuscript
Preparing and publishing a scientific manuscriptSreenivasa Reddy Thalla
 
Inferential statistics
Inferential statisticsInferential statistics
Inferential statisticsAshok Kulkarni
 
Quantitative Methods for Lawyers - Class #9 - Bayes Theorem (Part 2), Skewnes...
Quantitative Methods for Lawyers - Class #9 - Bayes Theorem (Part 2), Skewnes...Quantitative Methods for Lawyers - Class #9 - Bayes Theorem (Part 2), Skewnes...
Quantitative Methods for Lawyers - Class #9 - Bayes Theorem (Part 2), Skewnes...Daniel Katz
 
Statistical Data Analysis | Data Analysis | Statistics Services | Data Collec...
Statistical Data Analysis | Data Analysis | Statistics Services | Data Collec...Statistical Data Analysis | Data Analysis | Statistics Services | Data Collec...
Statistical Data Analysis | Data Analysis | Statistics Services | Data Collec...Stats Statswork
 
Quality Control in Clinical Chemistry
Quality Control in Clinical ChemistryQuality Control in Clinical Chemistry
Quality Control in Clinical ChemistryDiganta Dey
 
Experimental designs
Experimental designsExperimental designs
Experimental designsrx_sonali
 
Radioimmunoassay
RadioimmunoassayRadioimmunoassay
RadioimmunoassayFarha Banu
 

What's hot (20)

Calibration and Quality controls of automated hematology analyzer
Calibration and Quality controls of automated hematology analyzerCalibration and Quality controls of automated hematology analyzer
Calibration and Quality controls of automated hematology analyzer
 
Parametric and Non Parametric methods
Parametric and Non Parametric methods Parametric and Non Parametric methods
Parametric and Non Parametric methods
 
Authorship
AuthorshipAuthorship
Authorship
 
Roc curves
Roc curvesRoc curves
Roc curves
 
Predictive value and likelihood ratio
Predictive value and likelihood ratio Predictive value and likelihood ratio
Predictive value and likelihood ratio
 
Regression analysis algorithm
Regression analysis algorithm Regression analysis algorithm
Regression analysis algorithm
 
Student's t test and variations
Student's t test and variationsStudent's t test and variations
Student's t test and variations
 
Preanalytical quality control practices in clinical laboratory
Preanalytical quality control practices in clinical laboratoryPreanalytical quality control practices in clinical laboratory
Preanalytical quality control practices in clinical laboratory
 
How to read a receiver operating characteritic (ROC) curve
How to read a receiver operating characteritic (ROC) curveHow to read a receiver operating characteritic (ROC) curve
How to read a receiver operating characteritic (ROC) curve
 
Experimental Studies
Experimental StudiesExperimental Studies
Experimental Studies
 
Preparing and publishing a scientific manuscript
Preparing and publishing a scientific manuscriptPreparing and publishing a scientific manuscript
Preparing and publishing a scientific manuscript
 
Randomization
Randomization Randomization
Randomization
 
Inferential statistics
Inferential statisticsInferential statistics
Inferential statistics
 
Quantitative Methods for Lawyers - Class #9 - Bayes Theorem (Part 2), Skewnes...
Quantitative Methods for Lawyers - Class #9 - Bayes Theorem (Part 2), Skewnes...Quantitative Methods for Lawyers - Class #9 - Bayes Theorem (Part 2), Skewnes...
Quantitative Methods for Lawyers - Class #9 - Bayes Theorem (Part 2), Skewnes...
 
Statistical Data Analysis | Data Analysis | Statistics Services | Data Collec...
Statistical Data Analysis | Data Analysis | Statistics Services | Data Collec...Statistical Data Analysis | Data Analysis | Statistics Services | Data Collec...
Statistical Data Analysis | Data Analysis | Statistics Services | Data Collec...
 
Study designs
Study designsStudy designs
Study designs
 
Quality Control in Clinical Chemistry
Quality Control in Clinical ChemistryQuality Control in Clinical Chemistry
Quality Control in Clinical Chemistry
 
Experimental designs
Experimental designsExperimental designs
Experimental designs
 
Plagiarism
PlagiarismPlagiarism
Plagiarism
 
Radioimmunoassay
RadioimmunoassayRadioimmunoassay
Radioimmunoassay
 

Similar to Mendelian Randomisation

Bradford Hill Criteria.ppt
Bradford Hill Criteria.pptBradford Hill Criteria.ppt
Bradford Hill Criteria.pptexternalReviewer
 
1 A PRIMER ON CAUSALITY Marc F. Bellemare∗ Introd
1  A PRIMER ON CAUSALITY Marc F. Bellemare∗ Introd1  A PRIMER ON CAUSALITY Marc F. Bellemare∗ Introd
1 A PRIMER ON CAUSALITY Marc F. Bellemare∗ IntrodVannaJoy20
 
Page 266LEARNING OBJECTIVES· Explain how researchers use inf.docx
Page 266LEARNING OBJECTIVES· Explain how researchers use inf.docxPage 266LEARNING OBJECTIVES· Explain how researchers use inf.docx
Page 266LEARNING OBJECTIVES· Explain how researchers use inf.docxkarlhennesey
 
Probability Distribution
Probability DistributionProbability Distribution
Probability DistributionShahnawazAli81
 
Methodological Mistakes and Econometric Consequences
Methodological Mistakes and Econometric ConsequencesMethodological Mistakes and Econometric Consequences
Methodological Mistakes and Econometric ConsequencesAsad Zaman
 
lecture1.ppt
lecture1.pptlecture1.ppt
lecture1.pptvarunvs7
 
Debunk bullshit in statistics QN
Debunk bullshit in statistics QNDebunk bullshit in statistics QN
Debunk bullshit in statistics QNQuan Nguyen
 
Confounding and Directed Acyclic Graphs
Confounding and Directed Acyclic GraphsConfounding and Directed Acyclic Graphs
Confounding and Directed Acyclic GraphsDarren L Dahly PhD
 
PSY 341 Judgement, Decisions, Reasoning Notes Abyana
PSY 341 Judgement, Decisions, Reasoning Notes AbyanaPSY 341 Judgement, Decisions, Reasoning Notes Abyana
PSY 341 Judgement, Decisions, Reasoning Notes AbyanaNurulAbyana
 
Bayesian networks and the search for causality
Bayesian networks and the search for causalityBayesian networks and the search for causality
Bayesian networks and the search for causalityBayes Nets meetup London
 
Surviving statistics lecture 1
Surviving statistics lecture 1Surviving statistics lecture 1
Surviving statistics lecture 1MikeBlyth
 
Research methods(2)
Research methods(2)Research methods(2)
Research methods(2)geoghanm
 

Similar to Mendelian Randomisation (20)

Bradford Hill Criteria.ppt
Bradford Hill Criteria.pptBradford Hill Criteria.ppt
Bradford Hill Criteria.ppt
 
1 A PRIMER ON CAUSALITY Marc F. Bellemare∗ Introd
1  A PRIMER ON CAUSALITY Marc F. Bellemare∗ Introd1  A PRIMER ON CAUSALITY Marc F. Bellemare∗ Introd
1 A PRIMER ON CAUSALITY Marc F. Bellemare∗ Introd
 
Page 266LEARNING OBJECTIVES· Explain how researchers use inf.docx
Page 266LEARNING OBJECTIVES· Explain how researchers use inf.docxPage 266LEARNING OBJECTIVES· Explain how researchers use inf.docx
Page 266LEARNING OBJECTIVES· Explain how researchers use inf.docx
 
Research by MAGIC
Research by MAGICResearch by MAGIC
Research by MAGIC
 
Probability Distribution
Probability DistributionProbability Distribution
Probability Distribution
 
12 confounding
12 confounding12 confounding
12 confounding
 
Methodological Mistakes and Econometric Consequences
Methodological Mistakes and Econometric ConsequencesMethodological Mistakes and Econometric Consequences
Methodological Mistakes and Econometric Consequences
 
Abc4
Abc4Abc4
Abc4
 
DESCRIPTIVE EPIDEMIOLOGY
DESCRIPTIVE EPIDEMIOLOGYDESCRIPTIVE EPIDEMIOLOGY
DESCRIPTIVE EPIDEMIOLOGY
 
Lesson 24
Lesson 24Lesson 24
Lesson 24
 
lecture1.ppt
lecture1.pptlecture1.ppt
lecture1.ppt
 
Debunk bullshit in statistics QN
Debunk bullshit in statistics QNDebunk bullshit in statistics QN
Debunk bullshit in statistics QN
 
Confounding and Directed Acyclic Graphs
Confounding and Directed Acyclic GraphsConfounding and Directed Acyclic Graphs
Confounding and Directed Acyclic Graphs
 
Study designs
Study designsStudy designs
Study designs
 
Econometrics ch2
Econometrics ch2Econometrics ch2
Econometrics ch2
 
PSY 341 Judgement, Decisions, Reasoning Notes Abyana
PSY 341 Judgement, Decisions, Reasoning Notes AbyanaPSY 341 Judgement, Decisions, Reasoning Notes Abyana
PSY 341 Judgement, Decisions, Reasoning Notes Abyana
 
Bayesian networks and the search for causality
Bayesian networks and the search for causalityBayesian networks and the search for causality
Bayesian networks and the search for causality
 
Surviving statistics lecture 1
Surviving statistics lecture 1Surviving statistics lecture 1
Surviving statistics lecture 1
 
Chapter 1 Ap Psych- Research Methods
Chapter 1 Ap Psych- Research MethodsChapter 1 Ap Psych- Research Methods
Chapter 1 Ap Psych- Research Methods
 
Research methods(2)
Research methods(2)Research methods(2)
Research methods(2)
 

Recently uploaded

PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PPRINCE C P
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.Nitya salvi
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)Areesha Ahmad
 
fundamental of entomology all in one topics of entomology
fundamental of entomology all in one topics of entomologyfundamental of entomology all in one topics of entomology
fundamental of entomology all in one topics of entomologyDrAnita Sharma
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...RohitNehra6
 
DIFFERENCE IN BACK CROSS AND TEST CROSS
DIFFERENCE IN  BACK CROSS AND TEST CROSSDIFFERENCE IN  BACK CROSS AND TEST CROSS
DIFFERENCE IN BACK CROSS AND TEST CROSSLeenakshiTyagi
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfSumit Kumar yadav
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRDelhi Call girls
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoSérgio Sacani
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisDiwakar Mishra
 
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxPhysiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxAArockiyaNisha
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Lokesh Kothari
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )aarthirajkumar25
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfmuntazimhurra
 

Recently uploaded (20)

PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C P
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
fundamental of entomology all in one topics of entomology
fundamental of entomology all in one topics of entomologyfundamental of entomology all in one topics of entomology
fundamental of entomology all in one topics of entomology
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
DIFFERENCE IN BACK CROSS AND TEST CROSS
DIFFERENCE IN  BACK CROSS AND TEST CROSSDIFFERENCE IN  BACK CROSS AND TEST CROSS
DIFFERENCE IN BACK CROSS AND TEST CROSS
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdf
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
 
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxPhysiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 

Mendelian Randomisation

  • 1. Mendelian Randomisation James McMurray PhD Student Department of Empirical Inference 29/07/2014
  • 2. What is Mendelian Randomisation?
  • 3. What is Mendelian Randomisation? Approach to test for a causal effect from observational data in the presence of certain confounding factors.
  • 4. What is Mendelian Randomisation? Approach to test for a causal effect from observational data in the presence of certain confounding factors. Uses the measured variation of genes of known function, to bound the causal effect of a modifiable exposure (environment) on a phenotype (disease).
  • 5. What is Mendelian Randomisation? Approach to test for a causal effect from observational data in the presence of certain confounding factors. Uses the measured variation of genes of known function, to bound the causal effect of a modifiable exposure (environment) on a phenotype (disease). Fundamental idea is that the genotypes are randomly assigned (due to meiosis).
  • 6. What is Mendelian Randomisation? Approach to test for a causal effect from observational data in the presence of certain confounding factors. Uses the measured variation of genes of known function, to bound the causal effect of a modifiable exposure (environment) on a phenotype (disease). Fundamental idea is that the genotypes are randomly assigned (due to meiosis). This allows them to be used as an instrumental variable.
  • 7. What is Mendelian Randomisation?: DAG
  • 9. Motivation: Why Observational Data? Randomised Control Trials (RCTs) are the gold standard for causal inference.
  • 10. Motivation: Why Observational Data? Randomised Control Trials (RCTs) are the gold standard for causal inference. However, it is often not ethical or possible to carry out RCTs.
  • 11. Motivation: Why Observational Data? Randomised Control Trials (RCTs) are the gold standard for causal inference. However, it is often not ethical or possible to carry out RCTs. E.g. we cannot randomly assign a lifetime of heavy smoking and no smoking to groups of individuals.
  • 12. Motivation: Why Observational Data? Randomised Control Trials (RCTs) are the gold standard for causal inference. However, it is often not ethical or possible to carry out RCTs. E.g. we cannot randomly assign a lifetime of heavy smoking and no smoking to groups of individuals. This leads to a need to use observational data.
  • 13. Motivation: Why Observational Data? Randomised Control Trials (RCTs) are the gold standard for causal inference. However, it is often not ethical or possible to carry out RCTs. E.g. we cannot randomly assign a lifetime of heavy smoking and no smoking to groups of individuals. This leads to a need to use observational data. But this requires many assumptions.
  • 15. Instrumental Variables (IV) In the previous DAG, G is the instrumental variable (instrument).
  • 16. Instrumental Variables (IV) In the previous DAG, G is the instrumental variable (instrument). Because it affects Y only through X (exclusively)
  • 17. Instrumental Variables (IV) In the previous DAG, G is the instrumental variable (instrument). Because it affects Y only through X (exclusively) Therefore, under certain assumptions, if G is correlated with Y then we can infer the edge X → Y
  • 18. Instrumental Variables (IV) In the previous DAG, G is the instrumental variable (instrument). Because it affects Y only through X (exclusively) Therefore, under certain assumptions, if G is correlated with Y then we can infer the edge X → Y First we will consider an example
  • 20. Instrumental Variables (IV): Example If the assumptions are met, then if we observe that an increase in tax leads to a reduction in lung cancer, then one could infer that smoking is a “cause” of lung cancer (though possibly indirectly itself - i.e. it’s not “smoking” per se, but the tar in the lungs, etc.).
  • 21. Instrumental Variables (IV): Assumptions: Z → X
  • 22. Instrumental Variables (IV): Assumptions: Z → X We must know (a priori) that the causal direction is Z → X and not X → Z.
  • 23. Instrumental Variables (IV): Assumptions: Z → X We must know (a priori) that the causal direction is Z → X and not X → Z. This is what makes the causal structure unique and identifiable.
  • 24. Instrumental Variables (IV): Assumptions: Z → X We must know (a priori) that the causal direction is Z → X and not X → Z. This is what makes the causal structure unique and identifiable. Note this does not mean that the Z we choose has to be the “true” cause of X. For example, if Z is a SNP, we can choose a SNP in linkage disequilibrium with Z, so long as it is independent of all the other variables but still correlated with X.
  • 25. Instrumental Variables (IV): Assumptions: Z → X We must know (a priori) that the causal direction is Z → X and not X → Z. This is what makes the causal structure unique and identifiable. Note this does not mean that the Z we choose has to be the “true” cause of X. For example, if Z is a SNP, we can choose a SNP in linkage disequilibrium with Z, so long as it is independent of all the other variables but still correlated with X. The more correlated Z is with X, the better.
  • 26. Example: Can use associated SNP as instrument
  • 27. Instrumental Variables (IV): Assumptions: Z ⊥⊥ U
  • 28. Instrumental Variables (IV): Assumptions: Z ⊥⊥ U No factor can affect both the instrument and the effects. For example, there cannot be a factor that causes both higher taxes and less cancer (e.g. differences in health awareness in different countries).
  • 29. Instrumental Variables (IV): Assumptions: No Z → Y
  • 30. Instrumental Variables (IV): Assumptions: No Z → Y Z cannot directly affect Y (or indirectly, except through X).
  • 31. Instrumental Variables (IV): Assumptions: No Z → Y Z cannot directly affect Y (or indirectly, except through X). I.e. there cannot exist other mechanisms by which Z affects Y (i.e. high tobacco tax increases substance abuse, leading to higher rates of cancer)
  • 32. Instrumental Variables (IV): Assumptions: Faithfulness
  • 33. Instrumental Variables (IV): Assumptions: Faithfulness Assume that the true underlying DAG manifests itself in the observed data
  • 34. Instrumental Variables (IV): Assumptions: Faithfulness Assume that the true underlying DAG manifests itself in the observed data I.e. causal effects do not cancel out
  • 35. Instrumental Variables (IV): Assumptions: Faithfulness Assume that the true underlying DAG manifests itself in the observed data I.e. causal effects do not cancel out Reasonable assumption, because the contrary would require very specific parameters
  • 36. Instrumental Variables (IV): Assumptions: Faithfulness Assume that the true underlying DAG manifests itself in the observed data I.e. causal effects do not cancel out Reasonable assumption, because the contrary would require very specific parameters But note that if relations are deterministic, implied Conditional Independencies do not hold and faithfulness is violated
  • 37. Instrumental Variables (IV): Assumptions: Faithfulness Assume that the true underlying DAG manifests itself in the observed data I.e. causal effects do not cancel out Reasonable assumption, because the contrary would require very specific parameters But note that if relations are deterministic, implied Conditional Independencies do not hold and faithfulness is violated Note that, in practice, the sample size is important in testing the independencies in the data.
  • 38. Instrumental Variables (IV): Assumptions: DAG
  • 39. What is Mendelian Randomisation?: Original example
  • 40. What is Mendelian Randomisation?: Original example Katan MB (1986): Apolipoprotein E isoforms, serum cholesterol, and cancer.
  • 41. What is Mendelian Randomisation?: Original example Katan MB (1986): Apolipoprotein E isoforms, serum cholesterol, and cancer. Do low serum cholesterol levels increase cancer risk?
  • 42. What is Mendelian Randomisation?: Original example Katan MB (1986): Apolipoprotein E isoforms, serum cholesterol, and cancer. Do low serum cholesterol levels increase cancer risk? But maybe both cancer risk and cholesterol levels are affected by diet (confounders)
  • 43. What is Mendelian Randomisation?: Original example Katan MB (1986): Apolipoprotein E isoforms, serum cholesterol, and cancer. Do low serum cholesterol levels increase cancer risk? But maybe both cancer risk and cholesterol levels are affected by diet (confounders) Or latent tumours cause the lower cholesterol level (reverse causation)
  • 44. What is Mendelian Randomisation?: Original example Katan MB (1986): Apolipoprotein E isoforms, serum cholesterol, and cancer. Do low serum cholesterol levels increase cancer risk? But maybe both cancer risk and cholesterol levels are affected by diet (confounders) Or latent tumours cause the lower cholesterol level (reverse causation) But patients with Abetalipoproteinemia (inability to absorb cholesterol) - did not appear predisposed to cancer
  • 45. What is Mendelian Randomisation?: Original example Katan MB (1986): Apolipoprotein E isoforms, serum cholesterol, and cancer. Do low serum cholesterol levels increase cancer risk? But maybe both cancer risk and cholesterol levels are affected by diet (confounders) Or latent tumours cause the lower cholesterol level (reverse causation) But patients with Abetalipoproteinemia (inability to absorb cholesterol) - did not appear predisposed to cancer Led Katan to idea of finding a large group genetically predisposed to lower cholesterol levels
  • 46. What is Mendelian Randomisation?: Original example Katan MB (1986): Apolipoprotein E isoforms, serum cholesterol, and cancer. Do low serum cholesterol levels increase cancer risk? But maybe both cancer risk and cholesterol levels are affected by diet (confounders) Or latent tumours cause the lower cholesterol level (reverse causation) But patients with Abetalipoproteinemia (inability to absorb cholesterol) - did not appear predisposed to cancer Led Katan to idea of finding a large group genetically predisposed to lower cholesterol levels This is Mendelian Randomisation.
  • 47. What is Mendelian Randomisation?: Original example Katan MB (1986): Apolipoprotein E isoforms, serum cholesterol, and cancer. Do low serum cholesterol levels increase cancer risk? But maybe both cancer risk and cholesterol levels are affected by diet (confounders) Or latent tumours cause the lower cholesterol level (reverse causation) But patients with Abetalipoproteinemia (inability to absorb cholesterol) - did not appear predisposed to cancer Led Katan to idea of finding a large group genetically predisposed to lower cholesterol levels This is Mendelian Randomisation. Note this does not require that the genetic variants are direct determinants of health. But, uses the association to improve inferences of the effects of modifiable environmental risks on health.
  • 48. What is Mendelian Randomisation?: Original example
  • 49. What is Mendelian Randomisation?: Original example Apolipoprotein E (ApoE) gene was known to affect serum cholesterol, with the ApoE2 variant being associated with lower levels.
  • 50. What is Mendelian Randomisation?: Original example Apolipoprotein E (ApoE) gene was known to affect serum cholesterol, with the ApoE2 variant being associated with lower levels. Many individuals carry ApoE2 variant and so have lower cholesterol levels from birth
  • 51. What is Mendelian Randomisation?: Original example Apolipoprotein E (ApoE) gene was known to affect serum cholesterol, with the ApoE2 variant being associated with lower levels. Many individuals carry ApoE2 variant and so have lower cholesterol levels from birth Since genes are randomly assigned during meiosis (due to recombination), ApoE2 carriers should not be different from ApoE carriers in any other way (diet, etc.), so there is no confounding via the genome - note these assumptions.
  • 52. What is Mendelian Randomisation?: Original example Apolipoprotein E (ApoE) gene was known to affect serum cholesterol, with the ApoE2 variant being associated with lower levels. Many individuals carry ApoE2 variant and so have lower cholesterol levels from birth Since genes are randomly assigned during meiosis (due to recombination), ApoE2 carriers should not be different from ApoE carriers in any other way (diet, etc.), so there is no confounding via the genome - note these assumptions. Therefore if low serum cholesterol is really causal for cancer, the cancer patients should have more ApoE2 alleles than the controls - if not then the levels would be similar in both groups.
  • 53. What is Mendelian Randomisation?: Original example
  • 54. What is Mendelian Randomisation?: Original example Katan only provided the suggestion, but the method has since been used for many different analyses with some success, such as the link between blood pressure and stroke risk.
  • 55. What is Mendelian Randomisation?: Original example Katan only provided the suggestion, but the method has since been used for many different analyses with some success, such as the link between blood pressure and stroke risk. However, some conclusions have later been disproved by Randomised Control Trials. To understand why, we must consider the biological assumptions.
  • 57. Panmixia Recall the assumption that the genotype is randomly assigned - this implies panmixia
  • 58. Panmixia Recall the assumption that the genotype is randomly assigned - this implies panmixia That is, there is no selective breeding (so random mating)
  • 59. Panmixia Recall the assumption that the genotype is randomly assigned - this implies panmixia That is, there is no selective breeding (so random mating) Implies that all recombination is possible
  • 60. Panmixia Recall the assumption that the genotype is randomly assigned - this implies panmixia That is, there is no selective breeding (so random mating) Implies that all recombination is possible In our DAG, this means that G is not influences by Y (or other variables)
  • 61. Panmixia Recall the assumption that the genotype is randomly assigned - this implies panmixia That is, there is no selective breeding (so random mating) Implies that all recombination is possible In our DAG, this means that G is not influences by Y (or other variables) Not entirely accurate, as demonstrated by Population Stratification
  • 63. Population Stratification Systematic difference in allele frequencies between subpopulations, due to ancestry
  • 64. Population Stratification Systematic difference in allele frequencies between subpopulations, due to ancestry For example, physical separation leads to non-random mating
  • 65. Population Stratification Systematic difference in allele frequencies between subpopulations, due to ancestry For example, physical separation leads to non-random mating Leads to different genetic drift in different subpopulations (i.e. changes in allele frequency over time due to random sampling)
  • 66. Population Stratification Systematic difference in allele frequencies between subpopulations, due to ancestry For example, physical separation leads to non-random mating Leads to different genetic drift in different subpopulations (i.e. changes in allele frequency over time due to random sampling) Means that the genotype is not randomly assigned when considered across sub-populations
  • 67. Population Stratification Systematic difference in allele frequencies between subpopulations, due to ancestry For example, physical separation leads to non-random mating Leads to different genetic drift in different subpopulations (i.e. changes in allele frequency over time due to random sampling) Means that the genotype is not randomly assigned when considered across sub-populations E.g. Lactose intolerance
  • 69. Canalization Variation in robustness of phenotypes to genotype and environments
  • 70. Canalization Variation in robustness of phenotypes to genotype and environments Waddington Drosophilia experiment: Exposed Drosophilia pupae to heat shock Developed Cross-veinless phenotype (no cross-veins in wings) By selecting for this phenotype, eventually appears without heat shock Led to theory of organisms rolling downhill in to “canals” of the epigenetic landscape with development, becoming more robust to variation Think of it like an optimisation problem
  • 71. Canalization Variation in robustness of phenotypes to genotype and environments Waddington Drosophilia experiment: Exposed Drosophilia pupae to heat shock Developed Cross-veinless phenotype (no cross-veins in wings) By selecting for this phenotype, eventually appears without heat shock Led to theory of organisms rolling downhill in to “canals” of the epigenetic landscape with development, becoming more robust to variation Think of it like an optimisation problem Exact mechanisms unknown
  • 72. Canalization Variation in robustness of phenotypes to genotype and environments Waddington Drosophilia experiment: Exposed Drosophilia pupae to heat shock Developed Cross-veinless phenotype (no cross-veins in wings) By selecting for this phenotype, eventually appears without heat shock Led to theory of organisms rolling downhill in to “canals” of the epigenetic landscape with development, becoming more robust to variation Think of it like an optimisation problem Exact mechanisms unknown Acts as confounder between genotype, environment and phenotype
  • 74. Pleiotropy One gene can affect many (even seemingly unrelated) phenotypes
  • 75. Pleiotropy One gene can affect many (even seemingly unrelated) phenotypes Mendelian Randomisation makes the assumption of no pleiotropy
  • 76. Pleiotropy One gene can affect many (even seemingly unrelated) phenotypes Mendelian Randomisation makes the assumption of no pleiotropy In this case, this means that we know the genotype is only influencing the phenotype via the considered exposure
  • 77. Pleiotropy One gene can affect many (even seemingly unrelated) phenotypes Mendelian Randomisation makes the assumption of no pleiotropy In this case, this means that we know the genotype is only influencing the phenotype via the considered exposure I.e. ApoE2 only affects serum cholesterol levels, and cannot affect cancer risk by other, unobserved means.
  • 78. Pleiotropy One gene can affect many (even seemingly unrelated) phenotypes Mendelian Randomisation makes the assumption of no pleiotropy In this case, this means that we know the genotype is only influencing the phenotype via the considered exposure I.e. ApoE2 only affects serum cholesterol levels, and cannot affect cancer risk by other, unobserved means. This is a big assumption, prior knowledge is necessary.
  • 79. Pleiotropy One gene can affect many (even seemingly unrelated) phenotypes Mendelian Randomisation makes the assumption of no pleiotropy In this case, this means that we know the genotype is only influencing the phenotype via the considered exposure I.e. ApoE2 only affects serum cholesterol levels, and cannot affect cancer risk by other, unobserved means. This is a big assumption, prior knowledge is necessary. If possible, using multiple, independent SNPs (instruments) helps to alleviate this issue (as if they are all consistent then it is unlikely that they all have other pathways causing the same change) - but note they must not be in Linkage Disequilibrium!
  • 81. Conclusion Instrumental variables are a method to infer causal relations from observational data, given certain assumptions. Applied in Genetic Epidemiology with Mendelian Randomisation. Has had some success, but underlying biology poses problems. Can we improve robustness with more measurements of intermediate phenotypes? (gene methylation, RNAseq, proteomics) - multi-step Mendelian Randomisation Can we improve identification of appropriate instruments? (e.g. whole genome sequencing makes it easier to identify population stratification) Thanks for your time Questions?