An analytic approach for interpretable predictive models in high dimensional data, in the presence of interactions with exposures

An analytic approach for interpretable
predictive models in high dimensional data, in
the presence of interactions with exposures
Sahir Rai Bhatnagar, PhD Candidate
Joint with Yi Yang, Mathieu Blanchette, Luigi Bouchard, Celia Greenwood
Biostatistics, McGill University
preprint available at
sahirbhatnagar.com
Simulated Data ̸=
Real Data
0/21
Simple Rule 11:
Simulated Data ̸=
Real Data
0/21
Motivation
one predictor variable at a time
Predictor Variable Phenotype
one predictor variable at a time
Predictor Variable Phenotype
Test 1
Test 2
Test 3
Test 4
Test 5
1/21
a network based view
Predictor Variable Phenotype
a network based view
Predictor Variable Phenotype
a network based view
Predictor Variable Phenotype
Test 1
2/21
system level changes due to environment
Predictor Variable PhenotypeEnvironment
A
B
system level changes due to environment
Predictor Variable PhenotypeEnvironment
A
B
Test 1
3/21
Motivating Dataset: Newborn epigenetic adaptations to gesta-
tional diabetes exposure (Luigi Bouchard, USherbrooke)
Environment
Gestational
Diabetes
Large Data
Child’s epigenome
(p ≈ 450k)
Phenotype
Obesity measures
4/21
Differential Correlation between environments
(a) Gestational diabetes affected pregnancy (b) Controls
5/21
NIH MRI brain study
Environment
Age
Large Data
Cortical Thickness
(p ≈ 80k)
Phenotype
Intelligence
6/21
Goals of this study
Objective
(i) Whether clustering that incorporates known covariate or
exposure information can improve prediction models
7/21
Goals of this study
Objective
(i) Whether clustering that incorporates known covariate or
exposure information can improve prediction models
(ii) Can the resulting clusters provide an easier route to
interpretation
7/21
Methods
ECLUST - our proposed method: 2 steps
Original Data
ECLUST - our proposed method: 2 steps
Original Data
E = 0
1a) Gene Similarity
E = 1
ECLUST - our proposed method: 2 steps
Original Data
E = 0
1a) Gene Similarity
E = 1
ECLUST - our proposed method: 2 steps
Original Data
E = 0
1a) Gene Similarity
E = 1
1b) Cluster
Representation
ECLUST - our proposed method: 2 steps
Original Data
E = 0
1a) Gene Similarity
E = 1
1b) Cluster
Representation
n × 1 n × 1
ECLUST - our proposed method: 2 steps
Original Data
E = 0
1a) Gene Similarity
E = 1
1b) Cluster
Representation
n × 1 n × 1
2) Penalized
Regression
Yn×1∼ + ×E
8/21
the objective of statistical
methods is the reduction of
data. A quantity of data . . . is to be
replaced by relatively few quantities
which shall adequately represent
. . . the relevant information
contained in the original data.
- Sir R. A. Fisher, 1922
8/21
Step 1a: Method to detect gene clusters
(i) Hierarchical clustering (average linkage) with TOM1
scoring
dissimilarity2
:
|TOME=1 − TOME=0|
(ii) Number of clusters chosen using dynamicTreeCut algorithm 3
Original Data
E = 0
1a) Gene Similarity
E = 1
1Ravasz et al., Science (2002)
2Klein Oros et al., Frontiers in Genetics (2016)
3Langfelder and Zhang, Bioinformatics (2008)
9/21
Step 1b: Cluster Representation
(i) Average 4
(ii) 1st Principal Component 5
Original Data
E = 0
1a) Gene Similarity
E = 1
1b) Cluster
Representation
n × 1 n × 1
4Hastie et al., Genome Biology (2001), Park et al., Biostatistics (2007)
5Kendall, A Course in Multivariate analysis (1957)
10/21
Step 2: Variable Selection
(i) Linear effects: Lasso, Elastic Net 6
(ii) Non-linear effects: MARS 7
Original Data
E = 0
1a) Gene Similarity
E = 1
1b) Cluster
Representation
n × 1 n × 1
2) Penalized
Regression
Yn×1∼ + ×E
6Tibshirani, JRSSB (1996), Zou and Hastie, JRSSB (2005)
7Friedman, Annals of Statistics (1991)
11/21
Simulation Study
Simulated TOM by Exposure Status
(a) TOM(XE=1) (b) TOM(XE=0)
12/21
Difference of TOMs
(a) |TOM(XE=1) − TOM(XE=0)| 13/21
TOM based on all subjects
(a) TOM(Xall) 14/21
Real Data Analysis
Gestational Diabetes: Prediction Performance
15/21
Gestational Diabetes: Interpretation of Clusters with IPA
• Canonical Pathways: 1.25-dihydroxyvitamin D3 Biosynthesis –
vitamin D associated with obesity
16/21
Gestational Diabetes: Interpretation of Clusters with IPA
• Canonical Pathways: 1.25-dihydroxyvitamin D3 Biosynthesis –
vitamin D associated with obesity
• Diseases and Disorders: Hepatic System Disease – metabolism
of glucose and lipids
16/21
Gestational Diabetes: Interpretation of Clusters with IPA
• Canonical Pathways: 1.25-dihydroxyvitamin D3 Biosynthesis –
vitamin D associated with obesity
• Diseases and Disorders: Hepatic System Disease – metabolism
of glucose and lipids
• Physiological System Development and Function:
(i) Behavior and neurodevelopment – associated with obesity
(ii) Embryonic and organ development – GD associated with
macrosomia
16/21
NIHPD: Age
17/21
NIHPD: Income
18/21
Final Remarks
Discussion and Contributions
• Large system-wide changes are observed in many
environments (DNA methylation, cortical thickness, gene
expression)
19/21
Discussion and Contributions
• Large system-wide changes are observed in many
environments (DNA methylation, cortical thickness, gene
expression)
• Environment dependent clustering can improve prediction
performance in high dimensional settings (n << p)
19/21
Discussion and Contributions
• Large system-wide changes are observed in many
environments (DNA methylation, cortical thickness, gene
expression)
• Environment dependent clustering can improve prediction
performance in high dimensional settings (n << p)
• Clusters can be interpreted but require much more expert
knowledge
19/21
Discussion and Contributions
• Large system-wide changes are observed in many
environments (DNA methylation, cortical thickness, gene
expression)
• Environment dependent clustering can improve prediction
performance in high dimensional settings (n << p)
• Clusters can be interpreted but require much more expert
knowledge
• Leverages existing computationally fast algorithms and can run
on a laptop computer (p ≈ 10k)
19/21
Discussion and Contributions
• Large system-wide changes are observed in many
environments (DNA methylation, cortical thickness, gene
expression)
• Environment dependent clustering can improve prediction
performance in high dimensional settings (n << p)
• Clusters can be interpreted but require much more expert
knowledge
• Leverages existing computationally fast algorithms and can run
on a laptop computer (p ≈ 10k)
• Software implementation in R: sahirbhatnagar.com
19/21
Limitations
• There must be a high-dimensional signature of the exposure
20/21
Limitations
• There must be a high-dimensional signature of the exposure
• Covariance estimation
20/21
Limitations
• There must be a high-dimensional signature of the exposure
• Covariance estimation
• Currently limited to binary environment
20/21
Limitations
• There must be a high-dimensional signature of the exposure
• Covariance estimation
• Currently limited to binary environment
• Interpretation can be difficult
20/21
Acknowledgements
• Dr. Celia Greenwood
• Dr. Blanchette and Dr. Yang
• Dr. Luigi Bouchard, André Anne
Houde
• Dr. Steele, Dr. Kramer,
Dr. Abrahamowicz
• Maxime Turgeon, Kevin
McGregor, Lauren Mokry,
Dr. Forest
• Greg Voisin, Dr. Forgetta,
Dr. Klein
• Mothers and children from the
study
21/21
1 of 49

Recommended

Marketing Analytics: Predictive analysis by
Marketing Analytics: Predictive analysisMarketing Analytics: Predictive analysis
Marketing Analytics: Predictive analysisIzmir Vodinaj
456 views19 slides
Basis of statistical inference by
Basis of statistical inferenceBasis of statistical inference
Basis of statistical inferencezahidacademy
10.1K views17 slides
Lecture 4: Statistical Inference by
Lecture 4: Statistical InferenceLecture 4: Statistical Inference
Lecture 4: Statistical InferenceMarina Santini
11.1K views26 slides
Statistical inference: Statistical Power, ANOVA, and Post Hoc tests by
Statistical inference: Statistical Power, ANOVA, and Post Hoc testsStatistical inference: Statistical Power, ANOVA, and Post Hoc tests
Statistical inference: Statistical Power, ANOVA, and Post Hoc testsEugene Yan Ziyou
6.9K views22 slides
Predictive Analytics using R by
Predictive Analytics using RPredictive Analytics using R
Predictive Analytics using RJeffrey Strickland, Ph.D., CMSP
19.3K views553 slides
Ppt on remote sensing system by
Ppt on remote sensing systemPpt on remote sensing system
Ppt on remote sensing systemAlisha Korpal
38.7K views24 slides

More Related Content

More from sahirbhatnagar

Strong Heredity Models in High Dimensional Data by
Strong Heredity Models in High Dimensional DataStrong Heredity Models in High Dimensional Data
Strong Heredity Models in High Dimensional Datasahirbhatnagar
173 views53 slides
Methods for High Dimensional Interactions by
Methods for High Dimensional InteractionsMethods for High Dimensional Interactions
Methods for High Dimensional Interactionssahirbhatnagar
682 views94 slides
An introduction to knitr and R Markdown by
An introduction to knitr and R MarkdownAn introduction to knitr and R Markdown
An introduction to knitr and R Markdownsahirbhatnagar
2.7K views40 slides
Atelier r-gerad by
Atelier r-geradAtelier r-gerad
Atelier r-geradsahirbhatnagar
1.9K views153 slides
Reproducible Research: An Introduction to knitr by
Reproducible Research: An Introduction to knitrReproducible Research: An Introduction to knitr
Reproducible Research: An Introduction to knitrsahirbhatnagar
840 views43 slides
Analysis of DNA methylation and Gene expression to predict childhood obesity by
Analysis of DNA methylation and Gene expression to predict childhood obesityAnalysis of DNA methylation and Gene expression to predict childhood obesity
Analysis of DNA methylation and Gene expression to predict childhood obesitysahirbhatnagar
926 views33 slides

More from sahirbhatnagar(13)

Strong Heredity Models in High Dimensional Data by sahirbhatnagar
Strong Heredity Models in High Dimensional DataStrong Heredity Models in High Dimensional Data
Strong Heredity Models in High Dimensional Data
sahirbhatnagar173 views
Methods for High Dimensional Interactions by sahirbhatnagar
Methods for High Dimensional InteractionsMethods for High Dimensional Interactions
Methods for High Dimensional Interactions
sahirbhatnagar682 views
An introduction to knitr and R Markdown by sahirbhatnagar
An introduction to knitr and R MarkdownAn introduction to knitr and R Markdown
An introduction to knitr and R Markdown
sahirbhatnagar2.7K views
Reproducible Research: An Introduction to knitr by sahirbhatnagar
Reproducible Research: An Introduction to knitrReproducible Research: An Introduction to knitr
Reproducible Research: An Introduction to knitr
sahirbhatnagar840 views
Analysis of DNA methylation and Gene expression to predict childhood obesity by sahirbhatnagar
Analysis of DNA methylation and Gene expression to predict childhood obesityAnalysis of DNA methylation and Gene expression to predict childhood obesity
Analysis of DNA methylation and Gene expression to predict childhood obesity
sahirbhatnagar926 views
Estimation and Accuracy after Model Selection by sahirbhatnagar
Estimation and Accuracy after Model SelectionEstimation and Accuracy after Model Selection
Estimation and Accuracy after Model Selection
sahirbhatnagar1.1K views
Absolute risk estimation in a case cohort study of prostate cancer by sahirbhatnagar
Absolute risk estimation in a case cohort study of prostate cancerAbsolute risk estimation in a case cohort study of prostate cancer
Absolute risk estimation in a case cohort study of prostate cancer
sahirbhatnagar1.1K views
Computational methods for case-cohort studies by sahirbhatnagar
Computational methods for case-cohort studiesComputational methods for case-cohort studies
Computational methods for case-cohort studies
sahirbhatnagar1.6K views
Factors influencing participation in cancer screening by sahirbhatnagar
Factors influencing participation in cancer screeningFactors influencing participation in cancer screening
Factors influencing participation in cancer screening
sahirbhatnagar485 views
Methylation and Expression data integration by sahirbhatnagar
Methylation and Expression data integrationMethylation and Expression data integration
Methylation and Expression data integration
sahirbhatnagar643 views

Recently uploaded

Radioactive and Non- radioactive probes by
Radioactive and Non- radioactive probesRadioactive and Non- radioactive probes
Radioactive and Non- radioactive probesNathiya .T Nathiya.T
6 views14 slides
ELECTRON TRANSPORT CHAIN by
ELECTRON TRANSPORT CHAINELECTRON TRANSPORT CHAIN
ELECTRON TRANSPORT CHAINDEEKSHA RANI
16 views16 slides
Vegetable grafting: A new crop improvement approach.pptx by
Vegetable grafting: A new crop improvement approach.pptxVegetable grafting: A new crop improvement approach.pptx
Vegetable grafting: A new crop improvement approach.pptxHimul Suthar
8 views69 slides
ZEBRA FISH: as model organism.pptx by
ZEBRA FISH: as model organism.pptxZEBRA FISH: as model organism.pptx
ZEBRA FISH: as model organism.pptxmahimachoudhary0807
12 views17 slides
Ellagic Acid and Its Metabolites as Potent and Selective Allosteric Inhibitor... by
Ellagic Acid and Its Metabolites as Potent and Selective Allosteric Inhibitor...Ellagic Acid and Its Metabolites as Potent and Selective Allosteric Inhibitor...
Ellagic Acid and Its Metabolites as Potent and Selective Allosteric Inhibitor...Trustlife
154 views17 slides
Small ruminant keepers’ knowledge, attitudes and practices towards peste des ... by
Small ruminant keepers’ knowledge, attitudes and practices towards peste des ...Small ruminant keepers’ knowledge, attitudes and practices towards peste des ...
Small ruminant keepers’ knowledge, attitudes and practices towards peste des ...ILRI
6 views6 slides

Recently uploaded(20)

ELECTRON TRANSPORT CHAIN by DEEKSHA RANI
ELECTRON TRANSPORT CHAINELECTRON TRANSPORT CHAIN
ELECTRON TRANSPORT CHAIN
DEEKSHA RANI16 views
Vegetable grafting: A new crop improvement approach.pptx by Himul Suthar
Vegetable grafting: A new crop improvement approach.pptxVegetable grafting: A new crop improvement approach.pptx
Vegetable grafting: A new crop improvement approach.pptx
Himul Suthar8 views
Ellagic Acid and Its Metabolites as Potent and Selective Allosteric Inhibitor... by Trustlife
Ellagic Acid and Its Metabolites as Potent and Selective Allosteric Inhibitor...Ellagic Acid and Its Metabolites as Potent and Selective Allosteric Inhibitor...
Ellagic Acid and Its Metabolites as Potent and Selective Allosteric Inhibitor...
Trustlife154 views
Small ruminant keepers’ knowledge, attitudes and practices towards peste des ... by ILRI
Small ruminant keepers’ knowledge, attitudes and practices towards peste des ...Small ruminant keepers’ knowledge, attitudes and practices towards peste des ...
Small ruminant keepers’ knowledge, attitudes and practices towards peste des ...
ILRI6 views
Study on Drug Drug Interaction Through Prescription Analysis of Type II Diabe... by Anmol Vishnu Gupta
Study on Drug Drug Interaction Through Prescription Analysis of Type II Diabe...Study on Drug Drug Interaction Through Prescription Analysis of Type II Diabe...
Study on Drug Drug Interaction Through Prescription Analysis of Type II Diabe...
별헤는 사람들 2023년 12월호 전명원 교수 자료 by sciencepeople
별헤는 사람들 2023년 12월호 전명원 교수 자료별헤는 사람들 2023년 12월호 전명원 교수 자료
별헤는 사람들 2023년 12월호 전명원 교수 자료
sciencepeople68 views
Determination of color fastness to rubbing(wet and dry condition) by crockmeter. by ShadmanSakib63
Determination of color fastness to rubbing(wet and dry condition) by crockmeter.Determination of color fastness to rubbing(wet and dry condition) by crockmeter.
Determination of color fastness to rubbing(wet and dry condition) by crockmeter.
ShadmanSakib636 views
selection of preformed arch wires during the alignment stage of preadjusted o... by MaherFouda1
selection of preformed arch wires during the alignment stage of preadjusted o...selection of preformed arch wires during the alignment stage of preadjusted o...
selection of preformed arch wires during the alignment stage of preadjusted o...
MaherFouda17 views
Presentation on experimental laboratory animal- Hamster by Kanika13641
Presentation on experimental laboratory animal- HamsterPresentation on experimental laboratory animal- Hamster
Presentation on experimental laboratory animal- Hamster
Kanika136416 views
Indian council for child welfare by RenuWaghmare2
Indian council for child welfareIndian council for child welfare
Indian council for child welfare
RenuWaghmare27 views
Evaluation and Standardization of the Marketed Polyherbal drug Patanjali Divy... by Anmol Vishnu Gupta
Evaluation and Standardization of the Marketed Polyherbal drug Patanjali Divy...Evaluation and Standardization of the Marketed Polyherbal drug Patanjali Divy...
Evaluation and Standardization of the Marketed Polyherbal drug Patanjali Divy...
Applications of Large Language Models in Materials Discovery and Design by Anubhav Jain
Applications of Large Language Models in Materials Discovery and DesignApplications of Large Language Models in Materials Discovery and Design
Applications of Large Language Models in Materials Discovery and Design
Anubhav Jain14 views

An analytic approach for interpretable predictive models in high dimensional data, in the presence of interactions with exposures

  • 1. An analytic approach for interpretable predictive models in high dimensional data, in the presence of interactions with exposures Sahir Rai Bhatnagar, PhD Candidate Joint with Yi Yang, Mathieu Blanchette, Luigi Bouchard, Celia Greenwood Biostatistics, McGill University preprint available at sahirbhatnagar.com
  • 3. Simple Rule 11: Simulated Data ̸= Real Data 0/21
  • 5. one predictor variable at a time Predictor Variable Phenotype
  • 6. one predictor variable at a time Predictor Variable Phenotype Test 1 Test 2 Test 3 Test 4 Test 5 1/21
  • 7. a network based view Predictor Variable Phenotype
  • 8. a network based view Predictor Variable Phenotype
  • 9. a network based view Predictor Variable Phenotype Test 1 2/21
  • 10. system level changes due to environment Predictor Variable PhenotypeEnvironment A B
  • 11. system level changes due to environment Predictor Variable PhenotypeEnvironment A B Test 1 3/21
  • 12. Motivating Dataset: Newborn epigenetic adaptations to gesta- tional diabetes exposure (Luigi Bouchard, USherbrooke) Environment Gestational Diabetes Large Data Child’s epigenome (p ≈ 450k) Phenotype Obesity measures 4/21
  • 13. Differential Correlation between environments (a) Gestational diabetes affected pregnancy (b) Controls 5/21
  • 14. NIH MRI brain study Environment Age Large Data Cortical Thickness (p ≈ 80k) Phenotype Intelligence 6/21
  • 15. Goals of this study Objective (i) Whether clustering that incorporates known covariate or exposure information can improve prediction models 7/21
  • 16. Goals of this study Objective (i) Whether clustering that incorporates known covariate or exposure information can improve prediction models (ii) Can the resulting clusters provide an easier route to interpretation 7/21
  • 18. ECLUST - our proposed method: 2 steps Original Data
  • 19. ECLUST - our proposed method: 2 steps Original Data E = 0 1a) Gene Similarity E = 1
  • 20. ECLUST - our proposed method: 2 steps Original Data E = 0 1a) Gene Similarity E = 1
  • 21. ECLUST - our proposed method: 2 steps Original Data E = 0 1a) Gene Similarity E = 1 1b) Cluster Representation
  • 22. ECLUST - our proposed method: 2 steps Original Data E = 0 1a) Gene Similarity E = 1 1b) Cluster Representation n × 1 n × 1
  • 23. ECLUST - our proposed method: 2 steps Original Data E = 0 1a) Gene Similarity E = 1 1b) Cluster Representation n × 1 n × 1 2) Penalized Regression Yn×1∼ + ×E 8/21
  • 24. the objective of statistical methods is the reduction of data. A quantity of data . . . is to be replaced by relatively few quantities which shall adequately represent . . . the relevant information contained in the original data. - Sir R. A. Fisher, 1922 8/21
  • 25. Step 1a: Method to detect gene clusters (i) Hierarchical clustering (average linkage) with TOM1 scoring dissimilarity2 : |TOME=1 − TOME=0| (ii) Number of clusters chosen using dynamicTreeCut algorithm 3 Original Data E = 0 1a) Gene Similarity E = 1 1Ravasz et al., Science (2002) 2Klein Oros et al., Frontiers in Genetics (2016) 3Langfelder and Zhang, Bioinformatics (2008) 9/21
  • 26. Step 1b: Cluster Representation (i) Average 4 (ii) 1st Principal Component 5 Original Data E = 0 1a) Gene Similarity E = 1 1b) Cluster Representation n × 1 n × 1 4Hastie et al., Genome Biology (2001), Park et al., Biostatistics (2007) 5Kendall, A Course in Multivariate analysis (1957) 10/21
  • 27. Step 2: Variable Selection (i) Linear effects: Lasso, Elastic Net 6 (ii) Non-linear effects: MARS 7 Original Data E = 0 1a) Gene Similarity E = 1 1b) Cluster Representation n × 1 n × 1 2) Penalized Regression Yn×1∼ + ×E 6Tibshirani, JRSSB (1996), Zou and Hastie, JRSSB (2005) 7Friedman, Annals of Statistics (1991) 11/21
  • 29. Simulated TOM by Exposure Status (a) TOM(XE=1) (b) TOM(XE=0) 12/21
  • 30. Difference of TOMs (a) |TOM(XE=1) − TOM(XE=0)| 13/21
  • 31. TOM based on all subjects (a) TOM(Xall) 14/21
  • 33. Gestational Diabetes: Prediction Performance 15/21
  • 34. Gestational Diabetes: Interpretation of Clusters with IPA • Canonical Pathways: 1.25-dihydroxyvitamin D3 Biosynthesis – vitamin D associated with obesity 16/21
  • 35. Gestational Diabetes: Interpretation of Clusters with IPA • Canonical Pathways: 1.25-dihydroxyvitamin D3 Biosynthesis – vitamin D associated with obesity • Diseases and Disorders: Hepatic System Disease – metabolism of glucose and lipids 16/21
  • 36. Gestational Diabetes: Interpretation of Clusters with IPA • Canonical Pathways: 1.25-dihydroxyvitamin D3 Biosynthesis – vitamin D associated with obesity • Diseases and Disorders: Hepatic System Disease – metabolism of glucose and lipids • Physiological System Development and Function: (i) Behavior and neurodevelopment – associated with obesity (ii) Embryonic and organ development – GD associated with macrosomia 16/21
  • 40. Discussion and Contributions • Large system-wide changes are observed in many environments (DNA methylation, cortical thickness, gene expression) 19/21
  • 41. Discussion and Contributions • Large system-wide changes are observed in many environments (DNA methylation, cortical thickness, gene expression) • Environment dependent clustering can improve prediction performance in high dimensional settings (n << p) 19/21
  • 42. Discussion and Contributions • Large system-wide changes are observed in many environments (DNA methylation, cortical thickness, gene expression) • Environment dependent clustering can improve prediction performance in high dimensional settings (n << p) • Clusters can be interpreted but require much more expert knowledge 19/21
  • 43. Discussion and Contributions • Large system-wide changes are observed in many environments (DNA methylation, cortical thickness, gene expression) • Environment dependent clustering can improve prediction performance in high dimensional settings (n << p) • Clusters can be interpreted but require much more expert knowledge • Leverages existing computationally fast algorithms and can run on a laptop computer (p ≈ 10k) 19/21
  • 44. Discussion and Contributions • Large system-wide changes are observed in many environments (DNA methylation, cortical thickness, gene expression) • Environment dependent clustering can improve prediction performance in high dimensional settings (n << p) • Clusters can be interpreted but require much more expert knowledge • Leverages existing computationally fast algorithms and can run on a laptop computer (p ≈ 10k) • Software implementation in R: sahirbhatnagar.com 19/21
  • 45. Limitations • There must be a high-dimensional signature of the exposure 20/21
  • 46. Limitations • There must be a high-dimensional signature of the exposure • Covariance estimation 20/21
  • 47. Limitations • There must be a high-dimensional signature of the exposure • Covariance estimation • Currently limited to binary environment 20/21
  • 48. Limitations • There must be a high-dimensional signature of the exposure • Covariance estimation • Currently limited to binary environment • Interpretation can be difficult 20/21
  • 49. Acknowledgements • Dr. Celia Greenwood • Dr. Blanchette and Dr. Yang • Dr. Luigi Bouchard, André Anne Houde • Dr. Steele, Dr. Kramer, Dr. Abrahamowicz • Maxime Turgeon, Kevin McGregor, Lauren Mokry, Dr. Forest • Greg Voisin, Dr. Forgetta, Dr. Klein • Mothers and children from the study 21/21