SlideShare a Scribd company logo
Analysis of DNA Methylation and Gene
Expression data in Placenta tissue to
predict childhood obesity
An Integrative Approach
Bhatnagar SR1,2
, Houde A4,5
, Voisin G2
,
Bouchard L4,5
, Greenwood CMT1,2,3
1
Department of Epidemiology, Biostatistics and Occupational Health, McGill University
2
Lady Davis Institute, Jewish General Hospital, Montr´eal, QC
3
Departments of Oncology and Human Genetics, McGill University
4
Department of Biochemistry, Universit´e de Sherbrooke, QC
5
ECOGENE-21 and Lipid Clinic, Chicoutimi Hospital, QC
sahirbhatnagar.com/talks
Poster Session B, # 56
Motivating Question # 1
sahirbhatnagar.com Data Integration CHSGM 2015 2 / 25
Motivation
1 in 4 adult Canadians and 1 in 10 children are clinically obese.
Events during pregnancy are suspected to play a role in childhood
obesity → we don’t know about the mechanisms involved
Children born to women who had a gestational diabetes
mellitus-affected pregnancy are more likely to be overweight and obese
Evidence suggests epigenetic factors are important piece of the puzzle
sahirbhatnagar.com Data Integration CHSGM 2015 3 / 25
Motivating Question # 2
sahirbhatnagar.com Data Integration CHSGM 2015 4 / 25
Motivating Question
sample size
genomic data
25 50
Gene
Expression
Motivating Question
sample size
genomic data
25 50
Gene
Expression
DNA
Methylation
DNA
Methylation
Gene
Expression
Motivating Question
sample size
genomic data
25 50
Gene
Expression
DNA
Methylation
DNA
Methylation
Gene
Expression
??
?
sahirbhatnagar.com Data Integration CHSGM 2015 5 / 25
The Data
sahirbhatnagar.com Data Integration CHSGM 2015 6 / 25
The Data
Expression
HT-12 v4
p = 46, 889
Methylation
Illumina 450k
p = 375, 561
Gestational
Diabetes
n = 45
GD = 29
Placenta
n = 45
time
at birth age 5
| |
The Data
Expression
HT-12 v4
p = 46, 889
Methylation
Illumina 450k
p = 375, 561
Gestational
Diabetes
n = 45
GD = 29
Placenta
n = 45
time
at birth age 5
| |
X
The Data
Expression
HT-12 v4
p = 46, 889
Methylation
Illumina 450k
p = 375, 561
Gestational
Diabetes
n = 45
GD = 29
Placenta
n = 45
time
at birth age 5
| |
X
7 Fat
Measures
Child
n = 23
GD = 16
The Data
Expression
HT-12 v4
p = 46, 889
Methylation
Illumina 450k
p = 375, 561
Gestational
Diabetes
n = 45
GD = 29
Placenta
n = 45
time
at birth age 5
| |
X
7 Fat
Measures
Child
n = 23
GD = 16
Y
The Data
Expression
HT-12 v4
p = 46, 889
Methylation
Illumina 450k
p = 375, 561
Gestational
Diabetes
n = 45
GD = 29
Placenta
n = 45
time
at birth age 5
| |
X
7 Fat
Measures
Child
n = 23
GD = 16
Y
?
sahirbhatnagar.com Data Integration CHSGM 2015 7 / 25
Summarizing Expression,
Methylation and Gestational
Diabetes Phenotype in Placenta
Tissue
sahirbhatnagar.com Data Integration CHSGM 2015 8 / 25
Sparse Canonical Correlation Analysis (sCCA)
CCA requires calculation of XT
X
−1
and YT
Y
−1
When p + q >> n, these matrices are singular
sCCA applies an L1 penalty to the canonical vectors to obtain sparse
solutions (Witten et al., 2009; Parkhomenko et al., 2009)
Assumes XT
X = Ip, YT
Y = Iq
maximizeu,v uT
XT
Yv
subject to
u 2
2 ≤ 1, v 2
2 ≤ 1
and
P1(u) ≤ λ1, P2(v) ≤ λ2
sahirbhatnagar.com Data Integration CHSGM 2015 9 / 25
Supervised Sparse CCA
Main idea:
1. The features that are most associated with the outcome Q are
identified to form the reduced matrices X and Y
2. sCCA is performed on X and Y
sahirbhatnagar.com Data Integration CHSGM 2015 10 / 25
Importance of Gestational Diabetes Phenotype
0.88
0.90
0.92
0.94
0.96
0.98
#non−0expressionprobes
# non−0 methylation probes
Correlation
Gestational Diabetes Status Used in Sparse CCA
0.88
0.90
0.92
0.94
0.96
0.98
#non−0expressionprobes
# non−0 methylation probes
Correlation
Gestational Diabetes Status Not Used
sahirbhatnagar.com Data Integration CHSGM 2015 11 / 25
GO Stat Analysis for Enrichment
Enrichment Analysis based on non zero vector of 1st component from
the Supervised sCCA analysis
Genes associated with inflammatory processes
Table : Top list of enriched GO terms
GOBPID FDR OR E.Count Count Size Term
0002376 < 10−14
2.1 131.6 227 2178 immune system process
0006955 < 10−13
2.3 78.7 153 1303 immune response
0002252 < 10−9
2.7 34.1 80 565 immune effector process
0045087 < 10−8
2.3 49.0 99 811 innate immune response
0002682 < 10−8
2.1 66.56 122 1102 regulation of immune system process
0002684 < 10−8
2.4 40.1 84 664 positive regulation of immune system proces
0006952 < 10−8
1.9 84.5 144 1399 defense response
0050776 < 10−8
2.3 44.5 90 738 regulation of immune response
0050778 < 10−7
2.6 28.5 65 473 positive regulation of immune response
0006950 < 10−7
1.6 196.8 271 3258 response to stress
sahirbhatnagar.com Data Integration CHSGM 2015 12 / 25
Summarizing Bodyfat Measures
sahirbhatnagar.com Data Integration CHSGM 2015 13 / 25
Cluster 6 Bodyfat measures in 2 groups
34
14
8
16
7
6
38
30
20
25
13
3
12
11
17
21
39
31
19
37
28
32
18
Zscore BMI
percent fat
subscapularis
bicep
tricep
iliacus
−2 0 2
Value
Color Key
sahirbhatnagar.com Data Integration CHSGM 2015 14 / 25
Circle of Correlations
−1.0 −0.5 0.0 0.5 1.0
−1.0−0.50.00.51.0
Variables factor map (PCA)
Dim 1 (50.68%)
Dim2(15.41%)
Zscore BMI
percent fat
tricep
bicep
subscapularis
iliacus
sahirbhatnagar.com Data Integration CHSGM 2015 15 / 25
Combining Both Data
sahirbhatnagar.com Data Integration CHSGM 2015 16 / 25
Regression via Elastic Net
Expression
HT-12 v4
p = 46, 889
Methylation
Illumina 450k
p = 375, 561
Gestational
Diabetes
n = 45
GD = 29
Placenta
n = 45
time
at birth age 5
| |
X
7 Fat
Measures
Child
n = 23
GD = 16
Y
?
sahirbhatnagar.com Data Integration CHSGM 2015 17 / 25
1st PC as Summary Bodyfat Measure
3
8
32
14294
102
1443
187
12853
124
375563
36
37513
81
338052
197
75115
30
7505
188
67612
196
84503
37
9380
202
75125
1
2
3
4
data used to predict 1st PC of bodyfat measures
LOOCVmeansquarederror
data.type
Canonical Variables
Expr+Methy non 0 CCA factors
Expr non 0 CCA factors
Methy non 0 CCA factors
Expr+Methy Filter
Expr Filter low means
Methy Filter low var
Expr+Methy Filter low+t.test
Expr Filter low+t.test
Methy Filter low+t.test
Expr+Methy Filter t.test
Expr Filter t.test
Methy Filter t.test
sahirbhatnagar.com Data Integration CHSGM 2015 18 / 25
Ward Clustering Groups
1
8
22
14294
1
1443
20
12853
331
375563
1
37513
54
338052
6
75115
1
7505
6
67612
7
84503
1
9380
30
75125
0.0
0.1
0.2
0.3
0.4
0.5
data used to predict Ward clustering groups
LOOCVmisclassificationerror
data.type
Canonical Variables
Expr+Methy non 0 CCA factors
Expr non 0 CCA factors
Methy non 0 CCA factors
Expr+Methy Filter
Expr Filter low means
Methy Filter low var
Expr+Methy Filter low+t.test
Expr Filter low+t.test
Methy Filter low+t.test
Expr+Methy Filter t.test
Expr Filter t.test
Methy Filter t.test
sahirbhatnagar.com Data Integration CHSGM 2015 19 / 25
Neuropeptide Y Receptor (NPY1R)
From OMIM:
One of the most abundant neuropeptides in the mammalian
nervous system
Exhibits a diverse range of important physiologic activities,
including effects on food intake
Have been identified in a variety of tissues, including
placenta (Herzog et al., 1992).
sahirbhatnagar.com Data Integration CHSGM 2015 20 / 25
Motivating Question #2: My Answer
sample size
genomic data
25 50
Gene
Expression
DNA
Methylation
DNA
Methylation
Gene
Expression
sahirbhatnagar.com Data Integration CHSGM 2015 21 / 25
Big Data
sahirbhatnagar.com Data Integration CHSGM 2015 22 / 25
Big Data
Data Integration
sahirbhatnagar.com Data Integration CHSGM 2015 22 / 25
Big Data
Data Integration
Machine Learning
sahirbhatnagar.com Data Integration CHSGM 2015 22 / 25
Smalln Data
sahirbhatnagar.com Data Integration CHSGM 2015 23 / 25
Acknowledgements
Celia Greenwood and
Mathieu Blanchette
Greg Voisin, Andr´ee-Anne
Houde, Luigi Bouchard
All the mothers and children
that took part in this study
You
sahirbhatnagar.com Data Integration CHSGM 2015 24 / 25
References
Principal component analysis plots and beamer template. URL
http://gastonsanchez.com/.
Elena Parkhomenko, David Tritchler, and Joseph Beyene. Sparse canonical
correlation analysis with application to genomic data integration.
Statistical Applications in Genetics and Molecular Biology, 8(1):1–34,
2009.
Daniela M Witten and Robert J Tibshirani. Extensions of sparse canonical
correlation analysis with applications to genomic data. Statistical
applications in genetics and molecular biology, 8(1):1–27, 2009.
Daniela M Witten, Robert Tibshirani, and Trevor Hastie. A penalized
matrix decomposition, with applications to sparse principal components
and canonical correlation analysis. Biostatistics, page kxp008, 2009.
sahirbhatnagar.com Data Integration CHSGM 2015 25 / 25

More Related Content

Similar to Analysis of DNA methylation and Gene expression to predict childhood obesity

Antiretroviral intensification to prevent intrapartum HIV transmission in lat...
Antiretroviral intensification to prevent intrapartum HIV transmission in lat...Antiretroviral intensification to prevent intrapartum HIV transmission in lat...
Antiretroviral intensification to prevent intrapartum HIV transmission in lat...
Илья Антипин
 
48999
4899948999
48999
seishiro00
 
progesterone & Miscarriage.pptx
progesterone & Miscarriage.pptxprogesterone & Miscarriage.pptx
progesterone & Miscarriage.pptx
Hesham Al-Inany
 
maternal and child under nutrition . what works copy.pptx
maternal and child under nutrition . what works copy.pptxmaternal and child under nutrition . what works copy.pptx
maternal and child under nutrition . what works copy.pptx
Mujeeb Ahmed Wadho
 
Diagnosis and Treatment of Metritis
Diagnosis and Treatment of MetritisDiagnosis and Treatment of Metritis
Diagnosis and Treatment of Metritis
DAIReXNET
 
Using Multi-Scale Modeling to Support Preclinical Developments
Using Multi-Scale Modeling to Support Preclinical DevelopmentsUsing Multi-Scale Modeling to Support Preclinical Developments
Using Multi-Scale Modeling to Support Preclinical Developments
Tao You
 
Success beyond numbers: The Salud Mesoamerica Initiative’s results-based fin...
Success beyond numbers:  The Salud Mesoamerica Initiative’s results-based fin...Success beyond numbers:  The Salud Mesoamerica Initiative’s results-based fin...
Success beyond numbers: The Salud Mesoamerica Initiative’s results-based fin...
RBFHealth
 
Salud Mesoamérica 2015 Initiative: Select results from the first operation me...
Salud Mesoamérica 2015 Initiative: Select results from the first operation me...Salud Mesoamérica 2015 Initiative: Select results from the first operation me...
Salud Mesoamérica 2015 Initiative: Select results from the first operation me...
Institute for Health Metrics and Evaluation - University of Washington
 
Pharmacogenomic Clinical Studies
Pharmacogenomic Clinical StudiesPharmacogenomic Clinical Studies
Pharmacogenomic Clinical Studies
Ryan Squire
 
Room a a08. mcgee-utility of qsofa lactate sepsis_(en)
Room a a08. mcgee-utility of qsofa lactate sepsis_(en)Room a a08. mcgee-utility of qsofa lactate sepsis_(en)
Room a a08. mcgee-utility of qsofa lactate sepsis_(en)
SoM
 
Contemporary Management of HIV. New Data From IDWeek 2018 and Other Fall 2018...
Contemporary Management of HIV. New Data From IDWeek 2018 and Other Fall 2018...Contemporary Management of HIV. New Data From IDWeek 2018 and Other Fall 2018...
Contemporary Management of HIV. New Data From IDWeek 2018 and Other Fall 2018...
hivlifeinfo
 
Meaningful (meta)data at scale: removing barriers to precision medicine research
Meaningful (meta)data at scale: removing barriers to precision medicine researchMeaningful (meta)data at scale: removing barriers to precision medicine research
Meaningful (meta)data at scale: removing barriers to precision medicine research
Nolan Nichols
 
Calprotectin europaediatrics 2009
Calprotectin europaediatrics 2009Calprotectin europaediatrics 2009
Calprotectin europaediatrics 2009
Sofia Liosi
 
Uterine Health in Dairy Cows
Uterine Health in Dairy CowsUterine Health in Dairy Cows
Uterine Health in Dairy Cows
buzzcp
 
Critical appraisal of a journal article
Critical appraisal of a journal articleCritical appraisal of a journal article
Critical appraisal of a journal article
Preethi Selvaraj
 
So much evidence 1
So much evidence 1So much evidence 1
So much evidence 1
Hesham Gaber
 
So much evidence 1
So much evidence 1So much evidence 1
So much evidence 1
Hesham Gaber
 
Is there a place for progesterone in the management of miscarriage?
Is there a place for progesterone in the management of miscarriage?Is there a place for progesterone in the management of miscarriage?
Is there a place for progesterone in the management of miscarriage?
drmattprior
 
2016 shareholder meeting presentation final
2016 shareholder meeting presentation final2016 shareholder meeting presentation final
2016 shareholder meeting presentation final
Galenabio
 
Determining Candidacy and Strategies for ART Modification.2019
Determining Candidacy and Strategies for ART Modification.2019Determining Candidacy and Strategies for ART Modification.2019
Determining Candidacy and Strategies for ART Modification.2019
hivlifeinfo
 

Similar to Analysis of DNA methylation and Gene expression to predict childhood obesity (20)

Antiretroviral intensification to prevent intrapartum HIV transmission in lat...
Antiretroviral intensification to prevent intrapartum HIV transmission in lat...Antiretroviral intensification to prevent intrapartum HIV transmission in lat...
Antiretroviral intensification to prevent intrapartum HIV transmission in lat...
 
48999
4899948999
48999
 
progesterone & Miscarriage.pptx
progesterone & Miscarriage.pptxprogesterone & Miscarriage.pptx
progesterone & Miscarriage.pptx
 
maternal and child under nutrition . what works copy.pptx
maternal and child under nutrition . what works copy.pptxmaternal and child under nutrition . what works copy.pptx
maternal and child under nutrition . what works copy.pptx
 
Diagnosis and Treatment of Metritis
Diagnosis and Treatment of MetritisDiagnosis and Treatment of Metritis
Diagnosis and Treatment of Metritis
 
Using Multi-Scale Modeling to Support Preclinical Developments
Using Multi-Scale Modeling to Support Preclinical DevelopmentsUsing Multi-Scale Modeling to Support Preclinical Developments
Using Multi-Scale Modeling to Support Preclinical Developments
 
Success beyond numbers: The Salud Mesoamerica Initiative’s results-based fin...
Success beyond numbers:  The Salud Mesoamerica Initiative’s results-based fin...Success beyond numbers:  The Salud Mesoamerica Initiative’s results-based fin...
Success beyond numbers: The Salud Mesoamerica Initiative’s results-based fin...
 
Salud Mesoamérica 2015 Initiative: Select results from the first operation me...
Salud Mesoamérica 2015 Initiative: Select results from the first operation me...Salud Mesoamérica 2015 Initiative: Select results from the first operation me...
Salud Mesoamérica 2015 Initiative: Select results from the first operation me...
 
Pharmacogenomic Clinical Studies
Pharmacogenomic Clinical StudiesPharmacogenomic Clinical Studies
Pharmacogenomic Clinical Studies
 
Room a a08. mcgee-utility of qsofa lactate sepsis_(en)
Room a a08. mcgee-utility of qsofa lactate sepsis_(en)Room a a08. mcgee-utility of qsofa lactate sepsis_(en)
Room a a08. mcgee-utility of qsofa lactate sepsis_(en)
 
Contemporary Management of HIV. New Data From IDWeek 2018 and Other Fall 2018...
Contemporary Management of HIV. New Data From IDWeek 2018 and Other Fall 2018...Contemporary Management of HIV. New Data From IDWeek 2018 and Other Fall 2018...
Contemporary Management of HIV. New Data From IDWeek 2018 and Other Fall 2018...
 
Meaningful (meta)data at scale: removing barriers to precision medicine research
Meaningful (meta)data at scale: removing barriers to precision medicine researchMeaningful (meta)data at scale: removing barriers to precision medicine research
Meaningful (meta)data at scale: removing barriers to precision medicine research
 
Calprotectin europaediatrics 2009
Calprotectin europaediatrics 2009Calprotectin europaediatrics 2009
Calprotectin europaediatrics 2009
 
Uterine Health in Dairy Cows
Uterine Health in Dairy CowsUterine Health in Dairy Cows
Uterine Health in Dairy Cows
 
Critical appraisal of a journal article
Critical appraisal of a journal articleCritical appraisal of a journal article
Critical appraisal of a journal article
 
So much evidence 1
So much evidence 1So much evidence 1
So much evidence 1
 
So much evidence 1
So much evidence 1So much evidence 1
So much evidence 1
 
Is there a place for progesterone in the management of miscarriage?
Is there a place for progesterone in the management of miscarriage?Is there a place for progesterone in the management of miscarriage?
Is there a place for progesterone in the management of miscarriage?
 
2016 shareholder meeting presentation final
2016 shareholder meeting presentation final2016 shareholder meeting presentation final
2016 shareholder meeting presentation final
 
Determining Candidacy and Strategies for ART Modification.2019
Determining Candidacy and Strategies for ART Modification.2019Determining Candidacy and Strategies for ART Modification.2019
Determining Candidacy and Strategies for ART Modification.2019
 

More from sahirbhatnagar

Strong Heredity Models in High Dimensional Data
Strong Heredity Models in High Dimensional DataStrong Heredity Models in High Dimensional Data
Strong Heredity Models in High Dimensional Data
sahirbhatnagar
 
Methods for High Dimensional Interactions
Methods for High Dimensional InteractionsMethods for High Dimensional Interactions
Methods for High Dimensional Interactions
sahirbhatnagar
 
An introduction to knitr and R Markdown
An introduction to knitr and R MarkdownAn introduction to knitr and R Markdown
An introduction to knitr and R Markdown
sahirbhatnagar
 
Atelier r-gerad
Atelier r-geradAtelier r-gerad
Atelier r-gerad
sahirbhatnagar
 
Reproducible Research: An Introduction to knitr
Reproducible Research: An Introduction to knitrReproducible Research: An Introduction to knitr
Reproducible Research: An Introduction to knitr
sahirbhatnagar
 
Estimation and Accuracy after Model Selection
Estimation and Accuracy after Model SelectionEstimation and Accuracy after Model Selection
Estimation and Accuracy after Model Selection
sahirbhatnagar
 
Absolute risk estimation in a case cohort study of prostate cancer
Absolute risk estimation in a case cohort study of prostate cancerAbsolute risk estimation in a case cohort study of prostate cancer
Absolute risk estimation in a case cohort study of prostate cancer
sahirbhatnagar
 
Computational methods for case-cohort studies
Computational methods for case-cohort studiesComputational methods for case-cohort studies
Computational methods for case-cohort studies
sahirbhatnagar
 
Factors influencing participation in cancer screening
Factors influencing participation in cancer screeningFactors influencing participation in cancer screening
Factors influencing participation in cancer screening
sahirbhatnagar
 
Introduction to LaTeX
Introduction to LaTeXIntroduction to LaTeX
Introduction to LaTeX
sahirbhatnagar
 
Reproducible Research
Reproducible ResearchReproducible Research
Reproducible Research
sahirbhatnagar
 

More from sahirbhatnagar (11)

Strong Heredity Models in High Dimensional Data
Strong Heredity Models in High Dimensional DataStrong Heredity Models in High Dimensional Data
Strong Heredity Models in High Dimensional Data
 
Methods for High Dimensional Interactions
Methods for High Dimensional InteractionsMethods for High Dimensional Interactions
Methods for High Dimensional Interactions
 
An introduction to knitr and R Markdown
An introduction to knitr and R MarkdownAn introduction to knitr and R Markdown
An introduction to knitr and R Markdown
 
Atelier r-gerad
Atelier r-geradAtelier r-gerad
Atelier r-gerad
 
Reproducible Research: An Introduction to knitr
Reproducible Research: An Introduction to knitrReproducible Research: An Introduction to knitr
Reproducible Research: An Introduction to knitr
 
Estimation and Accuracy after Model Selection
Estimation and Accuracy after Model SelectionEstimation and Accuracy after Model Selection
Estimation and Accuracy after Model Selection
 
Absolute risk estimation in a case cohort study of prostate cancer
Absolute risk estimation in a case cohort study of prostate cancerAbsolute risk estimation in a case cohort study of prostate cancer
Absolute risk estimation in a case cohort study of prostate cancer
 
Computational methods for case-cohort studies
Computational methods for case-cohort studiesComputational methods for case-cohort studies
Computational methods for case-cohort studies
 
Factors influencing participation in cancer screening
Factors influencing participation in cancer screeningFactors influencing participation in cancer screening
Factors influencing participation in cancer screening
 
Introduction to LaTeX
Introduction to LaTeXIntroduction to LaTeX
Introduction to LaTeX
 
Reproducible Research
Reproducible ResearchReproducible Research
Reproducible Research
 

Recently uploaded

Sciences of Europe journal No 142 (2024)
Sciences of Europe journal No 142 (2024)Sciences of Europe journal No 142 (2024)
Sciences of Europe journal No 142 (2024)
Sciences of Europe
 
Discovery of An Apparent Red, High-Velocity Type Ia Supernova at 𝐳 = 2.9 wi...
Discovery of An Apparent Red, High-Velocity Type Ia Supernova at  𝐳 = 2.9  wi...Discovery of An Apparent Red, High-Velocity Type Ia Supernova at  𝐳 = 2.9  wi...
Discovery of An Apparent Red, High-Velocity Type Ia Supernova at 𝐳 = 2.9 wi...
Sérgio Sacani
 
23PH301 - Optics - Optical Lenses.pptx
23PH301 - Optics  -  Optical Lenses.pptx23PH301 - Optics  -  Optical Lenses.pptx
23PH301 - Optics - Optical Lenses.pptx
RDhivya6
 
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
vluwdy49
 
Authoring a personal GPT for your research and practice: How we created the Q...
Authoring a personal GPT for your research and practice: How we created the Q...Authoring a personal GPT for your research and practice: How we created the Q...
Authoring a personal GPT for your research and practice: How we created the Q...
Leonel Morgado
 
(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...
(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...
(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...
Scintica Instrumentation
 
Compexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titrationCompexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titration
Vandana Devesh Sharma
 
Alternate Wetting and Drying - Climate Smart Agriculture
Alternate Wetting and Drying - Climate Smart AgricultureAlternate Wetting and Drying - Climate Smart Agriculture
Alternate Wetting and Drying - Climate Smart Agriculture
International Food Policy Research Institute- South Asia Office
 
LEARNING TO LIVE WITH LAWS OF MOTION .pptx
LEARNING TO LIVE WITH LAWS OF MOTION .pptxLEARNING TO LIVE WITH LAWS OF MOTION .pptx
LEARNING TO LIVE WITH LAWS OF MOTION .pptx
yourprojectpartner05
 
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
Sérgio Sacani
 
Tissue fluids_etiology_volume regulation_pressure.pptx
Tissue fluids_etiology_volume regulation_pressure.pptxTissue fluids_etiology_volume regulation_pressure.pptx
Tissue fluids_etiology_volume regulation_pressure.pptx
muralinath2
 
The binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defectsThe binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defects
Sérgio Sacani
 
Methods of grain storage Structures in India.pdf
Methods of grain storage Structures in India.pdfMethods of grain storage Structures in India.pdf
Methods of grain storage Structures in India.pdf
PirithiRaju
 
Farming systems analysis: what have we learnt?.pptx
Farming systems analysis: what have we learnt?.pptxFarming systems analysis: what have we learnt?.pptx
Farming systems analysis: what have we learnt?.pptx
Frédéric Baudron
 
The cost of acquiring information by natural selection
The cost of acquiring information by natural selectionThe cost of acquiring information by natural selection
The cost of acquiring information by natural selection
Carl Bergstrom
 
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Leonel Morgado
 
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
PsychoTech Services
 
JAMES WEBB STUDY THE MASSIVE BLACK HOLE SEEDS
JAMES WEBB STUDY THE MASSIVE BLACK HOLE SEEDSJAMES WEBB STUDY THE MASSIVE BLACK HOLE SEEDS
JAMES WEBB STUDY THE MASSIVE BLACK HOLE SEEDS
Sérgio Sacani
 
GBSN - Biochemistry (Unit 6) Chemistry of Proteins
GBSN - Biochemistry (Unit 6) Chemistry of ProteinsGBSN - Biochemistry (Unit 6) Chemistry of Proteins
GBSN - Biochemistry (Unit 6) Chemistry of Proteins
Areesha Ahmad
 
Evidence of Jet Activity from the Secondary Black Hole in the OJ 287 Binary S...
Evidence of Jet Activity from the Secondary Black Hole in the OJ 287 Binary S...Evidence of Jet Activity from the Secondary Black Hole in the OJ 287 Binary S...
Evidence of Jet Activity from the Secondary Black Hole in the OJ 287 Binary S...
Sérgio Sacani
 

Recently uploaded (20)

Sciences of Europe journal No 142 (2024)
Sciences of Europe journal No 142 (2024)Sciences of Europe journal No 142 (2024)
Sciences of Europe journal No 142 (2024)
 
Discovery of An Apparent Red, High-Velocity Type Ia Supernova at 𝐳 = 2.9 wi...
Discovery of An Apparent Red, High-Velocity Type Ia Supernova at  𝐳 = 2.9  wi...Discovery of An Apparent Red, High-Velocity Type Ia Supernova at  𝐳 = 2.9  wi...
Discovery of An Apparent Red, High-Velocity Type Ia Supernova at 𝐳 = 2.9 wi...
 
23PH301 - Optics - Optical Lenses.pptx
23PH301 - Optics  -  Optical Lenses.pptx23PH301 - Optics  -  Optical Lenses.pptx
23PH301 - Optics - Optical Lenses.pptx
 
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
 
Authoring a personal GPT for your research and practice: How we created the Q...
Authoring a personal GPT for your research and practice: How we created the Q...Authoring a personal GPT for your research and practice: How we created the Q...
Authoring a personal GPT for your research and practice: How we created the Q...
 
(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...
(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...
(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...
 
Compexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titrationCompexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titration
 
Alternate Wetting and Drying - Climate Smart Agriculture
Alternate Wetting and Drying - Climate Smart AgricultureAlternate Wetting and Drying - Climate Smart Agriculture
Alternate Wetting and Drying - Climate Smart Agriculture
 
LEARNING TO LIVE WITH LAWS OF MOTION .pptx
LEARNING TO LIVE WITH LAWS OF MOTION .pptxLEARNING TO LIVE WITH LAWS OF MOTION .pptx
LEARNING TO LIVE WITH LAWS OF MOTION .pptx
 
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
 
Tissue fluids_etiology_volume regulation_pressure.pptx
Tissue fluids_etiology_volume regulation_pressure.pptxTissue fluids_etiology_volume regulation_pressure.pptx
Tissue fluids_etiology_volume regulation_pressure.pptx
 
The binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defectsThe binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defects
 
Methods of grain storage Structures in India.pdf
Methods of grain storage Structures in India.pdfMethods of grain storage Structures in India.pdf
Methods of grain storage Structures in India.pdf
 
Farming systems analysis: what have we learnt?.pptx
Farming systems analysis: what have we learnt?.pptxFarming systems analysis: what have we learnt?.pptx
Farming systems analysis: what have we learnt?.pptx
 
The cost of acquiring information by natural selection
The cost of acquiring information by natural selectionThe cost of acquiring information by natural selection
The cost of acquiring information by natural selection
 
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
 
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
 
JAMES WEBB STUDY THE MASSIVE BLACK HOLE SEEDS
JAMES WEBB STUDY THE MASSIVE BLACK HOLE SEEDSJAMES WEBB STUDY THE MASSIVE BLACK HOLE SEEDS
JAMES WEBB STUDY THE MASSIVE BLACK HOLE SEEDS
 
GBSN - Biochemistry (Unit 6) Chemistry of Proteins
GBSN - Biochemistry (Unit 6) Chemistry of ProteinsGBSN - Biochemistry (Unit 6) Chemistry of Proteins
GBSN - Biochemistry (Unit 6) Chemistry of Proteins
 
Evidence of Jet Activity from the Secondary Black Hole in the OJ 287 Binary S...
Evidence of Jet Activity from the Secondary Black Hole in the OJ 287 Binary S...Evidence of Jet Activity from the Secondary Black Hole in the OJ 287 Binary S...
Evidence of Jet Activity from the Secondary Black Hole in the OJ 287 Binary S...
 

Analysis of DNA methylation and Gene expression to predict childhood obesity

  • 1. Analysis of DNA Methylation and Gene Expression data in Placenta tissue to predict childhood obesity An Integrative Approach Bhatnagar SR1,2 , Houde A4,5 , Voisin G2 , Bouchard L4,5 , Greenwood CMT1,2,3 1 Department of Epidemiology, Biostatistics and Occupational Health, McGill University 2 Lady Davis Institute, Jewish General Hospital, Montr´eal, QC 3 Departments of Oncology and Human Genetics, McGill University 4 Department of Biochemistry, Universit´e de Sherbrooke, QC 5 ECOGENE-21 and Lipid Clinic, Chicoutimi Hospital, QC sahirbhatnagar.com/talks Poster Session B, # 56
  • 2. Motivating Question # 1 sahirbhatnagar.com Data Integration CHSGM 2015 2 / 25
  • 3. Motivation 1 in 4 adult Canadians and 1 in 10 children are clinically obese. Events during pregnancy are suspected to play a role in childhood obesity → we don’t know about the mechanisms involved Children born to women who had a gestational diabetes mellitus-affected pregnancy are more likely to be overweight and obese Evidence suggests epigenetic factors are important piece of the puzzle sahirbhatnagar.com Data Integration CHSGM 2015 3 / 25
  • 4. Motivating Question # 2 sahirbhatnagar.com Data Integration CHSGM 2015 4 / 25
  • 5. Motivating Question sample size genomic data 25 50 Gene Expression
  • 6. Motivating Question sample size genomic data 25 50 Gene Expression DNA Methylation DNA Methylation Gene Expression
  • 7. Motivating Question sample size genomic data 25 50 Gene Expression DNA Methylation DNA Methylation Gene Expression ?? ? sahirbhatnagar.com Data Integration CHSGM 2015 5 / 25
  • 8. The Data sahirbhatnagar.com Data Integration CHSGM 2015 6 / 25
  • 9. The Data Expression HT-12 v4 p = 46, 889 Methylation Illumina 450k p = 375, 561 Gestational Diabetes n = 45 GD = 29 Placenta n = 45 time at birth age 5 | |
  • 10. The Data Expression HT-12 v4 p = 46, 889 Methylation Illumina 450k p = 375, 561 Gestational Diabetes n = 45 GD = 29 Placenta n = 45 time at birth age 5 | | X
  • 11. The Data Expression HT-12 v4 p = 46, 889 Methylation Illumina 450k p = 375, 561 Gestational Diabetes n = 45 GD = 29 Placenta n = 45 time at birth age 5 | | X 7 Fat Measures Child n = 23 GD = 16
  • 12. The Data Expression HT-12 v4 p = 46, 889 Methylation Illumina 450k p = 375, 561 Gestational Diabetes n = 45 GD = 29 Placenta n = 45 time at birth age 5 | | X 7 Fat Measures Child n = 23 GD = 16 Y
  • 13. The Data Expression HT-12 v4 p = 46, 889 Methylation Illumina 450k p = 375, 561 Gestational Diabetes n = 45 GD = 29 Placenta n = 45 time at birth age 5 | | X 7 Fat Measures Child n = 23 GD = 16 Y ? sahirbhatnagar.com Data Integration CHSGM 2015 7 / 25
  • 14. Summarizing Expression, Methylation and Gestational Diabetes Phenotype in Placenta Tissue sahirbhatnagar.com Data Integration CHSGM 2015 8 / 25
  • 15. Sparse Canonical Correlation Analysis (sCCA) CCA requires calculation of XT X −1 and YT Y −1 When p + q >> n, these matrices are singular sCCA applies an L1 penalty to the canonical vectors to obtain sparse solutions (Witten et al., 2009; Parkhomenko et al., 2009) Assumes XT X = Ip, YT Y = Iq maximizeu,v uT XT Yv subject to u 2 2 ≤ 1, v 2 2 ≤ 1 and P1(u) ≤ λ1, P2(v) ≤ λ2 sahirbhatnagar.com Data Integration CHSGM 2015 9 / 25
  • 16. Supervised Sparse CCA Main idea: 1. The features that are most associated with the outcome Q are identified to form the reduced matrices X and Y 2. sCCA is performed on X and Y sahirbhatnagar.com Data Integration CHSGM 2015 10 / 25
  • 17. Importance of Gestational Diabetes Phenotype 0.88 0.90 0.92 0.94 0.96 0.98 #non−0expressionprobes # non−0 methylation probes Correlation Gestational Diabetes Status Used in Sparse CCA 0.88 0.90 0.92 0.94 0.96 0.98 #non−0expressionprobes # non−0 methylation probes Correlation Gestational Diabetes Status Not Used sahirbhatnagar.com Data Integration CHSGM 2015 11 / 25
  • 18. GO Stat Analysis for Enrichment Enrichment Analysis based on non zero vector of 1st component from the Supervised sCCA analysis Genes associated with inflammatory processes Table : Top list of enriched GO terms GOBPID FDR OR E.Count Count Size Term 0002376 < 10−14 2.1 131.6 227 2178 immune system process 0006955 < 10−13 2.3 78.7 153 1303 immune response 0002252 < 10−9 2.7 34.1 80 565 immune effector process 0045087 < 10−8 2.3 49.0 99 811 innate immune response 0002682 < 10−8 2.1 66.56 122 1102 regulation of immune system process 0002684 < 10−8 2.4 40.1 84 664 positive regulation of immune system proces 0006952 < 10−8 1.9 84.5 144 1399 defense response 0050776 < 10−8 2.3 44.5 90 738 regulation of immune response 0050778 < 10−7 2.6 28.5 65 473 positive regulation of immune response 0006950 < 10−7 1.6 196.8 271 3258 response to stress sahirbhatnagar.com Data Integration CHSGM 2015 12 / 25
  • 19. Summarizing Bodyfat Measures sahirbhatnagar.com Data Integration CHSGM 2015 13 / 25
  • 20. Cluster 6 Bodyfat measures in 2 groups 34 14 8 16 7 6 38 30 20 25 13 3 12 11 17 21 39 31 19 37 28 32 18 Zscore BMI percent fat subscapularis bicep tricep iliacus −2 0 2 Value Color Key sahirbhatnagar.com Data Integration CHSGM 2015 14 / 25
  • 21. Circle of Correlations −1.0 −0.5 0.0 0.5 1.0 −1.0−0.50.00.51.0 Variables factor map (PCA) Dim 1 (50.68%) Dim2(15.41%) Zscore BMI percent fat tricep bicep subscapularis iliacus sahirbhatnagar.com Data Integration CHSGM 2015 15 / 25
  • 22. Combining Both Data sahirbhatnagar.com Data Integration CHSGM 2015 16 / 25
  • 23. Regression via Elastic Net Expression HT-12 v4 p = 46, 889 Methylation Illumina 450k p = 375, 561 Gestational Diabetes n = 45 GD = 29 Placenta n = 45 time at birth age 5 | | X 7 Fat Measures Child n = 23 GD = 16 Y ? sahirbhatnagar.com Data Integration CHSGM 2015 17 / 25
  • 24. 1st PC as Summary Bodyfat Measure 3 8 32 14294 102 1443 187 12853 124 375563 36 37513 81 338052 197 75115 30 7505 188 67612 196 84503 37 9380 202 75125 1 2 3 4 data used to predict 1st PC of bodyfat measures LOOCVmeansquarederror data.type Canonical Variables Expr+Methy non 0 CCA factors Expr non 0 CCA factors Methy non 0 CCA factors Expr+Methy Filter Expr Filter low means Methy Filter low var Expr+Methy Filter low+t.test Expr Filter low+t.test Methy Filter low+t.test Expr+Methy Filter t.test Expr Filter t.test Methy Filter t.test sahirbhatnagar.com Data Integration CHSGM 2015 18 / 25
  • 25. Ward Clustering Groups 1 8 22 14294 1 1443 20 12853 331 375563 1 37513 54 338052 6 75115 1 7505 6 67612 7 84503 1 9380 30 75125 0.0 0.1 0.2 0.3 0.4 0.5 data used to predict Ward clustering groups LOOCVmisclassificationerror data.type Canonical Variables Expr+Methy non 0 CCA factors Expr non 0 CCA factors Methy non 0 CCA factors Expr+Methy Filter Expr Filter low means Methy Filter low var Expr+Methy Filter low+t.test Expr Filter low+t.test Methy Filter low+t.test Expr+Methy Filter t.test Expr Filter t.test Methy Filter t.test sahirbhatnagar.com Data Integration CHSGM 2015 19 / 25
  • 26. Neuropeptide Y Receptor (NPY1R) From OMIM: One of the most abundant neuropeptides in the mammalian nervous system Exhibits a diverse range of important physiologic activities, including effects on food intake Have been identified in a variety of tissues, including placenta (Herzog et al., 1992). sahirbhatnagar.com Data Integration CHSGM 2015 20 / 25
  • 27. Motivating Question #2: My Answer sample size genomic data 25 50 Gene Expression DNA Methylation DNA Methylation Gene Expression sahirbhatnagar.com Data Integration CHSGM 2015 21 / 25
  • 28. Big Data sahirbhatnagar.com Data Integration CHSGM 2015 22 / 25
  • 29. Big Data Data Integration sahirbhatnagar.com Data Integration CHSGM 2015 22 / 25
  • 30. Big Data Data Integration Machine Learning sahirbhatnagar.com Data Integration CHSGM 2015 22 / 25
  • 31. Smalln Data sahirbhatnagar.com Data Integration CHSGM 2015 23 / 25
  • 32. Acknowledgements Celia Greenwood and Mathieu Blanchette Greg Voisin, Andr´ee-Anne Houde, Luigi Bouchard All the mothers and children that took part in this study You sahirbhatnagar.com Data Integration CHSGM 2015 24 / 25
  • 33. References Principal component analysis plots and beamer template. URL http://gastonsanchez.com/. Elena Parkhomenko, David Tritchler, and Joseph Beyene. Sparse canonical correlation analysis with application to genomic data integration. Statistical Applications in Genetics and Molecular Biology, 8(1):1–34, 2009. Daniela M Witten and Robert J Tibshirani. Extensions of sparse canonical correlation analysis with applications to genomic data. Statistical applications in genetics and molecular biology, 8(1):1–27, 2009. Daniela M Witten, Robert Tibshirani, and Trevor Hastie. A penalized matrix decomposition, with applications to sparse principal components and canonical correlation analysis. Biostatistics, page kxp008, 2009. sahirbhatnagar.com Data Integration CHSGM 2015 25 / 25