SlideShare a Scribd company logo
Approaches for Integration of multiple ā€˜Omicā€™
Data
Dmitry Grapov, PhD
Examples
Nature Reviews Genetics 15, 107ā€“120 (2014) doi:10.1038/nrg3643
FBA = flux-balance analysis
ā€¢ Topological enrichment can give broad overview
of impacted genes, proteins and metabolites
ā€¢ Changes in biochemical domains corroborated
by multi-Omic data sets can be used to identify
robust candidates responsible for phenotypic
variation between comparisons
ā€¢ Gene-gene, protein-protein or gene-protein
interaction networks can be used to
deconvolute ambiguous metabolic pathways
Common Approaches
Nature Reviews Genetics 15, 107ā€“120 (2014) doi:10.1038/nrg3643
Biochemical Domain
Enrichment Analysis
ā€¢ Genes/Proteins ļƒ  DAVID, AmiGo, etc ļƒ GO:terms
ā€¢ Genes/Proteins + Metabolites ļƒ  IMPaLA: Integrated Molecular
Pathway Level Analysis (http://impala.molgen.mpg.de/) ļƒ  pathways
1. Classify all species domains (e.g. biological process, pathway, etc)
2. Calculate probability of observing changes in species by chance
IMPaLA: Gene + Metabolite
pathway enrichment
Challenges:
ā€¢Removal of redundant information
ā€¢Preference of specific vs. generic pathways
ā€¢Visualization of gene + metabolite + pathway relationships
Determining significance of the
enrichment: Hypergeometric Test
How to calculate statistics to determine enrichment?
hit.num = 51 # number of significantly changed pathway
metabolites
set.num = 1455 # number of metabolites in pathway
full = 3358 # all possible metabolites in organism
q.size = 72 # number of significantly changed metabolites
phyper(hit.num-1, set.num, full-set.num, q.size, lower.tail=F)
= 1.717553e-06
GO Enrichment analysis:
Hierarchy of Redundancy (parents)
ā€¢ GO is an ontology wherein enrichment is often
shared by children and parents.
ā€¢ Difficult to co-visualize term hierarchy and gene to
term mapping
Enrichment networks:
Removing the Hierarchy of
Redundancy
Workflow:
1. If two nodes share all genes, drop least
enriched (highest p-value)
2. Filter terms based on enrichment
3. Display term to gene/protein
relationships as edges in a network
4. Map direction of change in
genes/proteins to network node
attributes
Enrichment Network
Mapping of parents through children
GO enrichment network displays:
ā€¢ gene names associated with
each overrepresented term
ā€¢ Fold change in protein
expression between two
groups (can be extended k>2
groups)
ā€¢ Can display enrichment p-
value for each term
ā€¢ Can incorporate metabolites
as children of genes
Empirical Networks
ā€¢ Correlation based networks (CN)
(simple, tendency to hairball)
ā€¢ GGM or partial correlation based
networks (advanced, preference
of direct over indirect
relationships
ā€¢ *Increase in robustness with
sample size
10.1007/978-1-4614-1689-0_17
Topological
Enrichment
Networks
http://pubchem.ncbi.nlm.nih.gov//score_matrix/score_matrix.cgi
http://www.genome.jp/dbget-bin/www_bget?rn:R00975
gene gene
Topological Enrichment Networks:
genes + proteins + metabolites
metabolite
metabolite
protein
gene
MetaMapR
Biological network generator
https://github.com/dgrapov/MetaMapR
dgrapov@ucdavis.edu
metabolomics.ucdavis.edu
This research was supported in part by NIH 1 U24 DK097154

More Related Content

What's hot

Data analysis workflows part 2 2015
Data analysis workflows part 2 2015Data analysis workflows part 2 2015
Data analysis workflows part 2 2015
Dmitry Grapov
Ā 
Machine Learning Powered Metabolomic Network Analysis
Machine Learning Powered Metabolomic Network AnalysisMachine Learning Powered Metabolomic Network Analysis
Machine Learning Powered Metabolomic Network Analysis
Dmitry Grapov
Ā 
Metabolomics and Beyond Challenges and Strategies for Next-gen Omic Analyses
Metabolomics and Beyond Challenges and Strategies for Next-gen Omic Analyses Metabolomics and Beyond Challenges and Strategies for Next-gen Omic Analyses
Metabolomics and Beyond Challenges and Strategies for Next-gen Omic Analyses
Dmitry Grapov
Ā 
A Biclustering Method for Rationalizing Chemical Biology Mechanisms of Action
A Biclustering Method for Rationalizing Chemical Biology Mechanisms of ActionA Biclustering Method for Rationalizing Chemical Biology Mechanisms of Action
A Biclustering Method for Rationalizing Chemical Biology Mechanisms of ActionGerald Lushington
Ā 
Systems genetics approaches to understand complex traits
Systems genetics approaches to understand complex traitsSystems genetics approaches to understand complex traits
Systems genetics approaches to understand complex traits
SOYEON KIM
Ā 
Model Organism Linked Data
Model Organism Linked DataModel Organism Linked Data
Model Organism Linked Data
Michel Dumontier
Ā 
Robust Pathway-based Multi-Omics Data Integration using Directed Random Walk ...
Robust Pathway-based Multi-Omics Data Integration using Directed Random Walk ...Robust Pathway-based Multi-Omics Data Integration using Directed Random Walk ...
Robust Pathway-based Multi-Omics Data Integration using Directed Random Walk ...
SOYEON KIM
Ā 
Metabolic network visualization - concepts
Metabolic network visualization - conceptsMetabolic network visualization - concepts
Metabolic network visualization - concepts
Dinesh Barupal
Ā 
The Gene Ontology & Gene Ontology Annotation resources
The Gene Ontology & Gene Ontology Annotation resourcesThe Gene Ontology & Gene Ontology Annotation resources
The Gene Ontology & Gene Ontology Annotation resources
Melanie Courtot
Ā 
Revealing disease-associated pathways by network integration of untargeted me...
Revealing disease-associated pathways by network integration of untargeted me...Revealing disease-associated pathways by network integration of untargeted me...
Revealing disease-associated pathways by network integration of untargeted me...
SOYEON KIM
Ā 
Deep learning based multi-omics integration, a survey
Deep learning based multi-omics integration, a surveyDeep learning based multi-omics integration, a survey
Deep learning based multi-omics integration, a survey
SOYEON KIM
Ā 
1 statistical analysis
1  statistical analysis1  statistical analysis
1 statistical analysisDmitry Grapov
Ā 
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental MetadataMaking it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Michel Dumontier
Ā 
Link Analysis of Life Sciences Linked Data
Link Analysis of Life Sciences Linked DataLink Analysis of Life Sciences Linked Data
Link Analysis of Life Sciences Linked Data
Michel Dumontier
Ā 
Mega
MegaMega
Mega
Nabajit Roy
Ā 
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
Michel Dumontier
Ā 
Harnessing The Proteome With Proteo Iq Quantitative Proteomics Software
Harnessing The Proteome With Proteo Iq Quantitative Proteomics SoftwareHarnessing The Proteome With Proteo Iq Quantitative Proteomics Software
Harnessing The Proteome With Proteo Iq Quantitative Proteomics Software
jatwood3
Ā 
Prote-OMIC Data Analysis and Visualization
Prote-OMIC Data Analysis and VisualizationProte-OMIC Data Analysis and Visualization
Prote-OMIC Data Analysis and Visualization
Dmitry Grapov
Ā 
Metabolic Set Enrichment Analysis - chemrich - 2019
Metabolic Set Enrichment Analysis - chemrich - 2019Metabolic Set Enrichment Analysis - chemrich - 2019
Metabolic Set Enrichment Analysis - chemrich - 2019
Dinesh Barupal
Ā 
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
Michel Dumontier
Ā 

What's hot (20)

Data analysis workflows part 2 2015
Data analysis workflows part 2 2015Data analysis workflows part 2 2015
Data analysis workflows part 2 2015
Ā 
Machine Learning Powered Metabolomic Network Analysis
Machine Learning Powered Metabolomic Network AnalysisMachine Learning Powered Metabolomic Network Analysis
Machine Learning Powered Metabolomic Network Analysis
Ā 
Metabolomics and Beyond Challenges and Strategies for Next-gen Omic Analyses
Metabolomics and Beyond Challenges and Strategies for Next-gen Omic Analyses Metabolomics and Beyond Challenges and Strategies for Next-gen Omic Analyses
Metabolomics and Beyond Challenges and Strategies for Next-gen Omic Analyses
Ā 
A Biclustering Method for Rationalizing Chemical Biology Mechanisms of Action
A Biclustering Method for Rationalizing Chemical Biology Mechanisms of ActionA Biclustering Method for Rationalizing Chemical Biology Mechanisms of Action
A Biclustering Method for Rationalizing Chemical Biology Mechanisms of Action
Ā 
Systems genetics approaches to understand complex traits
Systems genetics approaches to understand complex traitsSystems genetics approaches to understand complex traits
Systems genetics approaches to understand complex traits
Ā 
Model Organism Linked Data
Model Organism Linked DataModel Organism Linked Data
Model Organism Linked Data
Ā 
Robust Pathway-based Multi-Omics Data Integration using Directed Random Walk ...
Robust Pathway-based Multi-Omics Data Integration using Directed Random Walk ...Robust Pathway-based Multi-Omics Data Integration using Directed Random Walk ...
Robust Pathway-based Multi-Omics Data Integration using Directed Random Walk ...
Ā 
Metabolic network visualization - concepts
Metabolic network visualization - conceptsMetabolic network visualization - concepts
Metabolic network visualization - concepts
Ā 
The Gene Ontology & Gene Ontology Annotation resources
The Gene Ontology & Gene Ontology Annotation resourcesThe Gene Ontology & Gene Ontology Annotation resources
The Gene Ontology & Gene Ontology Annotation resources
Ā 
Revealing disease-associated pathways by network integration of untargeted me...
Revealing disease-associated pathways by network integration of untargeted me...Revealing disease-associated pathways by network integration of untargeted me...
Revealing disease-associated pathways by network integration of untargeted me...
Ā 
Deep learning based multi-omics integration, a survey
Deep learning based multi-omics integration, a surveyDeep learning based multi-omics integration, a survey
Deep learning based multi-omics integration, a survey
Ā 
1 statistical analysis
1  statistical analysis1  statistical analysis
1 statistical analysis
Ā 
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental MetadataMaking it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Ā 
Link Analysis of Life Sciences Linked Data
Link Analysis of Life Sciences Linked DataLink Analysis of Life Sciences Linked Data
Link Analysis of Life Sciences Linked Data
Ā 
Mega
MegaMega
Mega
Ā 
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
Ā 
Harnessing The Proteome With Proteo Iq Quantitative Proteomics Software
Harnessing The Proteome With Proteo Iq Quantitative Proteomics SoftwareHarnessing The Proteome With Proteo Iq Quantitative Proteomics Software
Harnessing The Proteome With Proteo Iq Quantitative Proteomics Software
Ā 
Prote-OMIC Data Analysis and Visualization
Prote-OMIC Data Analysis and VisualizationProte-OMIC Data Analysis and Visualization
Prote-OMIC Data Analysis and Visualization
Ā 
Metabolic Set Enrichment Analysis - chemrich - 2019
Metabolic Set Enrichment Analysis - chemrich - 2019Metabolic Set Enrichment Analysis - chemrich - 2019
Metabolic Set Enrichment Analysis - chemrich - 2019
Ā 
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
Ā 

Viewers also liked

Metabolomic Profile of Human Myocardial Ischemia by NMR Spectroscopy of Perip...
Metabolomic Profile of HumanMyocardial Ischemia by NMR Spectroscopy of Perip...Metabolomic Profile of HumanMyocardial Ischemia by NMR Spectroscopy of Perip...
Metabolomic Profile of Human Myocardial Ischemia by NMR Spectroscopy of Perip...Nagi Abdalla
Ā 
Data Normalization Approaches for Large-scale Biological Studies
Data Normalization Approaches for Large-scale Biological StudiesData Normalization Approaches for Large-scale Biological Studies
Data Normalization Approaches for Large-scale Biological Studies
Dmitry Grapov
Ā 
Automation of (Biological) Data Analysis and Report Generation
Automation of (Biological) Data Analysis and Report GenerationAutomation of (Biological) Data Analysis and Report Generation
Automation of (Biological) Data Analysis and Report Generation
Dmitry Grapov
Ā 
Metabolomic Data Analysis Workshop and Tutorials (2014)
Metabolomic Data Analysis Workshop and Tutorials (2014)Metabolomic Data Analysis Workshop and Tutorials (2014)
Metabolomic Data Analysis Workshop and Tutorials (2014)
Dmitry Grapov
Ā 
Metabolomics
MetabolomicsMetabolomics
Metabolomics
Shreya Ahuja
Ā 
Slide shere
Slide shereSlide shere
Slide sheremariabraga
Ā 

Viewers also liked (8)

Metabolomic Profile of Human Myocardial Ischemia by NMR Spectroscopy of Perip...
Metabolomic Profile of HumanMyocardial Ischemia by NMR Spectroscopy of Perip...Metabolomic Profile of HumanMyocardial Ischemia by NMR Spectroscopy of Perip...
Metabolomic Profile of Human Myocardial Ischemia by NMR Spectroscopy of Perip...
Ā 
Data Normalization Approaches for Large-scale Biological Studies
Data Normalization Approaches for Large-scale Biological StudiesData Normalization Approaches for Large-scale Biological Studies
Data Normalization Approaches for Large-scale Biological Studies
Ā 
Automation of (Biological) Data Analysis and Report Generation
Automation of (Biological) Data Analysis and Report GenerationAutomation of (Biological) Data Analysis and Report Generation
Automation of (Biological) Data Analysis and Report Generation
Ā 
Metabolomic Data Analysis Workshop and Tutorials (2014)
Metabolomic Data Analysis Workshop and Tutorials (2014)Metabolomic Data Analysis Workshop and Tutorials (2014)
Metabolomic Data Analysis Workshop and Tutorials (2014)
Ā 
Metabolomics
MetabolomicsMetabolomics
Metabolomics
Ā 
Metabolomics
MetabolomicsMetabolomics
Metabolomics
Ā 
Slide shere
Slide shereSlide shere
Slide shere
Ā 
Basic Immunology
Basic ImmunologyBasic Immunology
Basic Immunology
Ā 

Similar to Omic Data Integration Strategies

Network Biology Lent 2010 - lecture 1
Network Biology Lent 2010 - lecture 1Network Biology Lent 2010 - lecture 1
Network Biology Lent 2010 - lecture 1
Florian Markowetz
Ā 
KnetMiner - Knowledge Network Miner
KnetMiner - Knowledge Network MinerKnetMiner - Knowledge Network Miner
KnetMiner - Knowledge Network Miner
Keywan Hassani-Pak
Ā 
Algal Functional Annotation Tool
Algal Functional Annotation ToolAlgal Functional Annotation Tool
Algal Functional Annotation Tool
Sarah Adams
Ā 
Specificity and Evolvability in Eukaryotic Protein Interaction Networks
Specificity and Evolvability in Eukaryotic Protein Interaction NetworksSpecificity and Evolvability in Eukaryotic Protein Interaction Networks
Specificity and Evolvability in Eukaryotic Protein Interaction Networks
pedrobeltrao
Ā 
Prediction of protein function
Prediction of protein functionPrediction of protein function
Prediction of protein function
Lars Juhl Jensen
Ā 
How to analyse large data sets
How to analyse large data setsHow to analyse large data sets
How to analyse large data sets
improvemed
Ā 
Protease Phylogeny
 Protease Phylogeny  Protease Phylogeny
Protease Phylogeny
Chris Southan
Ā 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomicshemantbreeder
Ā 
Functional Genomic l Genomes l proteomic l DNA l #genomics #proteomics #scien...
Functional Genomic l Genomes l proteomic l DNA l #genomics #proteomics #scien...Functional Genomic l Genomes l proteomic l DNA l #genomics #proteomics #scien...
Functional Genomic l Genomes l proteomic l DNA l #genomics #proteomics #scien...
DevikaPatel12
Ā 
4.4 Biotechnology - Gene Transfer (by Je Hoon)
4.4 Biotechnology - Gene Transfer (by Je Hoon)4.4 Biotechnology - Gene Transfer (by Je Hoon)
4.4 Biotechnology - Gene Transfer (by Je Hoon)Abbotsford Christian School
Ā 
EnrichNet: Graph-based statistic and web-application for gene/protein set enr...
EnrichNet: Graph-based statistic and web-application for gene/protein set enr...EnrichNet: Graph-based statistic and web-application for gene/protein set enr...
EnrichNet: Graph-based statistic and web-application for gene/protein set enr...
Enrico Glaab
Ā 
O.M.GSEA - An in-depth introduction to gene-set enrichment analysis
O.M.GSEA - An in-depth introduction to gene-set enrichment analysisO.M.GSEA - An in-depth introduction to gene-set enrichment analysis
O.M.GSEA - An in-depth introduction to gene-set enrichment analysis
Shana White
Ā 
Molecular basis of evolution and softwares used in phylogenetic tree contruction
Molecular basis of evolution and softwares used in phylogenetic tree contructionMolecular basis of evolution and softwares used in phylogenetic tree contruction
Molecular basis of evolution and softwares used in phylogenetic tree contruction
UdayBhanushali111
Ā 
Presage database
Presage databasePresage database
Presage database
Akshay More
Ā 
2015 mcgill-talk
2015 mcgill-talk2015 mcgill-talk
2015 mcgill-talk
c.titus.brown
Ā 
Waksman Student Scholars Program Poster
Waksman Student Scholars Program PosterWaksman Student Scholars Program Poster
Waksman Student Scholars Program PosterWesley Kwong
Ā 
The Language of the Gene Ontology
The Language of the Gene OntologyThe Language of the Gene Ontology
The Language of the Gene Ontology
robertstevens65
Ā 
David
DavidDavid

Similar to Omic Data Integration Strategies (20)

Network Biology Lent 2010 - lecture 1
Network Biology Lent 2010 - lecture 1Network Biology Lent 2010 - lecture 1
Network Biology Lent 2010 - lecture 1
Ā 
KnetMiner - Knowledge Network Miner
KnetMiner - Knowledge Network MinerKnetMiner - Knowledge Network Miner
KnetMiner - Knowledge Network Miner
Ā 
Algal Functional Annotation Tool
Algal Functional Annotation ToolAlgal Functional Annotation Tool
Algal Functional Annotation Tool
Ā 
Specificity and Evolvability in Eukaryotic Protein Interaction Networks
Specificity and Evolvability in Eukaryotic Protein Interaction NetworksSpecificity and Evolvability in Eukaryotic Protein Interaction Networks
Specificity and Evolvability in Eukaryotic Protein Interaction Networks
Ā 
Prediction of protein function
Prediction of protein functionPrediction of protein function
Prediction of protein function
Ā 
How to analyse large data sets
How to analyse large data setsHow to analyse large data sets
How to analyse large data sets
Ā 
Protease Phylogeny
 Protease Phylogeny  Protease Phylogeny
Protease Phylogeny
Ā 
genomic comparison
genomic comparison genomic comparison
genomic comparison
Ā 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
Ā 
Functional Genomic l Genomes l proteomic l DNA l #genomics #proteomics #scien...
Functional Genomic l Genomes l proteomic l DNA l #genomics #proteomics #scien...Functional Genomic l Genomes l proteomic l DNA l #genomics #proteomics #scien...
Functional Genomic l Genomes l proteomic l DNA l #genomics #proteomics #scien...
Ā 
4.4 Biotechnology - Gene Transfer (by Je Hoon)
4.4 Biotechnology - Gene Transfer (by Je Hoon)4.4 Biotechnology - Gene Transfer (by Je Hoon)
4.4 Biotechnology - Gene Transfer (by Je Hoon)
Ā 
EnrichNet: Graph-based statistic and web-application for gene/protein set enr...
EnrichNet: Graph-based statistic and web-application for gene/protein set enr...EnrichNet: Graph-based statistic and web-application for gene/protein set enr...
EnrichNet: Graph-based statistic and web-application for gene/protein set enr...
Ā 
O.M.GSEA - An in-depth introduction to gene-set enrichment analysis
O.M.GSEA - An in-depth introduction to gene-set enrichment analysisO.M.GSEA - An in-depth introduction to gene-set enrichment analysis
O.M.GSEA - An in-depth introduction to gene-set enrichment analysis
Ā 
Molecular basis of evolution and softwares used in phylogenetic tree contruction
Molecular basis of evolution and softwares used in phylogenetic tree contructionMolecular basis of evolution and softwares used in phylogenetic tree contruction
Molecular basis of evolution and softwares used in phylogenetic tree contruction
Ā 
Presage database
Presage databasePresage database
Presage database
Ā 
2015 mcgill-talk
2015 mcgill-talk2015 mcgill-talk
2015 mcgill-talk
Ā 
Waksman Student Scholars Program Poster
Waksman Student Scholars Program PosterWaksman Student Scholars Program Poster
Waksman Student Scholars Program Poster
Ā 
The Language of the Gene Ontology
The Language of the Gene OntologyThe Language of the Gene Ontology
The Language of the Gene Ontology
Ā 
David
DavidDavid
David
Ā 
MORPH-R article
MORPH-R articleMORPH-R article
MORPH-R article
Ā 

More from Dmitry Grapov

R programming for Data Science - A Beginnerā€™s Guide
R programming for Data Science - A Beginnerā€™s GuideR programming for Data Science - A Beginnerā€™s Guide
R programming for Data Science - A Beginnerā€™s Guide
Dmitry Grapov
Ā 
Network mapping 101 course
Network mapping 101 courseNetwork mapping 101 course
Network mapping 101 course
Dmitry Grapov
Ā 
Rise of Deep Learning for Genomic, Proteomic, and Metabolomic Data Integratio...
Rise of Deep Learning for Genomic, Proteomic, and Metabolomic Data Integratio...Rise of Deep Learning for Genomic, Proteomic, and Metabolomic Data Integratio...
Rise of Deep Learning for Genomic, Proteomic, and Metabolomic Data Integratio...
Dmitry Grapov
Ā 
Dmitry Grapov Resume and CV
Dmitry Grapov Resume and CVDmitry Grapov Resume and CV
Dmitry Grapov Resume and CV
Dmitry Grapov
Ā 
Case Study: Overview of Metabolomic Data Normalization Strategies
Case Study: Overview of Metabolomic Data Normalization StrategiesCase Study: Overview of Metabolomic Data Normalization Strategies
Case Study: Overview of Metabolomic Data Normalization Strategies
Dmitry Grapov
Ā 
Modeling poster
Modeling posterModeling poster
Modeling poster
Dmitry Grapov
Ā 
3 data normalization (2014 lab tutorial)
3  data normalization (2014 lab tutorial)3  data normalization (2014 lab tutorial)
3 data normalization (2014 lab tutorial)
Dmitry Grapov
Ā 
Normalization of Large-Scale Metabolomic Studies 2014
Normalization of Large-Scale Metabolomic Studies 2014Normalization of Large-Scale Metabolomic Studies 2014
Normalization of Large-Scale Metabolomic Studies 2014
Dmitry Grapov
Ā 
American Society of Mass Spectrommetry Conference 2014
American Society of Mass Spectrommetry Conference 2014American Society of Mass Spectrommetry Conference 2014
American Society of Mass Spectrommetry Conference 2014
Dmitry Grapov
Ā 
High Dimensional Biological Data Analysis and Visualization
High Dimensional Biological Data Analysis and VisualizationHigh Dimensional Biological Data Analysis and Visualization
High Dimensional Biological Data Analysis and Visualization
Dmitry Grapov
Ā 
6 metabolite enrichment analysis
6  metabolite enrichment analysis6  metabolite enrichment analysis
6 metabolite enrichment analysisDmitry Grapov
Ā 
5 data analysis case study
5  data analysis case study5  data analysis case study
5 data analysis case studyDmitry Grapov
Ā 
4 partial least squares modeling
4  partial least squares modeling4  partial least squares modeling
4 partial least squares modelingDmitry Grapov
Ā 
3 principal components analysis
3  principal components analysis3  principal components analysis
3 principal components analysisDmitry Grapov
Ā 
2 cluster analysis
2  cluster analysis2  cluster analysis
2 cluster analysisDmitry Grapov
Ā 
0 introduction
0  introduction0  introduction
0 introductionDmitry Grapov
Ā 

More from Dmitry Grapov (16)

R programming for Data Science - A Beginnerā€™s Guide
R programming for Data Science - A Beginnerā€™s GuideR programming for Data Science - A Beginnerā€™s Guide
R programming for Data Science - A Beginnerā€™s Guide
Ā 
Network mapping 101 course
Network mapping 101 courseNetwork mapping 101 course
Network mapping 101 course
Ā 
Rise of Deep Learning for Genomic, Proteomic, and Metabolomic Data Integratio...
Rise of Deep Learning for Genomic, Proteomic, and Metabolomic Data Integratio...Rise of Deep Learning for Genomic, Proteomic, and Metabolomic Data Integratio...
Rise of Deep Learning for Genomic, Proteomic, and Metabolomic Data Integratio...
Ā 
Dmitry Grapov Resume and CV
Dmitry Grapov Resume and CVDmitry Grapov Resume and CV
Dmitry Grapov Resume and CV
Ā 
Case Study: Overview of Metabolomic Data Normalization Strategies
Case Study: Overview of Metabolomic Data Normalization StrategiesCase Study: Overview of Metabolomic Data Normalization Strategies
Case Study: Overview of Metabolomic Data Normalization Strategies
Ā 
Modeling poster
Modeling posterModeling poster
Modeling poster
Ā 
3 data normalization (2014 lab tutorial)
3  data normalization (2014 lab tutorial)3  data normalization (2014 lab tutorial)
3 data normalization (2014 lab tutorial)
Ā 
Normalization of Large-Scale Metabolomic Studies 2014
Normalization of Large-Scale Metabolomic Studies 2014Normalization of Large-Scale Metabolomic Studies 2014
Normalization of Large-Scale Metabolomic Studies 2014
Ā 
American Society of Mass Spectrommetry Conference 2014
American Society of Mass Spectrommetry Conference 2014American Society of Mass Spectrommetry Conference 2014
American Society of Mass Spectrommetry Conference 2014
Ā 
High Dimensional Biological Data Analysis and Visualization
High Dimensional Biological Data Analysis and VisualizationHigh Dimensional Biological Data Analysis and Visualization
High Dimensional Biological Data Analysis and Visualization
Ā 
6 metabolite enrichment analysis
6  metabolite enrichment analysis6  metabolite enrichment analysis
6 metabolite enrichment analysis
Ā 
5 data analysis case study
5  data analysis case study5  data analysis case study
5 data analysis case study
Ā 
4 partial least squares modeling
4  partial least squares modeling4  partial least squares modeling
4 partial least squares modeling
Ā 
3 principal components analysis
3  principal components analysis3  principal components analysis
3 principal components analysis
Ā 
2 cluster analysis
2  cluster analysis2  cluster analysis
2 cluster analysis
Ā 
0 introduction
0  introduction0  introduction
0 introduction
Ā 

Recently uploaded

Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
TechSoup
Ā 
Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.
Ashokrao Mane college of Pharmacy Peth-Vadgaon
Ā 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
JosvitaDsouza2
Ā 
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCECLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
BhavyaRajput3
Ā 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
DeeptiGupta154
Ā 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
Thiyagu K
Ā 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345
beazzy04
Ā 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
Delapenabediema
Ā 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
Celine George
Ā 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
Jisc
Ā 
Lapbook sobre os Regimes TotalitƔrios.pdf
Lapbook sobre os Regimes TotalitƔrios.pdfLapbook sobre os Regimes TotalitƔrios.pdf
Lapbook sobre os Regimes TotalitƔrios.pdf
Jean Carlos Nunes PaixĆ£o
Ā 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
Special education needs
Ā 
The basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptxThe basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptx
heathfieldcps1
Ā 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
camakaiclarkmusic
Ā 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
Jisc
Ā 
Acetabularia Information For Class 9 .docx
Acetabularia Information For Class 9  .docxAcetabularia Information For Class 9  .docx
Acetabularia Information For Class 9 .docx
vaibhavrinwa19
Ā 
"Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe..."Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe...
SACHIN R KONDAGURI
Ā 
Embracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic ImperativeEmbracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic Imperative
Peter Windle
Ā 
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
TechSoup
Ā 
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdfAdversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
Po-Chuan Chen
Ā 

Recently uploaded (20)

Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
Ā 
Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.
Ā 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
Ā 
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCECLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
Ā 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
Ā 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
Ā 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345
Ā 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
Ā 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
Ā 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
Ā 
Lapbook sobre os Regimes TotalitƔrios.pdf
Lapbook sobre os Regimes TotalitƔrios.pdfLapbook sobre os Regimes TotalitƔrios.pdf
Lapbook sobre os Regimes TotalitƔrios.pdf
Ā 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
Ā 
The basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptxThe basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptx
Ā 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
Ā 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
Ā 
Acetabularia Information For Class 9 .docx
Acetabularia Information For Class 9  .docxAcetabularia Information For Class 9  .docx
Acetabularia Information For Class 9 .docx
Ā 
"Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe..."Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe...
Ā 
Embracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic ImperativeEmbracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic Imperative
Ā 
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Ā 
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdfAdversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
Ā 

Omic Data Integration Strategies

  • 1. Approaches for Integration of multiple ā€˜Omicā€™ Data Dmitry Grapov, PhD
  • 2. Examples Nature Reviews Genetics 15, 107ā€“120 (2014) doi:10.1038/nrg3643 FBA = flux-balance analysis ā€¢ Topological enrichment can give broad overview of impacted genes, proteins and metabolites ā€¢ Changes in biochemical domains corroborated by multi-Omic data sets can be used to identify robust candidates responsible for phenotypic variation between comparisons ā€¢ Gene-gene, protein-protein or gene-protein interaction networks can be used to deconvolute ambiguous metabolic pathways
  • 3. Common Approaches Nature Reviews Genetics 15, 107ā€“120 (2014) doi:10.1038/nrg3643
  • 4. Biochemical Domain Enrichment Analysis ā€¢ Genes/Proteins ļƒ  DAVID, AmiGo, etc ļƒ GO:terms ā€¢ Genes/Proteins + Metabolites ļƒ  IMPaLA: Integrated Molecular Pathway Level Analysis (http://impala.molgen.mpg.de/) ļƒ  pathways 1. Classify all species domains (e.g. biological process, pathway, etc) 2. Calculate probability of observing changes in species by chance
  • 5. IMPaLA: Gene + Metabolite pathway enrichment Challenges: ā€¢Removal of redundant information ā€¢Preference of specific vs. generic pathways ā€¢Visualization of gene + metabolite + pathway relationships
  • 6. Determining significance of the enrichment: Hypergeometric Test How to calculate statistics to determine enrichment? hit.num = 51 # number of significantly changed pathway metabolites set.num = 1455 # number of metabolites in pathway full = 3358 # all possible metabolites in organism q.size = 72 # number of significantly changed metabolites phyper(hit.num-1, set.num, full-set.num, q.size, lower.tail=F) = 1.717553e-06
  • 7. GO Enrichment analysis: Hierarchy of Redundancy (parents) ā€¢ GO is an ontology wherein enrichment is often shared by children and parents. ā€¢ Difficult to co-visualize term hierarchy and gene to term mapping
  • 8. Enrichment networks: Removing the Hierarchy of Redundancy Workflow: 1. If two nodes share all genes, drop least enriched (highest p-value) 2. Filter terms based on enrichment 3. Display term to gene/protein relationships as edges in a network 4. Map direction of change in genes/proteins to network node attributes
  • 9. Enrichment Network Mapping of parents through children GO enrichment network displays: ā€¢ gene names associated with each overrepresented term ā€¢ Fold change in protein expression between two groups (can be extended k>2 groups) ā€¢ Can display enrichment p- value for each term ā€¢ Can incorporate metabolites as children of genes
  • 10. Empirical Networks ā€¢ Correlation based networks (CN) (simple, tendency to hairball) ā€¢ GGM or partial correlation based networks (advanced, preference of direct over indirect relationships ā€¢ *Increase in robustness with sample size 10.1007/978-1-4614-1689-0_17
  • 12. gene gene Topological Enrichment Networks: genes + proteins + metabolites metabolite metabolite protein gene
  • 14. dgrapov@ucdavis.edu metabolomics.ucdavis.edu This research was supported in part by NIH 1 U24 DK097154