SlideShare a Scribd company logo
Metabolic
pathway mapping
Dinesh Barupal
dinkumar@ucdavis.edu
5.1 & 5.4
DATA
ACQUISITION
Separation
Detection
SAMPLING
EXTRACTION
DATA
PROCESSING
File Conversion
Baseline Correction
Peak Detection
Deconvolution
Adduct Annotation
Alignment
Gap Filling
STATISTICS
Normalization
Multivariate Analysis
(Parametric, Nonparametric)
Univariate Analysis
(Unsupervised, Supervised)
BIOLOGICAL
INTERPRETATION
Pathway Mapping
Network Enrichment
STUDY DESIGN
VALIDATION
COMPOUND
IDENTIFICATION
Molecular Formula ID
Structure ID
MS Library Search
Database Search
In silico Fragmentation
WCMC
UC Davis
Question :
Which metabolites from my list can be mapped to
metabolic pathways ?
Approach : Database mapping
Results:
A. Table of pathways with statistics
B. Maps overlaid with detected metabolites
C. Metabolites not mapped to pathways
Data preparation
C00003
C00016
C00019
C00020
C00025
C00047
C00064
C00082
C00086
C00099
C00105
Select the KEGG database identifiers for compounds having pvalue <0.05 in your experiment.
To get the KEGG ids for your compound list, you can use PubChem identifier exchange service.
https://pubchem.ncbi.nlm.nih.gov/idexchange/idexchange.cgi
Prepare the compound list
Sort the statistical results by p-value
Select the KEGG ids of compounds with p-value <0.05
Use filter in
Excel
Example study : “spring_2018_metabolomics_course_pathway_example.xlsx” in the
PathwayAnalysis_example folder
Four major databases for pathway
mapping –
• KEGG
• Reactome
• HMDB
• Consensus path db
For a comprehensive list of pathway databases – go to http://www.pathguide.org/
KEGG : Kyoto encyclopedia of genes and genomes
http://www.genome.jp/kegg/
GO TO
KEGG is a collection of databases dealing with genomes, biological pathways, diseases, drugs, and chemical substances.
KEGG is utilized for bioinformatics research and education, including data analysis in genomics, metagenomics,
metabolomics and other omics studies, modeling and simulation in systems biology, and translational research in drug
development.
http://www.genome.jp/kegg/
Click on KEGG Mapper (at the bottom of the page)
Click on search pathway
Copy and paste the KEGG
Ids in this box
Click on execute
Change organism if needed
KEGG Results -
Click on the a pathway to get
the map with compound
overlaid on it.
KEGG Global metabolic map
Red dots are the
compounds
present in the
metabolite list.
Green boxes are enzymes mapped to the human genome.
White boxes
are enzymes
not mapped
to the human
genome.
Red circle shows
the presence of
this compound
in the
metabolite list
Reactome pathway mapping
www.reactome.org
“REACTOME is an open-source, open access, manually curated and peer-reviewed pathway database.
Our goal is to provide intuitive bioinformatics tools for the visualization, interpretation and analysis of
pathway knowledge to support basic and clinical research, genome analysis, modeling, systems biology
and education. Founded in 2001, the Reactome project is led by Lincoln Stein of OICR, Peter D’Eustachio
of NYULMC, Henning Hermjakob of EMBL-EBI, and Guanming Wu of OHSU.”
www.reactome.org
Copy and paste KEGG Ids in this box.
# first line should be
“#Small_molecules_KEGG”
Reactome results
Download
Zooming in and double clicking will show
the map of a pathway
Metabolite in the list
MetaboAnalyst
Pathway Mapping
“To provide a user-friendly, web-based analytical pipeline for high-throughput metabolomics
studies. In particular, MetaboAnalyst aims to offer a variety of commonly used procedures for
metabolomic data processing, normalization, multivariate statistical analysis, as well as data
annotation. The current implementation focuses on exploratory statistical analysis, functional
interpretation, and advanced statistics for translational metabolomics studies.”
http://www.metaboanalyst.ca/
Go to www.metaboanalyst.ca
Click here
Paste the KEGG ids and select the type of identifiers
Click : Submit
Click : Submit
Select a pathway library
Click : Submit
Results
Pathway topology : centrality of compounds
Output table
Pathway with FDR < 0.20 are selected
Compound-pathway mapping is not provided
Consensus PathDB
Metabolic Network Mapping
“ConsensusPathDB-human integrates interaction networks in Homo sapiens including binary and complex
protein-protein, genetic, metabolic, signaling, gene regulatory and drug-target interactions, as well as
biochemical pathways. Data originate from currently 32 public resources for interactions (listed below) and
interactions that we have curated from the literature. The interaction data are integrated in a complementary
manner (avoiding redundancies), resulting in a seamless interaction network containing different types of
interactions.”
http://cpdb.molgen.mpg.de/
Click here
http://cpdb.molgen.mpg.de/
Copy and paste KEGG
id list here.
Results
Click on select “all”
Pathway similarity
network
Pathway network
Click here
http://cpdb.molgen.mpg.de/
Copy and paste KEGG
id list here.
Input KEGG ids are
mapped to internal ids and
to various databases.
Click on “all” to select
them.
Then click on the show
interactions button.
CPDB Interaction Results
Click on “all” to select them and then click on map and visualize interactions.
CPDB Network Results
Discussions
Key points :
• Not all detected metabolites have KEGG identifiers.
• Not all compounds with KEGG ids can be mapped to
metabolic pathways.
• Pathway maps may not show all the reaction known for a
metabolite.
• Several compound appeared in many pathways.
• Online tools can map metabolite list to pathway maps and
perform pathway over-representation analysis for
metabolites.
• Online network visualization of a large number of
metabolites is not efficient.
KEGG - 495
MetaCyc - 2453
Reactome - 2000
HMDB – 613
Wikipathways - 789
Note : Pathway definitions are manual and vary across
different databases.
Major pathway databases
How many pathway we know so far ?
http://www.ncbi.nlm.nih.gov/pubmed/22591066
Not all metabolites have pathways
http://onlinelibrary.wiley.com/doi/10.1002/9781444339956.ch12/summary
Caution!
Can we trust a genome-derived pathway map ?
Glucose belongs to ?
KEGG DB
Reactome DB
Pathway boundaries are arbitrary
Chemical-similarity maps of small molecules and metals in human blood
Rappaport, Stephen M., et al. "The blood exposome and its role in discovering causes
of disease." Environmental Health Perspectives (Online)122.8 (2014): 769.
Node size = number of
pathways
Node size =
Epidemiological studies
• Epidemiologists and
systems biologists /
biochemist don’t study
same compounds.
• Epidemiologist seem to
study compounds having
a lesser number of known
pathways.
Do everyone care about pathways ?Do everyone care about pathways ?
Which TCA definitions ?
KEGG
Reactome
SMPDB
MetaCyc
Which TCA definition ?
P <0.10 P <0.05
Which p-value : < 0.05 OR < 0.10
decide the p-value
cutoff to get the list
of input
metabolites.
Metabolite pathway analysis: how about pathway topology ?
Shall they
have a same
score ?
Topology scoring
If affected compounds are in
close biochemical proximity,
the pathway should be given
a weightage score.
Fahrmann et al 2017
Gene Ontology TCA map
-include gene names or symbols in the map
Use pathway maps for focused biochemical visualization
Use pathway maps for focused biochemical visualization
Pathway maps provided by databases are not used
by publications. Authors draw their own maps and
show data on it.
https://www.nature.com/articles/cddis2011123
How to create these customized maps ?
(Re) read biochemistry books and draw them
OR
How to create these customized maps ?
Metabolites Reactions Genes
Pathway layout
Biochemical databases
Metabolomics dataset
Cytsocape + powerpoint
**No automated way exits to make pretty
looking customized pathway maps.
ProteinsKEGG
HMDB
MetaCyc
Wikipathways
Reactome
Use biochemical databases
Next questions :
• How to map all the identified metabolites
into a biochemical network ?
• How to show all the known reactions for a
metabolite ?
• How to map publishable network graphs ?

More Related Content

What's hot

A Comparison of NGS Platforms.
A Comparison of NGS Platforms.A Comparison of NGS Platforms.
A Comparison of NGS Platforms.mkim8
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
prateek kumar
 
Sequence database
Sequence databaseSequence database
Sequence database
Dr.M.Prasad Naidu
 
Next generation sequencing
Next generation sequencingNext generation sequencing
Next generation sequencingDayananda Salam
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
Somdutt Sharma
 
Softwares For Phylogentic Analysis
Softwares For Phylogentic AnalysisSoftwares For Phylogentic Analysis
Softwares For Phylogentic Analysis
Prasanthperceptron
 
Variant analysis and whole exome sequencing
Variant analysis and whole exome sequencingVariant analysis and whole exome sequencing
Variant analysis and whole exome sequencing
Bioinformatics and Computational Biosciences Branch
 
Data retrieval
Data retrievalData retrieval
Sequence Alignment
Sequence AlignmentSequence Alignment
Sequence Alignment
PRUTHVIRAJ K
 
Phylogenetic prediction - maximum parsimony method
Phylogenetic prediction - maximum parsimony methodPhylogenetic prediction - maximum parsimony method
Phylogenetic prediction - maximum parsimony method
Afnan Zuiter
 
Kegg
KeggKegg
Kegg
msfbi1521
 
NGS File formats
NGS File formatsNGS File formats
NGS File formats
HARSHITHA EBBALI
 
RNA-seq quality control and pre-processing
RNA-seq quality control and pre-processingRNA-seq quality control and pre-processing
RNA-seq quality control and pre-processing
mikaelhuss
 
Genome Assembly 2018
Genome Assembly 2018Genome Assembly 2018
Genome Assembly 2018
Aureliano Bombarely
 
Pathway and network analysis
Pathway and network analysisPathway and network analysis
Pathway and network analysis
Manar Al-Eslam Mattar
 
Sequencealignmentinbioinformatics 100204112518-phpapp02
Sequencealignmentinbioinformatics 100204112518-phpapp02Sequencealignmentinbioinformatics 100204112518-phpapp02
Sequencealignmentinbioinformatics 100204112518-phpapp02
PILLAI ASWATHY VISWANATH
 
Secondary structure prediction
Secondary structure predictionSecondary structure prediction
Secondary structure prediction
samantlalit
 
Protein databases
Protein databasesProtein databases
Protein databasessarumalay
 
Bioinformatics
BioinformaticsBioinformatics
Next Generation Sequencing
Next Generation SequencingNext Generation Sequencing
Next Generation Sequencing
Amritha S R
 

What's hot (20)

A Comparison of NGS Platforms.
A Comparison of NGS Platforms.A Comparison of NGS Platforms.
A Comparison of NGS Platforms.
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
Sequence database
Sequence databaseSequence database
Sequence database
 
Next generation sequencing
Next generation sequencingNext generation sequencing
Next generation sequencing
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Softwares For Phylogentic Analysis
Softwares For Phylogentic AnalysisSoftwares For Phylogentic Analysis
Softwares For Phylogentic Analysis
 
Variant analysis and whole exome sequencing
Variant analysis and whole exome sequencingVariant analysis and whole exome sequencing
Variant analysis and whole exome sequencing
 
Data retrieval
Data retrievalData retrieval
Data retrieval
 
Sequence Alignment
Sequence AlignmentSequence Alignment
Sequence Alignment
 
Phylogenetic prediction - maximum parsimony method
Phylogenetic prediction - maximum parsimony methodPhylogenetic prediction - maximum parsimony method
Phylogenetic prediction - maximum parsimony method
 
Kegg
KeggKegg
Kegg
 
NGS File formats
NGS File formatsNGS File formats
NGS File formats
 
RNA-seq quality control and pre-processing
RNA-seq quality control and pre-processingRNA-seq quality control and pre-processing
RNA-seq quality control and pre-processing
 
Genome Assembly 2018
Genome Assembly 2018Genome Assembly 2018
Genome Assembly 2018
 
Pathway and network analysis
Pathway and network analysisPathway and network analysis
Pathway and network analysis
 
Sequencealignmentinbioinformatics 100204112518-phpapp02
Sequencealignmentinbioinformatics 100204112518-phpapp02Sequencealignmentinbioinformatics 100204112518-phpapp02
Sequencealignmentinbioinformatics 100204112518-phpapp02
 
Secondary structure prediction
Secondary structure predictionSecondary structure prediction
Secondary structure prediction
 
Protein databases
Protein databasesProtein databases
Protein databases
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Next Generation Sequencing
Next Generation SequencingNext Generation Sequencing
Next Generation Sequencing
 

Similar to Mapping metabolites against pathway databases

Metabolic pathway mapping against KEGG, Reactome, HMDB and CPDB
Metabolic pathway mapping against KEGG, Reactome, HMDB and CPDBMetabolic pathway mapping against KEGG, Reactome, HMDB and CPDB
Metabolic pathway mapping against KEGG, Reactome, HMDB and CPDB
Dinesh Barupal
 
ChemSpider: Connecting Chemistry & Mass Spectrometry on the Internet
ChemSpider: Connecting Chemistry & Mass Spectrometry on the Internet ChemSpider: Connecting Chemistry & Mass Spectrometry on the Internet
ChemSpider: Connecting Chemistry & Mass Spectrometry on the Internet
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Analysis with biological pathways:
Analysis with biological pathways: Analysis with biological pathways:
Analysis with biological pathways:
Chris Evelo
 
Metabolite Set Enrichment Analysis (ChemRICH)
Metabolite Set Enrichment Analysis (ChemRICH)Metabolite Set Enrichment Analysis (ChemRICH)
Metabolite Set Enrichment Analysis (ChemRICH)
Dinesh Barupal
 
Towards semantic systems chemical biology
Towards semantic systems chemical biology Towards semantic systems chemical biology
Towards semantic systems chemical biology
Bin Chen
 
STRING - Prediction of a functional association network for the yeast mitocho...
STRING - Prediction of a functional association network for the yeast mitocho...STRING - Prediction of a functional association network for the yeast mitocho...
STRING - Prediction of a functional association network for the yeast mitocho...
Lars Juhl Jensen
 
Cadd and molecular modeling for M.Pharm
Cadd and molecular modeling for M.PharmCadd and molecular modeling for M.Pharm
Cadd and molecular modeling for M.Pharm
Shikha Popali
 
Protein Network Analysis
Protein Network AnalysisProtein Network Analysis
Protein Network Analysis
Creative Proteomics
 
Metabolic Set Enrichment Analysis - chemrich - 2019
Metabolic Set Enrichment Analysis - chemrich - 2019Metabolic Set Enrichment Analysis - chemrich - 2019
Metabolic Set Enrichment Analysis - chemrich - 2019
Dinesh Barupal
 
T-bioinfo overview
T-bioinfo overviewT-bioinfo overview
T-bioinfo overview
Jaclyn Williams
 
T-BioInfo Methods and Approaches
T-BioInfo Methods and ApproachesT-BioInfo Methods and Approaches
T-BioInfo Methods and Approaches
Elia Brodsky
 
NetBioSIG2012 anyatsalenko-en-viz
NetBioSIG2012 anyatsalenko-en-vizNetBioSIG2012 anyatsalenko-en-viz
NetBioSIG2012 anyatsalenko-en-viz
Alexander Pico
 
Psb tutorial cancer_pathways
Psb tutorial cancer_pathwaysPsb tutorial cancer_pathways
Psb tutorial cancer_pathways
Jeff Kiefer
 
WikiPathways: how open source and open data can make omics technology more us...
WikiPathways: how open source and open data can make omics technology more us...WikiPathways: how open source and open data can make omics technology more us...
WikiPathways: how open source and open data can make omics technology more us...
Chris Evelo
 
2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe
open_phacts
 
EnrichNet: Graph-based statistic and web-application for gene/protein set enr...
EnrichNet: Graph-based statistic and web-application for gene/protein set enr...EnrichNet: Graph-based statistic and web-application for gene/protein set enr...
EnrichNet: Graph-based statistic and web-application for gene/protein set enr...
Enrico Glaab
 
Metabolic network mapping for metabolomics
Metabolic network mapping for metabolomicsMetabolic network mapping for metabolomics
Metabolic network mapping for metabolomics
Dinesh Barupal
 
Algal Functional Annotation Tool
Algal Functional Annotation ToolAlgal Functional Annotation Tool
Algal Functional Annotation Tool
Sarah Adams
 

Similar to Mapping metabolites against pathway databases (20)

Metabolic pathway mapping against KEGG, Reactome, HMDB and CPDB
Metabolic pathway mapping against KEGG, Reactome, HMDB and CPDBMetabolic pathway mapping against KEGG, Reactome, HMDB and CPDB
Metabolic pathway mapping against KEGG, Reactome, HMDB and CPDB
 
Metabolomics seminarslides 013111final 110201
Metabolomics seminarslides 013111final 110201Metabolomics seminarslides 013111final 110201
Metabolomics seminarslides 013111final 110201
 
ChemSpider: Connecting Chemistry & Mass Spectrometry on the Internet
ChemSpider: Connecting Chemistry & Mass Spectrometry on the Internet ChemSpider: Connecting Chemistry & Mass Spectrometry on the Internet
ChemSpider: Connecting Chemistry & Mass Spectrometry on the Internet
 
Analysis with biological pathways:
Analysis with biological pathways: Analysis with biological pathways:
Analysis with biological pathways:
 
Metabolite Set Enrichment Analysis (ChemRICH)
Metabolite Set Enrichment Analysis (ChemRICH)Metabolite Set Enrichment Analysis (ChemRICH)
Metabolite Set Enrichment Analysis (ChemRICH)
 
Towards semantic systems chemical biology
Towards semantic systems chemical biology Towards semantic systems chemical biology
Towards semantic systems chemical biology
 
STRING - Prediction of a functional association network for the yeast mitocho...
STRING - Prediction of a functional association network for the yeast mitocho...STRING - Prediction of a functional association network for the yeast mitocho...
STRING - Prediction of a functional association network for the yeast mitocho...
 
Cadd and molecular modeling for M.Pharm
Cadd and molecular modeling for M.PharmCadd and molecular modeling for M.Pharm
Cadd and molecular modeling for M.Pharm
 
Metabolomics
MetabolomicsMetabolomics
Metabolomics
 
Protein Network Analysis
Protein Network AnalysisProtein Network Analysis
Protein Network Analysis
 
Metabolic Set Enrichment Analysis - chemrich - 2019
Metabolic Set Enrichment Analysis - chemrich - 2019Metabolic Set Enrichment Analysis - chemrich - 2019
Metabolic Set Enrichment Analysis - chemrich - 2019
 
T-bioinfo overview
T-bioinfo overviewT-bioinfo overview
T-bioinfo overview
 
T-BioInfo Methods and Approaches
T-BioInfo Methods and ApproachesT-BioInfo Methods and Approaches
T-BioInfo Methods and Approaches
 
NetBioSIG2012 anyatsalenko-en-viz
NetBioSIG2012 anyatsalenko-en-vizNetBioSIG2012 anyatsalenko-en-viz
NetBioSIG2012 anyatsalenko-en-viz
 
Psb tutorial cancer_pathways
Psb tutorial cancer_pathwaysPsb tutorial cancer_pathways
Psb tutorial cancer_pathways
 
WikiPathways: how open source and open data can make omics technology more us...
WikiPathways: how open source and open data can make omics technology more us...WikiPathways: how open source and open data can make omics technology more us...
WikiPathways: how open source and open data can make omics technology more us...
 
2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe
 
EnrichNet: Graph-based statistic and web-application for gene/protein set enr...
EnrichNet: Graph-based statistic and web-application for gene/protein set enr...EnrichNet: Graph-based statistic and web-application for gene/protein set enr...
EnrichNet: Graph-based statistic and web-application for gene/protein set enr...
 
Metabolic network mapping for metabolomics
Metabolic network mapping for metabolomicsMetabolic network mapping for metabolomics
Metabolic network mapping for metabolomics
 
Algal Functional Annotation Tool
Algal Functional Annotation ToolAlgal Functional Annotation Tool
Algal Functional Annotation Tool
 

Recently uploaded

The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
Health Advances
 
ESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptxESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptx
muralinath2
 
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of LipidsGBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
Areesha Ahmad
 
Richard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlandsRichard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlands
Richard Gill
 
GBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture MediaGBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture Media
Areesha Ahmad
 
insect taxonomy importance systematics and classification
insect taxonomy importance systematics and classificationinsect taxonomy importance systematics and classification
insect taxonomy importance systematics and classification
anitaento25
 
Mammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also FunctionsMammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also Functions
YOGESH DOGRA
 
filosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptxfilosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptx
IvanMallco1
 
general properties of oerganologametal.ppt
general properties of oerganologametal.pptgeneral properties of oerganologametal.ppt
general properties of oerganologametal.ppt
IqrimaNabilatulhusni
 
Hemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptxHemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptx
muralinath2
 
GBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram StainingGBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram Staining
Areesha Ahmad
 
EY - Supply Chain Services 2018_template.pptx
EY - Supply Chain Services 2018_template.pptxEY - Supply Chain Services 2018_template.pptx
EY - Supply Chain Services 2018_template.pptx
AlguinaldoKong
 
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdfSCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SELF-EXPLANATORY
 
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATIONPRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
ChetanK57
 
In silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptxIn silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptx
AlaminAfendy1
 
erythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptxerythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptx
muralinath2
 
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
Sérgio Sacani
 
Citrus Greening Disease and its Management
Citrus Greening Disease and its ManagementCitrus Greening Disease and its Management
Citrus Greening Disease and its Management
subedisuryaofficial
 
NuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final versionNuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final version
pablovgd
 
Comparative structure of adrenal gland in vertebrates
Comparative structure of adrenal gland in vertebratesComparative structure of adrenal gland in vertebrates
Comparative structure of adrenal gland in vertebrates
sachin783648
 

Recently uploaded (20)

The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
 
ESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptxESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptx
 
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of LipidsGBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
 
Richard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlandsRichard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlands
 
GBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture MediaGBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture Media
 
insect taxonomy importance systematics and classification
insect taxonomy importance systematics and classificationinsect taxonomy importance systematics and classification
insect taxonomy importance systematics and classification
 
Mammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also FunctionsMammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also Functions
 
filosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptxfilosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptx
 
general properties of oerganologametal.ppt
general properties of oerganologametal.pptgeneral properties of oerganologametal.ppt
general properties of oerganologametal.ppt
 
Hemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptxHemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptx
 
GBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram StainingGBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram Staining
 
EY - Supply Chain Services 2018_template.pptx
EY - Supply Chain Services 2018_template.pptxEY - Supply Chain Services 2018_template.pptx
EY - Supply Chain Services 2018_template.pptx
 
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdfSCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
 
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATIONPRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
 
In silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptxIn silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptx
 
erythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptxerythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptx
 
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
 
Citrus Greening Disease and its Management
Citrus Greening Disease and its ManagementCitrus Greening Disease and its Management
Citrus Greening Disease and its Management
 
NuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final versionNuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final version
 
Comparative structure of adrenal gland in vertebrates
Comparative structure of adrenal gland in vertebratesComparative structure of adrenal gland in vertebrates
Comparative structure of adrenal gland in vertebrates
 

Mapping metabolites against pathway databases

  • 2. DATA ACQUISITION Separation Detection SAMPLING EXTRACTION DATA PROCESSING File Conversion Baseline Correction Peak Detection Deconvolution Adduct Annotation Alignment Gap Filling STATISTICS Normalization Multivariate Analysis (Parametric, Nonparametric) Univariate Analysis (Unsupervised, Supervised) BIOLOGICAL INTERPRETATION Pathway Mapping Network Enrichment STUDY DESIGN VALIDATION COMPOUND IDENTIFICATION Molecular Formula ID Structure ID MS Library Search Database Search In silico Fragmentation WCMC UC Davis
  • 3. Question : Which metabolites from my list can be mapped to metabolic pathways ? Approach : Database mapping Results: A. Table of pathways with statistics B. Maps overlaid with detected metabolites C. Metabolites not mapped to pathways
  • 4. Data preparation C00003 C00016 C00019 C00020 C00025 C00047 C00064 C00082 C00086 C00099 C00105 Select the KEGG database identifiers for compounds having pvalue <0.05 in your experiment. To get the KEGG ids for your compound list, you can use PubChem identifier exchange service. https://pubchem.ncbi.nlm.nih.gov/idexchange/idexchange.cgi
  • 5. Prepare the compound list Sort the statistical results by p-value Select the KEGG ids of compounds with p-value <0.05 Use filter in Excel Example study : “spring_2018_metabolomics_course_pathway_example.xlsx” in the PathwayAnalysis_example folder
  • 6. Four major databases for pathway mapping – • KEGG • Reactome • HMDB • Consensus path db For a comprehensive list of pathway databases – go to http://www.pathguide.org/
  • 7. KEGG : Kyoto encyclopedia of genes and genomes http://www.genome.jp/kegg/ GO TO KEGG is a collection of databases dealing with genomes, biological pathways, diseases, drugs, and chemical substances. KEGG is utilized for bioinformatics research and education, including data analysis in genomics, metagenomics, metabolomics and other omics studies, modeling and simulation in systems biology, and translational research in drug development.
  • 8. http://www.genome.jp/kegg/ Click on KEGG Mapper (at the bottom of the page)
  • 9. Click on search pathway
  • 10. Copy and paste the KEGG Ids in this box Click on execute Change organism if needed
  • 11. KEGG Results - Click on the a pathway to get the map with compound overlaid on it.
  • 12. KEGG Global metabolic map Red dots are the compounds present in the metabolite list.
  • 13. Green boxes are enzymes mapped to the human genome. White boxes are enzymes not mapped to the human genome. Red circle shows the presence of this compound in the metabolite list
  • 14. Reactome pathway mapping www.reactome.org “REACTOME is an open-source, open access, manually curated and peer-reviewed pathway database. Our goal is to provide intuitive bioinformatics tools for the visualization, interpretation and analysis of pathway knowledge to support basic and clinical research, genome analysis, modeling, systems biology and education. Founded in 2001, the Reactome project is led by Lincoln Stein of OICR, Peter D’Eustachio of NYULMC, Henning Hermjakob of EMBL-EBI, and Guanming Wu of OHSU.”
  • 16. Copy and paste KEGG Ids in this box. # first line should be “#Small_molecules_KEGG”
  • 17.
  • 19. Zooming in and double clicking will show the map of a pathway Metabolite in the list
  • 20. MetaboAnalyst Pathway Mapping “To provide a user-friendly, web-based analytical pipeline for high-throughput metabolomics studies. In particular, MetaboAnalyst aims to offer a variety of commonly used procedures for metabolomic data processing, normalization, multivariate statistical analysis, as well as data annotation. The current implementation focuses on exploratory statistical analysis, functional interpretation, and advanced statistics for translational metabolomics studies.” http://www.metaboanalyst.ca/
  • 22. Paste the KEGG ids and select the type of identifiers Click : Submit Click : Submit Select a pathway library Click : Submit
  • 23. Results Pathway topology : centrality of compounds
  • 24. Output table Pathway with FDR < 0.20 are selected
  • 26. Consensus PathDB Metabolic Network Mapping “ConsensusPathDB-human integrates interaction networks in Homo sapiens including binary and complex protein-protein, genetic, metabolic, signaling, gene regulatory and drug-target interactions, as well as biochemical pathways. Data originate from currently 32 public resources for interactions (listed below) and interactions that we have curated from the literature. The interaction data are integrated in a complementary manner (avoiding redundancies), resulting in a seamless interaction network containing different types of interactions.” http://cpdb.molgen.mpg.de/
  • 28.
  • 29. Results Click on select “all” Pathway similarity network
  • 32. Input KEGG ids are mapped to internal ids and to various databases. Click on “all” to select them. Then click on the show interactions button.
  • 33. CPDB Interaction Results Click on “all” to select them and then click on map and visualize interactions.
  • 36. Key points : • Not all detected metabolites have KEGG identifiers. • Not all compounds with KEGG ids can be mapped to metabolic pathways. • Pathway maps may not show all the reaction known for a metabolite. • Several compound appeared in many pathways. • Online tools can map metabolite list to pathway maps and perform pathway over-representation analysis for metabolites. • Online network visualization of a large number of metabolites is not efficient.
  • 37. KEGG - 495 MetaCyc - 2453 Reactome - 2000 HMDB – 613 Wikipathways - 789 Note : Pathway definitions are manual and vary across different databases. Major pathway databases How many pathway we know so far ?
  • 40. Glucose belongs to ? KEGG DB Reactome DB Pathway boundaries are arbitrary
  • 41. Chemical-similarity maps of small molecules and metals in human blood Rappaport, Stephen M., et al. "The blood exposome and its role in discovering causes of disease." Environmental Health Perspectives (Online)122.8 (2014): 769. Node size = number of pathways Node size = Epidemiological studies • Epidemiologists and systems biologists / biochemist don’t study same compounds. • Epidemiologist seem to study compounds having a lesser number of known pathways. Do everyone care about pathways ?Do everyone care about pathways ?
  • 42. Which TCA definitions ? KEGG Reactome SMPDB MetaCyc Which TCA definition ?
  • 43. P <0.10 P <0.05 Which p-value : < 0.05 OR < 0.10 decide the p-value cutoff to get the list of input metabolites.
  • 44. Metabolite pathway analysis: how about pathway topology ? Shall they have a same score ? Topology scoring If affected compounds are in close biochemical proximity, the pathway should be given a weightage score.
  • 45. Fahrmann et al 2017 Gene Ontology TCA map -include gene names or symbols in the map Use pathway maps for focused biochemical visualization
  • 46. Use pathway maps for focused biochemical visualization Pathway maps provided by databases are not used by publications. Authors draw their own maps and show data on it. https://www.nature.com/articles/cddis2011123
  • 47. How to create these customized maps ? (Re) read biochemistry books and draw them OR
  • 48. How to create these customized maps ? Metabolites Reactions Genes Pathway layout Biochemical databases Metabolomics dataset Cytsocape + powerpoint **No automated way exits to make pretty looking customized pathway maps. ProteinsKEGG HMDB MetaCyc Wikipathways Reactome Use biochemical databases
  • 49. Next questions : • How to map all the identified metabolites into a biochemical network ? • How to show all the known reactions for a metabolite ? • How to map publishable network graphs ?

Editor's Notes

  1. https://bmcbioinformatics.biomedcentral.com/articles/10.1186/1471-2105-13-99