SlideShare a Scribd company logo
1 of 30
Gene Ontology Network 
Enrichment Analysis 
Dmitry Grapov, PhD
Download all material for the tutorial 
https://sourceforge.net/projects/teachingdemos/files/ 
Choose 2014 UC Davis Proteomics Workshop or use the 
full URL below 
https://sourceforge.net/projects/teachingdemos/files/2014%20UC%
• decrease 
• increase 
Use functional analysis to identify if the changes in variables 
are enriched (increased compared to random chance) for 
some biological pathway, domain or ontological category.
Enrichment or Overrepresentation analysis 
Biochemical Pathway Biochemical Ontology
Major Tasks 
Using the proteins listed in the excel workbook: ‘proteomic data for 
analysis.xlsx’ and worksheet: ‘protein IDs’ 
1. Conduct Gene Ontology (GO) Enrichment Analysis using 
DAVID Bioinformatics Resources 
http://david.abcc.ncifcrf.gov/home.jsp 
2. Investigate enriched terms using 
Quick GO http://www.ebi.ac.uk/QuickGO/ 
3. Summaries and visualize the results using 
REVIGO http://revigo.irb.hr/ 
4. Create and modify GO network using 
Cytoscape http://www.cytoscape.org/
Protein IDs 
Common protein identifier 
UniProt/SwissProt Accession 
(default in scaffold) 
http://www.uniprot.org/ 
Use Biomart to translate to other 
database IDS 
http://www.biomart.org/ 
e.g. gene symbols
David Bioinformatics Resources 
http://david.abcc.ncifcrf.gov/home.jsp
David Bioinformatics Resources 
1. Upload list 
2. Choose ID 
type 
3. Select list 
type 
4. Submit
David Bioinformatics Resources 
organism Make sure all IDs were recognized 
List of 
biochemical 
databases tested 
for enrichment
David Bioinformatics Resources 
List of 
biochemical 
databases tested 
for enrichment 
1. Choose GO
David Bioinformatics Resources 
http://david.abcc.ncifcrf.gov/helps/functional_annotation.html#E3
David Bioinformatics Resources 
List of 
biochemical 
databases tested 
for enrichment 
1. Overview 
BP: Biological 
process 
2. Select
David Bioinformatics Resources 
http://david.abcc.ncifcrf.gov/helps/functional_annotation.html#E3
David Bioinformatics Resources 
1. Overview most enriched term
Quick GO http://www.ebi.ac.uk/QuickGO/ 
1. View children (lower hierarchy subsets) of this term
David Bioinformatics Resources/Quick GO 
1. Can you identify any enriched 
children of this term in our DAVID 
output? 
? 
2. Download 
results
Overview and Format Results in Excel 
1. Save results 2. Open in MS Excel
Overview Results 
Modified Fisher’s Exact Test p-value 
optionally: Check in R 
x<-data.frame(user=c(1,47),genome=c(690,13528)) 
fisher.test(x) # p-value = 5.41e-06 
(13/47) / (690/13528)
Alternative to Fisher Exact Test: 
Hypergeometric Test 
How to calculate statistics to determine enrichment? 
hit.num = 51 # number of significantly changed pathway variables 
set.num = 1455 # number of variables in pathway 
full = 3358 # all possible variables in organism 
q.size = 72 # number of significantly changed variables 
phyper(hit.num-1, set.num, full-set.num, q.size, lower.tail=F) 
enrichment p-value = 1.717553e-06
Visualization Options 
Challenges: 
•Removal of redundant information 
•Visualizing term relationships (term-term, term-protein)
Use REVIGO to filter redundant terms 
http://revigo.irb.hr/ 
prepare input (term, p-value) 
1. Upload to 
REVIGO 
2. Run 
Supek F, Bošnjak M, Škunca N, Šmuc T. "REVIGO summarizes and visualizes long lists of Gene Ontology terms" PLoS ONE 2011. doi:10.1371/journal.pone.0021800
REVIGO: overview scatterplot 
Position defined on similarity (MDS)
REVIGO: overview table 
Cluster leaders prioritized based on enrichment p-value
REVIGO: network 
• Edges: 3% of the 
strongest GO term 
pairwise similarities 
• Node size: generality 
of term 
(small = specific) 
• Node color: p-value 
Download network
Cytoscape 
1. Open Cytoscape 
Import REVIGO network into cytoscape 
2 
3 4
Cytoscape: set layout and defaults 
1. Set layout 3. Set network defaults 
2 
4 5
Cytoscape: map data to network properties 
1. Set Edge width and color 2. Set Node labels, size and color
Cytoscape: overview network components 
Download edge information 
1 
2 
3. View in excel 
Download node information 
1 
2 
3. View in excel
Bonus: Modify Edge and Node Attributes to show 
term to protein connections 
See file ‘test edge.xlsx’ and ‘test node.xslx, for examples of upload 
formats 
See detailed instructions at http://www.slideshare.net/dgrapov/demonstration-of-network-mapping
See more Statistical and Multivariate Analysis Examples at 
http://imdevsoftware.wordpress.com/tutorials/ 
Questions? 
dgrapov@ucdavis.edu 
This research was supported in part by NIH 1 U24 DK097154

More Related Content

What's hot

Sequence Alignment In Bioinformatics
Sequence Alignment In BioinformaticsSequence Alignment In Bioinformatics
Sequence Alignment In BioinformaticsNikesh Narayanan
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignmentSanaym
 
Structural Genomics
Structural GenomicsStructural Genomics
Structural GenomicsAqsa Javed
 
Role of ensembl in genome browsing
Role of ensembl in genome browsingRole of ensembl in genome browsing
Role of ensembl in genome browsingJoydeep16
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignmentAfra Fathima
 
Functional proteomics, and tools
Functional proteomics, and toolsFunctional proteomics, and tools
Functional proteomics, and toolsKAUSHAL SAHU
 
Protein identification and analysis on ExPASy server
Protein identification and analysis on ExPASy serverProtein identification and analysis on ExPASy server
Protein identification and analysis on ExPASy serverEkta Gupta
 
RNA-seq: Mapping and quality control - part 3
RNA-seq: Mapping and quality control - part 3RNA-seq: Mapping and quality control - part 3
RNA-seq: Mapping and quality control - part 3BITS
 
Structural genomics
Structural genomicsStructural genomics
Structural genomicsAshfaq Ahmad
 
Blast fasta
Blast fastaBlast fasta
Blast fastayaghava
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomicsAthira RG
 
Web based servers and softwares for genome analysis
Web based servers and softwares for genome analysisWeb based servers and softwares for genome analysis
Web based servers and softwares for genome analysisDr. Naveen Gaurav srivastava
 
Introduction to the Proteomics Bioinformatics Course 2016
Introduction to the Proteomics Bioinformatics Course 2016Introduction to the Proteomics Bioinformatics Course 2016
Introduction to the Proteomics Bioinformatics Course 2016Juan Antonio Vizcaino
 
Functional proteomics, methods and tools
Functional proteomics, methods and toolsFunctional proteomics, methods and tools
Functional proteomics, methods and toolsKAUSHAL SAHU
 
Study of Transcriptome
Study of TranscriptomeStudy of Transcriptome
Study of TranscriptomeBOTANYWith
 

What's hot (20)

Sequence alignment
Sequence alignmentSequence alignment
Sequence alignment
 
Sequence Alignment In Bioinformatics
Sequence Alignment In BioinformaticsSequence Alignment In Bioinformatics
Sequence Alignment In Bioinformatics
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
 
David
DavidDavid
David
 
Structural Genomics
Structural GenomicsStructural Genomics
Structural Genomics
 
Role of ensembl in genome browsing
Role of ensembl in genome browsingRole of ensembl in genome browsing
Role of ensembl in genome browsing
 
BLAST
BLASTBLAST
BLAST
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Functional proteomics, and tools
Functional proteomics, and toolsFunctional proteomics, and tools
Functional proteomics, and tools
 
Protein structure analysis
Protein structure analysis Protein structure analysis
Protein structure analysis
 
Protein identification and analysis on ExPASy server
Protein identification and analysis on ExPASy serverProtein identification and analysis on ExPASy server
Protein identification and analysis on ExPASy server
 
RNA-seq: Mapping and quality control - part 3
RNA-seq: Mapping and quality control - part 3RNA-seq: Mapping and quality control - part 3
RNA-seq: Mapping and quality control - part 3
 
Structural genomics
Structural genomicsStructural genomics
Structural genomics
 
Blast fasta
Blast fastaBlast fasta
Blast fasta
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
Web based servers and softwares for genome analysis
Web based servers and softwares for genome analysisWeb based servers and softwares for genome analysis
Web based servers and softwares for genome analysis
 
Introduction to the Proteomics Bioinformatics Course 2016
Introduction to the Proteomics Bioinformatics Course 2016Introduction to the Proteomics Bioinformatics Course 2016
Introduction to the Proteomics Bioinformatics Course 2016
 
Functional proteomics, methods and tools
Functional proteomics, methods and toolsFunctional proteomics, methods and tools
Functional proteomics, methods and tools
 
Study of Transcriptome
Study of TranscriptomeStudy of Transcriptome
Study of Transcriptome
 

Similar to Gene Ontology Network Enrichment Analysis

Gene Ontology Enrichment Network Analysis -Tutorial
Gene Ontology Enrichment Network Analysis -TutorialGene Ontology Enrichment Network Analysis -Tutorial
Gene Ontology Enrichment Network Analysis -TutorialDmitry Grapov
 
Linked Data for Federation of OER Data &amp; Repositories
Linked Data for Federation of OER Data &amp; RepositoriesLinked Data for Federation of OER Data &amp; Repositories
Linked Data for Federation of OER Data &amp; RepositoriesStefan Dietze
 
Predictive modeling DBs
Predictive modeling DBsPredictive modeling DBs
Predictive modeling DBsDataVita
 
How can you access PubChem programmatically?
How can you access PubChem programmatically?How can you access PubChem programmatically?
How can you access PubChem programmatically?Sunghwan Kim
 
Multivarite and network tools for biological data analysis
Multivarite and network tools for biological data analysisMultivarite and network tools for biological data analysis
Multivarite and network tools for biological data analysisDmitry Grapov
 
Curation-Friendly Tools for the Scientific Researcher
Curation-Friendly Tools for the Scientific ResearcherCuration-Friendly Tools for the Scientific Researcher
Curation-Friendly Tools for the Scientific Researcherbwestra
 
Connecting the dots: drug information and Linked Data
Connecting the dots: drug information and Linked DataConnecting the dots: drug information and Linked Data
Connecting the dots: drug information and Linked DataTomasz Adamusiak
 
Scott Edmunds at #GAMe2017: GigaGalaxy & publishing workflows for publishing ...
Scott Edmunds at #GAMe2017: GigaGalaxy & publishing workflows for publishing ...Scott Edmunds at #GAMe2017: GigaGalaxy & publishing workflows for publishing ...
Scott Edmunds at #GAMe2017: GigaGalaxy & publishing workflows for publishing ...GigaScience, BGI Hong Kong
 
Bio-IT 2017 - Session 7: Next-Gen Sequencing Informatics
Bio-IT 2017 - Session 7: Next-Gen Sequencing InformaticsBio-IT 2017 - Session 7: Next-Gen Sequencing Informatics
Bio-IT 2017 - Session 7: Next-Gen Sequencing InformaticsYaoyu Wang
 
Scott Edmunds ISMB talk on Big Data Publishing
Scott Edmunds ISMB talk on Big Data PublishingScott Edmunds ISMB talk on Big Data Publishing
Scott Edmunds ISMB talk on Big Data PublishingGigaScience, BGI Hong Kong
 
Case Study in Linked Data and Semantic Web: Human Genome
Case Study in Linked Data and Semantic Web: Human GenomeCase Study in Linked Data and Semantic Web: Human Genome
Case Study in Linked Data and Semantic Web: Human GenomeDavid Portnoy
 
Accessing and scripting CDK from Bioclipse
Accessing and scripting CDK from BioclipseAccessing and scripting CDK from Bioclipse
Accessing and scripting CDK from BioclipseOla Spjuth
 
Modeling Data Life Cycles with PROV
Modeling Data Life Cycles with PROVModeling Data Life Cycles with PROV
Modeling Data Life Cycles with PROVEUDAT
 

Similar to Gene Ontology Network Enrichment Analysis (20)

Gene Ontology Enrichment Network Analysis -Tutorial
Gene Ontology Enrichment Network Analysis -TutorialGene Ontology Enrichment Network Analysis -Tutorial
Gene Ontology Enrichment Network Analysis -Tutorial
 
Linked Data for Federation of OER Data &amp; Repositories
Linked Data for Federation of OER Data &amp; RepositoriesLinked Data for Federation of OER Data &amp; Repositories
Linked Data for Federation of OER Data &amp; Repositories
 
Predictive modeling DBs
Predictive modeling DBsPredictive modeling DBs
Predictive modeling DBs
 
FAIR Cookbook
FAIR Cookbook FAIR Cookbook
FAIR Cookbook
 
How can you access PubChem programmatically?
How can you access PubChem programmatically?How can you access PubChem programmatically?
How can you access PubChem programmatically?
 
Computer Scientists Retrieval - PDF Report
Computer Scientists Retrieval - PDF ReportComputer Scientists Retrieval - PDF Report
Computer Scientists Retrieval - PDF Report
 
Multivarite and network tools for biological data analysis
Multivarite and network tools for biological data analysisMultivarite and network tools for biological data analysis
Multivarite and network tools for biological data analysis
 
Curation-Friendly Tools for the Scientific Researcher
Curation-Friendly Tools for the Scientific ResearcherCuration-Friendly Tools for the Scientific Researcher
Curation-Friendly Tools for the Scientific Researcher
 
Connecting the dots: drug information and Linked Data
Connecting the dots: drug information and Linked DataConnecting the dots: drug information and Linked Data
Connecting the dots: drug information and Linked Data
 
Scott Edmunds at #GAMe2017: GigaGalaxy & publishing workflows for publishing ...
Scott Edmunds at #GAMe2017: GigaGalaxy & publishing workflows for publishing ...Scott Edmunds at #GAMe2017: GigaGalaxy & publishing workflows for publishing ...
Scott Edmunds at #GAMe2017: GigaGalaxy & publishing workflows for publishing ...
 
iMicrobe_ASLO_2015
iMicrobe_ASLO_2015iMicrobe_ASLO_2015
iMicrobe_ASLO_2015
 
Bio-IT 2017 - Session 7: Next-Gen Sequencing Informatics
Bio-IT 2017 - Session 7: Next-Gen Sequencing InformaticsBio-IT 2017 - Session 7: Next-Gen Sequencing Informatics
Bio-IT 2017 - Session 7: Next-Gen Sequencing Informatics
 
2013 eswc-bio2rdf-r2
2013 eswc-bio2rdf-r22013 eswc-bio2rdf-r2
2013 eswc-bio2rdf-r2
 
SADI CSHALS 2013
SADI CSHALS 2013SADI CSHALS 2013
SADI CSHALS 2013
 
Scott Edmunds ISMB talk on Big Data Publishing
Scott Edmunds ISMB talk on Big Data PublishingScott Edmunds ISMB talk on Big Data Publishing
Scott Edmunds ISMB talk on Big Data Publishing
 
Case Study in Linked Data and Semantic Web: Human Genome
Case Study in Linked Data and Semantic Web: Human GenomeCase Study in Linked Data and Semantic Web: Human Genome
Case Study in Linked Data and Semantic Web: Human Genome
 
Accessing and scripting CDK from Bioclipse
Accessing and scripting CDK from BioclipseAccessing and scripting CDK from Bioclipse
Accessing and scripting CDK from Bioclipse
 
Modeling Data Life Cycles with PROV
Modeling Data Life Cycles with PROVModeling Data Life Cycles with PROV
Modeling Data Life Cycles with PROV
 
Knetminer Backend Training, Nov 2018
Knetminer Backend Training, Nov 2018Knetminer Backend Training, Nov 2018
Knetminer Backend Training, Nov 2018
 
Pine education-platform
Pine education-platformPine education-platform
Pine education-platform
 

More from UC Davis

Presentation phinney abrf 2019
Presentation phinney abrf 2019Presentation phinney abrf 2019
Presentation phinney abrf 2019UC Davis
 
Prosit google-cloud
Prosit google-cloudProsit google-cloud
Prosit google-cloudUC Davis
 
Phinney 2019 ASMS Proteome software Users group Talk
Phinney 2019 ASMS Proteome software Users group TalkPhinney 2019 ASMS Proteome software Users group Talk
Phinney 2019 ASMS Proteome software Users group TalkUC Davis
 
Genome web july 2019 presentation phinney
Genome web july 2019 presentation phinneyGenome web july 2019 presentation phinney
Genome web july 2019 presentation phinneyUC Davis
 
Some statistical concepts relevant to proteomics data analysis
Some statistical concepts relevant to proteomics data analysisSome statistical concepts relevant to proteomics data analysis
Some statistical concepts relevant to proteomics data analysisUC Davis
 
Multivariate Analysis and Visualization of Proteomic Data
Multivariate Analysis and Visualization of Proteomic DataMultivariate Analysis and Visualization of Proteomic Data
Multivariate Analysis and Visualization of Proteomic DataUC Davis
 
Asms qc Will Thompson Duke
Asms qc Will Thompson DukeAsms qc Will Thompson Duke
Asms qc Will Thompson DukeUC Davis
 
Phinney varibility workshop
Phinney varibility workshopPhinney varibility workshop
Phinney varibility workshopUC Davis
 
Colangelo asms workshop_061714
Colangelo asms workshop_061714Colangelo asms workshop_061714
Colangelo asms workshop_061714UC Davis
 
Moeller proteomics course
Moeller proteomics courseMoeller proteomics course
Moeller proteomics courseUC Davis
 

More from UC Davis (10)

Presentation phinney abrf 2019
Presentation phinney abrf 2019Presentation phinney abrf 2019
Presentation phinney abrf 2019
 
Prosit google-cloud
Prosit google-cloudProsit google-cloud
Prosit google-cloud
 
Phinney 2019 ASMS Proteome software Users group Talk
Phinney 2019 ASMS Proteome software Users group TalkPhinney 2019 ASMS Proteome software Users group Talk
Phinney 2019 ASMS Proteome software Users group Talk
 
Genome web july 2019 presentation phinney
Genome web july 2019 presentation phinneyGenome web july 2019 presentation phinney
Genome web july 2019 presentation phinney
 
Some statistical concepts relevant to proteomics data analysis
Some statistical concepts relevant to proteomics data analysisSome statistical concepts relevant to proteomics data analysis
Some statistical concepts relevant to proteomics data analysis
 
Multivariate Analysis and Visualization of Proteomic Data
Multivariate Analysis and Visualization of Proteomic DataMultivariate Analysis and Visualization of Proteomic Data
Multivariate Analysis and Visualization of Proteomic Data
 
Asms qc Will Thompson Duke
Asms qc Will Thompson DukeAsms qc Will Thompson Duke
Asms qc Will Thompson Duke
 
Phinney varibility workshop
Phinney varibility workshopPhinney varibility workshop
Phinney varibility workshop
 
Colangelo asms workshop_061714
Colangelo asms workshop_061714Colangelo asms workshop_061714
Colangelo asms workshop_061714
 
Moeller proteomics course
Moeller proteomics courseMoeller proteomics course
Moeller proteomics course
 

Recently uploaded

Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.Silpa
 
Atp synthase , Atp synthase complex 1 to 4.
Atp synthase , Atp synthase complex 1 to 4.Atp synthase , Atp synthase complex 1 to 4.
Atp synthase , Atp synthase complex 1 to 4.Silpa
 
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...Scintica Instrumentation
 
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort ServiceCall Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Serviceshivanisharma5244
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxseri bangash
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryAlex Henderson
 
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry Areesha Ahmad
 
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceFAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceAlex Henderson
 
LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.Silpa
 
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptxTHE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptxANSARKHAN96
 
Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Silpa
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusNazaninKarimi6
 
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIACURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIADr. TATHAGAT KHOBRAGADE
 
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRLGwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRLkantirani197
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxMohamedFarag457087
 
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxPSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxSuji236384
 
Genome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptxGenome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptxSilpa
 

Recently uploaded (20)

Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.
 
Atp synthase , Atp synthase complex 1 to 4.
Atp synthase , Atp synthase complex 1 to 4.Atp synthase , Atp synthase complex 1 to 4.
Atp synthase , Atp synthase complex 1 to 4.
 
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
 
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort ServiceCall Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptx
 
Clean In Place(CIP).pptx .
Clean In Place(CIP).pptx                 .Clean In Place(CIP).pptx                 .
Clean In Place(CIP).pptx .
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry
 
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceFAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical Science
 
LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.
 
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptxTHE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
 
Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
 
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIACURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
 
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRLGwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
 
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxPSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
 
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICEPATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
 
Genome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptxGenome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptx
 

Gene Ontology Network Enrichment Analysis

  • 1. Gene Ontology Network Enrichment Analysis Dmitry Grapov, PhD
  • 2. Download all material for the tutorial https://sourceforge.net/projects/teachingdemos/files/ Choose 2014 UC Davis Proteomics Workshop or use the full URL below https://sourceforge.net/projects/teachingdemos/files/2014%20UC%
  • 3. • decrease • increase Use functional analysis to identify if the changes in variables are enriched (increased compared to random chance) for some biological pathway, domain or ontological category.
  • 4. Enrichment or Overrepresentation analysis Biochemical Pathway Biochemical Ontology
  • 5. Major Tasks Using the proteins listed in the excel workbook: ‘proteomic data for analysis.xlsx’ and worksheet: ‘protein IDs’ 1. Conduct Gene Ontology (GO) Enrichment Analysis using DAVID Bioinformatics Resources http://david.abcc.ncifcrf.gov/home.jsp 2. Investigate enriched terms using Quick GO http://www.ebi.ac.uk/QuickGO/ 3. Summaries and visualize the results using REVIGO http://revigo.irb.hr/ 4. Create and modify GO network using Cytoscape http://www.cytoscape.org/
  • 6. Protein IDs Common protein identifier UniProt/SwissProt Accession (default in scaffold) http://www.uniprot.org/ Use Biomart to translate to other database IDS http://www.biomart.org/ e.g. gene symbols
  • 7. David Bioinformatics Resources http://david.abcc.ncifcrf.gov/home.jsp
  • 8. David Bioinformatics Resources 1. Upload list 2. Choose ID type 3. Select list type 4. Submit
  • 9. David Bioinformatics Resources organism Make sure all IDs were recognized List of biochemical databases tested for enrichment
  • 10. David Bioinformatics Resources List of biochemical databases tested for enrichment 1. Choose GO
  • 11. David Bioinformatics Resources http://david.abcc.ncifcrf.gov/helps/functional_annotation.html#E3
  • 12. David Bioinformatics Resources List of biochemical databases tested for enrichment 1. Overview BP: Biological process 2. Select
  • 13. David Bioinformatics Resources http://david.abcc.ncifcrf.gov/helps/functional_annotation.html#E3
  • 14. David Bioinformatics Resources 1. Overview most enriched term
  • 15. Quick GO http://www.ebi.ac.uk/QuickGO/ 1. View children (lower hierarchy subsets) of this term
  • 16. David Bioinformatics Resources/Quick GO 1. Can you identify any enriched children of this term in our DAVID output? ? 2. Download results
  • 17. Overview and Format Results in Excel 1. Save results 2. Open in MS Excel
  • 18. Overview Results Modified Fisher’s Exact Test p-value optionally: Check in R x<-data.frame(user=c(1,47),genome=c(690,13528)) fisher.test(x) # p-value = 5.41e-06 (13/47) / (690/13528)
  • 19. Alternative to Fisher Exact Test: Hypergeometric Test How to calculate statistics to determine enrichment? hit.num = 51 # number of significantly changed pathway variables set.num = 1455 # number of variables in pathway full = 3358 # all possible variables in organism q.size = 72 # number of significantly changed variables phyper(hit.num-1, set.num, full-set.num, q.size, lower.tail=F) enrichment p-value = 1.717553e-06
  • 20. Visualization Options Challenges: •Removal of redundant information •Visualizing term relationships (term-term, term-protein)
  • 21. Use REVIGO to filter redundant terms http://revigo.irb.hr/ prepare input (term, p-value) 1. Upload to REVIGO 2. Run Supek F, Bošnjak M, Škunca N, Šmuc T. "REVIGO summarizes and visualizes long lists of Gene Ontology terms" PLoS ONE 2011. doi:10.1371/journal.pone.0021800
  • 22. REVIGO: overview scatterplot Position defined on similarity (MDS)
  • 23. REVIGO: overview table Cluster leaders prioritized based on enrichment p-value
  • 24. REVIGO: network • Edges: 3% of the strongest GO term pairwise similarities • Node size: generality of term (small = specific) • Node color: p-value Download network
  • 25. Cytoscape 1. Open Cytoscape Import REVIGO network into cytoscape 2 3 4
  • 26. Cytoscape: set layout and defaults 1. Set layout 3. Set network defaults 2 4 5
  • 27. Cytoscape: map data to network properties 1. Set Edge width and color 2. Set Node labels, size and color
  • 28. Cytoscape: overview network components Download edge information 1 2 3. View in excel Download node information 1 2 3. View in excel
  • 29. Bonus: Modify Edge and Node Attributes to show term to protein connections See file ‘test edge.xlsx’ and ‘test node.xslx, for examples of upload formats See detailed instructions at http://www.slideshare.net/dgrapov/demonstration-of-network-mapping
  • 30. See more Statistical and Multivariate Analysis Examples at http://imdevsoftware.wordpress.com/tutorials/ Questions? dgrapov@ucdavis.edu This research was supported in part by NIH 1 U24 DK097154