SlideShare a Scribd company logo
Extracting biological meaning from large gene list with DAVID Huang et al., CurrProtoc Bioinformatics (2009) http://david.abcc.ncifcrf.gov/home.jsp Francesco Mattia Mancuso (francesco.mancuso@crg.es) Bioinfarmatics Core Facility Short Tutorial
Introduction ,[object Object]
 Proteomics
 Expression microarray
 Promoter microarray
ChIP-on-CHIPs
 …significant capabilities to study a large variety of biological mechanisms, including associations with diseases large ‘interesting’ gene list (ranging in size from hundreds to thousands of genes) involved in studied biological conditions.  Data Analysis of genes/proteins list ,[object Object]
Challenging task,[object Object]
Released in 2003 (Dennis et al., Genome Biol.; Hosack et al., Genome Biol.)
able to extract biological features/meaning associated with large gene lists
able to handle any type of gene listCommon strategy with other tool: ,[object Object]
gene ontology terms
to statistically highlight the most overrepresented biological annotation
enrichment,[object Object]
Main objectives of GO project Compile and provide GO terms; Use of structured vocabularies in the annotation of gene products; Provide open access to the GO database and Web resource. Independent sets of vocabularies Molecular Function (MF) – elemental activity or task performed, or potentially performed, by individual gene products (e.g. “DNA binding” and “catalytic activity”); Cellular Component (CC) – location of action for a gene product (e.g. “organelle membrane” and “cytoskeleton”); Biological Process (BP) – broad biological objective or goal in which a gene product participates. (e.g. “DNA replication” and “response to stimulus”).
[object Object]
The accession ID belongs with the definition.
if a term changes (e.g., from “chromatin” to “structural component of chromatin”), but not the definition of the term, the accession ID will remain the same.Directed acyclic graphs (DAGs) Semantic relationships between parent and child terms: ,[object Object]
part_of: the child is a component of the parent, such as a subprocess or physical part (e.g. nucleolus is part of nuclear lumen),[object Object]
Enrichment  and p-valuescalculatedwith a hypergeometricdistribution N = all genes (universe) M = all genes belonging to a pathway n = your gene list m = genes of your gene list that belongs to the pathway Other well-known statistical methods: χ2, Fisher’s exact test, Binomial probability
A 'good' gene list Contains many important genes (marker genes) as expected; Reasonable number of genes ranging from hundreds to thousands (e.g., 100–2,000 genes), not extremely low or high; Most of the genes significantly pass the statistical threshold; Portion of up- or down-regulated genes are involved in certain interesting biological processes, rather than being randomly spread throughout all possible biological processes; Consistently contain more enriched biology than that of a random list in the same size range; High reproducibility to generate a similar gene list under the same conditions; Data high quality can be confirmed by other independent experiments.
DAVIDhomepage:   http://david.abcc.ncifcrf.gov/home.jsp
The wide-range collection of heterogeneous functional annotations in the DAVID Knowledgebase
Analytic tools/modules in DAVID Huang et al., Nature Protocols, 2009
GENE LIST MANAGEMENT PANEL: SUBMIT AND MANAGE USER’S GENE LISTS
Analytic tools/modules in DAVID Huang et al., Nature Protocols, 2009
GENE NAME BATCH VIEWER: EXPLORE GENE NAMES BASED ON USER’S GENE IDs
ID CONVERSION TOOL: CONVERT USERS’ GENE IDs TO DIFFERENT TYPES
Exercise 1 Submit data and convert the IDs Cicala, C. et al. HIV envelope induces a cascade of cell signals in non-proliferating target cells that favor virus replication. Proc. Natl. Acad. Sci. USA 99, 9380–9385 (2002). “Freshly isolated peripheral blood mononuclear cells were treated with an HIV envelope protein (gp120) and genome-wide gene expression changes were observed using Affymetrix U95A microarray chips. The aim of the experiment was to investigate cellular responses to viral envelope protein infection, which may help in understanding the mechanisms for HIV replication in resting or sub-optimally activated peripheral blood mononuclear cells.” DOWNLOAD THE DATASET FROM : http://www.nature.com/nprot/journal/v4/n1/suppinfo/nprot.2008.211_S1.html Supplementary Data 2
Analytic tools/modules in DAVID Huang et al., Nature Protocols, 2009
GENE FUNCTIONAL CLASSIFICATION TOOL: CLASSIFY USERS’ GENES INTO CO-FUNCTIONAL GENE GROUPS
Analytic tools/modules in DAVID Huang et al., Nature Protocols, 2009
FUNCTIONAL ANNOTATION TOOL: IDENTIFY ENRICHEDBIOLOGY WITHIN USERS’ GENE LISTS

More Related Content

What's hot

Uni prot presentation
Uni prot presentationUni prot presentation
Uni prot presentation
Rida Khalid
 
Nucleic acid database
Nucleic acid databaseNucleic acid database
Nucleic acid database
Esakkiammal S
 
BLAST
BLASTBLAST
GENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICSGENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICS
sandeshGM
 
Blast
BlastBlast
(Expasy)
(Expasy)(Expasy)
(Expasy)
Mazhar Khan
 
Protein Databases
Protein DatabasesProtein Databases
Data Retrieval Systems
Data Retrieval SystemsData Retrieval Systems
Data Retrieval Systems
Saramita De Chakravarti
 
Drug and Chemical Databases 2018 - Drug Discovery
Drug and Chemical Databases 2018 - Drug DiscoveryDrug and Chemical Databases 2018 - Drug Discovery
Drug and Chemical Databases 2018 - Drug Discovery
Girinath Pillai
 
Database Searching
Database SearchingDatabase Searching
Database Searching
Meghaj Mallick
 
Primary and secondary database
Primary and secondary databasePrimary and secondary database
Primary and secondary database
KAUSHAL SAHU
 
Role of bioinformatics in drug designing
Role of bioinformatics in drug designingRole of bioinformatics in drug designing
Role of bioinformatics in drug designing
W Roseybala Devi
 
Prosite
PrositeProsite
Structural databases
Structural databases Structural databases
Structural databases
Priyadharshana
 
BITS: Basics of sequence analysis
BITS: Basics of sequence analysisBITS: Basics of sequence analysis
BITS: Basics of sequence analysis
BITS
 
Blast
BlastBlast
Proteins databases
Proteins databasesProteins databases
Proteins databases
Hafiz Muhammad Zeeshan Raza
 
Rasmol
RasmolRasmol

What's hot (20)

Uni prot presentation
Uni prot presentationUni prot presentation
Uni prot presentation
 
Nucleic acid database
Nucleic acid databaseNucleic acid database
Nucleic acid database
 
BLAST
BLASTBLAST
BLAST
 
EMBL-EBI
EMBL-EBIEMBL-EBI
EMBL-EBI
 
GENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICSGENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICS
 
Blast
BlastBlast
Blast
 
Est database
Est databaseEst database
Est database
 
(Expasy)
(Expasy)(Expasy)
(Expasy)
 
Protein Databases
Protein DatabasesProtein Databases
Protein Databases
 
Data Retrieval Systems
Data Retrieval SystemsData Retrieval Systems
Data Retrieval Systems
 
Drug and Chemical Databases 2018 - Drug Discovery
Drug and Chemical Databases 2018 - Drug DiscoveryDrug and Chemical Databases 2018 - Drug Discovery
Drug and Chemical Databases 2018 - Drug Discovery
 
Database Searching
Database SearchingDatabase Searching
Database Searching
 
Primary and secondary database
Primary and secondary databasePrimary and secondary database
Primary and secondary database
 
Role of bioinformatics in drug designing
Role of bioinformatics in drug designingRole of bioinformatics in drug designing
Role of bioinformatics in drug designing
 
Prosite
PrositeProsite
Prosite
 
Structural databases
Structural databases Structural databases
Structural databases
 
BITS: Basics of sequence analysis
BITS: Basics of sequence analysisBITS: Basics of sequence analysis
BITS: Basics of sequence analysis
 
Blast
BlastBlast
Blast
 
Proteins databases
Proteins databasesProteins databases
Proteins databases
 
Rasmol
RasmolRasmol
Rasmol
 

Similar to David

Bioinformatics .pptx
Bioinformatics .pptxBioinformatics .pptx
Bioinformatics .pptx
UpendraSharmaUS1
 
rheumatoid arthritis
rheumatoid arthritisrheumatoid arthritis
rheumatoid arthritis
Ankit Bhardwaj
 
Bioinformatics Introduction and Use of BLAST Tool
Bioinformatics Introduction and Use of BLAST ToolBioinformatics Introduction and Use of BLAST Tool
Bioinformatics Introduction and Use of BLAST Tool
JesminBinti
 
Bioinformatics and functional genomics
Bioinformatics and functional genomicsBioinformatics and functional genomics
Bioinformatics and functional genomicsAisha Kalsoom
 
Introducción a la bioinformatica
Introducción a la bioinformaticaIntroducción a la bioinformatica
Introducción a la bioinformaticaMartín Arrieta
 
bioinformatics enabling knowledge generation from agricultural omics data
bioinformatics enabling knowledge generation from agricultural omics databioinformatics enabling knowledge generation from agricultural omics data
bioinformatics enabling knowledge generation from agricultural omics data
International Institute of Tropical Agriculture
 
BIOINFORMATICS Applications And Challenges
BIOINFORMATICS Applications And ChallengesBIOINFORMATICS Applications And Challenges
BIOINFORMATICS Applications And Challenges
Amos Watentena
 
Prediction of protein function
Prediction of protein functionPrediction of protein function
Prediction of protein function
Lars Juhl Jensen
 
X-Meeting Poster 2015 - Vallys A Coverage tool
X-Meeting Poster 2015 - Vallys A Coverage toolX-Meeting Poster 2015 - Vallys A Coverage tool
X-Meeting Poster 2015 - Vallys A Coverage tool
Genomika Diagnósticos
 
COMPUTATIONAL METHODS FOR FUNCTIONAL ANALYSIS OF GENE EXPRESSION
COMPUTATIONAL METHODS FOR FUNCTIONAL ANALYSIS OF GENE EXPRESSIONCOMPUTATIONAL METHODS FOR FUNCTIONAL ANALYSIS OF GENE EXPRESSION
COMPUTATIONAL METHODS FOR FUNCTIONAL ANALYSIS OF GENE EXPRESSION
csandit
 
Major biological nucleotide databases
Major biological nucleotide databasesMajor biological nucleotide databases
Major biological nucleotide databases
Vidya Kalaivani Rajkumar
 
INBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision Workshop at MIE 2011. Victoria LópezINBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision
 
Bioinformatics
BioinformaticsBioinformatics
How to analyse large data sets
How to analyse large data setsHow to analyse large data sets
How to analyse large data sets
improvemed
 
2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe
open_phacts
 
LECTURE NOTES ON BIOINFORMATICS
LECTURE NOTES ON BIOINFORMATICSLECTURE NOTES ON BIOINFORMATICS
LECTURE NOTES ON BIOINFORMATICS
MSCW Mysore
 
IJSRED-V2I1P5
IJSRED-V2I1P5IJSRED-V2I1P5
IJSRED-V2I1P5
IJSRED
 
Bioinformatics, application by kk sahu sir
Bioinformatics, application by kk sahu sirBioinformatics, application by kk sahu sir
Bioinformatics, application by kk sahu sir
KAUSHAL SAHU
 
Utilizing literature for biological discovery
Utilizing literature for biological discoveryUtilizing literature for biological discovery
Utilizing literature for biological discovery
Lars Juhl Jensen
 
Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!
adcobb
 

Similar to David (20)

Bioinformatics .pptx
Bioinformatics .pptxBioinformatics .pptx
Bioinformatics .pptx
 
rheumatoid arthritis
rheumatoid arthritisrheumatoid arthritis
rheumatoid arthritis
 
Bioinformatics Introduction and Use of BLAST Tool
Bioinformatics Introduction and Use of BLAST ToolBioinformatics Introduction and Use of BLAST Tool
Bioinformatics Introduction and Use of BLAST Tool
 
Bioinformatics and functional genomics
Bioinformatics and functional genomicsBioinformatics and functional genomics
Bioinformatics and functional genomics
 
Introducción a la bioinformatica
Introducción a la bioinformaticaIntroducción a la bioinformatica
Introducción a la bioinformatica
 
bioinformatics enabling knowledge generation from agricultural omics data
bioinformatics enabling knowledge generation from agricultural omics databioinformatics enabling knowledge generation from agricultural omics data
bioinformatics enabling knowledge generation from agricultural omics data
 
BIOINFORMATICS Applications And Challenges
BIOINFORMATICS Applications And ChallengesBIOINFORMATICS Applications And Challenges
BIOINFORMATICS Applications And Challenges
 
Prediction of protein function
Prediction of protein functionPrediction of protein function
Prediction of protein function
 
X-Meeting Poster 2015 - Vallys A Coverage tool
X-Meeting Poster 2015 - Vallys A Coverage toolX-Meeting Poster 2015 - Vallys A Coverage tool
X-Meeting Poster 2015 - Vallys A Coverage tool
 
COMPUTATIONAL METHODS FOR FUNCTIONAL ANALYSIS OF GENE EXPRESSION
COMPUTATIONAL METHODS FOR FUNCTIONAL ANALYSIS OF GENE EXPRESSIONCOMPUTATIONAL METHODS FOR FUNCTIONAL ANALYSIS OF GENE EXPRESSION
COMPUTATIONAL METHODS FOR FUNCTIONAL ANALYSIS OF GENE EXPRESSION
 
Major biological nucleotide databases
Major biological nucleotide databasesMajor biological nucleotide databases
Major biological nucleotide databases
 
INBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision Workshop at MIE 2011. Victoria LópezINBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision Workshop at MIE 2011. Victoria López
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
How to analyse large data sets
How to analyse large data setsHow to analyse large data sets
How to analyse large data sets
 
2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe
 
LECTURE NOTES ON BIOINFORMATICS
LECTURE NOTES ON BIOINFORMATICSLECTURE NOTES ON BIOINFORMATICS
LECTURE NOTES ON BIOINFORMATICS
 
IJSRED-V2I1P5
IJSRED-V2I1P5IJSRED-V2I1P5
IJSRED-V2I1P5
 
Bioinformatics, application by kk sahu sir
Bioinformatics, application by kk sahu sirBioinformatics, application by kk sahu sir
Bioinformatics, application by kk sahu sir
 
Utilizing literature for biological discovery
Utilizing literature for biological discoveryUtilizing literature for biological discovery
Utilizing literature for biological discovery
 
Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!
 

Recently uploaded

Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
Mohd Adib Abd Muin, Senior Lecturer at Universiti Utara Malaysia
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
Jisc
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
JosvitaDsouza2
 
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
EugeneSaldivar
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
Atul Kumar Singh
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptx
Jisc
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
Sandy Millin
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
Special education needs
 
Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
Jean Carlos Nunes Paixão
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
siemaillard
 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345
beazzy04
 
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXXPhrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
MIRIAMSALINAS13
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
Delapenabediema
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
Jisc
 
The French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free downloadThe French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free download
Vivekanand Anglo Vedic Academy
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
Pavel ( NSTU)
 
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdfAdversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
Po-Chuan Chen
 
The Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptxThe Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptx
DhatriParmar
 
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
Levi Shapiro
 
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Thiyagu K
 

Recently uploaded (20)

Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
 
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptx
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
 
Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345
 
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXXPhrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
 
The French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free downloadThe French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free download
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
 
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdfAdversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
 
The Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptxThe Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptx
 
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
 
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
 

David

  • 1. Extracting biological meaning from large gene list with DAVID Huang et al., CurrProtoc Bioinformatics (2009) http://david.abcc.ncifcrf.gov/home.jsp Francesco Mattia Mancuso (francesco.mancuso@crg.es) Bioinfarmatics Core Facility Short Tutorial
  • 2.
  • 7.
  • 8.
  • 9. Released in 2003 (Dennis et al., Genome Biol.; Hosack et al., Genome Biol.)
  • 10. able to extract biological features/meaning associated with large gene lists
  • 11.
  • 13. to statistically highlight the most overrepresented biological annotation
  • 14.
  • 15. Main objectives of GO project Compile and provide GO terms; Use of structured vocabularies in the annotation of gene products; Provide open access to the GO database and Web resource. Independent sets of vocabularies Molecular Function (MF) – elemental activity or task performed, or potentially performed, by individual gene products (e.g. “DNA binding” and “catalytic activity”); Cellular Component (CC) – location of action for a gene product (e.g. “organelle membrane” and “cytoskeleton”); Biological Process (BP) – broad biological objective or goal in which a gene product participates. (e.g. “DNA replication” and “response to stimulus”).
  • 16.
  • 17. The accession ID belongs with the definition.
  • 18.
  • 19.
  • 20. Enrichment and p-valuescalculatedwith a hypergeometricdistribution N = all genes (universe) M = all genes belonging to a pathway n = your gene list m = genes of your gene list that belongs to the pathway Other well-known statistical methods: χ2, Fisher’s exact test, Binomial probability
  • 21. A 'good' gene list Contains many important genes (marker genes) as expected; Reasonable number of genes ranging from hundreds to thousands (e.g., 100–2,000 genes), not extremely low or high; Most of the genes significantly pass the statistical threshold; Portion of up- or down-regulated genes are involved in certain interesting biological processes, rather than being randomly spread throughout all possible biological processes; Consistently contain more enriched biology than that of a random list in the same size range; High reproducibility to generate a similar gene list under the same conditions; Data high quality can be confirmed by other independent experiments.
  • 22. DAVIDhomepage: http://david.abcc.ncifcrf.gov/home.jsp
  • 23. The wide-range collection of heterogeneous functional annotations in the DAVID Knowledgebase
  • 24. Analytic tools/modules in DAVID Huang et al., Nature Protocols, 2009
  • 25. GENE LIST MANAGEMENT PANEL: SUBMIT AND MANAGE USER’S GENE LISTS
  • 26. Analytic tools/modules in DAVID Huang et al., Nature Protocols, 2009
  • 27. GENE NAME BATCH VIEWER: EXPLORE GENE NAMES BASED ON USER’S GENE IDs
  • 28. ID CONVERSION TOOL: CONVERT USERS’ GENE IDs TO DIFFERENT TYPES
  • 29.
  • 30. Exercise 1 Submit data and convert the IDs Cicala, C. et al. HIV envelope induces a cascade of cell signals in non-proliferating target cells that favor virus replication. Proc. Natl. Acad. Sci. USA 99, 9380–9385 (2002). “Freshly isolated peripheral blood mononuclear cells were treated with an HIV envelope protein (gp120) and genome-wide gene expression changes were observed using Affymetrix U95A microarray chips. The aim of the experiment was to investigate cellular responses to viral envelope protein infection, which may help in understanding the mechanisms for HIV replication in resting or sub-optimally activated peripheral blood mononuclear cells.” DOWNLOAD THE DATASET FROM : http://www.nature.com/nprot/journal/v4/n1/suppinfo/nprot.2008.211_S1.html Supplementary Data 2
  • 31. Analytic tools/modules in DAVID Huang et al., Nature Protocols, 2009
  • 32. GENE FUNCTIONAL CLASSIFICATION TOOL: CLASSIFY USERS’ GENES INTO CO-FUNCTIONAL GENE GROUPS
  • 33.
  • 34.
  • 35. Analytic tools/modules in DAVID Huang et al., Nature Protocols, 2009
  • 36. FUNCTIONAL ANNOTATION TOOL: IDENTIFY ENRICHEDBIOLOGY WITHIN USERS’ GENE LISTS
  • 37.
  • 38. Analytic tools/modules in DAVID Huang et al., Nature Protocols, 2009
  • 39.
  • 41. Analytic tools/modules in DAVID Huang et al., Nature Protocols, 2009
  • 43. Analytic tools/modules in DAVID Huang et al., Nature Protocols, 2009
  • 44. Functional Annotation Table There is no statistics applied in this report.
  • 45. Attention!!!!! DAVID enrichment analysis is more of an exploratory procedure than a pure statistical solution. “The final interpretation and analytic result decisions (in terms of accepting the results that make sense biologically in the context of the study, or rejecting ones that do not) should be made by the biologists/analysts themselves, rather than by any of the tools.” (Huang et al., 2009)
  • 46.
  • 47.
  • 48. EASE Score Threshold (Maximum Probability): the threshold of EASE Score, a modified Fisher Exact P-value, for gene-enrichment analysis. It ranges from 0 to 1. Fisher Exact P-Value = 0 represents perfect enrichment.
  • 49. The Fold Enrichment is defined as the ratio of the two proportions. For example, if 40/400 (i.e. 10%) of your input genes involved in "kinase activity" and the background information is 300/30000 genes (i.e.  1%) associating with "kinase activity", roughly 10% / 1% = 10 fold enrichment.
  • 50. In DAVID annotation system, Fisher Exact is adopted to measure the gene-enrichment in annotation terms. When members of two independent groups can fall into one of two mutually exclusive categories, Fisher Exact test is used to determine whether the proportions of those falling into each category differs by group.
  • 51. Benjamini-Hochberg, Bonferroni, FDR (False Discovery Rate) are different 'standard' statistics for multiple comparison corrections. They correct P-values to be more conservative in order to lower family-wise false discovery rate.
  • 52. LT (list total): number of genes in your gene list mapped to any term in this ontology ("system”)
  • 53. PH (population hits): number of genes with this GO term on the background list (the whole chip)
  • 54. PT (population total): number of genes on the background list (the whole chip) mapped to any term in this ontology ("system”)

Editor's Notes

  1. GoMiner, GOstat, Onto-express, GoToolBox, FatiGO, GFINDer and GSEA
  2. 3 - (e.g., selecting genes by comparing gene expression between control and experimental cells with t-test statistics: fold changes greater than or equal to 2 and P-values less than or equal to 0.05)6 - e.g., by independent experiments under the same conditions or by leave-one-out statistical test
  3. Functional classification: ability for investigators to explore and view functionally related genes together, as a unit, to concentrate on the larger biological network rather than at the level of an individual gene.Functional Annotation chart: provides typical gene–term enrichment (overrepresented) analysis to identify the most relevant (overrepresented) biological terms associated with a given gene listFunctional Annotation Clustering: uses a similar fuzzy clustering concept as functional classification by measuring relationships among the annotation terms on the basis of the degree of their coassociation with genes within the user’s list to cluster somewhat heterogeneous, yet highly similar annotation into functional annotation groupsFunctional annotation table: is a query engine for the DAVID knowledgebase, without statistical calculations. For a given gene list, the tool can quickly query corresponding annotation for each gene and present them in a table format.