SlideShare a Scribd company logo
1 of 27
An introduction to Web Apollo.
A webinar for the i5K Pilot Species Projects - Hemiptera
Monica Munoz-Torres, PhD
Biocurator & Bioinformatics Analyst | @monimunozto
Genomics Division, Lawrence Berkeley National Laboratory
12+1 May, 2014
UNIVERSITY OF
CALIFORNIA
Outline
1. What is Web Apollo?:
• Definition & working concept.
2. Community based curation from our
experience. Lessons Learned.
3. Manual Annotation at i5K: how do we
get there?
4. Becoming acquainted with Web
Apollo.
An introduction to
Web Apollo.
A webinar for the i5K
Pilot Species Projects -
Hemiptera.
Outline 2
What is Web Apollo?
• Web Apollo is a web-based, collaborative genomic
annotation editing platform.
We need annotation editing tools to modify and refine the
precise location and structure of the genome elements that
predictive algorithms cannot yet resolve automatically.
31. What is Web Apollo?
Find more about Web Apollo at
http://GenomeArchitect.org
and
Genome Biol 14:R93. (2013).
Brief history of Apollo*:
a. Desktop:
one person at a time editing a
specific region, annotations
saved in local files; slowed down
collaboration.
b. Java Web Start:
users saved annotations directly
to a centralized database;
potential issues with stale
annotation data remained.
1. What is Web Apollo? 4
Biologists could finally visualize computational analyses and
experimental evidence from genomic features and build
manually-curated consensus gene structures. Apollo became a
very popular, open source tool (insects, fish, mammals, birds, etc.).
*
Web Apollo
• Browser-based; plugin for JBrowse.
• Allows for intuitive annotation creation and editing,
with gestures and pull-down menus to create
transcripts, add/delete/resize exons, merge/split
exons or transcripts, insert comments
(CV, freeform text), etc.
• Customizable rules and
appearance.
• Edits in one client are
instantly pushed to all other
clients: Collaborative!
1. What is Web Apollo? 5
Working
Concept
In the context of gene manual annotation,
curation tries to find the best examples
and/or eliminate (most) errors.
To conduct manual annotation efforts:
Gather and evaluate all available evidence
using quality-control metrics to
corroborate or modify automated
annotation predictions.
Perform sequence similarity searches
(phylogenetic framework) and use
literature and public databases to:
• Predict functional assignments from
experimental data.
• Distinguish orthologs from paralogs,
and classify gene membership in
families and networks.
2. In our experience. 6
Automated gene models
Evidence:
cDNAs, HMM domain searches,
alignments with assemblies or
genes from other species.
Manual annotation & curation
Dispersed, community-based gene
manual annotation efforts.
Using Web Apollo, we* have trained
geographically dispersed scientific
communities to perform biologically
supported manual annotations, and
monitored their findings: ~80 institutions,
14 countries, hundreds of scientists, and
gate keepers.
– Training workshops and geneborees.
– Tutorials with detailed instructions.
– Personalized user support.
2. In our experience. 7
*Collaboration with Elsik Lab,
Hymenoptera Genome
Database.
What have we learned?
Harvesting expertise from dispersed researchers who
assigned functions to predicted and curated peptides,
we have developed more interactive and responsive
tools, as well as better visualization, editing, and
analysis capabilities.
82. In our experience.
It is helpful to work together.
Scientific community efforts bring together domain-
specific and natural history expertise that would have
otherwise remain disconnected.
92. In our experience.
Improved Automated Annotations*
In many cases, automated annotations have been
improved (e.g: Apis mellifera. Elsik et al. BMC Genomics 2014, 15:86).
Also, learned of the challenges of newer sequencing
technologies, e.g.:
– Frameshifts and indel errors
– Split genes across scaffolds
– Highly repetitive sequences
To face these challenges, we train annotators in
recovering coding sequences in agreement with all
available biological evidence.
102. In our experience.
Understanding the evolution of sociality.
Comparison of the genomes of 7 species of
ants contributed to a better understanding
of the evolution and organization of insect
societies at the molecular level.
Insights drawn mainly from six core aspects of
ant biology:
1. Alternative morphological castes
2. Division of labor
3. Chemical Communication
4. Alternative social organization
5. Social immunity
6. Mutualism
11
… groups of
communities
have taught us a
lot!
Libbrecht et al. 2012. Genome Biology 2013, 14:212
2. In our experience.
A little training goes a long way!
With the right tools, wet lab scientists make exceptional
curators who can easily learn to maximize the
generation of accurate, biologically supported gene
models.
122. In our experience.
Manual annotation at i5K
How do we get there?
3. How do we get there? 13
Assembly
Manual
annotation
Experimental
validation
Automated
Annotation
In a genome sequencing project…
Gene Prediction
Gene Prediction:
Identification of protein-coding genes, tRNAs, rRNAs,
regulatory motifs, repetitive elements (masked), etc.
Ab initio or homology-based. E.g: fgenesh, Augustus,
geneid, SGP2
14
Nucleic Acids 2003 vol. 31 no. 13 3738-3741
3. How do we get there?
Gene Annotation
Gene Annotation:
Integration of data from prediction tools to generate a
consensus set of predictions (gene models).
• Models may be organized by:
- automatic integration of predicted sets; e.g: GLEAN
- packaging necessary tools into pipeline; e.g: MAKER
• Transcriptomes are used to further inform the annotation
process.
153. How do we get there?
The Collaborative Curation Process at
i5K
1) A computationally predicted consensus gene set has
been generated using multiple lines of evidence; e.g.
CLEC_v0.5.3-Models.
2) i5K Projects will integrate consensus computational
predictions with manual annotations to produce an updated
Official Gene Set (OGS):
» If it’s not on either track, it won’t make the OGS!
» If it’s there and it shouldn’t, it will still make the OGS!
163. How do we get there?
Consensus set: reference and start point
• In some cases algorithms and metrics used to generate
consensus sets may actually reduce the accuracy of the gene’s
representation; e.g. use Augustus model instead to create a new
annotation.
• Isoforms: drag original and alternatively spliced form to ‘User-
created Annotations’ area.
• If an annotation needs to be removed from the consensus set,
drag it to the ‘User-created Annotations’ area and label as
‘Delete’ on Information Editor.
• Overlapping interests? Collaborate to reach agreement.
• Follow guidelines for i5K Pilot Species Projects as shown at
http://goo.gl/LRu1VY
173. How do we get there?
Navigation tools:
pan and zoom Search box: go
to a scaffold or
a gene model.
Grey bar of coordinates
indicates location. You can
also select here in order to
zoom to a sub-region.
‘View’: change
color by CDS,
toggle strands,
set highlight.
‘File’:
Upload your own
evidence: GFF3,
BAM, BigWig, VCF*.
Add combination
and sequence
search tracks.
‘Tools’:
Use BLAT to query the
genome with a protein
or DNA sequence.
Available Tracks
Evidence Tracks Area
‘User-created Annotations’ Track
Login
Web Apollo
Graphical User Interface (GUI) for editing annotations
4. Becoming Acquainted with Web Apollo.
Flags non-
canonical splice
sites.
Selection of features and
sub-features
Edge-matching
Evidence Tracks Area
‘User-created Annotations’ Track
 The editing logic (server):
 selects longest ORF as CDS
 flags non-canonical splice sites
Web Apollo
4. Becoming Acquainted with Web Apollo.
DNA Track
‘User-created Annotations’ Track
 Two new kinds of tracks:
 annotation editing
 sequence alteration editing
Web Apollo
4. Becoming Acquainted with Web Apollo.
Web Apollo
 Annotations, annotation edits, and History: stored in a centralized database.
4. Becoming Acquainted with Web Apollo.
Web Apollo
 Annotation Information Editor
4. Becoming Acquainted with Web Apollo.
Web Apollo
 Annotation Information Editor
4. Becoming Acquainted with Web Apollo.
[Some of the] Functionality:
 Protein-coding gene annotation (that you know and love)
 Sequence alterations (less coverage = more fragmentation)
 Visualization of stage and cell-type specific transcription data as
coverage plots, heat maps, and alignments
4. Becoming Acquainted with Web Apollo.
Example: ORCO
Live Demonstration using the Cimex lectularius genome
Footer 25
Arthropodcentric Thanks!
AgriPest Base
FlyBase
Hymenoptera Genome Database
VectorBase
Apis mellifera
Tribolium castaneum
Pogonomyrmex barbatus
Manduca sexta
Bombus terrestris
Helicoverpa armigera
Nasonia vitripennis
Acyrthosiphon pisum
Mayetiola destructor
Atta cephalotes
Linepithema humile
Camponotus floridanus
Solenopsis invicta
Acromyrmex echinatior
Thanks!
• Berkeley Bioinformatics Open-source Projects
(BBOP), Berkeley Lab: Web Apollo and Gene
Ontology teams. Suzanna E. Lewis (PI).
• Elsik Lab. § University of Missouri. Christine G.
Elsik (PI).
• Ian Holmes (PI). * University of California Berkeley.
• Arthropod genomics community, i5K
http://www.arthropodgenomes.org/wiki/i5K Steering
Committee, USDA/NAL, HGSC-BCM, BGI, and
1KITE http://www.1kite.org/.
• Web Apollo is supported by NIH grants 5R01GM080203
from NIGMS, and 5R01HG004483 from NHGRI, and by the
Director, Office of Science, Office of Basic Energy
Sciences, of the U.S. Department of Energy under Contract
No. DE-AC02-05CH11231.
• Insect images used with permission:
http://AlexanderWild.com
• For your attention, thank you!
Thank you. 27
Web Apollo
Ed Lee
Gregg Helt
Colin Diesh §
Deepak Unni §
Rob Buels *
Gene Ontology
Chris Mungall
Seth Carbon
Heiko Dietze
BBOP
Web Apollo: http://GenomeArchitect.org
GO: http://GeneOntology.org
i5K: http://arthropodgenomes.org/wiki/i5K

More Related Content

What's hot

Apollo and i5K: Collaborative Curation and Interactive Analysis of Genomes
Apollo and i5K: Collaborative Curation and Interactive Analysis of GenomesApollo and i5K: Collaborative Curation and Interactive Analysis of Genomes
Apollo and i5K: Collaborative Curation and Interactive Analysis of Genomes
Monica Munoz-Torres
 
Web Apollo at Genome Informatics 2014
Web Apollo at Genome Informatics 2014Web Apollo at Genome Informatics 2014
Web Apollo at Genome Informatics 2014
Monica Munoz-Torres
 
2013 bms-retreat-talk
2013 bms-retreat-talk2013 bms-retreat-talk
2013 bms-retreat-talk
c.titus.brown
 
JulieKlein_Bosc2012
JulieKlein_Bosc2012JulieKlein_Bosc2012
JulieKlein_Bosc2012
KUPKB_Team
 

What's hot (14)

Apollo and i5K: Collaborative Curation and Interactive Analysis of Genomes
Apollo and i5K: Collaborative Curation and Interactive Analysis of GenomesApollo and i5K: Collaborative Curation and Interactive Analysis of Genomes
Apollo and i5K: Collaborative Curation and Interactive Analysis of Genomes
 
Web Apollo Workshop University of Exeter
Web Apollo Workshop University of ExeterWeb Apollo Workshop University of Exeter
Web Apollo Workshop University of Exeter
 
Web Apollo at Genome Informatics 2014
Web Apollo at Genome Informatics 2014Web Apollo at Genome Informatics 2014
Web Apollo at Genome Informatics 2014
 
2013 bms-retreat-talk
2013 bms-retreat-talk2013 bms-retreat-talk
2013 bms-retreat-talk
 
Advanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven ResearchAdvanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven Research
 
J Klein - KUPKB: sharing, connecting and exposing kidney and urinary knowledg...
J Klein - KUPKB: sharing, connecting and exposing kidney and urinary knowledg...J Klein - KUPKB: sharing, connecting and exposing kidney and urinary knowledg...
J Klein - KUPKB: sharing, connecting and exposing kidney and urinary knowledg...
 
JulieKlein_Bosc2012
JulieKlein_Bosc2012JulieKlein_Bosc2012
JulieKlein_Bosc2012
 
2014 sage-talk
2014 sage-talk2014 sage-talk
2014 sage-talk
 
Oxford DTP - Sansone curation tools - Dec 2014
Oxford DTP - Sansone curation tools - Dec 2014Oxford DTP - Sansone curation tools - Dec 2014
Oxford DTP - Sansone curation tools - Dec 2014
 
Facilitating Scientific Discovery through Crowdsourcing and Distributed Parti...
Facilitating Scientific Discovery through Crowdsourcing and Distributed Parti...Facilitating Scientific Discovery through Crowdsourcing and Distributed Parti...
Facilitating Scientific Discovery through Crowdsourcing and Distributed Parti...
 
2014 bangkok-talk
2014 bangkok-talk2014 bangkok-talk
2014 bangkok-talk
 
Towards Incidental Collaboratories; Research Data Services
Towards Incidental Collaboratories; Research Data ServicesTowards Incidental Collaboratories; Research Data Services
Towards Incidental Collaboratories; Research Data Services
 
Ontomaton icbo2013-alternative order-t_wv3
Ontomaton icbo2013-alternative order-t_wv3Ontomaton icbo2013-alternative order-t_wv3
Ontomaton icbo2013-alternative order-t_wv3
 
Towards Reproducible Science: a few building blocks from my personal experience
Towards Reproducible Science: a few building blocks from my personal experienceTowards Reproducible Science: a few building blocks from my personal experience
Towards Reproducible Science: a few building blocks from my personal experience
 

Similar to An introduction to Web Apollo for i5K Pilot Species Projects - Hemiptera

Three's a crowd-source: Observations on Collaborative Genome Annotation
Three's a crowd-source: Observations on Collaborative Genome AnnotationThree's a crowd-source: Observations on Collaborative Genome Annotation
Three's a crowd-source: Observations on Collaborative Genome Annotation
Monica Munoz-Torres
 
one complete report from all the 4 labs.pdf
one complete report from all the 4 labs.pdfone complete report from all the 4 labs.pdf
one complete report from all the 4 labs.pdf
study help
 
one complete report from all the 4 labs.pdf
one complete report from all the 4 labs.pdfone complete report from all the 4 labs.pdf
one complete report from all the 4 labs.pdf
study help
 
Ontology Services for the Biomedical Sciences
Ontology Services for the Biomedical SciencesOntology Services for the Biomedical Sciences
Ontology Services for the Biomedical Sciences
Connected Data World
 

Similar to An introduction to Web Apollo for i5K Pilot Species Projects - Hemiptera (20)

Three's a crowd-source: Observations on Collaborative Genome Annotation
Three's a crowd-source: Observations on Collaborative Genome AnnotationThree's a crowd-source: Observations on Collaborative Genome Annotation
Three's a crowd-source: Observations on Collaborative Genome Annotation
 
Introduction to Apollo - i5k Research Community – Calanoida (copepod)
Introduction to Apollo - i5k Research Community – Calanoida (copepod)Introduction to Apollo - i5k Research Community – Calanoida (copepod)
Introduction to Apollo - i5k Research Community – Calanoida (copepod)
 
Apollo Workshop at KSU 2015
Apollo Workshop at KSU 2015Apollo Workshop at KSU 2015
Apollo Workshop at KSU 2015
 
Apollo Workshop AGS2017 Introduction
Apollo Workshop AGS2017 IntroductionApollo Workshop AGS2017 Introduction
Apollo Workshop AGS2017 Introduction
 
Genome annotation with open source software: Apollo, Jbrowse and the GO in Ga...
Genome annotation with open source software: Apollo, Jbrowse and the GO in Ga...Genome annotation with open source software: Apollo, Jbrowse and the GO in Ga...
Genome annotation with open source software: Apollo, Jbrowse and the GO in Ga...
 
Apollo annotation guidelines for i5k projects Diaphorina citri
Apollo annotation guidelines for i5k projects Diaphorina citriApollo annotation guidelines for i5k projects Diaphorina citri
Apollo annotation guidelines for i5k projects Diaphorina citri
 
Introduction to Apollo for i5k
Introduction to Apollo for i5kIntroduction to Apollo for i5k
Introduction to Apollo for i5k
 
Variant analysis and whole exome sequencing
Variant analysis and whole exome sequencingVariant analysis and whole exome sequencing
Variant analysis and whole exome sequencing
 
Curation Introduction - Apollo Workshop
Curation Introduction - Apollo WorkshopCuration Introduction - Apollo Workshop
Curation Introduction - Apollo Workshop
 
Apollo Collaborative genome annotation editing
Apollo Collaborative genome annotation editing Apollo Collaborative genome annotation editing
Apollo Collaborative genome annotation editing
 
Introduction to Ontologies for Environmental Biology
Introduction to Ontologies for Environmental BiologyIntroduction to Ontologies for Environmental Biology
Introduction to Ontologies for Environmental Biology
 
Designing a community resource - Sandra Orchard
Designing a community resource - Sandra OrchardDesigning a community resource - Sandra Orchard
Designing a community resource - Sandra Orchard
 
Genome Curation using Apollo
Genome Curation using ApolloGenome Curation using Apollo
Genome Curation using Apollo
 
Chibucos annot go_final
Chibucos annot go_finalChibucos annot go_final
Chibucos annot go_final
 
Curate locally, think globally
Curate locally, think globallyCurate locally, think globally
Curate locally, think globally
 
one complete report from all the 4 labs.pdf
one complete report from all the 4 labs.pdfone complete report from all the 4 labs.pdf
one complete report from all the 4 labs.pdf
 
one complete report from all the 4 labs.pdf
one complete report from all the 4 labs.pdfone complete report from all the 4 labs.pdf
one complete report from all the 4 labs.pdf
 
Ontology Services for the Biomedical Sciences
Ontology Services for the Biomedical SciencesOntology Services for the Biomedical Sciences
Ontology Services for the Biomedical Sciences
 
Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017
 
EVE161: Microbial Phylogenomics - Class 1 - Introduction
EVE161: Microbial Phylogenomics - Class 1 - IntroductionEVE161: Microbial Phylogenomics - Class 1 - Introduction
EVE161: Microbial Phylogenomics - Class 1 - Introduction
 

More from Monica Munoz-Torres

Apollo Genome Annotation Editor: Latest Updates, Including New Galaxy Integra...
Apollo Genome Annotation Editor: Latest Updates, Including New Galaxy Integra...Apollo Genome Annotation Editor: Latest Updates, Including New Galaxy Integra...
Apollo Genome Annotation Editor: Latest Updates, Including New Galaxy Integra...
Monica Munoz-Torres
 
Apollo: Scalable & collaborative curation of genomes - Biocuration 2015
Apollo: Scalable & collaborative curation of genomes - Biocuration 2015Apollo: Scalable & collaborative curation of genomes - Biocuration 2015
Apollo: Scalable & collaborative curation of genomes - Biocuration 2015
Monica Munoz-Torres
 
Data Visualization And Annotation Workshop at Biocuration 2015
Data Visualization And Annotation Workshop at Biocuration 2015Data Visualization And Annotation Workshop at Biocuration 2015
Data Visualization And Annotation Workshop at Biocuration 2015
Monica Munoz-Torres
 

More from Monica Munoz-Torres (20)

Apollo Workshop AGS2017 Editing functionality
Apollo Workshop AGS2017 Editing functionalityApollo Workshop AGS2017 Editing functionality
Apollo Workshop AGS2017 Editing functionality
 
Editing Functionality - Apollo Workshop
Editing Functionality - Apollo WorkshopEditing Functionality - Apollo Workshop
Editing Functionality - Apollo Workshop
 
Apollo Exercises Kansas State University 2015
Apollo Exercises Kansas State University 2015Apollo Exercises Kansas State University 2015
Apollo Exercises Kansas State University 2015
 
JBrowse & Apollo Overview - for AGR
JBrowse & Apollo Overview - for AGRJBrowse & Apollo Overview - for AGR
JBrowse & Apollo Overview - for AGR
 
Apollo Genome Annotation Editor: Latest Updates, Including New Galaxy Integra...
Apollo Genome Annotation Editor: Latest Updates, Including New Galaxy Integra...Apollo Genome Annotation Editor: Latest Updates, Including New Galaxy Integra...
Apollo Genome Annotation Editor: Latest Updates, Including New Galaxy Integra...
 
Gene Ontology Consortium: Website & COmmunity
Gene Ontology Consortium: Website & COmmunityGene Ontology Consortium: Website & COmmunity
Gene Ontology Consortium: Website & COmmunity
 
Essential Requirements for Community Annotation Tools
Essential Requirements for Community Annotation ToolsEssential Requirements for Community Annotation Tools
Essential Requirements for Community Annotation Tools
 
Genome Curation using Apollo - Workshop at UTK
Genome Curation using Apollo - Workshop at UTKGenome Curation using Apollo - Workshop at UTK
Genome Curation using Apollo - Workshop at UTK
 
Introduction to Apollo: i5K E affinis
Introduction to Apollo: i5K E affinisIntroduction to Apollo: i5K E affinis
Introduction to Apollo: i5K E affinis
 
Introduction to Apollo: A webinar for the i5K Research Community
Introduction to Apollo: A webinar for the i5K Research CommunityIntroduction to Apollo: A webinar for the i5K Research Community
Introduction to Apollo: A webinar for the i5K Research Community
 
Apollo Introduction for i5K Groups 2015-10-07
Apollo Introduction for i5K Groups 2015-10-07Apollo Introduction for i5K Groups 2015-10-07
Apollo Introduction for i5K Groups 2015-10-07
 
CONSORCIO ONTOLOGÍA DE GENES: herramientas para anotación funcional
CONSORCIO ONTOLOGÍA DE GENES: herramientas para anotación funcionalCONSORCIO ONTOLOGÍA DE GENES: herramientas para anotación funcional
CONSORCIO ONTOLOGÍA DE GENES: herramientas para anotación funcional
 
Apolo Taller en BIOS
Apolo Taller en BIOS Apolo Taller en BIOS
Apolo Taller en BIOS
 
Apollo Introduction for the Chestnut Research Community
Apollo Introduction for the Chestnut Research CommunityApollo Introduction for the Chestnut Research Community
Apollo Introduction for the Chestnut Research Community
 
Apollo : A workshop for the Manakin Research Coordination Network
Apollo: A workshop for the Manakin Research Coordination NetworkApollo: A workshop for the Manakin Research Coordination Network
Apollo : A workshop for the Manakin Research Coordination Network
 
Apollo - A webinar for the Phascolarctos cinereus research community
Apollo - A webinar for the Phascolarctos cinereus research communityApollo - A webinar for the Phascolarctos cinereus research community
Apollo - A webinar for the Phascolarctos cinereus research community
 
PAINT Family PTHR13451-MUS81
PAINT Family PTHR13451-MUS81PAINT Family PTHR13451-MUS81
PAINT Family PTHR13451-MUS81
 
Apollo: Scalable & collaborative curation of genomes - Biocuration 2015
Apollo: Scalable & collaborative curation of genomes - Biocuration 2015Apollo: Scalable & collaborative curation of genomes - Biocuration 2015
Apollo: Scalable & collaborative curation of genomes - Biocuration 2015
 
Data Visualization And Annotation Workshop at Biocuration 2015
Data Visualization And Annotation Workshop at Biocuration 2015Data Visualization And Annotation Workshop at Biocuration 2015
Data Visualization And Annotation Workshop at Biocuration 2015
 
Apollo: developers call 2015-02-05
Apollo: developers call 2015-02-05Apollo: developers call 2015-02-05
Apollo: developers call 2015-02-05
 

Recently uploaded

development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
NazaninKarimi6
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Sérgio Sacani
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Sérgio Sacani
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
levieagacer
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
PirithiRaju
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
Areesha Ahmad
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptx
seri bangash
 

Recently uploaded (20)

Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts ServiceJustdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
 
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLKochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verifiedConnaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
 
Clean In Place(CIP).pptx .
Clean In Place(CIP).pptx                 .Clean In Place(CIP).pptx                 .
Clean In Place(CIP).pptx .
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
 
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
 
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
 
Dubai Call Girls Beauty Face Teen O525547819 Call Girls Dubai Young
Dubai Call Girls Beauty Face Teen O525547819 Call Girls Dubai YoungDubai Call Girls Beauty Face Teen O525547819 Call Girls Dubai Young
Dubai Call Girls Beauty Face Teen O525547819 Call Girls Dubai Young
 
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptx
 
Introduction to Viruses
Introduction to VirusesIntroduction to Viruses
Introduction to Viruses
 

An introduction to Web Apollo for i5K Pilot Species Projects - Hemiptera

  • 1. An introduction to Web Apollo. A webinar for the i5K Pilot Species Projects - Hemiptera Monica Munoz-Torres, PhD Biocurator & Bioinformatics Analyst | @monimunozto Genomics Division, Lawrence Berkeley National Laboratory 12+1 May, 2014 UNIVERSITY OF CALIFORNIA
  • 2. Outline 1. What is Web Apollo?: • Definition & working concept. 2. Community based curation from our experience. Lessons Learned. 3. Manual Annotation at i5K: how do we get there? 4. Becoming acquainted with Web Apollo. An introduction to Web Apollo. A webinar for the i5K Pilot Species Projects - Hemiptera. Outline 2
  • 3. What is Web Apollo? • Web Apollo is a web-based, collaborative genomic annotation editing platform. We need annotation editing tools to modify and refine the precise location and structure of the genome elements that predictive algorithms cannot yet resolve automatically. 31. What is Web Apollo? Find more about Web Apollo at http://GenomeArchitect.org and Genome Biol 14:R93. (2013).
  • 4. Brief history of Apollo*: a. Desktop: one person at a time editing a specific region, annotations saved in local files; slowed down collaboration. b. Java Web Start: users saved annotations directly to a centralized database; potential issues with stale annotation data remained. 1. What is Web Apollo? 4 Biologists could finally visualize computational analyses and experimental evidence from genomic features and build manually-curated consensus gene structures. Apollo became a very popular, open source tool (insects, fish, mammals, birds, etc.). *
  • 5. Web Apollo • Browser-based; plugin for JBrowse. • Allows for intuitive annotation creation and editing, with gestures and pull-down menus to create transcripts, add/delete/resize exons, merge/split exons or transcripts, insert comments (CV, freeform text), etc. • Customizable rules and appearance. • Edits in one client are instantly pushed to all other clients: Collaborative! 1. What is Web Apollo? 5
  • 6. Working Concept In the context of gene manual annotation, curation tries to find the best examples and/or eliminate (most) errors. To conduct manual annotation efforts: Gather and evaluate all available evidence using quality-control metrics to corroborate or modify automated annotation predictions. Perform sequence similarity searches (phylogenetic framework) and use literature and public databases to: • Predict functional assignments from experimental data. • Distinguish orthologs from paralogs, and classify gene membership in families and networks. 2. In our experience. 6 Automated gene models Evidence: cDNAs, HMM domain searches, alignments with assemblies or genes from other species. Manual annotation & curation
  • 7. Dispersed, community-based gene manual annotation efforts. Using Web Apollo, we* have trained geographically dispersed scientific communities to perform biologically supported manual annotations, and monitored their findings: ~80 institutions, 14 countries, hundreds of scientists, and gate keepers. – Training workshops and geneborees. – Tutorials with detailed instructions. – Personalized user support. 2. In our experience. 7 *Collaboration with Elsik Lab, Hymenoptera Genome Database.
  • 8. What have we learned? Harvesting expertise from dispersed researchers who assigned functions to predicted and curated peptides, we have developed more interactive and responsive tools, as well as better visualization, editing, and analysis capabilities. 82. In our experience.
  • 9. It is helpful to work together. Scientific community efforts bring together domain- specific and natural history expertise that would have otherwise remain disconnected. 92. In our experience.
  • 10. Improved Automated Annotations* In many cases, automated annotations have been improved (e.g: Apis mellifera. Elsik et al. BMC Genomics 2014, 15:86). Also, learned of the challenges of newer sequencing technologies, e.g.: – Frameshifts and indel errors – Split genes across scaffolds – Highly repetitive sequences To face these challenges, we train annotators in recovering coding sequences in agreement with all available biological evidence. 102. In our experience.
  • 11. Understanding the evolution of sociality. Comparison of the genomes of 7 species of ants contributed to a better understanding of the evolution and organization of insect societies at the molecular level. Insights drawn mainly from six core aspects of ant biology: 1. Alternative morphological castes 2. Division of labor 3. Chemical Communication 4. Alternative social organization 5. Social immunity 6. Mutualism 11 … groups of communities have taught us a lot! Libbrecht et al. 2012. Genome Biology 2013, 14:212 2. In our experience.
  • 12. A little training goes a long way! With the right tools, wet lab scientists make exceptional curators who can easily learn to maximize the generation of accurate, biologically supported gene models. 122. In our experience.
  • 13. Manual annotation at i5K How do we get there? 3. How do we get there? 13 Assembly Manual annotation Experimental validation Automated Annotation In a genome sequencing project…
  • 14. Gene Prediction Gene Prediction: Identification of protein-coding genes, tRNAs, rRNAs, regulatory motifs, repetitive elements (masked), etc. Ab initio or homology-based. E.g: fgenesh, Augustus, geneid, SGP2 14 Nucleic Acids 2003 vol. 31 no. 13 3738-3741 3. How do we get there?
  • 15. Gene Annotation Gene Annotation: Integration of data from prediction tools to generate a consensus set of predictions (gene models). • Models may be organized by: - automatic integration of predicted sets; e.g: GLEAN - packaging necessary tools into pipeline; e.g: MAKER • Transcriptomes are used to further inform the annotation process. 153. How do we get there?
  • 16. The Collaborative Curation Process at i5K 1) A computationally predicted consensus gene set has been generated using multiple lines of evidence; e.g. CLEC_v0.5.3-Models. 2) i5K Projects will integrate consensus computational predictions with manual annotations to produce an updated Official Gene Set (OGS): » If it’s not on either track, it won’t make the OGS! » If it’s there and it shouldn’t, it will still make the OGS! 163. How do we get there?
  • 17. Consensus set: reference and start point • In some cases algorithms and metrics used to generate consensus sets may actually reduce the accuracy of the gene’s representation; e.g. use Augustus model instead to create a new annotation. • Isoforms: drag original and alternatively spliced form to ‘User- created Annotations’ area. • If an annotation needs to be removed from the consensus set, drag it to the ‘User-created Annotations’ area and label as ‘Delete’ on Information Editor. • Overlapping interests? Collaborate to reach agreement. • Follow guidelines for i5K Pilot Species Projects as shown at http://goo.gl/LRu1VY 173. How do we get there?
  • 18. Navigation tools: pan and zoom Search box: go to a scaffold or a gene model. Grey bar of coordinates indicates location. You can also select here in order to zoom to a sub-region. ‘View’: change color by CDS, toggle strands, set highlight. ‘File’: Upload your own evidence: GFF3, BAM, BigWig, VCF*. Add combination and sequence search tracks. ‘Tools’: Use BLAT to query the genome with a protein or DNA sequence. Available Tracks Evidence Tracks Area ‘User-created Annotations’ Track Login Web Apollo Graphical User Interface (GUI) for editing annotations 4. Becoming Acquainted with Web Apollo.
  • 19. Flags non- canonical splice sites. Selection of features and sub-features Edge-matching Evidence Tracks Area ‘User-created Annotations’ Track  The editing logic (server):  selects longest ORF as CDS  flags non-canonical splice sites Web Apollo 4. Becoming Acquainted with Web Apollo.
  • 20. DNA Track ‘User-created Annotations’ Track  Two new kinds of tracks:  annotation editing  sequence alteration editing Web Apollo 4. Becoming Acquainted with Web Apollo.
  • 21. Web Apollo  Annotations, annotation edits, and History: stored in a centralized database. 4. Becoming Acquainted with Web Apollo.
  • 22. Web Apollo  Annotation Information Editor 4. Becoming Acquainted with Web Apollo.
  • 23. Web Apollo  Annotation Information Editor 4. Becoming Acquainted with Web Apollo.
  • 24. [Some of the] Functionality:  Protein-coding gene annotation (that you know and love)  Sequence alterations (less coverage = more fragmentation)  Visualization of stage and cell-type specific transcription data as coverage plots, heat maps, and alignments 4. Becoming Acquainted with Web Apollo.
  • 25. Example: ORCO Live Demonstration using the Cimex lectularius genome Footer 25
  • 26. Arthropodcentric Thanks! AgriPest Base FlyBase Hymenoptera Genome Database VectorBase Apis mellifera Tribolium castaneum Pogonomyrmex barbatus Manduca sexta Bombus terrestris Helicoverpa armigera Nasonia vitripennis Acyrthosiphon pisum Mayetiola destructor Atta cephalotes Linepithema humile Camponotus floridanus Solenopsis invicta Acromyrmex echinatior
  • 27. Thanks! • Berkeley Bioinformatics Open-source Projects (BBOP), Berkeley Lab: Web Apollo and Gene Ontology teams. Suzanna E. Lewis (PI). • Elsik Lab. § University of Missouri. Christine G. Elsik (PI). • Ian Holmes (PI). * University of California Berkeley. • Arthropod genomics community, i5K http://www.arthropodgenomes.org/wiki/i5K Steering Committee, USDA/NAL, HGSC-BCM, BGI, and 1KITE http://www.1kite.org/. • Web Apollo is supported by NIH grants 5R01GM080203 from NIGMS, and 5R01HG004483 from NHGRI, and by the Director, Office of Science, Office of Basic Energy Sciences, of the U.S. Department of Energy under Contract No. DE-AC02-05CH11231. • Insect images used with permission: http://AlexanderWild.com • For your attention, thank you! Thank you. 27 Web Apollo Ed Lee Gregg Helt Colin Diesh § Deepak Unni § Rob Buels * Gene Ontology Chris Mungall Seth Carbon Heiko Dietze BBOP Web Apollo: http://GenomeArchitect.org GO: http://GeneOntology.org i5K: http://arthropodgenomes.org/wiki/i5K

Editor's Notes

  1. Thank you!