Apollo is a web-based application that supports and enables collaborative genome curation in real time, allowing teams of curators to improve on existing automated gene models through an intuitive interface. Apollo allows researchers to break down large amounts of data into manageable portions to mobilize groups of researchers with shared interests.
Apollo is a web-based application that supports and enables collaborative genome curation in real time, allowing teams of curators to improve on existing automated gene models through an intuitive interface. Apollo allows researchers to break down large amounts of data into manageable portions to mobilize groups of researchers with shared interests.
An introduction on gene annotation & curation for the IAGC and BIPAA research communities.
Apollo is a web-based application that supports and enables collaborative genome curation in real time, allowing teams of curators to improve on existing automated gene models through an intuitive interface. Apollo allows researchers to break down large amounts of data into manageable portions to mobilize groups of researchers with shared interests.
A Workshop at the Stowers Institute for Medical Research.
Apollo is a web-based application that supports and enables collaborative genome curation in real time, allowing teams of curators to improve on existing automated gene models through an intuitive interface. Apollo allows researchers to break down large amounts of data into manageable portions to mobilize groups of researchers with shared interests.
An introduction to use and functionality for the IAGC and BIPAA research communities.
This presentation contains details about the Apollo genome annotation editor functionality. It also includes a step-by-step example about curating a gene of interest.
This presentation explains the meaning of curation and includes an introduction to the Apollo genome annotation editing tool and its curation environment.
An introduction to Web Apollo for the Biomphalaria glabatra research community.Monica Munoz-Torres
Web Apollo is a web-based, collaborative genomic annotation editing platform. We need annotation editing tools to modify and refine precise location and structure of the genome elements that predictive algorithms cannot yet resolve automatically.
This presentation is an introduction to how the manual annotation process takes place using Web Apollo. It is addressed to the members of the Biomphalaria glabatra research community.
This presentation is a thorough guide to the use of Web Apollo, with details on User Navigation, Functionality, and the thought process behind manual annotation.
During this workshop, participants:
- Learn to identify homologs of known genes of interest in your newly sequenced genome.
- Become familiar with the environment and functionality of the Web Apollo genome annotation editing tool.
- Learn how to corroborate or modify automatically annotated gene models using all available evidence in Web Apollo.
- Understand the process of curation in the context of genome annotation.
Genome resources at EMBL-EBI: Ensembl and Ensembl GenomesEBI
Event: Plant and Animal Genomes Conference
Speaker: Bert Overduin
The Ensembl project (http://www.ensembl.org) seeks to enable genomic science by providing high quality, integrated annotation on chordate and selected eukaryotic genomes. All supported species include comprehensive, evidence-based gene annotations and a selected set of genomes includes additional data focused on variation, comparative, evolutionary, functional and regulatory annotation. As of Ensembl release 65 (December 2011), 56 species are fully supported. Ensembl data are accessible through an interactive web site, flat files, the data mining tool BioMart, direct database querying and a set of Perl APIs. Moreover, Ensembl is not just a data visualisation tool, but a suite of programs for data production (e.g. gene calling and comparative genomics) that can be deployed individually according to the needs of an individual community. Ensembl Genomes (http://www.ensemblgenomes.org) consists of five sub-portals (for bacteria, protists, fungi, plants and invertebrate metazoa) designed to complement the genomes available in Ensembl. It currently contains data for over 300 species. Many of the databases that support Ensembl Genomes have been built by, or in close collaboration with, groups that maintain specialist data resources for individual species, and we are actively seeking to extend the range of these collaborations. Together Ensembl and Ensembl Genomes offer a single unified interface across the taxonomic space. This presentation will consist of a short introduction to Ensembl and Ensembl Genomes followed by a demonstration of the respective websites and the BioMart data retrieval tool. Special attention will be given to recently developed functionality like the Variant Effect Predictor, which predicts the consequences of substitutions, insertions and deletions on transcripts and protein sequences, and the possibility to visualize your own data by attaching BAM and VCF files (for example).
Apollo is a web-based application that supports and enables collaborative genome curation in real time, allowing teams of curators to improve on existing automated gene models through an intuitive interface. Apollo allows researchers to break down large amounts of data into manageable portions to mobilize groups of researchers with shared interests.
An introduction on gene annotation & curation for the IAGC and BIPAA research communities.
Apollo is a web-based application that supports and enables collaborative genome curation in real time, allowing teams of curators to improve on existing automated gene models through an intuitive interface. Apollo allows researchers to break down large amounts of data into manageable portions to mobilize groups of researchers with shared interests.
A Workshop at the Stowers Institute for Medical Research.
Apollo is a web-based application that supports and enables collaborative genome curation in real time, allowing teams of curators to improve on existing automated gene models through an intuitive interface. Apollo allows researchers to break down large amounts of data into manageable portions to mobilize groups of researchers with shared interests.
An introduction to use and functionality for the IAGC and BIPAA research communities.
This presentation contains details about the Apollo genome annotation editor functionality. It also includes a step-by-step example about curating a gene of interest.
This presentation explains the meaning of curation and includes an introduction to the Apollo genome annotation editing tool and its curation environment.
An introduction to Web Apollo for the Biomphalaria glabatra research community.Monica Munoz-Torres
Web Apollo is a web-based, collaborative genomic annotation editing platform. We need annotation editing tools to modify and refine precise location and structure of the genome elements that predictive algorithms cannot yet resolve automatically.
This presentation is an introduction to how the manual annotation process takes place using Web Apollo. It is addressed to the members of the Biomphalaria glabatra research community.
This presentation is a thorough guide to the use of Web Apollo, with details on User Navigation, Functionality, and the thought process behind manual annotation.
During this workshop, participants:
- Learn to identify homologs of known genes of interest in your newly sequenced genome.
- Become familiar with the environment and functionality of the Web Apollo genome annotation editing tool.
- Learn how to corroborate or modify automatically annotated gene models using all available evidence in Web Apollo.
- Understand the process of curation in the context of genome annotation.
Genome resources at EMBL-EBI: Ensembl and Ensembl GenomesEBI
Event: Plant and Animal Genomes Conference
Speaker: Bert Overduin
The Ensembl project (http://www.ensembl.org) seeks to enable genomic science by providing high quality, integrated annotation on chordate and selected eukaryotic genomes. All supported species include comprehensive, evidence-based gene annotations and a selected set of genomes includes additional data focused on variation, comparative, evolutionary, functional and regulatory annotation. As of Ensembl release 65 (December 2011), 56 species are fully supported. Ensembl data are accessible through an interactive web site, flat files, the data mining tool BioMart, direct database querying and a set of Perl APIs. Moreover, Ensembl is not just a data visualisation tool, but a suite of programs for data production (e.g. gene calling and comparative genomics) that can be deployed individually according to the needs of an individual community. Ensembl Genomes (http://www.ensemblgenomes.org) consists of five sub-portals (for bacteria, protists, fungi, plants and invertebrate metazoa) designed to complement the genomes available in Ensembl. It currently contains data for over 300 species. Many of the databases that support Ensembl Genomes have been built by, or in close collaboration with, groups that maintain specialist data resources for individual species, and we are actively seeking to extend the range of these collaborations. Together Ensembl and Ensembl Genomes offer a single unified interface across the taxonomic space. This presentation will consist of a short introduction to Ensembl and Ensembl Genomes followed by a demonstration of the respective websites and the BioMart data retrieval tool. Special attention will be given to recently developed functionality like the Variant Effect Predictor, which predicts the consequences of substitutions, insertions and deletions on transcripts and protein sequences, and the possibility to visualize your own data by attaching BAM and VCF files (for example).
Ontologies for life sciences: examples from the gene ontologyMelanie Courtot
Ontologies for life sciences: examples from the Gene Ontology
The document discusses ontologies for life sciences, using the Gene Ontology (GO) as an example. It provides an overview of GO, describing it as a way to capture biological knowledge for gene products in a written and computable form using a set of concepts and relationships arranged hierarchically. GO allows consistent descriptions of genes/gene products across databases. Model organism databases provide annotations connecting genes to GO terms. The GO is a collaborative effort to address the need for consistent descriptions of genes.
This document provides information about a training workshop on using Ensembl. It includes an agenda for the day-long workshop covering topics like introduction to Ensembl, browsing genes and data, using BioMart, and exploring genetic variation data. The workshop materials are available under a CC BY license and the document encourages attendees to cite any Ensembl papers if using the resource for their own work. Break times and locations are also listed on the agenda.
GRC Workshop held at Churchill College on Sep 21, 2014. Talk by Bronwen Aken discussing the Ensembl approach to annotating the complete human reference assembly.
Course: Bioinformatics for Biomedical Research (2014).
Session: 1.3- Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensembl, Biomart and IGV.
Statistics and Bioinformatisc Unit (UEB) & High Technology Unit (UAT) from Vall d'Hebron Research Institute (www.vhir.org), Barcelona.
This document provides an introduction and overview of a manual annotation workshop using the Web Apollo genome annotation tool. It discusses manual annotation and community-based curation efforts. The workshop aims to teach participants how to identify genes of interest, become familiar with Web Apollo, learn how to corroborate and modify gene models using evidence, and understand the genome annotation process from assembly to manual curation. The document outlines the workshop activities and provides guidance on using Web Apollo, including navigating the interface, editing annotations, and annotating simple cases by adding or modifying exons.
Ensembl is a genome browser that annotates genes and predicts regulatory functions for vertebrate genomes. It uses raw DNA sequence data to create a tracking database and automatically finds genes and other features. Ensembl incorporates data from other sources and provides web-based access to genomic information through views of genes, transcripts, proteins, DNA homology, and more. It aims to make genome annotation freely accessible to support research.
This document discusses biological networks and how to analyze genome-scale data using networks. It defines different types of biological networks including DNA-protein, RNA-RNA, RNA-protein, and protein-protein networks. It also describes popular network visualization and analysis tools like Cytoscape and different databases for retrieving protein-protein and pathway interaction networks. The document emphasizes that networks can help validate findings, explore and discover new insights from large genomic and omics datasets.
The document discusses two programs - BLASTing AmiGOs and "33" - that were designed to automatically generate Gene Ontology (GO) terms from gene/protein sequences. BLASTing AmiGOs takes FASTA sequences as input and outputs the associated GO terms without manual input. "33" queries a GO database using gene products from another group to retrieve GO terms and evidence codes. Manually collecting the same GO term data for 32 genes took 4-5 hours, while the programs could generate the terms automatically. The document compares the manual and automated methods and discusses using computational tools to help biologists more efficiently organize and access expanding genomic data.
This presentation is a thorough guide to the use of Web Apollo, with details on User Navigation, Functionality, and the thought process behind manual annotation.
During this workshop, participants:
- Learn to identify homologs of known genes of interest in a newly sequenced genome.
- Become familiar with the environment and functionality of the Web Apollo genome annotation editing tool.
- Learn how to corroborate or modify automatically annotated gene models using available evidence in Web Apollo.
- Understand the process of curation in the context of genome annotation.
This document outlines a presentation on protein-protein interaction networks, including predicting such networks, available interaction data sources, visualization and analysis tools. Methods for predicting networks include analyzing genomic sequences, 'omics' data, and literature. Popular tools for visualizing and analyzing networks include Cytoscape, VisANT, and tools for detecting network motifs and similarities. The presentation will demonstrate predicting a network from microarray data using ARACNE and visualizing it in Cytoscape.
The Gene Ontology (GO) provides a controlled vocabulary for describing gene and gene product attributes across species. It consists of three ontologies covering biological processes, molecular functions, and cellular components. GO terms are organized into a directed acyclic graph structure and can have relationships like "is_a" and "part_of". Genes are annotated with GO terms to capture functional information, which is shared across species to facilitate research. While useful, the GO has some limitations like unclear reasoning principles and lack of validation procedures.
Ensembl Genomes is a portal that provides integrated access to genome-scale data for non-vertebrate species. It was developed using the Ensembl genome annotation and visualization platform. It consists of sub-portals for bacteria, protists, fungi, plants and invertebrate metazoa that complement the vertebrate genomes available in Ensembl. It provides graphical views of genomes and genes using the Ensembl genome browser software. Data is retrieved through interfaces like BioMart and APIs that access relational databases describing genomic features, sequences and identifiers. Tools like the Assembly Converter and Variant Effect Predictor are also available.
This dissertation developed algorithms and software tools to analyze the biological role of low complexity regions (LCRs) in proteins. It evaluated and improved methods for identifying homologs containing LCRs. It also created LCR-eXXXplorer, a web resource with unique tools for exploring annotated LCRs among millions of proteins. Using these tools, the dissertation predicted pathogenicity of E. coli strains based on genomic composition, showing prediction is possible with limited data like from metagenomic samples. The results open new areas for research on sequence search validation and large-scale experiments.
This document provides an outline for a presentation on biological networks, including introducing biological networks, describing their basic components and types, methods for predicting and building networks, sources of interaction data, tools for network visualization and analysis, and a demonstration of building, visualizing and analyzing biological networks using Cytoscape. The presentation covers topics like nodes and edges in networks, features used to analyze networks, methods for predicting networks from sequences and omics data, integrated databases for interaction data, and popular tools for searching, visualizing and performing network analysis.
Gene Expression - Microarrays discusses analyzing gene expression data from microarray experiments. It describes the basic workflow including experimental design, sample preparation, hybridization, image analysis, preprocessing, normalization, and statistical analysis. Key points are that microarrays allow measuring expression of thousands of genes simultaneously, and proper experimental design and data analysis are important to draw meaningful biological conclusions from microarray data.
This is a brief update about the genome browser JBrowse and the genome annotation editor Apollo, addressed to the members of the Alliance of Genome Resources (AGR).
Learn more about JBrowse at jbrowse.org
Learn more about Apollo at GenomeArchitect.org
Apollo - A webinar for the Phascolarctos cinereus research communityMonica Munoz-Torres
This document provides an overview of a webinar introducing the Apollo genome annotation tool. The webinar aims to help the koala genome research community better understand genome curation processes involving automated annotation and manual curation using Apollo. It outlines the webinar topics which will explain gene prediction, the Apollo interface for collaborative curation, and demonstrations of identifying gene homologs and modifying automated annotations. The webinar aims to familiarize participants with genome curation concepts and the Apollo tool.
Ontologies for life sciences: examples from the gene ontologyMelanie Courtot
Ontologies for life sciences: examples from the Gene Ontology
The document discusses ontologies for life sciences, using the Gene Ontology (GO) as an example. It provides an overview of GO, describing it as a way to capture biological knowledge for gene products in a written and computable form using a set of concepts and relationships arranged hierarchically. GO allows consistent descriptions of genes/gene products across databases. Model organism databases provide annotations connecting genes to GO terms. The GO is a collaborative effort to address the need for consistent descriptions of genes.
This document provides information about a training workshop on using Ensembl. It includes an agenda for the day-long workshop covering topics like introduction to Ensembl, browsing genes and data, using BioMart, and exploring genetic variation data. The workshop materials are available under a CC BY license and the document encourages attendees to cite any Ensembl papers if using the resource for their own work. Break times and locations are also listed on the agenda.
GRC Workshop held at Churchill College on Sep 21, 2014. Talk by Bronwen Aken discussing the Ensembl approach to annotating the complete human reference assembly.
Course: Bioinformatics for Biomedical Research (2014).
Session: 1.3- Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensembl, Biomart and IGV.
Statistics and Bioinformatisc Unit (UEB) & High Technology Unit (UAT) from Vall d'Hebron Research Institute (www.vhir.org), Barcelona.
This document provides an introduction and overview of a manual annotation workshop using the Web Apollo genome annotation tool. It discusses manual annotation and community-based curation efforts. The workshop aims to teach participants how to identify genes of interest, become familiar with Web Apollo, learn how to corroborate and modify gene models using evidence, and understand the genome annotation process from assembly to manual curation. The document outlines the workshop activities and provides guidance on using Web Apollo, including navigating the interface, editing annotations, and annotating simple cases by adding or modifying exons.
Ensembl is a genome browser that annotates genes and predicts regulatory functions for vertebrate genomes. It uses raw DNA sequence data to create a tracking database and automatically finds genes and other features. Ensembl incorporates data from other sources and provides web-based access to genomic information through views of genes, transcripts, proteins, DNA homology, and more. It aims to make genome annotation freely accessible to support research.
This document discusses biological networks and how to analyze genome-scale data using networks. It defines different types of biological networks including DNA-protein, RNA-RNA, RNA-protein, and protein-protein networks. It also describes popular network visualization and analysis tools like Cytoscape and different databases for retrieving protein-protein and pathway interaction networks. The document emphasizes that networks can help validate findings, explore and discover new insights from large genomic and omics datasets.
The document discusses two programs - BLASTing AmiGOs and "33" - that were designed to automatically generate Gene Ontology (GO) terms from gene/protein sequences. BLASTing AmiGOs takes FASTA sequences as input and outputs the associated GO terms without manual input. "33" queries a GO database using gene products from another group to retrieve GO terms and evidence codes. Manually collecting the same GO term data for 32 genes took 4-5 hours, while the programs could generate the terms automatically. The document compares the manual and automated methods and discusses using computational tools to help biologists more efficiently organize and access expanding genomic data.
This presentation is a thorough guide to the use of Web Apollo, with details on User Navigation, Functionality, and the thought process behind manual annotation.
During this workshop, participants:
- Learn to identify homologs of known genes of interest in a newly sequenced genome.
- Become familiar with the environment and functionality of the Web Apollo genome annotation editing tool.
- Learn how to corroborate or modify automatically annotated gene models using available evidence in Web Apollo.
- Understand the process of curation in the context of genome annotation.
This document outlines a presentation on protein-protein interaction networks, including predicting such networks, available interaction data sources, visualization and analysis tools. Methods for predicting networks include analyzing genomic sequences, 'omics' data, and literature. Popular tools for visualizing and analyzing networks include Cytoscape, VisANT, and tools for detecting network motifs and similarities. The presentation will demonstrate predicting a network from microarray data using ARACNE and visualizing it in Cytoscape.
The Gene Ontology (GO) provides a controlled vocabulary for describing gene and gene product attributes across species. It consists of three ontologies covering biological processes, molecular functions, and cellular components. GO terms are organized into a directed acyclic graph structure and can have relationships like "is_a" and "part_of". Genes are annotated with GO terms to capture functional information, which is shared across species to facilitate research. While useful, the GO has some limitations like unclear reasoning principles and lack of validation procedures.
Ensembl Genomes is a portal that provides integrated access to genome-scale data for non-vertebrate species. It was developed using the Ensembl genome annotation and visualization platform. It consists of sub-portals for bacteria, protists, fungi, plants and invertebrate metazoa that complement the vertebrate genomes available in Ensembl. It provides graphical views of genomes and genes using the Ensembl genome browser software. Data is retrieved through interfaces like BioMart and APIs that access relational databases describing genomic features, sequences and identifiers. Tools like the Assembly Converter and Variant Effect Predictor are also available.
This dissertation developed algorithms and software tools to analyze the biological role of low complexity regions (LCRs) in proteins. It evaluated and improved methods for identifying homologs containing LCRs. It also created LCR-eXXXplorer, a web resource with unique tools for exploring annotated LCRs among millions of proteins. Using these tools, the dissertation predicted pathogenicity of E. coli strains based on genomic composition, showing prediction is possible with limited data like from metagenomic samples. The results open new areas for research on sequence search validation and large-scale experiments.
This document provides an outline for a presentation on biological networks, including introducing biological networks, describing their basic components and types, methods for predicting and building networks, sources of interaction data, tools for network visualization and analysis, and a demonstration of building, visualizing and analyzing biological networks using Cytoscape. The presentation covers topics like nodes and edges in networks, features used to analyze networks, methods for predicting networks from sequences and omics data, integrated databases for interaction data, and popular tools for searching, visualizing and performing network analysis.
Gene Expression - Microarrays discusses analyzing gene expression data from microarray experiments. It describes the basic workflow including experimental design, sample preparation, hybridization, image analysis, preprocessing, normalization, and statistical analysis. Key points are that microarrays allow measuring expression of thousands of genes simultaneously, and proper experimental design and data analysis are important to draw meaningful biological conclusions from microarray data.
This is a brief update about the genome browser JBrowse and the genome annotation editor Apollo, addressed to the members of the Alliance of Genome Resources (AGR).
Learn more about JBrowse at jbrowse.org
Learn more about Apollo at GenomeArchitect.org
Apollo - A webinar for the Phascolarctos cinereus research communityMonica Munoz-Torres
This document provides an overview of a webinar introducing the Apollo genome annotation tool. The webinar aims to help the koala genome research community better understand genome curation processes involving automated annotation and manual curation using Apollo. It outlines the webinar topics which will explain gene prediction, the Apollo interface for collaborative curation, and demonstrations of identifying gene homologs and modifying automated annotations. The webinar aims to familiarize participants with genome curation concepts and the Apollo tool.
Talk at the 8th International Biocuration Conference. Beijing, China. April 23-26, 2015.
Obtaining meaningful results from genome analyses requires high quality annotations of all genomic elements. Today’s sequencing projects face challenges such as lower coverage, more frequent assembly errors, and the lack of closely related species with well-annotated genomes. Apollo is a web-based application that supports and enables collaborative genome curation in real time, analogous to Google Docs, allowing curators to improve on existing automated gene models through an intuitive interface. Apollo’s extensible architecture is built on top of JBrowse; its components are a web-based client, an annotation-editing engine, and a server-side data service. It allows users to visualize automated gene models, protein alignments, expression and variant data, and conduct structural and/or functional annotations.
Apollo is actively used within a variety of projects, including the initiative to sequence the genomes of 5,000 Arthropod species (i5K), and will become essential to the thousands of genomes now being sequenced and analyzed. Researchers from nearly 100 institutions worldwide are currently using Apollo on distributed curation efforts for over sixty genome projects across the tree of life; from plants to echinoderms, to fungi, to species of fish and other vertebrates including human, cattle (bovine), and dog. We are training the next generation of researchers by reaching out to educators to make these tools available as part of curricula, offering workshops and webinars to the scientific community, and through widely applied systems such as iPlant and DNA Subway. We are currently integrating Apollo into an annotation environment combining gene structural and functional annotation, transcriptomic, proteomic, and phenotypic annotation. In this presentation we will describe in detail its utility to users, introduce the architecture to developers interested in expanding on this open-source project, and offer details of our future plans.
Authors:
Monica Munoz-Torres(1), Nathan Dunn(1), Colin Diesh(2), Deepak Unni(2), Seth Carbon(1), Heiko Dietze(1), Christopher Mungall(1), Nicole Washington(1), Ian Holmes(3), Christine Elsik(2), and Suzanna E. Lewis(1)
1Lawrence Berkeley National Laboratory, Genomics Division, Berkeley, CA
2Divisions of Animal and Plant Sciences, University of Missouri, Columbia, MO
3University of California Berkeley, Bioengineering, Berkeley, CA
Continuing with the theme of DNA repair via homologous recombination, I will discuss the following family during the PAINT call:
PTHR13451 CLASS II CROSSOVER JUNCTION ENDONUCLEASE MUS81
This is an introduction to conducting manual annotation efforts using Apollo. This webinar was offered to members of the i5K Research community on 2015-10-07.
This document provides an introduction and overview of manual genome annotation using the Apollo genome annotation tool. It begins with an outline of the webinar topics, which include an introduction to manual annotation and its necessity, an overview of the Apollo tool and its functionality for collaborative curation, and examples and demonstrations. The document then covers key concepts for manual annotation such as the definition of a gene, genome curation steps, transcription and translation including reading frames, splice sites, and phase. The goal of the webinar is to help participants better understand genome curation and manual annotation using Apollo to identify and modify gene models.
Apollo: A workshop for the Manakin Research Coordination NetworkMonica Munoz-Torres
Apollo is a web-based, collaborative genomic annotation editing platform. We need annotation editing tools to modify and refine precise location and structure of the genome elements that predictive algorithms cannot yet resolve automatically.
This presentation is an introduction to how the manual annotation process takes place using Apollo. It is addressed to the members of the Manakin Genomics research community.
Introduction to Apollo - i5k Research Community – Calanoida (copepod)Monica Munoz-Torres
Apollo is a web-based application that supports and enables collaborative genome curation in real time, allowing teams of curators to improve on existing automated gene models through an intuitive interface. Apollo allows researchers to break down large amounts of data into manageable portions to mobilize groups of researchers with shared interests.
The i5k, an initiative to sequence the genomes of 5,000 insect and related arthropod species, is a broad and inclusive effort that seeks to involve scientists from around the world in their genome curation process, and Apollo is serving as the platform to empower this community.
This presentation is an introduction to Apollo for the members of the i5K Pilot Project working on species of the order Calanoida (copepod).
This document provides an overview of a talk on genome curation and manual annotation using the Apollo genome annotation tool. The talk aims to help scientists understand the genome curation process from assembled genome to automated and manual annotation. It will introduce Apollo and teach how to identify homologs of known genes, corroborate and modify automated gene models using evidence in Apollo. The talk will also refresh attendees on key biological concepts like the definition of a gene, central dogma, transcription, and translation to better understand manual annotation.
Apollo is a web-based application that supports and enables collaborative genome curation in real time, allowing teams of curators to improve on existing automated gene models through an intuitive interface. Apollo allows researchers to break down large amounts of data into manageable portions to mobilize groups of researchers with shared interests.
The i5K, an initiative to sequence the genomes of 5,000 insect and related arthropod species, is a broad and inclusive effort that seeks to involve scientists from around the world in their genome curation process, and Apollo is serving as the platform to empower this community.
This presentation is an introduction to Apollo for the members of the i5K Pilot Project working on species of the order Hemiptera.
Investigations on Synthesis, Purification and Characterization of Indium Anti...AM Publications
The Indium antimonide (InSb) is one of the promising optoelectronic materials having potential applications in the development of data storage, frequency, parametric oscillations, detectors and related gadgets. Purity of material plays a vital role in the development of quality devices for space and defence related high-end applications. Directional Solidification System (DSS) plays a major role in reducing the vibrations during the synthesis and crystallization process to yield high and pure InSb compound. This paper discusses homogenization, synthesis, purification and characterization of Indium antimonide. Directional Solidification System (DSS) is employed for the preparation of pure InSb crystalline material. This system is fabricated with a view to establish required and suitable temperature isotherms at top and bottom of the furnace and also to tackle the irregularities of sample during the process of cooling the aspects. The crystallization of homogenized pure sample is kept under vacuum in the furnace. The impurities of the InSb are segregated at the bottom end of the sample. The purity analysis of the InSb sample is studied and presented by employing XRD, TEM, SEM, EDS, ICP-OES, Raman and FTIR techniques.
Apollo is a web-based application that supports and enables collaborative genome curation in real time, allowing teams of curators to improve on existing automated gene models through an intuitive interface. Apollo allows researchers to break down large amounts of data into manageable portions to mobilize groups of researchers with shared interests.
CONSORCIO ONTOLOGÍA DE GENES: herramientas para anotación funcionalMonica Munoz-Torres
Este documento describe herramientas para la anotación funcional de genes utilizando la ontología de genes (GO). Explica cómo Apollo y la GO permiten editar y asignar términos GO de forma precisa y profunda para mejorar la anotación estructural y funcional de genomas. También describe dónde se encuentran los archivos de anotación GO y cómo la estructura de la GO como un grafo acíclico dirigido permite análisis complejos de conjuntos de datos.
Diapositivas tuberculosis y salud publica CesarArgus96
El documento describe factores de protección, detección y seguimiento de la tuberculosis. La vacunación BCG y la búsqueda temprana y tratamiento de fuentes de infección ayudan a prevenir la transmisión. Se debe realizar baciloscopia seriada a cualquier persona con tos por más de 15 días para detectar casos. El seguimiento incluye controles médicos y de enfermería periódicos así como controles bacteriológicos. También se describen consideraciones para casos particulares como embarazo, diabetes e infección por VIH.
This document provides a quick reference guide for various commands and functions across different computer reservation systems, including Amadeus, Galileo, Abacus/Sabre, and Worldspan. It outlines commands for security, information/help, encoding/decoding, scrolling, printing, availability, selling/changing/canceling flights, timetables, and adding/modifying passenger name record fields like names, phones, addresses, payments and more. The summary provides high-level overviews of the key areas covered in the document in a concise 3 sentences.
Expert System to Determine the Priority Scale of Case in Laboratory of Forens...AM Publications
1. The document describes an expert system developed to determine the priority scale of cases in the Forensic Laboratory using forward and backward chaining rule-based methods. The system uses criteria like the age of suspects, victim occupations, and crime scene details to assign cases a priority level of 1 to 6.
2. The expert system was tested on 6 cases and achieved 100% accuracy in assigning the correct priority levels when compared to expert assessments. The system provides a simple way to monitor case completion progress and meet deadlines based on assigned priority levels.
Este documento describe varias herramientas y características del sistema operativo Windows, incluyendo el escritorio, la barra de tareas, la carpeta "Mis documentos", la carpeta "Mi PC", la carpeta "Mis sitios de red", la Papelera de reciclaje, el Explorador de Windows, herramientas de sistema como el comprobador de errores y la desfragmentación de discos, y el software de gestión de proyectos Microsoft Project.
Doppler medios diagnostico y cuidados de enfermeria CesarArgus96
El Doppler es una variante de la ecografía que utiliza ultrasonido para visualizar las ondas de velocidad del flujo sanguíneo a través de vasos que de otro modo serían invisibles. Se usa para encontrar coágulos de sangre, obstrucciones vasculares, evaluar la viabilidad de injertos venosos, y examinar problemas de flujo sanguíneo. Existen tres tipos de Doppler - color, pulsado y continuo - que difieren en cómo miden la dirección y profundidad del flujo.
Precise elucidation of the many different biological features encoded in any genome requires careful examination and review by researchers, who gather and evaluate the available evidence to corroborate and modify gene predictions and other biological elements. This curation process allows them to resolve discrepancies and validate automated gene model hypotheses and alignments. This approach is the well-established practice for well-known genomes such as human, mouse, zebrafish, Drosophila, et cetera. Desktop Apollo was originally developed to meet these needs.
The cost of sequencing a genome has been dramatically reduced by several orders of magnitude in the last decade, and the natural consequence is that more and more researchers are sequencing more and more new genomes, both within populations and across species. Because individual researchers can now readily sequence many genomes of interest, the need for a universally accessible genomic curation tool logically follows. Each new exome or genome sequenced requires visualization and curation to obtain biologically accurate genomic features sets, even for limited set of genes, because computational genome analysis remains an imperfect art. Additionally, unlike earlier genome projects, which had the advantage of more highly polished genomes, recent projects usually have lower coverage. Therefore researchers now face additional work correcting for more frequent assembly errors and annotating genes split across multiple contigs.
Genome annotation is an inherently collaborative task; researchers only very rarely work in isolation, turning to colleagues for second opinions and insights from those with with expertise in particular domains and gene families. The new JavaScript based Apollo, allows researchers real-time interactivity, breaking down large amounts of data into manageable portions to mobilize groups of researchers with shared interests. We are also focused on training the next generation of researchers by reaching out to educators to make these tools available as part of curricula via workshops and webinars, and through widely applied systems such as iPlant and DNA Subway. Here we offer details of our progress.
Presentation at Genome Informatics, Session (3) on Databases, Data Mining, Visualization, Ontologies and Curation.
Authors: Monica C Munoz-Torres, Suzanna E. Lewis, Ian Holmes, Colin Diesh, Deepak Unni, Christine Elsik.
Comparative genome analysis requires high quality annotations of all genomic elements. Today’s sequencing projects face numerous challenges including lower coverage, more frequent assembly errors, and the lack of closely related species with well-annotated genomes. Precise elucidation of the many different biological features encoded in any genome requires careful examination and review. We need genome annotation editing tools to modify and refine the location and structure of the genome elements that predictive algorithms cannot yet resolve automatically. During the manual annotation process, curators identify elements that best represent the underlying biology and eliminate elements that reflect systemic errors of automated analyses.
Apollo is a web-based application that supports and enables collaborative genome curation in real time, analogous to Google Docs, allowing teams of curators to improve on existing automated gene models through an intuitive interface. Researchers from nearly one hundred institutions worldwide are currently using Apollo for distributed curation efforts in over sixty genome projects across the tree of life: from plants to arthropods, to fungi, to species of fish and other vertebrates including human, cattle (bovine), and dog.
Web Apollo Tutorial for the i5K copepod research community.Monica Munoz-Torres
Introduction to Web Apollo for the i5K i5K copepod research community. WebApollo is genome annotation editor; it provides a web-based environment that allows multiple distributed users to review, edit, and share manual annotations. This presentation includes information specific to the projects of the Global Initiative to sequence the genomes of 5,000 species of arthropods, i5K. Let's get started!
Comparative genome analysis requires high quality annotations of all genomic elements. Today’s sequencing projects face numerous challenges including lower coverage, more frequent assembly errors, and the lack of closely related species with well-annotated genomes. Precise elucidation of the many different biological features encoded in any genome requires careful examination and review. We need genome annotation editing tools to modify and refine the location and structure of the genome elements that predictive algorithms cannot yet resolve automatically. During the manual annotation process, curators identify elements that best represent the underlying biology and eliminate elements that reflect systemic errors of automated analyses.
Apollo is a web-based application that supports and enables collaborative genome curation in real time, analogous to Google Docs, allowing teams of curators to improve on existing automated gene models through an intuitive interface. Researchers from nearly one hundred institutions worldwide are currently using Apollo for distributed curation efforts in over sixty genome projects across the tree of life: from plants to arthropods, to fungi, to species of fish and other vertebrates including human, cattle (bovine), and dog.
Apollo is a web-based application that supports and enables collaborative genome curation in real time, allowing teams of curators to improve on existing automated gene models through an intuitive interface. Apollo allows researchers to break down large amounts of data into manageable portions to mobilize groups of researchers with shared interests.
The i5K, an initiative to sequence the genomes of 5,000 insect and related arthropod species, is a broad and inclusive effort that seeks to involve scientists from around the world in their genome curation process, and Apollo is serving as the platform to empower this community.
This presentation is an introduction to Apollo for the members of the i5K Pilot Project on Eurytemora affinis
Apollo and i5K: Collaborative Curation and Interactive Analysis of GenomesMonica Munoz-Torres
Precise elucidation of the many different biological features encoded in a genome requires a careful curation process that involves reviewing all available evidence to allow researchers to resolve discrepancies and validate automated gene models, protein alignments, and other biological elements. Genome annotation is an inherently collaborative task; researchers only rarely work in isolation, turning to colleagues for second opinions and insights from those with expertise in particular domains and gene families.
The i5k initiative seeks to sequence the genomes of 5,000 insect and related arthropod species. The selected species are known to be important to worldwide agriculture, food safety, medicine, and energy production as well as many used as models in biology, those most abundant in world ecosystems, and representatives in every branch of the insect phylogeny in an effort to better understand arthropod evolution and phylogeny. Because computational genome analysis remains an imperfect art, each of these new genomes sequenced will require visualization and curation.
Apollo is an instantaneous, collaborative, genome annotation editor, and the new JavaScript based version allows researchers real-time interactivity, breaking down large amounts of data into manageable portions to mobilize groups of researchers with shared interests. The i5K is a broad and inclusive effort that seeks to involve scientists from around the world in their genome curation process and Apollo is serving as the platform to empower this community. Here we offer details about this collaboration.
Introduction to Apollo: A webinar for the i5K Research CommunityMonica Munoz-Torres
This document provides an introduction and outline for a webinar on using the Apollo genome annotation editing tool. It was presented by Monica Munoz-Torres of BBOP to the i5K Research Community. The webinar aimed to help participants better understand genome curation in the context of automated and manual annotation. It also aimed to familiarize participants with Apollo's functionality and how to identify homologs of known genes, corroborate gene models using evidence, and modify automated annotations in Apollo. The document includes sections on genome sequencing projects, the objectives and uses of genome annotation, and a biological refresher on concepts relevant to manual annotation like genes, transcription, translation, and genome curation steps.
An introduction to Web Apollo for i5K Pilot Species Projects - HemipteraMonica Munoz-Torres
Introduction to Web Apollo for the i5K Pilot species project. WebApollo is genome annotation editor; it provides a web-based environment that allows multiple distributed users to review, edit, and share manual annotations. This presentation includes information specific to the projects of the Global Initiative to sequence the genomes of 5,000 species of arthropods, i5K. Let's get started!
Introduction to Web Apollo for the i5K Pilot species project. WebApollo is genome annotation editor; it provides a web-based environment that allows multiple distributed users to review, edit, and share manual annotations. Let's get started!
Web Apollo: Lessons learned from community-based biocuration efforts.Monica Munoz-Torres
This presentation tries to highlight the importance and relevance of community-based curation of biological data. It describes the results of harvesting expertise from dispersed researchers assigning functions to predicted and curated peptides, as well as collaborative efforts for standardization of genes and gene product attributes across species and databases.
This document provides an overview of bioinformatics and genomics. It begins with an acknowledgement and abstract section. The introduction defines bioinformatics and its role in analyzing genetic sequences and biological data through computational methods. Major research areas of bioinformatics discussed include sequence analysis, genome annotation, evolutionary biology, measuring biodiversity, gene expression analysis, protein analysis, cancer mutation analysis, and protein structure prediction. Comparative genomics and modeling biological systems are also summarized. The document concludes with a definition of genomics as the study of genomes through sequencing efforts and mapping genetic interactions.
This document provides an overview of a webinar introducing the Web Apollo genome annotation tool. The webinar aims to help researchers in the Ceratitis capitata research community learn to identify homologous genes, become familiar with the Web Apollo interface and annotation process, and access resources for the Ceratitis capitata genome. The webinar covers what Web Apollo is, the manual annotation process, and a demonstration of Web Apollo's functionality.
This document discusses challenges and opportunities in applying mRNA sequencing (mRNAseq) to non-model organisms. It describes using digital normalization to cope with large amounts of lamprey mRNAseq data that would otherwise be too computationally intensive to assemble. Digital normalization was applied successfully to Molgula ascidian mRNAseq data, enabling transcriptome analysis. The lamprey transcriptome was assembled from over 5 billion reads from 50 tissues, producing over 600,000 transcripts. Next steps include addressing contamination issues and using the more complete transcriptome to enable various evolutionary and biological studies of lamprey. The document advocates making protocols and data openly available to help characterize genes in non-model organisms.
Three's a crowd-source: Observations on Collaborative Genome AnnotationMonica Munoz-Torres
It is impossible for a single individual to fully curate a genome with precise biological fidelity. Beyond the problem of scale, curators need second opinions and insights from colleagues with domain and gene family expertise, but the communications constraints imposed in earlier applications made this inherently collaborative task difficult. Apollo, a client-side, JavaScript application allowing extensive changes to be rapidly made without server round-trips, placed us in a position to assess the difference this real-time interactivity would make to researchers’ productivity and the quality of downstream scientific analysis. To evaluate this, we trained and supported geographically dispersed scientific communities (hundreds of scientists and agreed-upon gatekeepers, in ~100 institutions around the world) to perform biologically supported manual annotations, and monitored their findings. We observed that: 1) Previously disconnected researchers were more productive when obtaining immediate feedback in dialogs with collaborators. 2) Unlike earlier genome projects, which had the advantage of more highly polished genomes, recent projects usually have lower coverage. Therefore curators now face additional work correcting for more frequent assembly errors and annotating genes that are split across multiple contigs. 3) Automated annotations were improved as exemplified by discoveries made based on revised annotations, for example ~2800 manually annotated genes from three species of ants granted further insight into the evolution of sociality in this group, and ~3600 manual annotations contributed to a better understanding of immune function, reproduction, lactation and metabolism in cattle. 4) There is a notable trend shifting from whole-genome annotation to annotation of specific gene families or other gene groups linked by ecological and evolutionary significance. 5) The distributed nature of these efforts still demand strong, goal-oriented (i.e. publication of findings) leadership and coordination, as these are crucial to the success of each project. Here we detail these and other observations on collaborative genome annotation efforts.
Genome annotation with open source software: Apollo, Jbrowse and the GO in Ga...Nathan Dunn
This document describes improvements to Apollo, an open-source genome annotation editor, that increase the efficiency of genome annotation refinement. Key improvements include automated processing of genomic evidence, ability to associate and export Gene Ontology annotations, variant effect prediction, and user interface enhancements. Apollo can now be launched via Docker or preconfigured Amazon cloud instances, simplifying installation. It provides web services for integration with other web tools.
This document discusses challenges and opportunities in applying mRNA sequencing (mRNAseq) to non-model organisms. It describes using digital normalization as a way to cope with having a massive amount of lamprey mRNAseq data but an incomplete genomic reference. Digital normalization enables assembly and analysis that would otherwise not be possible. The document also discusses applying digital normalization to ascidian mRNAseq data, where it results in substantial time savings and comparable transcriptomes to those assembled without normalization. Finally, it discusses next steps for lamprey transcriptome analysis including enabling various evolutionary and biological studies.
This document discusses challenges in analyzing transcriptome data from non-model organisms. It begins by outlining the problems with lamprey, a distant vertebrate, including its large and complex genome with significant genomic variation. It then introduces the concept of digital normalization as a computational method for coping with massive transcriptome datasets by normalizing coverage and removing redundant reads. The document applies this method to analyze lamprey and ascidian transcriptomes. It finds that digital normalization enables assembly and analysis that would otherwise be impossible due to limitations of computational resources. The document advocates for open sharing of genomic and transcriptomic data to help characterize understudied lineages.
I am Rebecca K. I am a Microbiology Assignment Expert at nursingassignmenthelp.com. I hold a Masters’ in Microbiology, from Bournemouth University, UK. I have been helping students with their assignments for the past 11 years. I solve assignments related to Microbiology.
Visit nursingassignmenthelp.com or email info@nursingassignmenthelp.com. You can also call on +1 678 648 4277 for any assistance with Microbiology Assignments.
The document discusses using comparative gene neighborhood analysis and visualization to help understand bacterial gene function from large genome sequence datasets. It describes how genes involved in similar biological processes are often located near each other in bacterial genomes. By comparing gene neighborhoods across different genomes, functions can be predicted for unknown genes. However, this requires analyzing many gene neighborhoods to identify statistically significant patterns. The author's thesis examines designing a new visualization called BactoGeNIE that can scale to "big data" sizes and large displays to enable experts to explore and analyze comparative gene neighborhood data in an interactive way.
Apollo annotation guidelines for i5k projects Diaphorina citriMonica Munoz-Torres
Apollo is a web-based application that supports and enables collaborative genome curation in real time, allowing teams of curators to improve on existing automated gene models through an intuitive interface. Apollo allows researchers to break down large amounts of data into manageable portions to mobilize groups of researchers with shared interests.
Apollo Genome Annotation Editor: Latest Updates, Including New Galaxy Integra...Monica Munoz-Torres
This document describes Apollo, an open-source web-based tool for collaborative genome annotation. It allows users to curate gene models for multiple organisms on one server, generates analysis-ready data and reports. Key features include user annotations, evidence tracks, an annotator panel, search/edit/export functions, switching between organisms, and visualizing/editing annotation details. Apollo integrates with JBrowse and can export annotations to GFF3, FASTA, and Chado databases.
This document summarizes updates from the Gene Ontology Consortium website and community. It discusses the GO Help rotation team for 2016 and their productivity in resolving issues since September 2015. It also highlights updates to the GO website homepage and contributions page, including removing outdated elements and adding suggested teaching materials. The document concludes by recognizing the Berkeley Bioinformatics Open Source Projects for their support of the GO Consortium through various NIH and DOE grants.
Scientific research is inherently a collaborative task; in our case it is a dialog among different researchers to reach a shared understanding of the underlying biology. To facilitate this dialog we have developed two web-based annotation tools: Apollo (http://genomearchitect.org/), a genomic feature editor, designed to support structural annotation of gene models, and Noctua (http://noctua.berkeleybop.org/), a biological-process model builder designed for describing the functional roles of gene products. Here we wish to outline an inventory of essential requirements that, in our experience, enable an annotation tool to meet the needs of both professional biocurators as well as other members of the research community. Here are the general requirements, beyond specific functional requirements, that any annotation tool must satisfy.
Data Visualization And Annotation Workshop at Biocuration 2015Monica Munoz-Torres
8th International Biocuration Conference. Beijing, China. April 23-26, 2015.
Workshop 2: Data Visualization and Annotation.
Chairs: Rama Balakrishnan, Stanford University, USA and Monica Munoz-Torres, Lawrence Berkeley National Laboratory, USA
Explaining the most intricate biological processes often requires a degree of detail beyond the scope of equations and algorithms; in fact, most biological knowledge is represented visually as illustrations, graphs, and diagrams. Genomics data in particular require specialized forms of visualization to improve our understanding and increase our chances of extracting meaningful conclusions from our analyses. Furthermore, the heterogeneity and abundance of genomic data include widely varied sources, techniques for their obtention, and intrinsic experimental error. And even data obtained under similar conditions from two or more individuals are loaded with biological variation. So what is the best way to interpret the stories the data are telling us? Given the questions we wish to answer and the data we are generating, which tools would be most useful and effective? In this workshop we will explore the tools available for human interpretation of genomic data, specifically in the context of annotation.
Presentations and perspectives, panelists/presenters:
- Lorna Richardson, IGMM, University of Edinburgh, United Kingdom
- Justyna Szostak, PMI Research & Development, Switzerland
The workshop included a brief introduction to a landscape of tools available - as updated as the constantly changing field allows-, brief presentations chosen from abstract submissions and invited speakers, as well as ample discussion to capture the contributions and questions from attendants. In the end, we hope participants walked away with a toolset in hand that may benefit the progress of their own research.
The document provides an update on the progress of Apollo, an open-source genome annotation tool. Key points include: Milestone 2.0 will feature a simpler, more extensible architecture; new layout features like multiple organism selection and reference sequence search; pending issues for Milestone 1.0.4 and timeline for testing/release; and a call for more developers to join the Apollo community.
hematic appreciation test is a psychological assessment tool used to measure an individual's appreciation and understanding of specific themes or topics. This test helps to evaluate an individual's ability to connect different ideas and concepts within a given theme, as well as their overall comprehension and interpretation skills. The results of the test can provide valuable insights into an individual's cognitive abilities, creativity, and critical thinking skills
ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...Advanced-Concepts-Team
Presentation in the Science Coffee of the Advanced Concepts Team of the European Space Agency on the 07.06.2024.
Speaker: Diego Blas (IFAE/ICREA)
Title: Gravitational wave detection with orbital motion of Moon and artificial
Abstract:
In this talk I will describe some recent ideas to find gravitational waves from supermassive black holes or of primordial origin by studying their secular effect on the orbital motion of the Moon or satellites that are laser ranged.
The binding of cosmological structures by massless topological defectsSérgio Sacani
Assuming spherical symmetry and weak field, it is shown that if one solves the Poisson equation or the Einstein field
equations sourced by a topological defect, i.e. a singularity of a very specific form, the result is a localized gravitational
field capable of driving flat rotation (i.e. Keplerian circular orbits at a constant speed for all radii) of test masses on a thin
spherical shell without any underlying mass. Moreover, a large-scale structure which exploits this solution by assembling
concentrically a number of such topological defects can establish a flat stellar or galactic rotation curve, and can also deflect
light in the same manner as an equipotential (isothermal) sphere. Thus, the need for dark matter or modified gravity theory is
mitigated, at least in part.
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...Sérgio Sacani
Context. With a mass exceeding several 104 M⊙ and a rich and dense population of massive stars, supermassive young star clusters
represent the most massive star-forming environment that is dominated by the feedback from massive stars and gravitational interactions
among stars.
Aims. In this paper we present the Extended Westerlund 1 and 2 Open Clusters Survey (EWOCS) project, which aims to investigate
the influence of the starburst environment on the formation of stars and planets, and on the evolution of both low and high mass stars.
The primary targets of this project are Westerlund 1 and 2, the closest supermassive star clusters to the Sun.
Methods. The project is based primarily on recent observations conducted with the Chandra and JWST observatories. Specifically,
the Chandra survey of Westerlund 1 consists of 36 new ACIS-I observations, nearly co-pointed, for a total exposure time of 1 Msec.
Additionally, we included 8 archival Chandra/ACIS-S observations. This paper presents the resulting catalog of X-ray sources within
and around Westerlund 1. Sources were detected by combining various existing methods, and photon extraction and source validation
were carried out using the ACIS-Extract software.
Results. The EWOCS X-ray catalog comprises 5963 validated sources out of the 9420 initially provided to ACIS-Extract, reaching a
photon flux threshold of approximately 2 × 10−8 photons cm−2
s
−1
. The X-ray sources exhibit a highly concentrated spatial distribution,
with 1075 sources located within the central 1 arcmin. We have successfully detected X-ray emissions from 126 out of the 166 known
massive stars of the cluster, and we have collected over 71 000 photons from the magnetar CXO J164710.20-455217.
When I was asked to give a companion lecture in support of ‘The Philosophy of Science’ (https://shorturl.at/4pUXz) I decided not to walk through the detail of the many methodologies in order of use. Instead, I chose to employ a long standing, and ongoing, scientific development as an exemplar. And so, I chose the ever evolving story of Thermodynamics as a scientific investigation at its best.
Conducted over a period of >200 years, Thermodynamics R&D, and application, benefitted from the highest levels of professionalism, collaboration, and technical thoroughness. New layers of application, methodology, and practice were made possible by the progressive advance of technology. In turn, this has seen measurement and modelling accuracy continually improved at a micro and macro level.
Perhaps most importantly, Thermodynamics rapidly became a primary tool in the advance of applied science/engineering/technology, spanning micro-tech, to aerospace and cosmology. I can think of no better a story to illustrate the breadth of scientific methodologies and applications at their best.
The technology uses reclaimed CO₂ as the dyeing medium in a closed loop process. When pressurized, CO₂ becomes supercritical (SC-CO₂). In this state CO₂ has a very high solvent power, allowing the dye to dissolve easily.
Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdfSelcen Ozturkcan
Ozturkcan, S., Berndt, A., & Angelakis, A. (2024). Mending clothing to support sustainable fashion. Presented at the 31st Annual Conference by the Consortium for International Marketing Research (CIMaR), 10-13 Jun 2024, University of Gävle, Sweden.
Immersive Learning That Works: Research Grounding and Paths ForwardLeonel Morgado
We will metaverse into the essence of immersive learning, into its three dimensions and conceptual models. This approach encompasses elements from teaching methodologies to social involvement, through organizational concerns and technologies. Challenging the perception of learning as knowledge transfer, we introduce a 'Uses, Practices & Strategies' model operationalized by the 'Immersive Learning Brain' and ‘Immersion Cube’ frameworks. This approach offers a comprehensive guide through the intricacies of immersive educational experiences and spotlighting research frontiers, along the immersion dimensions of system, narrative, and agency. Our discourse extends to stakeholders beyond the academic sphere, addressing the interests of technologists, instructional designers, and policymakers. We span various contexts, from formal education to organizational transformation to the new horizon of an AI-pervasive society. This keynote aims to unite the iLRN community in a collaborative journey towards a future where immersive learning research and practice coalesce, paving the way for innovative educational research and practice landscapes.
Current Ms word generated power point presentation covers major details about the micronuclei test. It's significance and assays to conduct it. It is used to detect the micronuclei formation inside the cells of nearly every multicellular organism. It's formation takes place during chromosomal sepration at metaphase.
ESR spectroscopy in liquid food and beverages.pptxPRIYANKA PATEL
With increasing population, people need to rely on packaged food stuffs. Packaging of food materials requires the preservation of food. There are various methods for the treatment of food to preserve them and irradiation treatment of food is one of them. It is the most common and the most harmless method for the food preservation as it does not alter the necessary micronutrients of food materials. Although irradiated food doesn’t cause any harm to the human health but still the quality assessment of food is required to provide consumers with necessary information about the food. ESR spectroscopy is the most sophisticated way to investigate the quality of the food and the free radicals induced during the processing of the food. ESR spin trapping technique is useful for the detection of highly unstable radicals in the food. The antioxidant capability of liquid food and beverages in mainly performed by spin trapping technique.
1. An Introduction to Web Apollo
Manual Annotation Workshop at Kansas State University
Monica Munoz-Torres, PhD | @monimunozto
Berkeley Bioinformatics Open-Source Projects (BBOP)
Genomics Division, Lawrence Berkeley National Laboratory
IX Arthropod Genomics Symposium. Manhattan, KS. 17 June, 2015
2. 2COURSE MATERIAL
Recommended
Browsers:
Google
Chrome,
Firefox.
Exercises
file
available
at
Basecamp
Workshop
slides
and
answers
to
exercises
will
be
available
on
Basecamp
next
week.
TODAY
3. OUTLINE
Web
Apollo
CollaboraBve
CuraBon
and
InteracBve
Analysis
of
Genomes
3OUTLINE
• GENOME
CURATION
steps
involved
• COMMUNITY
BASED
CURATION
our
experience
• APOLLO
empowering
collaboraBve
curaBon
• APOLLO
on
THE
WEB
becoming
acquainted
• PRACTICE
demonstraBon
and
exercises
4. 4
DURING THIS WORKSHOP
you will
v Understand
the
process
of
genome
curaBon
in
the
context
of
annotaBon:
assembled
genome
à
automated
annotaBon
à
manual
annotaBon
v Become
familiar
with
the
environment
and
funcBonality
of
the
Web
Apollo
genome
annotaBon
ediBng
tool.
v Learn
to
idenBfy
homologs
of
known
genes
of
interest
in
a
newly
sequenced
genome
of
interest.
v Learn
how
to
corroborate
and
modify
automaBcally
generated
gene
models
using
available
biological
evidence
(in
Apollo).
Introduction
5. 5
I INVITE YOU TO:
v Observe
details
in
figures
v Listen
to
explanaBons
v Ask
quesBons
at
any
Bme
v Use
TwiNer
&
share
your
thoughts:
I
am
@monimunozto
A
few
tags
&
users:
#WebApollo
#annotaBon
#biocuraBon
#GMOD
#genome
@JBrowseGossip
v Take
brakes:
LBL’s
ergo
team
suggests
I
should
not
work
at
the
computer
for
>45
minutes
without
a
break;
neither
should
you!
We
will
be
here
for
~2.5
hours:
please
get
up
and
stretch
your
neck,
arms,
and
legs
as
o^en
as
you
need.
Introduction
6. I kindly ask that you refrain from:
v Reading
all
the
text
I
wrote.
Think
of
the
text
on
these
slides
as
your
“class
notes”.
You
will
use
them
during
exercises.
v Checking
email.
You
have
my
undivided
aNenBon,
I’d
like
to
receive
yours
in
exchange.
Warning:
If
you
get
*caught*,
you
will
read
it
out
loudly
for
everyone
to
hear,
we
may
contribute
to
the
response.
Introduction
8. REMEMBER, REMEMBER…
from intro webinar last week
Web
Apollo
IntroducDon
Biological
concepts
to
beNer
understand
manual
annotaBon
8OUTLINE
• CENTRAL
DOGMA
in
molecular
biology
• WHAT
IS
A
GENE?
let’s
think
computaBonally
• TRANSCRIPTION
mRNA
in
detail
• TRANSLATION
and
many
definiBons
• GENOME
CURATION
steps
involved
• WHAT
TO
LOOK
FOR
training
the
annotators
9. CURATING GENOMES
steps involved
1 GeneraDon
of
Gene
Models
calling
ORFs,
one
or
more
rounds
of
gene
predicBon,
etc.
2 AnnotaDon
of
gene
models
Describing
funcBon,
expression
paNerns,
metabolic
network
memberships.
3
Manual
annotaDon
CURATING GENOMES 9
10. 10Manual Curation
GENE PREDICTION
v The
idenBficaBon
of
structural
features
of
the
genome.
• Primarily
protein-‐coding
genes.
• Also
transfer
RNAs
(tRNA),
ribosomal
RNAs
(rRNA),
regulatory
moBfs,
long
and
small
non-‐coding
RNAs
(ncRNA),
repeBBve
elements
(masked),
etc.
11. 11Manual Curation
GENE PREDICTION
v Methods
for
discovery:
1)
Ab
ini&o:
based
on
DNA
composiBon,
deals
strictly
with
genomic
sequences
and
makes
use
of
staBsBcal
approaches
to
search
for
coding
regions
and
typical
gene
signals.
• E.g.
Augustus,
GENSCAN,
geneid,
fgenesh,
etc.
12. 12
Nucleic Acids 2003 vol. 31 no. 13 3738-3741
Manual Curation
GENE PREDICTION
v Methods
for
discovery:
2)
Homology-‐based:
evidence-‐based;
finds
genes
using
either
similarity
searches
in
the
main
databases
or
experimental
data
including
RNAseq,
expressed
sequence
tags
(ESTs),
full-‐length
complementary
DNAs
(cDNAs),
etc.
• E.g:
SGP2,
fgenesh++
13. 13
In
some
cases
algorithms
and
metrics
used
to
generate
consensus
sets
may
actually
reduce
the
accuracy
of
the
gene’s
representaBon;
in
such
cases
it
is
usually
beNer
to
use
an
ab
ini&o
model
to
create
a
new
annotaBon.
GENE ANNOTATION
IntegraBon
of
data
from
predicBon
tools
to
generate
a
reliable
set
of
structural
annotaDons:
involves
ab
ini&o
predicBons,
assessment
of
biological
evidence
to
drive
the
gene
predicBon
process,
and
the
synthesis
of
these
results
to
produce
a
set
of
consensus
gene
models.
v Models
may
be
organized
using:
v automaBc
integraBon
of
predicted
sets;
e.g:
GLEAN
v packaged
tools
from
pipeline;
e.g:
MAKER
Manual Curation
14. NOT PERFECT
automated annotation remains an imperfect art
Unlike
the
more
highly
polished
genomes
of
earlier
projects,
today’s
genomes
have:
1. lower
coverage.
2. more
frequent
assembly
errors
and
annotaBon
of
genes
across
mulBple
scaffolds.
CURATING GENOMES 14
Image: www.BroadInstitute.org
15. MANUAL ANNOTATION
working concept
v Precise
elucidaBon
of
biological
features
encoded
in
the
genome
requires
careful
examinaBon
and
review.
Schiex
et
al.
Nucleic
Acids
2003
(31)
13:
3738-‐3741
Automated Predictions
Experimental Evidence
Manual Curation 15
cDNAs,
HMM
domain
searches,
RNAseq,
genes
from
other
species.
16. MANUAL ANNOTATION
is necessary
v Evaluate
all
available
evidence
and
corroborate
or
modify
genome
element
predicBons.
v Determine
funcBonal
roles
through
comparaBve
analysis
using
literature,
databases,
and
experimental
data.
v Resolve
discrepancies
and
validate
automated
gene
model
hypotheses.
v Desktop
version
of
Apollo
was
designed
to
fit
the
manual
annotaBon
needs
of
genome
projects
such
as
fruit
fly,
mouse,
zebrafish,
human,
etc.
Manual Curation 16
Automated Predictions
Curated Gene Models
Official Gene Set
“Incorrect
and
incomplete
genome
annota&ons
will
poison
every
experiment
that
uses
them”.
-‐
M.
Yandell
17. BUT, MANUAL CURATION
did not always scale well
A
small
group
of
highly
trained
experts;
e.g.
GO
1
Museum
Model
A
few
very
good
biologists
and
a
few
very
good
bioinformaBcians
camp
together,
during
intense
but
short
periods
of
Bme.
Old-‐Dme
Jamborees
2
Researchers
work
by
themselves,
then
may
or
may
not
publicize
results;
…
may
be
a
dead-‐end
with
very
few
people
ever
aware
of
these
results.
CoQage
Model
3
Elsik
et
al.
2006.
Genome
Res.
16(11):1329-‐33.
Manual Curation 17
Too
many
sequences
and
not
enough
hands
to
approach
curaBon.
18. POWER TO THE CURATORS
augment existing tools
Fill
in
the
gap
for
all
the
things
that
won’t
be
easy
to
cover
with
these
approaches;
this
will
allow
researchers
to
beNer
contribute
their
efforts.
Give
more
people
the
power
to
curate!
Big
data
are
not
a
subsBtute
for,
but
a
supplement
to
tradiBonal
data
collecBon
and
analysis.
The
Parable
of
Google
Flu.
Lazer
et
al.
2014.
Science
343
(6176):
1203-‐1205.
v Enable
more
curators
to
work
v Enable
beNer
scienBfic
publishing
v Credit
curators
for
their
work
Manual Curation 18
19. IMPROVING TOOLS FOR MANUAL ANNOTATION
our plan
“More
and
more
sequences”:
more
genomes,
within
populaBons
and
across
species,
are
now
being
sequenced.
This
begs
the
need
for
a
universally
accessible
genome
curaBon
tool:
Manual Curation 19
To
produce
accurate
sets
of
genomic
features.
To
address
the
need
to
correct
for
more
frequent
assembly
and
automated
predicBon
errors
due
to
new
sequencing
technologies.
20. GENOME ANNOTATION
an inherently collaborative task
Researchers
o^en
turn
to
colleagues
for
second
opinions
and
insight
from
those
with
experBse
in
parBcular
areas
(e.g.,
domains,
families).
To
facilitate
and
encourage
this,
we
conBnue
to
improve
Apollo.
APOLLO 20
Apollo
is
a
web-‐based,
collaboraBve
genomic
annotaBon
ediBng
plavorm.
We
need
annota&on
edi&ng
tools
to
modify
and
refine
the
precise
loca&on
and
structure
of
the
genome
elements
that
predic&ve
algorithms
cannot
yet
resolve
automa&cally.
hNp://GenomeArchitect.org
21. APOLLO
genome annotation editing tool
21
v Web
based,
integrated
with
JBrowse.
v Supports
real
Bme
collaboraBon!
v AutomaBc
generaBon
of
ready-‐made
computable
data.
v Supports
annotaBon
of
genes,
pseudogenes,
tRNAs,
snRNAs,
snoRNAs,
ncRNAs,
miRNAs,
TEs,
and
repeats.
v IntuiBve
annotaBon,
gestures,
and
pull-‐down
menus
to
create
and
edit
transcripts
and
exons
structures,
insert
comments
(CV,
freeform
text),
GO
terms,
etc.
APOLLO
22. NEW APOLLO ARCHITECTURE
simpler, more flexible
APOLLO 22
Web-‐based
client
+
annotaBon-‐ediBng
engine
+
server-‐side
data
service
REST / JSON
Websockets
Annotation Engine (Server)
Shiro
LDAP
OAuth
JBrowse Data
Organism 2
Annotations
Security
Preferences
Organisms
Tracks
BAM
BED
VCF
GFF3
BigWig
Annotators
Google Web Toolkit (GWT) /
Bootstrap
JBrowse DOJO / jQuery JBrowse Data
Organism 1
Load genomic
evidence for
selected organism
Single Data Store
PostgreSQL, MySQL,
MongoDB, ElasticSearch
Apollo v2.0
23. We
conBnuously
train
and
support
hundreds
of
geographically
dispersed
scienBsts
from
many
research
communiBes
to
conduct
manual
annotaBons,
recovering
coding
sequences
in
agreement
with
all
available
biological
evidence
using
Web
Apollo.
v Gate
keeping
and
monitoring.
v Tutorials,
training
workshops,
and
“geneborees”.
v Personalized
user
support.
23
DISPERSED COMMUNITIES
collaborative manual annotation efforts
APOLLO
24. 24
CURATION
how it works
IdenBfies
elements
that
best
represent
the
underlying
biology
(including
missing
genes)
and
eliminates
elements
that
reflect
systemic
errors
of
automated
analyses.
Assigns
funcBon
through
comparaBve
analysis
of
similar
genome
elements
from
closely
related
species
using
literature,
databases,
and
researchers’
lab
data.
1
2
Examples
Comparing
7
ant
genomes
contributed
to
beNer
understanding
evoluBon
and
organizaBon
of
insect
socieBes
at
the
molecular
level;
e.g.
division
of
labor,
mutualism,
chemical
communicaBon,
etc.
Libbrecht
et
al.
2012.
Genome
Biology
2013,
14:212
Queen
Bee
Worker
Bee
Castes
Larva
Dnmt
RNAi
Royal
jelly
Kucharski
et
al.
2008.
Science
(319)
5871:
1827-‐1830
Insect
Methylome
Anchoring
molecular
markers
to
reference
genome
pointed
to
chromosomal
rearrangements
&
detecBng
signals
of
adapBve
radiaBon
in
Heliconius
buNerflies.
Joron
et
al.
2011.
Nature,
477:203-‐206
APOLLO
25. CURRENT COLLABORATIONS
training and contributions
Partnerships
WEB APOLLO 25
UNIVERSITY
of MISSOURI
National
Agricultural
Library
Nature
Reviews
Gene&cs
2009
(10),
346-‐347
Norwegian
Spruce
hNp://congenie.org/
Phlebotomus
papatasi
Tallapoosa
darter
hNp://darter2.westga.edu/
Wasmania
auropunctata
Homo
sapiens
hg19
Pinus
taeda
hIp://dendrome.ucdavis.edu/treegenes/browsers/
26. LESSONS LEARNED
What
we
have
learned:
• CollaboraBve
work
disBlls
invaluable
knowledge
• We
must
enforce
strict
rules
and
formats
• We
must
evolve
with
the
data
• A
liNle
training
goes
a
long
way
• NGS
poses
addiBonal
challenges
PREVIOUSLY WE LEARNED 26
27. THE COLLABORATIVE CURATION PROCESS AT I5K
1) In
some
cases
a
computaBonally
predicted
consensus
gene
set
is
generated
using
mulBple
lines
of
evidence.
In
other
cases,
more
than
one
gene
set
are
made
available
for
analysis:
e.g.
Primary
Gene
Sets:
HAZT_v0.5.3-‐Models,
Augustus
gene
set.
2) i5K
Projects
will
integrate
consensus
computaBonal
predicBons
with
manual
annotaBons
to
produce
an
updated
Official
Gene
Set
(OGS):
» If
it’s
not
on
either
track,
it
won’t
make
the
OGS!
» If
it’s
there
and
it
shouldn’t,
it
will
sBll
make
the
OGS!
27Collaborative Curation at i5K
28. CONSENSUS SET: REFERENCE AND START POINT
• Isoforms:
drag
original
and
alternaBvely
spliced
form
to
‘User-‐
created
Annota&ons’
area.
• If
an
annotaBon
needs
to
be
removed
from
the
consensus
set,
drag
it
to
the
‘User-‐created
Annota&ons’
area
and
label
as
‘Delete’
on
InformaBon
Editor.
• Overlapping
interests?
Collaborate
to
reach
agreement.
• Follow
guidelines
for
i5K
Pilot
Species
Projects
as
shown
at
hNp://goo.gl/LRu1VY
28Collaborative Curation at i5K
31. NavigaBon
tools:
pan
and
zoom
Search
box:
go
to
a
scaffold
or
a
gene
model.
Grey
bar
of
coordinates
indicates
locaBon.
You
can
also
select
here
in
order
to
zoom
to
a
sub-‐region.
‘View’:
change
color
by
CDS,
toggle
strands,
set
highlight.
‘File’:
Upload
your
own
evidence:
GFF3,
BAM,
BigWig,
VCF*.
Add
combinaBon
and
sequence
search
tracks.
‘Tools’:
Use
BLAT
to
query
the
genome
with
a
protein
or
DNA
sequence.
Available Tracks
Evidence
Tracks
Area
‘User-‐created
AnnotaBons’
Track
Login
31
WEB APOLLO
graphical user interface (GUI) for editing annotations
Becoming Acquainted with Web Apollo.
32. In
addiBon
to
protein-‐coding
gene
annotaBon
that
you
know
and
love.
• Non-‐coding
genes:
ncRNAs,
miRNAs,
repeat
regions,
and
TEs
• Sequence
alteraBons
(less
coverage
=
more
fragmentaBon)
• VisualizaBon
of
stage
and
cell-‐type
specific
transcripBon
data
as
coverage
plots,
heat
maps,
and
alignments
32
32
WEB APOLLO
additional functionality
Becoming Acquainted with Web Apollo.
33. 1. Select
a
chromosomal
region
of
interest,
e.g.
scaffold.
2. Select
appropriate
evidence
tracks.
3. Determine
whether
a
feature
in
an
exisBng
evidence
track
will
provide
a
reasonable
gene
model
to
start
working.
-‐ If
yes:
select
and
drag
the
feature
to
the
‘User-‐created
AnnotaBons’
area,
creaDng
an
iniDal
gene
model.
If
necessary
use
ediBng
funcBons
to
adjust
the
gene
model.
-‐ If
not:
let’s
talk.
4. Check
your
edited
gene
model
for
integrity
and
accuracy
by
comparing
it
with
available
homologs.
Becoming Acquainted with Web Apollo
33 |
Always
remember:
when
annotaBng
gene
models
using
Web
Apollo,
you
are
looking
at
a
‘frozen’
version
of
the
genome
assembly
and
you
will
not
be
able
to
modify
the
assembly
itself.
33
GENERAL PROCESS OF CURATION
steps to remember
34. Choose
(click
or
drag)
appropriate
evidence
tracks
from
the
list
on
the
le^.
Click
on
an
exon
to
select
it.
Double
click
on
an
exon
or
single
click
on
an
intron
to
select
the
enBre
gene.
Select
&
drag
any
elements
from
an
evidence
track
into
the
curaBon
area:
these
are
editable
and
considered
the
curated
version
of
the
gene.
Other
opBons
for
elements
in
evidence
tracks
available
from
right-‐click
menu.
If
you
select
an
exon
or
a
gene,
then
every
track
is
automaBcally
searched
for
exons
with
exactly
the
same
co-‐ordinates
as
what
you
selected.
Matching
edges
are
highlighted
red.
Hovering
over
an
annotaBon
in
progress
brings
up
an
informaBon
pop-‐up.
34 | 34
Becoming Acquainted with Web Apollo.
USER NAVIGATION
35. Right-‐click
menu:
• With
the
excepBon
of
deleBng
a
model,
all
edits
can
be
reversed
with
‘Undo’
opBon.
‘Redo’
also
available.
All
changes
are
immediately
saved
and
available
to
all
users
in
real
Bme.
• ‘Get
sequence’
retrieves
pepBde,
cDNA,
CDS,
and
genomic
sequences.
• You
can
select
an
exon
and
select
‘Delete’.
You
can
create
an
intron,
flip
the
direcBon,
change
the
start
or
split
the
gene.
35 | 35
USER NAVIGATION
Becoming Acquainted with Web Apollo.
36. Right-‐click
menu:
• If
you
select
two
gene
models,
you
can
join
them
using
‘Merge’,
and
you
may
also
‘Split’
a
model.
• You
can
select
‘Duplicate’,
for
example
to
annotate
isoforms.
• Set
translaBon
start,
annotate
selenocysteine-‐containing
proteins,
match
edges
of
annotaBon
to
those
of
evidence
tracks.
36 | 36
USER NAVIGATION
Becoming Acquainted with Web Apollo.
37. 37
AnnotaBons,
annotaBon
edits,
and
History:
stored
in
a
centralized
database.
37
USER NAVIGATION
Becoming Acquainted with Web Apollo.
38. 38
The
AnnotaBon
InformaBon
Editor
DBXRefs
are
database
crossed
references:
if
you
have
reason
to
believe
that
this
gene
is
linked
to
a
gene
in
a
public
database
(including
your
own),
then
add
it
here.
38
USER NAVIGATION
Becoming Acquainted with Web Apollo.
39. 39
The
AnnotaBon
InformaBon
Editor
• Add
PubMed
IDs
• Include
GO
terms
as
appropriate
from
any
of
the
three
ontologies
• Write
comments
staBng
how
you
have
validated
each
model.
39
USER NAVIGATION
Becoming Acquainted with Web Apollo.
40. 40 |
• ‘Zoom
to
base
level’
opBon
reveals
the
DNA
Track.
• Change
color
of
exons
by
CDS
from
the
‘View’
menu.
• The
reference
DNA
sequence
is
visible
in
both
direcBons
as
are
the
protein
translaBons
in
all
six
frames.
You
can
toggle
either
direcBon
to
display
only
3
frames.
Zoom
in/out
with
keyboard:
shi^
+
arrow
keys
up/down
40
USER NAVIGATION
Becoming Acquainted with Web Apollo.
41. Web Apollo User Guide
(Fragment)
http://genomearchitect.org/web_apollo_user_guide
42. In
a
“simple
case”
the
predicted
gene
model
is
correct
or
nearly
correct,
and
this
model
is
supported
by
evidence
that
completely
or
mostly
agrees
with
the
predicBon.
Evidence
that
extends
beyond
the
predicted
model
is
assumed
to
be
non-‐coding
sequence.
The
following
secBons
describe
simple
modificaBons.
42 | 42
ANNOTATING SIMPLE CASES
Becoming Acquainted with Web Apollo.
43. Select
and
drag
the
putaBve
new
exon
from
a
track,
and
add
it
directly
to
an
annotated
transcript
in
the
‘User-‐created
AnnotaBons’
area.
• Click
the
exon,
hold
your
finger
on
the
mouse
buNon,
and
drag
the
cursor
unBl
it
touches
the
receiving
transcript.
A
dark
green
highlight
indicates
it
is
okay
to
release
the
mouse
buNon.
• When
released,
the
addiBonal
exon
becomes
aNached
to
the
receiving
transcript.
43 |
• A
confirmaBon
box
will
warn
you
if
the
receiving
transcript
is
not
on
the
same
strand
as
the
feature
where
the
new
exon
originated.
43
ADDING EXONS
Becoming Acquainted with Web Apollo.
44. Each
Bme
you
add
an
exon
region,
whether
by
extension
or
adding
an
exon,
Web
Apollo
recalculates
the
longest
ORF,
idenBfying
‘Start’
and
‘Stop’
signals
and
allowing
you
to
determine
whether
a
‘Stop’
codon
has
been
incorporated
a^er
each
ediBng
step.
44 |
Web
Apollo
demands
that
an
exon
already
exists
as
an
evidence
in
one
of
the
tracks.
You
could
provide
a
text
file
in
GFF
format
and
select
File
à
Open.
GFF
is
a
simple
text
file
delimited
by
TABs,
one
line
for
each
genomic
‘feature’:
column
1
is
the
name
of
the
scaffold;
then
some
text
(irrelevant),
then
‘exon’,
then
start,
stop,
strand
as
+
or
-‐,
a
dot,
another
dot,
and
Name=some
name
Example:
scaffold_88
Qratore
exon
21
2111
+
.
.
Name=bob
scaffold_88
Qratore
exon
2201
5111
+
.
.
Name=rad
44
ADDING EXONS
Becoming Acquainted with Web Apollo.
45. Gene
predicBons
may
or
may
not
include
UTRs.
If
transcript
alignment
data
are
available
and
extend
beyond
your
original
annotaBon,
you
may
extend
or
add
UTRs.
1. PosiBon
the
cursor
at
the
beginning
of
the
exon
that
needs
to
be
extended
and
‘Zoom
to
base
level’.
2. Place
the
cursor
over
the
edge
of
the
exon
unBl
it
becomes
a
black
arrow
then
click
and
drag
the
edge
of
the
exon
to
the
new
coordinate
posiBon
that
includes
the
UTR.
45 |
View
zoomed
to
base
level.
The
DNA
track
and
annotaBon
track
are
visible.
The
DNA
track
includes
the
sense
strand
(top)
and
anB-‐sense
strand
(boNom).
The
six
reading
frames
flank
the
DNA
track,
with
the
three
forward
frames
above
and
the
three
reverse
frames
below.
The
User-‐
created
AnnotaBon
track
shows
the
terminal
end
of
an
annotaBon.
The
green
rectangle
highlights
the
locaBon
of
the
nucleoBde
residues
in
the
‘Stop’
signal.
To
add
a
new
spliced
UTR
to
an
exisBng
annotaBon
follow
the
procedure
for
adding
an
exon.
45
ADDING UTRs
Becoming Acquainted with Web Apollo.
46. 1. Zoom
in
sufficiently
to
clearly
resolve
each
exon
as
a
disBnct
rectangle.
2. Two
exons
from
different
tracks
sharing
the
same
start
and/or
end
coordinates
will
display
a
red
bar
to
indicate
the
matching
edges.
3. SelecBng
the
whole
annotaBon
or
one
exon
at
a
Bme,
use
this
‘edge-‐
matching’
funcBon
and
scroll
along
the
length
of
the
annotaBon,
verifying
exon
boundaries
against
available
data.
Use
square
[
]
brackets
to
scroll
from
exon
to
exon.
4. Note
if
there
are
cDNA
/
RNAseq
reads
that
lack
one
or
more
of
the
annotated
exons
or
include
addiBonal
exons.
46 | 46
EXON STRUCTURE INTEGRITY
Becoming Acquainted with Web Apollo.
47. To
modify
an
exon
boundary
and
match
data
in
the
evidence
tracks:
select
both
the
offending
exon
and
the
feature
with
the
expected
boundary,
then
right
click
on
the
annotaBon
to
select
‘Set
3’
end’
or
‘Set
5’
end’
as
appropriate.
47 |
In
some
cases
all
the
data
may
disagree
with
the
annotaBon,
in
other
cases
some
data
support
the
annotaBon
and
some
of
the
data
support
one
or
more
alternaBve
transcripts.
Try
to
annotate
as
many
alternaBve
transcripts
as
are
well
supported
by
the
data.
47
EXON STRUCTURE INTEGRITY
Becoming Acquainted with Web Apollo.
48. Flags
non-‐canonical
splice
sites.
SelecBon
of
features
and
sub-‐
features
Edge-‐matching
Evidence
Tracks
Area
‘User-‐created
AnnotaBons’
Track
The
ediBng
logic
in
the
server:
§ selects
longest
ORF
as
CDS
§ flags
non-‐canonical
splice
sites
48
EDITING LOGIC
Becoming Acquainted with Web Apollo.
49. Zoom
to
base
level
to
review
non-‐
canonical
splice
site
warnings.
These
do
not
necessarily
need
to
be
corrected,
but
should
be
flagged
with
the
appropriate
comment.
49 |
Exon/intron
juncBon
possible
error
Original
model
Curated
model
Non-‐canonical
splices
are
indicated
by
an
orange
circle
with
a
white
exclamaBon
point
inside,
placed
over
the
edge
of
the
offending
exon.
Most
insects,
have
a
valid
non-‐canonical
site
GC-‐AG.
Other
non-‐canonical
splice
sites
are
unverified.
Web
Apollo
flags
GC
splice
donors
as
non-‐canonical.
Canonical
splice
sites:
3’-‐…exon]GA
/
TG[exon…-‐5’
5’-‐…exon]GT
/
AG[exon…-‐3’
reverse
strand,
not
reverse-‐complemented:
forward
strand
49
SPLICE SITES
Becoming Acquainted with Web Apollo.
50. Some
gene
predicBon
algorithms
do
not
recognize
GC
splice
sites,
thus
the
intron/exon
juncBon
may
be
incorrect.
For
example,
one
such
gene
predicBon
algorithm
may
ignore
a
true
GC
donor
and
select
another
non-‐canonical
splice
site
that
is
less
frequently
observed
in
nature.
Therefore,
if
upon
inspecBon
you
find
a
non-‐
canonical
splice
site
that
is
rarely
observed
in
nature,
you
may
wish
to
search
the
region
for
a
more
frequent
in-‐frame
non-‐canonical
splice
site,
such
as
a
GC
donor.
If
there
is
an
in-‐frame
site
close
that
is
more
likely
to
be
the
correct
splice
donor,
you
may
make
this
adjustment
while
zoomed
at
base
level.
50 |
Exon/intron junction possible error
Original model
Curated model
Use
RNA-‐Seq
data
to
make
a
decision.
Canonical
splice
sites:
3’-‐…exon]GA
/
TG[exon…-‐5’
5’-‐…exon]GT
/
AG[exon…-‐3’
reverse
strand,
not
reverse-‐complemented:
forward
strand
50
SPLICE SITES
keep this in mind
Becoming Acquainted with Web Apollo.
51. Web
Apollo
calculates
the
longest
possible
open
reading
frame
(ORF)
that
includes
canonical
‘Start’
and
‘Stop’
signals
within
the
predicted
exons.
If
it
appears
to
have
calculated
an
incorrect
‘Start’
signal,
you
may
modify
it
selecBng
an
in-‐frame
‘Start’
codon
further
up
or
downstream,
depending
on
evidence
(protein
database,
addiBonal
evidence
tracks).
An
upstream
‘Start’
codon
may
be
present
outside
the
predicted
gene
model,
within
a
region
supported
by
another
evidence
track.
51 | 51
‘START’ AND ‘STOP’ SITES
Becoming Acquainted with Web Apollo.
52. Note
that
the
‘Start’
codon
may
also
be
located
in
a
non-‐predicted
exon
further
upstream.
If
you
cannot
idenBfy
that
exon,
add
the
appropriate
note
in
the
transcript’s
‘Comments’
secBon.
In
very
rare
cases,
the
actual
‘Start’
codon
may
be
non-‐canonical
(non-‐ATG).
In
some
cases,
a
‘Stop’
codon
may
not
be
automaBcally
idenBfied.
Check
to
see
if
there
are
data
supporBng
a
3’
extension
of
the
terminal
exon
or
addiBonal
3’
exons
with
valid
splice
sites.
52 | 52
‘START’ AND ‘STOP’ SITES
keep this in mind
Becoming Acquainted with Web Apollo.
53.
54. Evidence
may
support
joining
two
or
more
different
gene
models.
Warning:
protein
alignments
may
have
incorrect
splice
sites
and
lack
non-‐conserved
regions!
1. Drag
and
drop
each
gene
model
to
‘User-‐created
AnnotaBons’
area.
Shi^
click
to
select
an
intron
from
each
gene
model
and
right
click
to
select
the
‘Merge’
opBon
from
the
menu.
2. Drag
supporBng
evidence
tracks
over
the
candidate
models
to
corroborate
overlap,
or
review
edge
matching
and
coverage
across
models.
3. Check
the
resulBng
translaBon
by
querying
a
protein
database
e.g.
UniProt.
Record
the
IDs
of
both
starBng
gene
models
in
‘DBXref’
and
add
comments
to
record
that
this
annotaBon
is
the
result
of
a
merge.
54 | 54
Red
lines
around
exons:
‘edge-‐matching’
allows
annotators
to
confirm
whether
the
evidence
is
in
agreement
without
examining
each
exon
at
the
base
level.
COMPLEX CASES
merge two gene predictions on the same scaffold
Becoming Acquainted with Web Apollo.
55. One
or
more
splits
may
be
recommended
when
different
segments
of
the
predicted
protein
align
to
two
or
more
different
families
of
protein
homologs,
and
the
predicted
protein
does
not
align
to
any
known
protein
over
its
enBre
length.
Transcript
data
may
support
a
split
(if
so,
verify
that
it
is
not
a
case
of
alternaBve
transcripts).
55 | 55
COMPLEX CASES
split a gene prediction
Becoming Acquainted with Web Apollo.
56. DNA
Track
‘User-‐created
AnnotaDons’
Track
56
COMPLEX CASES
frameshifts, single-base errors, and selenocysteines
Becoming Acquainted with Web Apollo.
57. 1. Web
Apollo
allows
annotators
to
make
single
base
modificaBons
or
frameshi^s
that
are
reflected
in
the
sequence
and
structure
of
any
transcripts
overlapping
the
modificaBon.
Note
that
these
manipulaBons
do
NOT
change
the
underlying
genomic
sequence.
2. If
you
determine
that
you
need
to
make
one
of
these
changes,
zoom
in
to
the
nucleoBde
level
and
right
click
over
a
single
nucleoBde
on
the
genomic
sequence
to
access
a
menu
that
provides
opBons
for
creaBng
inserBons,
deleBons
or
subsBtuBons.
3. The
‘Create
Genomic
InserBon’
feature
will
require
you
to
enter
the
necessary
string
of
nucleoBde
residues
that
will
be
inserted
to
the
right
of
the
cursor’s
current
locaBon.
The
‘Create
Genomic
DeleBon’
opBon
will
require
you
to
enter
the
length
of
the
deleBon,
starBng
with
the
nucleoBde
where
the
cursor
is
posiBoned.
The
‘Create
Genomic
SubsBtuBon’
feature
asks
for
the
string
of
nucleoBde
residues
that
will
replace
the
ones
on
the
DNA
track.
4. Once
you
have
entered
the
modificaBons,
Web
Apollo
will
recalculate
the
corrected
transcript
and
protein
sequences,
which
will
appear
when
you
use
the
right-‐click
menu
‘Get
Sequence’
opBon.
Since
the
underlying
genomic
sequence
is
reflected
in
all
annotaBons
that
include
the
modified
region
you
should
alert
the
curators
of
your
organisms
database
using
the
‘Comments’
secBon
to
report
the
CDS
edits.
5. In
special
cases
such
as
selenocysteine
containing
proteins
(read-‐throughs),
right-‐click
over
the
offending/premature
‘Stop’
signal
and
choose
the
‘Set
readthrough
stop
codon’
opBon
from
the
menu.
57 | 57
COMPLEX CASES
frameshifts, single-base errors, and selenocysteines
Becoming Acquainted with Web Apollo.
58. Follow
our
checklist
unBl
you
are
happy
with
the
annotaBon!
Then:
– Comment
to
validate
your
annotaBon,
even
if
you
made
no
changes
to
an
exisBng
model.
Your
comments
mean
you
looked
at
the
curated
model
and
are
happy
with
it;
think
of
it
as
a
vote
of
confidence.
– Or
add
a
comment
to
inform
the
community
of
unresolved
issues
you
think
this
model
may
have.
58 | 58
Always
Remember:
Web
Apollo
curaBon
is
a
community
effort
so
please
use
comments
to
communicate
the
reasons
for
your
annotaBon
(your
comments
will
be
visible
to
everyone).
COMPLETING THE ANNOTATION
Becoming Acquainted with Web Apollo.
59. To
find
the
gene
region
you
wish
to
annotate,
you
may
use:
a) a
protein
sequence
of
a
homolog
from
another
species
b) a
sequence
from
a
similar
gene
in
species
of
interest
(e.g.
another
gene
family
member)
c) on
your
own,
you
aligned
your
gene
models
or
transcriptomic
data
to
the
genome.
d) you
used
high
quality
proteins
and/or
gene
family
alignments
(mulB
or
single
species)
and
are
able
to
idenBfy
conserved
domains.
OpDon
1
–
You
have
a
sequence
but
don’t
know
where
it
is
in
this
genome:
• Use
BLAT
in
the
Apollo
window,
or
BLAST
at
NAL’s
i5k
BLAST
server,
available
at:
hNp://i5k.nal.usda.gov/blastn
• You
may
also
use
other
tools
for
annotaBon
and
contribute
your
data
from
those
efforts.
OpDon
2
–
The
genome
has
already
been
annotated
with
your
sequences
and
you
have
a
gene
idenBfier
that
has
been
indexed
in
Apollo.
• That
is,
you
know
where
to
look,
so
type
the
ID
in
the
Search
box
of
Apollo.
• Apollo
autocompletes
using
a
case-‐insensiBve
search
anchored
on
the
le^-‐hand
side
of
the
word.
For
example
“HaGR”
will
show
all
“hagr”
objects
(up
to
30).
• Choose
one
of
the
genes
and
click
“Go”.
• You
can
do
that
with
Domains,
Alignments
or
Gene
names
provided
to
you
(if
they
have
been
indexed).
OpDon
3
–
Find
genes
based
on
funcBonal
ontology
terms
or
network
membership
idenBfiers.
Becoming Acquainted with Web Apollo.
HOW TO BEGIN
60. 1. Select
the
chromosomal
region
of
interest,
e.g.
scaffold.
2. Select
appropriate
evidence
tracks.
3. Determine
whether
a
feature
in
an
exisBng
evidence
track
will
provide
a
reasonable
gene
model
to
start
working.
-‐ If
yes:
select
and
drag
the
feature
to
the
‘User-‐created
AnnotaBons’
area,
creaDng
an
iniDal
gene
model.
If
necessary
use
ediBng
funcBons
to
adjust
the
gene
model.
-‐ Nothing
available
to
you?
Let’s
have
a
talk.
4. Check
your
edited
gene
model
for
integrity
and
accuracy
by
comparing
it
with
available
homologs.
60 |
Always
remember:
when
annotaBng
gene
models
using
Apollo,
you
are
looking
at
a
‘frozen’
version
of
the
genome
assembly
and
you
will
not
be
able
to
modify
the
assembly
itself.
60
Becoming Acquainted with Web Apollo.
GENERAL PROCESS OF CURATION
61. 61CURATING GENOMES
WHAT ANNOTATORS SHOULD LOOK FOR
pay attention to these details
v AnnotaDng
a
simple
case:
WHEN
“The
official
predicBon
is
correct,
or
nearly
correct,
assuming
that
no
aligned
data
extends
beyond
the
gene
model
and
if
so,
it
is
not
likely
to
be
coding
sequence,
and/or
the
gene
predicBon
matches
what
you
know
about
the
gene”:
a. Can
you
add
UTRs?
b. Check
exon
structures.
c. Check
splice
sites:
…]5’-‐GT/AG-‐3’[…
d. Check
‘start’
and
‘stop’
sites.
e. Check
the
predicted
protein
product(s).
f. If
the
protein
product
sBll
does
not
look
correct,
go
on
to
“AnnotaBng
more
complex
cases”.
62. 62CURATING GENOMES
WHAT ANNOTATORS SHOULD LOOK FOR
continued
v AddiDonal
funcDonality.
You
may
also
need
to
learn
how
to:
a. Get
genomic
sequence
b. Merge
exons
c. Add/Delete
an
exon
d. Create
an
exon
de
novo
(within
an
intron
or
outside
exisBng
annotaBons).
e. Right/apple-‐click
on
a
feature
to
get
feature
ID
and
addiBonal
informaBon
f. Looking
up
homolog
descripBons
going
to
the
accession
web
page
at
UniProt/Swissprot
63. 63CURATING GENOMES
WHAT ANNOTATORS SHOULD LOOK FOR
continued
v AnnotaDng
more
complex
cases:
a. Incomplete
annotaBon:
protein
integrity
checks,
indicate
gaps,
missing
5’
sequences
or
missing
3’
sequences.
b. Merge
of
2
gene
predicBons
on
same
scaffold
c. Merge
of
2
gene
predicBons
on
different
scaffolds
(uh-‐oh!).
d. Split
of
a
gene
predicBon
e. Frameshi^s,
Selenocysteine,
single-‐base
errors,
and
other
inconvenient
phenomena
64. 64CURATING GENOMES
WHAT ANNOTATORS SHOULD LOOK FOR
continued
v Adding
important
project
informaDon
in
the
form
of
Canned
and/or
Customized
Comments:
a. NCBI
ID,
RefSeq
ID,
gene
symbol(s),
common
name(s),
synonyms,
top
BLAST
hits
(GenBank
IDs),
orthologs
with
species
names,
and
anything
else
you
can
think
of,
because
you
are
the
expert.
b. Type
of
annotaBon
(e.g.:
whether
or
not
the
gene
model
was
changed)
c. Data
source
(for
example
if
the
Fgeneshpp
predicted
gene
was
the
starBng
point
for
your
annotaBon)
d. The
kinds
of
changes
you
made
to
the
gene
model,
e.g.:
split,
merge
e. FuncBonal
descripBon
f. Whether
you
would
like
for
your
MOD
curator
to
check
the
annotaBon
g. Whether
part
of
your
gene
is
on
a
different
scaffold.
65. 1. Can
you
add
UTRs
(e.g.:
via
RNA-‐Seq)?
2. Check
exon
structures
3. Check
splice
sites:
most
splice
sites
display
these
residues
…]5’-‐GT/AG-‐3’[…
4. Check
‘Start’
and
‘Stop’
sites
5. Check
the
predicted
protein
product(s)
– Align
it
against
relevant
genes/gene
family.
– blastp
against
NCBI’s
RefSeq
or
nr
6. If
the
protein
product
sBll
does
not
look
correct
then
check:
– Are
there
gaps
in
the
genome?
– Merge
of
2
gene
predicBons
on
the
same
scaffold
– Merge
of
2
gene
predicBons
from
different
scaffolds
– Split
a
gene
predicBon
– Frameshi^s
– error
in
the
genome
assembly?
– Selenocysteine,
single-‐base
errors,
and
other
inconvenient
phenomena
65 | 65
7. Finalize
annotaBon
by
adding:
– Important
project
informaBon
in
the
form
of
canned
and/or
customized
comments
– IDs
from
GenBank
(via
DBXRef),
gene
symbol(s),
common
name(s),
synonyms,
top
BLAST
hits
(with
GenBank
IDs),
orthologs
with
species
names,
and
everything
else
you
can
think
of,
because
you
are
the
expert.
– Whether
your
model
replaces
one
or
more
models
from
the
official
gene
set
(so
it
can
be
deleted).
– The
kinds
of
changes
you
made
to
the
gene
model
of
interest,
if
any.
E.g.:
splits,
merges,
whether
the
5’
or
3’
ends
had
to
be
modified
to
include
‘Start’
or
‘Stop’
codons,
addiBonal
exons
had
to
be
added,
or
non-‐canonical
splice
sites
were
accepted.
– Any
funcBonal
assignments
that
you
think
are
of
interest
to
the
community
(e.g.
via
BLAST,
RNA-‐Seq
data,
literature
searches,
etc.)
THE CHECK LIST
for accuracy and integrity
Becoming Acquainted with Web Apollo.
67. Apollo Example
-‐
Introductory
demonstraBon
using
the
Hyalella
azteca
genome
(amphipod
crustacean).
Example 67
A
public
Apollo
Demo
using
the
Honey
Bee
genome
is
available
at
hNp://genomearchitect.org/WebApolloDemo
68. What do we know about this genome?
• Currently
publicly
available
data
at
NCBI:
• >37,000
nucleoBde
seqsà
scaffolds,
mitochondrial
genes
• 300
amino
acid
seqsà
mitochondrion
• 53
ESTs
• 0
conserved
domains
idenBfied
• 0
“gene”
entries
submiNed
• Data
at
i5K
Workspace@NAL
-‐
10,832
scaffolds,
23,288
transcripts,
12,906
proteins
Example 68
70. PubMed Search: what’s new?
Example 70
“Ten
populaBons
(3
laboratory
cultures,
7
California
water
bodies)
differed
by
at
least
550-‐fold
in
sensiBvity
to
pyrethroids.”
“By
sequencing
the
primary
pyrethroid
target
site,
the
voltage-‐gated
sodium
channel
(vgsc),
we
show
that
point
mutaBons
and
their
spread
in
natural
populaBons
were
responsible
for
differences
in
pyrethroid
sensiBvity.”
“The
finding
that
a
non-‐target
aquaBc
species
has
acquired
resistance
to
pesBcides
used
only
on
terrestrial
pests
is
troubling
evidence
of
the
impact
of
chronic
pesBcide
transport
from
land-‐based
applicaBons
into
aquaBc
systems.”
71. How many sequences for our gene of
interest?
Example 71
• Para,
(voltage-‐gated
sodium
channel
alpha
subunit;
Nasonia
vitripennis).
• NaCP60E
(Sodium
channel
protein
60
E;
D.
melanogaster).
• MF:
voltage-‐gated
caBon
channel
acBvity
(IDA,
GO:0022843).
• BP:
olfactory
behavior
(IMP,
GO:0042048),
sodium
ion
transmembrane
transport
(ISS,GO:0035725).
• CC:
voltage-‐gated
sodium
channel
complex
(IEA,
GO:0001518).
And
what
do
we
know
about
them?
72. BLAST at i5K
https://i5k.nal.usda.gov/blast
Example 72
>vgsc-‐Segment3-‐DomainII
RVFKLAKSWPTLNLLISIMGKTVGALGNLTFVLCIIIFIFAVMGMQLFGKNYTEKVTKFKWSQDG
QMPRWNFVDFFHSFMIVFRVLCGEWIESMWDCMYVGDFSCVPFFLATVVIGNLVVSFMHR
73. BLAST at i5K
https://i5k.nal.usda.gov/blast
Example 73
>vgsc-‐Segment3-‐DomainII
RVFKLAKSWPTLNLLISIMGKTVGALGNLTFVLCIIIFIFAVMGMQLFGKNYTEKVTKFKWSQDG
QMPRWNFVDFFHSFMIVFRVLCGEWIESMWDCMYVGDFSCVPFFLATVVIGNLVVSFMHR
74. BLAST at i5K
https://i5k.nal.usda.gov/blast
Example 74
75. BLAST at i5K:
high-scoring segment pairs (hsp) in “BLAST+ Results” track
Example 75
77. Creating a new gene model: drag and drop
Example 77
• Web Apollo automatically calculates the longest open reading
frame (ORF). In this case, the ORF includes the high-scoring
segment pairs (hsp).
82. Editing: merge
Example 82
Merge
by
dropping
an
exon
or
gene
model
onto
another.
Merge
by
selecBng
two
exons
(holding
down
“Shi^”)
and
using
the
right
click
menu.
83. Editing: correct boundaries, delete exons
Example 83
Modify
exon
/
intron
boundary
by
dragging
the
end
of
the
exon
to
the
nearest
canonical
splice
site.
Delete
first
exon
from
M006233
84. Editing: set translation start, modify boundary
Example 84
Set
translaBon
start
Modify
intron
/
exon
boundary
(here
and
at
coord.
78,999)
85. Finished model
Example 85
Corroborate
integrity
and
accuracy
of
the
model:
-‐
Start
and
Stop
-‐
Exon
structure
and
splice
sites
…]5’-‐GT/AG-‐3’[…
-‐
Check
the
predicted
protein
product
on
NCBI
nr
86. Information Editor
• DBXRefs:
e.g.
NP_001128389.1,
N.
vitripennis,
RefSeq
• PubMed
idenBfier:
PMID:
24065824
• Gene
Ontology
IDs:
GO:0022843,
GO:
0042048,
GO:0035725,
GO:0001518.
• Comments.
• Name,
Symbol.
• Approve
/
Delete
radio
buNon.
Example 86
Comments
(if
applicable)
89. Exercises
Live
DemonstraBon
using
the
Apis
mellifera
genome.
89
1.
Evidence
in
support
of
protein
coding
gene
models.
1.1
Consensus
Gene
Sets:
Official
Gene
Set
v3.2
Official
Gene
Set
v1.0
1.2
Consensus
Gene
Sets
comparison:
OGSv3.2
genes
that
merge
OGSv1.0
and
RefSeq
genes
OGSv3.2
genes
that
split
OGSv1.0
and
RefSeq
genes
1.3
Protein
Coding
Gene
PredicDons
Supported
by
Biological
Evidence:
NCBI
Gnomon
Fgenesh++
with
RNASeq
training
data
Fgenesh++
without
RNASeq
training
data
NCBI
RefSeq
Protein
Coding
Genes
and
Low
Quality
Protein
Coding
Genes
1.4
Ab
ini&o
protein
coding
gene
predicDons:
Augustus
Set
12,
Augustus
Set
9,
Fgenesh,
GeneID,
N-‐SCAN,
SGP2
1.5
Transcript
Sequence
Alignment:
NCBI
ESTs,
Apis
cerana
RNA-‐Seq,
Forager
Bee
Brain
Illumina
ConBgs,
Nurse
Bee
Brain
Illumina
ConBgs,
Forager
RNA-‐Seq
reads,
Nurse
RNA-‐Seq
reads,
Abdomen
454
ConBgs,
Brain
and
Ovary
454
ConBgs,
Embryo
454
ConBgs,
Larvae
454
ConBgs,
Mixed
Antennae
454
ConBgs,
Ovary
454
ConBgs
Testes
454
ConBgs,
Forager
RNA-‐Seq
HeatMap,
Forager
RNA-‐Seq
XY
Plot,
Nurse
RNA-‐Seq
HeatMap,
Nurse
RNA-‐Seq
XY
Plot
Becoming Acquainted with Web Apollo.
90. Exercises (continued)
Live
DemonstraBon
using
the
Apis
mellifera
genome.
90
1.
Evidence
in
support
of
protein
coding
gene
models
(ConDnued).
1.6
Protein
homolog
alignment:
Acep_OGSv1.2
Aech_OGSv3.8
Cflo_OGSv3.3
Dmel_r5.42
Hsal_OGSv3.3
Lhum_OGSv1.2
Nvit_OGSv1.2
Nvit_OGSv2.0
Pbar_OGSv1.2
Sinv_OGSv2.2.3
Znev_OGSv2.1
Metazoa_Swissprot
2.
Evidence
in
support
of
non
protein
coding
gene
models
2.1
Non-‐protein
coding
gene
predicDons:
NCBI
RefSeq
Noncoding
RNA
NCBI
RefSeq
miRNA
2.2
Pseudogene
predicDons:
NCBI
RefSeq
Pseudogene
Becoming Acquainted with Web Apollo.
91. Web Apollo Workshop Instances
Demo
1:
hNp://genomes.missouri.edu:8080/Amel_4.5_demo_1
Demo
2:
hNp://genomes.missouri.edu:8080/Amel_4.5_demo_2
Workshop
DocumentaBon
can
be
found
at:
Basecamp
Web
Apollo
instance
for
Diaphorina
citri
hNps://apollo.nal.usda.gov/diacit/selectTrack.jsp
Register
for
i5K
Workspace@NAL
at:
hNps://i5k.nal.usda.gov/web-‐apollo-‐registraBon
92. FUTURE PLANS
interactive analysis and curation of variants
v InteracBve
exploraBon
of
VCF
files
(e.g.
from
GATK,
VAAST)
in
addiBon
to
BAM
and
GVF.
MulBple
tracks
in
one:
visualizaBon
of
geneBc
alteraBons
and
populaBon
frequency
of
variants.
WEB APOLLO 92
1
1
2
v Clinical
applicaBons:
analysis
of
Copy
Number
VariaBons
for
regulatory
effects;
overlaying
display
of
the
regulatory
domains.
Philips-‐Creminis
and
Corces.
2013.
Cell
50
(4):461-‐474
2
TADs:
topologically
associaBng
domains
93. FUTURE PLANS
educational tools
We
are
working
with
educators
to
make
Web
Apollo
part
of
their
curricula.
WEB APOLLO 93
Lecture
Series.
In
the
classroom.
At
the
lab.
Classroom
exercises:
from
genome
sequence
to
hypothesis.
CuraBon
group
dedicated
to
producing
educaBon
materials
for
non-‐model
organism
communiBes.
Our
team
provides
online
documentaBon,
hands-‐on
training,
and
rapid
response
to
users.
94. JOIN US
Footer 94
http://GenomeArchitect.org/
Please bring your suggestions,
requests, and contributions to:
Nathan Dunn
Apollo Technical Lead
Eric Yao
JBrowse, UC Berkeley
Deepak Unni
Colin Diesh
Apollo Developers
Elsik Lab, University of Missouri
Suzi Lewis
Principal Investigator
Berkeley
BOP
95. • Berkeley
BioinformaDcs
Open-‐source
Projects
(BBOP),
Berkeley
Lab:
Web
Apollo
and
Gene
Ontology
teams.
Suzanna
E.
Lewis
(PI).
• §
Chris&ne
G.
Elsik
(PI).
University
of
Missouri.
• *
Ian
Holmes
(PI).
University
of
California
Berkeley.
• Arthropod
genomics
community:
i5K
Steering
CommiNee
(esp.
Sue
Brown
(Kansas
State)),
Alexie
Papanicolaou
(UWS),
BGI,
Oliver
Niehuis
(1KITE
hNp://www.1kite.org/),
and
the
Honey
Bee
Genome
Sequencing
ConsorBum.
• Apollo
is
supported
by
NIH
grants
5R01GM080203
from
NIGMS,
and
5R01HG004483
from
NHGRI;
by
Contract
No.
60-‐8260-‐4-‐005
from
the
NaBonal
Agricultural
Library
(NAL)
at
the
United
States
Department
of
Agriculture
(USDA);
and
by
the
Director,
Office
of
Science,
Office
of
Basic
Energy
Sciences,
of
the
U.S.
Department
of
Energy
under
Contract
No.
DE-‐AC02-‐05CH11231.
• Insect
images
used
with
permission:
hNp://AlexanderWild.com
• For
your
aQenDon,
thank
you!
Thank you. 95
Web
Apollo
Nathan
Dunn
Colin
Diesh
§
Deepak
Unni
§
Gene
Ontology
Chris
Mungall
Seth
Carbon
Heiko
Dietze
BBOP
Web
Apollo:
hNp://GenomeArchitect.org
i5K:
hNp://arthropodgenomes.org/wiki/i5K
GO:
hNp://GeneOntology.org
Thanks!
NAL
at
USDA
Monica
Poelchau
Christopher
Childers
Gary
Moore
HGSC
at
BCM
fringy
Richards
Dan
Hughes
Kim
Worley
JBrowse
Eric
Yao
*