Apollo is a web-based, collaborative genomic annotation editing platform. We need annotation editing tools to modify and refine precise location and structure of the genome elements that predictive algorithms cannot yet resolve automatically.
This presentation is an introduction to how the manual annotation process takes place using Apollo. It is addressed to the members of the American Chestnut & Chinese Chestnut Genomics research community.
Apollo - A webinar for the Phascolarctos cinereus research communityMonica Munoz-Torres
Web Apollo is a web-based, collaborative genomic annotation editing platform. We need annotation editing tools to modify and refine precise location and structure of the genome elements that predictive algorithms cannot yet resolve automatically.
This presentation is an introduction to how the manual annotation process takes place using Web Apollo. It is addressed to the members of the Phascolarctos cinereus research community.
Apollo: A workshop for the Manakin Research Coordination NetworkMonica Munoz-Torres
Apollo is a web-based, collaborative genomic annotation editing platform. We need annotation editing tools to modify and refine precise location and structure of the genome elements that predictive algorithms cannot yet resolve automatically.
This presentation is an introduction to how the manual annotation process takes place using Apollo. It is addressed to the members of the Manakin Genomics research community.
Comparative genome analysis requires high quality annotations of all genomic elements. Today’s sequencing projects face numerous challenges including lower coverage, more frequent assembly errors, and the lack of closely related species with well-annotated genomes. Precise elucidation of the many different biological features encoded in any genome requires careful examination and review. We need genome annotation editing tools to modify and refine the location and structure of the genome elements that predictive algorithms cannot yet resolve automatically. During the manual annotation process, curators identify elements that best represent the underlying biology and eliminate elements that reflect systemic errors of automated analyses.
Apollo is a web-based application that supports and enables collaborative genome curation in real time, analogous to Google Docs, allowing teams of curators to improve on existing automated gene models through an intuitive interface. Researchers from nearly one hundred institutions worldwide are currently using Apollo for distributed curation efforts in over sixty genome projects across the tree of life: from plants to arthropods, to fungi, to species of fish and other vertebrates including human, cattle (bovine), and dog.
Apollo is a web-based application that supports and enables collaborative genome curation in real time, allowing teams of curators to improve on existing automated gene models through an intuitive interface. Apollo allows researchers to break down large amounts of data into manageable portions to mobilize groups of researchers with shared interests.
The i5K, an initiative to sequence the genomes of 5,000 insect and related arthropod species, is a broad and inclusive effort that seeks to involve scientists from around the world in their genome curation process, and Apollo is serving as the platform to empower this community.
This presentation is an introduction to Apollo for the members of the i5K Pilot Project on Eurytemora affinis
Apollo is a web-based application that supports and enables collaborative genome curation in real time, allowing teams of curators to improve on existing automated gene models through an intuitive interface. Apollo allows researchers to break down large amounts of data into manageable portions to mobilize groups of researchers with shared interests.
The i5K, an initiative to sequence the genomes of 5,000 insect and related arthropod species, is a broad and inclusive effort that seeks to involve scientists from around the world in their genome curation process, and Apollo is serving as the platform to empower this community.
This presentation is an introduction to Apollo for the members of the i5K Pilot Project working on species of the order Hemiptera.
Comparative genome analysis requires high quality annotations of all genomic elements. Today’s sequencing projects face numerous challenges including lower coverage, more frequent assembly errors, and the lack of closely related species with well-annotated genomes. Precise elucidation of the many different biological features encoded in any genome requires careful examination and review. We need genome annotation editing tools to modify and refine the location and structure of the genome elements that predictive algorithms cannot yet resolve automatically. During the manual annotation process, curators identify elements that best represent the underlying biology and eliminate elements that reflect systemic errors of automated analyses.
Apollo is a web-based application that supports and enables collaborative genome curation in real time, analogous to Google Docs, allowing teams of curators to improve on existing automated gene models through an intuitive interface. Researchers from nearly one hundred institutions worldwide are currently using Apollo for distributed curation efforts in over sixty genome projects across the tree of life: from plants to arthropods, to fungi, to species of fish and other vertebrates including human, cattle (bovine), and dog.
This is an introduction to conducting manual annotation efforts using Apollo. This webinar was offered to members of the i5K Research community on 2015-10-07.
Apollo is a web-based, collaborative genomic annotation editing platform. We need annotation editing tools to modify and refine precise location and structure of the genome elements that predictive algorithms cannot yet resolve automatically.
This presentation is an introduction to how the manual annotation process takes place using Apollo. It is addressed to the members of the American Chestnut & Chinese Chestnut Genomics research community.
Apollo - A webinar for the Phascolarctos cinereus research communityMonica Munoz-Torres
Web Apollo is a web-based, collaborative genomic annotation editing platform. We need annotation editing tools to modify and refine precise location and structure of the genome elements that predictive algorithms cannot yet resolve automatically.
This presentation is an introduction to how the manual annotation process takes place using Web Apollo. It is addressed to the members of the Phascolarctos cinereus research community.
Apollo: A workshop for the Manakin Research Coordination NetworkMonica Munoz-Torres
Apollo is a web-based, collaborative genomic annotation editing platform. We need annotation editing tools to modify and refine precise location and structure of the genome elements that predictive algorithms cannot yet resolve automatically.
This presentation is an introduction to how the manual annotation process takes place using Apollo. It is addressed to the members of the Manakin Genomics research community.
Comparative genome analysis requires high quality annotations of all genomic elements. Today’s sequencing projects face numerous challenges including lower coverage, more frequent assembly errors, and the lack of closely related species with well-annotated genomes. Precise elucidation of the many different biological features encoded in any genome requires careful examination and review. We need genome annotation editing tools to modify and refine the location and structure of the genome elements that predictive algorithms cannot yet resolve automatically. During the manual annotation process, curators identify elements that best represent the underlying biology and eliminate elements that reflect systemic errors of automated analyses.
Apollo is a web-based application that supports and enables collaborative genome curation in real time, analogous to Google Docs, allowing teams of curators to improve on existing automated gene models through an intuitive interface. Researchers from nearly one hundred institutions worldwide are currently using Apollo for distributed curation efforts in over sixty genome projects across the tree of life: from plants to arthropods, to fungi, to species of fish and other vertebrates including human, cattle (bovine), and dog.
Apollo is a web-based application that supports and enables collaborative genome curation in real time, allowing teams of curators to improve on existing automated gene models through an intuitive interface. Apollo allows researchers to break down large amounts of data into manageable portions to mobilize groups of researchers with shared interests.
The i5K, an initiative to sequence the genomes of 5,000 insect and related arthropod species, is a broad and inclusive effort that seeks to involve scientists from around the world in their genome curation process, and Apollo is serving as the platform to empower this community.
This presentation is an introduction to Apollo for the members of the i5K Pilot Project on Eurytemora affinis
Apollo is a web-based application that supports and enables collaborative genome curation in real time, allowing teams of curators to improve on existing automated gene models through an intuitive interface. Apollo allows researchers to break down large amounts of data into manageable portions to mobilize groups of researchers with shared interests.
The i5K, an initiative to sequence the genomes of 5,000 insect and related arthropod species, is a broad and inclusive effort that seeks to involve scientists from around the world in their genome curation process, and Apollo is serving as the platform to empower this community.
This presentation is an introduction to Apollo for the members of the i5K Pilot Project working on species of the order Hemiptera.
Comparative genome analysis requires high quality annotations of all genomic elements. Today’s sequencing projects face numerous challenges including lower coverage, more frequent assembly errors, and the lack of closely related species with well-annotated genomes. Precise elucidation of the many different biological features encoded in any genome requires careful examination and review. We need genome annotation editing tools to modify and refine the location and structure of the genome elements that predictive algorithms cannot yet resolve automatically. During the manual annotation process, curators identify elements that best represent the underlying biology and eliminate elements that reflect systemic errors of automated analyses.
Apollo is a web-based application that supports and enables collaborative genome curation in real time, analogous to Google Docs, allowing teams of curators to improve on existing automated gene models through an intuitive interface. Researchers from nearly one hundred institutions worldwide are currently using Apollo for distributed curation efforts in over sixty genome projects across the tree of life: from plants to arthropods, to fungi, to species of fish and other vertebrates including human, cattle (bovine), and dog.
This is an introduction to conducting manual annotation efforts using Apollo. This webinar was offered to members of the i5K Research community on 2015-10-07.
Introduction to Apollo: A webinar for the i5K Research CommunityMonica Munoz-Torres
Apollo is a web-based application that supports and enables collaborative genome curation in real time, allowing teams of curators to improve on existing automated gene models through an intuitive interface. Apollo allows researchers to break down large amounts of data into manageable portions to mobilize groups of researchers with shared interests.
The i5K, an initiative to sequence the genomes of 5,000 insect and related arthropod species, is a broad and inclusive effort that seeks to involve scientists from around the world in their genome curation process, and Apollo is serving as the platform to empower this community.
This presentation is an introduction to Apollo for the members of the i5K Pilot Project Species.
BITS - Comparative genomics: gene family analysisBITS
This is the second presentation of the BITS training on 'Comparative genomics'.
It reviews the different methods of investigating sequence homology on the gene family level.
Thanks to Klaas Vandepoele of the PSB department.
This session will follow up from transcript quantification of RNAseq data and discusses statistical means of identifying differentially regulated transcripts, and isoforms and contrasts these against microarray analysis approaches.
In shotgun sequencing the genome is broken randomly into short fragments (1 to 2 kbp long) suitable for sequencing. The fragments are ligated into a suitable vector and then partially sequenced. Around 400–500 bp of sequence can be generated from each fragment in a single sequencing run. In some cases, both ends of a fragment are sequenced. Computerized searching for overlaps between individual sequences then assembles the complete sequence.
This presentation is a thorough guide to the use of Web Apollo, with details on User Navigation, Functionality, and the thought process behind manual annotation.
During this workshop, participants:
- Learn to identify homologs of known genes of interest in your newly sequenced genome.
- Become familiar with the environment and functionality of the Web Apollo genome annotation editing tool.
- Learn how to corroborate or modify automatically annotated gene models using all available evidence in Web Apollo.
- Understand the process of curation in the context of genome annotation.
Introduction to Apollo: A webinar for the i5K Research CommunityMonica Munoz-Torres
Apollo is a web-based application that supports and enables collaborative genome curation in real time, allowing teams of curators to improve on existing automated gene models through an intuitive interface. Apollo allows researchers to break down large amounts of data into manageable portions to mobilize groups of researchers with shared interests.
The i5K, an initiative to sequence the genomes of 5,000 insect and related arthropod species, is a broad and inclusive effort that seeks to involve scientists from around the world in their genome curation process, and Apollo is serving as the platform to empower this community.
This presentation is an introduction to Apollo for the members of the i5K Pilot Project Species.
BITS - Comparative genomics: gene family analysisBITS
This is the second presentation of the BITS training on 'Comparative genomics'.
It reviews the different methods of investigating sequence homology on the gene family level.
Thanks to Klaas Vandepoele of the PSB department.
This session will follow up from transcript quantification of RNAseq data and discusses statistical means of identifying differentially regulated transcripts, and isoforms and contrasts these against microarray analysis approaches.
In shotgun sequencing the genome is broken randomly into short fragments (1 to 2 kbp long) suitable for sequencing. The fragments are ligated into a suitable vector and then partially sequenced. Around 400–500 bp of sequence can be generated from each fragment in a single sequencing run. In some cases, both ends of a fragment are sequenced. Computerized searching for overlaps between individual sequences then assembles the complete sequence.
This presentation is a thorough guide to the use of Web Apollo, with details on User Navigation, Functionality, and the thought process behind manual annotation.
During this workshop, participants:
- Learn to identify homologs of known genes of interest in your newly sequenced genome.
- Become familiar with the environment and functionality of the Web Apollo genome annotation editing tool.
- Learn how to corroborate or modify automatically annotated gene models using all available evidence in Web Apollo.
- Understand the process of curation in the context of genome annotation.
Talk at the 8th International Biocuration Conference. Beijing, China. April 23-26, 2015.
Obtaining meaningful results from genome analyses requires high quality annotations of all genomic elements. Today’s sequencing projects face challenges such as lower coverage, more frequent assembly errors, and the lack of closely related species with well-annotated genomes. Apollo is a web-based application that supports and enables collaborative genome curation in real time, analogous to Google Docs, allowing curators to improve on existing automated gene models through an intuitive interface. Apollo’s extensible architecture is built on top of JBrowse; its components are a web-based client, an annotation-editing engine, and a server-side data service. It allows users to visualize automated gene models, protein alignments, expression and variant data, and conduct structural and/or functional annotations.
Apollo is actively used within a variety of projects, including the initiative to sequence the genomes of 5,000 Arthropod species (i5K), and will become essential to the thousands of genomes now being sequenced and analyzed. Researchers from nearly 100 institutions worldwide are currently using Apollo on distributed curation efforts for over sixty genome projects across the tree of life; from plants to echinoderms, to fungi, to species of fish and other vertebrates including human, cattle (bovine), and dog. We are training the next generation of researchers by reaching out to educators to make these tools available as part of curricula, offering workshops and webinars to the scientific community, and through widely applied systems such as iPlant and DNA Subway. We are currently integrating Apollo into an annotation environment combining gene structural and functional annotation, transcriptomic, proteomic, and phenotypic annotation. In this presentation we will describe in detail its utility to users, introduce the architecture to developers interested in expanding on this open-source project, and offer details of our future plans.
Authors:
Monica Munoz-Torres(1), Nathan Dunn(1), Colin Diesh(2), Deepak Unni(2), Seth Carbon(1), Heiko Dietze(1), Christopher Mungall(1), Nicole Washington(1), Ian Holmes(3), Christine Elsik(2), and Suzanna E. Lewis(1)
1Lawrence Berkeley National Laboratory, Genomics Division, Berkeley, CA
2Divisions of Animal and Plant Sciences, University of Missouri, Columbia, MO
3University of California Berkeley, Bioengineering, Berkeley, CA
Continuing with the theme of DNA repair via homologous recombination, I will discuss the following family during the PAINT call:
PTHR13451 CLASS II CROSSOVER JUNCTION ENDONUCLEASE MUS81
Introduction to Apollo - i5k Research Community – Calanoida (copepod)Monica Munoz-Torres
Apollo is a web-based application that supports and enables collaborative genome curation in real time, allowing teams of curators to improve on existing automated gene models through an intuitive interface. Apollo allows researchers to break down large amounts of data into manageable portions to mobilize groups of researchers with shared interests.
The i5k, an initiative to sequence the genomes of 5,000 insect and related arthropod species, is a broad and inclusive effort that seeks to involve scientists from around the world in their genome curation process, and Apollo is serving as the platform to empower this community.
This presentation is an introduction to Apollo for the members of the i5K Pilot Project working on species of the order Calanoida (copepod).
Apollo is a web-based application that supports and enables collaborative genome curation in real time, allowing teams of curators to improve on existing automated gene models through an intuitive interface. Apollo allows researchers to break down large amounts of data into manageable portions to mobilize groups of researchers with shared interests.
Apollo is a web-based application that supports and enables collaborative genome curation in real time, allowing teams of curators to improve on existing automated gene models through an intuitive interface. Apollo allows researchers to break down large amounts of data into manageable portions to mobilize groups of researchers with shared interests.
An introduction on gene annotation & curation for the IAGC and BIPAA research communities.
CONSORCIO ONTOLOGÍA DE GENES: herramientas para anotación funcionalMonica Munoz-Torres
Esta presentación contiene información impartida durante el curso de Ontología de Genes en BIOS. Los temas de la charla incluyen una descripción de la estructura de la ontología, cómo se construyen los términos y porqué es necesario usar ontologías. También discutimos los análisis estadísticos de enriquecimiento y representación de términos. Los ejercicios son parte del entrenamiento del grupo de GOA en EMBL-EBI.
La presentación fue dada en Español.
Apollo is a web-based application that supports and enables collaborative genome curation in real time, allowing teams of curators to improve on existing automated gene models through an intuitive interface. Apollo allows researchers to break down large amounts of data into manageable portions to mobilize groups of researchers with shared interests.
An introduction to use and functionality for the IAGC and BIPAA research communities.
Apollo is a web-based application that supports and enables collaborative genome curation in real time, allowing teams of curators to improve on existing automated gene models through an intuitive interface. Apollo allows researchers to break down large amounts of data into manageable portions to mobilize groups of researchers with shared interests.
A Workshop at the Stowers Institute for Medical Research.
Two approaches (clone by clone & whole genome shotgun).
Types of DNA sequencing ( 1st, next and 3rd).
Crop genomes sequenced . (Example :Arabidopsis,Rice, Pigeon pea)
Genomic library and shotgun sequencing. It includes the topics about genomic library,construction method, its uses and applications, shotgun sequencing, difference between random and whole genome sequencing, its advantages and disadvantages etc.
The NCBI Boot Camp for Beginners was designed to offer an overview of the NCBI suite of resources. In the first half of the presentation, highlighted databases were covered in four main categories: literature, sequences, genes & genomes and expression & structure. The second half of the class used the apolipoprotein A as a query that was explored through many of the NCBI databases, from identifying the reference sequences to a structural analysis of the Cys130Arg variant.
1.introduction to genetic engineering and restriction enzymesGetachew Birhanu
An introduction to Genetic engineering
A short background and history of Genetic Engineering
Classification of DNA manipulating Enzymes, nomenclature
Restriction recognition sequences, the anatomy of a gene and the flow of genetic information
More emphasis is given for the essential DNA Manipulating Enzymes
Finally Restriction mapping (analysis)
Cambridge Pre-U Biology - 1.6 Genes and Protein Synthesis PART 1 Samplemrexham
This is a widescreen fully animated and editable PowerPoint presentation that covers the first half of section 1.6 of the Cambridge Pre-U Biology course.
It is 64 slides long and covers the following topics:
What is a gene?
How does the genetic code work?
Protein synthesis
The lac operon
Variation
Proteomics and genomics
The full PowerPoint can be downloaded from mrexham.com
This presentation contains details about the Apollo genome annotation editor functionality. It also includes a step-by-step example about curating a gene of interest.
This presentation explains the meaning of curation and includes an introduction to the Apollo genome annotation editing tool and its curation environment.
Apollo is a web-based application that supports and enables collaborative genome curation in real time, allowing teams of curators to improve on existing automated gene models through an intuitive interface. Apollo allows researchers to break down large amounts of data into manageable portions to mobilize groups of researchers with shared interests.
Apollo annotation guidelines for i5k projects Diaphorina citriMonica Munoz-Torres
Apollo is a web-based application that supports and enables collaborative genome curation in real time, allowing teams of curators to improve on existing automated gene models through an intuitive interface. Apollo allows researchers to break down large amounts of data into manageable portions to mobilize groups of researchers with shared interests.
This is a brief update about the genome browser JBrowse and the genome annotation editor Apollo, addressed to the members of the Alliance of Genome Resources (AGR).
Learn more about JBrowse at jbrowse.org
Learn more about Apollo at GenomeArchitect.org
Apollo Genome Annotation Editor: Latest Updates, Including New Galaxy Integra...Monica Munoz-Torres
Manual curation is crucial to improving the quality of the annotations for a genome sequencing project. During this portion of the genome sequencing workflow, curators use a variety of experimental evidence to improve on automated predictions to more accurately represent the underlying biology.
Apollo is a web-based genome annotation editor that allows curators to manually revise and edit genomic elements. It provides a reporting structure for annotated genomic elements and an ‘Annotator Panel’ that allows users to quickly browse the genome and all available annotations. Users can manually edit the structure of a genomic element as well as add metadata, including references to other databases, adding functional assignments to genes and gene products with specific lookup support for Gene Ontology (GO) terms, as well as including references to published literature in support of these annotations.
Apollo is currently used in more than one hundred genome annotation projects around the world, ranging from the annotation of a single species to lineage-specific efforts supporting annotation for dozens of organisms at a time. Apollo enables collaborative, real-time curation (akin to Google Docs); researchers may restrict access to certain annotations depending on the role of users and groups within the community, as well as share tracks of evidence data with the public. Users are able to export their manual annotations via FASTA and GFF3 files, the Chado database schema, and web services. The news hot of the presses is that Apollo is now available for integration with Galaxy via Docker! This allows users to run analyses on their genome of interest, including a step of manual curation, all from the comfort of their installation of the versatile Galaxy platform.
Scientific research is inherently a collaborative task; in our case it is a dialog among different researchers to reach a shared understanding of the underlying biology. To facilitate this dialog we have developed two web-based annotation tools: Apollo (http://genomearchitect.org/), a genomic feature editor, designed to support structural annotation of gene models, and Noctua (http://noctua.berkeleybop.org/), a biological-process model builder designed for describing the functional roles of gene products. Here we wish to outline an inventory of essential requirements that, in our experience, enable an annotation tool to meet the needs of both professional biocurators as well as other members of the research community. Here are the general requirements, beyond specific functional requirements, that any annotation tool must satisfy.
Data Visualization And Annotation Workshop at Biocuration 2015Monica Munoz-Torres
8th International Biocuration Conference. Beijing, China. April 23-26, 2015.
Workshop 2: Data Visualization and Annotation.
Chairs: Rama Balakrishnan, Stanford University, USA and Monica Munoz-Torres, Lawrence Berkeley National Laboratory, USA
Explaining the most intricate biological processes often requires a degree of detail beyond the scope of equations and algorithms; in fact, most biological knowledge is represented visually as illustrations, graphs, and diagrams. Genomics data in particular require specialized forms of visualization to improve our understanding and increase our chances of extracting meaningful conclusions from our analyses. Furthermore, the heterogeneity and abundance of genomic data include widely varied sources, techniques for their obtention, and intrinsic experimental error. And even data obtained under similar conditions from two or more individuals are loaded with biological variation. So what is the best way to interpret the stories the data are telling us? Given the questions we wish to answer and the data we are generating, which tools would be most useful and effective? In this workshop we will explore the tools available for human interpretation of genomic data, specifically in the context of annotation.
Presentations and perspectives, panelists/presenters:
- Lorna Richardson, IGMM, University of Edinburgh, United Kingdom
- Justyna Szostak, PMI Research & Development, Switzerland
The workshop included a brief introduction to a landscape of tools available - as updated as the constantly changing field allows-, brief presentations chosen from abstract submissions and invited speakers, as well as ample discussion to capture the contributions and questions from attendants. In the end, we hope participants walked away with a toolset in hand that may benefit the progress of their own research.
Apollo and i5K: Collaborative Curation and Interactive Analysis of GenomesMonica Munoz-Torres
Precise elucidation of the many different biological features encoded in a genome requires a careful curation process that involves reviewing all available evidence to allow researchers to resolve discrepancies and validate automated gene models, protein alignments, and other biological elements. Genome annotation is an inherently collaborative task; researchers only rarely work in isolation, turning to colleagues for second opinions and insights from those with expertise in particular domains and gene families.
The i5k initiative seeks to sequence the genomes of 5,000 insect and related arthropod species. The selected species are known to be important to worldwide agriculture, food safety, medicine, and energy production as well as many used as models in biology, those most abundant in world ecosystems, and representatives in every branch of the insect phylogeny in an effort to better understand arthropod evolution and phylogeny. Because computational genome analysis remains an imperfect art, each of these new genomes sequenced will require visualization and curation.
Apollo is an instantaneous, collaborative, genome annotation editor, and the new JavaScript based version allows researchers real-time interactivity, breaking down large amounts of data into manageable portions to mobilize groups of researchers with shared interests. The i5K is a broad and inclusive effort that seeks to involve scientists from around the world in their genome curation process and Apollo is serving as the platform to empower this community. Here we offer details about this collaboration.
This presentation is a thorough guide to the use of Web Apollo, with details on User Navigation, Functionality, and the thought process behind manual annotation.
During this workshop, participants:
- Learn to identify homologs of known genes of interest in a newly sequenced genome.
- Become familiar with the environment and functionality of the Web Apollo genome annotation editing tool.
- Learn how to corroborate or modify automatically annotated gene models using available evidence in Web Apollo.
- Understand the process of curation in the context of genome annotation.
A brief information about the SCOP protein database used in bioinformatics.
The Structural Classification of Proteins (SCOP) database is a comprehensive and authoritative resource for the structural and evolutionary relationships of proteins. It provides a detailed and curated classification of protein structures, grouping them into families, superfamilies, and folds based on their structural and sequence similarities.
Cancer cell metabolism: special Reference to Lactate PathwayAADYARAJPANDEY1
Normal Cell Metabolism:
Cellular respiration describes the series of steps that cells use to break down sugar and other chemicals to get the energy we need to function.
Energy is stored in the bonds of glucose and when glucose is broken down, much of that energy is released.
Cell utilize energy in the form of ATP.
The first step of respiration is called glycolysis. In a series of steps, glycolysis breaks glucose into two smaller molecules - a chemical called pyruvate. A small amount of ATP is formed during this process.
Most healthy cells continue the breakdown in a second process, called the Kreb's cycle. The Kreb's cycle allows cells to “burn” the pyruvates made in glycolysis to get more ATP.
The last step in the breakdown of glucose is called oxidative phosphorylation (Ox-Phos).
It takes place in specialized cell structures called mitochondria. This process produces a large amount of ATP. Importantly, cells need oxygen to complete oxidative phosphorylation.
If a cell completes only glycolysis, only 2 molecules of ATP are made per glucose. However, if the cell completes the entire respiration process (glycolysis - Kreb's - oxidative phosphorylation), about 36 molecules of ATP are created, giving it much more energy to use.
IN CANCER CELL:
Unlike healthy cells that "burn" the entire molecule of sugar to capture a large amount of energy as ATP, cancer cells are wasteful.
Cancer cells only partially break down sugar molecules. They overuse the first step of respiration, glycolysis. They frequently do not complete the second step, oxidative phosphorylation.
This results in only 2 molecules of ATP per each glucose molecule instead of the 36 or so ATPs healthy cells gain. As a result, cancer cells need to use a lot more sugar molecules to get enough energy to survive.
Unlike healthy cells that "burn" the entire molecule of sugar to capture a large amount of energy as ATP, cancer cells are wasteful.
Cancer cells only partially break down sugar molecules. They overuse the first step of respiration, glycolysis. They frequently do not complete the second step, oxidative phosphorylation.
This results in only 2 molecules of ATP per each glucose molecule instead of the 36 or so ATPs healthy cells gain. As a result, cancer cells need to use a lot more sugar molecules to get enough energy to survive.
introduction to WARBERG PHENOMENA:
WARBURG EFFECT Usually, cancer cells are highly glycolytic (glucose addiction) and take up more glucose than do normal cells from outside.
Otto Heinrich Warburg (; 8 October 1883 – 1 August 1970) In 1931 was awarded the Nobel Prize in Physiology for his "discovery of the nature and mode of action of the respiratory enzyme.
WARNBURG EFFECT : cancer cells under aerobic (well-oxygenated) conditions to metabolize glucose to lactate (aerobic glycolysis) is known as the Warburg effect. Warburg made the observation that tumor slices consume glucose and secrete lactate at a higher rate than normal tissues.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.Sérgio Sacani
The return of a sample of near-surface atmosphere from Mars would facilitate answers to several first-order science questions surrounding the formation and evolution of the planet. One of the important aspects of terrestrial planet formation in general is the role that primary atmospheres played in influencing the chemistry and structure of the planets and their antecedents. Studies of the martian atmosphere can be used to investigate the role of a primary atmosphere in its history. Atmosphere samples would also inform our understanding of the near-surface chemistry of the planet, and ultimately the prospects for life. High-precision isotopic analyses of constituent gases are needed to address these questions, requiring that the analyses are made on returned samples rather than in situ.
Slide 1: Title Slide
Extrachromosomal Inheritance
Slide 2: Introduction to Extrachromosomal Inheritance
Definition: Extrachromosomal inheritance refers to the transmission of genetic material that is not found within the nucleus.
Key Components: Involves genes located in mitochondria, chloroplasts, and plasmids.
Slide 3: Mitochondrial Inheritance
Mitochondria: Organelles responsible for energy production.
Mitochondrial DNA (mtDNA): Circular DNA molecule found in mitochondria.
Inheritance Pattern: Maternally inherited, meaning it is passed from mothers to all their offspring.
Diseases: Examples include Leber’s hereditary optic neuropathy (LHON) and mitochondrial myopathy.
Slide 4: Chloroplast Inheritance
Chloroplasts: Organelles responsible for photosynthesis in plants.
Chloroplast DNA (cpDNA): Circular DNA molecule found in chloroplasts.
Inheritance Pattern: Often maternally inherited in most plants, but can vary in some species.
Examples: Variegation in plants, where leaf color patterns are determined by chloroplast DNA.
Slide 5: Plasmid Inheritance
Plasmids: Small, circular DNA molecules found in bacteria and some eukaryotes.
Features: Can carry antibiotic resistance genes and can be transferred between cells through processes like conjugation.
Significance: Important in biotechnology for gene cloning and genetic engineering.
Slide 6: Mechanisms of Extrachromosomal Inheritance
Non-Mendelian Patterns: Do not follow Mendel’s laws of inheritance.
Cytoplasmic Segregation: During cell division, organelles like mitochondria and chloroplasts are randomly distributed to daughter cells.
Heteroplasmy: Presence of more than one type of organellar genome within a cell, leading to variation in expression.
Slide 7: Examples of Extrachromosomal Inheritance
Four O’clock Plant (Mirabilis jalapa): Shows variegated leaves due to different cpDNA in leaf cells.
Petite Mutants in Yeast: Result from mutations in mitochondrial DNA affecting respiration.
Slide 8: Importance of Extrachromosomal Inheritance
Evolution: Provides insight into the evolution of eukaryotic cells.
Medicine: Understanding mitochondrial inheritance helps in diagnosing and treating mitochondrial diseases.
Agriculture: Chloroplast inheritance can be used in plant breeding and genetic modification.
Slide 9: Recent Research and Advances
Gene Editing: Techniques like CRISPR-Cas9 are being used to edit mitochondrial and chloroplast DNA.
Therapies: Development of mitochondrial replacement therapy (MRT) for preventing mitochondrial diseases.
Slide 10: Conclusion
Summary: Extrachromosomal inheritance involves the transmission of genetic material outside the nucleus and plays a crucial role in genetics, medicine, and biotechnology.
Future Directions: Continued research and technological advancements hold promise for new treatments and applications.
Slide 11: Questions and Discussion
Invite Audience: Open the floor for any questions or further discussion on the topic.
Introduction:
RNA interference (RNAi) or Post-Transcriptional Gene Silencing (PTGS) is an important biological process for modulating eukaryotic gene expression.
It is highly conserved process of posttranscriptional gene silencing by which double stranded RNA (dsRNA) causes sequence-specific degradation of mRNA sequences.
dsRNA-induced gene silencing (RNAi) is reported in a wide range of eukaryotes ranging from worms, insects, mammals and plants.
This process mediates resistance to both endogenous parasitic and exogenous pathogenic nucleic acids, and regulates the expression of protein-coding genes.
What are small ncRNAs?
micro RNA (miRNA)
short interfering RNA (siRNA)
Properties of small non-coding RNA:
Involved in silencing mRNA transcripts.
Called “small” because they are usually only about 21-24 nucleotides long.
Synthesized by first cutting up longer precursor sequences (like the 61nt one that Lee discovered).
Silence an mRNA by base pairing with some sequence on the mRNA.
Discovery of siRNA?
The first small RNA:
In 1993 Rosalind Lee (Victor Ambros lab) was studying a non- coding gene in C. elegans, lin-4, that was involved in silencing of another gene, lin-14, at the appropriate time in the
development of the worm C. elegans.
Two small transcripts of lin-4 (22nt and 61nt) were found to be complementary to a sequence in the 3' UTR of lin-14.
Because lin-4 encoded no protein, she deduced that it must be these transcripts that are causing the silencing by RNA-RNA interactions.
Types of RNAi ( non coding RNA)
MiRNA
Length (23-25 nt)
Trans acting
Binds with target MRNA in mismatch
Translation inhibition
Si RNA
Length 21 nt.
Cis acting
Bind with target Mrna in perfect complementary sequence
Piwi-RNA
Length ; 25 to 36 nt.
Expressed in Germ Cells
Regulates trnasposomes activity
MECHANISM OF RNAI:
First the double-stranded RNA teams up with a protein complex named Dicer, which cuts the long RNA into short pieces.
Then another protein complex called RISC (RNA-induced silencing complex) discards one of the two RNA strands.
The RISC-docked, single-stranded RNA then pairs with the homologous mRNA and destroys it.
THE RISC COMPLEX:
RISC is large(>500kD) RNA multi- protein Binding complex which triggers MRNA degradation in response to MRNA
Unwinding of double stranded Si RNA by ATP independent Helicase
Active component of RISC is Ago proteins( ENDONUCLEASE) which cleave target MRNA.
DICER: endonuclease (RNase Family III)
Argonaute: Central Component of the RNA-Induced Silencing Complex (RISC)
One strand of the dsRNA produced by Dicer is retained in the RISC complex in association with Argonaute
ARGONAUTE PROTEIN :
1.PAZ(PIWI/Argonaute/ Zwille)- Recognition of target MRNA
2.PIWI (p-element induced wimpy Testis)- breaks Phosphodiester bond of mRNA.)RNAse H activity.
MiRNA:
The Double-stranded RNAs are naturally produced in eukaryotic cells during development, and they have a key role in regulating gene expression .
This pdf is about the Schizophrenia.
For more details visit on YouTube; @SELF-EXPLANATORY;
https://www.youtube.com/channel/UCAiarMZDNhe1A3Rnpr_WkzA/videos
Thanks...!
1. Editando anotaciones con Apollo
Un taller para la comunidad científica reunida en BIOS
Monica Munoz-Torres, PhD | @monimunozto
Berkeley Bioinformatics Open-Source Projects (BBOP)
Lawrence Berkeley National Laboratory |
University of California Berkeley | U.S. Department of Energy
BIOS, Manizales, Colombia | 21 Septiembre, 2015
2. APOLLO DEVELOPMENT
APOLLO DEVELOPERS 2
h" p://G e nom e Ar c hite c t. or g /
Nathan Dunn
Eric Yao
JBrowse, UC Berkeley
Christine Elsik’s Lab,
University of Missouri
Suzi Lewis
Principal Investigator
BBOP
Moni Munoz-Torres
Stephen Ficklin
GenSAS,
Washington State University
Colin DieshDeepak Unni
3. OUTLINE
Web
Apollo
Collabora(ve
Cura(on
and
Interac(ve
Analysis
of
Genomes
3OUTLINE
• Hoy
descubriremos
cómo
sortear
obstáculos
para
extraer
la
información
más
valiosa
en
un
proyectos
de
secuenciación
&
anotación
de
genomas.
4. 4
BY THE END OF THIS TALK
you will
v BeAer
understand
genome
cura(on
in
the
context
of
annota(on:
assembled
genome
à
automated
annota=on
à
manual
annota=on
v Become
familiar
with
the
environment
and
func(onality
of
the
Apollo
genome
annota(on
edi(ng
tool.
v Learn
to
iden(fy
homologs
of
known
genes
of
interest
in
a
newly
sequenced
genome.
v Learn
about
corrobora(ng
and
modifying
automa(cally
annotated
gene
models
using
available
evidence
in
Apollo.
Introduction
8. CURATING GENOMES
steps involved
1 Genera=on
of
Gene
Models
calling
ORFs,
one
or
more
rounds
of
gene
predic(on,
etc.
2 Annota=on
of
gene
models
Describing
func(on,
expression
paAerns,
metabolic
network
memberships.
3 Manual
annota=on
CURATING GENOMES 8
9. ANOTACION DE GENOMAS
requiere precisión y profundidad
Anotando Genomas 9
La
colección
de
genes
de
cada
organismo
informa
una
variedad
de
análisis:
• Número
de
genes,
%
GC,
composición
de
TEs,
áreas
repe((vas
• Asignar
función
• Evolución
molecular,
conservación
de
secuencias
• Familias
de
genes
• Caminos
metabólicos
• ¿Qué
hace
único
a
cada
organismo?
¿Qué
hace
“abeja”
a
una
abeja?
Marbach et al. 2011. Nature Methods | Shutterstock.com | Alexander Wild
11. REVIEW ON YOUR OWN
for manual annotation
To
remember…
Biological
concepts
to
beAer
understand
manual
annota(on
11FOOD FOR THOUGHT
• GLOSSARY
from
con1g
to
splice
site
• CENTRAL
DOGMA
in
molecular
biology
• WHAT
IS
A
GENE?
defining
your
goal
• TRANSCRIPTION
mRNA
in
detail
• TRANSLATION
and
other
defini(ons
• GENOME
CURATION
steps
involved
12. 12BIO-REFRESHER
What is a gene?
v The
defini(on
of
a
gene
paints
a
very
complex
picture
of
molecular
ac(vity
and
it
is
a
con(nuously
evolving
concept.
• From
the
Sequence
Ontology
(SO):
“A
gene
is
a
locatable
region
of
genomic
sequence,
corresponding
to
a
unit
of
inheritance,
which
is
associated
with
regulatory
regions,
transcribed
regions
and/or
other
func(onal
sequence
regions”.
“Evolving
Concept”
at
hAp://goo.gl/LpsajQ
13. 13BIO-REFRESHER
What is a gene?
v In
our
life(me,
the
Encyclopedia
of
DNA
Elements
(ENCODE)
project
updated
this
concept
yet
again.
Long
transcripts
&
dispersed
regula1on!
“A
gene
is
a
DNA
segment
that
contributes
phenotype/func(on.
In
the
absence
of
demonstrated
func(on,
a
gene
may
be
characterized
by
sequence,
transcrip(on
or
homology.”
https://www.encodeproject.org/
14. 14BIO-REFRESHER
What is a gene?
let’s think computationally!
v Think
of
the
genome
as
an
operating system for
a
living
being
• Considering
that
the
nucleo(des
of
the
genome
are
put
together
into
a
code
that
is
executed
through
the
process
of
transcription
and
translation…
• …
think
of
genes
as
subroutines
that
are
repe((vely
called
in
the
process
of
transcription
Gerstein et al., 2007. Genome Res.
15. 15BIO-REFRESHER
What is a gene?
considerations
v Also
consider
:
• A
gene
is
a
genomic
sequence
(DNA
or
RNA)
directly
encoding
func(onal
product
molecules,
either
RNA
or
protein.
• If
several
func(onal
products
share
overlapping
regions,
we
take
the
union
of
all
overlapping
genomics
sequences
coding
for
them.
• This
union
must
be
coherent
–
i.e.,
processed
separately
for
final
protein
and
RNA
products
–
but
does
not
require
that
all
products
necessarily
share
a
common
subsequence.
Gerstein et al., 2007. Genome Res.
16. 16BIO-REFRESHER
“El
gen
es
la
unión
de
secuencias
genómicas
que
codifican
una
colección
coherente
de
productos
funcionales
que
pueden
o
no
superponerse.”
Gerstein et al., 2007. Genome Res
El
Gen:
un
blanco
en
movimiento.
¿QUÉ ES UN GEN?
17. 17BIO-REFRESHER
TRANSLATION
reading frame
v Reading
frame
is
a
manner
of
dividing
the
sequence
of
nucleo(des
in
mRNA
(or
DNA)
into
a
set
of
consecu(ve,
non-‐overlapping
triplets
(codons).
v Three
frames
can
be
read
in
the
5’
à
3’
direc(on.
Given
that
DNA
has
two
an(-‐parallel
strands,
an
addi(onal
three
frames
are
possible
to
be
read
on
the
an(-‐sense
strand.
Six
total
possible
reading
frames
exist.
v In
eukaryotes,
only
one
reading
frame
per
sec(on
of
DNA
is
biologically
relevant
at
a
(me:
it
has
the
poten(al
to
be
transcribed
into
RNA
and
translated
into
protein.
This
is
called
the
OPEN
READING
FRAME
(ORF)
• ORF
=
Start
signal
+
coding
sequence
(divisible
by
3)
+
Stop
signal
v The
sec(ons
of
the
mature
mRNA
transcribed
with
the
coding
sequence
but
not
translated
are
called
UnTranslated
Regions
(UTR);
one
at
each
end.
20. 20BIO-REFRESHER
TRANSLATION
reading frame: splice sites
v The
spliceosome
catalyzes
the
removal
of
introns
and
the
liga(on
of
flanking
exons.
• introns:
spaces
inside
the
gene,
not
part
of
the
coding
sequence
• exons:
expression
units
(of
the
coding
sequence)
v Splicing
“signals”
(from
the
point
of
view
of
an
intron):
• There
is
a
5’
end
splice
“signal”
(site):
usually
GT
(less
common:
GC)
• And
a
3’
end
splice
site:
usually
AG
• …]5’-‐GT/AG-‐3’[…
v It
is
possible
to
produce
more
than
one
protein
(polypep(de)
sequence
from
the
same
genic
region,
by
alterna(vely
bringing
exons
together=
alterna=ve
splicing.
For
example,
the
gene
Dscam
(Drosophila)
has
38,000
alterna(vely
spliced
mRNAs
=
isoforms
21. 21
"Gene structure" by Daycd- Wikimedia Commons
BIO-REFRESHER
TRANSLATION
now in your mind
• Although
of
brief
existence,
understanding
mRNAs
is
crucial,
as
they
will
become
the
center
of
your
work.
22. 22
Text for figures goes here
BIO-REFRESHER
TRANSLATION
reading frame: phase
v Introns
can
interrupt
the
reading
frame
of
a
gene
by
inser(ng
a
sequence
between
two
consecu(ve
codons
v Between
the
first
and
second
nucleo(de
of
a
codon
v Or
between
the
second
and
third
nucleo(de
of
a
codon
"Exon and Intron classes”. Licensed under Fair use via Wikipedia
24. 24BIO-REFRESHER
HICCUPS
in transcription and translation
v The
presence
of
premature
Stop
codons
in
the
message
is
possible.
A
process
called
non-‐sense
mediated
decay
checks
for
them
and
corrects
them
to
avoid:
incomplete
splicing,
DNA
muta(ons,
transcrip(on
errors,
and
leaky
scanning
of
ribosome
–
causing
changes
in
the
reading
frame
(frame
shiYs).
v Inser(ons
and
dele(ons
(indels)
can
cause
frame
shios,
when
indel
is
not
divisible
by
three
(3).
As
a
result,
the
pep(de
can
be
abnormally
long,
or
abnormally
short
–
depending
when
the
first
in-‐frame
Stop
signal
is
located.
26. 26Gene Prediction
GENE PREDICTION
v The
iden(fica(on
of
structural
features
of
the
genome:
• Primarily
focused
on
protein-‐coding
genes.
• Predicts
also
transfer
RNAs
(tRNA),
ribosomal
RNAs
(rRNA),
regulatory
mo(fs,
long
and
small
non-‐coding
RNAs
(ncRNA),
repe((ve
elements
(masked),
etc.
• Two
methods
for
iden(fica(on.
• Some
are
self-‐trained
and
some
must
be
trained.
27. 27Gene Prediction
GENE PREDICTION
methods for discovery
1)
Ab
ini,o:
-‐
based
on
DNA
composi(on,
-‐
deals
strictly
with
genomic
sequences
-‐
makes
use
of
sta(s(cal
approaches
to
search
for
coding
regions
and
typical
gene
signals.
• E.g.
Augustus,
GENSCAN,
geneid,
fgenesh,
etc.
3’
Nat Rev Genet. 2015 Jun;16(6):321-32. doi: 10.1038/nrg3920
28. 28
Nucleic Acids 2003 vol. 31 no. 13 3738-3741
Gene Prediction
GENE PREDICTION
methods for discovery (ctd)
2)
Homology-‐based:
-‐
evidence-‐based,
-‐
finds
genes
using
either
similarity
searches
in
the
main
databases
or
experimental
data
including
RNAseq,
expressed
sequence
tags
(ESTs),
full-‐length
complementary
DNAs
(cDNAs),
etc.
• E.g:
fgenesh++,
Just
Annotate
My
genome
(JAMg),
SGP2
29. 29
GENE ANNOTATION
Integra(on
of
data
from
computa(onal
&
experimental
evidence
with
data
from
predic(on
tools,
to
generate
a
reliable
set
of
structural
annota=ons.
Involves:
1)
ab
ini1o
predic(ons
2)
assessment
of
biological
evidence
to
drive
the
gene
predic(on
process
3)
synthesis
of
these
results
to
produce
a
set
of
consensus
gene
models
Gene Annotation
30. 30
In
some
cases
algorithms
and
metrics
used
to
generate
consensus
sets
may
actually
reduce
the
accuracy
of
the
gene’s
representa(on.
GENE ANNOTATION
Gene
models
may
be
organized
into
“sets”
using:
v automa(c
integra(on
of
predicted
sets
(combiners);
e.g:
GLEAN,
EvidenceModeler
or
v tools
packaged
into
pipelines;
e.g:
MAKER,
PASA,
Gnomon,
Ensembl,
etc.
Gene Annotation
31. ANOTACION
un arte imperfecto
No one is perfect, least of all automated annotation. 31
Nuevas
tecnologías
traen
nuevos
retos:
• Errores
en
el
ensamblaje
pueden
causar
fragmentación
en
las
anotaciones
• Cobertura
limitada
dificulta
la
iden(ficación
con
certeza
Image: www.BroadInstitute.org
32. ANOTACION MANUAL
mejorando predicciones
Schiex
et
al.
Nucleic
Acids
2003
(31)
13:
3738-‐3741
Predicciones Automatizadas
Evidencia Experimental
Manual Annotation – to the rescue. 32
cDNAs,
búsquedas
con
HMM,
RNAseq,
genes
de
otras
especies.
Entonces,
es
necesario
refinar
las
predicciones
de
elementos
biológicos
codificados
en
el
genoma,
lo
que
requiere
una
cuidadosa
revisión.
33. 33
BIOCURACION
ajustes estructurales y funcionales
Iden(ficar
los
elementos
del
genoma
que
mejor
representan
la
biología
subyacente
y
eliminar
los
elementos
que
reflejan
errores
sistémicos
de
los
análisis
automa(zados.
Asignar
funciones
a
través
de
análisis
compara(vos
entre
elementos
genómicos
similares
de
organismos
cercanamente
relacionados
usando
literatura,
bases
de
datos,
y
datos
experimentales.
BIOCURACION
hAp://GeneOntology.org
1
2
34. MANUAL ANNOTATION 34
PERO, EN CURACION
no siempre era posible ampliar estos esfuerzos
Researchers
on
their
own;
may
or
may
not
publicize
results;
may
be
a
dead-‐end
with
very
few
people
ever
aware
of
these
results.
Elsik
et
al.
2006.
Genome
Res.
16(11):1329-‐33.
Too
many
sequences
and
not
enough
hands.
A
small
group
of
highly
trained
experts
(e.g.
GO).
1
Museum
A
few
very
good
biologists,
a
few
very
good
bioinforma(cians
camping
together
for
intense
but
short
periods
of
(me.
Jamboree
2
Co"age
3
35. ANOTACION
un ejercicio en colaboración
COLABORANDO 35
Los
inves1gadores
usualmente
buscamos
las
opiniones
y
percepciones
de
colegas
con
experiencia
en
áreas
específicas
del
conocimiento.
Por
ejemplo,
dominios
conservados
o
familias
de
genes.
36. Apollo
una herramienta para editar anotaciones
36
v En
la
web,
integrado
con
JBrowse.
v ¡Permite
la
colaboración
en
(empo
real!
v Automá(camente
genera
datos
en
formatos
comunes
para
análisis.
v Anotación
manual
de
genes,
pseudogenes,
tRNAs,
snRNAs,
snoRNAs,
ncRNAs,
miRNAs,
TEs,
y
fragmentos
repe((vos.
v Funciones
intui(vas
y
menús
desplegables
crean
y
editan
estructuras
de
transcritos
y
exones,
insertan
comentarios
(CV,
texto
libre),
y
términos
de
GO,
etc.
INTRODUCING APOLLO
hAp://GenomeArchitect.org/
37. ARQUITECTURA
simple, flexible
ARCHITECTURE 37
Cliente
de
web
+
Motor
de
edición
de
anotaciones
+
Servicio
de
datos
en
el
servidor
REST / JSON
Websockets
Motor de Anotación (Servidor)
Shiro
LDAP
OAuth
Annotations
Security
Preferences
Organisms
Tracks
BAM
BED
VCF
GFF3
BigWig
Curadores
Google Web Toolkit (GWT) /
Bootstrap
JBrowse DOJO / jQuery Datos a JBrowse
Organismo 1
Carga de datos con
evidencia genómica
para cada organismo
Servicio único de almacenamiento
PostgreSQL, MySQL, MongoDB,
ElasticSearch
Apollo v2.0
Datos a JBrowse
Organismo 2
38. CLIENTE DE WEB
panel del curador
ARCHITECTURE 38
Motor de Anotación (Servidor)
Curadores
Google Web Toolkit (GWT) / Bootstrap
JBrowse
DOJO /
jQuery
Apollo v2.0
BAM
BED
VCF
GFF3
BigWig
REST / JSON
Websockets
Usa GWT/Bootstrap en el
frente para proveerle
un comportamiento
versátil a la aplicación.
Panel Del Curador
¡NUEVO!
¡NUEVO!
39. MOTOR DE ANOTACION
lógica de edición
ARCHITECTURE 39
Motor de Anotación (Servidor)
Shiro
LDAP
OAuth
Datos a JBrowse
Organismo 2
Datos a JBrowse
Organismo 1
Servicio único de almacenamiento
Apollo v2.0
Controladores Grails (J2EE servlet)
llevan las solicitudes al directorio
de datos apropiado para cada
organismo en JBrowse
Carga de datos con evidencia
genómica para cada organismo
¡NUEVO!
Cliente de web
REST / JSON
Websockets
40. SERVICIO DE DATOS EN EL SERVIDOR
servicio único de almacenamiento
ARCHITECTURE 40
Anotaciones
Seguridad
Preferencias
Organismos
Pistas de datos
Servicio único de almacenamiento
PostgreSQL, MySQL, MongoDB,
ElasticSearch
Motor de Anotación (Servidor)
Un solo servicio de almacenamiento,
consultable, para guardar las
anotaciones. ¡NUEVO!
Apollo v2.0
41. ¡COLABOREMOS!
Apollo tiene código abierto y es expandible
HIGHLIGHTED IMPROVEMENTS 41
The Genome Sequence Annotation Server (GenSAS)
Annotate
Los
usuarios
pueden
adicionar
programas
para
permi=r
sus
propios
procesos
de
trabajo.
Ejemplos:
• GenSAS:
plataforma
para
anotación
estructural
del
genoma.
• i5K:
-‐
Espacio
en
NAL
para
compar(r
ensamblajes
y
conjuntos
de
genes,
y
para
anotación
manual.
-‐
Proyecto
Piloto
>40
genomas:
47
charlas,
9
posters
en
Simposio
de
Genómica
de
Artrópodos.
Annotate
National
Agricultural
Library
42. We
train
and
support
hundreds
of
geographically
dispersed
scien(sts
from
diverse
research
communi(es
to
conduct
manual
annota(ons,
to
recover
coding
sequences
in
agreement
with
all
available
biological
evidence
using
Apollo.
v Gate
keeping
and
monitoring.
v Tutorials,
training
workshops,
and
“geneborees”.
42
DISPERSED COMMUNITIES
collaborative manual annotation efforts
APOLLO
43. LESSONS LEARNED
What
we
have
learned:
• Collabora(ve
work
dis(lls
invaluable
knowledge
• We
must
enforce
strict
rules
and
formats
• We
must
evolve
with
the
data
• A
liAle
training
goes
a
long
way
• NGS
poses
addi(onal
challenges
LESSONS LEARNED 43
45. Becoming Acquainted with Web Apollo
45 | 45
GENERAL PROCESS OF CURATION
main steps to remember
1. Select
or
find
a
region
of
interest,
e.g.
scaffold.
2. Select
appropriate
evidence
tracks
to
review
the
gene
model.
3. Determine
whether
a
feature
in
an
exis(ng
evidence
track
will
provide
a
reasonable
gene
model
to
start
working.
4. If
necessary,
adjust
the
gene
model.
5. Check
your
edited
gene
model
for
integrity
and
accuracy
by
comparing
it
with
available
homologs.
6. Comment
and
finish.
46. 46CURATING GENOMES
WHAT ANNOTATORS SHOULD LOOK FOR
annotators: that’s you!
v Annota=ng
a
simple
case:
WHEN
“The
official
predic(on
is
correct,
or
nearly
correct,
assuming
that
no
aligned
data
extends
beyond
the
gene
model
and
if
so,
it
is
not
likely
to
be
coding
sequence,
and/or
the
gene
predic(on
matches
what
you
know
about
the
gene”:
a. Can
you
add
UTRs?
b. Check
exon
structures.
c. Check
splice
sites:
…]5’-‐GT/AG-‐3’[…
d. Check
‘start’
and
‘stop’
sites.
e. Check
the
predicted
protein
product(s).
f. If
the
protein
product
s(ll
does
not
look
correct,
go
on
to
“Annota(ng
more
complex
cases”.
47. 47CURATING GENOMES
WHAT ANNOTATORS SHOULD LOOK FOR
continued
v Addi=onal
func=onality.
You
may
also
need
to
learn
how
to:
a. Get
genomic
sequence
b. Merge
exons
c. Add/Delete
an
exon
d. Create
an
exon
de
novo
(within
an
intron
or
outside
exis(ng
annota(ons).
e. Right/apple-‐click
on
a
feature
to
get
feature
ID
and
addi(onal
informa(on
f. Looking
up
homolog
descrip(ons
going
to
the
accession
web
page
at
UniProt/Swissprot
48. 48CURATING GENOMES
WHAT ANNOTATORS SHOULD LOOK FOR
continued
v Annota=ng
more
complex
cases:
a. Incomplete
annota(on:
protein
integrity
checks,
indicate
gaps,
missing
5’
sequences
or
missing
3’
sequences.
b. Merge
of
2
gene
predic(ons
on
same
scaffold
c. Merge
of
2
gene
predic(ons
on
different
scaffolds
(uh-‐oh!).
d. Split
of
a
gene
predic(on
e. Frameshios,
Selenocysteine,
single-‐base
errors,
and
other
inconvenient
phenomena
49. 49CURATING GENOMES
WHAT ANNOTATORS SHOULD LOOK FOR
continued
v Adding
important
project
informa=on
in
the
form
of
Canned
and/or
Customized
Comments:
a. NCBI
ID,
RefSeq
ID,
gene
symbol(s),
common
name(s),
synonyms,
top
BLAST
hits
(GenBank
IDs),
orthologs
with
species
names,
and
anything
else
you
can
think
of,
because
you
are
the
expert.
b. Type
of
annota(on
(e.g.:
whether
or
not
the
gene
model
was
changed)
c. Data
source
(for
example
if
the
Fgeneshpp
predicted
gene
was
the
star(ng
point
for
your
annota(on)
d. The
kinds
of
changes
you
made
to
the
gene
model,
e.g.:
split,
merge
e. Func(onal
descrip(on
f. Whether
you
would
like
for
your
MOD
curator
to
check
the
annota(on
g. Whether
part
of
your
gene
is
on
a
different
scaffold.
50. 50
TRAINING CURATORS
a little training goes a long way!
Provided
with
adequate
tools,
wet
lab
scien(sts
make
excep(onal
curators
who
can
easily
learn
to
maximize
the
genera(on
of
accurate,
biologically
supported
gene
models.
APOLLO
52. 52
Apollo
ámbito de edición para anotaciones
BECOMING ACQUAINTED WITH APOLLO
Color
por
marco
de
lectura,
alternar
cadena,
cambiar
esquema
de
color,
resaltador
Cargar
evidencia
experimental
(GFF3,
BAM,
BigWig),
pistas
de
datos
de
combinación
y
búsqueda.
Interrogar
el
genoma
usando
BLAT.
Navegación
y
zoom.
Buscar
un
gen
o
un
grupo
Obtener
coordenadas,
y
hacer
zoom
con
“selección
elás(ca”
Login
Anotaciones
creadas
por
el
usuario
Panel
del
curador
Pistas
de
datos
de
evidencia
Datos
transcriptómicos
de
estadío
y
(po
celular
específicos.
56. 56
Apollo
funcionalidad
BECOMING ACQUAINTED WITH APOLLO
• Agregar:
– IDs
de
bases
de
datos
públicas
(e.g.
GenBank,
usando
DBXRef);
símbolo(s)
de
cada
gen,
nombre(s)
común(es),
sinónimos,
el
mejor
resultado
de
BLAST,
ortólogos
con
el
nombre
de
la
especie.
– Asignaciones
de
función
apropiadas
(e.g.
via
datos
de
RNA-‐Seq,
búsquedas
de
literatura,
búsquedas
con
HMMs,
etc.)
– Comentarios
acerca
de
las
modificaciones
que
se
realizaron,
o
si
ninguna
fue
necesaria.
– Y
otras
notas
que
se
le
ocurran
al
biocurador.
• Corregir
si(os
de
‘Inicio’
y
‘Parada’
• Arreglar
si(os
de
ayuste
no
canónicos
• Anotar
UTRs
(e.g.:
usando
RNA-‐Seq)
• Obtener
&
corregir
predicciones
de
productos
de
proteínas
-‐
Alinearlos
con
genes
o
familias
de
genes
relevantes.
-‐
Usar
blastp
en
RefSeq
o
nr
de
NCBI
• Revisar
la
falta
de
datos
en
el
ensamble
• Unir
2
predicciones
de
genes
en
el
mismo
grupo
• Dividir
una
predicción
de
gen
• Corregir
desplazamientos
de
la
pauta
de
lectura,
y
otros
errores
en
el
ensamblaje
• Anotar
selenocisteínas,
errores
de
una
sola
base,
etc.
57. REMOVABLE SIDE DOCK
with customizable tabs
HIGHLIGHTED IMPROVEMENTS 57
Annotations Organism Users Groups AdminTracks
Reference
Sequence
60. 60 | 60
Becoming Acquainted with Web Apollo.
USER NAVIGATION
Annotator
panel.
• Choose appropriate evidence tracks from list on annotator panel.
• Select & drag elements from evidence track into the ‘User-created Annotations’ area.
• Hovering over annotation in progress brings up an information pop-up.
61. 61 | 61
USER NAVIGATION
Becoming Acquainted with Web Apollo.
• Annotation right-click menu
62. 62
Annota(ons,
annota(on
edits,
and
History:
stored
in
a
centralized
database.
62
USER NAVIGATION
Becoming Acquainted with Web Apollo.
63. 63
The
Annota(on
Informa=on
Editor
DBXRefs
are
database
crossed
references:
if
you
have
reason
to
believe
that
this
gene
is
linked
to
a
gene
in
a
public
database
(including
your
own),
then
add
it
here.
63
USER NAVIGATION
Becoming Acquainted with Web Apollo.
64. 64
The
Annota(on
Informa=on
Editor
• Add
PubMed
IDs
• Include
GO
terms
as
appropriate
from
any
of
the
three
ontologies
• Write
comments
sta(ng
how
you
have
validated
each
model.
64
USER NAVIGATION
Becoming Acquainted with Web Apollo.
65. 65 | 65
USER NAVIGATION
Becoming Acquainted with Web Apollo.
• ‘Zoom
to
base
level’
op(on
reveals
the
DNA
Track.
66. 66 | 66
USER NAVIGATION
Becoming Acquainted with Web Apollo.
• Color
exons
by
CDS
from
the
‘View’
menu.
67. 67 |
Zoom
in/out
with
keyboard:
shio
+
arrow
keys
up/down
67
USER NAVIGATION
Becoming Acquainted with Web Apollo.
• Toggle
reference
DNA
sequence
and
transla=on
frames
in
forward
strand.
Toggle
models
in
either
direc(on.
70. “Simple
case”:
-‐
the
predicted
gene
model
is
correct
or
nearly
correct,
and
-‐
this
model
is
supported
by
evidence
that
completely
or
mostly
agrees
with
the
predic(on.
-‐
evidence
that
extends
beyond
the
predicted
model
is
assumed
to
be
non-‐coding
sequence.
The
following
are
simple
modifica(ons.
70 | 70
ANNOTATING SIMPLE CASES
Becoming Acquainted with Web Apollo. SIMPLE CASES
71. 71 |
• A
confirma(on
box
will
warn
you
if
the
receiving
transcript
is
not
on
the
same
strand
as
the
feature
where
the
new
exon
originated.
• Check
‘Start’
and
‘Stop’
signals
aoer
each
edit.
71
ADDING EXONS
Becoming Acquainted with Web Apollo. SIMPLE CASES
72. If
transcript
alignment
data
are
available
and
extend
beyond
your
original
annota(on,
you
may
extend
or
add
UTRs.
1. Right
click
at
the
exon
edge
and
‘Zoom
to
base
level’.
2. Place
the
cursor
over
the
edge
of
the
exon
un1l
it
becomes
a
black
arrow
then
click
and
drag
the
edge
of
the
exon
to
the
new
coordinate
posi(on
that
includes
the
UTR.
72 |
To
add
a
new
spliced
UTR
to
an
exis(ng
annota(on
follow
the
procedure
for
adding
an
exon.
72
ADDING UTRs
Becoming Acquainted with Web Apollo. SIMPLE CASES
73. 1. Zoom
in
to
clearly
resolve
each
exon
as
a
dis(nct
rectangle.
2. Two
exons
from
different
tracks
sharing
the
same
start
and/or
end
coordinates
will
display
a
red
bar
to
indicate
matching
edges.
3. Selec(ng
the
whole
annota(on
or
one
exon
at
a
(me,
use
this
‘edge-‐
matching’
func(on
and
scroll
along
the
length
of
the
annota(on,
verifying
exon
boundaries
against
available
data.
Use
square
[
]
brackets
to
scroll
from
exon
to
exon.
4. Check
if
cDNA
/
RNAseq
reads
lack
one
or
more
of
the
annotated
exons
or
include
addi(onal
exons.
73 | 73
CHECK EXON INTEGRITY
Becoming Acquainted with Web Apollo. SIMPLE CASES
74. To
modify
an
exon
boundary
and
match
data
in
the
evidence
tracks:
select
both
the
offending
exon
and
the
feature
with
the
expected
boundary,
then
right
click
on
the
annota(on
to
select
‘Set
3’
end’
or
‘Set
5’
end’
as
appropriate.
74 |
In
some
cases
all
the
data
may
disagree
with
the
annota(on,
in
other
cases
some
data
support
the
annota(on
and
some
of
the
data
support
one
or
more
alterna(ve
transcripts.
Try
to
annotate
as
many
alterna(ve
transcripts
as
are
well
supported
by
the
data.
74
EXON STRUCTURE INTEGRITY
Becoming Acquainted with Web Apollo. SIMPLE CASES
75. Flags
non-‐canonical
splice
sites.
Selec(on
of
features
and
sub-‐
features
Edge-‐matching
Evidence
Tracks
Area
‘User-‐created
Annota(ons’
Track
Apollo’s
edi(ng
logic
(brain):
§ selects
longest
ORF
as
CDS
§ flags
non-‐canonical
splice
sites
75
ORFs AND SPLICE SITES
Becoming Acquainted with Web Apollo. SIMPLE CASES
76. 76 |
Exon/intron
junc(on
possible
error
Original
model
Curated
model
Non-‐canonical
splices
are
indicated
by
an
orange
circle
with
a
white
exclama(on
point
inside,
placed
over
the
edge
of
the
offending
exon.
Canonical
splice
sites:
3’-‐…exon]GA
/
TG[exon…-‐5’
5’-‐…exon]GT
/
AG[exon…-‐3’
reverse
strand,
not
reverse-‐complemented:
forward
strand
76
SPLICE SITES
Becoming Acquainted with Web Apollo. SIMPLE CASES
Zoom
to
review
non-‐canonical
splice
site
warnings.
Although
these
may
not
always
have
to
be
corrected
(e.g
GC
donor),
they
should
be
flagged
with
the
appropriate
comment.
77. Web
Apollo
calculates
the
longest
possible
open
reading
frame
(ORF)
that
includes
canonical
‘Start’
and
‘Stop’
signals
within
the
predicted
exons.
If
‘Start’
appears
to
be
incorrect,
modify
it
by
selec(ng
an
in-‐frame
‘Start’
codon
further
up
or
downstream,
depending
on
evidence
(protein
database,
addi(onal
evidence
tracks).
It
may
be
present
outside
the
predicted
gene
model,
within
a
region
supported
by
another
evidence
track.
In
very
rare
cases,
the
actual
‘Start’
codon
may
be
non-‐canonical
(non-‐ATG).
77 | 77
‘START’ AND ‘STOP’ SITES
Becoming Acquainted with Web Apollo. SIMPLE CASES
79. Evidence
may
support
joining
two
or
more
different
gene
models.
Warning:
protein
alignments
may
have
incorrect
splice
sites
and
lack
non-‐conserved
regions!
1. In
‘User-‐created
Annota=ons’
area
shio-‐click
to
select
an
intron
from
each
gene
model
and
right
click
to
select
the
‘Merge’
op(on
from
the
menu.
2. Drag
suppor(ng
evidence
tracks
over
the
candidate
models
to
corroborate
overlap,
or
review
edge
matching
and
coverage
across
models.
3. Check
the
resul(ng
transla(on
by
querying
a
protein
database
e.g.
UniProt.
Add
comments
to
record
that
this
annota(on
is
the
result
of
a
merge.
79 | 79
Red
lines
around
exons:
‘edge-‐matching’
allows
annotators
to
confirm
whether
the
evidence
is
in
agreement
without
examining
each
exon
at
the
base
level.
COMPLEX CASES
merge two gene predictions on the same scaffold
Becoming Acquainted with Web Apollo. COMPLEX CASES
80. One
or
more
splits
may
be
recommended
when:
-‐
different
segments
of
the
predicted
protein
align
to
two
or
more
different
gene
families
-‐
predicted
protein
doesn’t
align
to
known
proteins
over
its
en(re
length
Transcript
data
may
support
a
split,
but
first
verify
whether
they
are
alterna(ve
transcripts.
80 | 80
COMPLEX CASES
split a gene prediction
Becoming Acquainted with Web Apollo. COMPLEX CASES
81. DNA
Track
‘User-‐created
Annota=ons’
Track
81
COMPLEX CASES
correcting frameshifts and single-base errors
Becoming Acquainted with Web Apollo. COMPLEX CASES
Always
remember:
when
annota(ng
gene
models
using
Apollo,
you
are
looking
at
a
‘frozen’
version
of
the
genome
assembly
and
you
will
not
be
able
to
modify
the
assembly
itself.
84. 1. Apollo
allows
annotators
to
make
single
base
modifica(ons
or
frameshios
that
are
reflected
in
the
sequence
and
structure
of
any
transcripts
overlapping
the
modifica(on.
These
manipula(ons
do
NOT
change
the
underlying
genomic
sequence.
2. If
you
determine
that
you
need
to
make
one
of
these
changes,
zoom
in
to
the
nucleo(de
level
and
right
click
over
a
single
nucleo(de
on
the
genomic
sequence
to
access
a
menu
that
provides
op(ons
for
crea(ng
inser(ons,
dele(ons
or
subs(tu(ons.
3. The
‘Create
Genomic
Inser=on’
feature
will
require
you
to
enter
the
necessary
string
of
nucleo(de
residues
that
will
be
inserted
to
the
right
of
the
cursor’s
current
loca(on.
The
‘Create
Genomic
Dele=on’
op(on
will
require
you
to
enter
the
length
of
the
dele(on,
star(ng
with
the
nucleo(de
where
the
cursor
is
posi(oned.
The
‘Create
Genomic
Subs=tu=on’
feature
asks
for
the
string
of
nucleo(de
residues
that
will
replace
the
ones
on
the
DNA
track.
4. Once
you
have
entered
the
modifica(ons,
Apollo
will
recalculate
the
corrected
transcript
and
protein
sequences,
which
will
appear
when
you
use
the
right-‐click
menu
‘Get
Sequence’
op(on.
Since
the
underlying
genomic
sequence
is
reflected
in
all
annota(ons
that
include
the
modified
region
you
should
alert
the
curators
of
your
organisms
database
using
the
‘Comments’
sec(on
to
report
the
CDS
edits.
5. In
special
cases
such
as
selenocysteine
containing
proteins
(read-‐throughs),
right-‐click
over
the
offending/premature
‘Stop’
signal
and
choose
the
‘Set
readthrough
stop
codon’
op(on
from
the
menu.
84 | 84
Becoming Acquainted with Web Apollo. COMPLEX CASES
COMPLEX CASES
correcting frameshifts, single-base errors, and selenocysteines
85. Follow
the
checklist
un(l
you
are
happy
with
the
annota(on!
And
remember
to…
– comment
to
validate
your
annota(on,
even
if
you
made
no
changes
to
an
exis(ng
model.
Think
of
comments
as
your
vote
of
confidence.
– or
add
a
comment
to
inform
the
community
of
unresolved
issues
you
think
this
model
may
have.
85 | 85
Always
Remember:
Web
Apollo
cura(on
is
a
community
effort
so
please
use
comments
to
communicate
the
reasons
for
your
annota(on
(your
comments
will
be
visible
to
everyone).
COMPLETING THE ANNOTATION
Becoming Acquainted with Web Apollo.
87. 1. Can
you
add
UTRs
(e.g.:
via
RNA-‐Seq)?
2. Check
exon
structures
3. Check
splice
sites:
most
splice
sites
display
these
residues
…]5’-‐GT/AG-‐3’[…
4. Check
‘Start’
and
‘Stop’
sites
5. Check
the
predicted
protein
product(s)
– Align
it
against
relevant
genes/gene
family.
– blastp
against
NCBI’s
RefSeq
or
nr
6. If
the
protein
product
s(ll
does
not
look
correct
then
check:
– Are
there
gaps
in
the
genome?
– Merge
of
2
gene
predic(ons
on
the
same
scaffold
– Merge
of
2
gene
predic(ons
from
different
scaffolds
– Split
a
gene
predic(on
– FrameshiYs
– error
in
the
genome
assembly?
– Selenocysteines,
single-‐base
errors,
etc
87 | 87
7. Finalize
annota(on
by
adding:
– Important
project
informa(on
in
the
form
of
comments
– IDs
from
public
databases
e.g.
GenBank
(via
DBXRef),
gene
symbol(s),
common
name(s),
synonyms,
top
BLAST
hits,
orthologs
with
species
names,
and
everything
else
you
can
think
of,
because
you
are
the
expert.
– Whether
your
model
replaces
one
or
more
models
from
the
official
gene
set
(so
it
can
be
deleted).
– The
kinds
of
changes
you
made
to
the
gene
model
of
interest,
if
any.
– Any
appropriate
func(onal
assignments
of
interest
to
the
community
(e.g.
via
BLAST,
RNA-‐Seq
data,
literature
searches,
etc.)
THE CHECKLIST
for accuracy and integrity
MANUAL ANNOTATION CHECKLIST
89. Example
Example 89
A
public
Apollo
Demo
using
the
Honey
Bee
genome
is
available
at
hAp://genomearchitect.org/WebApolloDemo
-‐
Demonstra(on
using
the
Hyalella
azteca
genome
(amphipod
crustacean).
90. What do we know about this genome?
• Currently
publicly
available
data
at
NCBI:
• >37,000
nucleo(de
seqsà
scaffolds,
mitochondrial
genes
• 300
amino
acid
seqsà
mitochondrion
• 53
ESTs
• 0
conserved
domains
iden(fied
• 0
“gene”
entries
submiAed
• Data
at
i5K
Workspace@NAL
(annota(on
hosted
at
USDA)
-‐
10,832
scaffolds:
23,288
transcripts:
12,906
proteins
Example 90
92. PubMed Search: what’s new?
Example 92
“Ten
popula(ons
(3
cultures,
7
from
California
water
bodies)
differed
by
at
least
550-‐fold
in
sensi=vity
to
pyrethroids.”
“By
sequencing
the
primary
pyrethroid
target
site,
the
voltage-‐gated
sodium
channel
(vgsc),
we
show
that
point
muta(ons
and
their
spread
in
natural
popula(ons
were
responsible
for
differences
in
pyrethroid
sensi(vity.”
“The
finding
that
a
non-‐target
aqua(c
species
has
acquired
resistance
to
pes(cides
used
only
on
terrestrial
pests
is
troubling
evidence
of
the
impact
of
chronic
pes=cide
transport
from
land-‐based
applica(ons
into
aqua(c
systems.”
93. How many sequences for our gene of
interest?
Example 93
• Para,
(voltage-‐gated
sodium
channel
alpha
subunit;
Nasonia
vitripennis).
• NaCP60E
(Sodium
channel
protein
60
E;
D.
melanogaster).
– MF:
voltage-‐gated
ca(on
channel
ac(vity
(IDA,
GO:0022843).
– BP:
olfactory
behavior
(IMP,
GO:
0042048),
sodium
ion
transmembrane
transport
(ISS,GO:0035725).
– CC:
voltage-‐gated
sodium
channel
complex
(IEA,
GO:0001518).
And
what
do
we
know
about
them?
94. Retrieving sequences for
sequence similarity searches.
Example 94
>vgsc-‐Segment3-‐DomainII
RVFKLAKSWPTLNLLISIMGKTVGALGNLTFVLCIIIFIFAVMGMQLFGKNYTEKVTKFKWSQDG
QMPRWNFVDFFHSFMIVFRVLCGEWIESMWDCMYVGDFSCVPFFLATVVIGNLVVSFMHR
98. Creating a new gene model: drag and drop
Example 98
• Apollo automatically calculates ORF.
In this case, ORF includes the high-scoring segment pairs (hsp).
104. Editing: merge the three models
Example 104
Merge
by
dropping
an
exon
or
gene
model
onto
another.
Merge
by
selec(ng
two
exons
(holding
down
“Shio”)
and
using
the
right
click
menu.
or…
105. Editing: correct boundaries, delete exons
Example 105
Modify
exon
/
intron
boundary:
-‐ Drag
the
end
of
the
exon
to
the
nearest
canonical
splice
site.
-‐ Use
right-‐click
menu.
Delete
first
exon
from
HaztTmpM006233
108. Finished model
Example 108
Corroborate
integrity
and
accuracy
of
the
model:
-‐
Start
and
Stop
-‐
Exon
structure
and
splice
sites
…]5’-‐GT/AG-‐3’[…
-‐
Check
the
predicted
protein
product
vs.
NCBI
nr
109. Information Editor
• DBXRefs:
e.g.
NP_001128389.1,
N.
vitripennis,
RefSeq
• PubMed
iden(fier:
PMID:
24065824
• Gene
Ontology
IDs:
GO:0022843,
GO:
0042048,
GO:0035725,
GO:0001518.
• Comments.
• Name,
Symbol.
• Approve
/
Delete
radio
buAon.
Example 109
Comments
(if
applicable)
112. CONTENIDO
Web
Apollo
Collabora(ve
Cura(on
and
Interac(ve
Analysis
of
Genomes
112OUTLINE
• BIO-‐REFRESHER
conceptos
que
neceistamos
• ANOTACION
predicciones
automá(cas
• ANOTACION
MANUAL
necesaria,
en
colaboración
• APOLLO
avanzando
la
curación
en
colaboración
• EJEMPLO
demonstraciones
• EJERCICIOS
115. Exercises
Live
Demonstra(on
using
the
Apis
mellifera
genome.
115
1.
Evidence
in
support
of
protein
coding
gene
models.
1.1
Consensus
Gene
Sets:
Official
Gene
Set
v3.2
Official
Gene
Set
v1.0
1.2
Consensus
Gene
Sets
comparison:
OGSv3.2
genes
that
merge
OGSv1.0
and
RefSeq
genes
OGSv3.2
genes
that
split
OGSv1.0
and
RefSeq
genes
1.3
Protein
Coding
Gene
Predic=ons
Supported
by
Biological
Evidence:
NCBI
Gnomon
Fgenesh++
with
RNASeq
training
data
Fgenesh++
without
RNASeq
training
data
NCBI
RefSeq
Protein
Coding
Genes
and
Low
Quality
Protein
Coding
Genes
1.4
Ab
ini,o
protein
coding
gene
predic=ons:
Augustus
Set
12,
Augustus
Set
9,
Fgenesh,
GeneID,
N-‐SCAN,
SGP2
1.5
Transcript
Sequence
Alignment:
NCBI
ESTs,
Apis
cerana
RNA-‐Seq,
Forager
Bee
Brain
Illumina
Con(gs,
Nurse
Bee
Brain
Illumina
Con(gs,
Forager
RNA-‐Seq
reads,
Nurse
RNA-‐Seq
reads,
Abdomen
454
Con(gs,
Brain
and
Ovary
454
Con(gs,
Embryo
454
Con(gs,
Larvae
454
Con(gs,
Mixed
Antennae
454
Con(gs,
Ovary
454
Con(gs
Testes
454
Con(gs,
Forager
RNA-‐Seq
HeatMap,
Forager
RNA-‐Seq
XY
Plot,
Nurse
RNA-‐Seq
HeatMap,
Nurse
RNA-‐Seq
XY
Plot
Becoming Acquainted with Web Apollo.
116. Exercises
Live
Demonstra(on
using
the
Apis
mellifera
genome.
116
1.
Evidence
in
support
of
protein
coding
gene
models
(Con=nued).
1.6
Protein
homolog
alignment:
Acep_OGSv1.2
Aech_OGSv3.8
Cflo_OGSv3.3
Dmel_r5.42
Hsal_OGSv3.3
Lhum_OGSv1.2
Nvit_OGSv1.2
Nvit_OGSv2.0
Pbar_OGSv1.2
Sinv_OGSv2.2.3
Znev_OGSv2.1
Metazoa_Swissprot
2.
Evidence
in
support
of
non
protein
coding
gene
models
2.1
Non-‐protein
coding
gene
predic=ons:
NCBI
RefSeq
Noncoding
RNA
NCBI
RefSeq
miRNA
2.2
Pseudogene
predic=ons:
NCBI
RefSeq
Pseudogene
Becoming Acquainted with Web Apollo.
118. Thank you. 118
• Berkeley
Bioinforma=cs
Open-‐source
Projects
(BBOP),
Berkeley
Lab:
Apollo
and
Gene
Ontology
teams.
Suzanna
E.
Lewis
(PI).
• §
Chris1ne
G.
Elsik
(PI).
University
of
Missouri.
• *
Ian
Holmes
(PI).
University
of
California
Berkeley.
• Arthropod
genomics
community:
i5K
Steering
CommiAee
(esp.
Sue
Brown
(Kansas
State)),
Alexie
Papanicolaou
(UWS),
and
the
Honey
Bee
Genome
Sequencing
Consor(um.
• Stephen
Ficklin
GenSAS
Washington
State
University
• Apollo
is
supported
by
NIH
grants
5R01GM080203
from
NIGMS,
and
5R01HG004483
from
NHGRI.
Both
projects
are
also
supported
by
the
Director,
Office
of
Science,
Office
of
Basic
Energy
Sciences,
of
the
U.S.
Department
of
Energy
under
Contract
No.
DE-‐AC02-‐05CH11231
•
• For
your
a"en=on,
thank
you!
Apollo
Nathan
Dunn
Colin
Diesh
§
Deepak
Unni
§
Gene
Ontology
Chris
Mungall
Seth
Carbon
Heiko
Dietze
BBOP
Apollo:
hAp://GenomeArchitect.org
GO:
hAp://GeneOntology.org
i5K:
hAp://arthropodgenomes.org/wiki/i5K
¡Gracias!
NAL
at
USDA
Monica
Poelchau
Christopher
Childers
Gary
Moore
HGSC
at
BCM
fringy
Richards
Kim
Worley
JBrowse
Eric
Yao
*