Presentation by Valerie Schneider discussing Genome Reference Consortium (GRC) plans for the mouse and zebrafish reference genome assemblies, presented at the 2016 meeting of the The Allied Genetic Conference (TAGC). Includes description of resources at the National Center for Biotechnology Information (NCBI) for working with reference genome assemblies.
GRC Workshop at Churchill College on Sep 21, 2014. This is Aaron Quinlan's talk on issues with representing variants in the full assembly, with suggestions for VCF modifications for handling variant calls on the alts.
GRC Workshop at Churchill College on Sep 21, 2014. This is Aaron Quinlan's talk on issues with representing variants in the full assembly, with suggestions for VCF modifications for handling variant calls on the alts.
Presentation by Benedict Paten at GRC/GIAB ASHG 2017 workshop "Getting the most from the reference assembly and reference materials" on updates to the human reference assembly, GRCh38.
Presentation at IMGC 2019 workshop describing the latest improvements to the mouse reference genome assembly and analyses performed in preparation for the next release of the mouse genome assembly (GRCm39).
Presentation by Valerie Schneider at GRC/GIAB ASHG 2017 workshop "Getting the most from the reference assembly and reference materials" on updates to the human reference assembly, GRCh38.
Presentation by Tina Graves-Lindsay at GRC/GIAB ASHG 2017 workshop "Getting the most from the reference assembly and reference materials" on production of reference grade assemblies for various human populations.
Presentation at 2019 ASHG GRC/GIAB workshop describing history of the human reference genome, current curation efforts and future plans, and the relationship of all 3 to efforts to produce a human pan-genome.
Presentation by Benedict Paten at GRC/GIAB ASHG 2017 workshop "Getting the most from the reference assembly and reference materials" on updates to the human reference assembly, GRCh38.
Presentation at IMGC 2019 workshop describing the latest improvements to the mouse reference genome assembly and analyses performed in preparation for the next release of the mouse genome assembly (GRCm39).
Presentation by Valerie Schneider at GRC/GIAB ASHG 2017 workshop "Getting the most from the reference assembly and reference materials" on updates to the human reference assembly, GRCh38.
Presentation by Tina Graves-Lindsay at GRC/GIAB ASHG 2017 workshop "Getting the most from the reference assembly and reference materials" on production of reference grade assemblies for various human populations.
Presentation at 2019 ASHG GRC/GIAB workshop describing history of the human reference genome, current curation efforts and future plans, and the relationship of all 3 to efforts to produce a human pan-genome.
Microarray data and pathway analysis: example from the benchMaté Ongenaert
Microarray data and pathway analysis: example from the bench
by drs. Jolien Vermeire - HIVlab, Department of Clinical Chemistry, Microbiology and Immunology – UGent
The increased availability and lower cost of gene expression microarrays has stimulated the use of transcriptome studies in a high variety of fields. Generating expression data at whole-genome level can indeed be a powerful method to characterize cellular pathways involved in a certain biological process. However, the challenge of extracting relevant biological information from such large datasets still prevents researchers from exploiting this tool. In this presentation I will share my personal experience, as a 'researcher non-bioinformatician', with performing microarray data and pathway analyses. I will give a general overview of the different steps that where followed in order to transform raw gene expression data, obtained in context of HIV research, into useful biological information and highlight different methods and software tools that helped me in this process.
RNA-seq for DE analysis: the biology behind observed changes - part 6BITS
Part 6 of the training sesson 'RNA-seq for differential expression analysis' considers gene set analysis for inferring biology from RNA-seq data. See http://www.bits.vib.be
The Functional and Pathway Analysis talk given in March 2010 at the CRUK CRI. Cambridge UK.
It was designed to introduce wet-lab researchers to using web-based tools for doing functional analysis of gene lists, such as from microarray experiments.
RNA-seq for DE analysis: detecting differential expression - part 5BITS
Part 5 of the training sesson 'RNA-seq for differential expression analysis' considers the algorithm used for detecting differential expression between conditions. See http://www.bits.vib.be
Platform presentation at ASHG 2019 describing recent updates to the human reference genome assembly (GRCh38) and future plans with relevance to pan-genomic representations.
Big Data at Golden Helix: Scaling to Meet the Demand of Clinical and Research...Golden Helix Inc
With a focus on scalable architecture and optimized native code that fully utilizes the CPU and RAM available, we can scale genomic analysis into sizes conventionally considered Big Data on a single host. In this webcast, we demonstrate recent innovations and features in Golden Helix solutions that enable the analysis of big data on your own terms.
Building bioinformatics resources for the global communityExternalEvents
http://www.fao.org/about/meetings/wgs-on-food-safety-management/en/
Building bioinformatics resources for the global community. Presentation from the Technical Meeting on the impact of Whole Genome Sequencing (WGS) on food safety management and GMI-9, 23-25 May 2016, Rome, Italy.
Speaker: Benedict C. S. Cross, PhD, Team leader (Discovery Screening), Horizon Discovery
CRISPR–Cas9 mediated genome editing provides a highly efficient way to probe gene function. Using this technology, thousands of genes can be knocked out and their function assessed in a single experiment. We have conducted over 150 of these complex and powerful screens and will use our experience to guide you through the process of screen design, performance and analysis.
We'll be discussing:
• How to use CRISPR screening for target ID and validation, understanding drug MOA and patient stratification
• The screen design, quality control and how to evaluate success of your screening program
• Horizon’s latest developments to the platform
• Horizon’s novel approaches to target validation screening
Hail: SCALING GENETIC DATA ANALYSIS WITH APACHE SPARK: Keynote by Cotton SeedSpark Summit
In 2001, it cost ~$100M to sequence a single human genome. In 2014, due to dramatic improvements in sequencing technology far outpacing Moore’s law, we entered the era of the $1,000 genome. At the same time, the power of genetics to impact medicine has become evident: for example, drugs with supporting genetic evidence have twice the clinical trial success rate. These factors have led to an explosion in the volume of genetic data, in the face of which existing analysis tools are breaking down.
Therefore, we began the open-source Hail project (https://hail.is) to be a scalable platform built on Apache Spark to enable the worldwide genetics community to build, share, and apply new tools. Hail is focused on variant-level (post-read) data; querying genetic data, annotations and sample data; and performing rare and common variant association analyses. Hail has already been used to analyze datasets with hundreds of thousands of exomes and tens of thousands of whole genomes.
We will give an overview of the goals of the Hail project and its architecture. The challenge of efficiently manipulating genetic data in Spark has led to several innovations that may have wider applicability, including an RDD-like abstraction for representing multidimensional data and an OrderedRDD abstraction for ordered data, (for example, data indexed by position in the genome). Finally, we will discuss Hail performance and future directions.
Presentation by Fritz Sedlazeck at GRC/GIAB ASHG 2017 workshop "Getting the most from the reference assembly and reference materials" on characterizing human structural variation.
Presentation at 2019 ASHG GRC/GIAB workshop describing goals and progress of the telomere-to-telomere consortium to generate a genome assembly that provides representation of all sequences, including repetitive regions.
Presentation at 2019 ASHG GRC/GIAB workshop describing features and recent updates to the vg toolkit, including examples of comparisons to other methods used for alignment and variant detection.
Presentation at 2019 ASHG GRC/GIAB workshop describing recent updates to the MANE project, which aims to provide matched annotation from RefSeq and GENCODE.
Presentation at PanGenomics in the Cloud Hackathon, run by NCBI at UCSC (https://ncbiinsights.ncbi.nlm.nih.gov/2019/02/06/pangenomics-cloud-hackathon-march-2019/). Presents points to consider about the adoption of a pangenome reference, emphasizing aspects for long-term data management and wide-spread adoption.
Presentation by Justin Zook at GRC/GIAB ASHG 2017 workshop "Getting the most from the reference assembly and reference materials" on benchmarks for indels and structural variants.
Presentation by Karen Miga at GRC/GIAB ASHG 2017 workshop "Getting the most from the reference assembly and reference materials" on centromere assemblies.
263778731218 Abortion Clinic /Pills In Harare ,sisternakatoto
263778731218 Abortion Clinic /Pills In Harare ,ABORTION WOMEN’S CLINIC +27730423979 IN women clinic we believe that every woman should be able to make choices in her pregnancy. Our job is to provide compassionate care, safety,affordable and confidential services. That’s why we have won the trust from all generations of women all over the world. we use non surgical method(Abortion pills) to terminate…Dr.LISA +27730423979women Clinic is committed to providing the highest quality of obstetrical and gynecological care to women of all ages. Our dedicated staff aim to treat each patient and her health concerns with compassion and respect.Our dedicated group ABORTION WOMEN’S CLINIC +27730423979 IN women clinic we believe that every woman should be able to make choices in her pregnancy. Our job is to provide compassionate care, safety,affordable and confidential services. That’s why we have won the trust from all generations of women all over the world. we use non surgical method(Abortion pills) to terminate…Dr.LISA +27730423979women Clinic is committed to providing the highest quality of obstetrical and gynecological care to women of all ages. Our dedicated staff aim to treat each patient and her health concerns with compassion and respect.Our dedicated group of receptionists, nurses, and physicians have worked together as a teamof receptionists, nurses, and physicians have worked together as a team wwww.lisywomensclinic.co.za/
Prix Galien International 2024 Forum ProgramLevi Shapiro
June 20, 2024, Prix Galien International and Jerusalem Ethics Forum in ROME. Detailed agenda including panels:
- ADVANCES IN CARDIOLOGY: A NEW PARADIGM IS COMING
- WOMEN’S HEALTH: FERTILITY PRESERVATION
- WHAT’S NEW IN THE TREATMENT OF INFECTIOUS,
ONCOLOGICAL AND INFLAMMATORY SKIN DISEASES?
- ARTIFICIAL INTELLIGENCE AND ETHICS
- GENE THERAPY
- BEYOND BORDERS: GLOBAL INITIATIVES FOR DEMOCRATIZING LIFE SCIENCE TECHNOLOGIES AND PROMOTING ACCESS TO HEALTHCARE
- ETHICAL CHALLENGES IN LIFE SCIENCES
- Prix Galien International Awards Ceremony
New Directions in Targeted Therapeutic Approaches for Older Adults With Mantl...i3 Health
i3 Health is pleased to make the speaker slides from this activity available for use as a non-accredited self-study or teaching resource.
This slide deck presented by Dr. Kami Maddocks, Professor-Clinical in the Division of Hematology and
Associate Division Director for Ambulatory Operations
The Ohio State University Comprehensive Cancer Center, will provide insight into new directions in targeted therapeutic approaches for older adults with mantle cell lymphoma.
STATEMENT OF NEED
Mantle cell lymphoma (MCL) is a rare, aggressive B-cell non-Hodgkin lymphoma (NHL) accounting for 5% to 7% of all lymphomas. Its prognosis ranges from indolent disease that does not require treatment for years to very aggressive disease, which is associated with poor survival (Silkenstedt et al, 2021). Typically, MCL is diagnosed at advanced stage and in older patients who cannot tolerate intensive therapy (NCCN, 2022). Although recent advances have slightly increased remission rates, recurrence and relapse remain very common, leading to a median overall survival between 3 and 6 years (LLS, 2021). Though there are several effective options, progress is still needed towards establishing an accepted frontline approach for MCL (Castellino et al, 2022). Treatment selection and management of MCL are complicated by the heterogeneity of prognosis, advanced age and comorbidities of patients, and lack of an established standard approach for treatment, making it vital that clinicians be familiar with the latest research and advances in this area. In this activity chaired by Michael Wang, MD, Professor in the Department of Lymphoma & Myeloma at MD Anderson Cancer Center, expert faculty will discuss prognostic factors informing treatment, the promising results of recent trials in new therapeutic approaches, and the implications of treatment resistance in therapeutic selection for MCL.
Target Audience
Hematology/oncology fellows, attending faculty, and other health care professionals involved in the treatment of patients with mantle cell lymphoma (MCL).
Learning Objectives
1.) Identify clinical and biological prognostic factors that can guide treatment decision making for older adults with MCL
2.) Evaluate emerging data on targeted therapeutic approaches for treatment-naive and relapsed/refractory MCL and their applicability to older adults
3.) Assess mechanisms of resistance to targeted therapies for MCL and their implications for treatment selection
Report Back from SGO 2024: What’s the Latest in Cervical Cancer?bkling
Are you curious about what’s new in cervical cancer research or unsure what the findings mean? Join Dr. Emily Ko, a gynecologic oncologist at Penn Medicine, to learn about the latest updates from the Society of Gynecologic Oncology (SGO) 2024 Annual Meeting on Women’s Cancer. Dr. Ko will discuss what the research presented at the conference means for you and answer your questions about the new developments.
Pulmonary Thromboembolism - etilogy, types, medical- Surgical and nursing man...VarunMahajani
Disruption of blood supply to lung alveoli due to blockage of one or more pulmonary blood vessels is called as Pulmonary thromboembolism. In this presentation we will discuss its causes, types and its management in depth.
Knee anatomy and clinical tests 2024.pdfvimalpl1234
This includes all relevant anatomy and clinical tests compiled from standard textbooks, Campbell,netter etc..It is comprehensive and best suited for orthopaedicians and orthopaedic residents.
These lecture slides, by Dr Sidra Arshad, offer a quick overview of physiological basis of a normal electrocardiogram.
Learning objectives:
1. Define an electrocardiogram (ECG) and electrocardiography
2. Describe how dipoles generated by the heart produce the waveforms of the ECG
3. Describe the components of a normal electrocardiogram of a typical bipolar leads (limb II)
4. Differentiate between intervals and segments
5. Enlist some common indications for obtaining an ECG
Study Resources:
1. Chapter 11, Guyton and Hall Textbook of Medical Physiology, 14th edition
2. Chapter 9, Human Physiology - From Cells to Systems, Lauralee Sherwood, 9th edition
3. Chapter 29, Ganong’s Review of Medical Physiology, 26th edition
4. Electrocardiogram, StatPearls - https://www.ncbi.nlm.nih.gov/books/NBK549803/
5. ECG in Medical Practice by ABM Abdullah, 4th edition
6. ECG Basics, http://www.nataliescasebook.com/tag/e-c-g-basics
The prostate is an exocrine gland of the male mammalian reproductive system
It is a walnut-sized gland that forms part of the male reproductive system and is located in front of the rectum and just below the urinary bladder
Function is to store and secrete a clear, slightly alkaline fluid that constitutes 10-30% of the volume of the seminal fluid that along with the spermatozoa, constitutes semen
A healthy human prostate measures (4cm-vertical, by 3cm-horizontal, 2cm ant-post ).
It surrounds the urethra just below the urinary bladder. It has anterior, median, posterior and two lateral lobes
It’s work is regulated by androgens which are responsible for male sex characteristics
Generalised disease of the prostate due to hormonal derangement which leads to non malignant enlargement of the gland (increase in the number of epithelial cells and stromal tissue)to cause compression of the urethra leading to symptoms (LUTS
Explore natural remedies for syphilis treatment in Singapore. Discover alternative therapies, herbal remedies, and lifestyle changes that may complement conventional treatments. Learn about holistic approaches to managing syphilis symptoms and supporting overall health.
TEST BANK for Operations Management, 14th Edition by William J. Stevenson, Ve...kevinkariuki227
TEST BANK for Operations Management, 14th Edition by William J. Stevenson, Verified Chapters 1 - 19, Complete Newest Version.pdf
TEST BANK for Operations Management, 14th Edition by William J. Stevenson, Verified Chapters 1 - 19, Complete Newest Version.pdf
These simplified slides by Dr. Sidra Arshad present an overview of the non-respiratory functions of the respiratory tract.
Learning objectives:
1. Enlist the non-respiratory functions of the respiratory tract
2. Briefly explain how these functions are carried out
3. Discuss the significance of dead space
4. Differentiate between minute ventilation and alveolar ventilation
5. Describe the cough and sneeze reflexes
Study Resources:
1. Chapter 39, Guyton and Hall Textbook of Medical Physiology, 14th edition
2. Chapter 34, Ganong’s Review of Medical Physiology, 26th edition
3. Chapter 17, Human Physiology by Lauralee Sherwood, 9th edition
4. Non-respiratory functions of the lungs https://academic.oup.com/bjaed/article/13/3/98/278874
NVBDCP.pptx Nation vector borne disease control programSapna Thakur
NVBDCP was launched in 2003-2004 . Vector-Borne Disease: Disease that results from an infection transmitted to humans and other animals by blood-feeding arthropods, such as mosquitoes, ticks, and fleas. Examples of vector-borne diseases include Dengue fever, West Nile Virus, Lyme disease, and malaria.
micro teaching on communication m.sc nursing.pdfAnurag Sharma
Microteaching is a unique model of practice teaching. It is a viable instrument for the. desired change in the teaching behavior or the behavior potential which, in specified types of real. classroom situations, tends to facilitate the achievement of specified types of objectives.
6. Assembly (e.g. GRCm38)
Primary
Assembly
Unit
(C57BL/6J)
Non-nuclear
assembly unit
(e.g. MT)
129S6/
SvEvTac
129S1/
SvImJ
129X1/
SvJ
NOD/
ShiLtJ
NOD/
MrkTac
PAR
Genomic
Region
(MHC)
Genomic
Region
(DiGeorge)
Genomic
Region
(Ren2)
GRCm38 Alternate
Loci Strains
A/J
AKR/J
BALB/c
CAST/Ei
129S6/SvEvTac
129P2/OlaHsd
129S2/SvPas
129S1/SvImJ
129X1/SvJ
129S7/SvEvBrd-Hprt-
b-m2
NOD/MrkTac
NOD/ShiLtJ
RIII
Assembly Model
7. Assembly Updates
Assembly (e.g. GRCm38.p5)
Primary
Assembly
Unit
(C57BL/6J)
Non-nuclear
assembly unit
(e.g. MT)
129S6/
SvEvTac
129S1/
SvImJ
129X1/
SvJ
NOD/
ShiLtJ
NOD/
MrkTac
PAR
Genomic
Region
(MHC)
Genomic
Region
(DiGeorge)
Genomic
Region
(Ren2)
Patches
Genomic
Region
(Sftpb)
Genomic
Region
(Nlrp4g)
Genomic
Region
(Meg3)
Patches
FIX NOVEL
SCAFFOLD STATUS AT NEXT
MAJOR ASSEMBLY RELEASE
ALT
LOCI
--
(integrated)
24. Acknowledgements
GRC SAB
• Rick Myers
• Granger Sutton
• Evan Eichler
• Jim Kent
• Roderic Guigo
• Carol Bult
• Derek Stemple
• Jan Korbel
• Liz Worthey
• Matthew Hurles
• Richard Gibbs
GRC
• Tina Graves-Lindsay
• Kerstin Howe
• Richard Durbin
• Paul Flicek
• Laura Clarke
• Monte Westerfield
• Deanna Church
• Curators!
• Developers!
GRC Mouse/Zfish Collaborators
• NCBI RefSeq/Gene
• HAVANA annotators
• Peter Lansdorp
• Mark Hills
• Derek Stemple
• David Page
• WTSI NOD Idd team
NCBI Support
• Genome Browser team
• Assembly DB
• Gpipe annotation team
• Clone DB
• Remapping Service
https://genomereference.org
For more info:
poster M5055A
25. Utilizing NCBI Databases for Model Organism Research
News: www.ncbi.nlm.nih.gov/news/
Contact us: info@ncbi.nlm.nih.gov
Time Topic
Poster
Number
8:00 – 8:25
The 3 W’s of Sequence Data Submission: What, Where, and When
Ilene Mizrachi —
8:25 – 8:45
Reference genome assemblies: resources and updates from the GRC
Valerie Schneider
M5055/A
8:45 – 9:10
How to annotate for 300 species: the awesome power of NCBI’s
eukaryotic genome annotation pipeline
Terence Murphy
D1524/B
9:10 – 9:35
An introduction to NCBI’s RefSeq and Gene resources
Tripti Gupta
M5104/B
9:35 – 9:55
Optimizing use of NCBI databases to analyze your favorite gene
Nuala O’Leary
Z6088/A
Editor's Notes
I’d like to thank you all for getting up so bright and early this morning. My name is Valerie Schneider and I’m the team lead for the Genome Reference Consortium at NCBI.
Today I’ll be introducing you to the Genome Reference Consortium, telling you about our ongoing curation efforts and assembly update plans for the mouse and zebrafish reference genome assemblies and showing you some GRC and NCBI resources you can use in your research.
The GRC was established after the conclusion of the HGP to manage improvements to the human genome, and subsequently became responsible for the management of the mouse and zebrafish assemblies. It was initially comprised of the first 4 institutions shown here, who together perform the wet lab and bioinformatics work. These year we were pleased to have ZFIN join the GRC, who, as I’ll describe later, will be contributing to the zebrafish assembly curation effort.
The GRC has noticed that it’s not uncommon for many researchers to take the following view when it comes to genome assemblies. But we’d submit that the reality looks more like this. A publication does not equal a perfect genome assembly. Assemblies are kind of like cell phones. There’s no denying that the first cell phones or the initial genome representation for an organism have a transformative effect. But think about how much more we can do with today’s smart phones. And that’s the role of the GRC: it’s not just about fixing problems in reference assemblies, but updating them as we gain new knowledge so that we can continue to use them to advance our understanding of biology.
With that goal in mind , I’d like to talk today about updates and timelines for the mouse and zebrafish genome assemblies. To make sure we’re all on the same page with the terminology I’ll be using, I’ll start by briefly explain the assembly model used by the GRC, and which is depicted on the next slide.
The first thing to know is that the assembly is comprised of assembly units.
Primary assembly unit is the collection of chromosomes and unlocalized and unplaced scaffolds. For mouse, this is the C57BL/6J strain and for Zfish, it’s TU.
Non-nuclear genomes, like the MT, are assigned to their own assembly unit.
Regions (yellow) are defined for those areas of the genome for which alternate strain or haplotype representation is desired.
Those alternate sequence representations go into alternate loci assembly units.
In mouse, the alt loci units are strain-specific and at right is the list of the strains represented as alt loci in GRCm38.
There aren’t alt loci in GRCz10, but they may be added for the next assembly release. In contrast to mouse, the alts included with the zfish assembly will represent various haplotypes within the TU strain, rather than different strains.
Alt loci sequences are therefore separate from the chromosome sequences, but their relationship to the chromosomes is known and they are part of the reference assembly.
While folks in the mouse and zebrafish community have historically been relatively tolerant of assembly updates that involve coordinate changes, our work on the human assembly has shown us that as the amount of data mapped on a given assembly increases, tolerance declines. The “patches” feature of the assembly model allows the GRC to make updates available in a timely fashion to researchers needing corrected or new sequence without disrupting the chromosome coordinates. Regions are defined for the genomic locations to be updated, and the sequences representing those updates are put into the “Patches” assembly unit. Like the alts, the patches aren’t part of the chromosomes, but they and their alignments to the chromosomes are part of the reference assembly.
There are two types of patches:
(1) FIX patches correct problems in the assembly: deprecated in next major assembly release.
(2) NOVEL patches add new alternate sequence representations to the assembly: become alternate loci in the next major assembly release.
This slide shows an example of how we patched the reference representation of Jakmip3 (janus kinase and microtubule interacting protein 3) on GRCm38 chr 7. This image shows the alignment of GRCm38 to C57BL/6JN, a closely related strain whose assembly is part of the Mouse Genomes Project. The B6NJ assembly spans the gap and includes several exons missing from the reference. Since we can’t put the B6NJ sequence into the B6 chromosomes, we used it to query public databases for additional B6 assembly sequences that could fill the gap. As shown below, in this image of the recently released fix patch, the new sequences span and if you zoom in, you can see that they provide the representation for those missing exons.
For the mouse assembly GRCm38, we are working to resolve problems and we review data for patch releases 1-2x/year. The ideogram on the left shows the locations of all alternate loci and patch scaffolds in GRCm38, and on the right I’ve listed the genes whose representation was corrected in our most recent patch release, GRCm38.p5, just last month. For those of you wondering when the next major release will occur, the answer is that it is not currently scheduled. B/c coordinate changing updates can be disruptive, want to be sure they are worth it. In fact, as part of our decision-making process for release, we’d like to hear from you about whether the current assembly is meeting your needs or not and what sort of improvements you want to see in the next release. You can contact us through our website.
There have been several great talks at this meeting about the Sanger Mouse Genomes project, and you may be wondering about its relationship to the GRC and the reference assembly. It is not a part of the GRC, but due to Sanger’s involvement in both projects, we are tied into it. We are already using MGP data in our curation efforts and comparative analyses should help us identify regions needing further curation. I direct you to this poster by Will Chow that talks about tools the GRC is using to make these comparisons.
Unlike mouse, the next zebrafish genome assembly update is planned, and it’s coming soon. From now through the end of the year, the GRC is focused on addressing the issues listed here. Note that for the first time, the zfish assembly is planned to contain alternate loci. For more information on these efforts, as well as new strain sequencing and assembly efforts that will be happening at Sanger, make sure you check out Kerstin Howe’s poster.
With the release of GRCz11, there will also be some changes in GRC’s management of the reference assembly. ZFIN will be assuming the primary curatorial role from Sanger, and curation will move to a passive mode. That means that we’ll be responding to user reports, but that we’ll no longer be reviewing the assembly for proactive updates. We’ll be releasing improvements we make as patches, but there are no plans for a GRCz12 at this time. And while the GRC does not do assembly annotation, I also want to point out that the manual annotation effort that’s been ongoing at Sanger will be winding down. Moving forward, those data will be merged into the Ensembl automated gene builds, and RefSeq will also continue to provide annotation for the assembly and patches as well. You’ll hear more about the NCBI pipeline from Terence in the next talk.
Now that you know about the changes that are coming, I’d like to take the rest of this talk to tell you about resources from the GRC and NCBI for accessing assembly data, keeping abreast of the curation effort and assessing genome quality in your region of interest.
Let’s start at the GRC homepage. From there, you can get to organism-specific overview pages, through the toolbar at the top of the page. The overview pages contain links for downloading the current assembly and at the bottom of the page is a table with information about assembly regions that have alternate loci and patches. You can download the table and the panel on the left lets you search and filter the table by various criteria. Clicking on any region name in the table will take you to a separate page that provides you with more details about the region.
You can find information about the assembly issues on which the GRC is working via the organism-specific “Issues Under Review” tabs. On top, an ideogram shows the genomic locations of issues, which are listed in the table below. On the left of the page, you can search or filter for issues.
Clicking on a particular ideogram updates the page and table to a more specific view, where the annotations can be categorized by issue type or status. Hovering over the icons opens a pop-up with more detail. Within the table you will also find links to pages describing each of the individual issues.
An Issue detail page has 3 parts, highlighted here:
At top, brief description of the issue, plus an ideogram showing its genomic location
In middle, lists of patch and alternate loci sequences associated with the issue (if they exist)
At bottom, graphical view of the issue region
Tracks in the display have been chosen for their utility in assembly assessment, and I’ll describe in more detail shortly.
If there’s a patch or alt loci scaffold associated with the issue, you can toggle the graphic to see it from perspective of the chromosome (gap) or the patch/alternate loci scaffold (green closure), along with the sequence alignment.
If you’re looking for assembly stats or want quick access to the chromosome sequences, the data tab takes you to a page that includes lengths, gap counts, N50s and global stats. Using the drop down-down menu at the top, you can also find this data for previous assembly versions. Although I’m using mouse an example here, I want to emphasize that all these resources are also available for zebrafish.
Maybe most importantly, if you spot a potential problem with the genome, you can report this to us! In fact, we want you to. We prioritize work on user-reported issues. But if we don’t know about it, we can’t fix it!
But the GRC website isn’t the only place you can access GRC assemblies. I’d like to shift gear a little now and talk about accessing the data at NCBI. If you’re starting at the NCBI homepage, select the “Assembly” database from the drop-down menu and search. You can search by assembly name, accession or by organism.
For each assembly in the database, you’ll find a summary website that includes metadata, links to download the assembly from GenBank and RefSeq FTP, and the assembly statistics. And last but not least, if you’re looking at the latest assembly version annotated by RefSeq, you’ll find a link to view the assembly in the NCBI genome browser known as the Genome Data Viewer (GDV).
The first time you visit GDV, it will display a default set of tracks. The display is managed through the “Tracks” menu button. From there, you can access a feature, known as “Track Sets” which allows one-click configuration of the display. The “Assembly Support” track set includes the tracks most valuable for assessing assembly quality. This is essentially the same set of tracks you’ll find on the GRC pages or in the GRC track hubs at the Ensembl and UCSC browsers. You can also do a custom configuration of GDV with the “Configure Tracks” option.
Within the Assembly Support track set, the Assembly Components track shows the underlying sequences and gaps in the assembly, while the “Issues” track shows you where the GRC is curating the assembly. There’s also a track showing component sequencing problems. The “Clone Placement” track can be used to identify mis-assemblies or find clones of interest. On the lower left side of the browser is a section called “Region details”. If the chromosome region you’re looking at has alts/patches associated with it, you can click here update the display to show those sequences instead. It also includes a link to the relevant issues at the GRC website. The “Your Data” section, lets you upload your own data into the browser for viewing alongside the NCBI-provided tracks. You can use this combination of features to assess whether the genome is okay in your region of interest.
If you work on zfish and you’re already thinking ahead to GRCz11, there are resources to help you remap data.
The NCBI Remapping tool uses assembly-assembly alignments to project the features from one assembly to the other. You select the assemblies you want to map between, your remapping options and your input and output file formats. For those who need more than the web interface can offer, there is also a perl API available.
With that, I’d like to wrap things up and acknowledge the many contributors to this work. I hope I’ve left you with some idea of what’s happening with your reference assemblies, and how to find tools to help you assess them. We look forward to hearing from you!
The region page provides:
Location of region, including ideogram
Lists of GRC issues and patch and alternate loci sequences associated with the region
Graphical view of the region (in this view, the blue bars are the assembly components)
Can toggle to see graphic from perspective of the chromosome or the patch/alternate loci scaffold
Tracks in the display include the alignment of the alt/patch to the chromosome, so you can see how they differ, plus other tracks useful for assembly assessment, which I’ll describe in more detail shortly.