SlideShare a Scribd company logo
1 of 1
Download to read offline
gEVAL – A Genome Evaluation Browser for Improving Genome Assemblies! 
William Chow, Kim Brugger, Britt Kilian, James Torrance, Eduard Zuiderwijk, and Kerstin Howe! 
Wellcome Trust Sanger Institute, Cambridge, UK.! 
http://geval.sanger.ac.uk! 
geval-help@sanger.ac.uk! 
gEVAL Punchlists and Issue Navigation! 
Automated lists created to facilitate identification of and navigation to issues or regions of 
interest. In browser menus also help to jump between issues.! 
! 
Optical Maps! 
Optical map data are ordered restriction maps from single stained molecules of DNA that can be 
aligned against assemblies. gEVAL hosts some of this data for human and mouse and aids in 
identifying genomic regions that requires attention, such as rearrangements or mis-representation 
of sequence and haplotypes. ! 
! 
Introduction! 
The web-accessible gEVAL browser (http://geval.sanger.ac.uk) allows the evaluation of 
genome assemblies through its tools and pre-computed analyses. The strength of this browser 
is the ability to navigate an up to date assembly and identify problematic regions and assisting in 
strategizing potential solutions for these issues. This facilitates the improvement of overall 
assemblies to a “gold” standard for release as reference genomes.! 
Mapped multiple times 
Wrong direction (<<, <>, >>) 
Wrong distance from partner 
Integration of GRC Review/Status Update System! 
A! s part of the GRC curation process, 
regions of interest that are to be 
evaluated are tagged and tracked via 
the GRC review ticketing system. ! 
! 
Both resolved and unresolved tickets are 
visible for viewing as a track on the 
browser or as a dedicated punchlist. ! 
! 
A summary of the features in the region 
associated with the ticket is also available 
(right insert).! 
Visual Representation of Current Assembly State! 
Our build cycle is frequent, and thus can represent a current snapshot of the 
assembly. As we are part of the GRC, we also have first access to major GRC 
assembly releases.! 
Component 
in 
sequencing 
Pipeline. 
Phase 
1 
unfinished 
component. 
Phase 
2/3 
finished 
component. 
Comparative Genomics! 
gEVAL includes comparative analyses of different assembly builds for each species. This 
helps in identifying missing sequences, reference assembly errors and haplotypic variation.! 
! 
A gap separates two clone 
components in a zebrafish 
bulid. Investigating the 
alignments against two whole 
genome shotgun (wgs) 
assemblies reveal size of gap 
and missing sequence.! 
A region of the wgs is used to 
cover the gap in a later build. 
(bonus: a clone is also in 
pipeline, grey box above). ! 
The clone component AL596089 
contains a deletion and is 
highlighted by the 3 cell line optical 
map analysis (right). This would not 
have been captured because the 
clone overlaps do not extend far 
enough to show this. An issue that 
is tagged and reported in GRC 
ticket: HG-1482.! 
! 
Optical Map data provided by the D. 
Schwartz Lab (UW Madison).! 
Popup menus on tracks to quickly 
help navigate between previous/ 
next overlap between components 
along a chr (below).! 
An example overview of punchlists available. Punchlists can be 
tailored for different projects, on request (above).! 
Components potentially placed on the wrong chr using marker 
evidence listed per chr (below).! 
Current Species Available! 
Identify Problematic/Incomplete Transcript Mappings! 
GREEN – 98% cutoff coverage ! 
ORANGE – Incomplete or problematic transcript! 
! 
• This example shows how a region of 2 
clones (dark/light blue boxes on contig 
track) have incorrect orientation.! 
• The overlapping gene ryr1b, therefore 
looked to be split on opposite strands! 
• The incorrect orientation of 2 gap 
spanning fosmids confirmed the 
assertion that CU138549 was in the 
wrong orientation.! 
! 
The up to date path returns the correct 
gene structure and clone end mapping.! 
before! 
after! 
Examine Large Region of Interest! 
View region windows of up to 2Mb, allowing for greater vantage of possible 
problematic areas. The Region overview page provides a less detailed snapshot 
of larger windows up to the entire chromosome or top level component.! 
! 
! 
Region overview can show, for example, the state of the assembly and how much are unfinished, 
finished or sequence that is in production. Above is a snapshot of a region just under 10Mb and 
the clones in the path. Status of clones can be quickly scanned and regions prioritized. ! 
Clone End Library Mappings! 
! 
Mapped 1 time 
Spanning partner in the vicinity 
Clone end mappings in gEVAL are unique 
due to how they are displayed, facilitating 
the ease of identifying concurrent clones 
or inconsistencies relating to a potential 
problem with the assembly. Clones can be 
picked to close gap regions or to span 
regions of interest for further interrogation.! 
! 
before! 
after! 
The above example illustrates using end placements to pick clones to cover gaps. In the before 
image, there is a gap with a BAC clone spanning the gapped region according to their end 
placements (orange). In the subsequent assembly (after image above) with the clone 
sequenced, the unfinished clone places well in the region, as illustrated by the green clone 
overlaps.! 
Human! 
GRCh38, GRCh37pX (latest patch), 
NCBI36, CHM1_1.1, NA12878, HuREF, 
YH1/2.0. ! 
Zebrafish! 
Zv9, WGS28, WGS29, WGS31, ! 
z.2013.12.06, z.2014.03.14.! 
! 
Mouse! 
GRCm38, GRCm38pX (latest patch), 
GRCm37B/C, NCBIm37, wgs_c57bl6j, 
wgs_celera, MGSCv3, m.2013.03.15.! 
Helminth! 
Echinococcus multilocularis! 
Schistosoma mansoni ! 
Stronglyoides ratti! 
Genome Reference Consortium! 
The Genome Reference Consortium (GRC) is a partnership between the Sanger Institute, 
NCBI, EBI and the Genome Institute at Wash U tasked with improving and providing accurate 
reference genomes. This includes releasing the reference assemblies of human, mouse and 
zebrafish. ! 
Pig! 
Sscrofa10.2! 
The red arrows highlights the 
incorrect orientation of these 
ryr1b gene split fosmid ends. ! 
on opposite 
strands! 
Clone end placements reveal sequence that can be placed in the gap 
region. Assembly reveals newly sequenced clone in path.!

More Related Content

What's hot

HCS for brain disorders / HCS Pharma at B4B mars 2018
HCS for brain disorders / HCS Pharma at B4B mars 2018HCS for brain disorders / HCS Pharma at B4B mars 2018
HCS for brain disorders / HCS Pharma at B4B mars 2018HCS Pharma
 
Introduction to 16S Analysis with NGS - BMR Genomics
Introduction to 16S Analysis with NGS - BMR GenomicsIntroduction to 16S Analysis with NGS - BMR Genomics
Introduction to 16S Analysis with NGS - BMR GenomicsAndrea Telatin
 
The ensembl database
The ensembl databaseThe ensembl database
The ensembl databaseAshfaq Ahmad
 
02.databases slides
02.databases slides02.databases slides
02.databases slidesItsme148
 
Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...
Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...
Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...VHIR Vall d’Hebron Institut de Recerca
 
Web based servers and softwares for genome analysis
Web based servers and softwares for genome analysisWeb based servers and softwares for genome analysis
Web based servers and softwares for genome analysisDr. Naveen Gaurav srivastava
 
Bioinformatic databases 2
Bioinformatic databases 2Bioinformatic databases 2
Bioinformatic databases 2Razzaqe
 

What's hot (7)

HCS for brain disorders / HCS Pharma at B4B mars 2018
HCS for brain disorders / HCS Pharma at B4B mars 2018HCS for brain disorders / HCS Pharma at B4B mars 2018
HCS for brain disorders / HCS Pharma at B4B mars 2018
 
Introduction to 16S Analysis with NGS - BMR Genomics
Introduction to 16S Analysis with NGS - BMR GenomicsIntroduction to 16S Analysis with NGS - BMR Genomics
Introduction to 16S Analysis with NGS - BMR Genomics
 
The ensembl database
The ensembl databaseThe ensembl database
The ensembl database
 
02.databases slides
02.databases slides02.databases slides
02.databases slides
 
Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...
Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...
Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...
 
Web based servers and softwares for genome analysis
Web based servers and softwares for genome analysisWeb based servers and softwares for genome analysis
Web based servers and softwares for genome analysis
 
Bioinformatic databases 2
Bioinformatic databases 2Bioinformatic databases 2
Bioinformatic databases 2
 

Similar to gEVAL - A Genome Evaluation Browser for Improving Genome Assemblies (SFAF 2014 Poster)

GenoCensus™: Analysis and Visualisation of Genomic Copy Number Variation
GenoCensus™: Analysis and Visualisation of Genomic Copy Number VariationGenoCensus™: Analysis and Visualisation of Genomic Copy Number Variation
GenoCensus™: Analysis and Visualisation of Genomic Copy Number VariationPierre Lindenbaum
 
Apollo Workshop AGS2017 Editing functionality
Apollo Workshop AGS2017 Editing functionalityApollo Workshop AGS2017 Editing functionality
Apollo Workshop AGS2017 Editing functionalityMonica Munoz-Torres
 
SyMAP Master's Thesis Presentation
SyMAP Master's Thesis PresentationSyMAP Master's Thesis Presentation
SyMAP Master's Thesis Presentationaustinps
 
Genome Assembly copy
Genome Assembly   copyGenome Assembly   copy
Genome Assembly copyPradeep Kumar
 
Human Genome 2009
Human Genome 2009Human Genome 2009
Human Genome 2009lyonja
 
2013 pag-equine-workshop
2013 pag-equine-workshop2013 pag-equine-workshop
2013 pag-equine-workshopc.titus.brown
 
Visualizing the genome: Techniques for presenting genome data and annotations
Visualizing the genome: Techniques for presenting genome data and annotationsVisualizing the genome: Techniques for presenting genome data and annotations
Visualizing the genome: Techniques for presenting genome data and annotationsAnn Loraine
 
CNV and aneuploidy detection by Ion semiconductor sequencing
CNV and aneuploidy detection by Ion semiconductor sequencingCNV and aneuploidy detection by Ion semiconductor sequencing
CNV and aneuploidy detection by Ion semiconductor sequencingThermo Fisher Scientific
 
Genome rearrangement
Genome rearrangementGenome rearrangement
Genome rearrangementPinky Vincent
 
Exploiting tertiary structure through local folds for crystallographic phasing
Exploiting tertiary structure through local folds for crystallographic phasingExploiting tertiary structure through local folds for crystallographic phasing
Exploiting tertiary structure through local folds for crystallographic phasingxrbiotech
 
Climbing Mt. Metagenome
Climbing Mt. MetagenomeClimbing Mt. Metagenome
Climbing Mt. Metagenomec.titus.brown
 
F Giordano ScanPAV Analysis Pipeline
F Giordano ScanPAV Analysis PipelineF Giordano ScanPAV Analysis Pipeline
F Giordano ScanPAV Analysis PipelineFrancesca Giordano
 
RNA-Seq_Presentation
RNA-Seq_PresentationRNA-Seq_Presentation
RNA-Seq_PresentationToyin23
 

Similar to gEVAL - A Genome Evaluation Browser for Improving Genome Assemblies (SFAF 2014 Poster) (20)

Shotgun and clone contig method
Shotgun and clone contig methodShotgun and clone contig method
Shotgun and clone contig method
 
GenoCensus™: Analysis and Visualisation of Genomic Copy Number Variation
GenoCensus™: Analysis and Visualisation of Genomic Copy Number VariationGenoCensus™: Analysis and Visualisation of Genomic Copy Number Variation
GenoCensus™: Analysis and Visualisation of Genomic Copy Number Variation
 
Bioinformatica 08-12-2011-t8-go-hmm
Bioinformatica 08-12-2011-t8-go-hmmBioinformatica 08-12-2011-t8-go-hmm
Bioinformatica 08-12-2011-t8-go-hmm
 
Apollo Workshop AGS2017 Editing functionality
Apollo Workshop AGS2017 Editing functionalityApollo Workshop AGS2017 Editing functionality
Apollo Workshop AGS2017 Editing functionality
 
SyMAP Master's Thesis Presentation
SyMAP Master's Thesis PresentationSyMAP Master's Thesis Presentation
SyMAP Master's Thesis Presentation
 
Genome Assembly copy
Genome Assembly   copyGenome Assembly   copy
Genome Assembly copy
 
Human Genome 2009
Human Genome 2009Human Genome 2009
Human Genome 2009
 
2013 pag-equine-workshop
2013 pag-equine-workshop2013 pag-equine-workshop
2013 pag-equine-workshop
 
Visualizing the genome: Techniques for presenting genome data and annotations
Visualizing the genome: Techniques for presenting genome data and annotationsVisualizing the genome: Techniques for presenting genome data and annotations
Visualizing the genome: Techniques for presenting genome data and annotations
 
CNV and aneuploidy detection by Ion semiconductor sequencing
CNV and aneuploidy detection by Ion semiconductor sequencingCNV and aneuploidy detection by Ion semiconductor sequencing
CNV and aneuploidy detection by Ion semiconductor sequencing
 
Alignment Approaches II: Long Reads
Alignment Approaches II: Long ReadsAlignment Approaches II: Long Reads
Alignment Approaches II: Long Reads
 
3302 3305
3302 33053302 3305
3302 3305
 
Genome rearrangement
Genome rearrangementGenome rearrangement
Genome rearrangement
 
Exploiting tertiary structure through local folds for crystallographic phasing
Exploiting tertiary structure through local folds for crystallographic phasingExploiting tertiary structure through local folds for crystallographic phasing
Exploiting tertiary structure through local folds for crystallographic phasing
 
Climbing Mt. Metagenome
Climbing Mt. MetagenomeClimbing Mt. Metagenome
Climbing Mt. Metagenome
 
Genome comparision
Genome comparisionGenome comparision
Genome comparision
 
F Giordano ScanPAV Analysis Pipeline
F Giordano ScanPAV Analysis PipelineF Giordano ScanPAV Analysis Pipeline
F Giordano ScanPAV Analysis Pipeline
 
Gene prediction strategies
Gene prediction strategies Gene prediction strategies
Gene prediction strategies
 
Bioinformatics t8-go-hmm v2014
Bioinformatics t8-go-hmm v2014Bioinformatics t8-go-hmm v2014
Bioinformatics t8-go-hmm v2014
 
RNA-Seq_Presentation
RNA-Seq_PresentationRNA-Seq_Presentation
RNA-Seq_Presentation
 

Recently uploaded

DECOMPOSITION PATHWAYS of TM-alkyl complexes.pdf
DECOMPOSITION PATHWAYS of TM-alkyl complexes.pdfDECOMPOSITION PATHWAYS of TM-alkyl complexes.pdf
DECOMPOSITION PATHWAYS of TM-alkyl complexes.pdfDivyaK787011
 
Immunoblott technique for protein detection.ppt
Immunoblott technique for protein detection.pptImmunoblott technique for protein detection.ppt
Immunoblott technique for protein detection.pptAmirRaziq1
 
GENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptx
GENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptxGENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptx
GENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptxRitchAndruAgustin
 
6.2 Pests of Sesame_Identification_Binomics_Dr.UPR
6.2 Pests of Sesame_Identification_Binomics_Dr.UPR6.2 Pests of Sesame_Identification_Binomics_Dr.UPR
6.2 Pests of Sesame_Identification_Binomics_Dr.UPRPirithiRaju
 
GenAI talk for Young at Wageningen University & Research (WUR) March 2024
GenAI talk for Young at Wageningen University & Research (WUR) March 2024GenAI talk for Young at Wageningen University & Research (WUR) March 2024
GenAI talk for Young at Wageningen University & Research (WUR) March 2024Jene van der Heide
 
DNA isolation molecular biology practical.pptx
DNA isolation molecular biology practical.pptxDNA isolation molecular biology practical.pptx
DNA isolation molecular biology practical.pptxGiDMOh
 
Observational constraints on mergers creating magnetism in massive stars
Observational constraints on mergers creating magnetism in massive starsObservational constraints on mergers creating magnetism in massive stars
Observational constraints on mergers creating magnetism in massive starsSérgio Sacani
 
complex analysis best book for solving questions.pdf
complex analysis best book for solving questions.pdfcomplex analysis best book for solving questions.pdf
complex analysis best book for solving questions.pdfSubhamKumar3239
 
bonjourmadame.tumblr.com bhaskar's girls
bonjourmadame.tumblr.com bhaskar's girlsbonjourmadame.tumblr.com bhaskar's girls
bonjourmadame.tumblr.com bhaskar's girlshansessene
 
Introduction of Human Body & Structure of cell.pptx
Introduction of Human Body & Structure of cell.pptxIntroduction of Human Body & Structure of cell.pptx
Introduction of Human Body & Structure of cell.pptxMedical College
 
WEEK 4 PHYSICAL SCIENCE QUARTER 3 FOR G11
WEEK 4 PHYSICAL SCIENCE QUARTER 3 FOR G11WEEK 4 PHYSICAL SCIENCE QUARTER 3 FOR G11
WEEK 4 PHYSICAL SCIENCE QUARTER 3 FOR G11GelineAvendao
 
Combining Asynchronous Task Parallelism and Intel SGX for Secure Deep Learning
Combining Asynchronous Task Parallelism and Intel SGX for Secure Deep LearningCombining Asynchronous Task Parallelism and Intel SGX for Secure Deep Learning
Combining Asynchronous Task Parallelism and Intel SGX for Secure Deep Learningvschiavoni
 
The Sensory Organs, Anatomy and Function
The Sensory Organs, Anatomy and FunctionThe Sensory Organs, Anatomy and Function
The Sensory Organs, Anatomy and FunctionJadeNovelo1
 
Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...
Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...
Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...Christina Parmionova
 
GLYCOSIDES Classification Of GLYCOSIDES Chemical Tests Glycosides
GLYCOSIDES Classification Of GLYCOSIDES  Chemical Tests GlycosidesGLYCOSIDES Classification Of GLYCOSIDES  Chemical Tests Glycosides
GLYCOSIDES Classification Of GLYCOSIDES Chemical Tests GlycosidesNandakishor Bhaurao Deshmukh
 
KDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdf
KDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdfKDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdf
KDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdfGABYFIORELAMALPARTID1
 
How we decide powerpoint presentation.pptx
How we decide powerpoint presentation.pptxHow we decide powerpoint presentation.pptx
How we decide powerpoint presentation.pptxJosielynTars
 

Recently uploaded (20)

DECOMPOSITION PATHWAYS of TM-alkyl complexes.pdf
DECOMPOSITION PATHWAYS of TM-alkyl complexes.pdfDECOMPOSITION PATHWAYS of TM-alkyl complexes.pdf
DECOMPOSITION PATHWAYS of TM-alkyl complexes.pdf
 
Immunoblott technique for protein detection.ppt
Immunoblott technique for protein detection.pptImmunoblott technique for protein detection.ppt
Immunoblott technique for protein detection.ppt
 
GENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptx
GENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptxGENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptx
GENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptx
 
6.2 Pests of Sesame_Identification_Binomics_Dr.UPR
6.2 Pests of Sesame_Identification_Binomics_Dr.UPR6.2 Pests of Sesame_Identification_Binomics_Dr.UPR
6.2 Pests of Sesame_Identification_Binomics_Dr.UPR
 
GenAI talk for Young at Wageningen University & Research (WUR) March 2024
GenAI talk for Young at Wageningen University & Research (WUR) March 2024GenAI talk for Young at Wageningen University & Research (WUR) March 2024
GenAI talk for Young at Wageningen University & Research (WUR) March 2024
 
DNA isolation molecular biology practical.pptx
DNA isolation molecular biology practical.pptxDNA isolation molecular biology practical.pptx
DNA isolation molecular biology practical.pptx
 
Observational constraints on mergers creating magnetism in massive stars
Observational constraints on mergers creating magnetism in massive starsObservational constraints on mergers creating magnetism in massive stars
Observational constraints on mergers creating magnetism in massive stars
 
complex analysis best book for solving questions.pdf
complex analysis best book for solving questions.pdfcomplex analysis best book for solving questions.pdf
complex analysis best book for solving questions.pdf
 
PLASMODIUM. PPTX
PLASMODIUM. PPTXPLASMODIUM. PPTX
PLASMODIUM. PPTX
 
bonjourmadame.tumblr.com bhaskar's girls
bonjourmadame.tumblr.com bhaskar's girlsbonjourmadame.tumblr.com bhaskar's girls
bonjourmadame.tumblr.com bhaskar's girls
 
Introduction of Human Body & Structure of cell.pptx
Introduction of Human Body & Structure of cell.pptxIntroduction of Human Body & Structure of cell.pptx
Introduction of Human Body & Structure of cell.pptx
 
Interferons.pptx.
Interferons.pptx.Interferons.pptx.
Interferons.pptx.
 
WEEK 4 PHYSICAL SCIENCE QUARTER 3 FOR G11
WEEK 4 PHYSICAL SCIENCE QUARTER 3 FOR G11WEEK 4 PHYSICAL SCIENCE QUARTER 3 FOR G11
WEEK 4 PHYSICAL SCIENCE QUARTER 3 FOR G11
 
Combining Asynchronous Task Parallelism and Intel SGX for Secure Deep Learning
Combining Asynchronous Task Parallelism and Intel SGX for Secure Deep LearningCombining Asynchronous Task Parallelism and Intel SGX for Secure Deep Learning
Combining Asynchronous Task Parallelism and Intel SGX for Secure Deep Learning
 
The Sensory Organs, Anatomy and Function
The Sensory Organs, Anatomy and FunctionThe Sensory Organs, Anatomy and Function
The Sensory Organs, Anatomy and Function
 
Let’s Say Someone Did Drop the Bomb. Then What?
Let’s Say Someone Did Drop the Bomb. Then What?Let’s Say Someone Did Drop the Bomb. Then What?
Let’s Say Someone Did Drop the Bomb. Then What?
 
Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...
Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...
Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...
 
GLYCOSIDES Classification Of GLYCOSIDES Chemical Tests Glycosides
GLYCOSIDES Classification Of GLYCOSIDES  Chemical Tests GlycosidesGLYCOSIDES Classification Of GLYCOSIDES  Chemical Tests Glycosides
GLYCOSIDES Classification Of GLYCOSIDES Chemical Tests Glycosides
 
KDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdf
KDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdfKDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdf
KDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdf
 
How we decide powerpoint presentation.pptx
How we decide powerpoint presentation.pptxHow we decide powerpoint presentation.pptx
How we decide powerpoint presentation.pptx
 

gEVAL - A Genome Evaluation Browser for Improving Genome Assemblies (SFAF 2014 Poster)

  • 1. gEVAL – A Genome Evaluation Browser for Improving Genome Assemblies! William Chow, Kim Brugger, Britt Kilian, James Torrance, Eduard Zuiderwijk, and Kerstin Howe! Wellcome Trust Sanger Institute, Cambridge, UK.! http://geval.sanger.ac.uk! geval-help@sanger.ac.uk! gEVAL Punchlists and Issue Navigation! Automated lists created to facilitate identification of and navigation to issues or regions of interest. In browser menus also help to jump between issues.! ! Optical Maps! Optical map data are ordered restriction maps from single stained molecules of DNA that can be aligned against assemblies. gEVAL hosts some of this data for human and mouse and aids in identifying genomic regions that requires attention, such as rearrangements or mis-representation of sequence and haplotypes. ! ! Introduction! The web-accessible gEVAL browser (http://geval.sanger.ac.uk) allows the evaluation of genome assemblies through its tools and pre-computed analyses. The strength of this browser is the ability to navigate an up to date assembly and identify problematic regions and assisting in strategizing potential solutions for these issues. This facilitates the improvement of overall assemblies to a “gold” standard for release as reference genomes.! Mapped multiple times Wrong direction (<<, <>, >>) Wrong distance from partner Integration of GRC Review/Status Update System! A! s part of the GRC curation process, regions of interest that are to be evaluated are tagged and tracked via the GRC review ticketing system. ! ! Both resolved and unresolved tickets are visible for viewing as a track on the browser or as a dedicated punchlist. ! ! A summary of the features in the region associated with the ticket is also available (right insert).! Visual Representation of Current Assembly State! Our build cycle is frequent, and thus can represent a current snapshot of the assembly. As we are part of the GRC, we also have first access to major GRC assembly releases.! Component in sequencing Pipeline. Phase 1 unfinished component. Phase 2/3 finished component. Comparative Genomics! gEVAL includes comparative analyses of different assembly builds for each species. This helps in identifying missing sequences, reference assembly errors and haplotypic variation.! ! A gap separates two clone components in a zebrafish bulid. Investigating the alignments against two whole genome shotgun (wgs) assemblies reveal size of gap and missing sequence.! A region of the wgs is used to cover the gap in a later build. (bonus: a clone is also in pipeline, grey box above). ! The clone component AL596089 contains a deletion and is highlighted by the 3 cell line optical map analysis (right). This would not have been captured because the clone overlaps do not extend far enough to show this. An issue that is tagged and reported in GRC ticket: HG-1482.! ! Optical Map data provided by the D. Schwartz Lab (UW Madison).! Popup menus on tracks to quickly help navigate between previous/ next overlap between components along a chr (below).! An example overview of punchlists available. Punchlists can be tailored for different projects, on request (above).! Components potentially placed on the wrong chr using marker evidence listed per chr (below).! Current Species Available! Identify Problematic/Incomplete Transcript Mappings! GREEN – 98% cutoff coverage ! ORANGE – Incomplete or problematic transcript! ! • This example shows how a region of 2 clones (dark/light blue boxes on contig track) have incorrect orientation.! • The overlapping gene ryr1b, therefore looked to be split on opposite strands! • The incorrect orientation of 2 gap spanning fosmids confirmed the assertion that CU138549 was in the wrong orientation.! ! The up to date path returns the correct gene structure and clone end mapping.! before! after! Examine Large Region of Interest! View region windows of up to 2Mb, allowing for greater vantage of possible problematic areas. The Region overview page provides a less detailed snapshot of larger windows up to the entire chromosome or top level component.! ! ! Region overview can show, for example, the state of the assembly and how much are unfinished, finished or sequence that is in production. Above is a snapshot of a region just under 10Mb and the clones in the path. Status of clones can be quickly scanned and regions prioritized. ! Clone End Library Mappings! ! Mapped 1 time Spanning partner in the vicinity Clone end mappings in gEVAL are unique due to how they are displayed, facilitating the ease of identifying concurrent clones or inconsistencies relating to a potential problem with the assembly. Clones can be picked to close gap regions or to span regions of interest for further interrogation.! ! before! after! The above example illustrates using end placements to pick clones to cover gaps. In the before image, there is a gap with a BAC clone spanning the gapped region according to their end placements (orange). In the subsequent assembly (after image above) with the clone sequenced, the unfinished clone places well in the region, as illustrated by the green clone overlaps.! Human! GRCh38, GRCh37pX (latest patch), NCBI36, CHM1_1.1, NA12878, HuREF, YH1/2.0. ! Zebrafish! Zv9, WGS28, WGS29, WGS31, ! z.2013.12.06, z.2014.03.14.! ! Mouse! GRCm38, GRCm38pX (latest patch), GRCm37B/C, NCBIm37, wgs_c57bl6j, wgs_celera, MGSCv3, m.2013.03.15.! Helminth! Echinococcus multilocularis! Schistosoma mansoni ! Stronglyoides ratti! Genome Reference Consortium! The Genome Reference Consortium (GRC) is a partnership between the Sanger Institute, NCBI, EBI and the Genome Institute at Wash U tasked with improving and providing accurate reference genomes. This includes releasing the reference assemblies of human, mouse and zebrafish. ! Pig! Sscrofa10.2! The red arrows highlights the incorrect orientation of these ryr1b gene split fosmid ends. ! on opposite strands! Clone end placements reveal sequence that can be placed in the gap region. Assembly reveals newly sequenced clone in path.!