SlideShare a Scribd company logo
10X Genomics
Novel variants and variant validation
September 2016
2
Partitioning to Linked Reads
1.0ng input
3
Linked read data
Confidential — Do not distribute
4
Unlinked, unphased short read SNP
5
Linked reads, phased SNP
6
Standard Short Read Alignment
Close Paralogs
Short Reads
Short Read Aligners Cannot Place Reads Correctly
7
Long Ranger – LariatTM Aligner
1. Confident mapping
provides anchors
2. Barcodes recruit short
reads into paralogous
loci
Close Paralogs
LariatTM Aligner Correctly Places Short Reads Even in
Paralogous Loci
Linked-Reads
8
Improved alignment leads to improved
variant calling
•SMN1 and SMN2: part of an inverted tandem duplication on chr5
–Differ by 8 nucleotides (3 exonic)
• SMN1: causative of spinal muscular atrophy
• SMN2: low function copy, not disease-causing
Haplotype 2 Reads
Haplotype 1 Reads
Standard
Genome
Chromium
Genome
SMN2
NA12878 WGS 128Gb
9
Inference
chr1
chr3
chr5
chr11
chr13
source
sink
• For every active alignment in the sink whose read has an alignment in the sink,
switch the alignment in the sink to active and score probabilistically. If the source
has few or no active alignments, the score goes up.
10
Inference
chr1
chr3
chr5
chr11
chr13
• This source is also now inactive.
source
sink
11
Inference
chr1
chr3
chr5
chr11
chr13
• Fast forward and we have the following active molecules left.
12
•Called by 10X data not in GIAB 3.2.2 (whole genome, not
restricted to confident regions)
•Validated with PacBio requiring > 2 alt alleles supported and
>15% allele fraction
•Of regions with PacBio coverage >=12, validation rates are 94%
for 10X and 89% for truseq.
Novel variants
10X Truseq Diff 10x
validated
Truseq
validated
Diff
SNPs 335k 292k 43k 289k 237k 52k
Deletions 76k 56k 20k 73k 54k 19k
Insertions 59k 43k 16k 58k 42k 16k
Total 470k 391k 79k 420k 333k 87k
13
• PacBio validation – align pac bio reads to reference then align them to the
reference with the alt allele in place of the reference allele. Only count as
support if one scores higher than the other.
Novel variant validation method
• Can we validate this validation
method
• Sensitivity of validation in confident
region
• Negative predictive value of
“random” mutations
• For SNPs, random is straight
forward (could include TI/TV
bias)
• For indels
• Pick length from geometric
distribution
• For deletions, the alt allele is
trivial
• For insertions, the alt allele
used is the bases in the
reference at that locus
repeated.
14
•Entire 10X team especially Patrick Marks and Deanna Church
•GIAB workshop organizers
1. Zheng, Grace XY, et al. "Haplotyping germline and cancer genomes with high-
throughput linked-read sequencing." Nature biotechnology (2016).
2. Samonte, Rhea Vallente, and Evan E. Eichler. "Segmental duplications and the
evolution of the primate genome." Nature Reviews Genetics 3.1 (2002): 65-72.
3. Bishara A et al. (2015) Read clouds uncover variation in complex regions of the
human genome. Genome Res, 25:1570-1580.
4. Li, Heng, and Richard Durbin. "Fast and accurate short read alignment with Burrows–
Wheeler transform." Bioinformatics 25.14 (2009): 1754-1760.
Acknowledgements and references
15
Addendum
16
SNP validation validation 
Confidential — Do not distribute
Used for
validation
17
Deletion validation validation
Confidential — Do not distribute
Used for
validation
18
Insertion validation validation
Confidential — Do not distribute
Used for
validation

More Related Content

What's hot

The key considerations of crispr genome editing
The key considerations of crispr genome editingThe key considerations of crispr genome editing
The key considerations of crispr genome editing
Chris Thorne
 
Genotyping by sequencing
Genotyping by sequencingGenotyping by sequencing
Genotyping by sequencing
Bhavya Sree
 
NGS in Clinical Research: Meet the NGS Experts Series Part 1
NGS in Clinical Research: Meet the NGS Experts Series Part 1NGS in Clinical Research: Meet the NGS Experts Series Part 1
NGS in Clinical Research: Meet the NGS Experts Series Part 1
QIAGEN
 
140127 abrf interlaboratory study proposal
140127 abrf interlaboratory study proposal140127 abrf interlaboratory study proposal
140127 abrf interlaboratory study proposalGenomeInABottle
 
Advanced NGS Data Analysis & Interpretation- BGW + IVA: NGS Tech Overview Web...
Advanced NGS Data Analysis & Interpretation- BGW + IVA: NGS Tech Overview Web...Advanced NGS Data Analysis & Interpretation- BGW + IVA: NGS Tech Overview Web...
Advanced NGS Data Analysis & Interpretation- BGW + IVA: NGS Tech Overview Web...
QIAGEN
 
Resolving Ambiguity in Target ID Screens - CRISPR-Cas9 Based Essentiality Pro...
Resolving Ambiguity in Target ID Screens - CRISPR-Cas9 Based Essentiality Pro...Resolving Ambiguity in Target ID Screens - CRISPR-Cas9 Based Essentiality Pro...
Resolving Ambiguity in Target ID Screens - CRISPR-Cas9 Based Essentiality Pro...
Candy Smellie
 
Emergingroleo fmi rnainmedicalsciences
Emergingroleo fmi rnainmedicalsciencesEmergingroleo fmi rnainmedicalsciences
Emergingroleo fmi rnainmedicalsciences
karenbbs
 
GIAB GRC Workshop slides
GIAB GRC Workshop slidesGIAB GRC Workshop slides
GIAB GRC Workshop slides
GenomeInABottle
 
Aug2013 Heidi Rehm integrating large scale sequencing into clinical practice
Aug2013 Heidi Rehm integrating large scale sequencing into clinical practiceAug2013 Heidi Rehm integrating large scale sequencing into clinical practice
Aug2013 Heidi Rehm integrating large scale sequencing into clinical practiceGenomeInABottle
 
Data Management for Quantitative Biology - Data sources (Next generation tech...
Data Management for Quantitative Biology - Data sources (Next generation tech...Data Management for Quantitative Biology - Data sources (Next generation tech...
Data Management for Quantitative Biology - Data sources (Next generation tech...
QBiC_Tue
 
Jan2016 pac bio giab
Jan2016 pac bio giabJan2016 pac bio giab
Jan2016 pac bio giab
GenomeInABottle
 
Assembly and finishing
Assembly and finishingAssembly and finishing
Assembly and finishingNikolay Vyahhi
 
Clinical molecular diagnostics for drug guidance
Clinical molecular diagnostics for drug guidanceClinical molecular diagnostics for drug guidance
Clinical molecular diagnostics for drug guidance
Nikesh Shah
 
Genome editing & targeting tools
Genome editing & targeting toolsGenome editing & targeting tools
Genome editing & targeting tools
S Rasouli
 
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
GenomeInABottle
 
Bioo Scientific - Reduced Bias Small RNA Library Prep with Gel-Free or Low-In...
Bioo Scientific - Reduced Bias Small RNA Library Prep with Gel-Free or Low-In...Bioo Scientific - Reduced Bias Small RNA Library Prep with Gel-Free or Low-In...
Bioo Scientific - Reduced Bias Small RNA Library Prep with Gel-Free or Low-In...
Bioo Scientific
 
Knowing Your NGS Upstream: Alignment and Variants
Knowing Your NGS Upstream: Alignment and VariantsKnowing Your NGS Upstream: Alignment and Variants
Knowing Your NGS Upstream: Alignment and Variants
Golden Helix Inc
 
Rna seq
Rna seq Rna seq
Rna seq
Amitha Dasari
 
Utilization of NGS to Identify Clinically-Relevant Mutations in cfDNA: Meet t...
Utilization of NGS to Identify Clinically-Relevant Mutations in cfDNA: Meet t...Utilization of NGS to Identify Clinically-Relevant Mutations in cfDNA: Meet t...
Utilization of NGS to Identify Clinically-Relevant Mutations in cfDNA: Meet t...
QIAGEN
 

What's hot (20)

The key considerations of crispr genome editing
The key considerations of crispr genome editingThe key considerations of crispr genome editing
The key considerations of crispr genome editing
 
Genotyping by sequencing
Genotyping by sequencingGenotyping by sequencing
Genotyping by sequencing
 
NGS in Clinical Research: Meet the NGS Experts Series Part 1
NGS in Clinical Research: Meet the NGS Experts Series Part 1NGS in Clinical Research: Meet the NGS Experts Series Part 1
NGS in Clinical Research: Meet the NGS Experts Series Part 1
 
140127 abrf interlaboratory study proposal
140127 abrf interlaboratory study proposal140127 abrf interlaboratory study proposal
140127 abrf interlaboratory study proposal
 
Advanced NGS Data Analysis & Interpretation- BGW + IVA: NGS Tech Overview Web...
Advanced NGS Data Analysis & Interpretation- BGW + IVA: NGS Tech Overview Web...Advanced NGS Data Analysis & Interpretation- BGW + IVA: NGS Tech Overview Web...
Advanced NGS Data Analysis & Interpretation- BGW + IVA: NGS Tech Overview Web...
 
Resolving Ambiguity in Target ID Screens - CRISPR-Cas9 Based Essentiality Pro...
Resolving Ambiguity in Target ID Screens - CRISPR-Cas9 Based Essentiality Pro...Resolving Ambiguity in Target ID Screens - CRISPR-Cas9 Based Essentiality Pro...
Resolving Ambiguity in Target ID Screens - CRISPR-Cas9 Based Essentiality Pro...
 
Emergingroleo fmi rnainmedicalsciences
Emergingroleo fmi rnainmedicalsciencesEmergingroleo fmi rnainmedicalsciences
Emergingroleo fmi rnainmedicalsciences
 
GIAB GRC Workshop slides
GIAB GRC Workshop slidesGIAB GRC Workshop slides
GIAB GRC Workshop slides
 
RNA-Seq
RNA-SeqRNA-Seq
RNA-Seq
 
Aug2013 Heidi Rehm integrating large scale sequencing into clinical practice
Aug2013 Heidi Rehm integrating large scale sequencing into clinical practiceAug2013 Heidi Rehm integrating large scale sequencing into clinical practice
Aug2013 Heidi Rehm integrating large scale sequencing into clinical practice
 
Data Management for Quantitative Biology - Data sources (Next generation tech...
Data Management for Quantitative Biology - Data sources (Next generation tech...Data Management for Quantitative Biology - Data sources (Next generation tech...
Data Management for Quantitative Biology - Data sources (Next generation tech...
 
Jan2016 pac bio giab
Jan2016 pac bio giabJan2016 pac bio giab
Jan2016 pac bio giab
 
Assembly and finishing
Assembly and finishingAssembly and finishing
Assembly and finishing
 
Clinical molecular diagnostics for drug guidance
Clinical molecular diagnostics for drug guidanceClinical molecular diagnostics for drug guidance
Clinical molecular diagnostics for drug guidance
 
Genome editing & targeting tools
Genome editing & targeting toolsGenome editing & targeting tools
Genome editing & targeting tools
 
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
 
Bioo Scientific - Reduced Bias Small RNA Library Prep with Gel-Free or Low-In...
Bioo Scientific - Reduced Bias Small RNA Library Prep with Gel-Free or Low-In...Bioo Scientific - Reduced Bias Small RNA Library Prep with Gel-Free or Low-In...
Bioo Scientific - Reduced Bias Small RNA Library Prep with Gel-Free or Low-In...
 
Knowing Your NGS Upstream: Alignment and Variants
Knowing Your NGS Upstream: Alignment and VariantsKnowing Your NGS Upstream: Alignment and Variants
Knowing Your NGS Upstream: Alignment and Variants
 
Rna seq
Rna seq Rna seq
Rna seq
 
Utilization of NGS to Identify Clinically-Relevant Mutations in cfDNA: Meet t...
Utilization of NGS to Identify Clinically-Relevant Mutations in cfDNA: Meet t...Utilization of NGS to Identify Clinically-Relevant Mutations in cfDNA: Meet t...
Utilization of NGS to Identify Clinically-Relevant Mutations in cfDNA: Meet t...
 

Similar to Sept2016 smallvar 10_x

Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...
Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...
Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...
Reid Robison
 
Lecture 4.ppt
Lecture 4.pptLecture 4.ppt
Lecture 4.ppt
khadijarafique14
 
Catalyzing Plant Science Research with RNA-seq
Catalyzing Plant Science Research with RNA-seqCatalyzing Plant Science Research with RNA-seq
Catalyzing Plant Science Research with RNA-seq
Manjappa Ganiger
 
Next generation sequencing
Next generation sequencingNext generation sequencing
Next generation sequencingDayananda Salam
 
Sts
StsSts
Nextgenerationsequencing 120202015950-phpapp02
Nextgenerationsequencing 120202015950-phpapp02Nextgenerationsequencing 120202015950-phpapp02
Nextgenerationsequencing 120202015950-phpapp02t7260678
 
EiB Seminar from Antoni Miñarro, Ph.D
EiB Seminar from Antoni Miñarro, Ph.DEiB Seminar from Antoni Miñarro, Ph.D
EiB Seminar from Antoni Miñarro, Ph.D
VHIR Vall d’Hebron Institut de Recerca
 
SNPs analysis methods
SNPs analysis methodsSNPs analysis methods
SNPs analysis methods
had89
 
Molecular marker by anil bl gather
Molecular marker by anil bl gatherMolecular marker by anil bl gather
Molecular marker by anil bl gather
ANIL BL GATHER
 
Impact_of_gene_length_on_DEG
Impact_of_gene_length_on_DEGImpact_of_gene_length_on_DEG
Impact_of_gene_length_on_DEGLong Pei
 
140127 GIAB update and NIST high-confidence calls
140127 GIAB update and NIST high-confidence calls140127 GIAB update and NIST high-confidence calls
140127 GIAB update and NIST high-confidence callsGenomeInABottle
 
Present status and recent developments on available molecular marker.pptx
Present status and recent developments on available molecular marker.pptxPresent status and recent developments on available molecular marker.pptx
Present status and recent developments on available molecular marker.pptx
PrabhatSingh628463
 
Arjun's Poster ACTUAL FINAL POSTER
Arjun's Poster ACTUAL FINAL POSTERArjun's Poster ACTUAL FINAL POSTER
Arjun's Poster ACTUAL FINAL POSTERArjun Mahadevan
 
2007. stephen chanock. technologic issues in gwas and follow up studies
2007. stephen chanock. technologic issues in gwas and follow up studies2007. stephen chanock. technologic issues in gwas and follow up studies
2007. stephen chanock. technologic issues in gwas and follow up studies
FOODCROPS
 
GIAB for AMP GeT-RM Forum
GIAB for AMP GeT-RM ForumGIAB for AMP GeT-RM Forum
GIAB for AMP GeT-RM Forum
GenomeInABottle
 
GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511
GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511
GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511
GenomeInABottle
 
Mapping and quantifying transcripts.pdf
Mapping and quantifying transcripts.pdfMapping and quantifying transcripts.pdf
Mapping and quantifying transcripts.pdf
Kristu Jayanti College
 
RNASeq Experiment Design
RNASeq Experiment DesignRNASeq Experiment Design
RNASeq Experiment Design
Yaoyu Wang
 
Whole Transcriptome Amplfication from Single Cell
Whole Transcriptome Amplfication from Single CellWhole Transcriptome Amplfication from Single Cell
Whole Transcriptome Amplfication from Single Cell
QIAGEN
 
Human genetic variation and its contribution to complex traits
Human genetic variation and its contribution to complex traitsHuman genetic variation and its contribution to complex traits
Human genetic variation and its contribution to complex traits
groovescience
 

Similar to Sept2016 smallvar 10_x (20)

Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...
Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...
Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...
 
Lecture 4.ppt
Lecture 4.pptLecture 4.ppt
Lecture 4.ppt
 
Catalyzing Plant Science Research with RNA-seq
Catalyzing Plant Science Research with RNA-seqCatalyzing Plant Science Research with RNA-seq
Catalyzing Plant Science Research with RNA-seq
 
Next generation sequencing
Next generation sequencingNext generation sequencing
Next generation sequencing
 
Sts
StsSts
Sts
 
Nextgenerationsequencing 120202015950-phpapp02
Nextgenerationsequencing 120202015950-phpapp02Nextgenerationsequencing 120202015950-phpapp02
Nextgenerationsequencing 120202015950-phpapp02
 
EiB Seminar from Antoni Miñarro, Ph.D
EiB Seminar from Antoni Miñarro, Ph.DEiB Seminar from Antoni Miñarro, Ph.D
EiB Seminar from Antoni Miñarro, Ph.D
 
SNPs analysis methods
SNPs analysis methodsSNPs analysis methods
SNPs analysis methods
 
Molecular marker by anil bl gather
Molecular marker by anil bl gatherMolecular marker by anil bl gather
Molecular marker by anil bl gather
 
Impact_of_gene_length_on_DEG
Impact_of_gene_length_on_DEGImpact_of_gene_length_on_DEG
Impact_of_gene_length_on_DEG
 
140127 GIAB update and NIST high-confidence calls
140127 GIAB update and NIST high-confidence calls140127 GIAB update and NIST high-confidence calls
140127 GIAB update and NIST high-confidence calls
 
Present status and recent developments on available molecular marker.pptx
Present status and recent developments on available molecular marker.pptxPresent status and recent developments on available molecular marker.pptx
Present status and recent developments on available molecular marker.pptx
 
Arjun's Poster ACTUAL FINAL POSTER
Arjun's Poster ACTUAL FINAL POSTERArjun's Poster ACTUAL FINAL POSTER
Arjun's Poster ACTUAL FINAL POSTER
 
2007. stephen chanock. technologic issues in gwas and follow up studies
2007. stephen chanock. technologic issues in gwas and follow up studies2007. stephen chanock. technologic issues in gwas and follow up studies
2007. stephen chanock. technologic issues in gwas and follow up studies
 
GIAB for AMP GeT-RM Forum
GIAB for AMP GeT-RM ForumGIAB for AMP GeT-RM Forum
GIAB for AMP GeT-RM Forum
 
GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511
GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511
GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511
 
Mapping and quantifying transcripts.pdf
Mapping and quantifying transcripts.pdfMapping and quantifying transcripts.pdf
Mapping and quantifying transcripts.pdf
 
RNASeq Experiment Design
RNASeq Experiment DesignRNASeq Experiment Design
RNASeq Experiment Design
 
Whole Transcriptome Amplfication from Single Cell
Whole Transcriptome Amplfication from Single CellWhole Transcriptome Amplfication from Single Cell
Whole Transcriptome Amplfication from Single Cell
 
Human genetic variation and its contribution to complex traits
Human genetic variation and its contribution to complex traitsHuman genetic variation and its contribution to complex traits
Human genetic variation and its contribution to complex traits
 

More from GenomeInABottle

2023 GIAB AMP Update
2023 GIAB AMP Update2023 GIAB AMP Update
2023 GIAB AMP Update
GenomeInABottle
 
GIAB Tumor Normal ASHG 2023
GIAB Tumor Normal ASHG 2023GIAB Tumor Normal ASHG 2023
GIAB Tumor Normal ASHG 2023
GenomeInABottle
 
Stratomod ASHG 2023
Stratomod ASHG 2023Stratomod ASHG 2023
Stratomod ASHG 2023
GenomeInABottle
 
GIAB_ASHG_JZook_2023.pdf
GIAB_ASHG_JZook_2023.pdfGIAB_ASHG_JZook_2023.pdf
GIAB_ASHG_JZook_2023.pdf
GenomeInABottle
 
Using accurate long reads to improve Genome in a Bottle Benchmarks 220923
Using accurate long reads to improve Genome in a Bottle Benchmarks 220923Using accurate long reads to improve Genome in a Bottle Benchmarks 220923
Using accurate long reads to improve Genome in a Bottle Benchmarks 220923
GenomeInABottle
 
Benchmarking with GIAB 220907
Benchmarking with GIAB 220907Benchmarking with GIAB 220907
Benchmarking with GIAB 220907
GenomeInABottle
 
Genome in a Bottle- reference materials to benchmark challenging variants and...
Genome in a Bottle- reference materials to benchmark challenging variants and...Genome in a Bottle- reference materials to benchmark challenging variants and...
Genome in a Bottle- reference materials to benchmark challenging variants and...
GenomeInABottle
 
GIAB Technical Germline Benchmark roadmap discussion
GIAB Technical Germline Benchmark roadmap discussionGIAB Technical Germline Benchmark roadmap discussion
GIAB Technical Germline Benchmark roadmap discussion
GenomeInABottle
 
Giab agbt small_var_2020
Giab agbt small_var_2020Giab agbt small_var_2020
Giab agbt small_var_2020
GenomeInABottle
 
Ga4gh 2019 - Assuring data quality with benchmarking tools from GIAB and GA4GH
Ga4gh 2019 - Assuring data quality with benchmarking tools from GIAB and GA4GHGa4gh 2019 - Assuring data quality with benchmarking tools from GIAB and GA4GH
Ga4gh 2019 - Assuring data quality with benchmarking tools from GIAB and GA4GH
GenomeInABottle
 
GIAB ASHG 2019 Structural Variant poster
GIAB ASHG 2019 Structural Variant posterGIAB ASHG 2019 Structural Variant poster
GIAB ASHG 2019 Structural Variant poster
GenomeInABottle
 
GIAB GRC Workshop ASHG 2019 Billy Rowell Evaluation of v4 with CCS GATK
GIAB GRC Workshop ASHG 2019 Billy Rowell Evaluation of v4 with CCS GATKGIAB GRC Workshop ASHG 2019 Billy Rowell Evaluation of v4 with CCS GATK
GIAB GRC Workshop ASHG 2019 Billy Rowell Evaluation of v4 with CCS GATK
GenomeInABottle
 
GIAB ASHG 2019 Small Variant poster
GIAB ASHG 2019 Small Variant posterGIAB ASHG 2019 Small Variant poster
GIAB ASHG 2019 Small Variant poster
GenomeInABottle
 
GRC GIAB Workshop ASHG 2019 Small Variant Benchmark
GRC GIAB Workshop ASHG 2019 Small Variant BenchmarkGRC GIAB Workshop ASHG 2019 Small Variant Benchmark
GRC GIAB Workshop ASHG 2019 Small Variant Benchmark
GenomeInABottle
 
Jason Chin MHC diploid assembly
Jason Chin MHC diploid assemblyJason Chin MHC diploid assembly
Jason Chin MHC diploid assembly
GenomeInABottle
 
GIAB update for GRC GIAB workshop 191015
GIAB update for GRC GIAB workshop 191015GIAB update for GRC GIAB workshop 191015
GIAB update for GRC GIAB workshop 191015
GenomeInABottle
 
Giab for jax long read 190917
Giab for jax long read 190917Giab for jax long read 190917
Giab for jax long read 190917
GenomeInABottle
 
GIAB and long reads for bio it world 190417
GIAB and long reads for bio it world 190417GIAB and long reads for bio it world 190417
GIAB and long reads for bio it world 190417
GenomeInABottle
 
New methods diploid assembly with graphs
New methods   diploid assembly with graphsNew methods   diploid assembly with graphs
New methods diploid assembly with graphs
GenomeInABottle
 
How giab fits in the rest of the world seqc2 tumor normal
How giab fits in the rest of the world   seqc2 tumor normalHow giab fits in the rest of the world   seqc2 tumor normal
How giab fits in the rest of the world seqc2 tumor normal
GenomeInABottle
 

More from GenomeInABottle (20)

2023 GIAB AMP Update
2023 GIAB AMP Update2023 GIAB AMP Update
2023 GIAB AMP Update
 
GIAB Tumor Normal ASHG 2023
GIAB Tumor Normal ASHG 2023GIAB Tumor Normal ASHG 2023
GIAB Tumor Normal ASHG 2023
 
Stratomod ASHG 2023
Stratomod ASHG 2023Stratomod ASHG 2023
Stratomod ASHG 2023
 
GIAB_ASHG_JZook_2023.pdf
GIAB_ASHG_JZook_2023.pdfGIAB_ASHG_JZook_2023.pdf
GIAB_ASHG_JZook_2023.pdf
 
Using accurate long reads to improve Genome in a Bottle Benchmarks 220923
Using accurate long reads to improve Genome in a Bottle Benchmarks 220923Using accurate long reads to improve Genome in a Bottle Benchmarks 220923
Using accurate long reads to improve Genome in a Bottle Benchmarks 220923
 
Benchmarking with GIAB 220907
Benchmarking with GIAB 220907Benchmarking with GIAB 220907
Benchmarking with GIAB 220907
 
Genome in a Bottle- reference materials to benchmark challenging variants and...
Genome in a Bottle- reference materials to benchmark challenging variants and...Genome in a Bottle- reference materials to benchmark challenging variants and...
Genome in a Bottle- reference materials to benchmark challenging variants and...
 
GIAB Technical Germline Benchmark roadmap discussion
GIAB Technical Germline Benchmark roadmap discussionGIAB Technical Germline Benchmark roadmap discussion
GIAB Technical Germline Benchmark roadmap discussion
 
Giab agbt small_var_2020
Giab agbt small_var_2020Giab agbt small_var_2020
Giab agbt small_var_2020
 
Ga4gh 2019 - Assuring data quality with benchmarking tools from GIAB and GA4GH
Ga4gh 2019 - Assuring data quality with benchmarking tools from GIAB and GA4GHGa4gh 2019 - Assuring data quality with benchmarking tools from GIAB and GA4GH
Ga4gh 2019 - Assuring data quality with benchmarking tools from GIAB and GA4GH
 
GIAB ASHG 2019 Structural Variant poster
GIAB ASHG 2019 Structural Variant posterGIAB ASHG 2019 Structural Variant poster
GIAB ASHG 2019 Structural Variant poster
 
GIAB GRC Workshop ASHG 2019 Billy Rowell Evaluation of v4 with CCS GATK
GIAB GRC Workshop ASHG 2019 Billy Rowell Evaluation of v4 with CCS GATKGIAB GRC Workshop ASHG 2019 Billy Rowell Evaluation of v4 with CCS GATK
GIAB GRC Workshop ASHG 2019 Billy Rowell Evaluation of v4 with CCS GATK
 
GIAB ASHG 2019 Small Variant poster
GIAB ASHG 2019 Small Variant posterGIAB ASHG 2019 Small Variant poster
GIAB ASHG 2019 Small Variant poster
 
GRC GIAB Workshop ASHG 2019 Small Variant Benchmark
GRC GIAB Workshop ASHG 2019 Small Variant BenchmarkGRC GIAB Workshop ASHG 2019 Small Variant Benchmark
GRC GIAB Workshop ASHG 2019 Small Variant Benchmark
 
Jason Chin MHC diploid assembly
Jason Chin MHC diploid assemblyJason Chin MHC diploid assembly
Jason Chin MHC diploid assembly
 
GIAB update for GRC GIAB workshop 191015
GIAB update for GRC GIAB workshop 191015GIAB update for GRC GIAB workshop 191015
GIAB update for GRC GIAB workshop 191015
 
Giab for jax long read 190917
Giab for jax long read 190917Giab for jax long read 190917
Giab for jax long read 190917
 
GIAB and long reads for bio it world 190417
GIAB and long reads for bio it world 190417GIAB and long reads for bio it world 190417
GIAB and long reads for bio it world 190417
 
New methods diploid assembly with graphs
New methods   diploid assembly with graphsNew methods   diploid assembly with graphs
New methods diploid assembly with graphs
 
How giab fits in the rest of the world seqc2 tumor normal
How giab fits in the rest of the world   seqc2 tumor normalHow giab fits in the rest of the world   seqc2 tumor normal
How giab fits in the rest of the world seqc2 tumor normal
 

Recently uploaded

Vision-1.pptx, Eye structure, basics of optics
Vision-1.pptx, Eye structure, basics of opticsVision-1.pptx, Eye structure, basics of optics
Vision-1.pptx, Eye structure, basics of optics
Sai Sailesh Kumar Goothy
 
Pharma Pcd Franchise in Jharkhand - Yodley Lifesciences
Pharma Pcd Franchise in Jharkhand - Yodley LifesciencesPharma Pcd Franchise in Jharkhand - Yodley Lifesciences
Pharma Pcd Franchise in Jharkhand - Yodley Lifesciences
Yodley Lifesciences
 
Hemodialysis: Chapter 3, Dialysis Water Unit - Dr.Gawad
Hemodialysis: Chapter 3, Dialysis Water Unit - Dr.GawadHemodialysis: Chapter 3, Dialysis Water Unit - Dr.Gawad
Hemodialysis: Chapter 3, Dialysis Water Unit - Dr.Gawad
NephroTube - Dr.Gawad
 
KDIGO 2024 guidelines for diabetologists
KDIGO 2024 guidelines for diabetologistsKDIGO 2024 guidelines for diabetologists
KDIGO 2024 guidelines for diabetologists
د.محمود نجيب
 
Dehradun #ℂall #gIRLS Oyo Hotel 9719300533 #ℂall #gIRL in Dehradun
Dehradun #ℂall #gIRLS Oyo Hotel 9719300533 #ℂall #gIRL in DehradunDehradun #ℂall #gIRLS Oyo Hotel 9719300533 #ℂall #gIRL in Dehradun
Dehradun #ℂall #gIRLS Oyo Hotel 9719300533 #ℂall #gIRL in Dehradun
chandankumarsmartiso
 
Ocular injury ppt Upendra pal optometrist upums saifai etawah
Ocular injury  ppt  Upendra pal  optometrist upums saifai etawahOcular injury  ppt  Upendra pal  optometrist upums saifai etawah
Ocular injury ppt Upendra pal optometrist upums saifai etawah
pal078100
 
Best Ayurvedic medicine for Gas and Indigestion
Best Ayurvedic medicine for Gas and IndigestionBest Ayurvedic medicine for Gas and Indigestion
Best Ayurvedic medicine for Gas and Indigestion
SwastikAyurveda
 
Top-Vitamin-Supplement-Brands-in-India.pptx
Top-Vitamin-Supplement-Brands-in-India.pptxTop-Vitamin-Supplement-Brands-in-India.pptx
Top-Vitamin-Supplement-Brands-in-India.pptx
SwisschemDerma
 
Physiology of Chemical Sensation of smell.pdf
Physiology of Chemical Sensation of smell.pdfPhysiology of Chemical Sensation of smell.pdf
Physiology of Chemical Sensation of smell.pdf
MedicoseAcademics
 
Role of Mukta Pishti in the Management of Hyperthyroidism
Role of Mukta Pishti in the Management of HyperthyroidismRole of Mukta Pishti in the Management of Hyperthyroidism
Role of Mukta Pishti in the Management of Hyperthyroidism
Dr. Jyothirmai Paindla
 
Gram Stain introduction, principle, Procedure
Gram Stain introduction, principle, ProcedureGram Stain introduction, principle, Procedure
Gram Stain introduction, principle, Procedure
Suraj Goswami
 
SURGICAL ANATOMY OF THE RETROPERITONEUM, ADRENALS, KIDNEYS AND URETERS.pptx
SURGICAL ANATOMY OF THE RETROPERITONEUM, ADRENALS, KIDNEYS AND URETERS.pptxSURGICAL ANATOMY OF THE RETROPERITONEUM, ADRENALS, KIDNEYS AND URETERS.pptx
SURGICAL ANATOMY OF THE RETROPERITONEUM, ADRENALS, KIDNEYS AND URETERS.pptx
Bright Chipili
 
Effective-Soaps-for-Fungal-Skin-Infections.pptx
Effective-Soaps-for-Fungal-Skin-Infections.pptxEffective-Soaps-for-Fungal-Skin-Infections.pptx
Effective-Soaps-for-Fungal-Skin-Infections.pptx
SwisschemDerma
 
Pharynx and Clinical Correlations BY Dr.Rabia Inam Gandapore.pptx
Pharynx and Clinical Correlations BY Dr.Rabia Inam Gandapore.pptxPharynx and Clinical Correlations BY Dr.Rabia Inam Gandapore.pptx
Pharynx and Clinical Correlations BY Dr.Rabia Inam Gandapore.pptx
Dr. Rabia Inam Gandapore
 
Pictures of Superficial & Deep Fascia.ppt.pdf
Pictures of Superficial & Deep Fascia.ppt.pdfPictures of Superficial & Deep Fascia.ppt.pdf
Pictures of Superficial & Deep Fascia.ppt.pdf
Dr. Rabia Inam Gandapore
 
Ophthalmology Clinical Tests for OSCE exam
Ophthalmology Clinical Tests for OSCE examOphthalmology Clinical Tests for OSCE exam
Ophthalmology Clinical Tests for OSCE exam
KafrELShiekh University
 
CDSCO and Phamacovigilance {Regulatory body in India}
CDSCO and Phamacovigilance {Regulatory body in India}CDSCO and Phamacovigilance {Regulatory body in India}
CDSCO and Phamacovigilance {Regulatory body in India}
NEHA GUPTA
 
Cardiac Assessment for B.sc Nursing Student.pdf
Cardiac Assessment for B.sc Nursing Student.pdfCardiac Assessment for B.sc Nursing Student.pdf
Cardiac Assessment for B.sc Nursing Student.pdf
shivalingatalekar1
 
263778731218 Abortion Clinic /Pills In Harare ,
263778731218 Abortion Clinic /Pills In Harare ,263778731218 Abortion Clinic /Pills In Harare ,
263778731218 Abortion Clinic /Pills In Harare ,
sisternakatoto
 
A Classical Text Review on Basavarajeeyam
A Classical Text Review on BasavarajeeyamA Classical Text Review on Basavarajeeyam
A Classical Text Review on Basavarajeeyam
Dr. Jyothirmai Paindla
 

Recently uploaded (20)

Vision-1.pptx, Eye structure, basics of optics
Vision-1.pptx, Eye structure, basics of opticsVision-1.pptx, Eye structure, basics of optics
Vision-1.pptx, Eye structure, basics of optics
 
Pharma Pcd Franchise in Jharkhand - Yodley Lifesciences
Pharma Pcd Franchise in Jharkhand - Yodley LifesciencesPharma Pcd Franchise in Jharkhand - Yodley Lifesciences
Pharma Pcd Franchise in Jharkhand - Yodley Lifesciences
 
Hemodialysis: Chapter 3, Dialysis Water Unit - Dr.Gawad
Hemodialysis: Chapter 3, Dialysis Water Unit - Dr.GawadHemodialysis: Chapter 3, Dialysis Water Unit - Dr.Gawad
Hemodialysis: Chapter 3, Dialysis Water Unit - Dr.Gawad
 
KDIGO 2024 guidelines for diabetologists
KDIGO 2024 guidelines for diabetologistsKDIGO 2024 guidelines for diabetologists
KDIGO 2024 guidelines for diabetologists
 
Dehradun #ℂall #gIRLS Oyo Hotel 9719300533 #ℂall #gIRL in Dehradun
Dehradun #ℂall #gIRLS Oyo Hotel 9719300533 #ℂall #gIRL in DehradunDehradun #ℂall #gIRLS Oyo Hotel 9719300533 #ℂall #gIRL in Dehradun
Dehradun #ℂall #gIRLS Oyo Hotel 9719300533 #ℂall #gIRL in Dehradun
 
Ocular injury ppt Upendra pal optometrist upums saifai etawah
Ocular injury  ppt  Upendra pal  optometrist upums saifai etawahOcular injury  ppt  Upendra pal  optometrist upums saifai etawah
Ocular injury ppt Upendra pal optometrist upums saifai etawah
 
Best Ayurvedic medicine for Gas and Indigestion
Best Ayurvedic medicine for Gas and IndigestionBest Ayurvedic medicine for Gas and Indigestion
Best Ayurvedic medicine for Gas and Indigestion
 
Top-Vitamin-Supplement-Brands-in-India.pptx
Top-Vitamin-Supplement-Brands-in-India.pptxTop-Vitamin-Supplement-Brands-in-India.pptx
Top-Vitamin-Supplement-Brands-in-India.pptx
 
Physiology of Chemical Sensation of smell.pdf
Physiology of Chemical Sensation of smell.pdfPhysiology of Chemical Sensation of smell.pdf
Physiology of Chemical Sensation of smell.pdf
 
Role of Mukta Pishti in the Management of Hyperthyroidism
Role of Mukta Pishti in the Management of HyperthyroidismRole of Mukta Pishti in the Management of Hyperthyroidism
Role of Mukta Pishti in the Management of Hyperthyroidism
 
Gram Stain introduction, principle, Procedure
Gram Stain introduction, principle, ProcedureGram Stain introduction, principle, Procedure
Gram Stain introduction, principle, Procedure
 
SURGICAL ANATOMY OF THE RETROPERITONEUM, ADRENALS, KIDNEYS AND URETERS.pptx
SURGICAL ANATOMY OF THE RETROPERITONEUM, ADRENALS, KIDNEYS AND URETERS.pptxSURGICAL ANATOMY OF THE RETROPERITONEUM, ADRENALS, KIDNEYS AND URETERS.pptx
SURGICAL ANATOMY OF THE RETROPERITONEUM, ADRENALS, KIDNEYS AND URETERS.pptx
 
Effective-Soaps-for-Fungal-Skin-Infections.pptx
Effective-Soaps-for-Fungal-Skin-Infections.pptxEffective-Soaps-for-Fungal-Skin-Infections.pptx
Effective-Soaps-for-Fungal-Skin-Infections.pptx
 
Pharynx and Clinical Correlations BY Dr.Rabia Inam Gandapore.pptx
Pharynx and Clinical Correlations BY Dr.Rabia Inam Gandapore.pptxPharynx and Clinical Correlations BY Dr.Rabia Inam Gandapore.pptx
Pharynx and Clinical Correlations BY Dr.Rabia Inam Gandapore.pptx
 
Pictures of Superficial & Deep Fascia.ppt.pdf
Pictures of Superficial & Deep Fascia.ppt.pdfPictures of Superficial & Deep Fascia.ppt.pdf
Pictures of Superficial & Deep Fascia.ppt.pdf
 
Ophthalmology Clinical Tests for OSCE exam
Ophthalmology Clinical Tests for OSCE examOphthalmology Clinical Tests for OSCE exam
Ophthalmology Clinical Tests for OSCE exam
 
CDSCO and Phamacovigilance {Regulatory body in India}
CDSCO and Phamacovigilance {Regulatory body in India}CDSCO and Phamacovigilance {Regulatory body in India}
CDSCO and Phamacovigilance {Regulatory body in India}
 
Cardiac Assessment for B.sc Nursing Student.pdf
Cardiac Assessment for B.sc Nursing Student.pdfCardiac Assessment for B.sc Nursing Student.pdf
Cardiac Assessment for B.sc Nursing Student.pdf
 
263778731218 Abortion Clinic /Pills In Harare ,
263778731218 Abortion Clinic /Pills In Harare ,263778731218 Abortion Clinic /Pills In Harare ,
263778731218 Abortion Clinic /Pills In Harare ,
 
A Classical Text Review on Basavarajeeyam
A Classical Text Review on BasavarajeeyamA Classical Text Review on Basavarajeeyam
A Classical Text Review on Basavarajeeyam
 

Sept2016 smallvar 10_x

  • 1. 10X Genomics Novel variants and variant validation September 2016
  • 2. 2 Partitioning to Linked Reads 1.0ng input
  • 3. 3 Linked read data Confidential — Do not distribute
  • 6. 6 Standard Short Read Alignment Close Paralogs Short Reads Short Read Aligners Cannot Place Reads Correctly
  • 7. 7 Long Ranger – LariatTM Aligner 1. Confident mapping provides anchors 2. Barcodes recruit short reads into paralogous loci Close Paralogs LariatTM Aligner Correctly Places Short Reads Even in Paralogous Loci Linked-Reads
  • 8. 8 Improved alignment leads to improved variant calling •SMN1 and SMN2: part of an inverted tandem duplication on chr5 –Differ by 8 nucleotides (3 exonic) • SMN1: causative of spinal muscular atrophy • SMN2: low function copy, not disease-causing Haplotype 2 Reads Haplotype 1 Reads Standard Genome Chromium Genome SMN2 NA12878 WGS 128Gb
  • 9. 9 Inference chr1 chr3 chr5 chr11 chr13 source sink • For every active alignment in the sink whose read has an alignment in the sink, switch the alignment in the sink to active and score probabilistically. If the source has few or no active alignments, the score goes up.
  • 10. 10 Inference chr1 chr3 chr5 chr11 chr13 • This source is also now inactive. source sink
  • 11. 11 Inference chr1 chr3 chr5 chr11 chr13 • Fast forward and we have the following active molecules left.
  • 12. 12 •Called by 10X data not in GIAB 3.2.2 (whole genome, not restricted to confident regions) •Validated with PacBio requiring > 2 alt alleles supported and >15% allele fraction •Of regions with PacBio coverage >=12, validation rates are 94% for 10X and 89% for truseq. Novel variants 10X Truseq Diff 10x validated Truseq validated Diff SNPs 335k 292k 43k 289k 237k 52k Deletions 76k 56k 20k 73k 54k 19k Insertions 59k 43k 16k 58k 42k 16k Total 470k 391k 79k 420k 333k 87k
  • 13. 13 • PacBio validation – align pac bio reads to reference then align them to the reference with the alt allele in place of the reference allele. Only count as support if one scores higher than the other. Novel variant validation method • Can we validate this validation method • Sensitivity of validation in confident region • Negative predictive value of “random” mutations • For SNPs, random is straight forward (could include TI/TV bias) • For indels • Pick length from geometric distribution • For deletions, the alt allele is trivial • For insertions, the alt allele used is the bases in the reference at that locus repeated.
  • 14. 14 •Entire 10X team especially Patrick Marks and Deanna Church •GIAB workshop organizers 1. Zheng, Grace XY, et al. "Haplotyping germline and cancer genomes with high- throughput linked-read sequencing." Nature biotechnology (2016). 2. Samonte, Rhea Vallente, and Evan E. Eichler. "Segmental duplications and the evolution of the primate genome." Nature Reviews Genetics 3.1 (2002): 65-72. 3. Bishara A et al. (2015) Read clouds uncover variation in complex regions of the human genome. Genome Res, 25:1570-1580. 4. Li, Heng, and Richard Durbin. "Fast and accurate short read alignment with Burrows– Wheeler transform." Bioinformatics 25.14 (2009): 1754-1760. Acknowledgements and references
  • 16. 16 SNP validation validation  Confidential — Do not distribute Used for validation
  • 17. 17 Deletion validation validation Confidential — Do not distribute Used for validation
  • 18. 18 Insertion validation validation Confidential — Do not distribute Used for validation