SlideShare a Scribd company logo
1 of 15
Methods for improved SV
detection and interpretation
Fritz J Sedlazeck
Schatz group @ CSHL
Breast Cancer
75x PacBio coverage (50x >10kbp)
Nattestad et al, In preparation
Breast Cancer Pipeline
PacBio Sequencing
Alignment with
BWA-MEM
Copy number
analysis
SV-calling with
Lumpy
Graphical genome
threading analysis
Assembly with
Falcon on
DNAnexus
Alignment with
MUMmer
(nucmer)
Call variants
between
consecutive
alignments
Call variants within
alignments
IsoSeq
transcriptome
analysis
...
Detailed analysis of
Her2
amplifications
Validation using
PCR.
Illumina
sequencing
What we learned
Illumina:
– Nested SV are hard to detect
– Problems with repetitive regions
PacBio:
– Tools are not yet as accurate
– Erroneous alignments hide SV
Assembly vs. mapping
– Detection of more SV over assembly
– Assembly misses low frequent SVs
– Comparing the assembly to the genome is still challenging
What we can offer
• Improve alignments:
– NextGenMap-LR*
• Improve SV detection:
– Survivor (illumina)
– Sniffles (PacBio)*
– Assembly guided detection (PacBio)
• Interpretation of the SV:
– SplitThreader*
NextGenMap-LR
• Novel algorithms to make mapper aware
of SV region
– Builds on very fast and robust NextGenMap
algorithms (Sedlazeck et al. Bioinformatics. 2013)
• Major benefits from pairwise alignment with
improved scoring functions
– Accounts for long (SV) and smaller (error) gaps
– SVs are better represented, cleaner breakpoints
NextGenMap-LR
BWA-MEM NextGenMap-LR
Very stable deletion calls
NextGenMap-LR
BWA-MEM NextGenMap-LR
Very stable insertion call
Sniffles
• Predicts: deletion, duplication, insertion,
inversion, translocations and nested SV
• Designed to overcome the problems of
current SV callers.
• Scans for erroneous regions in the alignment
and predict if they overlap.
Sniffles
Sniffles (Inversion)
SplitThreader
• Graph based tool that uses SV calls.
• Identifies the most likely paths that forms the
chromosomes.
50 Mb
Chr 8
PacBio
PacBio
Her2
Chr 17
RARA
PKIA
GSDMB
TATDN1
Cancer lesion reconstruction from genomic threads
How we could contribute
1. Calling SV using PacBio
2. Calling SV using Illumina
3. Interpretation of Structural Variants
Thanks to:
Schatz group + McCombie group + Hicks
group + OICR + PacBio + DNAnexus

More Related Content

More from GenomeInABottle

More from GenomeInABottle (20)

Genome in a Bottle- reference materials to benchmark challenging variants and...
Genome in a Bottle- reference materials to benchmark challenging variants and...Genome in a Bottle- reference materials to benchmark challenging variants and...
Genome in a Bottle- reference materials to benchmark challenging variants and...
 
GIAB Technical Germline Benchmark roadmap discussion
GIAB Technical Germline Benchmark roadmap discussionGIAB Technical Germline Benchmark roadmap discussion
GIAB Technical Germline Benchmark roadmap discussion
 
GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511
GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511
GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511
 
Giab agbt small_var_2020
Giab agbt small_var_2020Giab agbt small_var_2020
Giab agbt small_var_2020
 
GIAB for AMP GeT-RM Forum
GIAB for AMP GeT-RM ForumGIAB for AMP GeT-RM Forum
GIAB for AMP GeT-RM Forum
 
Ga4gh 2019 - Assuring data quality with benchmarking tools from GIAB and GA4GH
Ga4gh 2019 - Assuring data quality with benchmarking tools from GIAB and GA4GHGa4gh 2019 - Assuring data quality with benchmarking tools from GIAB and GA4GH
Ga4gh 2019 - Assuring data quality with benchmarking tools from GIAB and GA4GH
 
GIAB ASHG 2019 Structural Variant poster
GIAB ASHG 2019 Structural Variant posterGIAB ASHG 2019 Structural Variant poster
GIAB ASHG 2019 Structural Variant poster
 
GIAB GRC Workshop ASHG 2019 Billy Rowell Evaluation of v4 with CCS GATK
GIAB GRC Workshop ASHG 2019 Billy Rowell Evaluation of v4 with CCS GATKGIAB GRC Workshop ASHG 2019 Billy Rowell Evaluation of v4 with CCS GATK
GIAB GRC Workshop ASHG 2019 Billy Rowell Evaluation of v4 with CCS GATK
 
GIAB ASHG 2019 Small Variant poster
GIAB ASHG 2019 Small Variant posterGIAB ASHG 2019 Small Variant poster
GIAB ASHG 2019 Small Variant poster
 
GRC GIAB Workshop ASHG 2019 Small Variant Benchmark
GRC GIAB Workshop ASHG 2019 Small Variant BenchmarkGRC GIAB Workshop ASHG 2019 Small Variant Benchmark
GRC GIAB Workshop ASHG 2019 Small Variant Benchmark
 
Jason Chin MHC diploid assembly
Jason Chin MHC diploid assemblyJason Chin MHC diploid assembly
Jason Chin MHC diploid assembly
 
GIAB update for GRC GIAB workshop 191015
GIAB update for GRC GIAB workshop 191015GIAB update for GRC GIAB workshop 191015
GIAB update for GRC GIAB workshop 191015
 
Giab for jax long read 190917
Giab for jax long read 190917Giab for jax long read 190917
Giab for jax long read 190917
 
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
 
GIAB and long reads for bio it world 190417
GIAB and long reads for bio it world 190417GIAB and long reads for bio it world 190417
GIAB and long reads for bio it world 190417
 
New methods diploid assembly with graphs
New methods   diploid assembly with graphsNew methods   diploid assembly with graphs
New methods diploid assembly with graphs
 
How giab fits in the rest of the world seqc2 tumor normal
How giab fits in the rest of the world   seqc2 tumor normalHow giab fits in the rest of the world   seqc2 tumor normal
How giab fits in the rest of the world seqc2 tumor normal
 
New data from giab genomes pacbio ccs
New data from giab genomes   pacbio ccsNew data from giab genomes   pacbio ccs
New data from giab genomes pacbio ccs
 
New data from giab genomes strand-seq
New data from giab genomes   strand-seqNew data from giab genomes   strand-seq
New data from giab genomes strand-seq
 
New data from giab genomes promethion
New data from giab genomes   promethionNew data from giab genomes   promethion
New data from giab genomes promethion
 

Recently uploaded

Physiologic Anatomy of Heart_AntiCopy.pdf
Physiologic Anatomy of Heart_AntiCopy.pdfPhysiologic Anatomy of Heart_AntiCopy.pdf
Physiologic Anatomy of Heart_AntiCopy.pdf
MedicoseAcademics
 
VIII.1 Nursing Interventions to Promote Healthy Psychological responses, SELF...
VIII.1 Nursing Interventions to Promote Healthy Psychological responses, SELF...VIII.1 Nursing Interventions to Promote Healthy Psychological responses, SELF...
VIII.1 Nursing Interventions to Promote Healthy Psychological responses, SELF...
JRRolfNeuqelet
 
Obat Aborsi Ampuh Usia 1,2,3,4,5,6,7 Bulan 081901222272 Obat Penggugur Kandu...
Obat Aborsi Ampuh Usia 1,2,3,4,5,6,7 Bulan  081901222272 Obat Penggugur Kandu...Obat Aborsi Ampuh Usia 1,2,3,4,5,6,7 Bulan  081901222272 Obat Penggugur Kandu...
Obat Aborsi Ampuh Usia 1,2,3,4,5,6,7 Bulan 081901222272 Obat Penggugur Kandu...
Halo Docter
 
Difference Between Skeletal Smooth and Cardiac Muscles
Difference Between Skeletal Smooth and Cardiac MusclesDifference Between Skeletal Smooth and Cardiac Muscles
Difference Between Skeletal Smooth and Cardiac Muscles
MedicoseAcademics
 
Connective Tissue II - Dr Muhammad Ali Rabbani - Medicose Academics
Connective Tissue II - Dr Muhammad Ali Rabbani - Medicose AcademicsConnective Tissue II - Dr Muhammad Ali Rabbani - Medicose Academics
Connective Tissue II - Dr Muhammad Ali Rabbani - Medicose Academics
MedicoseAcademics
 
Failure to thrive in neonates and infants + pediatric case.pptx
Failure to thrive in neonates and infants  + pediatric case.pptxFailure to thrive in neonates and infants  + pediatric case.pptx
Failure to thrive in neonates and infants + pediatric case.pptx
claviclebrown44
 

Recently uploaded (20)

Physiologic Anatomy of Heart_AntiCopy.pdf
Physiologic Anatomy of Heart_AntiCopy.pdfPhysiologic Anatomy of Heart_AntiCopy.pdf
Physiologic Anatomy of Heart_AntiCopy.pdf
 
Treatment Choices for Slip Disc at Gokuldas Hospital
Treatment Choices for Slip Disc at Gokuldas HospitalTreatment Choices for Slip Disc at Gokuldas Hospital
Treatment Choices for Slip Disc at Gokuldas Hospital
 
High Purity 99% PMK Ethyl Glycidate Powder CAS 28578-16-7
High Purity 99% PMK Ethyl Glycidate Powder CAS 28578-16-7High Purity 99% PMK Ethyl Glycidate Powder CAS 28578-16-7
High Purity 99% PMK Ethyl Glycidate Powder CAS 28578-16-7
 
parliaments-for-health-security_RecordOfAchievement.pdf
parliaments-for-health-security_RecordOfAchievement.pdfparliaments-for-health-security_RecordOfAchievement.pdf
parliaments-for-health-security_RecordOfAchievement.pdf
 
BURNS (CLASSIFICATION & MANAGEMENTS).pdf
BURNS (CLASSIFICATION & MANAGEMENTS).pdfBURNS (CLASSIFICATION & MANAGEMENTS).pdf
BURNS (CLASSIFICATION & MANAGEMENTS).pdf
 
TEST BANK For Nursing Leadership, Management, and Professional Practice for t...
TEST BANK For Nursing Leadership, Management, and Professional Practice for t...TEST BANK For Nursing Leadership, Management, and Professional Practice for t...
TEST BANK For Nursing Leadership, Management, and Professional Practice for t...
 
Benefits of Chanting Hanuman Chalisa .pdf
Benefits of Chanting Hanuman Chalisa .pdfBenefits of Chanting Hanuman Chalisa .pdf
Benefits of Chanting Hanuman Chalisa .pdf
 
VIII.1 Nursing Interventions to Promote Healthy Psychological responses, SELF...
VIII.1 Nursing Interventions to Promote Healthy Psychological responses, SELF...VIII.1 Nursing Interventions to Promote Healthy Psychological responses, SELF...
VIII.1 Nursing Interventions to Promote Healthy Psychological responses, SELF...
 
Anti viral drug pharmacology classification
Anti viral drug pharmacology classificationAnti viral drug pharmacology classification
Anti viral drug pharmacology classification
 
Signs It’s Time for Physiotherapy Sessions Prioritizing Wellness
Signs It’s Time for Physiotherapy Sessions Prioritizing WellnessSigns It’s Time for Physiotherapy Sessions Prioritizing Wellness
Signs It’s Time for Physiotherapy Sessions Prioritizing Wellness
 
Obat Aborsi Ampuh Usia 1,2,3,4,5,6,7 Bulan 081901222272 Obat Penggugur Kandu...
Obat Aborsi Ampuh Usia 1,2,3,4,5,6,7 Bulan  081901222272 Obat Penggugur Kandu...Obat Aborsi Ampuh Usia 1,2,3,4,5,6,7 Bulan  081901222272 Obat Penggugur Kandu...
Obat Aborsi Ampuh Usia 1,2,3,4,5,6,7 Bulan 081901222272 Obat Penggugur Kandu...
 
Difference Between Skeletal Smooth and Cardiac Muscles
Difference Between Skeletal Smooth and Cardiac MusclesDifference Between Skeletal Smooth and Cardiac Muscles
Difference Between Skeletal Smooth and Cardiac Muscles
 
Connective Tissue II - Dr Muhammad Ali Rabbani - Medicose Academics
Connective Tissue II - Dr Muhammad Ali Rabbani - Medicose AcademicsConnective Tissue II - Dr Muhammad Ali Rabbani - Medicose Academics
Connective Tissue II - Dr Muhammad Ali Rabbani - Medicose Academics
 
Top 10 Most Beautiful Russian Pornstars List 2024
Top 10 Most Beautiful Russian Pornstars List 2024Top 10 Most Beautiful Russian Pornstars List 2024
Top 10 Most Beautiful Russian Pornstars List 2024
 
JOURNAL CLUB PRESENTATION TEMPLATE DOCUMENT
JOURNAL CLUB PRESENTATION TEMPLATE DOCUMENTJOURNAL CLUB PRESENTATION TEMPLATE DOCUMENT
JOURNAL CLUB PRESENTATION TEMPLATE DOCUMENT
 
CONGENITAL HYPERTROPHIC PYLORIC STENOSIS by Dr M.KARTHIK EMMANUEL
CONGENITAL HYPERTROPHIC PYLORIC STENOSIS  by Dr M.KARTHIK EMMANUELCONGENITAL HYPERTROPHIC PYLORIC STENOSIS  by Dr M.KARTHIK EMMANUEL
CONGENITAL HYPERTROPHIC PYLORIC STENOSIS by Dr M.KARTHIK EMMANUEL
 
Face and Muscles of facial expression.pptx
Face and Muscles of facial expression.pptxFace and Muscles of facial expression.pptx
Face and Muscles of facial expression.pptx
 
Unveiling Alcohol Withdrawal Syndrome: exploring it's hidden depths
Unveiling Alcohol Withdrawal Syndrome: exploring it's hidden depthsUnveiling Alcohol Withdrawal Syndrome: exploring it's hidden depths
Unveiling Alcohol Withdrawal Syndrome: exploring it's hidden depths
 
Failure to thrive in neonates and infants + pediatric case.pptx
Failure to thrive in neonates and infants  + pediatric case.pptxFailure to thrive in neonates and infants  + pediatric case.pptx
Failure to thrive in neonates and infants + pediatric case.pptx
 
SEMESTER-V CHILD HEALTH NURSING-UNIT-1-INTRODUCTION.pdf
SEMESTER-V CHILD HEALTH NURSING-UNIT-1-INTRODUCTION.pdfSEMESTER-V CHILD HEALTH NURSING-UNIT-1-INTRODUCTION.pdf
SEMESTER-V CHILD HEALTH NURSING-UNIT-1-INTRODUCTION.pdf
 

Aug2015 analysis team 07 fritz and schatz pac_bio sv

  • 1. Methods for improved SV detection and interpretation Fritz J Sedlazeck Schatz group @ CSHL
  • 2. Breast Cancer 75x PacBio coverage (50x >10kbp) Nattestad et al, In preparation
  • 3. Breast Cancer Pipeline PacBio Sequencing Alignment with BWA-MEM Copy number analysis SV-calling with Lumpy Graphical genome threading analysis Assembly with Falcon on DNAnexus Alignment with MUMmer (nucmer) Call variants between consecutive alignments Call variants within alignments IsoSeq transcriptome analysis ... Detailed analysis of Her2 amplifications Validation using PCR. Illumina sequencing
  • 4. What we learned Illumina: – Nested SV are hard to detect – Problems with repetitive regions PacBio: – Tools are not yet as accurate – Erroneous alignments hide SV Assembly vs. mapping – Detection of more SV over assembly – Assembly misses low frequent SVs – Comparing the assembly to the genome is still challenging
  • 5. What we can offer • Improve alignments: – NextGenMap-LR* • Improve SV detection: – Survivor (illumina) – Sniffles (PacBio)* – Assembly guided detection (PacBio) • Interpretation of the SV: – SplitThreader*
  • 6. NextGenMap-LR • Novel algorithms to make mapper aware of SV region – Builds on very fast and robust NextGenMap algorithms (Sedlazeck et al. Bioinformatics. 2013) • Major benefits from pairwise alignment with improved scoring functions – Accounts for long (SV) and smaller (error) gaps – SVs are better represented, cleaner breakpoints
  • 9. Sniffles • Predicts: deletion, duplication, insertion, inversion, translocations and nested SV • Designed to overcome the problems of current SV callers. • Scans for erroneous regions in the alignment and predict if they overlap.
  • 12. SplitThreader • Graph based tool that uses SV calls. • Identifies the most likely paths that forms the chromosomes.
  • 13. 50 Mb Chr 8 PacBio PacBio Her2 Chr 17 RARA PKIA GSDMB TATDN1
  • 14. Cancer lesion reconstruction from genomic threads
  • 15. How we could contribute 1. Calling SV using PacBio 2. Calling SV using Illumina 3. Interpretation of Structural Variants Thanks to: Schatz group + McCombie group + Hicks group + OICR + PacBio + DNAnexus

Editor's Notes

  1. Cancer in the tube  vs. Genome in the bottle
  2. * Things I will talk about today.
  3. NOTE: All results use only PacBio read alignments with the highest mapping quality of 60
  4. 8 121550937 121550938 17 37277042 37277043 1609 0 - - INTERCHROM 423 458 396 791 23 25 421 8 125549098 125549099 17 38067674 38067675 1611 3.65832e-206 - + INTERCHROM 251 195 245 448 478 241 241 8 79475432 79475433 17 38474392 38474393 1612 8.48245e-07 - + INTERCHROM 125 109 117 244 242 127 120 8 80086762 80086763 17 39542958 39542959 1613 2.11836e-48 + + INTERCHROM 113 425 328 352 169 129 124 8 117989173 117989174 17 40572276 40572277 1615 inf - + INTERCHROM 77 157 148 228 79 14 21