SlideShare a Scribd company logo
1 of 24
1
Overview
Real Time Single Molecule
Sequencing
• Mechanism/Motivation
• Computational Challenges
• Example: Sequence Alignment,
probabilistic model
Sequencing on a Chip
• Mechanism/Motivation
• Computation Challenges
• Example: Sequence Alignment,
probabilistic model
2
Sequencing DNA
3
P
G
C
A
T
P
P
A
P
G
T
Polymerase
Patient’s DNA sequence
4
P
G
C
A
T
P
P
A
P
G
Polymerase
Patient’s DNA sequence
Real Time Single Molecule Sequencing
Quantum
dot
FRET
FRET: fluorescence resonance
energy transfer
-> Energy is transferred from
quantum dot to dye, resulting
in a color shift
Emitted light
Glass plate
T
Raw time series of one molecule (live)
5
Time (seconds)
T A A G G
Motivation
• 5000 bp single reads or more
• Multiple polymerase on one strand of DNA
-> ultra long reads of up to 250’000 bp
• (It’s very cool :-)
6
Computational Challenges in Sequencing
BaseCalling
• Determine the sequence of G, A, T, C
-> Signal detection (noise)
-> Quality of base call
Mapping
• Locate billion(s) of sequences on genome
-> Efficient mapping strategies
-> Dealing with errors in reads
SNP/CNV
Detection
• Does the patient have cancer?
-> (Multiple) sequence alignment
-> correct probabilistic model is critical
7
T A C G T A C G T C T G A G C A
“Reads”
“Sequences”
Assembled
genome
Genotype,
CNV, SNP
Maximum likelihood alignment:
Dynamic programming
Confidential and Proprietary—DO NOT
DUPLICATE
8
G A A G T A
G
A
G
A
A
Reference sequence
R
e
a
d missed base call
mismatch
missed base call
NSB
Correct model
and transition
probabilities
are crucial
9
Simulating Reads: Polymerase Insertion Events
Polymerase: event detection
• For a detection limit of
10ms, we will miss ~ 7.5%
of all insertion events
• To see more events we
could:
– Slow down polymerase
insertion rate
– Lower insertion detection
limit (increase frame rate
and lower noise)
Does not include blinking
10
Detection Limit % Events Missed
5ms 3.9%
10ms 7.5%
20ms 18%
30ms ~30%
11
Quantum Dot Blinking: Power law
Sub-Sampled
timeseries
“Real”BlinkingSignal
 Light dt·dpix “on”
“off” Bell Curve
Power Law
Normal Distribution Power Law
Finite mean/variance Infinite mean/variance possible
Height/Weight distribution of people Financials: Stock market, Foreign exchange rates
IQ of people Hedge fund risks
Roulette, Blackjack etc File sizes, download times, city sizes, book sales
StarLight parametersModel Parameters
Challenges of SMS
• Enzyme kinetics (too fast to see,
branching ratio)
• Quantum dot blinking: random
switching between ON and OFF states
common to nanoscale emitters ->
power law
• Photobleaching (destruction of dot)
• Single molecule -> stochastic behavior
13
Accuracy
Sequencing on a Semiconductor Chip:
The Chip is the Machine™
Ion Torrent
Sensor Plate
Silicon Substrate
Drain SourceBulk
∆ pH
∆ V
Sensing Layer
H+
DNA
on bead
H+
Sensor Plate
Silicon Substrate
Drain SourceBulk
∆ pH
∆ V
Sensing Layer
H+
15
• Natural chemistry: natural polymerase and nucleotides
• Fast: one PGM run 1.5-7.5 hours, entire workflow 8-23 hours
• Cost (50k for a PGM, 300.- for a sample with chip)
• Simplicity: direct measurement, no cameras involved -> high accuracy
• Read length: approx. 400 bp
P
G
C
A
T
P
P
A
P
G
T
Patient’s DNA sequence
Method/Motivation
DNA
Raw signal of one base call
Incorporations add hydrogen (dV):
• one nucleotide
• two nucelotides
• three nucelotides
• four nucleotides
16
Flowgram of one read
17
T A C G T A C G T C T G A G C A
Flow 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16
Nr bases inserted
1
2
T
2T
C CA AGG
Dynamic programming
Confidential and Proprietary—DO NOT
DUPLICATE
18
Flow 1
A
Flow 2
G
Flow 3
T
Flow 4
C
Flow 5
A
Flow 6
T
Base 1
A
Base 2
T
Base 3
T
Base 4
A
carry forward
Incomplete
extension
Read
R
e
f
e
r
e
n
c
e
Correct model
and transition
probabilities
are crucial
Phasing Parameter Typical Values Description
Carry Forward 0.2% - 1.0%
% of polymerases that will
incorporate when they shouldn’t
Incomplete Extension 0.2% - 1.0%
% of polymerases that will
not incorporate when they should
Droop 0.0% - 0.3%
% of polymerases that will
stop working
Phasing Model
We have one bead per well and there a many copies of DNA on a bead.
During a run the DNA copies get out of synch with each other
19
20
Temporal model:
When will the polymerase incorporate?
Histogram of flows during which
6th base (T) is incorporated
T A C G T A C G T C T G A G C A T C G A
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
Flows (“time”)
A C G T C T G A G C A T C G A T C G A T G T A C
102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125
Histogram of flows during which
50th base (A) is incorporated
In phase
In phase
Goal of base calling (in principle)
21 Confidential and Proprietary—DO NOT DUPLICATE
T C A G T T G A C TFind
the base sequence
for which
the predicted flogram
is most similar to
the observed flogram
T A C G T A C G T C T G A G C A
T A C G T A C G T C T G A G C A
Phasing model
Least squares
Regions, Exomes, Genomes and Beyond
Small Genome
Transcriptome
100M
1G
10G
SequenceOutputperRun
100G
Exome
Small to Large Gene Panels
10M
Ion 316™
Ion 314
Ion 318™
PII
Human
Genome
PI
PIII
from 1.2 Million Sensors……… to 1.2 Billion
22
About me…
Detection of propylthiouracil
(coffee, cabbage, grapefruit, green tea)
Muscle performance (ACTN3)
23
FAQ on SMS
24
• Quantum dot: nanocrystal made of semiconductor
materials small enough to exhibit quantum mechanical
properties. Exitons (electron/hole pair) are confined in all
spacial dimensions. The frequency of emitted light
increases as the size of the dot decreases
• Blinking: caused by intercepted electrons, or by emitting
electron. On/off times follow power law.
• FRET: energy transfer without emission of photon.
Efficiency is inverse of 6th power of distance (10-100 Å).
Spectra must overlap

More Related Content

What's hot

Reducing off-target events in CRISPR genome editing applications with a novel...
Reducing off-target events in CRISPR genome editing applications with a novel...Reducing off-target events in CRISPR genome editing applications with a novel...
Reducing off-target events in CRISPR genome editing applications with a novel...Integrated DNA Technologies
 
Increase efficiency of genome editing with the Alt-R™ CRISPR-Cas9 System: Des...
Increase efficiency of genome editing with the Alt-R™ CRISPR-Cas9 System: Des...Increase efficiency of genome editing with the Alt-R™ CRISPR-Cas9 System: Des...
Increase efficiency of genome editing with the Alt-R™ CRISPR-Cas9 System: Des...Integrated DNA Technologies
 
Ribonucleoprotein delivery of CRISPR-Cas9 reagents for increased gene editing...
Ribonucleoprotein delivery of CRISPR-Cas9 reagents for increased gene editing...Ribonucleoprotein delivery of CRISPR-Cas9 reagents for increased gene editing...
Ribonucleoprotein delivery of CRISPR-Cas9 reagents for increased gene editing...Integrated DNA Technologies
 
rhAmp™ SNP Genotyping: A novel approach for improving PCR-based SNP genotyping
rhAmp™ SNP Genotyping: A novel approach for improving PCR-based SNP genotypingrhAmp™ SNP Genotyping: A novel approach for improving PCR-based SNP genotyping
rhAmp™ SNP Genotyping: A novel approach for improving PCR-based SNP genotypingIntegrated DNA Technologies
 
RNase H2 PCR—A New Technology to Reduce Primer Dimers and Increase Genotyping...
RNase H2 PCR—A New Technology to Reduce Primer Dimers and Increase Genotyping...RNase H2 PCR—A New Technology to Reduce Primer Dimers and Increase Genotyping...
RNase H2 PCR—A New Technology to Reduce Primer Dimers and Increase Genotyping...Integrated DNA Technologies
 
INFOCOM CNERT 2018 - The CityLab Testbed - Large-scale Multi-technology Wirel...
INFOCOM CNERT 2018 - The CityLab Testbed - Large-scale Multi-technology Wirel...INFOCOM CNERT 2018 - The CityLab Testbed - Large-scale Multi-technology Wirel...
INFOCOM CNERT 2018 - The CityLab Testbed - Large-scale Multi-technology Wirel...Johann Marquez-Barja
 
Cpf1-based genome editing using ribonucleoprotein complexes
Cpf1-based genome editing using ribonucleoprotein complexesCpf1-based genome editing using ribonucleoprotein complexes
Cpf1-based genome editing using ribonucleoprotein complexesIntegrated DNA Technologies
 
qPCR Design Strategies for Specific Applications
qPCR Design Strategies for Specific ApplicationsqPCR Design Strategies for Specific Applications
qPCR Design Strategies for Specific ApplicationsIntegrated DNA Technologies
 
Anne_Vaittinen_advanced_seminar_presentation
Anne_Vaittinen_advanced_seminar_presentationAnne_Vaittinen_advanced_seminar_presentation
Anne_Vaittinen_advanced_seminar_presentationAnne Vaittinen
 
Uncovering the Elusive HIV Capsid with Kepler GPUs Running NAMD and VMD
Uncovering the Elusive HIV Capsid with Kepler GPUs Running NAMD and VMDUncovering the Elusive HIV Capsid with Kepler GPUs Running NAMD and VMD
Uncovering the Elusive HIV Capsid with Kepler GPUs Running NAMD and VMDCan Ozdoruk
 
BB_NCBI_PAG_2019_Workshop
BB_NCBI_PAG_2019_WorkshopBB_NCBI_PAG_2019_Workshop
BB_NCBI_PAG_2019_WorkshopBen Busby
 
Polymerase chain reaction (PCR)
Polymerase chain reaction (PCR)Polymerase chain reaction (PCR)
Polymerase chain reaction (PCR)Mohammed Esawie
 
Daly altshuler.labmeeting
Daly altshuler.labmeetingDaly altshuler.labmeeting
Daly altshuler.labmeetingManuel Rivas
 
resampling techniques in machine learning
resampling techniques in machine learningresampling techniques in machine learning
resampling techniques in machine learningHamid Shekarforoush
 

What's hot (20)

Reducing off-target events in CRISPR genome editing applications with a novel...
Reducing off-target events in CRISPR genome editing applications with a novel...Reducing off-target events in CRISPR genome editing applications with a novel...
Reducing off-target events in CRISPR genome editing applications with a novel...
 
Increase efficiency of genome editing with the Alt-R™ CRISPR-Cas9 System: Des...
Increase efficiency of genome editing with the Alt-R™ CRISPR-Cas9 System: Des...Increase efficiency of genome editing with the Alt-R™ CRISPR-Cas9 System: Des...
Increase efficiency of genome editing with the Alt-R™ CRISPR-Cas9 System: Des...
 
20140711 3 t_clark_ercc2.0_workshop
20140711 3 t_clark_ercc2.0_workshop20140711 3 t_clark_ercc2.0_workshop
20140711 3 t_clark_ercc2.0_workshop
 
Ribonucleoprotein delivery of CRISPR-Cas9 reagents for increased gene editing...
Ribonucleoprotein delivery of CRISPR-Cas9 reagents for increased gene editing...Ribonucleoprotein delivery of CRISPR-Cas9 reagents for increased gene editing...
Ribonucleoprotein delivery of CRISPR-Cas9 reagents for increased gene editing...
 
rhAmp™ SNP Genotyping: A novel approach for improving PCR-based SNP genotyping
rhAmp™ SNP Genotyping: A novel approach for improving PCR-based SNP genotypingrhAmp™ SNP Genotyping: A novel approach for improving PCR-based SNP genotyping
rhAmp™ SNP Genotyping: A novel approach for improving PCR-based SNP genotyping
 
RNase H2 PCR—A New Technology to Reduce Primer Dimers and Increase Genotyping...
RNase H2 PCR—A New Technology to Reduce Primer Dimers and Increase Genotyping...RNase H2 PCR—A New Technology to Reduce Primer Dimers and Increase Genotyping...
RNase H2 PCR—A New Technology to Reduce Primer Dimers and Increase Genotyping...
 
INFOCOM CNERT 2018 - The CityLab Testbed - Large-scale Multi-technology Wirel...
INFOCOM CNERT 2018 - The CityLab Testbed - Large-scale Multi-technology Wirel...INFOCOM CNERT 2018 - The CityLab Testbed - Large-scale Multi-technology Wirel...
INFOCOM CNERT 2018 - The CityLab Testbed - Large-scale Multi-technology Wirel...
 
Cpf1-based genome editing using ribonucleoprotein complexes
Cpf1-based genome editing using ribonucleoprotein complexesCpf1-based genome editing using ribonucleoprotein complexes
Cpf1-based genome editing using ribonucleoprotein complexes
 
PrimeTime® qPCR products for gene expression
PrimeTime® qPCR products for gene expressionPrimeTime® qPCR products for gene expression
PrimeTime® qPCR products for gene expression
 
20140711 7 j_myerson_ercc2.0_workshop
20140711 7 j_myerson_ercc2.0_workshop20140711 7 j_myerson_ercc2.0_workshop
20140711 7 j_myerson_ercc2.0_workshop
 
qPCR Design Strategies for Specific Applications
qPCR Design Strategies for Specific ApplicationsqPCR Design Strategies for Specific Applications
qPCR Design Strategies for Specific Applications
 
Anne_Vaittinen_advanced_seminar_presentation
Anne_Vaittinen_advanced_seminar_presentationAnne_Vaittinen_advanced_seminar_presentation
Anne_Vaittinen_advanced_seminar_presentation
 
Uncovering the Elusive HIV Capsid with Kepler GPUs Running NAMD and VMD
Uncovering the Elusive HIV Capsid with Kepler GPUs Running NAMD and VMDUncovering the Elusive HIV Capsid with Kepler GPUs Running NAMD and VMD
Uncovering the Elusive HIV Capsid with Kepler GPUs Running NAMD and VMD
 
iGEM Progress Using High Quality Gene Fragments
iGEM Progress Using High Quality Gene FragmentsiGEM Progress Using High Quality Gene Fragments
iGEM Progress Using High Quality Gene Fragments
 
Use of NCBI Databases in qPCR Assay Design
Use of NCBI Databases in qPCR Assay DesignUse of NCBI Databases in qPCR Assay Design
Use of NCBI Databases in qPCR Assay Design
 
Q pcr
Q pcrQ pcr
Q pcr
 
BB_NCBI_PAG_2019_Workshop
BB_NCBI_PAG_2019_WorkshopBB_NCBI_PAG_2019_Workshop
BB_NCBI_PAG_2019_Workshop
 
Polymerase chain reaction (PCR)
Polymerase chain reaction (PCR)Polymerase chain reaction (PCR)
Polymerase chain reaction (PCR)
 
Daly altshuler.labmeeting
Daly altshuler.labmeetingDaly altshuler.labmeeting
Daly altshuler.labmeeting
 
resampling techniques in machine learning
resampling techniques in machine learningresampling techniques in machine learning
resampling techniques in machine learning
 

Viewers also liked

Hw09 Protein Alignment
Hw09   Protein AlignmentHw09   Protein Alignment
Hw09 Protein AlignmentCloudera, Inc.
 
Parallel DNA Sequence Alignment
Parallel DNA Sequence AlignmentParallel DNA Sequence Alignment
Parallel DNA Sequence AlignmentGiuliana Carullo
 
TCS: A new multiple sequence alignment reliability measure to estimate align...
 TCS: A new multiple sequence alignment reliability measure to estimate align... TCS: A new multiple sequence alignment reliability measure to estimate align...
TCS: A new multiple sequence alignment reliability measure to estimate align...JIA-MING CHANG
 
Sequencedatabases
SequencedatabasesSequencedatabases
SequencedatabasesAbhik Seal
 
BIOL335: Sequence alignment
BIOL335: Sequence alignmentBIOL335: Sequence alignment
BIOL335: Sequence alignmentPaul Gardner
 
Knowledge based expert systems in Bioinformatics
Knowledge based expert systems in BioinformaticsKnowledge based expert systems in Bioinformatics
Knowledge based expert systems in BioinformaticsRadwenAniba
 
The Needleman-Wunsch Algorithm for Sequence Alignment
The Needleman-Wunsch Algorithm for Sequence Alignment The Needleman-Wunsch Algorithm for Sequence Alignment
The Needleman-Wunsch Algorithm for Sequence Alignment Parinda Rajapaksha
 
T coffee algorithm dissection
T coffee algorithm dissectionT coffee algorithm dissection
T coffee algorithm dissectionGui Chen
 
The Needleman Wunsch algorithm
The Needleman Wunsch algorithmThe Needleman Wunsch algorithm
The Needleman Wunsch algorithmavrilcoghlan
 
Multiple alignment
Multiple alignmentMultiple alignment
Multiple alignmentavrilcoghlan
 
ECCB 2010 Next-gen sequencing Tutorial
ECCB 2010 Next-gen sequencing TutorialECCB 2010 Next-gen sequencing Tutorial
ECCB 2010 Next-gen sequencing TutorialThomas Keane
 
Pairwise sequence alignment
Pairwise sequence alignmentPairwise sequence alignment
Pairwise sequence alignmentavrilcoghlan
 

Viewers also liked (17)

Hw09 Protein Alignment
Hw09   Protein AlignmentHw09   Protein Alignment
Hw09 Protein Alignment
 
Parallel DNA Sequence Alignment
Parallel DNA Sequence AlignmentParallel DNA Sequence Alignment
Parallel DNA Sequence Alignment
 
TCS: A new multiple sequence alignment reliability measure to estimate align...
 TCS: A new multiple sequence alignment reliability measure to estimate align... TCS: A new multiple sequence alignment reliability measure to estimate align...
TCS: A new multiple sequence alignment reliability measure to estimate align...
 
Sequencedatabases
SequencedatabasesSequencedatabases
Sequencedatabases
 
BIOL335: Sequence alignment
BIOL335: Sequence alignmentBIOL335: Sequence alignment
BIOL335: Sequence alignment
 
Knowledge based expert systems in Bioinformatics
Knowledge based expert systems in BioinformaticsKnowledge based expert systems in Bioinformatics
Knowledge based expert systems in Bioinformatics
 
Coinami
CoinamiCoinami
Coinami
 
The Needleman-Wunsch Algorithm for Sequence Alignment
The Needleman-Wunsch Algorithm for Sequence Alignment The Needleman-Wunsch Algorithm for Sequence Alignment
The Needleman-Wunsch Algorithm for Sequence Alignment
 
T coffee algorithm dissection
T coffee algorithm dissectionT coffee algorithm dissection
T coffee algorithm dissection
 
Msa
MsaMsa
Msa
 
The Needleman Wunsch algorithm
The Needleman Wunsch algorithmThe Needleman Wunsch algorithm
The Needleman Wunsch algorithm
 
Ch07 linearspacealignment
Ch07 linearspacealignmentCh07 linearspacealignment
Ch07 linearspacealignment
 
Multiple alignment
Multiple alignmentMultiple alignment
Multiple alignment
 
ECCB 2010 Next-gen sequencing Tutorial
ECCB 2010 Next-gen sequencing TutorialECCB 2010 Next-gen sequencing Tutorial
ECCB 2010 Next-gen sequencing Tutorial
 
Ch06 alignment
Ch06 alignmentCh06 alignment
Ch06 alignment
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
 
Pairwise sequence alignment
Pairwise sequence alignmentPairwise sequence alignment
Pairwise sequence alignment
 

Similar to ETH_SymposiumCR

01-Sequencing_Technologies (1).ppt for education
01-Sequencing_Technologies (1).ppt for education01-Sequencing_Technologies (1).ppt for education
01-Sequencing_Technologies (1).ppt for educationaryajayakottarathil
 
DEF CON 23: Spread Spectrum Satcom Hacking: Attacking The GlobalStar Simplex ...
DEF CON 23: Spread Spectrum Satcom Hacking: Attacking The GlobalStar Simplex ...DEF CON 23: Spread Spectrum Satcom Hacking: Attacking The GlobalStar Simplex ...
DEF CON 23: Spread Spectrum Satcom Hacking: Attacking The GlobalStar Simplex ...Synack
 
Black Hat '15: Spread Spectrum Satcom Hacking: Attacking The GlobalStar Simpl...
Black Hat '15: Spread Spectrum Satcom Hacking: Attacking The GlobalStar Simpl...Black Hat '15: Spread Spectrum Satcom Hacking: Attacking The GlobalStar Simpl...
Black Hat '15: Spread Spectrum Satcom Hacking: Attacking The GlobalStar Simpl...Synack
 
Random access scan
Random access scan Random access scan
Random access scan Harish Peta
 
Digial instrumentation fnal
Digial instrumentation fnalDigial instrumentation fnal
Digial instrumentation fnalBishal Rimal
 
1999 si pi_dws_training_course
1999 si pi_dws_training_course1999 si pi_dws_training_course
1999 si pi_dws_training_coursePiero Belforte
 
Scaling Genomic Analyses
Scaling Genomic AnalysesScaling Genomic Analyses
Scaling Genomic Analysesfnothaft
 
Bioinformatics workshop Sept 2014
Bioinformatics workshop Sept 2014Bioinformatics workshop Sept 2014
Bioinformatics workshop Sept 2014LutzFr
 
Bioinfo ngs data format visualization v2
Bioinfo ngs data format visualization v2Bioinfo ngs data format visualization v2
Bioinfo ngs data format visualization v2Li Shen
 
Behavioral modeling of Clock/Data Recovery
Behavioral modeling of Clock/Data RecoveryBehavioral modeling of Clock/Data Recovery
Behavioral modeling of Clock/Data RecoveryArrow Devices
 
Real-time PCR.ppt
Real-time PCR.pptReal-time PCR.ppt
Real-time PCR.pptSappahAhmed
 
Chipqpcrpresentation
ChipqpcrpresentationChipqpcrpresentation
ChipqpcrpresentationElsa von Licy
 
The Remarkable Benefits and Grave Dangers of using Artificial Intelligence in...
The Remarkable Benefits and Grave Dangers of using Artificial Intelligence in...The Remarkable Benefits and Grave Dangers of using Artificial Intelligence in...
The Remarkable Benefits and Grave Dangers of using Artificial Intelligence in...Steve Cuddy
 
Spectral-, source-, connectivity- and network analysis of EEG and MEG data
Spectral-, source-, connectivity- and network analysis of EEG and MEG dataSpectral-, source-, connectivity- and network analysis of EEG and MEG data
Spectral-, source-, connectivity- and network analysis of EEG and MEG dataRobert Oostenveld
 
Next generation sequencing methods
Next generation sequencing methods Next generation sequencing methods
Next generation sequencing methods Mrinal Vashisth
 
MagneTag Presentation Winter Quarter V3.0
MagneTag Presentation Winter Quarter V3.0MagneTag Presentation Winter Quarter V3.0
MagneTag Presentation Winter Quarter V3.0John-Paul Petersen
 
Automated Generation of High-accuracy Interatomic Potentials Using Quantum Data
Automated Generation of High-accuracy Interatomic Potentials Using Quantum DataAutomated Generation of High-accuracy Interatomic Potentials Using Quantum Data
Automated Generation of High-accuracy Interatomic Potentials Using Quantum Dataaimsnist
 

Similar to ETH_SymposiumCR (20)

01-Sequencing_Technologies (1).ppt for education
01-Sequencing_Technologies (1).ppt for education01-Sequencing_Technologies (1).ppt for education
01-Sequencing_Technologies (1).ppt for education
 
DEF CON 23: Spread Spectrum Satcom Hacking: Attacking The GlobalStar Simplex ...
DEF CON 23: Spread Spectrum Satcom Hacking: Attacking The GlobalStar Simplex ...DEF CON 23: Spread Spectrum Satcom Hacking: Attacking The GlobalStar Simplex ...
DEF CON 23: Spread Spectrum Satcom Hacking: Attacking The GlobalStar Simplex ...
 
Black Hat '15: Spread Spectrum Satcom Hacking: Attacking The GlobalStar Simpl...
Black Hat '15: Spread Spectrum Satcom Hacking: Attacking The GlobalStar Simpl...Black Hat '15: Spread Spectrum Satcom Hacking: Attacking The GlobalStar Simpl...
Black Hat '15: Spread Spectrum Satcom Hacking: Attacking The GlobalStar Simpl...
 
Random access scan
Random access scan Random access scan
Random access scan
 
Digial instrumentation fnal
Digial instrumentation fnalDigial instrumentation fnal
Digial instrumentation fnal
 
1999 si pi_dws_training_course
1999 si pi_dws_training_course1999 si pi_dws_training_course
1999 si pi_dws_training_course
 
Primer designing for pcr and qpcr and their applications
Primer designing for pcr and qpcr and their applicationsPrimer designing for pcr and qpcr and their applications
Primer designing for pcr and qpcr and their applications
 
Scaling Genomic Analyses
Scaling Genomic AnalysesScaling Genomic Analyses
Scaling Genomic Analyses
 
Bioinformatics workshop Sept 2014
Bioinformatics workshop Sept 2014Bioinformatics workshop Sept 2014
Bioinformatics workshop Sept 2014
 
Bioinfo ngs data format visualization v2
Bioinfo ngs data format visualization v2Bioinfo ngs data format visualization v2
Bioinfo ngs data format visualization v2
 
Behavioral modeling of Clock/Data Recovery
Behavioral modeling of Clock/Data RecoveryBehavioral modeling of Clock/Data Recovery
Behavioral modeling of Clock/Data Recovery
 
Real-time PCR.ppt
Real-time PCR.pptReal-time PCR.ppt
Real-time PCR.ppt
 
Chipqpcrpresentation
ChipqpcrpresentationChipqpcrpresentation
Chipqpcrpresentation
 
The Remarkable Benefits and Grave Dangers of using Artificial Intelligence in...
The Remarkable Benefits and Grave Dangers of using Artificial Intelligence in...The Remarkable Benefits and Grave Dangers of using Artificial Intelligence in...
The Remarkable Benefits and Grave Dangers of using Artificial Intelligence in...
 
Pcr
PcrPcr
Pcr
 
Spectral-, source-, connectivity- and network analysis of EEG and MEG data
Spectral-, source-, connectivity- and network analysis of EEG and MEG dataSpectral-, source-, connectivity- and network analysis of EEG and MEG data
Spectral-, source-, connectivity- and network analysis of EEG and MEG data
 
Next generation sequencing methods
Next generation sequencing methods Next generation sequencing methods
Next generation sequencing methods
 
Guillemaud - SLAC T506 V2
Guillemaud - SLAC T506 V2Guillemaud - SLAC T506 V2
Guillemaud - SLAC T506 V2
 
MagneTag Presentation Winter Quarter V3.0
MagneTag Presentation Winter Quarter V3.0MagneTag Presentation Winter Quarter V3.0
MagneTag Presentation Winter Quarter V3.0
 
Automated Generation of High-accuracy Interatomic Potentials Using Quantum Data
Automated Generation of High-accuracy Interatomic Potentials Using Quantum DataAutomated Generation of High-accuracy Interatomic Potentials Using Quantum Data
Automated Generation of High-accuracy Interatomic Potentials Using Quantum Data
 

ETH_SymposiumCR

  • 1. 1
  • 2. Overview Real Time Single Molecule Sequencing • Mechanism/Motivation • Computational Challenges • Example: Sequence Alignment, probabilistic model Sequencing on a Chip • Mechanism/Motivation • Computation Challenges • Example: Sequence Alignment, probabilistic model 2
  • 4. 4 P G C A T P P A P G Polymerase Patient’s DNA sequence Real Time Single Molecule Sequencing Quantum dot FRET FRET: fluorescence resonance energy transfer -> Energy is transferred from quantum dot to dye, resulting in a color shift Emitted light Glass plate T
  • 5. Raw time series of one molecule (live) 5 Time (seconds) T A A G G
  • 6. Motivation • 5000 bp single reads or more • Multiple polymerase on one strand of DNA -> ultra long reads of up to 250’000 bp • (It’s very cool :-) 6
  • 7. Computational Challenges in Sequencing BaseCalling • Determine the sequence of G, A, T, C -> Signal detection (noise) -> Quality of base call Mapping • Locate billion(s) of sequences on genome -> Efficient mapping strategies -> Dealing with errors in reads SNP/CNV Detection • Does the patient have cancer? -> (Multiple) sequence alignment -> correct probabilistic model is critical 7 T A C G T A C G T C T G A G C A “Reads” “Sequences” Assembled genome Genotype, CNV, SNP
  • 8. Maximum likelihood alignment: Dynamic programming Confidential and Proprietary—DO NOT DUPLICATE 8 G A A G T A G A G A A Reference sequence R e a d missed base call mismatch missed base call NSB Correct model and transition probabilities are crucial
  • 10. Polymerase: event detection • For a detection limit of 10ms, we will miss ~ 7.5% of all insertion events • To see more events we could: – Slow down polymerase insertion rate – Lower insertion detection limit (increase frame rate and lower noise) Does not include blinking 10 Detection Limit % Events Missed 5ms 3.9% 10ms 7.5% 20ms 18% 30ms ~30%
  • 11. 11 Quantum Dot Blinking: Power law Sub-Sampled timeseries “Real”BlinkingSignal  Light dt·dpix “on” “off” Bell Curve Power Law Normal Distribution Power Law Finite mean/variance Infinite mean/variance possible Height/Weight distribution of people Financials: Stock market, Foreign exchange rates IQ of people Hedge fund risks Roulette, Blackjack etc File sizes, download times, city sizes, book sales
  • 13. Challenges of SMS • Enzyme kinetics (too fast to see, branching ratio) • Quantum dot blinking: random switching between ON and OFF states common to nanoscale emitters -> power law • Photobleaching (destruction of dot) • Single molecule -> stochastic behavior 13 Accuracy
  • 14. Sequencing on a Semiconductor Chip: The Chip is the Machine™ Ion Torrent Sensor Plate Silicon Substrate Drain SourceBulk ∆ pH ∆ V Sensing Layer H+ DNA on bead
  • 15. H+ Sensor Plate Silicon Substrate Drain SourceBulk ∆ pH ∆ V Sensing Layer H+ 15 • Natural chemistry: natural polymerase and nucleotides • Fast: one PGM run 1.5-7.5 hours, entire workflow 8-23 hours • Cost (50k for a PGM, 300.- for a sample with chip) • Simplicity: direct measurement, no cameras involved -> high accuracy • Read length: approx. 400 bp P G C A T P P A P G T Patient’s DNA sequence Method/Motivation DNA
  • 16. Raw signal of one base call Incorporations add hydrogen (dV): • one nucleotide • two nucelotides • three nucelotides • four nucleotides 16
  • 17. Flowgram of one read 17 T A C G T A C G T C T G A G C A Flow 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 Nr bases inserted 1 2 T 2T C CA AGG
  • 18. Dynamic programming Confidential and Proprietary—DO NOT DUPLICATE 18 Flow 1 A Flow 2 G Flow 3 T Flow 4 C Flow 5 A Flow 6 T Base 1 A Base 2 T Base 3 T Base 4 A carry forward Incomplete extension Read R e f e r e n c e Correct model and transition probabilities are crucial
  • 19. Phasing Parameter Typical Values Description Carry Forward 0.2% - 1.0% % of polymerases that will incorporate when they shouldn’t Incomplete Extension 0.2% - 1.0% % of polymerases that will not incorporate when they should Droop 0.0% - 0.3% % of polymerases that will stop working Phasing Model We have one bead per well and there a many copies of DNA on a bead. During a run the DNA copies get out of synch with each other 19
  • 20. 20 Temporal model: When will the polymerase incorporate? Histogram of flows during which 6th base (T) is incorporated T A C G T A C G T C T G A G C A T C G A 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 Flows (“time”) A C G T C T G A G C A T C G A T C G A T G T A C 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 Histogram of flows during which 50th base (A) is incorporated In phase In phase
  • 21. Goal of base calling (in principle) 21 Confidential and Proprietary—DO NOT DUPLICATE T C A G T T G A C TFind the base sequence for which the predicted flogram is most similar to the observed flogram T A C G T A C G T C T G A G C A T A C G T A C G T C T G A G C A Phasing model Least squares
  • 22. Regions, Exomes, Genomes and Beyond Small Genome Transcriptome 100M 1G 10G SequenceOutputperRun 100G Exome Small to Large Gene Panels 10M Ion 316™ Ion 314 Ion 318™ PII Human Genome PI PIII from 1.2 Million Sensors……… to 1.2 Billion 22
  • 23. About me… Detection of propylthiouracil (coffee, cabbage, grapefruit, green tea) Muscle performance (ACTN3) 23
  • 24. FAQ on SMS 24 • Quantum dot: nanocrystal made of semiconductor materials small enough to exhibit quantum mechanical properties. Exitons (electron/hole pair) are confined in all spacial dimensions. The frequency of emitted light increases as the size of the dot decreases • Blinking: caused by intercepted electrons, or by emitting electron. On/off times follow power law. • FRET: energy transfer without emission of photon. Efficiency is inverse of 6th power of distance (10-100 Å). Spectra must overlap