SlideShare a Scribd company logo
Class 2:
EVE 161:

Microbial Phylogenomics
Class #2:
Evolution of DNA Sequencing
UC Davis, Winter 2018
Instructor: Jonathan Eisen
Teaching Assistant: Cassie Ettinger
!1
Where we are going and where we have been
• Previous Class:
!1. Introduction, Reading Papers
• Current Class:
!2. Evolution of DNA sequencing
• Next Class:
!3. The Tree of Life
!2
Where we are going and where we have been
• Office Hours
• Cassie Ettinger - Thursday 10:30-11:30
Storer 5331
• Jonathan Eisen - Monday 11-12 

GBSF 5311
!3
Future Classes
• Come to class ready to discuss papers
• Will start a discussion forum to discuss in advance
• Come prepared with 2 questions about the papers to ask
• 1 question about something you did not understand
• 1 question for the other students to test understanding of
the paper
Class 2 Outline
• Two papers
• Evolution of Sequencing
!5
Costs / Mb
Pre Class
• 1. Briefly explain how to use chain-terminating inhibitors to
infer a DNA sequence. (Hint: Look at the figures in Sanger
et al)
• 2. How has sequencing technology changed over time?
Use examples of first-generation, second-generation and
third-generation DNA sequencing methods in your
answer.
Questions about Sanger paper?
Sanger et al. 1977
Sanger et al. 1977 Background
Sanger et al. 1977 Background
Sanger et al. Methods - Reagents
Sanger et al. Methods - Sequencing
Sanger et al. Results
• Key Points from Discussion
! Many advantages over plus / minus method
! Can read up to 300 bp
! Some artifacts but some solutions to these
! Best to read with both strands
! Hard to get ddNTPs
Sanger et al. Results
• A new method for determining nucleotide sequences in
DNA is described. It is similar to the “plus and minus”
method [Sanger, F. & Coulson, A. R. (1975) J. Mol. Biol.
94, 441-448] but makes use of the 2′,3′-dideoxy and
arabinonucleoside analogues of the normal
deoxynucleoside triphosphates, which act as specific
chain-terminating inhibitors of DNA polymerase. The
technique has been applied to the DNA of bacteriophage
ϕX174 and is more rapid and more accurate than either
the plus or the minus method.
Sanger et al. Abstract
Evolution of Sequencing
Gen 0:
Proto
Sequencing
Gen 0: Proto-Sequencing
Evolution of Sequencing
Gen 0:
Proto
Sequencing
Sanger
Maxam-Gilbert
Gen 1:
Manual
Sequencing
Maxam-Gilbert Sequencing
From http://www.pnas.org/content/74/2/560.full.pdf
Sanger Sequencing
http://www.ncbi.nlm.nih.gov/
pmc/articles/PMC431765/
Sanger Sequencing
• http://www.annualreviews.org/na101/home/literatum/publisher/ar/journals/content/anchem/2013/anchem.2013.6.issue-1/
annurev-anchem-062012-092628/20130605/images/medium/ac60287.f1.gif
Sanger Sequencing
• http://www.annualreviews.org/na101/home/literatum/publisher/ar/journals/content/anchem/2013/anchem.2013.6.issue-1/annurev-
anchem-062012-092628/20130605/images/medium/ac60287.f1.gif
Nobel Prize 1980: Berg, Gilbert, Sanger
http://www.nobelprize.org/nobel_prizes/chemistry/laureates/1980/
Evolution of Sequencing
Gen 0:
Proto
Sequencing
Sanger
Maxam-Gilbert
Gen 1:
Manual
Sequencing
Gen 2:
Automated
Sanger
Automation of Sanger Part I
Sanger method with labeled dNTPs
The Sanger mehtods is based on the idea that inhibitors can
terminate elongation of DNA at specific points
Many Systems for Sanger Automation
Thanks to Robin Coope and Dale Yazuki for comments on 2014 talk
ABI 3700 ABI 3730 ABI 3730xl
Megabase 1000 Megabase 4000 LiCor
Automation of Sanger Part II
Some Automated Sanger Highlights
• 1991: ESTs by Venter
• 1995: H. influenzae shotgun genome
• 1996: Yeast, archaeal genomes
• 1998: 1st animal genome - C. elegans
• 1999: Drosophila shotgun genome
• 2000: Arabidopsis genome
• 2000: Human genome
• 2004: Shotgun metagenomics
Thanks to Keith Bradnam for C. elegans suggestion
Evolution of Sequencing
Gen 0:
Proto
Sequencing
Sanger
Maxam-Gilbert
ABI-Solid
Solexa-
Illumina
454-
Roche
Ion
Torrent
Gen 1:
Manual
Sequencing
Gen 2:
Automated
Sanger
Gen 3:
Clusters
Not
Clones
Generation III: Clusters not Clones
Generation III = “NextGen”
NextGen Sequencing OutlineNext-generation sequencing platforms
Isolation and purification of
target DNA
Sample preparation
Library validation
Cluster generation
on solid-phase
Emulsion PCR
Sequencing by synthesis
with  3’-blocked reversible
terminators
Pyrosequencing Sequencing by ligation
Single colour imaging
Sequencing by synthesis
with  3’-unblocked reversible
terminators
AmplificationSequencingImaging
Four colour imaging
Data analysis
Roche 454Illumina GAII ABi SOLiD Helicos HeliScope
From Slideshare presentation of Cosentino Cristian
http://www.slideshare.net/cosentia/high-throughput-equencing
454 Pyrosequencing
Sanger
method
-
ABi SOLiD
HeliScope
Nanopore
Roche 454
umina GAII
From Slideshare presentation of Cosentino
Cristian
http://www.slideshare.net/cosentia/high-
throughput-equencing
NextGen #1: 454
NextGen #2: ABI SolidSequecning by ligation
Sanger
method
Roche 454
-
HeliScope
Nanopore
ABi SOLiD
Illumina GAII
From Slideshare
presentation of Cosentino
Cristian
http://www.slideshare.net/
cosentia/high-throughput-
equencing
NextGen #3: Solexa
Sequecning by synthesis with reversible terminator
anger
ethod
he 454
SOLiD
iScope
nopore
-
From Slideshare presentation of Cosentino Cristian
http://www.slideshare.net/cosentia/high-throughput-equencing
NextGen #2: Illumina
Sequecning by synthesis with reversible terminator
anger
ethod
he 454
SOLiD
iScope
nopore
-
From Slideshare presentation of Cosentino Cristian
http://www.slideshare.net/cosentia/high-throughput-equencing
Illumina Outline
Clusters
amplification
Clusterstation
Wash cluster
station
Clustergeneration
Linearization,
Blocking and
primer
Hybridization
Read 1
Prepare read 2
Read 2
GAIIx&PE
SBSsequencing
Pipeline base call
Data analysis
Sample
preparation and
library validation
Analysis
Sequencing workflow
Sample
preparation
Clusters
amplification
Sequencing by
synthesis
Analysis
pipeline
Introduction
Illumina GAII
High
throughput
From Slideshare
presentation of Cosentino
Cristian
http://www.slideshare.net/
cosentia/high-throughput-
equencing
Illumina Step 1: Prep & Attach DNA
termined by user-defined instrument settings,
which permits discrete read lengths of 25–35
the Illumina data from each run, removing
poor-quality sequences.
Adapter
DNA fragment
Dense lawn
of primers
Adapter
Attached
DNA
Adapters
Prepare genomic DNA sample
Randomly fragment genomic DNA
and ligate adapters to both ends of
the fragments.
Attach DNA to surface
Bind single-stranded fragments
randomly to the inside surface
of the flow cell channels.
Nucleotides
a
From Mardis 2008. Annual Rev. Genetics 9: 387.
Step 1: Sample Preparation The DNA sample of interest is sheared to appropriate size (average ~800bp) using a compressed air device known as a nebulizer. The
ends of the DNA are polished, and two unique adapters are ligated to the fragments. Ligated fragments of the size range of 150-200bp are isolated via gel
extraction and amplified using limited cycles of PCR
Illumina Step 2: Clusters by Bridge PCR
Attached
Prepare genomic DNA sample
Randomly fragment genomic DNA
and ligate adapters to both ends of
the fragments.
Attach DNA to surface
Bind single-stranded fragments
randomly to the inside surface
of the flow cell channels.
Bridge amplification
Add unlabeled nucleotides
and enzyme to initiate solid-
phase bridge amplification.
Denature the double
stranded molecules
Nucleotides
Figure 2
The Illumina sequencing-by-synthesis approach. Cluster strands created by bridge amplification are primed and all four fluorescently
labeled, 3′-OH blocked nucleotides are added to the flow cell with DNA polymerase. The cluster strands are extended by one
nucleotide. Following the incorporation step, the unused nucleotides and DNA polymerase molecules are washed away, a scan buffer is
added to the flow cell, and the optics system scans each lane of the flow cell by imaging units called tiles. Once imaging is completed,
chemicals that effect cleavage of the fluorescent labels and the 3′-OH blocking groups are added to the flow cell, which prepares the
cluster strands for another round of fluorescent nucleotide incorporation.
392 Mardis
• From : http://seqanswers.com/forums/showthread.php?t=21. Steps 2-6: Cluster Generation by Bridge Amplification. In contrast to the 454 and ABI methods which
use a bead-based emulsion PCR to generate "polonies", Illumina utilizes a unique "bridged" amplification reaction that occurs on the surface of the flow cell.
The flow cell surface is coated with single stranded oligonucleotides that correspond to the sequences of the adapters ligated during the sample preparation
stage. Single-stranded, adapter-ligated fragments are bound to the surface of the flow cell exposed to reagents for polyermase-based extension. Priming
occurs as the free/distal end of a ligated fragment "bridges" to a complementary oligo on the surface. Repeated denaturation and extension results in
localized amplification of single molecules in millions of unique locations across the flow cell surface. This process occurs in what is referred to as Illumina's
"cluster station", an automated flow cell processor.
From Mardis 2008. Annual Rev. Genetics 9: 387.
Clusters
Illumina Step 3: Sequencing
25–35 bp, and each sequencing run yields be-
tween 2–4 Gb of DNA sequence data. Once
generation read type has a unique error model
different from that already established for
b
Laser
First chemistry cycle:
determine first base
To initiate the first
sequencing cycle, add
all four labeled reversible
terminators, primers, and
DNA polymerase enzyme
to the flow cell.
Image of first chemistry cycle
After laser excitation, capture the image
of emitted fluorescence from each
cluster on the flow cell. Record the
identity of the first base for each cluster.
Sequence read over multiple chemistry cycles
Repeat cycles of sequencing to determine the sequence
Before initiating the
next chemistry cycle
The blocked 3' terminus
and the fluorophore
from each incorporated
base are removed.
GCTGA...
From Mardis 2008. Annual Rev. Genetics 9: 387.
From : http://seqanswers.com/forums/showthread.php?t=21. Steps 7-12: Sequencing by Synthesis. A flow cell containing millions of unique clusters is now
loaded into the 1G sequencer for automated cycles of extension and imaging. The first cycle of sequencing consists first of the incorporation of a single
fluorescent nucleotide, followed by high resolution imaging of the entire flow cell. These images represent the data collected for the first base. Any signal
above background identifies the physical location of a cluster (or polony), and the fluorescent emission identifies which of the four bases was incorporated at
that position. This cycle is repeated, one base at a time, generating a series of images each representing a single base extension at a specific cluster. Base calls
are derived with an algorithm that identifies the emission color over time. At this time reports of useful Illumina reads range from 26-50 bases.
SBS technology
Sample
eparation
Clusters
mplification
uencing by
ynthesis
Analysis
pipeline
troduction
umina GAII
High
roughput
From Slideshare presentation of Cosentino Cristian
http://www.slideshare.net/cosentia/high-throughput-equencing
Illumina Step 3: Sequencing
Illumina Cycling
Laser
terminators, primers, and
DNA polymerase enzyme
to the flow cell.
Image of first chemistry cycle
After laser excitation, capture the image
of emitted fluorescence from each
cluster on the flow cell. Record the
identity of the first base for each cluster.
Sequence read over multiple chemistry cycles
Repeat cycles of sequencing to determine the sequence
of bases in a given fragment a single base at a time.
Before initiating the
next chemistry cycle
The blocked 3' terminus
and the fluorophore
from each incorporated
base are removed.
GCTGA...
Figure 2
(Continued )
www.annualreviews.org • Next-Generation DNA Sequencing Methods 393
From Mardis 2008. Annual Rev. Genetics 9: 387.
Illumina Evolution
http://www.slideshare.net/AGRF_Ltd/ngs-technologies-platforms-and-applications
Generation 3.5: Faster w/ Clusters
• Even faster
Generation 3.5: Faster w/ Clusters - Ion Torrent
Ion Torrent pH Based Sequencing
Mardis ER. Next-generation sequencing platforms. Annu Rev Anal Chem 2013;6:287-303.
Oxford
NanoPore
Pacbio
Helicos
Gen 4: Single Molecule
Gen 0:
Proto
Sequencing
Sanger
Maxam-Gilbert
ABI-Solid
Solexa-
Illumina
454-
Roche
Ion
Torrent
Gen 1:
Manual
Sequencing
Gen 2:
Automated
Sanger
Gen 3:
Clusters
Not
Clones
Gen 4:
Single
Molecule
Single Molecule I: Helicos
3rd
Generation Sequencing
y
w for
cular
ent_sequencing
Single Molecule II: Pacific Biosciences
Mardis ER. Next-generation sequencing platforms. Annu Rev Anal Chem 2013;6:287-303.
Single Molecule II: Pacific Biosciences
Analytical Chemistry REVIEW
Φ29 polymerase. Each amplified product of a circularized
fragment is called a DNA nanoball (DNB). DNBs are selectively
attached to a hexamethyldisilizane (HMDS) coated silicon chip
that is photolithographically patterned with aminosilane active
sites. Figure 3A illustrates the DNB array design.
The use of the DNBs coupled with the highly patterned array
offers several advantages. The production of DNBs increases
signal intensity by simply increasing the number of hybridization
sites available for probing. Also, the size of the DNB is on the
same length scale as the active site or “sticky” spot patterned on
Each hybridization and ligation cycle is followed by fluorescent
imaging of the DNB spotted chip and subsequently regeneration
of the DNBs with a formamide solution. This cycle is repeated
until the entire combinatorial library of probes and anchors is
examined. This formula of the use of unchained reads and
regeneration of the sequencing fragment reduces reagent con-
sumption and eliminates potential accumulation errors that can
arise in other sequencing technologies that require close to
completion of each sequencing reaction.19,52,53
Complete Genomics showcased their DNB array and cPAL
Figure 2. Schematic of PacBio’s real-time single molecule sequencing. (A) The side view of a single ZMW nanostructure containing a single DNA
polymerase (Φ29) bound to the bottom glass surface. The ZMW and the confocal imaging system allow fluorescence detection only at the bottom
surface of each ZMW. (B) Representation of fluorescently labeled nucleotide substrate incorporation on to a sequencing template. The corresponding
temporal fluorescence detection with respect to each of the five incorporation steps is shown below. Reprinted with permission from ref 39. Copyright
2009 American Association for the Advancement of Science.
Figure 2. Schematic of PacBio’s real-time single molecule sequencing. (A) The side view of a single ZMW nanostructure containing a single
DNA polymerase (Φ29) bound to the bottom glass surface. The ZMW and the confocal imaging system allow fluorescence detection only at
the bottom surface of each ZMW. (B) Representation of fluorescently labeled nucleotide substrate incorporation on to a sequencing
template. The corresponding temporal fluorescence detection with respect to each of the five incorporation steps is shown below.
From Niedringhaus et al. Analytical Chemistry 83: 4327. 2011.
Single Molecule II: Pacific Biosciences
Detecting Modified Bases
Page 2 www.pacb.com/basemod
processes.
The potential benefits of detecting base modification,
using SMRT sequencing, include:
Single-base resolution detection of a wide
two successive base incorporations, are altered by
the presence of a modified base in the DNA
template
3
. This is observable as an increased space
between fluorescence pulses, which is called the
interpulse duration (IPD), as shown in Figure 2.
Figure 2. Principle of detecting modified DNA bases during SMRT sequencing. The presence of the modified base in the DNA
template (top), shown here for 6-mA, results in a delayed incorporation of the corresponding T nucleotide, i.e. longer
interpulse duration (IPD), compared to a control DNA template lacking the modification (bottom).
3
http://www.pacificbiosciences.com/pdf/microbial_primer.pdf
Single Molecule III: Oxford Nanopores
This diagram shows a protein nanopore set in an electrically resistant membrane bilayer. An ionic current is passed through the nanopore
by setting a voltage across this membrane. If an analyte passes through the pore or near its aperture, this event creates a characteristic
disruption in current. By measuring that current it is possible to identify the molecule in question. For example, this system can be used to
distinguish the four standard DNA bases and G, A, T and C, and also modified bases. It can be used to identify target proteins, small
molecules, or to gain rich molecular information for example to distinguish the enantiomers of ibuprofen or molecular binding dynamics.
From Oxford Nanopores Web Site
• Figure6. BiologicalnanoporeschemeemployedbyOxfordNanopore.
(A)SchematicofRHLproteinnanoporemutantdepictingthepositionsofthe cyclodextrin (at residue 135) and glutamines (at residue 139).
(B) A detailed view of the β barrel of the mutant nanopore shows the locations of the arginines (at residue 113) and the cysteines. (C)
Exonuclease sequencing: A processive enzyme is attached to the top of the nanopore to cleave single nucleotides from the target DNA
strand and pass them through the nanopore. (D) A residual current-vs-time signal trace from an RHL protein nanopore that shows a
clear discrimination between single bases (dGMP, dTMP, dAMP, and dCMP). (E) Strand sequencing: ssDNA is threaded through a
protein nanopore and individual bases are identified, as the strand remains intact. Panels A, B, and D reprinted with permission from ref
91. Copyright 2009 Nature Publishing Group. Panels C and E reprinted with permission from Oxford Nanopore Technologies (Zoe
McDougall).
nalytical Chemistry REVIEW
etected across a metal oxide-silicon layered structure. The translocated through a solid-state nanopore. In addition, sever
igure 6. Biological nanopore scheme employed by Oxford Nanopore. (A) Schematic of RHL protein nanopore mutant depicting the positions of t
yclodextrin (at residue 135) and glutamines (at residue 139). (B) A detailed view of the β barrel of the mutant nanopore shows the locations
he arginines (at residue 113) and the cysteines. (C) Exonuclease sequencing: A processive enzyme is attached to the top of the nanopore to cleave sing
ucleotides from the target DNA strand and pass them through the nanopore. (D) A residual current-vs-time signal trace from an RHL protein nanopo
hat shows a clear discrimination between single bases (dGMP, dTMP, dAMP, and dCMP). (E) Strand sequencing: ssDNA is threaded through
rotein nanopore and individual bases are identified, as the strand remains intact. Panels A, B, and D reprinted with permission from ref 91. Copyrig
009 Nature Publishing Group. Panels C and E reprinted with permission from Oxford Nanopore Technologies (Zoe McDougall).
Figure6. Biological nanopores cheme employed by Oxford Nanopore.(A) Schematic of RHL protein nano pore mutant depicting the positions of the cyclodextrin (at residue
135) and glutamines (at residue 139). (B) A detailed view of the β barrel of the mutant nanopore shows the locations of the arginines (at residue 113) and the cysteines. (C)
Exonuclease sequencing: A processive enzyme is attached to the top of the nanopore to cleave single nucleotides from the target DNA strand and pass them through the
nanopore. (D) A residual current-vs-time signal trace from an RHL protein nanopore that shows a clear discrimination between single bases (dGMP, dTMP, dAMP, and dCMP).
(E) Strand sequencing: ssDNA is threaded through a protein nanopore and individual bases are identified, as the strand remains intact. Panels A, B, and D reprinted with
permission from ref 91. Copyright 2009 Nature Publishing Group. Panels C and E reprinted with permission from Oxford Nanopore Technologies (Zoe McDougall).
From Niedringhaus et al. Analytical Chemistry 83: 4327. 2011.
Single Molecule III: Oxford Nanopores
Nanopores
Analytical Chemistry REVIEW
would result, in theory, in detectably altered current flow through
the pore. Theoretically, nanopores could also be designed to
measure tunneling current across the pore as bases, each with a
distinct tunneling potential, could be read. The nanopore ap-
lipid bilayer, using ionic current blockage method. The autho
predicted that single nucleotides could be discriminated as lon
as: (1) each nucleotide produces a unique signal signature; (2
the nanopore possesses proper aperture geometry to accommo
Figure 5. Nanopore DNA sequencing using electronic measurements and optical readout as detection methods. (A) In electronic nanopore scheme
signal is obtained through ionic current,73
tunneling current,78
and voltage difference79
measurements. Each method must produce a characteristic sign
to differentiate the four DNA bases. Reprinted with permission from ref 83. Copyright 2008 Annual Reviews. (B) In the optical readout nanopore desig
each nucleotide is converted to a preset oligonucleotide sequence and hybridized with labeled markers that are detected during translocation of the DN
fragment through the nanopore. Reprinted from ref 82. Copyright 2010 American Chemical Society.
Nanopore DNA sequencing using electronic measurements and optical readout as detection methods.(A)In electronic nanopore schemes, signal is obtained through ionic
current,73 tunneling current, and voltage difference measurements. Each method must produce a characteristic signal to differentiate the four DNA bases. (B) In the optical
readout nanopore design, each nucleotide is converted to a preset oligonucleotide sequence and hybridized with labeled markers that are detected during translocation of the
DNA fragment through the nanopore.
Single Molecule III: Oxford Nanopores
From Niedringhaus et al. Analytical Chemistry 83: 4327. 2011.
• http://www.nature.com/polopoly_fs/7.15596.1392415346!/
image/image001.jpg_gen/derivatives/landscape_630/
image001.jp
“It’s kind of a cute device,” says Jaffe of the MinION, which is roughly the size and shape of a packet of chewing gum. “It
has pretty lights and a fan that hums pleasantly, and plugs into a USB drive.” But his technical review is mixed. From http://
www.nature.com/news/data-from-pocket-sized-genome-sequencer-unveiled-1.14724
Single Molecule III: Oxford Nanopores
Real Time Sequencing?
Slide Provided by Nick Loman
Bells and Whistles
• Multiplexing
• Paired end
• Mate pair
• Moleculo
Multiplexing
From http://www.illumina.com/technology/multiplexing_sequencing_assay.ilmn
Multiplexing
http://res.illumina.com/documents/products/datasheets/datasheet_sequencing_multiplex.pdf
Small Amounts of DNA
http://www.epibio.com/docs/default-source/protocols/nextera-dna-sample-prep-kit-(illumina--compatible).pdf?sfvrsn=4
Capture Methods
High throughput sample preparation
Sample
preparation
Clusters
amplification
Sequencing by
synthesis
Analysis
pipeline
Introduction
Illumina GAII
High
throughput
Nature Methods, 2010, 7: 111-118
RainDance
Microdroplet PCR
Roche Nimblegen
Salid-phase capture with custom-
designed oligonucleotide microarray
Reported 84% of
capture efficiency
Reported 65-90% of capture efficiency
From Slideshare presentation of Cosentino Cristian
http://www.slideshare.net/cosentia/high-throughput-equencing
High throughput sample preparation
Sample
preparation
Clusters
amplification
Sequencing by
synthesis
Analysis
pipeline
Introduction
Illumina GAII
High
throughput
Agilent SureSelect
Solution-phase capture with
streptavidin-coated magnetic beads
Reported 60-80% of capture efficiency
From Slideshare presentation of
Cosentino Cristian
http://www.slideshare.net/cosentia/high-
throughput-equencing
Capture Methods
Illumina Paired EndsPaired-end technology
Paired-end sequencing works into GA and uses chemicals from the PE
module to perform cluster amplification of the reverse strandSample
preparation
Clusters
amplification
Sequencing by
synthesis
Analysis
pipeline
Introduction
Illumina GAII
High
throughput
From Slideshare presentation
of Cosentino Cristian
http://www.slideshare.net/
cosentia/high-throughput-
equencing
Moleculo
Large fragments
DNA
Isolate and amplify
CACC GGAA TCTC ACGT AAGG GATC AAAA
Sublibrary w/ unique barcodes
Sequence w/ Illumina
Assemble seqs w/ same codes
HiC
Suggested by Carlos Bustamante and Keith Bradnam
HiC Crosslinking & Sequencing
Beitel CW, Froenicke L, Lang JM, Korf IF, Michelmore RW,
Eisen JA, Darling AE. (2014) Strain- and plasmid-level
deconvolution of a synthetic metagenome by sequencing
proximity ligation products. PeerJ 2:e415 http://dx.doi.org/
10.7717/peerj.415
Table 1 Species alignment fractions. The number of reads aligning to each replicon present in the
synthetic microbial community are shown before and after filtering, along with the percent of total
constituted by each species. The GC content (“GC”) and restriction site counts (“#R.S.”) of each replicon,
species, and strain are shown. Bur1: B. thailandensis chromosome 1. Bur2: B. thailandensis chromosome
2. Lac0: L. brevis chromosome, Lac1: L. brevis plasmid 1, Lac2: L. brevis plasmid 2, Ped: P. pentosaceus,
K12: E. coli K12 DH10B, BL21: E. coli BL21. An expanded version of this table can be found in Table S2.
Sequence Alignment % of Total Filtered % of aligned Length GC #R.S.
Lac0 10,603,204 26.17% 10,269,562 96.85% 2,291,220 0.462 629
Lac1 145,718 0.36% 145,478 99.84% 13,413 0.386 3
Lac2 691,723 1.71% 665,825 96.26% 35,595 0.385 16
Lac 11,440,645 28.23% 11,080,865 96.86% 2,340,228 0.46 648
Ped 2,084,595 5.14% 2,022,870 97.04% 1,832,387 0.373 863
BL21 12,882,177 31.79% 2,676,458 20.78% 4,558,953 0.508 508
K12 9,693,726 23.92% 1,218,281 12.57% 4,686,137 0.507 568
E. coli 22,575,903 55.71% 3,894,739 17.25% 9,245,090 0.51 1076
Bur1 1,886,054 4.65% 1,797,745 95.32% 2,914,771 0.68 144
Bur2 2,536,569 6.26% 2,464,534 97.16% 3,809,201 0.672 225
Bur 4,422,623 10.91% 4,262,279 96.37% 6,723,972 0.68 369
Figure 1 Hi-C insert distribution. The distribution of genomic distances between Hi-C read pairs is
shown for read pairs mapping to each chromosome. For each read pair the minimum path length on
the circular chromosome was calculated and read pairs separated by less than 1000 bp were discarded.
The 2.5 Mb range was divided into 100 bins of equal size and the number of read pairs in each bin
was recorded for each chromosome. Bin values for each chromosome were normalized to sum to 1 and
plotted.
E. coli K12 genome were distributed in a similar manner as previously reported (Fig. 1;
(Lieberman-Aiden et al., 2009)). We observed a minor depletion of alignments spanning
the linearization point of the E. coli K12 assembly (e.g., near coordinates 0 and 4686137)
due to edge eVects induced by BWA treating the sequence as a linear chromosome rather
than circular.
10.7717/peerj.415 9/19
Figure 2 Metagenomic Hi-C associations. The log-scaled, normalized number of Hi-C read pairs
associating each genomic replicon in the synthetic community is shown as a heat map (see color scale,
blue to yellow: low to high normalized, log scaled association rates). Bur1: B. thailandensis chromosome
1. Bur2: B. thailandensis chromosome 2. Lac0: L. brevis chromosome, Lac1: L. brevis plasmid 1, Lac2:
L. brevis plasmid 2, Ped: P. pentosaceus, K12: E. coli K12 DH10B, BL21: E. coli BL21.
reference assemblies of the members of our synthetic microbial community with the same
alignment parameters as were used in the top ranked clustering (described above). We first
Figure 3 Contigs associated by Hi-C reads. A graph is drawn with nodes depicting contigs and edges
depicting associations between contigs as indicated by aligned Hi-C read pairs, with the count thereof
depicted by the weight of edges. Nodes are colored to reflect the species to which they belong (see legend)
with node size reflecting contig size. Contigs below 5 kb and edges with weights less than 5 were excluded.
Contig associations were normalized for variation in contig size.
typically represent the reads and variant sites as a variant graph wherein variant sites are
represented as nodes, and sequence reads define edges between variant sites observed in
the same read (or read pair). We reasoned that variant graphs constructed from Hi-C
data would have much greater connectivity (where connectivity is defined as the mean
path length between randomly sampled variant positions) than graphs constructed from
mate-pair sequencing data, simply because Hi-C inserts span megabase distances. Such
Figure 4 Hi-C contact maps for replicons of Lactobacillus brevis. Contact maps show the number of
Hi-C read pairs associating each region of the L. brevis genome. The L. brevis chromosome (Lac0, (A),
Chris Beitel
@datscimed
Aaron Darling
@koadman
Dovetail “Chicago” Libraries
http://arxiv.org/abs/1502.05331
Suggested by
Single Cell Sequencing
Suggested by Keith Robison

More Related Content

What's hot

Microbial Phylogenomics (EVE161) Class 17: Genomes from Uncultured
Microbial Phylogenomics (EVE161) Class 17: Genomes from UnculturedMicrobial Phylogenomics (EVE161) Class 17: Genomes from Uncultured
Microbial Phylogenomics (EVE161) Class 17: Genomes from Uncultured
Jonathan Eisen
 
Pathogen Genome Data
Pathogen Genome DataPathogen Genome Data
Pathogen Genome Data
Leighton Pritchard
 
Intro to metagenomic binning
Intro to metagenomic binningIntro to metagenomic binning
Intro to metagenomic binning
A. Murat Eren
 
Next Generation Sequencing for Identification and Subtyping of Foodborne Pat...
Next Generation Sequencing for Identification and Subtyping of Foodborne Pat...Next Generation Sequencing for Identification and Subtyping of Foodborne Pat...
Next Generation Sequencing for Identification and Subtyping of Foodborne Pat...
nist-spin
 
[2013.12.02] Mads Albertsen: Extracting Genomes from Metagenomes
[2013.12.02] Mads Albertsen: Extracting Genomes from Metagenomes[2013.12.02] Mads Albertsen: Extracting Genomes from Metagenomes
[2013.12.02] Mads Albertsen: Extracting Genomes from Metagenomes
Mads Albertsen
 
Genomics Technologies
Genomics TechnologiesGenomics Technologies
Genomics Technologies
Sean Davis
 
Advancing the Metagenomics Revolution
Advancing the Metagenomics RevolutionAdvancing the Metagenomics Revolution
Advancing the Metagenomics Revolution
Larry Smarr
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
Jajati Keshari Nayak
 
[2013.09.27] extracting genomes from metagenomes
[2013.09.27] extracting genomes from metagenomes[2013.09.27] extracting genomes from metagenomes
[2013.09.27] extracting genomes from metagenomesMads Albertsen
 
Beyond Metagenomics- Integration Of Complementary Approaches For The Study Of...
Beyond Metagenomics- Integration Of Complementary Approaches For The Study Of...Beyond Metagenomics- Integration Of Complementary Approaches For The Study Of...
Beyond Metagenomics- Integration Of Complementary Approaches For The Study Of...
guest5368597
 
Bhojeshwari sahu
Bhojeshwari sahuBhojeshwari sahu
Bhojeshwari sahu
Dr. S.S. Shaw
 
Genomics(functional genomics)
Genomics(functional genomics)Genomics(functional genomics)
Genomics(functional genomics)
IndrajaDoradla
 
Folker Meyer: Metagenomic Data Annotation
Folker Meyer: Metagenomic Data AnnotationFolker Meyer: Metagenomic Data Annotation
Folker Meyer: Metagenomic Data Annotation
GigaScience, BGI Hong Kong
 
rheumatoid arthritis
rheumatoid arthritisrheumatoid arthritis
rheumatoid arthritis
Ankit Bhardwaj
 
Ngs presentation
Ngs presentationNgs presentation
Ngs presentation
Chakradhar Reddy
 
Metagenomics sequencing
Metagenomics sequencingMetagenomics sequencing
Metagenomics sequencing
cdgenomics525
 
Microbial Genomics and Bioinformatics: BM405 (2015)
Microbial Genomics and Bioinformatics: BM405 (2015)Microbial Genomics and Bioinformatics: BM405 (2015)
Microbial Genomics and Bioinformatics: BM405 (2015)
Leighton Pritchard
 
Reverse-and forward-engineering specificity of carbohydrate-processing enzymes
Reverse-and forward-engineering specificity of carbohydrate-processing enzymesReverse-and forward-engineering specificity of carbohydrate-processing enzymes
Reverse-and forward-engineering specificity of carbohydrate-processing enzymes
Leighton Pritchard
 
Next Generation Sequencing Informatics - Challenges and Opportunities
Next Generation Sequencing Informatics - Challenges and OpportunitiesNext Generation Sequencing Informatics - Challenges and Opportunities
Next Generation Sequencing Informatics - Challenges and Opportunities
Chung-Tsai Su
 

What's hot (20)

Microbial Phylogenomics (EVE161) Class 17: Genomes from Uncultured
Microbial Phylogenomics (EVE161) Class 17: Genomes from UnculturedMicrobial Phylogenomics (EVE161) Class 17: Genomes from Uncultured
Microbial Phylogenomics (EVE161) Class 17: Genomes from Uncultured
 
Pathogen Genome Data
Pathogen Genome DataPathogen Genome Data
Pathogen Genome Data
 
Intro to metagenomic binning
Intro to metagenomic binningIntro to metagenomic binning
Intro to metagenomic binning
 
Next Generation Sequencing for Identification and Subtyping of Foodborne Pat...
Next Generation Sequencing for Identification and Subtyping of Foodborne Pat...Next Generation Sequencing for Identification and Subtyping of Foodborne Pat...
Next Generation Sequencing for Identification and Subtyping of Foodborne Pat...
 
Pattemore 2015
Pattemore 2015Pattemore 2015
Pattemore 2015
 
[2013.12.02] Mads Albertsen: Extracting Genomes from Metagenomes
[2013.12.02] Mads Albertsen: Extracting Genomes from Metagenomes[2013.12.02] Mads Albertsen: Extracting Genomes from Metagenomes
[2013.12.02] Mads Albertsen: Extracting Genomes from Metagenomes
 
Genomics Technologies
Genomics TechnologiesGenomics Technologies
Genomics Technologies
 
Advancing the Metagenomics Revolution
Advancing the Metagenomics RevolutionAdvancing the Metagenomics Revolution
Advancing the Metagenomics Revolution
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
[2013.09.27] extracting genomes from metagenomes
[2013.09.27] extracting genomes from metagenomes[2013.09.27] extracting genomes from metagenomes
[2013.09.27] extracting genomes from metagenomes
 
Beyond Metagenomics- Integration Of Complementary Approaches For The Study Of...
Beyond Metagenomics- Integration Of Complementary Approaches For The Study Of...Beyond Metagenomics- Integration Of Complementary Approaches For The Study Of...
Beyond Metagenomics- Integration Of Complementary Approaches For The Study Of...
 
Bhojeshwari sahu
Bhojeshwari sahuBhojeshwari sahu
Bhojeshwari sahu
 
Genomics(functional genomics)
Genomics(functional genomics)Genomics(functional genomics)
Genomics(functional genomics)
 
Folker Meyer: Metagenomic Data Annotation
Folker Meyer: Metagenomic Data AnnotationFolker Meyer: Metagenomic Data Annotation
Folker Meyer: Metagenomic Data Annotation
 
rheumatoid arthritis
rheumatoid arthritisrheumatoid arthritis
rheumatoid arthritis
 
Ngs presentation
Ngs presentationNgs presentation
Ngs presentation
 
Metagenomics sequencing
Metagenomics sequencingMetagenomics sequencing
Metagenomics sequencing
 
Microbial Genomics and Bioinformatics: BM405 (2015)
Microbial Genomics and Bioinformatics: BM405 (2015)Microbial Genomics and Bioinformatics: BM405 (2015)
Microbial Genomics and Bioinformatics: BM405 (2015)
 
Reverse-and forward-engineering specificity of carbohydrate-processing enzymes
Reverse-and forward-engineering specificity of carbohydrate-processing enzymesReverse-and forward-engineering specificity of carbohydrate-processing enzymes
Reverse-and forward-engineering specificity of carbohydrate-processing enzymes
 
Next Generation Sequencing Informatics - Challenges and Opportunities
Next Generation Sequencing Informatics - Challenges and OpportunitiesNext Generation Sequencing Informatics - Challenges and Opportunities
Next Generation Sequencing Informatics - Challenges and Opportunities
 

Similar to EVE161: Microbial Phylogenomics - Class 2 - Evolution of DNA Sequencing

Next generation sequencing
Next generation sequencingNext generation sequencing
Next generation sequencing
LINUS CORNERY
 
Sequencing genes and genomes
Sequencing genes and genomesSequencing genes and genomes
Sequencing genes and genomes
sepidehsaroghi
 
Illumina infinium sequencing
Illumina infinium sequencingIllumina infinium sequencing
Illumina infinium sequencing
Ayush Jain
 
Third Generation Sequencing
Third Generation Sequencing Third Generation Sequencing
Third Generation Sequencing
priyanka raviraj
 
Hamas 1
Hamas 1Hamas 1
03_Microbio590B_sequencing_2022.pdf
03_Microbio590B_sequencing_2022.pdf03_Microbio590B_sequencing_2022.pdf
03_Microbio590B_sequencing_2022.pdf
Kristen DeAngelis
 
Sequence based Markers
Sequence based MarkersSequence based Markers
Sequence based Markers
sukruthaa
 
Next generation sequencing
Next generation sequencingNext generation sequencing
Next generation sequencing
Shaheen Alam
 
NGS.pptx
NGS.pptxNGS.pptx
NGS.pptx
Bl Saini
 
Dna sequencing and its types
Dna sequencing and its typesDna sequencing and its types
Dna sequencing and its types
Yuvaraj neelakandan
 
Genome sequencing
Genome sequencingGenome sequencing
Genome sequencing
Musaddaq Hafeez
 
Generations of sequencing technologies.
Generations of sequencing technologies. Generations of sequencing technologies.
Generations of sequencing technologies.
ShadenAlharbi
 
2017 summer high school textbook (molecular biology and neuroscience)
2017 summer high school textbook (molecular biology and neuroscience)2017 summer high school textbook (molecular biology and neuroscience)
2017 summer high school textbook (molecular biology and neuroscience)
Masuma Sani
 
Manal- sequencing presentation-biotechpresentation-.pptx
Manal- sequencing presentation-biotechpresentation-.pptxManal- sequencing presentation-biotechpresentation-.pptx
Manal- sequencing presentation-biotechpresentation-.pptx
abun6
 
Modeling DNA Amplification by Polymerase Chain Reaction (PCR)
Modeling DNA Amplification by Polymerase Chain Reaction (PCR)Modeling DNA Amplification by Polymerase Chain Reaction (PCR)
Modeling DNA Amplification by Polymerase Chain Reaction (PCR)
Danielle Snowflack
 
Dna sequening
Dna sequeningDna sequening
Dna sequening
Muthu Kumar M
 
Conventional and next generation sequencing ppt
Conventional and next generation sequencing pptConventional and next generation sequencing ppt
Conventional and next generation sequencing ppt
Ashwini R
 
Next generation-sequencing.ppt-converted
Next generation-sequencing.ppt-convertedNext generation-sequencing.ppt-converted
Next generation-sequencing.ppt-converted
Shweta Tiwari
 
Useful.ppt
Useful.pptUseful.ppt
Useful.ppt
aaaa bbb
 
DNA Sequencing: History, methods and NGS
DNA Sequencing: History, methods and NGSDNA Sequencing: History, methods and NGS
DNA Sequencing: History, methods and NGS
4RTPCRAnand
 

Similar to EVE161: Microbial Phylogenomics - Class 2 - Evolution of DNA Sequencing (20)

Next generation sequencing
Next generation sequencingNext generation sequencing
Next generation sequencing
 
Sequencing genes and genomes
Sequencing genes and genomesSequencing genes and genomes
Sequencing genes and genomes
 
Illumina infinium sequencing
Illumina infinium sequencingIllumina infinium sequencing
Illumina infinium sequencing
 
Third Generation Sequencing
Third Generation Sequencing Third Generation Sequencing
Third Generation Sequencing
 
Hamas 1
Hamas 1Hamas 1
Hamas 1
 
03_Microbio590B_sequencing_2022.pdf
03_Microbio590B_sequencing_2022.pdf03_Microbio590B_sequencing_2022.pdf
03_Microbio590B_sequencing_2022.pdf
 
Sequence based Markers
Sequence based MarkersSequence based Markers
Sequence based Markers
 
Next generation sequencing
Next generation sequencingNext generation sequencing
Next generation sequencing
 
NGS.pptx
NGS.pptxNGS.pptx
NGS.pptx
 
Dna sequencing and its types
Dna sequencing and its typesDna sequencing and its types
Dna sequencing and its types
 
Genome sequencing
Genome sequencingGenome sequencing
Genome sequencing
 
Generations of sequencing technologies.
Generations of sequencing technologies. Generations of sequencing technologies.
Generations of sequencing technologies.
 
2017 summer high school textbook (molecular biology and neuroscience)
2017 summer high school textbook (molecular biology and neuroscience)2017 summer high school textbook (molecular biology and neuroscience)
2017 summer high school textbook (molecular biology and neuroscience)
 
Manal- sequencing presentation-biotechpresentation-.pptx
Manal- sequencing presentation-biotechpresentation-.pptxManal- sequencing presentation-biotechpresentation-.pptx
Manal- sequencing presentation-biotechpresentation-.pptx
 
Modeling DNA Amplification by Polymerase Chain Reaction (PCR)
Modeling DNA Amplification by Polymerase Chain Reaction (PCR)Modeling DNA Amplification by Polymerase Chain Reaction (PCR)
Modeling DNA Amplification by Polymerase Chain Reaction (PCR)
 
Dna sequening
Dna sequeningDna sequening
Dna sequening
 
Conventional and next generation sequencing ppt
Conventional and next generation sequencing pptConventional and next generation sequencing ppt
Conventional and next generation sequencing ppt
 
Next generation-sequencing.ppt-converted
Next generation-sequencing.ppt-convertedNext generation-sequencing.ppt-converted
Next generation-sequencing.ppt-converted
 
Useful.ppt
Useful.pptUseful.ppt
Useful.ppt
 
DNA Sequencing: History, methods and NGS
DNA Sequencing: History, methods and NGSDNA Sequencing: History, methods and NGS
DNA Sequencing: History, methods and NGS
 

More from Jonathan Eisen

Eisen.CentralValley2024.pdf
Eisen.CentralValley2024.pdfEisen.CentralValley2024.pdf
Eisen.CentralValley2024.pdf
Jonathan Eisen
 
Phylogenomics and the Diversity and Diversification of Microbes
Phylogenomics and the Diversity and Diversification of MicrobesPhylogenomics and the Diversity and Diversification of Microbes
Phylogenomics and the Diversity and Diversification of Microbes
Jonathan Eisen
 
Talk by Jonathan Eisen for LAMG2022 meeting
Talk by Jonathan Eisen for LAMG2022 meetingTalk by Jonathan Eisen for LAMG2022 meeting
Talk by Jonathan Eisen for LAMG2022 meeting
Jonathan Eisen
 
Thoughts on UC Davis' COVID Current Actions
Thoughts on UC Davis' COVID Current ActionsThoughts on UC Davis' COVID Current Actions
Thoughts on UC Davis' COVID Current Actions
Jonathan Eisen
 
Phylogenetic and Phylogenomic Approaches to the Study of Microbes and Microbi...
Phylogenetic and Phylogenomic Approaches to the Study of Microbes and Microbi...Phylogenetic and Phylogenomic Approaches to the Study of Microbes and Microbi...
Phylogenetic and Phylogenomic Approaches to the Study of Microbes and Microbi...
Jonathan Eisen
 
A Field Guide to Sars-CoV-2
A Field Guide to Sars-CoV-2A Field Guide to Sars-CoV-2
A Field Guide to Sars-CoV-2
Jonathan Eisen
 
EVE198 Summer Session Class 4
EVE198 Summer Session Class 4EVE198 Summer Session Class 4
EVE198 Summer Session Class 4
Jonathan Eisen
 
EVE198 Summer Session 2 Class 1
EVE198 Summer Session 2 Class 1 EVE198 Summer Session 2 Class 1
EVE198 Summer Session 2 Class 1
Jonathan Eisen
 
EVE198 Summer Session 2 Class 2 Vaccines
EVE198 Summer Session 2 Class 2 Vaccines EVE198 Summer Session 2 Class 2 Vaccines
EVE198 Summer Session 2 Class 2 Vaccines
Jonathan Eisen
 
EVE198 Spring2021 Class1 Introduction
EVE198 Spring2021 Class1 IntroductionEVE198 Spring2021 Class1 Introduction
EVE198 Spring2021 Class1 Introduction
Jonathan Eisen
 
EVE198 Spring2021 Class2
EVE198 Spring2021 Class2EVE198 Spring2021 Class2
EVE198 Spring2021 Class2
Jonathan Eisen
 
EVE198 Spring2021 Class5 Vaccines
EVE198 Spring2021 Class5 VaccinesEVE198 Spring2021 Class5 Vaccines
EVE198 Spring2021 Class5 Vaccines
Jonathan Eisen
 
EVE198 Winter2020 Class 8 - COVID RNA Detection
EVE198 Winter2020 Class 8 - COVID RNA DetectionEVE198 Winter2020 Class 8 - COVID RNA Detection
EVE198 Winter2020 Class 8 - COVID RNA Detection
Jonathan Eisen
 
EVE198 Winter2020 Class 1 Introduction
EVE198 Winter2020 Class 1 IntroductionEVE198 Winter2020 Class 1 Introduction
EVE198 Winter2020 Class 1 Introduction
Jonathan Eisen
 
EVE198 Winter2020 Class 3 - COVID Testing
EVE198 Winter2020 Class 3 - COVID TestingEVE198 Winter2020 Class 3 - COVID Testing
EVE198 Winter2020 Class 3 - COVID Testing
Jonathan Eisen
 
EVE198 Winter2020 Class 5 - COVID Vaccines
EVE198 Winter2020 Class 5 - COVID VaccinesEVE198 Winter2020 Class 5 - COVID Vaccines
EVE198 Winter2020 Class 5 - COVID Vaccines
Jonathan Eisen
 
EVE198 Winter2020 Class 9 - COVID Transmission
EVE198 Winter2020 Class 9 - COVID TransmissionEVE198 Winter2020 Class 9 - COVID Transmission
EVE198 Winter2020 Class 9 - COVID Transmission
Jonathan Eisen
 
EVE198 Fall2020 "Covid Mass Testing" Class 8 Vaccines
EVE198 Fall2020 "Covid Mass Testing" Class 8 VaccinesEVE198 Fall2020 "Covid Mass Testing" Class 8 Vaccines
EVE198 Fall2020 "Covid Mass Testing" Class 8 Vaccines
Jonathan Eisen
 
EVE198 Fall2020 "Covid Mass Testing" Class 2: Viruses, COIVD and Testing
EVE198 Fall2020 "Covid Mass Testing" Class 2: Viruses, COIVD and TestingEVE198 Fall2020 "Covid Mass Testing" Class 2: Viruses, COIVD and Testing
EVE198 Fall2020 "Covid Mass Testing" Class 2: Viruses, COIVD and Testing
Jonathan Eisen
 
EVE198 Fall2020 "Covid Mass Testing" Class 1 Introduction
EVE198 Fall2020 "Covid Mass Testing" Class 1 IntroductionEVE198 Fall2020 "Covid Mass Testing" Class 1 Introduction
EVE198 Fall2020 "Covid Mass Testing" Class 1 Introduction
Jonathan Eisen
 

More from Jonathan Eisen (20)

Eisen.CentralValley2024.pdf
Eisen.CentralValley2024.pdfEisen.CentralValley2024.pdf
Eisen.CentralValley2024.pdf
 
Phylogenomics and the Diversity and Diversification of Microbes
Phylogenomics and the Diversity and Diversification of MicrobesPhylogenomics and the Diversity and Diversification of Microbes
Phylogenomics and the Diversity and Diversification of Microbes
 
Talk by Jonathan Eisen for LAMG2022 meeting
Talk by Jonathan Eisen for LAMG2022 meetingTalk by Jonathan Eisen for LAMG2022 meeting
Talk by Jonathan Eisen for LAMG2022 meeting
 
Thoughts on UC Davis' COVID Current Actions
Thoughts on UC Davis' COVID Current ActionsThoughts on UC Davis' COVID Current Actions
Thoughts on UC Davis' COVID Current Actions
 
Phylogenetic and Phylogenomic Approaches to the Study of Microbes and Microbi...
Phylogenetic and Phylogenomic Approaches to the Study of Microbes and Microbi...Phylogenetic and Phylogenomic Approaches to the Study of Microbes and Microbi...
Phylogenetic and Phylogenomic Approaches to the Study of Microbes and Microbi...
 
A Field Guide to Sars-CoV-2
A Field Guide to Sars-CoV-2A Field Guide to Sars-CoV-2
A Field Guide to Sars-CoV-2
 
EVE198 Summer Session Class 4
EVE198 Summer Session Class 4EVE198 Summer Session Class 4
EVE198 Summer Session Class 4
 
EVE198 Summer Session 2 Class 1
EVE198 Summer Session 2 Class 1 EVE198 Summer Session 2 Class 1
EVE198 Summer Session 2 Class 1
 
EVE198 Summer Session 2 Class 2 Vaccines
EVE198 Summer Session 2 Class 2 Vaccines EVE198 Summer Session 2 Class 2 Vaccines
EVE198 Summer Session 2 Class 2 Vaccines
 
EVE198 Spring2021 Class1 Introduction
EVE198 Spring2021 Class1 IntroductionEVE198 Spring2021 Class1 Introduction
EVE198 Spring2021 Class1 Introduction
 
EVE198 Spring2021 Class2
EVE198 Spring2021 Class2EVE198 Spring2021 Class2
EVE198 Spring2021 Class2
 
EVE198 Spring2021 Class5 Vaccines
EVE198 Spring2021 Class5 VaccinesEVE198 Spring2021 Class5 Vaccines
EVE198 Spring2021 Class5 Vaccines
 
EVE198 Winter2020 Class 8 - COVID RNA Detection
EVE198 Winter2020 Class 8 - COVID RNA DetectionEVE198 Winter2020 Class 8 - COVID RNA Detection
EVE198 Winter2020 Class 8 - COVID RNA Detection
 
EVE198 Winter2020 Class 1 Introduction
EVE198 Winter2020 Class 1 IntroductionEVE198 Winter2020 Class 1 Introduction
EVE198 Winter2020 Class 1 Introduction
 
EVE198 Winter2020 Class 3 - COVID Testing
EVE198 Winter2020 Class 3 - COVID TestingEVE198 Winter2020 Class 3 - COVID Testing
EVE198 Winter2020 Class 3 - COVID Testing
 
EVE198 Winter2020 Class 5 - COVID Vaccines
EVE198 Winter2020 Class 5 - COVID VaccinesEVE198 Winter2020 Class 5 - COVID Vaccines
EVE198 Winter2020 Class 5 - COVID Vaccines
 
EVE198 Winter2020 Class 9 - COVID Transmission
EVE198 Winter2020 Class 9 - COVID TransmissionEVE198 Winter2020 Class 9 - COVID Transmission
EVE198 Winter2020 Class 9 - COVID Transmission
 
EVE198 Fall2020 "Covid Mass Testing" Class 8 Vaccines
EVE198 Fall2020 "Covid Mass Testing" Class 8 VaccinesEVE198 Fall2020 "Covid Mass Testing" Class 8 Vaccines
EVE198 Fall2020 "Covid Mass Testing" Class 8 Vaccines
 
EVE198 Fall2020 "Covid Mass Testing" Class 2: Viruses, COIVD and Testing
EVE198 Fall2020 "Covid Mass Testing" Class 2: Viruses, COIVD and TestingEVE198 Fall2020 "Covid Mass Testing" Class 2: Viruses, COIVD and Testing
EVE198 Fall2020 "Covid Mass Testing" Class 2: Viruses, COIVD and Testing
 
EVE198 Fall2020 "Covid Mass Testing" Class 1 Introduction
EVE198 Fall2020 "Covid Mass Testing" Class 1 IntroductionEVE198 Fall2020 "Covid Mass Testing" Class 1 Introduction
EVE198 Fall2020 "Covid Mass Testing" Class 1 Introduction
 

Recently uploaded

Comparative structure of adrenal gland in vertebrates
Comparative structure of adrenal gland in vertebratesComparative structure of adrenal gland in vertebrates
Comparative structure of adrenal gland in vertebrates
sachin783648
 
Viksit bharat till 2047 India@2047.pptx
Viksit bharat till 2047  India@2047.pptxViksit bharat till 2047  India@2047.pptx
Viksit bharat till 2047 India@2047.pptx
rakeshsharma20142015
 
Anemia_ different types_causes_ conditions
Anemia_ different types_causes_ conditionsAnemia_ different types_causes_ conditions
Anemia_ different types_causes_ conditions
muralinath2
 
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
Sérgio Sacani
 
Lateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensiveLateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensive
silvermistyshot
 
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdfSCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SELF-EXPLANATORY
 
RNA INTERFERENCE: UNRAVELING GENETIC SILENCING
RNA INTERFERENCE: UNRAVELING GENETIC SILENCINGRNA INTERFERENCE: UNRAVELING GENETIC SILENCING
RNA INTERFERENCE: UNRAVELING GENETIC SILENCING
AADYARAJPANDEY1
 
EY - Supply Chain Services 2018_template.pptx
EY - Supply Chain Services 2018_template.pptxEY - Supply Chain Services 2018_template.pptx
EY - Supply Chain Services 2018_template.pptx
AlguinaldoKong
 
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdfUnveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Erdal Coalmaker
 
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
Health Advances
 
justice-and-fairness-ethics with example
justice-and-fairness-ethics with examplejustice-and-fairness-ethics with example
justice-and-fairness-ethics with example
azzyixes
 
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
muralinath2
 
ESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptxESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptx
muralinath2
 
Lab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerinLab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerin
ossaicprecious19
 
filosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptxfilosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptx
IvanMallco1
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
SAMIR PANDA
 
platelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptxplatelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptx
muralinath2
 
Structures and textures of metamorphic rocks
Structures and textures of metamorphic rocksStructures and textures of metamorphic rocks
Structures and textures of metamorphic rocks
kumarmathi863
 
insect morphology and physiology of insect
insect morphology and physiology of insectinsect morphology and physiology of insect
insect morphology and physiology of insect
anitaento25
 
Mammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also FunctionsMammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also Functions
YOGESH DOGRA
 

Recently uploaded (20)

Comparative structure of adrenal gland in vertebrates
Comparative structure of adrenal gland in vertebratesComparative structure of adrenal gland in vertebrates
Comparative structure of adrenal gland in vertebrates
 
Viksit bharat till 2047 India@2047.pptx
Viksit bharat till 2047  India@2047.pptxViksit bharat till 2047  India@2047.pptx
Viksit bharat till 2047 India@2047.pptx
 
Anemia_ different types_causes_ conditions
Anemia_ different types_causes_ conditionsAnemia_ different types_causes_ conditions
Anemia_ different types_causes_ conditions
 
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
 
Lateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensiveLateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensive
 
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdfSCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
 
RNA INTERFERENCE: UNRAVELING GENETIC SILENCING
RNA INTERFERENCE: UNRAVELING GENETIC SILENCINGRNA INTERFERENCE: UNRAVELING GENETIC SILENCING
RNA INTERFERENCE: UNRAVELING GENETIC SILENCING
 
EY - Supply Chain Services 2018_template.pptx
EY - Supply Chain Services 2018_template.pptxEY - Supply Chain Services 2018_template.pptx
EY - Supply Chain Services 2018_template.pptx
 
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdfUnveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdf
 
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
 
justice-and-fairness-ethics with example
justice-and-fairness-ethics with examplejustice-and-fairness-ethics with example
justice-and-fairness-ethics with example
 
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
 
ESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptxESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptx
 
Lab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerinLab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerin
 
filosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptxfilosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptx
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
 
platelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptxplatelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptx
 
Structures and textures of metamorphic rocks
Structures and textures of metamorphic rocksStructures and textures of metamorphic rocks
Structures and textures of metamorphic rocks
 
insect morphology and physiology of insect
insect morphology and physiology of insectinsect morphology and physiology of insect
insect morphology and physiology of insect
 
Mammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also FunctionsMammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also Functions
 

EVE161: Microbial Phylogenomics - Class 2 - Evolution of DNA Sequencing

  • 1. Class 2: EVE 161:
 Microbial Phylogenomics Class #2: Evolution of DNA Sequencing UC Davis, Winter 2018 Instructor: Jonathan Eisen Teaching Assistant: Cassie Ettinger !1
  • 2. Where we are going and where we have been • Previous Class: !1. Introduction, Reading Papers • Current Class: !2. Evolution of DNA sequencing • Next Class: !3. The Tree of Life !2
  • 3. Where we are going and where we have been • Office Hours • Cassie Ettinger - Thursday 10:30-11:30 Storer 5331 • Jonathan Eisen - Monday 11-12 
 GBSF 5311 !3
  • 4. Future Classes • Come to class ready to discuss papers • Will start a discussion forum to discuss in advance • Come prepared with 2 questions about the papers to ask • 1 question about something you did not understand • 1 question for the other students to test understanding of the paper
  • 5. Class 2 Outline • Two papers • Evolution of Sequencing !5
  • 7. Pre Class • 1. Briefly explain how to use chain-terminating inhibitors to infer a DNA sequence. (Hint: Look at the figures in Sanger et al) • 2. How has sequencing technology changed over time? Use examples of first-generation, second-generation and third-generation DNA sequencing methods in your answer.
  • 10. Sanger et al. 1977 Background
  • 11. Sanger et al. 1977 Background
  • 12. Sanger et al. Methods - Reagents
  • 13. Sanger et al. Methods - Sequencing
  • 14. Sanger et al. Results
  • 15.
  • 16.
  • 17.
  • 18. • Key Points from Discussion ! Many advantages over plus / minus method ! Can read up to 300 bp ! Some artifacts but some solutions to these ! Best to read with both strands ! Hard to get ddNTPs Sanger et al. Results
  • 19. • A new method for determining nucleotide sequences in DNA is described. It is similar to the “plus and minus” method [Sanger, F. & Coulson, A. R. (1975) J. Mol. Biol. 94, 441-448] but makes use of the 2′,3′-dideoxy and arabinonucleoside analogues of the normal deoxynucleoside triphosphates, which act as specific chain-terminating inhibitors of DNA polymerase. The technique has been applied to the DNA of bacteriophage ϕX174 and is more rapid and more accurate than either the plus or the minus method. Sanger et al. Abstract
  • 20.
  • 21. Evolution of Sequencing Gen 0: Proto Sequencing
  • 23. Evolution of Sequencing Gen 0: Proto Sequencing Sanger Maxam-Gilbert Gen 1: Manual Sequencing
  • 28. Nobel Prize 1980: Berg, Gilbert, Sanger http://www.nobelprize.org/nobel_prizes/chemistry/laureates/1980/
  • 29. Evolution of Sequencing Gen 0: Proto Sequencing Sanger Maxam-Gilbert Gen 1: Manual Sequencing Gen 2: Automated Sanger
  • 30. Automation of Sanger Part I Sanger method with labeled dNTPs The Sanger mehtods is based on the idea that inhibitors can terminate elongation of DNA at specific points
  • 31. Many Systems for Sanger Automation Thanks to Robin Coope and Dale Yazuki for comments on 2014 talk ABI 3700 ABI 3730 ABI 3730xl Megabase 1000 Megabase 4000 LiCor
  • 33. Some Automated Sanger Highlights • 1991: ESTs by Venter • 1995: H. influenzae shotgun genome • 1996: Yeast, archaeal genomes • 1998: 1st animal genome - C. elegans • 1999: Drosophila shotgun genome • 2000: Arabidopsis genome • 2000: Human genome • 2004: Shotgun metagenomics Thanks to Keith Bradnam for C. elegans suggestion
  • 34. Evolution of Sequencing Gen 0: Proto Sequencing Sanger Maxam-Gilbert ABI-Solid Solexa- Illumina 454- Roche Ion Torrent Gen 1: Manual Sequencing Gen 2: Automated Sanger Gen 3: Clusters Not Clones
  • 36. Generation III = “NextGen”
  • 37. NextGen Sequencing OutlineNext-generation sequencing platforms Isolation and purification of target DNA Sample preparation Library validation Cluster generation on solid-phase Emulsion PCR Sequencing by synthesis with  3’-blocked reversible terminators Pyrosequencing Sequencing by ligation Single colour imaging Sequencing by synthesis with  3’-unblocked reversible terminators AmplificationSequencingImaging Four colour imaging Data analysis Roche 454Illumina GAII ABi SOLiD Helicos HeliScope From Slideshare presentation of Cosentino Cristian http://www.slideshare.net/cosentia/high-throughput-equencing
  • 38. 454 Pyrosequencing Sanger method - ABi SOLiD HeliScope Nanopore Roche 454 umina GAII From Slideshare presentation of Cosentino Cristian http://www.slideshare.net/cosentia/high- throughput-equencing NextGen #1: 454
  • 39. NextGen #2: ABI SolidSequecning by ligation Sanger method Roche 454 - HeliScope Nanopore ABi SOLiD Illumina GAII From Slideshare presentation of Cosentino Cristian http://www.slideshare.net/ cosentia/high-throughput- equencing
  • 40. NextGen #3: Solexa Sequecning by synthesis with reversible terminator anger ethod he 454 SOLiD iScope nopore - From Slideshare presentation of Cosentino Cristian http://www.slideshare.net/cosentia/high-throughput-equencing
  • 41. NextGen #2: Illumina Sequecning by synthesis with reversible terminator anger ethod he 454 SOLiD iScope nopore - From Slideshare presentation of Cosentino Cristian http://www.slideshare.net/cosentia/high-throughput-equencing
  • 42. Illumina Outline Clusters amplification Clusterstation Wash cluster station Clustergeneration Linearization, Blocking and primer Hybridization Read 1 Prepare read 2 Read 2 GAIIx&PE SBSsequencing Pipeline base call Data analysis Sample preparation and library validation Analysis Sequencing workflow Sample preparation Clusters amplification Sequencing by synthesis Analysis pipeline Introduction Illumina GAII High throughput From Slideshare presentation of Cosentino Cristian http://www.slideshare.net/ cosentia/high-throughput- equencing
  • 43. Illumina Step 1: Prep & Attach DNA termined by user-defined instrument settings, which permits discrete read lengths of 25–35 the Illumina data from each run, removing poor-quality sequences. Adapter DNA fragment Dense lawn of primers Adapter Attached DNA Adapters Prepare genomic DNA sample Randomly fragment genomic DNA and ligate adapters to both ends of the fragments. Attach DNA to surface Bind single-stranded fragments randomly to the inside surface of the flow cell channels. Nucleotides a From Mardis 2008. Annual Rev. Genetics 9: 387. Step 1: Sample Preparation The DNA sample of interest is sheared to appropriate size (average ~800bp) using a compressed air device known as a nebulizer. The ends of the DNA are polished, and two unique adapters are ligated to the fragments. Ligated fragments of the size range of 150-200bp are isolated via gel extraction and amplified using limited cycles of PCR
  • 44. Illumina Step 2: Clusters by Bridge PCR Attached Prepare genomic DNA sample Randomly fragment genomic DNA and ligate adapters to both ends of the fragments. Attach DNA to surface Bind single-stranded fragments randomly to the inside surface of the flow cell channels. Bridge amplification Add unlabeled nucleotides and enzyme to initiate solid- phase bridge amplification. Denature the double stranded molecules Nucleotides Figure 2 The Illumina sequencing-by-synthesis approach. Cluster strands created by bridge amplification are primed and all four fluorescently labeled, 3′-OH blocked nucleotides are added to the flow cell with DNA polymerase. The cluster strands are extended by one nucleotide. Following the incorporation step, the unused nucleotides and DNA polymerase molecules are washed away, a scan buffer is added to the flow cell, and the optics system scans each lane of the flow cell by imaging units called tiles. Once imaging is completed, chemicals that effect cleavage of the fluorescent labels and the 3′-OH blocking groups are added to the flow cell, which prepares the cluster strands for another round of fluorescent nucleotide incorporation. 392 Mardis • From : http://seqanswers.com/forums/showthread.php?t=21. Steps 2-6: Cluster Generation by Bridge Amplification. In contrast to the 454 and ABI methods which use a bead-based emulsion PCR to generate "polonies", Illumina utilizes a unique "bridged" amplification reaction that occurs on the surface of the flow cell. The flow cell surface is coated with single stranded oligonucleotides that correspond to the sequences of the adapters ligated during the sample preparation stage. Single-stranded, adapter-ligated fragments are bound to the surface of the flow cell exposed to reagents for polyermase-based extension. Priming occurs as the free/distal end of a ligated fragment "bridges" to a complementary oligo on the surface. Repeated denaturation and extension results in localized amplification of single molecules in millions of unique locations across the flow cell surface. This process occurs in what is referred to as Illumina's "cluster station", an automated flow cell processor. From Mardis 2008. Annual Rev. Genetics 9: 387.
  • 46. Illumina Step 3: Sequencing 25–35 bp, and each sequencing run yields be- tween 2–4 Gb of DNA sequence data. Once generation read type has a unique error model different from that already established for b Laser First chemistry cycle: determine first base To initiate the first sequencing cycle, add all four labeled reversible terminators, primers, and DNA polymerase enzyme to the flow cell. Image of first chemistry cycle After laser excitation, capture the image of emitted fluorescence from each cluster on the flow cell. Record the identity of the first base for each cluster. Sequence read over multiple chemistry cycles Repeat cycles of sequencing to determine the sequence Before initiating the next chemistry cycle The blocked 3' terminus and the fluorophore from each incorporated base are removed. GCTGA... From Mardis 2008. Annual Rev. Genetics 9: 387. From : http://seqanswers.com/forums/showthread.php?t=21. Steps 7-12: Sequencing by Synthesis. A flow cell containing millions of unique clusters is now loaded into the 1G sequencer for automated cycles of extension and imaging. The first cycle of sequencing consists first of the incorporation of a single fluorescent nucleotide, followed by high resolution imaging of the entire flow cell. These images represent the data collected for the first base. Any signal above background identifies the physical location of a cluster (or polony), and the fluorescent emission identifies which of the four bases was incorporated at that position. This cycle is repeated, one base at a time, generating a series of images each representing a single base extension at a specific cluster. Base calls are derived with an algorithm that identifies the emission color over time. At this time reports of useful Illumina reads range from 26-50 bases.
  • 47. SBS technology Sample eparation Clusters mplification uencing by ynthesis Analysis pipeline troduction umina GAII High roughput From Slideshare presentation of Cosentino Cristian http://www.slideshare.net/cosentia/high-throughput-equencing Illumina Step 3: Sequencing
  • 48. Illumina Cycling Laser terminators, primers, and DNA polymerase enzyme to the flow cell. Image of first chemistry cycle After laser excitation, capture the image of emitted fluorescence from each cluster on the flow cell. Record the identity of the first base for each cluster. Sequence read over multiple chemistry cycles Repeat cycles of sequencing to determine the sequence of bases in a given fragment a single base at a time. Before initiating the next chemistry cycle The blocked 3' terminus and the fluorophore from each incorporated base are removed. GCTGA... Figure 2 (Continued ) www.annualreviews.org • Next-Generation DNA Sequencing Methods 393 From Mardis 2008. Annual Rev. Genetics 9: 387.
  • 50.
  • 51. Generation 3.5: Faster w/ Clusters • Even faster
  • 52. Generation 3.5: Faster w/ Clusters - Ion Torrent
  • 53. Ion Torrent pH Based Sequencing Mardis ER. Next-generation sequencing platforms. Annu Rev Anal Chem 2013;6:287-303.
  • 54. Oxford NanoPore Pacbio Helicos Gen 4: Single Molecule Gen 0: Proto Sequencing Sanger Maxam-Gilbert ABI-Solid Solexa- Illumina 454- Roche Ion Torrent Gen 1: Manual Sequencing Gen 2: Automated Sanger Gen 3: Clusters Not Clones Gen 4: Single Molecule
  • 55. Single Molecule I: Helicos 3rd Generation Sequencing y w for cular ent_sequencing
  • 56. Single Molecule II: Pacific Biosciences
  • 57. Mardis ER. Next-generation sequencing platforms. Annu Rev Anal Chem 2013;6:287-303. Single Molecule II: Pacific Biosciences
  • 58. Analytical Chemistry REVIEW Φ29 polymerase. Each amplified product of a circularized fragment is called a DNA nanoball (DNB). DNBs are selectively attached to a hexamethyldisilizane (HMDS) coated silicon chip that is photolithographically patterned with aminosilane active sites. Figure 3A illustrates the DNB array design. The use of the DNBs coupled with the highly patterned array offers several advantages. The production of DNBs increases signal intensity by simply increasing the number of hybridization sites available for probing. Also, the size of the DNB is on the same length scale as the active site or “sticky” spot patterned on Each hybridization and ligation cycle is followed by fluorescent imaging of the DNB spotted chip and subsequently regeneration of the DNBs with a formamide solution. This cycle is repeated until the entire combinatorial library of probes and anchors is examined. This formula of the use of unchained reads and regeneration of the sequencing fragment reduces reagent con- sumption and eliminates potential accumulation errors that can arise in other sequencing technologies that require close to completion of each sequencing reaction.19,52,53 Complete Genomics showcased their DNB array and cPAL Figure 2. Schematic of PacBio’s real-time single molecule sequencing. (A) The side view of a single ZMW nanostructure containing a single DNA polymerase (Φ29) bound to the bottom glass surface. The ZMW and the confocal imaging system allow fluorescence detection only at the bottom surface of each ZMW. (B) Representation of fluorescently labeled nucleotide substrate incorporation on to a sequencing template. The corresponding temporal fluorescence detection with respect to each of the five incorporation steps is shown below. Reprinted with permission from ref 39. Copyright 2009 American Association for the Advancement of Science. Figure 2. Schematic of PacBio’s real-time single molecule sequencing. (A) The side view of a single ZMW nanostructure containing a single DNA polymerase (Φ29) bound to the bottom glass surface. The ZMW and the confocal imaging system allow fluorescence detection only at the bottom surface of each ZMW. (B) Representation of fluorescently labeled nucleotide substrate incorporation on to a sequencing template. The corresponding temporal fluorescence detection with respect to each of the five incorporation steps is shown below. From Niedringhaus et al. Analytical Chemistry 83: 4327. 2011. Single Molecule II: Pacific Biosciences
  • 59. Detecting Modified Bases Page 2 www.pacb.com/basemod processes. The potential benefits of detecting base modification, using SMRT sequencing, include: Single-base resolution detection of a wide two successive base incorporations, are altered by the presence of a modified base in the DNA template 3 . This is observable as an increased space between fluorescence pulses, which is called the interpulse duration (IPD), as shown in Figure 2. Figure 2. Principle of detecting modified DNA bases during SMRT sequencing. The presence of the modified base in the DNA template (top), shown here for 6-mA, results in a delayed incorporation of the corresponding T nucleotide, i.e. longer interpulse duration (IPD), compared to a control DNA template lacking the modification (bottom). 3 http://www.pacificbiosciences.com/pdf/microbial_primer.pdf
  • 60. Single Molecule III: Oxford Nanopores This diagram shows a protein nanopore set in an electrically resistant membrane bilayer. An ionic current is passed through the nanopore by setting a voltage across this membrane. If an analyte passes through the pore or near its aperture, this event creates a characteristic disruption in current. By measuring that current it is possible to identify the molecule in question. For example, this system can be used to distinguish the four standard DNA bases and G, A, T and C, and also modified bases. It can be used to identify target proteins, small molecules, or to gain rich molecular information for example to distinguish the enantiomers of ibuprofen or molecular binding dynamics. From Oxford Nanopores Web Site
  • 61. • Figure6. BiologicalnanoporeschemeemployedbyOxfordNanopore. (A)SchematicofRHLproteinnanoporemutantdepictingthepositionsofthe cyclodextrin (at residue 135) and glutamines (at residue 139). (B) A detailed view of the β barrel of the mutant nanopore shows the locations of the arginines (at residue 113) and the cysteines. (C) Exonuclease sequencing: A processive enzyme is attached to the top of the nanopore to cleave single nucleotides from the target DNA strand and pass them through the nanopore. (D) A residual current-vs-time signal trace from an RHL protein nanopore that shows a clear discrimination between single bases (dGMP, dTMP, dAMP, and dCMP). (E) Strand sequencing: ssDNA is threaded through a protein nanopore and individual bases are identified, as the strand remains intact. Panels A, B, and D reprinted with permission from ref 91. Copyright 2009 Nature Publishing Group. Panels C and E reprinted with permission from Oxford Nanopore Technologies (Zoe McDougall). nalytical Chemistry REVIEW etected across a metal oxide-silicon layered structure. The translocated through a solid-state nanopore. In addition, sever igure 6. Biological nanopore scheme employed by Oxford Nanopore. (A) Schematic of RHL protein nanopore mutant depicting the positions of t yclodextrin (at residue 135) and glutamines (at residue 139). (B) A detailed view of the β barrel of the mutant nanopore shows the locations he arginines (at residue 113) and the cysteines. (C) Exonuclease sequencing: A processive enzyme is attached to the top of the nanopore to cleave sing ucleotides from the target DNA strand and pass them through the nanopore. (D) A residual current-vs-time signal trace from an RHL protein nanopo hat shows a clear discrimination between single bases (dGMP, dTMP, dAMP, and dCMP). (E) Strand sequencing: ssDNA is threaded through rotein nanopore and individual bases are identified, as the strand remains intact. Panels A, B, and D reprinted with permission from ref 91. Copyrig 009 Nature Publishing Group. Panels C and E reprinted with permission from Oxford Nanopore Technologies (Zoe McDougall). Figure6. Biological nanopores cheme employed by Oxford Nanopore.(A) Schematic of RHL protein nano pore mutant depicting the positions of the cyclodextrin (at residue 135) and glutamines (at residue 139). (B) A detailed view of the β barrel of the mutant nanopore shows the locations of the arginines (at residue 113) and the cysteines. (C) Exonuclease sequencing: A processive enzyme is attached to the top of the nanopore to cleave single nucleotides from the target DNA strand and pass them through the nanopore. (D) A residual current-vs-time signal trace from an RHL protein nanopore that shows a clear discrimination between single bases (dGMP, dTMP, dAMP, and dCMP). (E) Strand sequencing: ssDNA is threaded through a protein nanopore and individual bases are identified, as the strand remains intact. Panels A, B, and D reprinted with permission from ref 91. Copyright 2009 Nature Publishing Group. Panels C and E reprinted with permission from Oxford Nanopore Technologies (Zoe McDougall). From Niedringhaus et al. Analytical Chemistry 83: 4327. 2011. Single Molecule III: Oxford Nanopores
  • 62. Nanopores Analytical Chemistry REVIEW would result, in theory, in detectably altered current flow through the pore. Theoretically, nanopores could also be designed to measure tunneling current across the pore as bases, each with a distinct tunneling potential, could be read. The nanopore ap- lipid bilayer, using ionic current blockage method. The autho predicted that single nucleotides could be discriminated as lon as: (1) each nucleotide produces a unique signal signature; (2 the nanopore possesses proper aperture geometry to accommo Figure 5. Nanopore DNA sequencing using electronic measurements and optical readout as detection methods. (A) In electronic nanopore scheme signal is obtained through ionic current,73 tunneling current,78 and voltage difference79 measurements. Each method must produce a characteristic sign to differentiate the four DNA bases. Reprinted with permission from ref 83. Copyright 2008 Annual Reviews. (B) In the optical readout nanopore desig each nucleotide is converted to a preset oligonucleotide sequence and hybridized with labeled markers that are detected during translocation of the DN fragment through the nanopore. Reprinted from ref 82. Copyright 2010 American Chemical Society. Nanopore DNA sequencing using electronic measurements and optical readout as detection methods.(A)In electronic nanopore schemes, signal is obtained through ionic current,73 tunneling current, and voltage difference measurements. Each method must produce a characteristic signal to differentiate the four DNA bases. (B) In the optical readout nanopore design, each nucleotide is converted to a preset oligonucleotide sequence and hybridized with labeled markers that are detected during translocation of the DNA fragment through the nanopore. Single Molecule III: Oxford Nanopores From Niedringhaus et al. Analytical Chemistry 83: 4327. 2011.
  • 63. • http://www.nature.com/polopoly_fs/7.15596.1392415346!/ image/image001.jpg_gen/derivatives/landscape_630/ image001.jp “It’s kind of a cute device,” says Jaffe of the MinION, which is roughly the size and shape of a packet of chewing gum. “It has pretty lights and a fan that hums pleasantly, and plugs into a USB drive.” But his technical review is mixed. From http:// www.nature.com/news/data-from-pocket-sized-genome-sequencer-unveiled-1.14724 Single Molecule III: Oxford Nanopores
  • 64. Real Time Sequencing? Slide Provided by Nick Loman
  • 65. Bells and Whistles • Multiplexing • Paired end • Mate pair • Moleculo
  • 68. Small Amounts of DNA http://www.epibio.com/docs/default-source/protocols/nextera-dna-sample-prep-kit-(illumina--compatible).pdf?sfvrsn=4
  • 69. Capture Methods High throughput sample preparation Sample preparation Clusters amplification Sequencing by synthesis Analysis pipeline Introduction Illumina GAII High throughput Nature Methods, 2010, 7: 111-118 RainDance Microdroplet PCR Roche Nimblegen Salid-phase capture with custom- designed oligonucleotide microarray Reported 84% of capture efficiency Reported 65-90% of capture efficiency From Slideshare presentation of Cosentino Cristian http://www.slideshare.net/cosentia/high-throughput-equencing
  • 70. High throughput sample preparation Sample preparation Clusters amplification Sequencing by synthesis Analysis pipeline Introduction Illumina GAII High throughput Agilent SureSelect Solution-phase capture with streptavidin-coated magnetic beads Reported 60-80% of capture efficiency From Slideshare presentation of Cosentino Cristian http://www.slideshare.net/cosentia/high- throughput-equencing Capture Methods
  • 71. Illumina Paired EndsPaired-end technology Paired-end sequencing works into GA and uses chemicals from the PE module to perform cluster amplification of the reverse strandSample preparation Clusters amplification Sequencing by synthesis Analysis pipeline Introduction Illumina GAII High throughput From Slideshare presentation of Cosentino Cristian http://www.slideshare.net/ cosentia/high-throughput- equencing
  • 72. Moleculo Large fragments DNA Isolate and amplify CACC GGAA TCTC ACGT AAGG GATC AAAA Sublibrary w/ unique barcodes Sequence w/ Illumina Assemble seqs w/ same codes
  • 73. HiC Suggested by Carlos Bustamante and Keith Bradnam
  • 74. HiC Crosslinking & Sequencing Beitel CW, Froenicke L, Lang JM, Korf IF, Michelmore RW, Eisen JA, Darling AE. (2014) Strain- and plasmid-level deconvolution of a synthetic metagenome by sequencing proximity ligation products. PeerJ 2:e415 http://dx.doi.org/ 10.7717/peerj.415 Table 1 Species alignment fractions. The number of reads aligning to each replicon present in the synthetic microbial community are shown before and after filtering, along with the percent of total constituted by each species. The GC content (“GC”) and restriction site counts (“#R.S.”) of each replicon, species, and strain are shown. Bur1: B. thailandensis chromosome 1. Bur2: B. thailandensis chromosome 2. Lac0: L. brevis chromosome, Lac1: L. brevis plasmid 1, Lac2: L. brevis plasmid 2, Ped: P. pentosaceus, K12: E. coli K12 DH10B, BL21: E. coli BL21. An expanded version of this table can be found in Table S2. Sequence Alignment % of Total Filtered % of aligned Length GC #R.S. Lac0 10,603,204 26.17% 10,269,562 96.85% 2,291,220 0.462 629 Lac1 145,718 0.36% 145,478 99.84% 13,413 0.386 3 Lac2 691,723 1.71% 665,825 96.26% 35,595 0.385 16 Lac 11,440,645 28.23% 11,080,865 96.86% 2,340,228 0.46 648 Ped 2,084,595 5.14% 2,022,870 97.04% 1,832,387 0.373 863 BL21 12,882,177 31.79% 2,676,458 20.78% 4,558,953 0.508 508 K12 9,693,726 23.92% 1,218,281 12.57% 4,686,137 0.507 568 E. coli 22,575,903 55.71% 3,894,739 17.25% 9,245,090 0.51 1076 Bur1 1,886,054 4.65% 1,797,745 95.32% 2,914,771 0.68 144 Bur2 2,536,569 6.26% 2,464,534 97.16% 3,809,201 0.672 225 Bur 4,422,623 10.91% 4,262,279 96.37% 6,723,972 0.68 369 Figure 1 Hi-C insert distribution. The distribution of genomic distances between Hi-C read pairs is shown for read pairs mapping to each chromosome. For each read pair the minimum path length on the circular chromosome was calculated and read pairs separated by less than 1000 bp were discarded. The 2.5 Mb range was divided into 100 bins of equal size and the number of read pairs in each bin was recorded for each chromosome. Bin values for each chromosome were normalized to sum to 1 and plotted. E. coli K12 genome were distributed in a similar manner as previously reported (Fig. 1; (Lieberman-Aiden et al., 2009)). We observed a minor depletion of alignments spanning the linearization point of the E. coli K12 assembly (e.g., near coordinates 0 and 4686137) due to edge eVects induced by BWA treating the sequence as a linear chromosome rather than circular. 10.7717/peerj.415 9/19 Figure 2 Metagenomic Hi-C associations. The log-scaled, normalized number of Hi-C read pairs associating each genomic replicon in the synthetic community is shown as a heat map (see color scale, blue to yellow: low to high normalized, log scaled association rates). Bur1: B. thailandensis chromosome 1. Bur2: B. thailandensis chromosome 2. Lac0: L. brevis chromosome, Lac1: L. brevis plasmid 1, Lac2: L. brevis plasmid 2, Ped: P. pentosaceus, K12: E. coli K12 DH10B, BL21: E. coli BL21. reference assemblies of the members of our synthetic microbial community with the same alignment parameters as were used in the top ranked clustering (described above). We first Figure 3 Contigs associated by Hi-C reads. A graph is drawn with nodes depicting contigs and edges depicting associations between contigs as indicated by aligned Hi-C read pairs, with the count thereof depicted by the weight of edges. Nodes are colored to reflect the species to which they belong (see legend) with node size reflecting contig size. Contigs below 5 kb and edges with weights less than 5 were excluded. Contig associations were normalized for variation in contig size. typically represent the reads and variant sites as a variant graph wherein variant sites are represented as nodes, and sequence reads define edges between variant sites observed in the same read (or read pair). We reasoned that variant graphs constructed from Hi-C data would have much greater connectivity (where connectivity is defined as the mean path length between randomly sampled variant positions) than graphs constructed from mate-pair sequencing data, simply because Hi-C inserts span megabase distances. Such Figure 4 Hi-C contact maps for replicons of Lactobacillus brevis. Contact maps show the number of Hi-C read pairs associating each region of the L. brevis genome. The L. brevis chromosome (Lac0, (A), Chris Beitel @datscimed Aaron Darling @koadman