The chain-termination method developed by Frederick Sanger and coworkers in 1977. This method used fewer toxic chemicals and lower amounts of radioactivity than the Maxam and Gilbert method. Because of its comparative ease, the Sanger method was soon automated and was the method used in the first generation of DNA sequencers.
1. Sanger sequencing
The chain termination method
By-
Dr. Dinesh C. Sharma
Head, Zoology
K.M. Govt. Girls P. G. College
Badalpur, G.B. Nagar
dr_dineshsharma@hotmail.com
2. DNA sequencing is the process of determining
the sequence of nucleotides (A, T, G, and C) in
the DNA. It includes method or technology that
is used to determine the order of the four bases:
adenine, thymine, guanine and cytosine.
Sequencing an entire genome of an organism
remains a complex task. It requires breaking
the DNA of the genome into many smaller
pieces, sequencing the pieces, and assembling
the sequences into a single long "consensus."
But, new methods developed over the past two
decades make it easier, genome sequencing is
now much faster and less expensive than it was
during the Human Genome Project.
3. The chain-termination method developed by Frederick Sanger and
coworkers in 1977. This method used fewer toxic chemicals and
lower amounts of radioactivity than the Maxam and Gilbert
method. Because of its comparative ease, the Sanger method was
soon automated and was the method used in the first generation
of DNA sequencers.
The Sanger method, in mass production form, is the technology
which produced the first human genome in 2001. In the Human
Genome Project, Sanger sequencing was used to determine the
sequences of many relatively small fragments of human DNA.
(These fragments weren't necessarily 900 bp or less, but
researchers were able to "walk" along each fragment using
multiple rounds of Sanger sequencing.) The fragments were
aligned based on overlapping portions to assemble the sequences
of larger regions of DNA and, eventually, entire chromosomes.
Although genomes are now typically sequenced using other
methods that are faster and less expensive, Sanger sequencing is
still in wide use for the sequencing of individual pieces of DNA,
such as fragments used in DNA cloning or generated
through polymerase chain reaction (PCR).
It was first commercialized by Applied Biosystems in 1986
4. Requirement for Sanger sequencing
Sanger sequencing make many copies of a target DNA
region. Its raw material are similar to the requirement of
DNA replication in an organism, or for polymerase chain
reaction (PCR), which copies DNA in vitro.
They include:
• A DNA polymerase enzyme
• A primer, acts as a "starter" for the DNA polymerase
• The four DNA nucleotides (dATP, dTTP, dCTP, dGTP)
• The template DNA to be sequenced
Sanger sequencing reaction also contains a unique
ingredient:
• Dideoxy nucleotide (dd), or chain-terminating,
versions of all four nucleotides (ddATP, ddTTP, ddCTP,
ddGTP), each labeled with a different color of
dye
5. Dideoxy nucleotides lack a hydroxyl
group on the 3’ carbon of the sugar
ring. In a regular nucleotide, the 3’
hydroxyl group acts as a “hook,"
allowing a new nucleotide to be added
to an existing chain.
Once a dideoxy nucleotide has been
added to the chain, there is no
hydroxyl available and no further
nucleotides can be added.
The chain ends with the dideoxy
nucleotide, which is marked with a
particular color of dye depending on
the base (A, T, C or G) that it carries.
normal
deoxynucleotidetriphosphates
(dNTPs)
modified
di-deoxynucleotidetriphosphates
(ddNTPs),
9. 3”
5”
5”
3”
| | | | | | | | | |
2-Denturation of DNA by heating @ 95O C
3”
5”
5”
3”
Heat Heat
To produce a complimentary strand and the template strand for DNA sequencing
10. 3” T A C G C A T
A 5”
T A T
3”| | |
4-The Primed DNA is then dispersed equally among for vessels
<------- Template strand
<------- Primer
3” T A C G C A T
A 5”
T A T
3”| | |
3-A Primer is then annealed to the 5” end of DNA
<------- Template strand
<------- Primer
11. 5-DNA polymerase is added to all 4 reaction vessels
DNA P DNA P DNA P DNA P
6-Add all four (dATP, dTTP, dGTP, dCTP) to each vessels
A,G,C,
T
A,G,C,
T
A,G,C,
T
A,G,C,
T
12. 7-Modifeid ddNTP are added to reaction vessels
ddAT
P
ddTT
P
ddGT
P
ddTT
P
A T
G C
3” T A T G C A T
A 5”
T A T
3”| | |
<------- Template strand
<------- Primer
13. 3” T A T G C A T
A 5”
T A T
3”| | |
<------- Template strand
<------- Primer
A
T
C
G
DNA
Polymeras
e
ddAT
P
8- The DNA polymerase attaches the dNTP to the template
strand at the primer normally until ddNTP base is pared. As
ddNTP attached the chain termination occur.
14. 3” T A T G C A T
A 5”
T A T
3”| | |
<------- Template strand
<------- PrimerA T
3”
9-Once the ddNTP is based paired , the sequence is
terminated because ddNTP lacks the –OH group at 3’ carbon
ddAT
P
ddTT
P
ddGT
P
ddCT
P
10-As a result of chain termination, DNA fragments of
different length are formed in all four vessels
15. A T G C
Polyacrylamide Gel
Electrophoresis is
used to sequence
DNA
16. A CGT
• DNA migrates form
the –ve pole towards
the +ve pole, due to
–ve charge impaired
by phosphate back
bone
• Smaller (lighter)
DNA fragments
migrate more
rapidly than larger
DNA fragments
• As a result of this
different bands are
observed on plate Small & lighter
Large & Heavy
17. A CGT
The sequence is read form
the bottom of the plate
T
C
A
T
G
G
T
A
T
T
C
A
C
G
G
A
T
A
G
T
C
G
A
18. 5” A G C T G A T A G G C A C T T A T G G T A
C T 3”
T
C
A
T
G
G
T
A
T
T
C
A
C
G
G
A
T
A
G
T
C
G
A
3” T C G A C T A T C C G T G A A T A C C A T
G A 5”
19. Method of Sanger sequencing
• The DNA sample is divided into four separate
sequencing reactions, containing all four of the
standard deoxynucleotides (dNTP, A,C,G,T) and
the DNA polymerase.
• To each reaction is added only one of the four
dideoxynucleotides (ddNTP ddATP, ddTTP,
ddGTP, ddCTP), while the other added
nucleotides are ordinary ones (dN).
• The ddNTP concentration should be
approximately 100-fold higher than that of the
corresponding dNTP (e.g. 0.5mM ddTTP :
0.005mM dTTP) to allow enough fragments to
be produced while still transcribing the
complete sequence (but the concentration of
ddNTP also depends on the desired length of
sequence)
20. • Four separate reactions are needed in this process
to test all four ddNTPs. Following rounds of
template DNA extension from the bound primer,
the resulting DNA fragments are heat denatured
and separated by size using gel electrophoresis.
• In the original publication of 1977, the formation of
base-paired loops of ssDNA was a cause of serious
difficulty in resolving bands at some locations. This
is frequently performed using a denaturing
polyacrylamide-urea gel with each of the four
reactions run in one of four individual lanes (lanes
A, T, G, C). The DNA bands may then be visualized
by autoradiography or UV light and the DNA
sequence can be directly read off the X-ray film or
gel image.
21. • DNA fragments are labelled
with a radioactive or
fluorescent tag on the
primer , in the new DNA
strand with a labeled dNTP,
or with a labeled ddNTP.
• Chain-termination methods
have greatly simplified DNA
sequencing. For example,
chain-termination-based
kits are commercially
available that contain the
reagents needed for
sequencing, pre-aliquoted
and ready to use.
22. Dye-terminator sequencing
utilizes labelling of the chain terminator ddNTPs,
which permits sequencing in a single
reaction, rather than four reactions
as in the labelled-primer method. In dye-
terminator sequencing, each of the four
dideoxynucleotide chain terminators is labelled
with fluorescent dyes, each of which emit light at
different wavelengths.
Owing to its greater expediency and speed, dye-
terminator sequencing is now the mainstay in
automated sequencing.
Its limitations include dye effects due to differences in the incorporation of the
dye-labelled chain terminators into the DNA fragment, resulting in unequal peak
heights and shapes in the electronic DNA sequence trace chromatogram after
capillary electrophoresis. This problem has been addressed with the use of
modified DNA polymerase enzyme systems and dyes that minimize incorporation
variability, as well as methods for eliminating "dye blobs". The dye-terminator
sequencing method, along with automated high-throughput DNA sequence
analyzers, was used for the vast majority of sequencing projects.
23. Automated DNA-sequencing instruments (DNA
sequencers) can sequence up to 384 DNA samples
in a single batch. Batch runs may occur up to 24
times a day. DNA sequencers separate strands by
size (or length) using capillary electrophoresis,
they detect and record dye fluorescence, and
output data as fluorescent peak trace
chromatograms. Sequencing reactions
(thermocycling and labelling), cleanup and re-
suspension of samples in a buffer solution are
performed separately, before loading samples onto
the sequencer. A number of commercial and non-
commercial software packages can trim low-
quality DNA traces automatically. These programs
score the quality of each peak and remove low-
quality base peaks (which are generally located at
the ends of the sequence). The accuracy of such
algorithms is inferior to visual examination by a
human operator, but is adequate for automated
processing of large sequence data sets.
24.
25. Challenges
• Poor quality in the first 15-40 bases of the sequence due
to primer binding and deteriorating quality of sequencing
traces after 700-900 bases. Base calling software such as
Phred typically provides an estimate of quality to aid in
trimming of low-quality regions of sequences.
• In cases where DNA fragments are cloned before
sequencing, the resulting sequence may contain parts of
the cloning vector. In contrast, PCR-based cloning and
next-generation sequencing technologies based on
pyrosequencing often avoid using cloning vectors.
• One-step Sanger sequencing (combined amplification and
sequencing) methods such as Ampliseq and SeqSharp
have been developed that allow rapid sequencing of
target genes without cloning or prior amplification.
• Current methods can directly sequence only relatively
short (300-1000 nucleotides long) DNA fragments in a
single reaction.
• The main obstacle to sequencing DNA fragments above
this size limit is insufficient power of separation for
resolving large DNA fragments that differ in length by only
one nucleotide.
26. Microfluidic Sanger sequencing
Microfluidic Sanger sequencing is a lab-on-a-chip
application for DNA sequencing, in which the
Sanger sequencing steps (thermal cycling, sample
purification, and capillary electrophoresis) are
integrated on a wafer-scale chip using nanoliter-
scale sample volumes.
This technology generates long and accurate
sequence reads, while obviating many of the
significant shortcomings of the conventional
Sanger method (e.g. high consumption of
expensive reagents, reliance on expensive
equipment, personnel-intensive manipulations,
etc.) by integrating and automating the Sanger
sequencing steps.
27. In its modern inception, high-throughput genome
sequencing involves
• fragmenting the genome into small single-stranded
pieces,
• followed by amplification of the fragments by Polymerase
Chain Reaction (PCR).
Adopting the Sanger method, each DNA fragment is
irreversibly terminated with the incorporation of a
fluorescently labeled dideoxy chain-terminating nucleotide,
thereby producing a DNA “ladder” of fragments that each
differ in length by one base and bear a base-specific
fluorescent label at the terminal base.
Amplified base ladders are then separated by Capillary
Array Electrophoresis (CAE) with automated, in situ “finish-
line” detection of the fluorescently labeled ssDNA
fragments, which provides an ordered sequence of the
fragments. These sequence reads are then computer
assembled into overlapping or contiguous sequences
(termed "contigs") which resemble the full genomic
sequence once fully assembled.
28. Applications of microfluidic sequencing technologies
• Single nucleotide polymorphism (SNP) detection,
• Single-strand conformation polymorphism (SSCP)
• Heteroduplex analysis, and
• Short tandem repeat (STR) analysis.
Resolving DNA fragments according to differences in size
and/or conformation is the most critical step in studying
these features of the genome
A single-nucleotide polymorphism (SNP) is a substitution
of a single nucleotide that occurs at a specific position in
the genome, where each variation is present at a level of
more than 1% in the population.
Heteroduplex analysis (HDA) is a method in biochemistry
used to detect point mutations in DNA since 1992.
Heteroduplexes are dsDNA molecules that have one or
more mismatched pairs, on the other hand homoduplexes
are dsDNA which are perfectly paired
29.
30. Next-generation sequencing
The most recent set of DNA sequencing technologies are
collectively referred to as next-generation sequencing.
There are a variety of next-generation sequencing techniques that
use different technologies. However, most share a common set of
features that distinguish them from Sanger sequencing:
• Highly parallel: many sequencing reactions take place at the
same time
• Micro scale: reactions are tiny and many can be done at once on
a chip
• Fast: because reactions are done in parallel, results are ready
much faster
• Low-cost: sequencing a genome is cheaper than with Sanger
sequencing
• Shorter length: reads typically range from 50-700 nucleotides in
length
Conceptually, next-generation sequencing is kind of like running a
very large number of tiny Sanger sequencing reactions in parallel.
This parallelization and small scale, large quantities of DNA can be
sequenced much more quickly and cheaply with next-generation
methods than with Sanger sequencing.
For example, in 2001, the cost of sequencing a human genome was
almost $100 million In 2015, it was just $1245.