SlideShare a Scribd company logo
1 of 45
Sequencing a genome
Definition 
• Determining the identity and order of 
nucleotides in the genetic material – usually 
DNA, sometimes RNA, of an organism
Basic problem 
• Genomes are large (typically millions or 
billions of base pairs) 
• Current technology can only reliably ‘read’ 
a short stretch – typically hundreds of base 
pairs
Elements of a solution 
• Automation – over the past decade, the 
amount of hand-labor in the ‘reads’ has 
been steadily and dramatically reduced 
• Assembly of the reads into sequences is an 
algorithmic and computational problem
A human drama 
• There are competing methods of assembly 
• The competing – public and private – 
sequencing teams used competing assembly 
methods
Assembly: 
• Putting sequenced fragments of DNA 
into their correct chromosomal 
positions
BAC 
• Bacterial artificial chromosome: 
bacterial DNA spliced with a medium-sized 
fragment of a genome (100 to 
300 kb) to be amplified in bacteria and 
sequenced.
Contig 
• Contiguous sequence of DNA created 
by assembling overlapping sequenced 
fragments of a chromosome (whether 
natural or artificial, as in BACs)
Cosmid 
• DNA from a bacterial virus spliced 
with a small fragment of a genome (45 
kb or less) to be amplified and 
sequenced
Directed sequencing 
• Successively sequencing DNA from 
adjacent stretches of chromosome
Draft sequence 
• Sequence with lower accuracy than a 
finished sequence; some segments are 
missing or in the wrong order or 
orientation
EST 
• Expressed sequence tag: a unique 
stretch of DNA within a coding region 
of a gene; useful for identifying full-length 
genes and as a landmark for 
mapping
Exon 
• Region of a gene’s DNA that encodes 
a portion of its protein; exons are 
interspersed with noncoding introns
Genome 
• The entire chromosomal genetic 
material of an organism
Intron 
• Region of a gene’s DNA that is not 
translated into a protein
Kilobase (kb) 
• Unit of DNA equal to 1000 bases
Locus 
• Chromosomal location of a gene or 
other piece of DNA
Megabase (mb) 
• Unit of DNA equal to 1 million bases
PCR 
• Polymerase chain reaction: a technique 
for amplifying a piece of DNA quickly 
and cheaply
Physical map 
• A map of the locations of identifiable 
markers spaced along the 
chromosomes; a physical map may 
also be a set of overlapping clones
Plasmid 
• Loop of bacterial DNA that replicates 
independently of the chromosomes; 
artificial plasmids can be inserted into 
bacteria to amplify DNA for 
sequencing
Regulatory region 
• A segment of DNA that controls 
whether a gene will be expressed and 
to what degree
Repetitive DNA 
• Sequences of varying lenths that occur 
in multiple copies in the genome; it 
represents much of the genome
Restriction enzyme 
• An enzyme that cuts DNA at specific 
sequences of base pairs
RFLP 
• Restriction fragment length 
polymorphism: genetic variation in the 
length of DNA fragments produced by 
restriction enzymes; useful as markers 
on maps
Scaffold 
• A series of contigs that are in the right 
order but are not necessarily connected 
in one continuous stretch of sequence
Shotgun sequencing 
• Breaking DNA into many small pieces, 
sequencing the pieces, and assembling 
the fragments
STS 
• Sequence tagged site: a unique stretch 
of DNA whose location is known; 
serves as a landmark for mapping and 
assembly
YAC 
• Yeast artificial chromosome: yeast 
DNA spliced with a large fragment of 
a genome (up to 1 mb) to be amplified 
in yeast cells and sequenced
Readings 
• Myers, “Whole Genome DNA Sequencing,” http://www.cs. 
arizona.edu/people/gene/PAPERS/whole.IEEE.pdf 
• Venter, et al, “The Sequence of the Human Genome,” 
Science, 16 Feb 2001, Vol. 291 No 5507, 1304 (parts 1 & 2) 
• Waterston, Lander, Sulston, “On the sequencing of the 
human genome,” PNAS, March 19, 2002, Vol 99, no 6, 
3712-3716 
• Myers, et.al., “On the sequencing and assembly of the 
human genome,” 
www.pnas.org/cgi/doi/10.1073/pnas.092136699
Hierarchical sequencing 
• Create a high-level physical map, using 
ESTs and STSs 
• Shred genome into overlapping clones 
• Multiply clones in BACs 
• ‘shotgun’ each clone 
• Read each ‘shotgunned’ fragment 
• Assemble the fragments
Physical map
Whole genome sequencing (WGS) 
• Make multiple copies of the target 
• Randomly ‘shotgun’ each target, discarding 
very big and very small pieces 
• Read each fragment 
• Reassemble the ‘reads’
Hierarchical v. whole-genome
The fragment assembly problem 
• Aim: infer the target from the reads 
• Difficulties – 
– Incomplete coverage. Leaves contigs separated 
by gaps of unknown size. 
– Sequencing errors. Rate increases with length 
of read. Less than some e. 
– Unknown orientation. Don’t know whether to 
use read or its Watson-Crick complement.
Scaling and computational 
complexity 
• Increasing size of target G. 
– 1990 – 40kb (one cosmid) 
– 1995 – 1.8 mb (H. Influenza) 
– 2001 – 3,200 mb (H. sapiens)
The repeat problem 
• Repeats 
– Bigger G means more repeats 
– Complex organisms have more repetitive 
elements 
– Small repeats may appear multiple times in a 
read 
– Long repeats may be bigger than reads (no 
unique region)
Gaps 
• Read length LR hasn’t changed much 
• w = LR /G gets steadily smaller 
• Gaps ~ Re- wR (Waterman & Lander)
How deep must coverage be?
Double-barreled shotgun 
sequencing 
• Choose longer fragments (say, 2 x LR) 
• Read both ends 
• Such fragments probably span gaps 
• This gives an approximate size of the gap 
• This links contigs into scaffolds
Genomic results
HGSC v Celera results
To do or not to do? 
• “The idea is gathering momentum. I shiver 
at the thought.” – David Baltimore, 1986 
• “If there is anything worth doing twice, it’s 
the human genome.” – David Haussler, 
2000
Public or private? 
• “This information is so important that it 
cannot be proprietary.” – C Thomas 
Caskey, 1987 
• “If a company behaves in what scientists 
believe is a socially responsible manner, 
they can’t make a profit.” – Robert Cook- 
Deegan, 1987
HW for Feb 17 
• Comment on these assertions (500-1000 
words): 
– WLS – “Our analysis indicates that the Celera 
paper provides neither a meaningful test of the 
WGS approach nor an independent sequence of 
the human genome.” 
– Venter – “This conclusion is based on incorrect 
assumptions and flawed reasoning.”

More Related Content

What's hot

Phylogenetic tree construction
Phylogenetic tree constructionPhylogenetic tree construction
Phylogenetic tree constructionUddalok Jana
 
NEXT GENERATION SEQUENCING
NEXT GENERATION SEQUENCINGNEXT GENERATION SEQUENCING
NEXT GENERATION SEQUENCINGAayushi Pal
 
Next Generation Sequencing of DNA
Next Generation Sequencing of DNANext Generation Sequencing of DNA
Next Generation Sequencing of DNAmaryamshah13
 
Whole genome sequence
Whole genome sequenceWhole genome sequence
Whole genome sequencesababibi
 
THIRD GEN SEQUENCING.pptx
THIRD GEN SEQUENCING.pptxTHIRD GEN SEQUENCING.pptx
THIRD GEN SEQUENCING.pptxRITHIKA R S
 
Next generation sequencing
Next generation sequencingNext generation sequencing
Next generation sequencingPALANIANANTH.S
 
2 whole genome sequencing and analysis
2 whole genome sequencing and analysis2 whole genome sequencing and analysis
2 whole genome sequencing and analysissaberhussain9
 
Next generation sequencing
Next generation sequencingNext generation sequencing
Next generation sequencingShaheen Alam
 
Next Gen Sequencing (NGS) Technology Overview
Next Gen Sequencing (NGS) Technology OverviewNext Gen Sequencing (NGS) Technology Overview
Next Gen Sequencing (NGS) Technology OverviewDominic Suciu
 
Next generation sequencing
Next generation sequencingNext generation sequencing
Next generation sequencingSwathi Prabakar
 
Next generation sequencing
Next generation sequencingNext generation sequencing
Next generation sequencingTapish Goel
 
genome sequencing, types by kk sahu sir
genome sequencing, types by kk sahu sirgenome sequencing, types by kk sahu sir
genome sequencing, types by kk sahu sirKAUSHAL SAHU
 
Next Generation Sequencing (NGS)
Next Generation Sequencing (NGS)Next Generation Sequencing (NGS)
Next Generation Sequencing (NGS)LOGESWARAN KA
 
Next generation sequencing
Next generation sequencingNext generation sequencing
Next generation sequencingDayananda Salam
 

What's hot (20)

Phylogenetic tree construction
Phylogenetic tree constructionPhylogenetic tree construction
Phylogenetic tree construction
 
Sanger sequencing method of DNA
Sanger sequencing method of DNA Sanger sequencing method of DNA
Sanger sequencing method of DNA
 
NEXT GENERATION SEQUENCING
NEXT GENERATION SEQUENCINGNEXT GENERATION SEQUENCING
NEXT GENERATION SEQUENCING
 
Next Generation Sequencing of DNA
Next Generation Sequencing of DNANext Generation Sequencing of DNA
Next Generation Sequencing of DNA
 
Whole genome sequence
Whole genome sequenceWhole genome sequence
Whole genome sequence
 
THIRD GEN SEQUENCING.pptx
THIRD GEN SEQUENCING.pptxTHIRD GEN SEQUENCING.pptx
THIRD GEN SEQUENCING.pptx
 
Sanger sequencing
Sanger sequencingSanger sequencing
Sanger sequencing
 
Next generation sequencing
Next generation sequencingNext generation sequencing
Next generation sequencing
 
2 whole genome sequencing and analysis
2 whole genome sequencing and analysis2 whole genome sequencing and analysis
2 whole genome sequencing and analysis
 
Tools and database of NCBI
Tools and database of NCBITools and database of NCBI
Tools and database of NCBI
 
Next generation sequencing
Next generation sequencingNext generation sequencing
Next generation sequencing
 
Next Gen Sequencing (NGS) Technology Overview
Next Gen Sequencing (NGS) Technology OverviewNext Gen Sequencing (NGS) Technology Overview
Next Gen Sequencing (NGS) Technology Overview
 
dna microarray
dna microarraydna microarray
dna microarray
 
Next generation sequencing
Next generation sequencingNext generation sequencing
Next generation sequencing
 
Next generation sequencing
Next generation sequencingNext generation sequencing
Next generation sequencing
 
Genome sequencing
Genome sequencingGenome sequencing
Genome sequencing
 
genome sequencing, types by kk sahu sir
genome sequencing, types by kk sahu sirgenome sequencing, types by kk sahu sir
genome sequencing, types by kk sahu sir
 
Prosite
PrositeProsite
Prosite
 
Next Generation Sequencing (NGS)
Next Generation Sequencing (NGS)Next Generation Sequencing (NGS)
Next Generation Sequencing (NGS)
 
Next generation sequencing
Next generation sequencingNext generation sequencing
Next generation sequencing
 

Viewers also liked

Genome sequencing
Genome sequencingGenome sequencing
Genome sequencingShital Pal
 
DNA SEQUENCING METHOD
DNA SEQUENCING METHODDNA SEQUENCING METHOD
DNA SEQUENCING METHODMusa Khan
 
Whole genome sequencing of bacteria & analysis
Whole genome sequencing of bacteria & analysisWhole genome sequencing of bacteria & analysis
Whole genome sequencing of bacteria & analysisdrelamuruganvet
 
The human genome project
The human genome projectThe human genome project
The human genome project14pascba
 
Dna sequencing powerpoint
Dna sequencing powerpointDna sequencing powerpoint
Dna sequencing powerpoint14cummke
 
Whole genome duplication and diversification of plant genomes
Whole genome duplication and diversification of plant genomesWhole genome duplication and diversification of plant genomes
Whole genome duplication and diversification of plant genomesSimonRB
 
YSP Week 3 HGP
YSP Week 3 HGPYSP Week 3 HGP
YSP Week 3 HGPLisa Feng
 
Evolutionary genomics
Evolutionary genomicsEvolutionary genomics
Evolutionary genomicsboussau
 
Monteiro 2015 Conservação ex situ de espécies ameaçadas da flora brasileira: ...
Monteiro 2015 Conservação ex situ de espécies ameaçadas da flora brasileira: ...Monteiro 2015 Conservação ex situ de espécies ameaçadas da flora brasileira: ...
Monteiro 2015 Conservação ex situ de espécies ameaçadas da flora brasileira: ...José André
 
Biology 16 1 genes and variation[1]
Biology 16 1 genes and variation[1]Biology 16 1 genes and variation[1]
Biology 16 1 genes and variation[1]Tamara
 
Genomic selection with weighted GBLUP and APY single step
Genomic selection with weighted GBLUP and APY single stepGenomic selection with weighted GBLUP and APY single step
Genomic selection with weighted GBLUP and APY single stepILRI
 
Genome Sequencing Project
Genome Sequencing ProjectGenome Sequencing Project
Genome Sequencing Projectguestd53a1
 
Breeding and genomics in ILRI biosciences research
Breeding and genomics in ILRI biosciences researchBreeding and genomics in ILRI biosciences research
Breeding and genomics in ILRI biosciences researchILRI
 
Presentation for blast algorithm bio-informatice
Presentation for blast algorithm bio-informaticePresentation for blast algorithm bio-informatice
Presentation for blast algorithm bio-informaticezahid6
 
Genomic Selection in Dairy Cattle
Genomic Selection in Dairy CattleGenomic Selection in Dairy Cattle
Genomic Selection in Dairy CattleJohn B. Cole, Ph.D.
 

Viewers also liked (20)

Genome sequencing
Genome sequencingGenome sequencing
Genome sequencing
 
Genome sequencing
Genome sequencingGenome sequencing
Genome sequencing
 
Whole Genome Analysis
Whole Genome AnalysisWhole Genome Analysis
Whole Genome Analysis
 
DNA SEQUENCING METHOD
DNA SEQUENCING METHODDNA SEQUENCING METHOD
DNA SEQUENCING METHOD
 
Whole genome sequencing of bacteria & analysis
Whole genome sequencing of bacteria & analysisWhole genome sequencing of bacteria & analysis
Whole genome sequencing of bacteria & analysis
 
Genome analysis
Genome analysisGenome analysis
Genome analysis
 
Genome Mapping
Genome MappingGenome Mapping
Genome Mapping
 
The human genome project
The human genome projectThe human genome project
The human genome project
 
Dna sequencing powerpoint
Dna sequencing powerpointDna sequencing powerpoint
Dna sequencing powerpoint
 
Whole genome duplication and diversification of plant genomes
Whole genome duplication and diversification of plant genomesWhole genome duplication and diversification of plant genomes
Whole genome duplication and diversification of plant genomes
 
YSP Week 3 HGP
YSP Week 3 HGPYSP Week 3 HGP
YSP Week 3 HGP
 
Evolutionary genomics
Evolutionary genomicsEvolutionary genomics
Evolutionary genomics
 
Mapping by pcr
Mapping by pcrMapping by pcr
Mapping by pcr
 
Monteiro 2015 Conservação ex situ de espécies ameaçadas da flora brasileira: ...
Monteiro 2015 Conservação ex situ de espécies ameaçadas da flora brasileira: ...Monteiro 2015 Conservação ex situ de espécies ameaçadas da flora brasileira: ...
Monteiro 2015 Conservação ex situ de espécies ameaçadas da flora brasileira: ...
 
Biology 16 1 genes and variation[1]
Biology 16 1 genes and variation[1]Biology 16 1 genes and variation[1]
Biology 16 1 genes and variation[1]
 
Genomic selection with weighted GBLUP and APY single step
Genomic selection with weighted GBLUP and APY single stepGenomic selection with weighted GBLUP and APY single step
Genomic selection with weighted GBLUP and APY single step
 
Genome Sequencing Project
Genome Sequencing ProjectGenome Sequencing Project
Genome Sequencing Project
 
Breeding and genomics in ILRI biosciences research
Breeding and genomics in ILRI biosciences researchBreeding and genomics in ILRI biosciences research
Breeding and genomics in ILRI biosciences research
 
Presentation for blast algorithm bio-informatice
Presentation for blast algorithm bio-informaticePresentation for blast algorithm bio-informatice
Presentation for blast algorithm bio-informatice
 
Genomic Selection in Dairy Cattle
Genomic Selection in Dairy CattleGenomic Selection in Dairy Cattle
Genomic Selection in Dairy Cattle
 

Similar to sequencing of genome

Genomics: Organization of Genome, Strategies of Genome Sequencing, Model Plan...
Genomics: Organization of Genome, Strategies of Genome Sequencing, Model Plan...Genomics: Organization of Genome, Strategies of Genome Sequencing, Model Plan...
Genomics: Organization of Genome, Strategies of Genome Sequencing, Model Plan...Promila Sheoran
 
Structural genomics
Structural genomicsStructural genomics
Structural genomicsAshfaq Ahmad
 
Recombinant dna technology
Recombinant dna technologyRecombinant dna technology
Recombinant dna technologyRahul k
 
Gene mapping methods
Gene mapping methodsGene mapping methods
Gene mapping methodsMEENAKSHI DAS
 
High Throughput Sequencing Technologies: What We Can Know
High Throughput Sequencing Technologies: What We Can KnowHigh Throughput Sequencing Technologies: What We Can Know
High Throughput Sequencing Technologies: What We Can KnowBrian Krueger
 
introduction to Genomics
introduction to Genomics introduction to Genomics
introduction to Genomics IqraSami3
 
Recombinant DNA Technology
Recombinant DNA TechnologyRecombinant DNA Technology
Recombinant DNA TechnologyPrasenjit Mitra
 
Recombinant dna technology
Recombinant dna technologyRecombinant dna technology
Recombinant dna technologynasira jaffry
 
Genomics-Mapping and sequencing.pdf
Genomics-Mapping and sequencing.pdfGenomics-Mapping and sequencing.pdf
Genomics-Mapping and sequencing.pdfshinycthomas
 
Recombinant DNA-PPT.pptx
Recombinant DNA-PPT.pptxRecombinant DNA-PPT.pptx
Recombinant DNA-PPT.pptxMargieBAlmoza
 
Recombinant DNA-PPT.pptx
Recombinant DNA-PPT.pptxRecombinant DNA-PPT.pptx
Recombinant DNA-PPT.pptxmargiebartolome
 
Recombinent DNA
Recombinent DNA Recombinent DNA
Recombinent DNA KanwalNisa1
 
20170209 ngs for_cancer_genomics_101
20170209 ngs for_cancer_genomics_10120170209 ngs for_cancer_genomics_101
20170209 ngs for_cancer_genomics_101Ino de Bruijn
 

Similar to sequencing of genome (20)

Genomics: Organization of Genome, Strategies of Genome Sequencing, Model Plan...
Genomics: Organization of Genome, Strategies of Genome Sequencing, Model Plan...Genomics: Organization of Genome, Strategies of Genome Sequencing, Model Plan...
Genomics: Organization of Genome, Strategies of Genome Sequencing, Model Plan...
 
Structural genomics
Structural genomicsStructural genomics
Structural genomics
 
Sts
StsSts
Sts
 
Recombinant dna technology
Recombinant dna technologyRecombinant dna technology
Recombinant dna technology
 
Gene mapping methods
Gene mapping methodsGene mapping methods
Gene mapping methods
 
Dna mapping
Dna mappingDna mapping
Dna mapping
 
High Throughput Sequencing Technologies: What We Can Know
High Throughput Sequencing Technologies: What We Can KnowHigh Throughput Sequencing Technologies: What We Can Know
High Throughput Sequencing Technologies: What We Can Know
 
introduction to Genomics
introduction to Genomics introduction to Genomics
introduction to Genomics
 
Dna fingerprinting
Dna fingerprintingDna fingerprinting
Dna fingerprinting
 
Genome editing
Genome editingGenome editing
Genome editing
 
Recombinant DNA Technology
Recombinant DNA TechnologyRecombinant DNA Technology
Recombinant DNA Technology
 
Recombinant dna technology
Recombinant dna technologyRecombinant dna technology
Recombinant dna technology
 
Genomics-Mapping and sequencing.pdf
Genomics-Mapping and sequencing.pdfGenomics-Mapping and sequencing.pdf
Genomics-Mapping and sequencing.pdf
 
Recombinant DNA-PPT.pptx
Recombinant DNA-PPT.pptxRecombinant DNA-PPT.pptx
Recombinant DNA-PPT.pptx
 
Recombinant DNA-PPT.pptx
Recombinant DNA-PPT.pptxRecombinant DNA-PPT.pptx
Recombinant DNA-PPT.pptx
 
Recombinent DNA
Recombinent DNA Recombinent DNA
Recombinent DNA
 
Genetic engineering
Genetic engineeringGenetic engineering
Genetic engineering
 
20170209 ngs for_cancer_genomics_101
20170209 ngs for_cancer_genomics_10120170209 ngs for_cancer_genomics_101
20170209 ngs for_cancer_genomics_101
 
Vntr marker
Vntr markerVntr marker
Vntr marker
 
biotechnology
biotechnologybiotechnology
biotechnology
 

More from Naveen Gupta

Dynamic memory allocation
Dynamic memory allocationDynamic memory allocation
Dynamic memory allocationNaveen Gupta
 
Lecture 1 family planning
Lecture 1 family planningLecture 1 family planning
Lecture 1 family planningNaveen Gupta
 
Human reproduction
Human reproductionHuman reproduction
Human reproductionNaveen Gupta
 
Genetic engineering
Genetic engineeringGenetic engineering
Genetic engineeringNaveen Gupta
 
Cell reproduction mitosis&meosis
Cell reproduction mitosis&meosisCell reproduction mitosis&meosis
Cell reproduction mitosis&meosisNaveen Gupta
 
Asexual and sexual_reproduction
Asexual and sexual_reproductionAsexual and sexual_reproduction
Asexual and sexual_reproductionNaveen Gupta
 
Biotechnology and genetic engineering
Biotechnology and genetic engineeringBiotechnology and genetic engineering
Biotechnology and genetic engineeringNaveen Gupta
 

More from Naveen Gupta (13)

Dynamic memory allocation
Dynamic memory allocationDynamic memory allocation
Dynamic memory allocation
 
Reproduction
ReproductionReproduction
Reproduction
 
stem cells
stem cellsstem cells
stem cells
 
Mitosis2
Mitosis2Mitosis2
Mitosis2
 
Mestrual cycle
Mestrual cycleMestrual cycle
Mestrual cycle
 
Menstrual cycle 2
Menstrual cycle 2Menstrual cycle 2
Menstrual cycle 2
 
Lecture 1 family planning
Lecture 1 family planningLecture 1 family planning
Lecture 1 family planning
 
Human reproduction
Human reproductionHuman reproduction
Human reproduction
 
Genetic engineering
Genetic engineeringGenetic engineering
Genetic engineering
 
Cell reproduction mitosis&meosis
Cell reproduction mitosis&meosisCell reproduction mitosis&meosis
Cell reproduction mitosis&meosis
 
Cell respiration
Cell respirationCell respiration
Cell respiration
 
Asexual and sexual_reproduction
Asexual and sexual_reproductionAsexual and sexual_reproduction
Asexual and sexual_reproduction
 
Biotechnology and genetic engineering
Biotechnology and genetic engineeringBiotechnology and genetic engineering
Biotechnology and genetic engineering
 

Recently uploaded

Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksSérgio Sacani
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real timeSatoshi NAKAHIRA
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...RohitNehra6
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfAnalytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfSwapnil Therkar
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PPRINCE C P
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...Sérgio Sacani
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |aasikanpl
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisDiwakar Mishra
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxkessiyaTpeter
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bSérgio Sacani
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
A relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfA relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfnehabiju2046
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxSwapnil Therkar
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCEPRINCE C P
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Nistarini College, Purulia (W.B) India
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfSumit Kumar yadav
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )aarthirajkumar25
 

Recently uploaded (20)

Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real time
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfAnalytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C P
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
Engler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomyEngler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomy
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
A relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfA relative description on Sonoporation.pdf
A relative description on Sonoporation.pdf
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 

sequencing of genome

  • 2. Definition • Determining the identity and order of nucleotides in the genetic material – usually DNA, sometimes RNA, of an organism
  • 3. Basic problem • Genomes are large (typically millions or billions of base pairs) • Current technology can only reliably ‘read’ a short stretch – typically hundreds of base pairs
  • 4. Elements of a solution • Automation – over the past decade, the amount of hand-labor in the ‘reads’ has been steadily and dramatically reduced • Assembly of the reads into sequences is an algorithmic and computational problem
  • 5. A human drama • There are competing methods of assembly • The competing – public and private – sequencing teams used competing assembly methods
  • 6. Assembly: • Putting sequenced fragments of DNA into their correct chromosomal positions
  • 7. BAC • Bacterial artificial chromosome: bacterial DNA spliced with a medium-sized fragment of a genome (100 to 300 kb) to be amplified in bacteria and sequenced.
  • 8. Contig • Contiguous sequence of DNA created by assembling overlapping sequenced fragments of a chromosome (whether natural or artificial, as in BACs)
  • 9. Cosmid • DNA from a bacterial virus spliced with a small fragment of a genome (45 kb or less) to be amplified and sequenced
  • 10. Directed sequencing • Successively sequencing DNA from adjacent stretches of chromosome
  • 11. Draft sequence • Sequence with lower accuracy than a finished sequence; some segments are missing or in the wrong order or orientation
  • 12. EST • Expressed sequence tag: a unique stretch of DNA within a coding region of a gene; useful for identifying full-length genes and as a landmark for mapping
  • 13. Exon • Region of a gene’s DNA that encodes a portion of its protein; exons are interspersed with noncoding introns
  • 14. Genome • The entire chromosomal genetic material of an organism
  • 15. Intron • Region of a gene’s DNA that is not translated into a protein
  • 16. Kilobase (kb) • Unit of DNA equal to 1000 bases
  • 17. Locus • Chromosomal location of a gene or other piece of DNA
  • 18. Megabase (mb) • Unit of DNA equal to 1 million bases
  • 19. PCR • Polymerase chain reaction: a technique for amplifying a piece of DNA quickly and cheaply
  • 20. Physical map • A map of the locations of identifiable markers spaced along the chromosomes; a physical map may also be a set of overlapping clones
  • 21. Plasmid • Loop of bacterial DNA that replicates independently of the chromosomes; artificial plasmids can be inserted into bacteria to amplify DNA for sequencing
  • 22. Regulatory region • A segment of DNA that controls whether a gene will be expressed and to what degree
  • 23. Repetitive DNA • Sequences of varying lenths that occur in multiple copies in the genome; it represents much of the genome
  • 24. Restriction enzyme • An enzyme that cuts DNA at specific sequences of base pairs
  • 25. RFLP • Restriction fragment length polymorphism: genetic variation in the length of DNA fragments produced by restriction enzymes; useful as markers on maps
  • 26. Scaffold • A series of contigs that are in the right order but are not necessarily connected in one continuous stretch of sequence
  • 27. Shotgun sequencing • Breaking DNA into many small pieces, sequencing the pieces, and assembling the fragments
  • 28. STS • Sequence tagged site: a unique stretch of DNA whose location is known; serves as a landmark for mapping and assembly
  • 29. YAC • Yeast artificial chromosome: yeast DNA spliced with a large fragment of a genome (up to 1 mb) to be amplified in yeast cells and sequenced
  • 30. Readings • Myers, “Whole Genome DNA Sequencing,” http://www.cs. arizona.edu/people/gene/PAPERS/whole.IEEE.pdf • Venter, et al, “The Sequence of the Human Genome,” Science, 16 Feb 2001, Vol. 291 No 5507, 1304 (parts 1 & 2) • Waterston, Lander, Sulston, “On the sequencing of the human genome,” PNAS, March 19, 2002, Vol 99, no 6, 3712-3716 • Myers, et.al., “On the sequencing and assembly of the human genome,” www.pnas.org/cgi/doi/10.1073/pnas.092136699
  • 31. Hierarchical sequencing • Create a high-level physical map, using ESTs and STSs • Shred genome into overlapping clones • Multiply clones in BACs • ‘shotgun’ each clone • Read each ‘shotgunned’ fragment • Assemble the fragments
  • 33. Whole genome sequencing (WGS) • Make multiple copies of the target • Randomly ‘shotgun’ each target, discarding very big and very small pieces • Read each fragment • Reassemble the ‘reads’
  • 35. The fragment assembly problem • Aim: infer the target from the reads • Difficulties – – Incomplete coverage. Leaves contigs separated by gaps of unknown size. – Sequencing errors. Rate increases with length of read. Less than some e. – Unknown orientation. Don’t know whether to use read or its Watson-Crick complement.
  • 36. Scaling and computational complexity • Increasing size of target G. – 1990 – 40kb (one cosmid) – 1995 – 1.8 mb (H. Influenza) – 2001 – 3,200 mb (H. sapiens)
  • 37. The repeat problem • Repeats – Bigger G means more repeats – Complex organisms have more repetitive elements – Small repeats may appear multiple times in a read – Long repeats may be bigger than reads (no unique region)
  • 38. Gaps • Read length LR hasn’t changed much • w = LR /G gets steadily smaller • Gaps ~ Re- wR (Waterman & Lander)
  • 39. How deep must coverage be?
  • 40. Double-barreled shotgun sequencing • Choose longer fragments (say, 2 x LR) • Read both ends • Such fragments probably span gaps • This gives an approximate size of the gap • This links contigs into scaffolds
  • 42. HGSC v Celera results
  • 43. To do or not to do? • “The idea is gathering momentum. I shiver at the thought.” – David Baltimore, 1986 • “If there is anything worth doing twice, it’s the human genome.” – David Haussler, 2000
  • 44. Public or private? • “This information is so important that it cannot be proprietary.” – C Thomas Caskey, 1987 • “If a company behaves in what scientists believe is a socially responsible manner, they can’t make a profit.” – Robert Cook- Deegan, 1987
  • 45. HW for Feb 17 • Comment on these assertions (500-1000 words): – WLS – “Our analysis indicates that the Celera paper provides neither a meaningful test of the WGS approach nor an independent sequence of the human genome.” – Venter – “This conclusion is based on incorrect assumptions and flawed reasoning.”