Human Genom Project, NCBI, Blast

1956 The first physical map of the human genome is determined. Using light microscopy of stained tissue, JH Tjio and A Levan (Hereditas 42,
1-6) reveal that our cells normally contain 46 chromosomes and that there are 24 different types of human chromosome.
1977 Fred Sanger and colleagues publish the dideoxy DNA sequencing method (Proc. Nat! Acod. Sci. USA 74,5463-5467). With some further
re finement s (fluorescence labeling and automation) it will be the method used to sequence the human genome and many other
genomes.
1981 Sanger and colleagues publish the complete sequence of human mitochondrial DNA (Anderson S et al. Nature 290, 457-465).
1988 The US National Institutes of Health (NIH) sets up a dedicated Office of Human Genome Research (later renamed the National Center
for
Human Genome Research).
1990 The Human Genome Project (HGP) is launched officially after implementation of a $3 billion 15-year project in the USA.
1992 The first comprehensive human genetic linkage map, based on microsatellite markers (Weissenbach J et at Nature 359, 794-801).
1995 The first detailed physical map of the human genome is published, based on sequence-tagged sites (Hudson TJ et al. Science 270,
1999 The first essentially complete DNA sequence for a human chromosome is reported, for chromosome 22 (Dunham I et al. Nature 402,
489-495).
2001 Draft sequences of the human nuclear genome, comprising roughly 90% of the total euchromatic component, are published by the
International Human Genome Sequencing Consortium (IHGSC) (Nature 409,860-921) and by Celera (Science 291, 1304-1351).
2001/2 Publication of a draft sequence ofthe mouse nuclear genome (Waterson B et al. Nature 420, 520-562). Human-mouse comparisons
help
human gene identification/characterization.
2003/4 The essentially completed sequence (about 99%) of the euchromatic componel1t of the human genome is reported, comprising 2.85
Gb of
DNA, or roughly gook> of the total of 3.2 Gb (analyses published in 2004 by the IHGSC, Nature 431, 93 1-945).
2004 Over 21 ,000 human genes are validated by full-length cDNA clones (Imanishi T et al. PLoS BioI. 2, e1 62).
2005-7 The International HapMap Consortium reports increasingly detailed single nucleotide polymorphism (SNP) maps for the human
genome
in 2005 (Nature 437, 1299-1320) and 2007 (Nature 449,851-861). The latter map has more than 3.1 million SNPs, roughly one SNP per
kilobase.
2007-8 The age of personal genome sequencing begins with delivery of euchromatic gel10me sequences for James Watson and Craig

• An important rationale of the HGP-and a major motivation for
privately funded genome sequencing. ( Started 1990 )
• The resulting gene map was published in (1998) and seemed to
identify the positions of 30,000 human genes.
• ((it was clear that human genes were not uniformly distributed
along or between chromosomes. Some chromosomes were rich
in genes and others were gene-poor))
Essentially devoid of genes and extraordinarily rich in repetitive DNA. which
would make mapping extremely difficult.

To collect and managing HGP 7 University and Institute were joint to this
process
Sanger Institute for chromosome 1,
Washington University for
chromosome 2

Celera company in 1999 announced that it intended to produce a draft human
genome sequence in 2 years and that it would do this by whole genome shotgun
sequencing.
International Human Genome Sequencing Consortium
IHGSC continued alone
By 2003, the lHGSC had produced an essentially complete sequence of the
euchromatic component of the human genome.
By 2004,. data became reedited and 2.85 Gb of sequence reported and The sequence
was interrupted by 341 gaps that remained. The gaps are of two types

• The final sequence was a composite one, representing various donor cells of
different genotypes. 2004
• 21,000 of human genes were confirmed but there are still considerable
uncertainties about exactly how many genes we have.
• The International 1000 Genomes Project that was launched in early 2008. The
aim is to obtain a detailed catalog of human genetic variation by sequencing the
genomes of at least 1000 individuals representing a variety of different ethnic
groups.

Genome projects have also been conducted for a variety of model organisms
 Escherichia coli
 Saccharomyces Cerevisiae
 Caenorhabditis Elegance
 Drosophila melanogaster

Mouse Genome Sequencing Consortium derived the genome
sequence of the widely used C57BLl6J mouse strain. The latest C57BLl6J
sequence update represents more than 90% of the total mouse genome

The first cellular genome was sequenced in 1995 (from the bacterium
Haemophilus influenza)
By late 2009, close to 1100 complete genome sequences had been
published, and an additional 4500 genome projects were ongoing
Prokaryotes Eukaryotes
varied motivations prompted genome
sequencing. They include general research
models, models of disease and
development,
models for evolutionary and comparative
genomic studies, farm animals and
crops, and pathogenic protozoa and
nematodes.
to understand general and
evolutionary aspects of
prokaryotes. The principal
motivation for genome sequencing
of many bacteria has been to
understand their involvement in
Pathogenesis or applications in
Biotechnology.

اطلاعات موجود در پایگاه های اطلاعاتی
پایگاه
اطلاعات ژنوم
(Genome databases)
اطلاعات مولکولی
(molecular databases)
اطلاعات منابعی
(literature databases)

اطلاعات منابعی
اطلاعات مقالات چاپ شده :MEDLINE
دریافت خلاصه مقالات :PubMed
مقالات رایگان بیولوژی و پزشکی :PubMed Central
اطلاعات های ژن های انسانی و ناهنجاری ژنتیکی :OMIM
(online Mendelian inheritance in man)
اطلاعات ژن ها و ناهنجاری ژنتیکی در حیوانات :OMIA
(online Mendelian inheritance in animals)
مجموعه ایی از کتابهای بیولوژی و پزشکی :Books
مجلات ISSN دسترسی به عنوان،مخفف و :Journals
اطلاعات واژگان و معادل تخصصی واژه های علمی :MeSH

(MOLECULAR DATABASES) اطلاعات مولکولی
توالی های نوکلئوتیدی 
توالی های پروتئینی 
اطلاعات ساختاری 
اطلاعات رده بندی 
اطلاعات ژن ها 
اطلاعات تظاهر ژن ها 

آورده شود GenBank در مواردی که یک مولکول با چند توالی برای یک ارگانیسم در :RefSeq
معرفی نماید ( رکورد RefSeq تلاش می کند تا بهترین توالی را انتخاب کند و به عنوان رکورد NCBI
تا حد امکان به دور از جهش، اشتباهات تعیین توال، تغییرات ناشی از کلونینگ می باشد) RefSeq
کوتاهی هستند که معمولا بین 300 تا 500 باز دارند و از تعیین ترادف یک یا DNA توالی های :dbEST
هر دو انتهای بیان شونده ژن بوجود می آیند.
ساخته می شود سپس کلون می شود CDNA ابتدا mRNA از روی
5 بدست می آید ’EST 3 یا ’EST توالی یابی شود cDNA بر حسب اینکه کدام انتهای
ژنومی است در حالیکه GSS در این است که منشاء EST با GSS است تفاوت EST شبیه :GSS
است mRNA مولکول EST منشاء
بدست می آیند. BAC کوتاه و تصادفی و معمولا از انتهای کلون های کاسمید و GSS توالی های

(GENOME DATABASES) پایگاه اطلاعات ژنوم
اطلاعات این پایگاه بر اساس شش گروه موجودات طراحی شده:
1. آرکئاها
2. باکتری ها
3. یوکاریوت ها
4. ویروس ها
5. ویروئید ها
6. پلاسمید ها

BASIC LOCAL ALIGNMENT SEARCH
TOOLS
(BLAST)
برنامه ایی است برای پیدا کردن همردیفی موضعی بدون فاصله و بالاترین میزان تشابه
سرعت بالا ودقت زیاد درشناسایی تشابه بین توالی های پرتئین و اسیدنوکلئیک
برای شناسایی تشابه بین توالی ها
توالی هایی که از نظر یا بطور دقیق باهم شباهت دارند )همسانی توالی ها(. : Identity
توالی هایی که در برخی نقاط با هم همسانی داشته باشند)شباهت توالی ها(. : Similarity

مقایسه یک توالی پروتئینی با یک توالی پروتئینی :BLASTP
در پایگاه اطلاعات
DNA بایک توالی DNA مقایسه یک توالی :BLASTN
موجود در پایگاه اطلاعات
مورد تفاضا بایک توالی DNA مقایسه یک توالی :BLASTX
پروتئینی موجود در پایگاه اطلاعات )*(
مقایسه یک توالی پروتئینی مورد تقاضا بایک :TBLASTN
توالی پروتئینی ترجمه شده موجود در پایگاه اطلاعات )**(
مقایسه یک توالی پروتئینی مورد تقاضا بایک :TBLASTX
توالی پروتئینی ترجمه شده موجود در پایگاه اطلاعات )***(

در شش چارچوب خواندنی ترجمه DNA توالی BLASTX * در
)سه چارچوب برای رشته بالا و سه چارچوب برای رشته پایین(
مقایسه میشود. Pro سپس با یک توالی
توالی ها در پایگاه اطلاعات در شش چارچوب TBLASTN ** در
قرائت ترجمه و مقایسه ها صورت میگیرد.
هم توالی مورد تقاضا و هم توالی های موجود TBLASTX *** در
در در پایگاه اطلاعات در شش چارچوب ترجمه میشوند و بعداز آن
مقایسه ها در سطح اسیدآمینه صورت میگیرد.

BLAST Output: Alignments
Identical match
positive score
(conservative)
Negative or zero
gap

REFERENCES
• http://www.ncbi.nlm.nih.gov/guide/sequence-analysis/#howtos_
• http://blast.ncbi.nlm.nih.gov/Blast.cgi
• http://blast.ncbi.nlm.nih.gov/Blast.cgi?CMD=Web&PAGE_TYPE=BlastDocs
• http://www.bio.davidson.edu/genomics/2008/Simpson/Tutorial.html

Human Genom Project, NCBI, Blast

Human Genom Project, NCBI, Blast

Recommended

Recommended

More Related Content

Viewers also liked

Viewers also liked (7)

Similar to Human Genom Project, NCBI, Blast

Similar to Human Genom Project, NCBI, Blast (10)

Human Genom Project, NCBI, Blast