SlideShare a Scribd company logo
Biological Databases
• Based on their contents, biological databases can be roughly divided into
three categories: primary databases, secondary databases, and specialized
databases.
• Primary databases contain original biological data.
• They are archives of raw sequence or structural data submitted by the
scientific community.
• GenBank EMBL and DDBJ are examples of primary databases.
• Most of the data in the databases are contributed directly by authors with a
minimal level of annotation.
• These three public databases closely collaborate and exchange new data
daily
• Although the three databases all contain the same sets of raw data, each of
the individual databases has a slightly different kind of format to represent
the data.
• All are freely available on the Internet.
• Secondary databases contain computationally processed or manually curated
information, based on original information from primary databases.
• Translated protein sequence databases containing functional annotation belong to
this category. Examples are SWISS-Prot and Protein Information Resources (PIR)
• Contain computationally processed sequence information derived from the primary
databases.
• The amount of computational processing work varies greatly among the secondary
databases; some are simple archives of translated sequence data from identified
open reading frames in DNA, whereas others provide additional annotation and
information related to higher levels of information regarding structure and
functions.
• SWISS-PROT, provides detailed sequence annotation that includes structure,
function, and protein family assignment.
• The annotation of each entry is carefully curated by human experts and thus is of
good quality.
• Much of this information is obtained from scientific literature and entered by
database curators.
• Specialized databases normally serve a specific research
community or focus on a particular organism.
• For example, Flybase, HIV sequence database, and Ribosomal
Database Project are databases that specialize in a particular
organism or a particular type of data.
• Many genome databases that are taxonomic specific also fall
within this category.
Pitfalls in BD
• There are many errors in sequence databases. Most errors in
nucleotide sequences are caused by sequencing errors.
• Some of these errors cause frameshifts that make whole gene
identification difficult or protein translation impossible.
• Sometimes, gene sequences are contaminated with sequences
from cloning vectors.
• Annotations of genes can also occasionally be false or incomplete.
• All these types of errors can be passed on to other databases,
causing propagation of errors.
• There are also high levels of redundancy in the primary sequence
databases.
• Redundancy (duplication) is another major problem affecting
primary databases.
• Often, the same gene sequence is found under different names
resulting in multiple entries and confusion about the data.
Biological Databases
• Open Access
Databases
• NCBI –
www.ncbi.nlm.nih.gov
• EBI – www.ebi.ac.uk
• DDBJ –
www.ddbj.nig.ac.jp
• ExPASy –
www.expasy.ch
Database Searching
• Sequence Based Searching
– Query – Sequence;
– Target DB – Seq/Str
• Text Based Searching
– Query – Text;
– Target DB – Text/Seq/Str
Nucleotide Sequence Retrieval Sequence Based
Nucleotide Sequence Retrieval
Text Based
Protein Sequence Retrieval
Sequence Based
Protein Sequence Retrieval
Text Based
Biological databases
Biological databases
Biological databases
Biological databases
Biological databases
Biological databases
Biological databases

More Related Content

What's hot

Swiss prot database
Swiss prot databaseSwiss prot database
Swiss prot database
sagrika chugh
 
Proteins databases
Proteins databasesProteins databases
Proteins databases
Hafiz Muhammad Zeeshan Raza
 
UniProt
UniProtUniProt
UniProt
AmnaA7
 
Primary and secondary database
Primary and secondary databasePrimary and secondary database
Primary and secondary database
KAUSHAL SAHU
 
Database in bioinformatics
Database in bioinformaticsDatabase in bioinformatics
Database in bioinformatics
VinaKhan1
 
Introduction to Bioinformatics
Introduction to BioinformaticsIntroduction to Bioinformatics
Introduction to Bioinformatics
Asad Afridi
 
databases in bioinformatics
databases in bioinformaticsdatabases in bioinformatics
databases in bioinformaticsnadeem akhter
 
Gen bank databases
Gen bank databasesGen bank databases
Gen bank databases
Hafiz Muhammad Zeeshan Raza
 
Protein Databases
Protein DatabasesProtein Databases
Protein Databases
SATHIYA NARAYANAN
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
Subhranil Bhattacharjee
 
Biological databases
Biological databasesBiological databases
Biological databases
Tamanna Syeda
 
Clustal W - Multiple Sequence alignment
Clustal W - Multiple Sequence alignment   Clustal W - Multiple Sequence alignment
Clustal W - Multiple Sequence alignment
The Oxford College Engineering
 
Protein data bank
Protein data bankProtein data bank
Protein data bank
Yogesh Joshi
 
Basics of bioinformatics
Basics of bioinformaticsBasics of bioinformatics
Basics of bioinformaticsAbhishek Vatsa
 
PIR- Protein Information Resource
PIR- Protein Information ResourcePIR- Protein Information Resource
Fasta
FastaFasta
EMBL
EMBLEMBL
Protein database
Protein databaseProtein database
Protein database
Rajpal Choudhary
 

What's hot (20)

NCBI
NCBINCBI
NCBI
 
Swiss prot database
Swiss prot databaseSwiss prot database
Swiss prot database
 
Proteins databases
Proteins databasesProteins databases
Proteins databases
 
UniProt
UniProtUniProt
UniProt
 
Primary and secondary database
Primary and secondary databasePrimary and secondary database
Primary and secondary database
 
Database in bioinformatics
Database in bioinformaticsDatabase in bioinformatics
Database in bioinformatics
 
Introduction to Bioinformatics
Introduction to BioinformaticsIntroduction to Bioinformatics
Introduction to Bioinformatics
 
databases in bioinformatics
databases in bioinformaticsdatabases in bioinformatics
databases in bioinformatics
 
Gen bank databases
Gen bank databasesGen bank databases
Gen bank databases
 
Protein Databases
Protein DatabasesProtein Databases
Protein Databases
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Clustal W - Multiple Sequence alignment
Clustal W - Multiple Sequence alignment   Clustal W - Multiple Sequence alignment
Clustal W - Multiple Sequence alignment
 
Protein data bank
Protein data bankProtein data bank
Protein data bank
 
Basics of bioinformatics
Basics of bioinformaticsBasics of bioinformatics
Basics of bioinformatics
 
PIR- Protein Information Resource
PIR- Protein Information ResourcePIR- Protein Information Resource
PIR- Protein Information Resource
 
Fasta
FastaFasta
Fasta
 
blast bioinformatics
blast bioinformaticsblast bioinformatics
blast bioinformatics
 
EMBL
EMBLEMBL
EMBL
 
Protein database
Protein databaseProtein database
Protein database
 

Similar to Biological databases

biological databases.pptx
biological databases.pptxbiological databases.pptx
biological databases.pptx
science lover
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
ShailendraSinghKhich
 
Primary Bioinformatics Database.pptx
Primary Bioinformatics Database.pptxPrimary Bioinformatics Database.pptx
Primary Bioinformatics Database.pptx
Vandana Yadav03
 
Biological data bioinformatics
Biological data bioinformatics Biological data bioinformatics
Biological data bioinformatics
AakifahAmreen
 
Introduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASEIntroduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASE
PrashantSharma807
 
protein databases.ppt
protein databases.pptprotein databases.ppt
protein databases.ppt
SanthiyaAK
 
Biological data base
Biological data baseBiological data base
Biological data base
kishoreGupta17
 
Entrez databases
Entrez databasesEntrez databases
Entrez databases
Hafiz Muhammad Zeeshan Raza
 
BIOINFORMATICS AND DATABASES IN BIOINFORMATICS.pdf
BIOINFORMATICS  AND  DATABASES IN BIOINFORMATICS.pdfBIOINFORMATICS  AND  DATABASES IN BIOINFORMATICS.pdf
BIOINFORMATICS AND DATABASES IN BIOINFORMATICS.pdf
PravanjanDash
 
COMPUNATIONAL BIOLOGY AND DATABASES IN BIOINFORMATICS.pptx
COMPUNATIONAL BIOLOGY AND DATABASES IN BIOINFORMATICS.pptxCOMPUNATIONAL BIOLOGY AND DATABASES IN BIOINFORMATICS.pptx
COMPUNATIONAL BIOLOGY AND DATABASES IN BIOINFORMATICS.pptx
PravanjanDash
 
Data retreival system
Data retreival systemData retreival system
Data retreival system
Shikha Thakur
 
Important protein databases and proteomics softwares
Important protein databases and proteomics softwaresImportant protein databases and proteomics softwares
Important protein databases and proteomics softwares
PUNJAB AGRICULTURAL UNIVERSITY, LUDHIANA, 141004, PUNJAB (INDIA)
 
Nucleic acid database
Nucleic acid databaseNucleic acid database
Nucleic acid database
Esakkiammal S
 
Databases_L2.pptx
Databases_L2.pptxDatabases_L2.pptx
Databases_L2.pptx
kigaruantony
 
Composite and Specialized databases
Composite and Specialized databasesComposite and Specialized databases
Sequence submission tools ............pptx
Sequence submission tools ............pptxSequence submission tools ............pptx
Sequence submission tools ............pptx
Cherry
 
Bioinformatics__Lecture_1.ppt
Bioinformatics__Lecture_1.pptBioinformatics__Lecture_1.ppt
Bioinformatics__Lecture_1.ppt
sirwansleman
 
Biological Database (1)pptxpdfpdfpdf.pdf
Biological Database (1)pptxpdfpdfpdf.pdfBiological Database (1)pptxpdfpdfpdf.pdf
Biological Database (1)pptxpdfpdfpdf.pdf
BioinformaticsCentre
 
Data Base in Bioinformatics.ppt
Data Base in Bioinformatics.pptData Base in Bioinformatics.ppt
Data Base in Bioinformatics.ppt
Bangaluru
 

Similar to Biological databases (20)

biological databases.pptx
biological databases.pptxbiological databases.pptx
biological databases.pptx
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Primary Bioinformatics Database.pptx
Primary Bioinformatics Database.pptxPrimary Bioinformatics Database.pptx
Primary Bioinformatics Database.pptx
 
Biological data bioinformatics
Biological data bioinformatics Biological data bioinformatics
Biological data bioinformatics
 
Introduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASEIntroduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASE
 
protein databases.ppt
protein databases.pptprotein databases.ppt
protein databases.ppt
 
Biological data base
Biological data baseBiological data base
Biological data base
 
Entrez databases
Entrez databasesEntrez databases
Entrez databases
 
BIOINFORMATICS AND DATABASES IN BIOINFORMATICS.pdf
BIOINFORMATICS  AND  DATABASES IN BIOINFORMATICS.pdfBIOINFORMATICS  AND  DATABASES IN BIOINFORMATICS.pdf
BIOINFORMATICS AND DATABASES IN BIOINFORMATICS.pdf
 
COMPUNATIONAL BIOLOGY AND DATABASES IN BIOINFORMATICS.pptx
COMPUNATIONAL BIOLOGY AND DATABASES IN BIOINFORMATICS.pptxCOMPUNATIONAL BIOLOGY AND DATABASES IN BIOINFORMATICS.pptx
COMPUNATIONAL BIOLOGY AND DATABASES IN BIOINFORMATICS.pptx
 
Data retreival system
Data retreival systemData retreival system
Data retreival system
 
Important protein databases and proteomics softwares
Important protein databases and proteomics softwaresImportant protein databases and proteomics softwares
Important protein databases and proteomics softwares
 
Nucleic acid database
Nucleic acid databaseNucleic acid database
Nucleic acid database
 
Databases_L2.pptx
Databases_L2.pptxDatabases_L2.pptx
Databases_L2.pptx
 
Composite and Specialized databases
Composite and Specialized databasesComposite and Specialized databases
Composite and Specialized databases
 
Sequence submission tools ............pptx
Sequence submission tools ............pptxSequence submission tools ............pptx
Sequence submission tools ............pptx
 
Bioinformatics__Lecture_1.ppt
Bioinformatics__Lecture_1.pptBioinformatics__Lecture_1.ppt
Bioinformatics__Lecture_1.ppt
 
Structural database and their classification by abdul qahar
Structural database and their classification by abdul qaharStructural database and their classification by abdul qahar
Structural database and their classification by abdul qahar
 
Biological Database (1)pptxpdfpdfpdf.pdf
Biological Database (1)pptxpdfpdfpdf.pdfBiological Database (1)pptxpdfpdfpdf.pdf
Biological Database (1)pptxpdfpdfpdf.pdf
 
Data Base in Bioinformatics.ppt
Data Base in Bioinformatics.pptData Base in Bioinformatics.ppt
Data Base in Bioinformatics.ppt
 

More from Afra Fathima

Time management
Time managementTime management
Time management
Afra Fathima
 
Radio active labeling
Radio active labelingRadio active labeling
Radio active labeling
Afra Fathima
 
Micriobiology & Microbes
Micriobiology & MicrobesMicriobiology & Microbes
Micriobiology & Microbes
Afra Fathima
 
Identification of proteins by 2D gel
Identification of proteins by 2D gelIdentification of proteins by 2D gel
Identification of proteins by 2D gel
Afra Fathima
 
SDS PAGE
SDS PAGESDS PAGE
SDS PAGE
Afra Fathima
 
HIGH PERFORMANCE LIQUID CHROMATOGRAPHY
HIGH PERFORMANCE LIQUID CHROMATOGRAPHYHIGH PERFORMANCE LIQUID CHROMATOGRAPHY
HIGH PERFORMANCE LIQUID CHROMATOGRAPHY
Afra Fathima
 
Dialysis
DialysisDialysis
Dialysis
Afra Fathima
 
Lowry's method
Lowry's  methodLowry's  method
Lowry's method
Afra Fathima
 
Bradford's method
Bradford's methodBradford's method
Bradford's method
Afra Fathima
 
Ammonium sulphate precipitation
Ammonium sulphate precipitationAmmonium sulphate precipitation
Ammonium sulphate precipitation
Afra Fathima
 
Collection of blood, serum & plasma
Collection of blood, serum & plasmaCollection of blood, serum & plasma
Collection of blood, serum & plasma
Afra Fathima
 
Sub cellular fractionation
Sub cellular fractionationSub cellular fractionation
Sub cellular fractionation
Afra Fathima
 
Cells of the immune system
Cells of the immune systemCells of the immune system
Cells of the immune system
Afra Fathima
 
Cells of the immune system
Cells  of the immune system Cells  of the immune system
Cells of the immune system
Afra Fathima
 
Dinitro salicylic acid (DNSA) method
Dinitro salicylic acid (DNSA) methodDinitro salicylic acid (DNSA) method
Dinitro salicylic acid (DNSA) method
Afra Fathima
 
Cells of the immune system
Cells of the immune system Cells of the immune system
Cells of the immune system
Afra Fathima
 
Cells of the immune system
Cells  of the immune system Cells  of the immune system
Cells of the immune system
Afra Fathima
 
Cells of the immune system
Cells  of the immune system Cells  of the immune system
Cells of the immune system
Afra Fathima
 
Cells of the immune system
Cells  of the immune systemCells  of the immune system
Cells of the immune system
Afra Fathima
 
Sulphadiazine
SulphadiazineSulphadiazine
Sulphadiazine
Afra Fathima
 

More from Afra Fathima (20)

Time management
Time managementTime management
Time management
 
Radio active labeling
Radio active labelingRadio active labeling
Radio active labeling
 
Micriobiology & Microbes
Micriobiology & MicrobesMicriobiology & Microbes
Micriobiology & Microbes
 
Identification of proteins by 2D gel
Identification of proteins by 2D gelIdentification of proteins by 2D gel
Identification of proteins by 2D gel
 
SDS PAGE
SDS PAGESDS PAGE
SDS PAGE
 
HIGH PERFORMANCE LIQUID CHROMATOGRAPHY
HIGH PERFORMANCE LIQUID CHROMATOGRAPHYHIGH PERFORMANCE LIQUID CHROMATOGRAPHY
HIGH PERFORMANCE LIQUID CHROMATOGRAPHY
 
Dialysis
DialysisDialysis
Dialysis
 
Lowry's method
Lowry's  methodLowry's  method
Lowry's method
 
Bradford's method
Bradford's methodBradford's method
Bradford's method
 
Ammonium sulphate precipitation
Ammonium sulphate precipitationAmmonium sulphate precipitation
Ammonium sulphate precipitation
 
Collection of blood, serum & plasma
Collection of blood, serum & plasmaCollection of blood, serum & plasma
Collection of blood, serum & plasma
 
Sub cellular fractionation
Sub cellular fractionationSub cellular fractionation
Sub cellular fractionation
 
Cells of the immune system
Cells of the immune systemCells of the immune system
Cells of the immune system
 
Cells of the immune system
Cells  of the immune system Cells  of the immune system
Cells of the immune system
 
Dinitro salicylic acid (DNSA) method
Dinitro salicylic acid (DNSA) methodDinitro salicylic acid (DNSA) method
Dinitro salicylic acid (DNSA) method
 
Cells of the immune system
Cells of the immune system Cells of the immune system
Cells of the immune system
 
Cells of the immune system
Cells  of the immune system Cells  of the immune system
Cells of the immune system
 
Cells of the immune system
Cells  of the immune system Cells  of the immune system
Cells of the immune system
 
Cells of the immune system
Cells  of the immune systemCells  of the immune system
Cells of the immune system
 
Sulphadiazine
SulphadiazineSulphadiazine
Sulphadiazine
 

Recently uploaded

Polish students' mobility in the Czech Republic
Polish students' mobility in the Czech RepublicPolish students' mobility in the Czech Republic
Polish students' mobility in the Czech Republic
Anna Sz.
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
Special education needs
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
Jisc
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
DeeptiGupta154
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
Delapenabediema
 
Sectors of the Indian Economy - Class 10 Study Notes pdf
Sectors of the Indian Economy - Class 10 Study Notes pdfSectors of the Indian Economy - Class 10 Study Notes pdf
Sectors of the Indian Economy - Class 10 Study Notes pdf
Vivekanand Anglo Vedic Academy
 
The Art Pastor's Guide to Sabbath | Steve Thomason
The Art Pastor's Guide to Sabbath | Steve ThomasonThe Art Pastor's Guide to Sabbath | Steve Thomason
The Art Pastor's Guide to Sabbath | Steve Thomason
Steve Thomason
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
Jisc
 
Template Jadual Bertugas Kelas (Boleh Edit)
Template Jadual Bertugas Kelas (Boleh Edit)Template Jadual Bertugas Kelas (Boleh Edit)
Template Jadual Bertugas Kelas (Boleh Edit)
rosedainty
 
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCECLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
BhavyaRajput3
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
EverAndrsGuerraGuerr
 
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Thiyagu K
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
siemaillard
 
PART A. Introduction to Costumer Service
PART A. Introduction to Costumer ServicePART A. Introduction to Costumer Service
PART A. Introduction to Costumer Service
PedroFerreira53928
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
Sandy Millin
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
MysoreMuleSoftMeetup
 
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
Nguyen Thanh Tu Collection
 
ESC Beyond Borders _From EU to You_ InfoPack general.pdf
ESC Beyond Borders _From EU to You_ InfoPack general.pdfESC Beyond Borders _From EU to You_ InfoPack general.pdf
ESC Beyond Borders _From EU to You_ InfoPack general.pdf
Fundacja Rozwoju Społeczeństwa Przedsiębiorczego
 
Basic phrases for greeting and assisting costumers
Basic phrases for greeting and assisting costumersBasic phrases for greeting and assisting costumers
Basic phrases for greeting and assisting costumers
PedroFerreira53928
 
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideas
GeoBlogs
 

Recently uploaded (20)

Polish students' mobility in the Czech Republic
Polish students' mobility in the Czech RepublicPolish students' mobility in the Czech Republic
Polish students' mobility in the Czech Republic
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
 
Sectors of the Indian Economy - Class 10 Study Notes pdf
Sectors of the Indian Economy - Class 10 Study Notes pdfSectors of the Indian Economy - Class 10 Study Notes pdf
Sectors of the Indian Economy - Class 10 Study Notes pdf
 
The Art Pastor's Guide to Sabbath | Steve Thomason
The Art Pastor's Guide to Sabbath | Steve ThomasonThe Art Pastor's Guide to Sabbath | Steve Thomason
The Art Pastor's Guide to Sabbath | Steve Thomason
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
 
Template Jadual Bertugas Kelas (Boleh Edit)
Template Jadual Bertugas Kelas (Boleh Edit)Template Jadual Bertugas Kelas (Boleh Edit)
Template Jadual Bertugas Kelas (Boleh Edit)
 
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCECLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
 
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
 
PART A. Introduction to Costumer Service
PART A. Introduction to Costumer ServicePART A. Introduction to Costumer Service
PART A. Introduction to Costumer Service
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
 
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
 
ESC Beyond Borders _From EU to You_ InfoPack general.pdf
ESC Beyond Borders _From EU to You_ InfoPack general.pdfESC Beyond Borders _From EU to You_ InfoPack general.pdf
ESC Beyond Borders _From EU to You_ InfoPack general.pdf
 
Basic phrases for greeting and assisting costumers
Basic phrases for greeting and assisting costumersBasic phrases for greeting and assisting costumers
Basic phrases for greeting and assisting costumers
 
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideas
 

Biological databases

  • 1. Biological Databases • Based on their contents, biological databases can be roughly divided into three categories: primary databases, secondary databases, and specialized databases. • Primary databases contain original biological data. • They are archives of raw sequence or structural data submitted by the scientific community. • GenBank EMBL and DDBJ are examples of primary databases. • Most of the data in the databases are contributed directly by authors with a minimal level of annotation. • These three public databases closely collaborate and exchange new data daily • Although the three databases all contain the same sets of raw data, each of the individual databases has a slightly different kind of format to represent the data. • All are freely available on the Internet.
  • 2. • Secondary databases contain computationally processed or manually curated information, based on original information from primary databases. • Translated protein sequence databases containing functional annotation belong to this category. Examples are SWISS-Prot and Protein Information Resources (PIR) • Contain computationally processed sequence information derived from the primary databases. • The amount of computational processing work varies greatly among the secondary databases; some are simple archives of translated sequence data from identified open reading frames in DNA, whereas others provide additional annotation and information related to higher levels of information regarding structure and functions. • SWISS-PROT, provides detailed sequence annotation that includes structure, function, and protein family assignment. • The annotation of each entry is carefully curated by human experts and thus is of good quality. • Much of this information is obtained from scientific literature and entered by database curators.
  • 3. • Specialized databases normally serve a specific research community or focus on a particular organism. • For example, Flybase, HIV sequence database, and Ribosomal Database Project are databases that specialize in a particular organism or a particular type of data. • Many genome databases that are taxonomic specific also fall within this category.
  • 4. Pitfalls in BD • There are many errors in sequence databases. Most errors in nucleotide sequences are caused by sequencing errors. • Some of these errors cause frameshifts that make whole gene identification difficult or protein translation impossible. • Sometimes, gene sequences are contaminated with sequences from cloning vectors. • Annotations of genes can also occasionally be false or incomplete. • All these types of errors can be passed on to other databases, causing propagation of errors. • There are also high levels of redundancy in the primary sequence databases. • Redundancy (duplication) is another major problem affecting primary databases. • Often, the same gene sequence is found under different names resulting in multiple entries and confusion about the data.
  • 5. Biological Databases • Open Access Databases • NCBI – www.ncbi.nlm.nih.gov • EBI – www.ebi.ac.uk • DDBJ – www.ddbj.nig.ac.jp • ExPASy – www.expasy.ch
  • 6. Database Searching • Sequence Based Searching – Query – Sequence; – Target DB – Seq/Str • Text Based Searching – Query – Text; – Target DB – Text/Seq/Str
  • 8.
  • 9.
  • 10.
  • 11.
  • 12.
  • 13.
  • 14.
  • 15.
  • 16.
  • 18.
  • 19.
  • 20.
  • 21.
  • 22.
  • 23.
  • 24.
  • 25.
  • 26.
  • 28.