SlideShare a Scribd company logo
1 of 6
Protein sequence databases
Introduction:
The Protein database is a collection of sequences from several sources, including translations from
annotated coding regions in GenBank, RefSeqand TPA, as well as records from SwissProt, PIR,
PRF, and PDB. Protein sequences are the fundamental determinants of biological structure and
function.
SWISS-PROT
– Manually curated
– high-quality annotations, less data
GenPept/TREMBL
– Translated coding sequences from GenBank/EMBL
– Few annotations, more up to date
PIR
– Phylogenetic-based annotations
All 3 now combining efforts to form UniProt (http://www.uniprot.org)
PDB (Protein Databank)
ď‚· Stores 3-dimensional atomic coordinates for biological molecules including protein and
nucleic acids
ď‚· Data obtained by X-ray crystallography, NMR, or computer modelling
http://www.rcsb.org/pdb/
MMDB (Molecular Modelling database)
Over 28,000 3D macromolecular structures, including proteins and
polynucleotides(http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Structure)
SCOP (Structural Classification of Proteins)
Classification of proteins according to structural and evolutionary relationships
SWISS-PROT
Introduction:
SWISS-PROT is an annotated protein sequence database, which was created at the
Department of Medical Biochemistry of the University of Geneva and has been a collaborative
effort of the Department and the European Molecular Biology Laboratory (EMBL), since 1987.
SWISS-PROT is now an equal partnership between the EMBL and the Swiss Institute of
Bioinformatics (SIB). The EMBL activities are carried out by its Hinxton Outstation, the European
Bioinformatics Institute (EBI). The SWISS-PROT protein sequence database consists of sequence
entries. Sequence entries are composed of different line types, each with their own format.
The SWISS-PROT database distinguishes itself from other protein sequence databases by three
distinct criteria:
(i) annotations
(ii) (ii) minimal redundancy and
(iii) (iii) integration with other databases.
Annotations
CORE DATA
• The sequence data
• The citation information (bibliographical references)
• The taxonomic data (description of the biological source of the protein)
Annotation- Additional Data
• Descriptions include:
• Function(s) of the protein
• Posttranslational modification(s) such as carbohydrates, phosphorylation, acetylation and
GPI-anchor
• Domains and sites, for example, calcium-binding regions, ATP-binding sites, zinc fingers,
homeoboxes, and SH2 and SH3 domains
• Secondary structure, e.g. alpha helix, beta sheet
• Quaternary structure, i.g. homodimer, heterotrimer, etc.
• Similarities to other proteins
• Disease(s) associated with any number of deficiencies in the protein
• Sequence conflicts, variants, etc.
Minimal Redundancy
• Much of data comes from more than one literature report
• Data condensed and merged to appear more concise and coherent
• Conflicts in data are listed for each entry
Integration with other databases
• 50+ databases for cross-reference
• Nucleic acid sequences, protein tertiary structure, protein 3-D models, etc.
• Allows Swiss-PROT to play a major role as the focal point for biomolecular
interconnectivity
Documentation
• All files documented and indexed
• Documentation kept up-to-date
Applications for the Knowledgebase
• Provides highly organized data and information on a wide variety of proteins
• Can be used as a starting point for protein research
• Allows searches to be conducted starting with various search strings
• Biochemical encyclopedia
SWISS-PROT Flat File format
ID - Identification.
AC - Accession number(s).
DT - Date.
DE - Description.
GN - Gene name(s).
OS - Organism species.
OG - Organelle.
OC - Organism classification.
RN - Reference number.
RP - Reference position.
RC - Reference comments.
RX - Reference cross-references.
RA - Reference authors.
RL - Reference location.
CC - Comments or notes.
DR - Database cross-references.
KW - Keywords.
FT - Feature table data.
SQ - Sequence header.
- (blanks) sequence data.
// - Termination line.

More Related Content

What's hot (20)

Scop database
Scop databaseScop database
Scop database
 
Biological database
Biological databaseBiological database
Biological database
 
Protein Databases
Protein DatabasesProtein Databases
Protein Databases
 
BLAST
BLASTBLAST
BLAST
 
Protein information resource (PIR)
Protein information resource (PIR)Protein information resource (PIR)
Protein information resource (PIR)
 
Primary and secondary databases ppt by puneet kulyana
Primary and secondary databases ppt by puneet kulyanaPrimary and secondary databases ppt by puneet kulyana
Primary and secondary databases ppt by puneet kulyana
 
blast bioinformatics
blast bioinformaticsblast bioinformatics
blast bioinformatics
 
Swiss prot database
Swiss prot databaseSwiss prot database
Swiss prot database
 
Clustal W - Multiple Sequence alignment
Clustal W - Multiple Sequence alignment   Clustal W - Multiple Sequence alignment
Clustal W - Multiple Sequence alignment
 
EMBL- European Molecular Biology Laboratory
EMBL- European Molecular Biology LaboratoryEMBL- European Molecular Biology Laboratory
EMBL- European Molecular Biology Laboratory
 
DNA data bank of japan (DDBJ)
DNA data bank of japan (DDBJ)DNA data bank of japan (DDBJ)
DNA data bank of japan (DDBJ)
 
Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-
 
EMBL-EBI
EMBL-EBIEMBL-EBI
EMBL-EBI
 
EMBL
EMBLEMBL
EMBL
 
Proteins databases
Proteins databasesProteins databases
Proteins databases
 
Fasta
FastaFasta
Fasta
 
Protein data bank
Protein data bankProtein data bank
Protein data bank
 
Rasmol
RasmolRasmol
Rasmol
 
Gene bank by kk sahu
Gene bank by kk sahuGene bank by kk sahu
Gene bank by kk sahu
 
Protein databases
Protein databasesProtein databases
Protein databases
 

Viewers also liked (17)

Community ecology
Community ecologyCommunity ecology
Community ecology
 
Negotiation skill
Negotiation skillNegotiation skill
Negotiation skill
 
Food chain
Food chainFood chain
Food chain
 
Table manners
Table mannersTable manners
Table manners
 
Locus link
Locus linkLocus link
Locus link
 
Group discussion
Group discussionGroup discussion
Group discussion
 
FERMENTATIONS , PHOTOSYNTHESIS & NITROGEN FIXATION
FERMENTATIONS , PHOTOSYNTHESIS & NITROGEN FIXATION FERMENTATIONS , PHOTOSYNTHESIS & NITROGEN FIXATION
FERMENTATIONS , PHOTOSYNTHESIS & NITROGEN FIXATION
 
Bioinformatics assignment
Bioinformatics assignmentBioinformatics assignment
Bioinformatics assignment
 
Identification of poisonous snakes
Identification of poisonous snakesIdentification of poisonous snakes
Identification of poisonous snakes
 
Working with charts in word 2003
Working with charts in word 2003Working with charts in word 2003
Working with charts in word 2003
 
Data retrieval tools
Data retrieval toolsData retrieval tools
Data retrieval tools
 
Social graces
Social gracesSocial graces
Social graces
 
Installing and uninstalling computer software
Installing and uninstalling computer softwareInstalling and uninstalling computer software
Installing and uninstalling computer software
 
Gen bank (genetic sequence databank)
Gen bank (genetic sequence databank)Gen bank (genetic sequence databank)
Gen bank (genetic sequence databank)
 
Flagella- Size, Shape, Arrangement
Flagella- Size, Shape, Arrangement Flagella- Size, Shape, Arrangement
Flagella- Size, Shape, Arrangement
 
Preparation of solutions
Preparation of solutionsPreparation of solutions
Preparation of solutions
 
Major databases in bioinformatics
Major databases in bioinformaticsMajor databases in bioinformatics
Major databases in bioinformatics
 

Similar to Protein sequence databases

Swiss prot protein database
Swiss prot protein databaseSwiss prot protein database
Swiss prot protein databaseAshfaq Ahmad
 
Databases
DatabasesDatabases
Databasesafzamalik
 
swiss-prot<bioinformatics>
swiss-prot<bioinformatics>swiss-prot<bioinformatics>
swiss-prot<bioinformatics>Pardeep kaushal
 
Bioinformatics introduction
Bioinformatics introductionBioinformatics introduction
Bioinformatics introductionDrGopaSarma
 
100505 koenig biological_databases
100505 koenig biological_databases100505 koenig biological_databases
100505 koenig biological_databasesMeetika Gupta
 
Biological databases
Biological databasesBiological databases
Biological databasesTamanna Syeda
 
Bioinformatics lecture xxiii
Bioinformatics lecture xxiiiBioinformatics lecture xxiii
Bioinformatics lecture xxiiiMuhammad Younis
 
Introduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASEIntroduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASEPrashantSharma807
 
Protein databases
Protein databasesProtein databases
Protein databasesbansalaman80
 
Types of biological databases-protein database
Types of biological databases-protein databaseTypes of biological databases-protein database
Types of biological databases-protein databasechinmayeec
 
Protein data bank
Protein data bankProtein data bank
Protein data bankAlichy Sowmya
 

Similar to Protein sequence databases (20)

Swiss prot protein database
Swiss prot protein databaseSwiss prot protein database
Swiss prot protein database
 
Proteomic databases
Proteomic databasesProteomic databases
Proteomic databases
 
Protein Databases
Protein DatabasesProtein Databases
Protein Databases
 
Databases
DatabasesDatabases
Databases
 
swiss-prot<bioinformatics>
swiss-prot<bioinformatics>swiss-prot<bioinformatics>
swiss-prot<bioinformatics>
 
Bioinformatics introduction
Bioinformatics introductionBioinformatics introduction
Bioinformatics introduction
 
Biological databases
Biological databases Biological databases
Biological databases
 
PIR- Protein Information Resource
PIR- Protein Information ResourcePIR- Protein Information Resource
PIR- Protein Information Resource
 
100505 koenig biological_databases
100505 koenig biological_databases100505 koenig biological_databases
100505 koenig biological_databases
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Bioinformatics lecture xxiii
Bioinformatics lecture xxiiiBioinformatics lecture xxiii
Bioinformatics lecture xxiii
 
Important protein databases and proteomics softwares
Important protein databases and proteomics softwaresImportant protein databases and proteomics softwares
Important protein databases and proteomics softwares
 
Introduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASEIntroduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASE
 
Protein Database
Protein DatabaseProtein Database
Protein Database
 
Protein databases
Protein databasesProtein databases
Protein databases
 
Data Retrieval Systems
Data Retrieval SystemsData Retrieval Systems
Data Retrieval Systems
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Types of biological databases-protein database
Types of biological databases-protein databaseTypes of biological databases-protein database
Types of biological databases-protein database
 
Structural database and their classification by abdul qahar
Structural database and their classification by abdul qaharStructural database and their classification by abdul qahar
Structural database and their classification by abdul qahar
 
Protein data bank
Protein data bankProtein data bank
Protein data bank
 

More from Vidya Kalaivani Rajkumar

Recombinant vaccines-Peptide Vaccines
Recombinant vaccines-Peptide Vaccines Recombinant vaccines-Peptide Vaccines
Recombinant vaccines-Peptide Vaccines Vidya Kalaivani Rajkumar
 
Transgenic plants- Abiotic stress tolerance
Transgenic plants- Abiotic stress toleranceTransgenic plants- Abiotic stress tolerance
Transgenic plants- Abiotic stress toleranceVidya Kalaivani Rajkumar
 
In vivo synthesis of tissues and organs
In vivo synthesis of tissues and organsIn vivo synthesis of tissues and organs
In vivo synthesis of tissues and organsVidya Kalaivani Rajkumar
 
Major biological nucleotide databases
Major biological nucleotide databasesMajor biological nucleotide databases
Major biological nucleotide databasesVidya Kalaivani Rajkumar
 
Protein structure visualization tools-RASMOL
Protein structure visualization tools-RASMOLProtein structure visualization tools-RASMOL
Protein structure visualization tools-RASMOLVidya Kalaivani Rajkumar
 

More from Vidya Kalaivani Rajkumar (20)

Recombinant vaccines-Peptide Vaccines
Recombinant vaccines-Peptide Vaccines Recombinant vaccines-Peptide Vaccines
Recombinant vaccines-Peptide Vaccines
 
Transgenic plants- Abiotic stress tolerance
Transgenic plants- Abiotic stress toleranceTransgenic plants- Abiotic stress tolerance
Transgenic plants- Abiotic stress tolerance
 
Bioreactors in tissue engineering
Bioreactors in tissue engineeringBioreactors in tissue engineering
Bioreactors in tissue engineering
 
Tissue assembly in microgravity
Tissue assembly in microgravityTissue assembly in microgravity
Tissue assembly in microgravity
 
In vivo synthesis of tissues and organs
In vivo synthesis of tissues and organsIn vivo synthesis of tissues and organs
In vivo synthesis of tissues and organs
 
Bioartificial pancreas
Bioartificial pancreasBioartificial pancreas
Bioartificial pancreas
 
Biomaterials for tissue engineering
Biomaterials for tissue engineeringBiomaterials for tissue engineering
Biomaterials for tissue engineering
 
Haematopoietic system
Haematopoietic systemHaematopoietic system
Haematopoietic system
 
Fasta
FastaFasta
Fasta
 
Water vascular system of star fish
Water vascular system of star fishWater vascular system of star fish
Water vascular system of star fish
 
Cephalopodes are advance molluscs
Cephalopodes are advance molluscsCephalopodes are advance molluscs
Cephalopodes are advance molluscs
 
Beat air pollution
Beat air pollution Beat air pollution
Beat air pollution
 
Birth control methods
Birth control methodsBirth control methods
Birth control methods
 
Future of human evolution
Future of human evolutionFuture of human evolution
Future of human evolution
 
Sequence alignment
Sequence alignmentSequence alignment
Sequence alignment
 
Assignment on developmental zoology
Assignment on developmental zoologyAssignment on developmental zoology
Assignment on developmental zoology
 
Development of chick
Development of chickDevelopment of chick
Development of chick
 
Major biological nucleotide databases
Major biological nucleotide databasesMajor biological nucleotide databases
Major biological nucleotide databases
 
Protein structure visualization tools-RASMOL
Protein structure visualization tools-RASMOLProtein structure visualization tools-RASMOL
Protein structure visualization tools-RASMOL
 
Swiss pdb viewer
Swiss pdb viewerSwiss pdb viewer
Swiss pdb viewer
 

Recently uploaded

Neurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trNeurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trssuser06f238
 
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Harmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms PresentationHarmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms Presentationtahreemzahra82
 
Volatile Oils Pharmacognosy And Phytochemistry -I
Volatile Oils Pharmacognosy And Phytochemistry -IVolatile Oils Pharmacognosy And Phytochemistry -I
Volatile Oils Pharmacognosy And Phytochemistry -INandakishor Bhaurao Deshmukh
 
Solution chemistry, Moral and Normal solutions
Solution chemistry, Moral and Normal solutionsSolution chemistry, Moral and Normal solutions
Solution chemistry, Moral and Normal solutionsHajira Mahmood
 
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxLIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxmalonesandreagweneth
 
Davis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technologyDavis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technologycaarthichand2003
 
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)Columbia Weather Systems
 
Scheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxScheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxyaramohamed343013
 
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCRCall Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCRlizamodels9
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfSELF-EXPLANATORY
 
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptxTHE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptxNandakishor Bhaurao Deshmukh
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxSwapnil Therkar
 
Speech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxSpeech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxpriyankatabhane
 
Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024AyushiRastogi48
 
Pests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdfPests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdfPirithiRaju
 
Sulphur & Phosphrus Cycle PowerPoint Presentation (2) [Autosaved]-3-1.pptx
Sulphur & Phosphrus Cycle PowerPoint Presentation (2) [Autosaved]-3-1.pptxSulphur & Phosphrus Cycle PowerPoint Presentation (2) [Autosaved]-3-1.pptx
Sulphur & Phosphrus Cycle PowerPoint Presentation (2) [Autosaved]-3-1.pptxnoordubaliya2003
 
GenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptxGenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptxBerniceCayabyab1
 
Topic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptxTopic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptxJorenAcuavera1
 

Recently uploaded (20)

Neurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trNeurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 tr
 
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
 
Hot Sexy call girls in Moti Nagar,🔝 9953056974 🔝 escort Service
Hot Sexy call girls in  Moti Nagar,🔝 9953056974 🔝 escort ServiceHot Sexy call girls in  Moti Nagar,🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Moti Nagar,🔝 9953056974 🔝 escort Service
 
Harmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms PresentationHarmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms Presentation
 
Volatile Oils Pharmacognosy And Phytochemistry -I
Volatile Oils Pharmacognosy And Phytochemistry -IVolatile Oils Pharmacognosy And Phytochemistry -I
Volatile Oils Pharmacognosy And Phytochemistry -I
 
Solution chemistry, Moral and Normal solutions
Solution chemistry, Moral and Normal solutionsSolution chemistry, Moral and Normal solutions
Solution chemistry, Moral and Normal solutions
 
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxLIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
 
Davis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technologyDavis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technology
 
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
 
Scheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxScheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docx
 
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCRCall Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
 
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptxTHE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
 
Speech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxSpeech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptx
 
Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024
 
Pests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdfPests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdf
 
Sulphur & Phosphrus Cycle PowerPoint Presentation (2) [Autosaved]-3-1.pptx
Sulphur & Phosphrus Cycle PowerPoint Presentation (2) [Autosaved]-3-1.pptxSulphur & Phosphrus Cycle PowerPoint Presentation (2) [Autosaved]-3-1.pptx
Sulphur & Phosphrus Cycle PowerPoint Presentation (2) [Autosaved]-3-1.pptx
 
GenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptxGenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptx
 
Topic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptxTopic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptx
 

Protein sequence databases

  • 1. Protein sequence databases Introduction: The Protein database is a collection of sequences from several sources, including translations from annotated coding regions in GenBank, RefSeqand TPA, as well as records from SwissProt, PIR, PRF, and PDB. Protein sequences are the fundamental determinants of biological structure and function. SWISS-PROT – Manually curated – high-quality annotations, less data GenPept/TREMBL – Translated coding sequences from GenBank/EMBL – Few annotations, more up to date PIR – Phylogenetic-based annotations All 3 now combining efforts to form UniProt (http://www.uniprot.org) PDB (Protein Databank) ď‚· Stores 3-dimensional atomic coordinates for biological molecules including protein and nucleic acids ď‚· Data obtained by X-ray crystallography, NMR, or computer modelling http://www.rcsb.org/pdb/ MMDB (Molecular Modelling database) Over 28,000 3D macromolecular structures, including proteins and polynucleotides(http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Structure) SCOP (Structural Classification of Proteins) Classification of proteins according to structural and evolutionary relationships SWISS-PROT Introduction: SWISS-PROT is an annotated protein sequence database, which was created at the Department of Medical Biochemistry of the University of Geneva and has been a collaborative effort of the Department and the European Molecular Biology Laboratory (EMBL), since 1987. SWISS-PROT is now an equal partnership between the EMBL and the Swiss Institute of
  • 2. Bioinformatics (SIB). The EMBL activities are carried out by its Hinxton Outstation, the European Bioinformatics Institute (EBI). The SWISS-PROT protein sequence database consists of sequence entries. Sequence entries are composed of different line types, each with their own format. The SWISS-PROT database distinguishes itself from other protein sequence databases by three distinct criteria: (i) annotations (ii) (ii) minimal redundancy and (iii) (iii) integration with other databases. Annotations CORE DATA • The sequence data • The citation information (bibliographical references) • The taxonomic data (description of the biological source of the protein) Annotation- Additional Data • Descriptions include: • Function(s) of the protein • Posttranslational modification(s) such as carbohydrates, phosphorylation, acetylation and GPI-anchor • Domains and sites, for example, calcium-binding regions, ATP-binding sites, zinc fingers, homeoboxes, and SH2 and SH3 domains • Secondary structure, e.g. alpha helix, beta sheet • Quaternary structure, i.g. homodimer, heterotrimer, etc. • Similarities to other proteins • Disease(s) associated with any number of deficiencies in the protein • Sequence conflicts, variants, etc. Minimal Redundancy • Much of data comes from more than one literature report • Data condensed and merged to appear more concise and coherent • Conflicts in data are listed for each entry Integration with other databases • 50+ databases for cross-reference
  • 3. • Nucleic acid sequences, protein tertiary structure, protein 3-D models, etc. • Allows Swiss-PROT to play a major role as the focal point for biomolecular interconnectivity Documentation • All files documented and indexed • Documentation kept up-to-date Applications for the Knowledgebase • Provides highly organized data and information on a wide variety of proteins • Can be used as a starting point for protein research • Allows searches to be conducted starting with various search strings • Biochemical encyclopedia
  • 5. ID - Identification. AC - Accession number(s). DT - Date. DE - Description. GN - Gene name(s). OS - Organism species. OG - Organelle. OC - Organism classification. RN - Reference number. RP - Reference position. RC - Reference comments. RX - Reference cross-references. RA - Reference authors. RL - Reference location. CC - Comments or notes.
  • 6. DR - Database cross-references. KW - Keywords. FT - Feature table data. SQ - Sequence header. - (blanks) sequence data. // - Termination line.