SlideShare a Scribd company logo
1 of 17
Swiss – Prot
SAGRIKA CHUGH
(M.Tech Bioinformatics)
23-Jan-17 1
INTRODUCTION
• The Universal Protein Resource Knowledgebase (UniProtKB) is the central hub for the
collection of functional information on proteins.
It consists of two sections:
23-Jan-17 2
Swiss -Prot
• Reviewed
• Manually annotated
• Records with information extracted from
literature and curator-evaluated
computational analysis.
Tr-EMBL (Transalted European
Molecular Biological Labratory)
• Unreviewed
• Computationally annotated
• Records that await full manual
annotation.
Source: http://www.uniprot.org/
23-Jan-17 3
• Created at the Department of Medical Biochemistry of the University of Geneva and
works in collaboration with the European Molecular Biology Laboratory (EMBL), since
1987
• Swiss-Prot strives to provide high level of annotation , minimal level of redundancy
and integration with other databases
• It is now an equal partnership between the EMBL and the Swiss Institute of
Bioinformatics (SIB)
• TrEMBL, a computer-annotated supplement to Swiss-Prot.
• Similar format to European Bioinformatics Institute Nucleotide Sequence Database
(EMBL)
INTRODUCTION
23-Jan-17 4
Features of Swiss-Prot
• Annotation
• Minimal Redundancy
• Integration with other databases
• Documentation
23-Jan-17 5
Annotation
Data
23-Jan-17 6
Core data Annotation
• sequence data
• the citation information (bibliographical
references)
• taxonomic data (description of the
biological source of the protein)
• Post-translational modification(s). for
example phosphorylation, acetylation, etc.
• Domains and sites. for example calcium
binding regions, zinc fingers.
• Secondary structure. For example alpha
helix, beta sheet, etc.
• Quaternary structure. For example
homodimer, heterotrimer, etc.
• Disease(s) associated with deficiencies in
the protein
Minimal redundancy
• Much of data comes from more than one literature report
• Data condensed and merged to appear more concise and coherent
• Conflicts in data are listed for each entry
23-Jan-17 7
Integration with other databases
• Swiss-Prot provides cross-references to external data collections
• Integration between the three types of sequence-related databases (nucleic acid
sequences, protein sequences and protein tertiary structures)
• Swiss-Prot Sample Entry swiss prot entry.txt
• Original entry Aar2 - Protein AAR2 homolog - Mus musculus (Mouse) - Aar2 gene & protein.html
23-Jan-17 8
Documentation
• All files documented and indexed.
• Documentation kept up-to-date.
23-Jan-17 9
Swiss-Prot Statistics
23-Jan-17 10
Number of entries
New entries 245
Updated entries 64,182
Unchanged entries 489,047
Total 553,474
Entries with updated sequences 40
With a fragmented AA sequence 9,143
With known alternative products 24,759
Source http://www.uniprot.org/statistics/Swiss-Prot
(Jan 18, 2017 release)
TrEMBL: A computer-annotated supplement to Swiss-PROT
• TrEMBL (translation of EMBL nucleotide sequence database) in 1996..
Why TrEMBL ?
• Increased data flow from genome projects to the sequence databases.
• To maintain the high annotation quality.
• To make sequences available as quickly as possible..
• TrEMBL consists of computer-annotated entries derived from the translation of all
coding sequences (CDS) in the nucleotide sequence databases, except for CDS
already included in Swiss-PROT.
• It also contains protein sequences extracted from the literature and protein sequences
submitted directly by the user community.
23-Jan-17 11
TrEMBL
23-Jan-17 12
Sp- TrEMBL
(SWISS PROT-TrEMBL)
REM-TrEMBL
(Remaining TrEMBL)
contains sequences, which will eventually
be incorporated into SWISS-PROT
contains those sequences which will not be
incorporated into SWISS-PROT.
For eg synthetic sequences, patent
application sequences, fragments of less
than 8 amino acids and coding sequences
where there is strong experimental
evidence that the sequence does not code
for a real protein.
Tr-EMBL Statistics
23-Jan-17 13
Number of entries
New entries 3,031,100
Updated entries 20,906,527
Unchanged entries 49,774,254
Total 73,711,881
Entries with updated sequences 746
With a fragmented AA sequence 8,492,670
With known alternative products 0
Source:http://www.uniprot.org/statistics/TrEMBL
Jan 18, 2017 release
Summary
23-Jan-17 14
Source: www.expasy.org
CONCLUSION
• Swiss-Prot continuously enhanced its format and content to adjust to the wide
knowledge pool in proteomics along with high quality of annotation.
• Automated annotation procedures are used for Swiss-Prot in a very conservative
manner.
• The extensive integration of SWISS-PROT with specialized databases enables users
to navigate through the current knowledge in the Life Sciences providing an insight into
the universe of proteins.
23-Jan-17 15
References
• The Swiss-PROT protein knowledgebase and its supplement TrEMBL in
2003 Brigitte Boeckmann etal Nucl Acids Res (2003) 31 (1): 365-370.
• The Swiss-PROT protein sequence database and its supplement TrEMBL in
2000 Amos Bairoch, Rolf Apweiler Nucl Acids Res (2000) 28 (1): 45-48.
• The Swiss-PROT protein sequence data bank and its supplement TrEMBL in
1999 Amos Bairoch ,Rolf Apweiler Nucl Acids Res (1999) 27 (1): 49-54
• The Swiss-PROT protein sequence data bank and its supplement TrEMBL Amos
Bairoch Rolf Apweiler Nucl Acids Res (1997) 25 (1): 31-36.
• The Swiss-PROT Protein Sequence Data Bank and Its New Supplement
TREMBL Amos Bairoch Rolf Apweiler Nucl Acids Res (1996) 24 (1): 21-25.
• http://www.uniprot.org/uniprot/
23-Jan-17 16
23-Jan-17 17

More Related Content

What's hot

Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignmentRamya S
 
Primary and secondary databases ppt by puneet kulyana
Primary and secondary databases ppt by puneet kulyanaPrimary and secondary databases ppt by puneet kulyana
Primary and secondary databases ppt by puneet kulyanaPuneet Kulyana
 
Blast and fasta
Blast and fastaBlast and fasta
Blast and fastaALLIENU
 
sequence of file formats in bioinformatics
sequence of file formats in bioinformaticssequence of file formats in bioinformatics
sequence of file formats in bioinformaticsnadeem akhter
 
Nucleic Acid Sequence databases
Nucleic Acid Sequence databasesNucleic Acid Sequence databases
Nucleic Acid Sequence databasesPranavathiyani G
 
UniProt
UniProtUniProt
UniProtAmnaA7
 
DNA data bank of japan (DDBJ)
DNA data bank of japan (DDBJ)DNA data bank of japan (DDBJ)
DNA data bank of japan (DDBJ)ZoufishanY
 
Protein databases
Protein databasesProtein databases
Protein databasessarumalay
 
sequence alignment
sequence alignmentsequence alignment
sequence alignmentammar kareem
 

What's hot (20)

Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
 
European molecular biology laboratory (EMBL)
European molecular biology laboratory (EMBL)European molecular biology laboratory (EMBL)
European molecular biology laboratory (EMBL)
 
Primary and secondary databases ppt by puneet kulyana
Primary and secondary databases ppt by puneet kulyanaPrimary and secondary databases ppt by puneet kulyana
Primary and secondary databases ppt by puneet kulyana
 
Blast and fasta
Blast and fastaBlast and fasta
Blast and fasta
 
sequence of file formats in bioinformatics
sequence of file formats in bioinformaticssequence of file formats in bioinformatics
sequence of file formats in bioinformatics
 
Nucleic Acid Sequence databases
Nucleic Acid Sequence databasesNucleic Acid Sequence databases
Nucleic Acid Sequence databases
 
Composite and Specialized databases
Composite and Specialized databasesComposite and Specialized databases
Composite and Specialized databases
 
Prosite
PrositeProsite
Prosite
 
Clustal W - Multiple Sequence alignment
Clustal W - Multiple Sequence alignment   Clustal W - Multiple Sequence alignment
Clustal W - Multiple Sequence alignment
 
UniProt
UniProtUniProt
UniProt
 
Protein data bank
Protein data bankProtein data bank
Protein data bank
 
Major databases in bioinformatics
Major databases in bioinformaticsMajor databases in bioinformatics
Major databases in bioinformatics
 
NCBI National Center for Biotechnology Information
NCBI National Center for Biotechnology InformationNCBI National Center for Biotechnology Information
NCBI National Center for Biotechnology Information
 
Biological databases
Biological databasesBiological databases
Biological databases
 
DNA data bank of japan (DDBJ)
DNA data bank of japan (DDBJ)DNA data bank of japan (DDBJ)
DNA data bank of japan (DDBJ)
 
PIR- Protein Information Resource
PIR- Protein Information ResourcePIR- Protein Information Resource
PIR- Protein Information Resource
 
Protein databases
Protein databasesProtein databases
Protein databases
 
Protein data bank
Protein data bankProtein data bank
Protein data bank
 
sequence alignment
sequence alignmentsequence alignment
sequence alignment
 
protein data bank
protein data bankprotein data bank
protein data bank
 

Similar to Swiss prot database

Major resources of bioinformatics 2
Major resources of bioinformatics 2Major resources of bioinformatics 2
Major resources of bioinformatics 2Mohd Affan
 
PROTEIN STRUCTURE DATABANK
PROTEIN STRUCTURE DATABANKPROTEIN STRUCTURE DATABANK
PROTEIN STRUCTURE DATABANKMalvika Bansal
 
Presentation on Biological database By Elufer Akram @ University Of Science ...
Presentation on Biological database  By Elufer Akram @ University Of Science ...Presentation on Biological database  By Elufer Akram @ University Of Science ...
Presentation on Biological database By Elufer Akram @ University Of Science ...Elufer Akram
 
Primary, secondary, tertiary biological database
Primary, secondary, tertiary biological databasePrimary, secondary, tertiary biological database
Primary, secondary, tertiary biological databaseKAUSHAL SAHU
 
Proteomics resources at the EBI & ExPASy
Proteomics resources at the EBI & ExPASyProteomics resources at the EBI & ExPASy
Proteomics resources at the EBI & ExPASyChrist College, Rajkot
 
Bioinformatics مي.pdf
Bioinformatics  مي.pdfBioinformatics  مي.pdf
Bioinformatics مي.pdfnedalalazzwy
 
Bioinformatics
BioinformaticsBioinformatics
BioinformaticsRaj Varun
 
TheUniProtKBpptx__2022_03_30_13_07_41.pptx
TheUniProtKBpptx__2022_03_30_13_07_41.pptxTheUniProtKBpptx__2022_03_30_13_07_41.pptx
TheUniProtKBpptx__2022_03_30_13_07_41.pptxPRIYANKAZALA9
 
protein databases
 protein databases protein databases
protein databaseswasisyed
 
The ProteomeXchange Consoritum: 2017 update
The ProteomeXchange Consoritum: 2017 updateThe ProteomeXchange Consoritum: 2017 update
The ProteomeXchange Consoritum: 2017 updateJuan Antonio Vizcaino
 

Similar to Swiss prot database (20)

Swiss prot
Swiss protSwiss prot
Swiss prot
 
Protein Databases
Protein DatabasesProtein Databases
Protein Databases
 
EMBL- European Molecular Biology Laboratory
EMBL- European Molecular Biology LaboratoryEMBL- European Molecular Biology Laboratory
EMBL- European Molecular Biology Laboratory
 
Major resources of bioinformatics 2
Major resources of bioinformatics 2Major resources of bioinformatics 2
Major resources of bioinformatics 2
 
Protein database
Protein  databaseProtein  database
Protein database
 
Protein Databases
Protein DatabasesProtein Databases
Protein Databases
 
PROTEIN STRUCTURE DATABANK
PROTEIN STRUCTURE DATABANKPROTEIN STRUCTURE DATABANK
PROTEIN STRUCTURE DATABANK
 
Presentation on Biological database By Elufer Akram @ University Of Science ...
Presentation on Biological database  By Elufer Akram @ University Of Science ...Presentation on Biological database  By Elufer Akram @ University Of Science ...
Presentation on Biological database By Elufer Akram @ University Of Science ...
 
Primary, secondary, tertiary biological database
Primary, secondary, tertiary biological databasePrimary, secondary, tertiary biological database
Primary, secondary, tertiary biological database
 
Protein Database
Protein DatabaseProtein Database
Protein Database
 
Proteomics resources at the EBI & ExPASy
Proteomics resources at the EBI & ExPASyProteomics resources at the EBI & ExPASy
Proteomics resources at the EBI & ExPASy
 
Biological databases
Biological databases Biological databases
Biological databases
 
Bioinformatics مي.pdf
Bioinformatics  مي.pdfBioinformatics  مي.pdf
Bioinformatics مي.pdf
 
Introduction to databases.pptx
Introduction to databases.pptxIntroduction to databases.pptx
Introduction to databases.pptx
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
TheUniProtKBpptx__2022_03_30_13_07_41.pptx
TheUniProtKBpptx__2022_03_30_13_07_41.pptxTheUniProtKBpptx__2022_03_30_13_07_41.pptx
TheUniProtKBpptx__2022_03_30_13_07_41.pptx
 
PROTEIN DATABASE
PROTEIN DATABASEPROTEIN DATABASE
PROTEIN DATABASE
 
Biological databases
Biological databasesBiological databases
Biological databases
 
protein databases
 protein databases protein databases
protein databases
 
The ProteomeXchange Consoritum: 2017 update
The ProteomeXchange Consoritum: 2017 updateThe ProteomeXchange Consoritum: 2017 update
The ProteomeXchange Consoritum: 2017 update
 

Recently uploaded

zoogeography of pakistan.pptx fauna of Pakistan
zoogeography of pakistan.pptx fauna of Pakistanzoogeography of pakistan.pptx fauna of Pakistan
zoogeography of pakistan.pptx fauna of Pakistanzohaibmir069
 
Manassas R - Parkside Middle School 🌎🏫
Manassas R - Parkside Middle School 🌎🏫Manassas R - Parkside Middle School 🌎🏫
Manassas R - Parkside Middle School 🌎🏫qfactory1
 
Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |aasikanpl
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxkessiyaTpeter
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Nistarini College, Purulia (W.B) India
 
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxLIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxmalonesandreagweneth
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Patrick Diehl
 
Forest laws, Indian forest laws, why they are important
Forest laws, Indian forest laws, why they are importantForest laws, Indian forest laws, why they are important
Forest laws, Indian forest laws, why they are importantadityabhardwaj282
 
STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptx
STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptxSTOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptx
STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptxMurugaveni B
 
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.PraveenaKalaiselvan1
 
Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPirithiRaju
 
Harmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms PresentationHarmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms Presentationtahreemzahra82
 
Recombinant DNA technology( Transgenic plant and animal)
Recombinant DNA technology( Transgenic plant and animal)Recombinant DNA technology( Transgenic plant and animal)
Recombinant DNA technology( Transgenic plant and animal)DHURKADEVIBASKAR
 
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCRCall Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCRlizamodels9
 
Environmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial BiosensorEnvironmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial Biosensorsonawaneprad
 
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPirithiRaju
 
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...lizamodels9
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real timeSatoshi NAKAHIRA
 
‏‏VIRUS - 123455555555555555555555555555555555555555
‏‏VIRUS -  123455555555555555555555555555555555555555‏‏VIRUS -  123455555555555555555555555555555555555555
‏‏VIRUS - 123455555555555555555555555555555555555555kikilily0909
 
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝soniya singh
 

Recently uploaded (20)

zoogeography of pakistan.pptx fauna of Pakistan
zoogeography of pakistan.pptx fauna of Pakistanzoogeography of pakistan.pptx fauna of Pakistan
zoogeography of pakistan.pptx fauna of Pakistan
 
Manassas R - Parkside Middle School 🌎🏫
Manassas R - Parkside Middle School 🌎🏫Manassas R - Parkside Middle School 🌎🏫
Manassas R - Parkside Middle School 🌎🏫
 
Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...
 
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxLIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?
 
Forest laws, Indian forest laws, why they are important
Forest laws, Indian forest laws, why they are importantForest laws, Indian forest laws, why they are important
Forest laws, Indian forest laws, why they are important
 
STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptx
STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptxSTOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptx
STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptx
 
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
 
Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
 
Harmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms PresentationHarmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms Presentation
 
Recombinant DNA technology( Transgenic plant and animal)
Recombinant DNA technology( Transgenic plant and animal)Recombinant DNA technology( Transgenic plant and animal)
Recombinant DNA technology( Transgenic plant and animal)
 
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCRCall Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
 
Environmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial BiosensorEnvironmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial Biosensor
 
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
 
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real time
 
‏‏VIRUS - 123455555555555555555555555555555555555555
‏‏VIRUS -  123455555555555555555555555555555555555555‏‏VIRUS -  123455555555555555555555555555555555555555
‏‏VIRUS - 123455555555555555555555555555555555555555
 
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
 

Swiss prot database

  • 1. Swiss – Prot SAGRIKA CHUGH (M.Tech Bioinformatics) 23-Jan-17 1
  • 2. INTRODUCTION • The Universal Protein Resource Knowledgebase (UniProtKB) is the central hub for the collection of functional information on proteins. It consists of two sections: 23-Jan-17 2 Swiss -Prot • Reviewed • Manually annotated • Records with information extracted from literature and curator-evaluated computational analysis. Tr-EMBL (Transalted European Molecular Biological Labratory) • Unreviewed • Computationally annotated • Records that await full manual annotation.
  • 4. • Created at the Department of Medical Biochemistry of the University of Geneva and works in collaboration with the European Molecular Biology Laboratory (EMBL), since 1987 • Swiss-Prot strives to provide high level of annotation , minimal level of redundancy and integration with other databases • It is now an equal partnership between the EMBL and the Swiss Institute of Bioinformatics (SIB) • TrEMBL, a computer-annotated supplement to Swiss-Prot. • Similar format to European Bioinformatics Institute Nucleotide Sequence Database (EMBL) INTRODUCTION 23-Jan-17 4
  • 5. Features of Swiss-Prot • Annotation • Minimal Redundancy • Integration with other databases • Documentation 23-Jan-17 5
  • 6. Annotation Data 23-Jan-17 6 Core data Annotation • sequence data • the citation information (bibliographical references) • taxonomic data (description of the biological source of the protein) • Post-translational modification(s). for example phosphorylation, acetylation, etc. • Domains and sites. for example calcium binding regions, zinc fingers. • Secondary structure. For example alpha helix, beta sheet, etc. • Quaternary structure. For example homodimer, heterotrimer, etc. • Disease(s) associated with deficiencies in the protein
  • 7. Minimal redundancy • Much of data comes from more than one literature report • Data condensed and merged to appear more concise and coherent • Conflicts in data are listed for each entry 23-Jan-17 7
  • 8. Integration with other databases • Swiss-Prot provides cross-references to external data collections • Integration between the three types of sequence-related databases (nucleic acid sequences, protein sequences and protein tertiary structures) • Swiss-Prot Sample Entry swiss prot entry.txt • Original entry Aar2 - Protein AAR2 homolog - Mus musculus (Mouse) - Aar2 gene & protein.html 23-Jan-17 8
  • 9. Documentation • All files documented and indexed. • Documentation kept up-to-date. 23-Jan-17 9
  • 10. Swiss-Prot Statistics 23-Jan-17 10 Number of entries New entries 245 Updated entries 64,182 Unchanged entries 489,047 Total 553,474 Entries with updated sequences 40 With a fragmented AA sequence 9,143 With known alternative products 24,759 Source http://www.uniprot.org/statistics/Swiss-Prot (Jan 18, 2017 release)
  • 11. TrEMBL: A computer-annotated supplement to Swiss-PROT • TrEMBL (translation of EMBL nucleotide sequence database) in 1996.. Why TrEMBL ? • Increased data flow from genome projects to the sequence databases. • To maintain the high annotation quality. • To make sequences available as quickly as possible.. • TrEMBL consists of computer-annotated entries derived from the translation of all coding sequences (CDS) in the nucleotide sequence databases, except for CDS already included in Swiss-PROT. • It also contains protein sequences extracted from the literature and protein sequences submitted directly by the user community. 23-Jan-17 11
  • 12. TrEMBL 23-Jan-17 12 Sp- TrEMBL (SWISS PROT-TrEMBL) REM-TrEMBL (Remaining TrEMBL) contains sequences, which will eventually be incorporated into SWISS-PROT contains those sequences which will not be incorporated into SWISS-PROT. For eg synthetic sequences, patent application sequences, fragments of less than 8 amino acids and coding sequences where there is strong experimental evidence that the sequence does not code for a real protein.
  • 13. Tr-EMBL Statistics 23-Jan-17 13 Number of entries New entries 3,031,100 Updated entries 20,906,527 Unchanged entries 49,774,254 Total 73,711,881 Entries with updated sequences 746 With a fragmented AA sequence 8,492,670 With known alternative products 0 Source:http://www.uniprot.org/statistics/TrEMBL Jan 18, 2017 release
  • 15. CONCLUSION • Swiss-Prot continuously enhanced its format and content to adjust to the wide knowledge pool in proteomics along with high quality of annotation. • Automated annotation procedures are used for Swiss-Prot in a very conservative manner. • The extensive integration of SWISS-PROT with specialized databases enables users to navigate through the current knowledge in the Life Sciences providing an insight into the universe of proteins. 23-Jan-17 15
  • 16. References • The Swiss-PROT protein knowledgebase and its supplement TrEMBL in 2003 Brigitte Boeckmann etal Nucl Acids Res (2003) 31 (1): 365-370. • The Swiss-PROT protein sequence database and its supplement TrEMBL in 2000 Amos Bairoch, Rolf Apweiler Nucl Acids Res (2000) 28 (1): 45-48. • The Swiss-PROT protein sequence data bank and its supplement TrEMBL in 1999 Amos Bairoch ,Rolf Apweiler Nucl Acids Res (1999) 27 (1): 49-54 • The Swiss-PROT protein sequence data bank and its supplement TrEMBL Amos Bairoch Rolf Apweiler Nucl Acids Res (1997) 25 (1): 31-36. • The Swiss-PROT Protein Sequence Data Bank and Its New Supplement TREMBL Amos Bairoch Rolf Apweiler Nucl Acids Res (1996) 24 (1): 21-25. • http://www.uniprot.org/uniprot/ 23-Jan-17 16

Editor's Notes

  1. Encyclopedia of proteins
  2. annotation (such as the description of the function of a protein, its domains structure, post-translational modifications, variants, etc.), except the CDSs already included in SWISS-PROT
  3. annotation is mainly found in the comment lines (CC), in the feature table (FT) and in the keyword lines (KW)
  4. In addition, there is a weekly update to TrEMBL called TrEMBLnew. TrEMBLnew from new nucleotide sequences deposited in the EMBL nucleotide sequence database. At each TrEMBL release, the TrEMBLnew entries are processed; any entries redundant against SWISS-PROT/TrEMBL ( 4 ) are merged and the remainder then progressed into TrEMBL ( 5 ).
  5. are only applied where they allow the achievement of the same level of quality as obtained by manual annotation