SlideShare a Scribd company logo
1 of 80
Download to read offline
Protein Database
Unit-2nd
Dr. Khalid Rehman Hakeem
Department of Bioresources
University of Kashmir
Biores-111: Bio-informatics
Unit I
1. What is Bioinformatics? Overview of Bioinformatics.
2. Basic concepts and applications of bioinformatics.
3. Bio-informatics related to proteins, amino acids, DNA and RNA
5. Sequence, structure and function.
Unit II
•Bioinformatics databases, introduction, type of databases: Nucleotide sequence databases; primary
nucleotide sequence databases viz., EMBL; GeneBank ; DDBJ
•Protein sequence databases viz., SwissProt/TrEMBL; PIR
•Sequence motif databases: Pfam and. PROSITE
•Protein structure databases: Protein Data Bank; SCOP; CATH
•Other relevant databases e.g., KEGG.
Unit III
•Single sequence and pair-wise alignments
•Scoring matrix: PAM and BLOSUM and Dot Plots
•Heuristic methods: FASTA and BLAST
•Statistics of sequence alignment score: E-Value; P-Value.
•Multiple sequence alignments: ClustalW, PSI-Blast.
Unit IV
• Human Genome Project
• Genbank SNPs, GOG, STSs, and ESTs data bases
• Genomic Data Mining
• Understanding of the principles of the microarray technique, limitations of microarrays and
problems associated with the technique
•Brief introduction to PERL language.
Proteome Bioinformatics/Databases
 1. Protein sequence databases
- SWISS-PROT; TrEMBL
2. Nucleotide sequence databases
- Hidden treasures: EST
3. Pattern and profile databases
- PROSITE; BLOCKS; PRINTS; Pfam
4. 2D-PAGE databases
- SWISS-2DPAGE;WORLD-2DPAGE
5. 3D structural databases
- PDB; DSSP; HSSP
Proteome Bioinformatics/Databases
 6. Post-translational modification databases
- O-GLYCBASE
7. Genomic databases
- OMIM; GDB; MGD; FlyBase
8. Metabolic databases
- ENZYME; KEGG; EMP/WT
9. Interfacing and Integrating Databases
- EXExPASy; SWISS-PROT; Cyber–Encyclopaedia
of the Proteome
Protein Database
 UniPro - protein knowledge database
 Swiss 2DPAGE - 2D PAGE
 Pfam - protein family and domain
 Prosite - protein family and domain
 SMART - protein module
 BLOCK - protein conserved regions
Protein database
Protein database
Protein database
Protein database
Protein database
Protein database
Protein database
Protein database
Protein database
Protein database
Protein database
The Pfam database is one the most important collections
of information in the world for classifying proteins. The
database categorises 75 per cent of known proteins to form
a library of protein families - a 'periodic table' of biology.
The open access resource was established at the Wellcome
Trust Sanger Institute in 1998. Its vision is to provide a
tool which allows experimental, computational and
evolutionary biologists to classify protein sequences and
answer questions about what they do and how they have
evolved. The Pfam project is led by Dr Alex Bateman at the
Sanger Institute.
Each entry in the Pfam database includes a protein sequence
alignment as well as an accompanying statistical model, called a
hidden Markov model
Pfam :: Home
Pfam :: Home
 Pfam is a large collection of multiple sequence
alignments and hidden Markov models
(HMMs)covering many common protein families.
 Pfam version 26.0 (November, 2011) contains
alignments and models for 13672 protein families,
based on the Swissprot and SP-TrEMBL
protein sequence databases.
HMM: A Hidden Markov Model
 HMM: A Hidden Markov Model,
or HMM, is a statistical model for
any system that can be represented
as a succession of transitions
between discrete states.
•Proteins are generally composed of one or more functional
regions, commonly termed domains. Different combinations of
domains give rise to the diverse range of proteins found in nature.
The identification of domains that occur within proteins can
therefore provide insights into their function.
•There are two components to Pfam: Pfam-A and Pfam-B. Pfam-A
entries are high quality, manually curated families. Although
these Pfam-A entries cover a large proportion of the sequences in
the underlying sequence database, in order to give a more
comprehensive coverage of known proteins we also generate a
supplement using the ADDA (Automatic Domain Decomposition
Algorithm ) database. These automatically generated entries are
called Pfam-B. Although of lower quality, Pfam-B families can be
useful for identifying functionally conserved regions when no
Pfam-A entries are found.
•Pfam also generates higher-level groupings of related
families, known as clans. A clan is a collection of Pfam-A
entries which are related by similarity of sequence, structure or
Protein database
Protein database
Protein database
Protein database
Biotin Synthase
Biotin synthase (BioB) converts dethiobiotin into biotin by
inserting a sulfur atom between C6 and C9 of dethiobiotin
inan S-adenosylmethionine (SAM)-dependent reaction.
Protein database
Protein database
Protein database
Reaction steps common in radical
SAM enzymes
PROSITE is a protein database. It consists of entries describing the
protein families, domains and functional sites as well as amino acid patterns,
signatures, and profiles in them, which are manually curated by a team of the
Swiss Institute of Bioinformatics and tightly integrated into Swiss-Prot protein
annotation.
PROSITE was created in 1988 by Amos Bairoch, who directed the group for
more than 20 years. Since July 2009 the director of the PROSITE, Swiss-Prot
and Vital-IT groups is Ioannis Xenarios.
PROSITE's uses include identifying possible functions of newly discovered
proteins and analysis of known proteins for previously undetermined activity.
Properties from well-studied genes can be propagated to biologically related
organisms, and for different or poorly known genes biochemical functions can be
predicted from similarities.
PROSITE offers tools for protein sequence analysis and motif detection. It is
part of the ExPASy proteomics analysis servers.
PROSITE
Protein database
Protein database
Protein database
Protein database
Protein database
Protein database
 Simple
 Modular
 Architecture
 Research
 Tool
Protein database
Protein database
Protein database
Protein database
Protein database
Protein database
Protein database
Protein database
Protein database
Protein database
Protein database
Protein database
Protein database
Protein database
Protein database
Protein database
Protein database
Protein database
Protein database
Protein database
Protein database
Protein database
Protein database
Protein database
Protein database
Protein database
Protein database
Protein database
Protein database
Protein database
Protein database
Protein database
Protein database
Protein database
Protein database
Protein database
Protein database
Protein database
Protein database
KEGG:
Kyoto Encyclopedia of Genes
and Genomes
KEGG (Kyoto Encyclopedia of Genes and Genomes) is a
collection of online databases dealing with genomes, enzymatic
pathways, and biological chemicals. The PATHWAY database records
networks of molecular interactions in the cells, and variants of them
specific to particular organisms. As of July 2011, KEGG has switched
to a subscription model and access via FTP is no longer free.
The KEGG, the Kyoto Encyclopedia of Genes and Genomes, was
initiated by the Japanese human genome programme in 1995.
According to the developers they consider KEGG to be a "computer
representation" of the biological system. The KEGG database can be
utilized for modeling and simulation, browsing and retrieval of data. It
is a part of the systems biology approach.
KEGG maintains five main databases:
KEGG Atlas
KEGG Pathway
KEGG Genes
KEGG Ligand
KEGG BRITE
Purpose
 Developed at the Kanehisa Laboratory
 Integrates:
 current knowledge of molecular interaction
networks
 information about genes and proteins
 information about chemical compounds
and reactions
THANK YOU

More Related Content

What's hot

Blast and fasta
Blast and fastaBlast and fasta
Blast and fastaALLIENU
 
History and scope in bioinformatics
History and scope in bioinformaticsHistory and scope in bioinformatics
History and scope in bioinformaticsKAUSHAL SAHU
 
Primary and secondary databases ppt by puneet kulyana
Primary and secondary databases ppt by puneet kulyanaPrimary and secondary databases ppt by puneet kulyana
Primary and secondary databases ppt by puneet kulyanaPuneet Kulyana
 
Nucleic Acid Sequence databases
Nucleic Acid Sequence databasesNucleic Acid Sequence databases
Nucleic Acid Sequence databasesPranavathiyani G
 
Uni prot presentation
Uni prot presentationUni prot presentation
Uni prot presentationRida Khalid
 
Protein information resource (PIR)
Protein information resource (PIR)Protein information resource (PIR)
Protein information resource (PIR)ShivaniShewale2
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignmentRamya S
 

What's hot (20)

Sequence alignment
Sequence alignmentSequence alignment
Sequence alignment
 
Blast and fasta
Blast and fastaBlast and fasta
Blast and fasta
 
NCBI National Center for Biotechnology Information
NCBI National Center for Biotechnology InformationNCBI National Center for Biotechnology Information
NCBI National Center for Biotechnology Information
 
Protein data bank
Protein data bankProtein data bank
Protein data bank
 
Proteins databases
Proteins databasesProteins databases
Proteins databases
 
History and scope in bioinformatics
History and scope in bioinformaticsHistory and scope in bioinformatics
History and scope in bioinformatics
 
Protein Databases
Protein DatabasesProtein Databases
Protein Databases
 
Swiss PROT
Swiss PROT Swiss PROT
Swiss PROT
 
Primary and secondary databases ppt by puneet kulyana
Primary and secondary databases ppt by puneet kulyanaPrimary and secondary databases ppt by puneet kulyana
Primary and secondary databases ppt by puneet kulyana
 
Data Retrieval Systems
Data Retrieval SystemsData Retrieval Systems
Data Retrieval Systems
 
Protein Data Bank (PDB)
Protein Data Bank (PDB)Protein Data Bank (PDB)
Protein Data Bank (PDB)
 
Nucleic Acid Sequence databases
Nucleic Acid Sequence databasesNucleic Acid Sequence databases
Nucleic Acid Sequence databases
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
 
Uni prot presentation
Uni prot presentationUni prot presentation
Uni prot presentation
 
Genome annotation 2013
Genome annotation 2013Genome annotation 2013
Genome annotation 2013
 
Protein information resource (PIR)
Protein information resource (PIR)Protein information resource (PIR)
Protein information resource (PIR)
 
Protein database
Protein databaseProtein database
Protein database
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
 
European molecular biology laboratory (EMBL)
European molecular biology laboratory (EMBL)European molecular biology laboratory (EMBL)
European molecular biology laboratory (EMBL)
 
blast bioinformatics
blast bioinformaticsblast bioinformatics
blast bioinformatics
 

Viewers also liked

Protein databases
Protein databasesProtein databases
Protein databasessarumalay
 
Cytoscape plugins - GeneMania and CentiScape
Cytoscape plugins - GeneMania and CentiScapeCytoscape plugins - GeneMania and CentiScape
Cytoscape plugins - GeneMania and CentiScapeNixon Mendez
 
Kegg database resources
Kegg database resources Kegg database resources
Kegg database resources innocent87
 
Introduction to bioinformatics
Introduction to bioinformaticsIntroduction to bioinformatics
Introduction to bioinformaticsMakarand Bhale
 
Ab Initio Protein Structure Prediction
Ab Initio Protein Structure PredictionAb Initio Protein Structure Prediction
Ab Initio Protein Structure PredictionArindam Ghosh
 
Errors and Limitaions of Next Generation Sequencing
Errors and Limitaions of Next Generation SequencingErrors and Limitaions of Next Generation Sequencing
Errors and Limitaions of Next Generation SequencingNixon Mendez
 
Addressing the shortage of medical doctors in zambia
Addressing the shortage of medical doctors in zambiaAddressing the shortage of medical doctors in zambia
Addressing the shortage of medical doctors in zambiaNixon Mendez
 
PERL- Bioperl modules
PERL- Bioperl modulesPERL- Bioperl modules
PERL- Bioperl modulesNixon Mendez
 
Clustering and Visualisation using R programming
Clustering and Visualisation using R programmingClustering and Visualisation using R programming
Clustering and Visualisation using R programmingNixon Mendez
 
Structural Bioinformatics - Homology modeling & its Scope
Structural Bioinformatics - Homology modeling & its ScopeStructural Bioinformatics - Homology modeling & its Scope
Structural Bioinformatics - Homology modeling & its ScopeNixon Mendez
 
Protein database ..... of NCBI
Protein database ..... of NCBI Protein database ..... of NCBI
Protein database ..... of NCBI Alagppa University
 
Protein-protein interaction (PPI)
Protein-protein interaction (PPI)Protein-protein interaction (PPI)
Protein-protein interaction (PPI)N Poorin
 

Viewers also liked (20)

Protein databases
Protein databasesProtein databases
Protein databases
 
Cytoscape plugins - GeneMania and CentiScape
Cytoscape plugins - GeneMania and CentiScapeCytoscape plugins - GeneMania and CentiScape
Cytoscape plugins - GeneMania and CentiScape
 
Kegg database resources
Kegg database resources Kegg database resources
Kegg database resources
 
protein data bank
protein data bankprotein data bank
protein data bank
 
Ddbj
DdbjDdbj
Ddbj
 
Introduction to bioinformatics
Introduction to bioinformaticsIntroduction to bioinformatics
Introduction to bioinformatics
 
Ab Initio Protein Structure Prediction
Ab Initio Protein Structure PredictionAb Initio Protein Structure Prediction
Ab Initio Protein Structure Prediction
 
Homology modeling
Homology modelingHomology modeling
Homology modeling
 
PowerMV
PowerMV PowerMV
PowerMV
 
Errors and Limitaions of Next Generation Sequencing
Errors and Limitaions of Next Generation SequencingErrors and Limitaions of Next Generation Sequencing
Errors and Limitaions of Next Generation Sequencing
 
Lyme disease
Lyme diseaseLyme disease
Lyme disease
 
Addressing the shortage of medical doctors in zambia
Addressing the shortage of medical doctors in zambiaAddressing the shortage of medical doctors in zambia
Addressing the shortage of medical doctors in zambia
 
PERL- Bioperl modules
PERL- Bioperl modulesPERL- Bioperl modules
PERL- Bioperl modules
 
Clustering and Visualisation using R programming
Clustering and Visualisation using R programmingClustering and Visualisation using R programming
Clustering and Visualisation using R programming
 
Protein Data Bank
Protein Data BankProtein Data Bank
Protein Data Bank
 
PROTEIN DATABASE
PROTEIN DATABASEPROTEIN DATABASE
PROTEIN DATABASE
 
Sequence database
Sequence databaseSequence database
Sequence database
 
Structural Bioinformatics - Homology modeling & its Scope
Structural Bioinformatics - Homology modeling & its ScopeStructural Bioinformatics - Homology modeling & its Scope
Structural Bioinformatics - Homology modeling & its Scope
 
Protein database ..... of NCBI
Protein database ..... of NCBI Protein database ..... of NCBI
Protein database ..... of NCBI
 
Protein-protein interaction (PPI)
Protein-protein interaction (PPI)Protein-protein interaction (PPI)
Protein-protein interaction (PPI)
 

Similar to Protein database

Bioinformatics databases
Bioinformatics databasesBioinformatics databases
Bioinformatics databasesgokilaamu
 
Proteomics: lecture (1) introduction to proteomics
Proteomics: lecture (1) introduction to proteomicsProteomics: lecture (1) introduction to proteomics
Proteomics: lecture (1) introduction to proteomicsClaudine83
 
Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...SBituila
 
Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...BibiQuinah
 
Essay On Protein Structure Assurance
Essay On Protein Structure AssuranceEssay On Protein Structure Assurance
Essay On Protein Structure AssuranceGinger Schiffli
 
Essay On Protein Structure Assurance
Essay On Protein Structure AssuranceEssay On Protein Structure Assurance
Essay On Protein Structure AssuranceTara Jensen
 
BioInformatics Tools -Genomics , Proteomics and metablomics
BioInformatics Tools -Genomics , Proteomics and metablomicsBioInformatics Tools -Genomics , Proteomics and metablomics
BioInformatics Tools -Genomics , Proteomics and metablomicsAyeshaYousaf20
 
Bioinformatics, application by kk sahu sir
Bioinformatics, application by kk sahu sirBioinformatics, application by kk sahu sir
Bioinformatics, application by kk sahu sirKAUSHAL SAHU
 
Mapping protein to function
Mapping protein to functionMapping protein to function
Mapping protein to functionAbhik Seal
 
Types of biological databases-protein database
Types of biological databases-protein databaseTypes of biological databases-protein database
Types of biological databases-protein databasechinmayeec
 
Bioinformatic, and tools by kk sahu
Bioinformatic, and tools by kk sahuBioinformatic, and tools by kk sahu
Bioinformatic, and tools by kk sahuKAUSHAL SAHU
 
Bioinformatics introduction
Bioinformatics introductionBioinformatics introduction
Bioinformatics introductionDrGopaSarma
 
Features of biological databases
Features of biological databasesFeatures of biological databases
Features of biological databasesCharu Sharma
 
Protein structure
Protein structureProtein structure
Protein structurePooja Pawar
 

Similar to Protein database (20)

Bioinformatics databases
Bioinformatics databasesBioinformatics databases
Bioinformatics databases
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Protein databases
Protein databasesProtein databases
Protein databases
 
Proteomics: lecture (1) introduction to proteomics
Proteomics: lecture (1) introduction to proteomicsProteomics: lecture (1) introduction to proteomics
Proteomics: lecture (1) introduction to proteomics
 
Protein database
Protein  databaseProtein  database
Protein database
 
Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...
 
Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...
 
Essay On Protein Structure Assurance
Essay On Protein Structure AssuranceEssay On Protein Structure Assurance
Essay On Protein Structure Assurance
 
Essay On Protein Structure Assurance
Essay On Protein Structure AssuranceEssay On Protein Structure Assurance
Essay On Protein Structure Assurance
 
Proteomic databases
Proteomic databasesProteomic databases
Proteomic databases
 
BioInformatics Tools -Genomics , Proteomics and metablomics
BioInformatics Tools -Genomics , Proteomics and metablomicsBioInformatics Tools -Genomics , Proteomics and metablomics
BioInformatics Tools -Genomics , Proteomics and metablomics
 
Bioinformatics, application by kk sahu sir
Bioinformatics, application by kk sahu sirBioinformatics, application by kk sahu sir
Bioinformatics, application by kk sahu sir
 
Mapping protein to function
Mapping protein to functionMapping protein to function
Mapping protein to function
 
Protein Databases
Protein DatabasesProtein Databases
Protein Databases
 
NIH-mar2604.rm.ppt
NIH-mar2604.rm.pptNIH-mar2604.rm.ppt
NIH-mar2604.rm.ppt
 
Types of biological databases-protein database
Types of biological databases-protein databaseTypes of biological databases-protein database
Types of biological databases-protein database
 
Bioinformatic, and tools by kk sahu
Bioinformatic, and tools by kk sahuBioinformatic, and tools by kk sahu
Bioinformatic, and tools by kk sahu
 
Bioinformatics introduction
Bioinformatics introductionBioinformatics introduction
Bioinformatics introduction
 
Features of biological databases
Features of biological databasesFeatures of biological databases
Features of biological databases
 
Protein structure
Protein structureProtein structure
Protein structure
 

More from Khalid Hakeem

Phytoremediation (Updated Information)
Phytoremediation (Updated Information)Phytoremediation (Updated Information)
Phytoremediation (Updated Information)Khalid Hakeem
 
Research beyond borders sky is the limit
Research beyond borders sky is the limitResearch beyond borders sky is the limit
Research beyond borders sky is the limitKhalid Hakeem
 
RESEARCH TO PUBLICATION: A JOURNEY
RESEARCH TO PUBLICATION: A JOURNEYRESEARCH TO PUBLICATION: A JOURNEY
RESEARCH TO PUBLICATION: A JOURNEYKhalid Hakeem
 
Forest biotechnology khalid ppt.
Forest biotechnology  khalid ppt.Forest biotechnology  khalid ppt.
Forest biotechnology khalid ppt.Khalid Hakeem
 
Flood control by plants
Flood control by plants Flood control by plants
Flood control by plants Khalid Hakeem
 
Scientific writing process
Scientific writing processScientific writing process
Scientific writing processKhalid Hakeem
 
International Mangrove Conference 11-13 Nov. 2014
 International Mangrove Conference  11-13 Nov. 2014 International Mangrove Conference  11-13 Nov. 2014
International Mangrove Conference 11-13 Nov. 2014Khalid Hakeem
 
Proteomics Processes and Applications
Proteomics Processes and ApplicationsProteomics Processes and Applications
Proteomics Processes and ApplicationsKhalid Hakeem
 
How to write great research papers
How to write great research papersHow to write great research papers
How to write great research papersKhalid Hakeem
 
How to write an answer
How to write an answerHow to write an answer
How to write an answerKhalid Hakeem
 
How to write a Wonderful Research paper
How to write a Wonderful Research paperHow to write a Wonderful Research paper
How to write a Wonderful Research paperKhalid Hakeem
 

More from Khalid Hakeem (16)

Phytoremediation (Updated Information)
Phytoremediation (Updated Information)Phytoremediation (Updated Information)
Phytoremediation (Updated Information)
 
Research beyond borders sky is the limit
Research beyond borders sky is the limitResearch beyond borders sky is the limit
Research beyond borders sky is the limit
 
RESEARCH TO PUBLICATION: A JOURNEY
RESEARCH TO PUBLICATION: A JOURNEYRESEARCH TO PUBLICATION: A JOURNEY
RESEARCH TO PUBLICATION: A JOURNEY
 
Forest biotechnology khalid ppt.
Forest biotechnology  khalid ppt.Forest biotechnology  khalid ppt.
Forest biotechnology khalid ppt.
 
Flood control by plants
Flood control by plants Flood control by plants
Flood control by plants
 
Scientific writing process
Scientific writing processScientific writing process
Scientific writing process
 
International Mangrove Conference 11-13 Nov. 2014
 International Mangrove Conference  11-13 Nov. 2014 International Mangrove Conference  11-13 Nov. 2014
International Mangrove Conference 11-13 Nov. 2014
 
Proteomics Processes and Applications
Proteomics Processes and ApplicationsProteomics Processes and Applications
Proteomics Processes and Applications
 
How to write great research papers
How to write great research papersHow to write great research papers
How to write great research papers
 
Khalid, kerala
Khalid, keralaKhalid, kerala
Khalid, kerala
 
Forgrad 2103
Forgrad 2103Forgrad 2103
Forgrad 2103
 
Best paper award
Best paper awardBest paper award
Best paper award
 
How to write an answer
How to write an answerHow to write an answer
How to write an answer
 
Publish or prish
Publish or prishPublish or prish
Publish or prish
 
Rafflesia
RafflesiaRafflesia
Rafflesia
 
How to write a Wonderful Research paper
How to write a Wonderful Research paperHow to write a Wonderful Research paper
How to write a Wonderful Research paper
 

Recently uploaded

CW marking grid Analytical BS - M Ahmad.docx
CW  marking grid Analytical BS - M Ahmad.docxCW  marking grid Analytical BS - M Ahmad.docx
CW marking grid Analytical BS - M Ahmad.docxmarwaahmad357
 
Human brain.. It's parts and function.
Human brain.. It's parts and function. Human brain.. It's parts and function.
Human brain.. It's parts and function. MUKTA MANJARI SAHOO
 
Role of Herbs in Cosmetics in Cosmetic Science.
Role of Herbs in Cosmetics in Cosmetic Science.Role of Herbs in Cosmetics in Cosmetic Science.
Role of Herbs in Cosmetics in Cosmetic Science.ShwetaHattimare
 
Application of Foraminiferal Ecology- Rahul.pptx
Application of Foraminiferal Ecology- Rahul.pptxApplication of Foraminiferal Ecology- Rahul.pptx
Application of Foraminiferal Ecology- Rahul.pptxRahulVishwakarma71547
 
Bureau of Indian Standards Specification of Shampoo.pptx
Bureau of Indian Standards Specification of Shampoo.pptxBureau of Indian Standards Specification of Shampoo.pptx
Bureau of Indian Standards Specification of Shampoo.pptxkastureyashashree
 
IB Biology New syllabus B3.2 Transport.pptx
IB Biology New syllabus B3.2 Transport.pptxIB Biology New syllabus B3.2 Transport.pptx
IB Biology New syllabus B3.2 Transport.pptxUalikhanKalkhojayev1
 
Krishi Vigyan Kendras - कृषि विज्ञान केंद्र
Krishi Vigyan Kendras - कृषि विज्ञान केंद्रKrishi Vigyan Kendras - कृषि विज्ञान केंद्र
Krishi Vigyan Kendras - कृषि विज्ञान केंद्रKrashi Coaching
 
Pests of ragi_Identification, Binomics_Dr.UPR
Pests of ragi_Identification, Binomics_Dr.UPRPests of ragi_Identification, Binomics_Dr.UPR
Pests of ragi_Identification, Binomics_Dr.UPRPirithiRaju
 
Applied Biochemistry feedback_M Ahwad 2023.docx
Applied Biochemistry feedback_M Ahwad 2023.docxApplied Biochemistry feedback_M Ahwad 2023.docx
Applied Biochemistry feedback_M Ahwad 2023.docxmarwaahmad357
 
001 Case Study - Submission Point_c1051231_attempt_2023-11-23-14-08-42_ABS CW...
001 Case Study - Submission Point_c1051231_attempt_2023-11-23-14-08-42_ABS CW...001 Case Study - Submission Point_c1051231_attempt_2023-11-23-14-08-42_ABS CW...
001 Case Study - Submission Point_c1051231_attempt_2023-11-23-14-08-42_ABS CW...marwaahmad357
 
Substances in Common Use for Shahu College Screening Test
Substances in Common Use for Shahu College Screening TestSubstances in Common Use for Shahu College Screening Test
Substances in Common Use for Shahu College Screening TestAkashDTejwani
 
Main Exam Applied biochemistry final year
Main Exam Applied biochemistry final yearMain Exam Applied biochemistry final year
Main Exam Applied biochemistry final yearmarwaahmad357
 
Genomics and Bioinformatics basics from genome to phenome
Genomics and Bioinformatics basics from genome to phenomeGenomics and Bioinformatics basics from genome to phenome
Genomics and Bioinformatics basics from genome to phenomeAjay Kumar Mahato
 
RCPE terms and cycles scenarios as of March 2024
RCPE terms and cycles scenarios as of March 2024RCPE terms and cycles scenarios as of March 2024
RCPE terms and cycles scenarios as of March 2024suelcarter1
 
Pests of Redgram_Identification, Binomics_Dr.UPR
Pests of Redgram_Identification, Binomics_Dr.UPRPests of Redgram_Identification, Binomics_Dr.UPR
Pests of Redgram_Identification, Binomics_Dr.UPRPirithiRaju
 
Pests of tenai_Identification,Binomics_Dr.UPR
Pests of tenai_Identification,Binomics_Dr.UPRPests of tenai_Identification,Binomics_Dr.UPR
Pests of tenai_Identification,Binomics_Dr.UPRPirithiRaju
 
KeyBio pipeline for bioinformatics and data science
KeyBio pipeline for bioinformatics and data scienceKeyBio pipeline for bioinformatics and data science
KeyBio pipeline for bioinformatics and data scienceLayne Sadler
 
biosynthesis of the cell wall and antibiotics
biosynthesis of the cell wall and antibioticsbiosynthesis of the cell wall and antibiotics
biosynthesis of the cell wall and antibioticsSafaFallah
 
Basic Concepts in Pharmacology in molecular .pptx
Basic Concepts in Pharmacology in molecular  .pptxBasic Concepts in Pharmacology in molecular  .pptx
Basic Concepts in Pharmacology in molecular .pptxVijayaKumarR28
 
Exploration Method’s in Archaeological Studies & Research
Exploration Method’s in Archaeological Studies & ResearchExploration Method’s in Archaeological Studies & Research
Exploration Method’s in Archaeological Studies & ResearchPrachya Adhyayan
 

Recently uploaded (20)

CW marking grid Analytical BS - M Ahmad.docx
CW  marking grid Analytical BS - M Ahmad.docxCW  marking grid Analytical BS - M Ahmad.docx
CW marking grid Analytical BS - M Ahmad.docx
 
Human brain.. It's parts and function.
Human brain.. It's parts and function. Human brain.. It's parts and function.
Human brain.. It's parts and function.
 
Role of Herbs in Cosmetics in Cosmetic Science.
Role of Herbs in Cosmetics in Cosmetic Science.Role of Herbs in Cosmetics in Cosmetic Science.
Role of Herbs in Cosmetics in Cosmetic Science.
 
Application of Foraminiferal Ecology- Rahul.pptx
Application of Foraminiferal Ecology- Rahul.pptxApplication of Foraminiferal Ecology- Rahul.pptx
Application of Foraminiferal Ecology- Rahul.pptx
 
Bureau of Indian Standards Specification of Shampoo.pptx
Bureau of Indian Standards Specification of Shampoo.pptxBureau of Indian Standards Specification of Shampoo.pptx
Bureau of Indian Standards Specification of Shampoo.pptx
 
IB Biology New syllabus B3.2 Transport.pptx
IB Biology New syllabus B3.2 Transport.pptxIB Biology New syllabus B3.2 Transport.pptx
IB Biology New syllabus B3.2 Transport.pptx
 
Krishi Vigyan Kendras - कृषि विज्ञान केंद्र
Krishi Vigyan Kendras - कृषि विज्ञान केंद्रKrishi Vigyan Kendras - कृषि विज्ञान केंद्र
Krishi Vigyan Kendras - कृषि विज्ञान केंद्र
 
Pests of ragi_Identification, Binomics_Dr.UPR
Pests of ragi_Identification, Binomics_Dr.UPRPests of ragi_Identification, Binomics_Dr.UPR
Pests of ragi_Identification, Binomics_Dr.UPR
 
Applied Biochemistry feedback_M Ahwad 2023.docx
Applied Biochemistry feedback_M Ahwad 2023.docxApplied Biochemistry feedback_M Ahwad 2023.docx
Applied Biochemistry feedback_M Ahwad 2023.docx
 
001 Case Study - Submission Point_c1051231_attempt_2023-11-23-14-08-42_ABS CW...
001 Case Study - Submission Point_c1051231_attempt_2023-11-23-14-08-42_ABS CW...001 Case Study - Submission Point_c1051231_attempt_2023-11-23-14-08-42_ABS CW...
001 Case Study - Submission Point_c1051231_attempt_2023-11-23-14-08-42_ABS CW...
 
Substances in Common Use for Shahu College Screening Test
Substances in Common Use for Shahu College Screening TestSubstances in Common Use for Shahu College Screening Test
Substances in Common Use for Shahu College Screening Test
 
Main Exam Applied biochemistry final year
Main Exam Applied biochemistry final yearMain Exam Applied biochemistry final year
Main Exam Applied biochemistry final year
 
Genomics and Bioinformatics basics from genome to phenome
Genomics and Bioinformatics basics from genome to phenomeGenomics and Bioinformatics basics from genome to phenome
Genomics and Bioinformatics basics from genome to phenome
 
RCPE terms and cycles scenarios as of March 2024
RCPE terms and cycles scenarios as of March 2024RCPE terms and cycles scenarios as of March 2024
RCPE terms and cycles scenarios as of March 2024
 
Pests of Redgram_Identification, Binomics_Dr.UPR
Pests of Redgram_Identification, Binomics_Dr.UPRPests of Redgram_Identification, Binomics_Dr.UPR
Pests of Redgram_Identification, Binomics_Dr.UPR
 
Pests of tenai_Identification,Binomics_Dr.UPR
Pests of tenai_Identification,Binomics_Dr.UPRPests of tenai_Identification,Binomics_Dr.UPR
Pests of tenai_Identification,Binomics_Dr.UPR
 
KeyBio pipeline for bioinformatics and data science
KeyBio pipeline for bioinformatics and data scienceKeyBio pipeline for bioinformatics and data science
KeyBio pipeline for bioinformatics and data science
 
biosynthesis of the cell wall and antibiotics
biosynthesis of the cell wall and antibioticsbiosynthesis of the cell wall and antibiotics
biosynthesis of the cell wall and antibiotics
 
Basic Concepts in Pharmacology in molecular .pptx
Basic Concepts in Pharmacology in molecular  .pptxBasic Concepts in Pharmacology in molecular  .pptx
Basic Concepts in Pharmacology in molecular .pptx
 
Exploration Method’s in Archaeological Studies & Research
Exploration Method’s in Archaeological Studies & ResearchExploration Method’s in Archaeological Studies & Research
Exploration Method’s in Archaeological Studies & Research
 

Protein database

  • 1. Protein Database Unit-2nd Dr. Khalid Rehman Hakeem Department of Bioresources University of Kashmir
  • 2. Biores-111: Bio-informatics Unit I 1. What is Bioinformatics? Overview of Bioinformatics. 2. Basic concepts and applications of bioinformatics. 3. Bio-informatics related to proteins, amino acids, DNA and RNA 5. Sequence, structure and function. Unit II •Bioinformatics databases, introduction, type of databases: Nucleotide sequence databases; primary nucleotide sequence databases viz., EMBL; GeneBank ; DDBJ •Protein sequence databases viz., SwissProt/TrEMBL; PIR •Sequence motif databases: Pfam and. PROSITE •Protein structure databases: Protein Data Bank; SCOP; CATH •Other relevant databases e.g., KEGG. Unit III •Single sequence and pair-wise alignments •Scoring matrix: PAM and BLOSUM and Dot Plots •Heuristic methods: FASTA and BLAST •Statistics of sequence alignment score: E-Value; P-Value. •Multiple sequence alignments: ClustalW, PSI-Blast. Unit IV • Human Genome Project • Genbank SNPs, GOG, STSs, and ESTs data bases • Genomic Data Mining • Understanding of the principles of the microarray technique, limitations of microarrays and problems associated with the technique •Brief introduction to PERL language.
  • 3. Proteome Bioinformatics/Databases  1. Protein sequence databases - SWISS-PROT; TrEMBL 2. Nucleotide sequence databases - Hidden treasures: EST 3. Pattern and profile databases - PROSITE; BLOCKS; PRINTS; Pfam 4. 2D-PAGE databases - SWISS-2DPAGE;WORLD-2DPAGE 5. 3D structural databases - PDB; DSSP; HSSP
  • 4. Proteome Bioinformatics/Databases  6. Post-translational modification databases - O-GLYCBASE 7. Genomic databases - OMIM; GDB; MGD; FlyBase 8. Metabolic databases - ENZYME; KEGG; EMP/WT 9. Interfacing and Integrating Databases - EXExPASy; SWISS-PROT; Cyber–Encyclopaedia of the Proteome
  • 5. Protein Database  UniPro - protein knowledge database  Swiss 2DPAGE - 2D PAGE  Pfam - protein family and domain  Prosite - protein family and domain  SMART - protein module  BLOCK - protein conserved regions
  • 17. The Pfam database is one the most important collections of information in the world for classifying proteins. The database categorises 75 per cent of known proteins to form a library of protein families - a 'periodic table' of biology. The open access resource was established at the Wellcome Trust Sanger Institute in 1998. Its vision is to provide a tool which allows experimental, computational and evolutionary biologists to classify protein sequences and answer questions about what they do and how they have evolved. The Pfam project is led by Dr Alex Bateman at the Sanger Institute. Each entry in the Pfam database includes a protein sequence alignment as well as an accompanying statistical model, called a hidden Markov model Pfam :: Home
  • 18. Pfam :: Home  Pfam is a large collection of multiple sequence alignments and hidden Markov models (HMMs)covering many common protein families.  Pfam version 26.0 (November, 2011) contains alignments and models for 13672 protein families, based on the Swissprot and SP-TrEMBL protein sequence databases.
  • 19. HMM: A Hidden Markov Model  HMM: A Hidden Markov Model, or HMM, is a statistical model for any system that can be represented as a succession of transitions between discrete states.
  • 20. •Proteins are generally composed of one or more functional regions, commonly termed domains. Different combinations of domains give rise to the diverse range of proteins found in nature. The identification of domains that occur within proteins can therefore provide insights into their function. •There are two components to Pfam: Pfam-A and Pfam-B. Pfam-A entries are high quality, manually curated families. Although these Pfam-A entries cover a large proportion of the sequences in the underlying sequence database, in order to give a more comprehensive coverage of known proteins we also generate a supplement using the ADDA (Automatic Domain Decomposition Algorithm ) database. These automatically generated entries are called Pfam-B. Although of lower quality, Pfam-B families can be useful for identifying functionally conserved regions when no Pfam-A entries are found. •Pfam also generates higher-level groupings of related families, known as clans. A clan is a collection of Pfam-A entries which are related by similarity of sequence, structure or
  • 25. Biotin Synthase Biotin synthase (BioB) converts dethiobiotin into biotin by inserting a sulfur atom between C6 and C9 of dethiobiotin inan S-adenosylmethionine (SAM)-dependent reaction.
  • 29. Reaction steps common in radical SAM enzymes
  • 30. PROSITE is a protein database. It consists of entries describing the protein families, domains and functional sites as well as amino acid patterns, signatures, and profiles in them, which are manually curated by a team of the Swiss Institute of Bioinformatics and tightly integrated into Swiss-Prot protein annotation. PROSITE was created in 1988 by Amos Bairoch, who directed the group for more than 20 years. Since July 2009 the director of the PROSITE, Swiss-Prot and Vital-IT groups is Ioannis Xenarios. PROSITE's uses include identifying possible functions of newly discovered proteins and analysis of known proteins for previously undetermined activity. Properties from well-studied genes can be propagated to biologically related organisms, and for different or poorly known genes biochemical functions can be predicted from similarities. PROSITE offers tools for protein sequence analysis and motif detection. It is part of the ExPASy proteomics analysis servers. PROSITE
  • 37.  Simple  Modular  Architecture  Research  Tool
  • 77. KEGG: Kyoto Encyclopedia of Genes and Genomes
  • 78. KEGG (Kyoto Encyclopedia of Genes and Genomes) is a collection of online databases dealing with genomes, enzymatic pathways, and biological chemicals. The PATHWAY database records networks of molecular interactions in the cells, and variants of them specific to particular organisms. As of July 2011, KEGG has switched to a subscription model and access via FTP is no longer free. The KEGG, the Kyoto Encyclopedia of Genes and Genomes, was initiated by the Japanese human genome programme in 1995. According to the developers they consider KEGG to be a "computer representation" of the biological system. The KEGG database can be utilized for modeling and simulation, browsing and retrieval of data. It is a part of the systems biology approach. KEGG maintains five main databases: KEGG Atlas KEGG Pathway KEGG Genes KEGG Ligand KEGG BRITE
  • 79. Purpose  Developed at the Kanehisa Laboratory  Integrates:  current knowledge of molecular interaction networks  information about genes and proteins  information about chemical compounds and reactions