SlideShare a Scribd company logo
STRUCTURAL
DATABASES
PDB , CSD , CATH
INTRODUCTION:
• Structural databases are the essential tools for all
crystallographic works.
• They are used in the process of producing, solving
,refining and publishing the structure of a new material.
THE COMMON INFORMATION FOUND IN THE
STRUCTURAL DATABASE INCLUDE:
• Bibliographic information- author name, journal reference.
• The chemical compound name, formula and oxidation states
of the element present.
• Number of formula units per unit cell(contents)
• Dimension and symmetry of the unit cell.
• symmetry of the structure.
• Atomic coordinates, occupancies and thermal parameters.
• Any special features of the experiment to collect the
diffraction data.
• The structures in the database have been solved using X-ray,
neutron and electron diffraction techniques on sample,
computational modelling or by using NMR.
PDB:(PROTEIN DATABASES)
• Protein database contains the information about 3D structures of
the proteins.
• The structural information of the protein can be determined by
X-ray crystallography or Nuclear magnetic resonance(NMR)
spectroscopy methods.
• The PDB is overseen by an organisation called World Wide
Protein Data Bank,wwPDB.
• It is available at
• www.wwpdb.org
• www.pdbe.org
• www.pdbj.org
• Each entry in the PDB is provided with a unique identification
number called PDB ID.It is a 4 letter identification number which
consists of both alpha numeric characters.
PDB FILE FORMAT:
The PDB file format is the standard file format for protein
structure file. It describes how molecules are held together in
3-D Structure of a protein.
• The file contain hundreds or thousands of lines called
records. Each record provides a different set of information
like
• HEADER: This reocord contains file name, date of submission
and the PDB ID of the molecule.
• TITLE: This record contains the title of the PDB entry.
• COMPND: This record includes the protein name.
• SOURCE: This record contains the name of the organism in
which the particular protein is obtained.
• KEYWDS: This record contains the keywords that describes
about the protein.
PDB FILE FORMAT:
• EXPDTA: This record contains the method used for the
protein structure experiment.
• AUTHOR: This record contains the name of the
contributors who put the data into the database.
• REVDATA: This record contains the revision date of the
data related to protein.(Date of modification)
• JRNL: This record contains the journal details of the
literature about the protein
• REMARK: This record contains the remarks about the
protein structure.
• DBREF: This record contains the reference to the protein
in the sequence databases.
PDB FILE FORMAT:
• SEQRES: This record contains information about the
amino acid sequence of protein.
• HET: This record contains details about the non protein
substances in the protein.
• HETNAM: This record contain the compound name of
the non protein substances.
• HETSYN: This record contains the identical compound
name for the non protein substances.
• FORMUL: This record contain the chemical formula of
the non protein substances.
• HELIX: This record holds the recognition of helical
substructures.
PDB FILE FORMAT:
• LINK: This record holds the recognition of inter-residue bonds.
• ATOM: This record contains the atomic coordinates for the
structure.
• HEATM: This record contains the atomic coordinate record for
non protein substances.
• CONECT: This record contains the details about the bonds
involved in non protein atoms.
• MASTER: This record contains the details about the number of
REMARK records, HET records, HELIX records, CONECT records
and SEQRES records, etc.
• END: This record represent the end of the file.
•
THE PDB FORMAT
• 123456789+123456789+123456789+123456789+123456789+123456789+123456789+123456789+
• HEADER RETINOIC-ACID TRANSPORT 28-SEP-94 1CBS 1CBS 2
• COMPND CELLULAR RETINOIC-ACID-BINDING PROTEIN TYPE II COMPLEXED 1CBS 3
• COMPND 2 WITH ALL-TRANS-RETINOIC ACID (THE PRESUMED PHYSIOLOGICAL 1CBS 4
• COMPND 3 LIGAND) 1CBS 5
• SOURCE HUMAN (HOMO SAPIENS) 1CBS 6
• SOURCE 2 EXPRESSION SYSTEM: (ESCHERICHIA COLI) BL21 (DE3) 1CBS 7
• SOURCE 3 PLASMID: PET-3A 1CBS 8
• SOURCE 4 GENE: HUMAN CRABP-II 1CBS 9
• AUTHOR G.J.KLEYWEGT,T.BERGFORS,T.A.JONES 1CBS 10
• REVDAT 1 26-JAN-95 1CBS 0 1CBS 11
• -------------------------------------------------------------------------------------------------------------------------------------------
CATH:
• The CATH means Class, Architecture,Topology and
homologouus super family database for proteins
• It was created by Janet Thornton and colleagues at the
university college London.
• It is available at
http://www.biochem.ucl.ac.uk/bsm/cath
• http://www.cathdb.info
• It is a protein classification tool
IT CONSISTS OF FOUR LEVELS
• Class: It includes structural conformations of proteins
and their contents(alpha, beta, alpha/beta, etc.)
• Architecture: It describes the gross orientation of
secondary structures. It also gives information about
folding of polypeptide chains.
• Topology: It deals with the structures formed due to
different topological arrangement of secondary
structures. It explains the super families of the proteins.
• Homologous super family: It compares the sequence
and structure of various proteins. It helps to trace the
evolutionary relationship among the proteins.
CATH
• The CATH aims to provide official releases of protein
structures every 12 months
• It is a free publicly available online resource.
• The latest version of CATH contains 1,14,215
domains,2178 homologous superfamilies,1110 fold
groups.
THE CATH SERVER
• The CATH have recently set up a server which allows
the user to submit the co-ordinates of the newly
determined structure for automatic classification in
CATH.
• DOMAIN BOUNDARIES AND SEQUENCE COMPARISON
• CATH contains a detective program which is good for
identifying multidomain proteins.
• The results from the detective are returned to the user in
less than a minutes.
• Identified domains are scanned against non identical
representatives from CATH using a global sequence
alignment method
CATH SERVER
• If a sequence match 95% then the domain is identical
to one in CATH.
• If a sequence match less than 30% then the structures
are compared with all the sequence families (s-level).
• ASSESING STRUCTURAL SIMILARITY:
• TOPSCAN compares the secondary strucutres in each
fold family to identify the possible fold families to which
the new structures belong.
• Subsequently the fast version of structure comparison
SSAP scans represetatives from all the families
• Structural pairs having a ssap score more than 80 are
possible homologues while the score with 70-80 don’t
have no sequence or functional similiarity.
• Finally the SSAP structural alignment is displayed using a
graphical display package.
CSD
• The cambridge structural Database is both a repository
and a validated resource for 3-D structural data of
molecules containing carbon and hydrogen.
• It is used to know about the structures of organic,
metal-organic and organometallic molecules
• The specific entries in the CSD are complementary to
PDB and Inorganic crystal structure database.
• The data in the CSD is typically obtained by X-ray
crystallography and less frequently by neutron
diffraction
CSD
• The data in the CSD is submitted by crystallographers and
chemists from all over the world.
• The CSD is maintained by an incorporated company called
Cambridge Crystallographic Data centre, CCDC
• The CCDC are publicly available for download at the point of
publication.
• The CSD is updated with about 50,000 new structures each
year and are freely available to support teaching and other
activities
• The CSD is available at
• www.ccdc.cam.ac.uk
• webcsd.ccdc.cam.ac.uk
Structural
Database
Applications
Prediction
Analysis
Mining
Compariso
n
Classificatio
n
Structure
Refinement
Databases
Annotation
Structural databases

More Related Content

What's hot

Introduction to ncbi, embl, ddbj
Introduction to ncbi, embl, ddbjIntroduction to ncbi, embl, ddbj
Introduction to ncbi, embl, ddbj
KAUSHAL SAHU
 
Proteins databases
Proteins databasesProteins databases
Proteins databases
Hafiz Muhammad Zeeshan Raza
 
Scoring matrices
Scoring matricesScoring matrices
Scoring matrices
Ashwini
 
Protein data bank
Protein data bankProtein data bank
Protein data bank
Alichy Sowmya
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
Ramya S
 
PIR- Protein Information Resource
PIR- Protein Information ResourcePIR- Protein Information Resource
Fasta
FastaFasta
Gen bank databases
Gen bank databasesGen bank databases
Gen bank databases
Hafiz Muhammad Zeeshan Raza
 
BLAST
BLASTBLAST
Blast and fasta
Blast and fastaBlast and fasta
Blast and fasta
ALLIENU
 
Introduction to NCBI
Introduction to NCBIIntroduction to NCBI
Introduction to NCBI
geetikaJethra
 
Ddbj
DdbjDdbj
Swiss prot database
Swiss prot databaseSwiss prot database
Swiss prot database
sagrika chugh
 
(Expasy)
(Expasy)(Expasy)
(Expasy)
Mazhar Khan
 
Biological database
Biological databaseBiological database
Biological database
Iqbal college Peringammala TVM
 
Sequence alignment global vs. local
Sequence alignment  global vs. localSequence alignment  global vs. local
Sequence alignment global vs. local
benazeer fathima
 
Protein data bank
Protein data bankProtein data bank
Protein data bank
Yogesh Joshi
 
Blast
BlastBlast
Protein 3 d structure prediction
Protein 3 d structure predictionProtein 3 d structure prediction
Protein 3 d structure prediction
Samvartika Majumdar
 
sequence alignment
sequence alignmentsequence alignment
sequence alignment
ammar kareem
 

What's hot (20)

Introduction to ncbi, embl, ddbj
Introduction to ncbi, embl, ddbjIntroduction to ncbi, embl, ddbj
Introduction to ncbi, embl, ddbj
 
Proteins databases
Proteins databasesProteins databases
Proteins databases
 
Scoring matrices
Scoring matricesScoring matrices
Scoring matrices
 
Protein data bank
Protein data bankProtein data bank
Protein data bank
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
 
PIR- Protein Information Resource
PIR- Protein Information ResourcePIR- Protein Information Resource
PIR- Protein Information Resource
 
Fasta
FastaFasta
Fasta
 
Gen bank databases
Gen bank databasesGen bank databases
Gen bank databases
 
BLAST
BLASTBLAST
BLAST
 
Blast and fasta
Blast and fastaBlast and fasta
Blast and fasta
 
Introduction to NCBI
Introduction to NCBIIntroduction to NCBI
Introduction to NCBI
 
Ddbj
DdbjDdbj
Ddbj
 
Swiss prot database
Swiss prot databaseSwiss prot database
Swiss prot database
 
(Expasy)
(Expasy)(Expasy)
(Expasy)
 
Biological database
Biological databaseBiological database
Biological database
 
Sequence alignment global vs. local
Sequence alignment  global vs. localSequence alignment  global vs. local
Sequence alignment global vs. local
 
Protein data bank
Protein data bankProtein data bank
Protein data bank
 
Blast
BlastBlast
Blast
 
Protein 3 d structure prediction
Protein 3 d structure predictionProtein 3 d structure prediction
Protein 3 d structure prediction
 
sequence alignment
sequence alignmentsequence alignment
sequence alignment
 

Similar to Structural databases

Major databases in bioinformatics
Major databases in bioinformaticsMajor databases in bioinformatics
Major databases in bioinformatics
Vidya Kalaivani Rajkumar
 
Databases_CSS2.pptx
Databases_CSS2.pptxDatabases_CSS2.pptx
Databases_CSS2.pptx
Silpa87
 
Bioinformatic_Databases_2.ppt Bioinformatics
Bioinformatic_Databases_2.ppt BioinformaticsBioinformatic_Databases_2.ppt Bioinformatics
Bioinformatic_Databases_2.ppt Bioinformatics
MohamedHasan816582
 
PDF文档.pdf
PDF文档.pdfPDF文档.pdf
PDF文档.pdf
SanaKhan250785
 
Bioinformatic databases 2
Bioinformatic databases 2Bioinformatic databases 2
Bioinformatic databases 2
Razzaqe
 
Bioinformatic databases 2
Bioinformatic databases 2Bioinformatic databases 2
Bioinformatic databases 2
Razzaqe
 
Bioinformatic_Databases_2.ppt
Bioinformatic_Databases_2.pptBioinformatic_Databases_2.ppt
Bioinformatic_Databases_2.ppt
NaglaaFathy42
 
Bioinformatic_Databases_2xcxzczxcxzxcxzc
Bioinformatic_Databases_2xcxzczxcxzxcxzcBioinformatic_Databases_2xcxzczxcxzxcxzc
Bioinformatic_Databases_2xcxzczxcxzxcxzc
AdiM27
 
Protein database
Protein databaseProtein database
Protein database
Rajpal Choudhary
 
Biological databases
Biological databases Biological databases
Biological databases
SEKHARREDDYAMBATI
 
Analisis 16S dan 18S rRNA.ppt
Analisis 16S dan 18S rRNA.pptAnalisis 16S dan 18S rRNA.ppt
Analisis 16S dan 18S rRNA.ppt
PurwaningtyasKusuman2
 
Bioinformatics lecture xxiii
Bioinformatics lecture xxiiiBioinformatics lecture xxiii
Bioinformatics lecture xxiii
Muhammad Younis
 
Lecture 9 molecular descriptors
Lecture 9  molecular descriptorsLecture 9  molecular descriptors
Lecture 9 molecular descriptors
RAJAN ROLTA
 
Introduction to pdb
Introduction to pdbIntroduction to pdb
Introduction to pdb
Makarand Bhale
 
Nucleic acid database
Nucleic acid database Nucleic acid database
Nucleic acid database bhargvi sharma
 
Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Enfin, DAS and BioMart
Enfin, DAS and BioMartEnfin, DAS and BioMart
Enfin, DAS and BioMart
Rafael C. Jimenez
 
ECCB 2014: Extracting patterns of database and software usage from the bioinf...
ECCB 2014: Extracting patterns of database and software usage from the bioinf...ECCB 2014: Extracting patterns of database and software usage from the bioinf...
ECCB 2014: Extracting patterns of database and software usage from the bioinf...
geraintduck
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
ShailendraSinghKhich
 

Similar to Structural databases (20)

Major databases in bioinformatics
Major databases in bioinformaticsMajor databases in bioinformatics
Major databases in bioinformatics
 
Databases_CSS2.pptx
Databases_CSS2.pptxDatabases_CSS2.pptx
Databases_CSS2.pptx
 
Bioinformatic_Databases_2.ppt Bioinformatics
Bioinformatic_Databases_2.ppt BioinformaticsBioinformatic_Databases_2.ppt Bioinformatics
Bioinformatic_Databases_2.ppt Bioinformatics
 
PDF文档.pdf
PDF文档.pdfPDF文档.pdf
PDF文档.pdf
 
Bioinformatic databases 2
Bioinformatic databases 2Bioinformatic databases 2
Bioinformatic databases 2
 
Bioinformatic databases 2
Bioinformatic databases 2Bioinformatic databases 2
Bioinformatic databases 2
 
Bioinformatic_Databases_2.ppt
Bioinformatic_Databases_2.pptBioinformatic_Databases_2.ppt
Bioinformatic_Databases_2.ppt
 
Bioinformatic_Databases_2xcxzczxcxzxcxzc
Bioinformatic_Databases_2xcxzczxcxzxcxzcBioinformatic_Databases_2xcxzczxcxzxcxzc
Bioinformatic_Databases_2xcxzczxcxzxcxzc
 
Protein database
Protein databaseProtein database
Protein database
 
Biological databases
Biological databases Biological databases
Biological databases
 
Analisis 16S dan 18S rRNA.ppt
Analisis 16S dan 18S rRNA.pptAnalisis 16S dan 18S rRNA.ppt
Analisis 16S dan 18S rRNA.ppt
 
Bioinformatics lecture xxiii
Bioinformatics lecture xxiiiBioinformatics lecture xxiii
Bioinformatics lecture xxiii
 
Lecture 9 molecular descriptors
Lecture 9  molecular descriptorsLecture 9  molecular descriptors
Lecture 9 molecular descriptors
 
Introduction to pdb
Introduction to pdbIntroduction to pdb
Introduction to pdb
 
Nucleic acid database
Nucleic acid database Nucleic acid database
Nucleic acid database
 
Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...
 
Enfin, DAS and BioMart
Enfin, DAS and BioMartEnfin, DAS and BioMart
Enfin, DAS and BioMart
 
Intro to databases
Intro to databasesIntro to databases
Intro to databases
 
ECCB 2014: Extracting patterns of database and software usage from the bioinf...
ECCB 2014: Extracting patterns of database and software usage from the bioinf...ECCB 2014: Extracting patterns of database and software usage from the bioinf...
ECCB 2014: Extracting patterns of database and software usage from the bioinf...
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 

More from Priyadharshana

Advantages of herbal medicine.
Advantages of herbal medicine.Advantages of herbal medicine.
Advantages of herbal medicine.
Priyadharshana
 
National research laboratories in medicinal plant research (autosaved).edited
National research laboratories in medicinal plant research (autosaved).editedNational research laboratories in medicinal plant research (autosaved).edited
National research laboratories in medicinal plant research (autosaved).edited
Priyadharshana
 
History of herbal medicine.
History of herbal medicine.History of herbal medicine.
History of herbal medicine.
Priyadharshana
 
Cultivation of Asparagus racemosus.
Cultivation of Asparagus racemosus.Cultivation of Asparagus racemosus.
Cultivation of Asparagus racemosus.
Priyadharshana
 
Pickling
PicklingPickling
Pickling
Priyadharshana
 
Canning
CanningCanning
Chromatography
ChromatographyChromatography
Chromatography
Priyadharshana
 
Climate change causes, effects and prevention
Climate change  causes, effects and preventionClimate change  causes, effects and prevention
Climate change causes, effects and prevention
Priyadharshana
 
Radio immuno assay (priya)
Radio immuno assay (priya)Radio immuno assay (priya)
Radio immuno assay (priya)
Priyadharshana
 
Vaccines
Vaccines Vaccines
Vaccines
Priyadharshana
 
Anorexia
Anorexia Anorexia
Anorexia
Priyadharshana
 
Landslides
LandslidesLandslides
Landslides
Priyadharshana
 

More from Priyadharshana (12)

Advantages of herbal medicine.
Advantages of herbal medicine.Advantages of herbal medicine.
Advantages of herbal medicine.
 
National research laboratories in medicinal plant research (autosaved).edited
National research laboratories in medicinal plant research (autosaved).editedNational research laboratories in medicinal plant research (autosaved).edited
National research laboratories in medicinal plant research (autosaved).edited
 
History of herbal medicine.
History of herbal medicine.History of herbal medicine.
History of herbal medicine.
 
Cultivation of Asparagus racemosus.
Cultivation of Asparagus racemosus.Cultivation of Asparagus racemosus.
Cultivation of Asparagus racemosus.
 
Pickling
PicklingPickling
Pickling
 
Canning
CanningCanning
Canning
 
Chromatography
ChromatographyChromatography
Chromatography
 
Climate change causes, effects and prevention
Climate change  causes, effects and preventionClimate change  causes, effects and prevention
Climate change causes, effects and prevention
 
Radio immuno assay (priya)
Radio immuno assay (priya)Radio immuno assay (priya)
Radio immuno assay (priya)
 
Vaccines
Vaccines Vaccines
Vaccines
 
Anorexia
Anorexia Anorexia
Anorexia
 
Landslides
LandslidesLandslides
Landslides
 

Recently uploaded

Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Sérgio Sacani
 
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdfUnveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Erdal Coalmaker
 
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdfSCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SELF-EXPLANATORY
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Sérgio Sacani
 
Anemia_ different types_causes_ conditions
Anemia_ different types_causes_ conditionsAnemia_ different types_causes_ conditions
Anemia_ different types_causes_ conditions
muralinath2
 
insect taxonomy importance systematics and classification
insect taxonomy importance systematics and classificationinsect taxonomy importance systematics and classification
insect taxonomy importance systematics and classification
anitaento25
 
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptxBody fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
muralinath2
 
platelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptxplatelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptx
muralinath2
 
Hemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptxHemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptx
muralinath2
 
Lateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensiveLateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensive
silvermistyshot
 
Lab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerinLab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerin
ossaicprecious19
 
Hemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptxHemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptx
muralinath2
 
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATIONPRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
ChetanK57
 
extra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdfextra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdf
DiyaBiswas10
 
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of LipidsGBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
Areesha Ahmad
 
In silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptxIn silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptx
AlaminAfendy1
 
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
Scintica Instrumentation
 
Penicillin...........................pptx
Penicillin...........................pptxPenicillin...........................pptx
Penicillin...........................pptx
Cherry
 
RNA INTERFERENCE: UNRAVELING GENETIC SILENCING
RNA INTERFERENCE: UNRAVELING GENETIC SILENCINGRNA INTERFERENCE: UNRAVELING GENETIC SILENCING
RNA INTERFERENCE: UNRAVELING GENETIC SILENCING
AADYARAJPANDEY1
 
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
muralinath2
 

Recently uploaded (20)

Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
 
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdfUnveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdf
 
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdfSCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
 
Anemia_ different types_causes_ conditions
Anemia_ different types_causes_ conditionsAnemia_ different types_causes_ conditions
Anemia_ different types_causes_ conditions
 
insect taxonomy importance systematics and classification
insect taxonomy importance systematics and classificationinsect taxonomy importance systematics and classification
insect taxonomy importance systematics and classification
 
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptxBody fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
 
platelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptxplatelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptx
 
Hemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptxHemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptx
 
Lateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensiveLateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensive
 
Lab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerinLab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerin
 
Hemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptxHemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptx
 
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATIONPRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
 
extra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdfextra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdf
 
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of LipidsGBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
 
In silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptxIn silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptx
 
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
 
Penicillin...........................pptx
Penicillin...........................pptxPenicillin...........................pptx
Penicillin...........................pptx
 
RNA INTERFERENCE: UNRAVELING GENETIC SILENCING
RNA INTERFERENCE: UNRAVELING GENETIC SILENCINGRNA INTERFERENCE: UNRAVELING GENETIC SILENCING
RNA INTERFERENCE: UNRAVELING GENETIC SILENCING
 
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
 

Structural databases

  • 2. INTRODUCTION: • Structural databases are the essential tools for all crystallographic works. • They are used in the process of producing, solving ,refining and publishing the structure of a new material.
  • 3. THE COMMON INFORMATION FOUND IN THE STRUCTURAL DATABASE INCLUDE: • Bibliographic information- author name, journal reference. • The chemical compound name, formula and oxidation states of the element present. • Number of formula units per unit cell(contents) • Dimension and symmetry of the unit cell. • symmetry of the structure. • Atomic coordinates, occupancies and thermal parameters. • Any special features of the experiment to collect the diffraction data. • The structures in the database have been solved using X-ray, neutron and electron diffraction techniques on sample, computational modelling or by using NMR.
  • 4. PDB:(PROTEIN DATABASES) • Protein database contains the information about 3D structures of the proteins. • The structural information of the protein can be determined by X-ray crystallography or Nuclear magnetic resonance(NMR) spectroscopy methods. • The PDB is overseen by an organisation called World Wide Protein Data Bank,wwPDB. • It is available at • www.wwpdb.org • www.pdbe.org • www.pdbj.org • Each entry in the PDB is provided with a unique identification number called PDB ID.It is a 4 letter identification number which consists of both alpha numeric characters.
  • 5.
  • 6.
  • 7. PDB FILE FORMAT: The PDB file format is the standard file format for protein structure file. It describes how molecules are held together in 3-D Structure of a protein. • The file contain hundreds or thousands of lines called records. Each record provides a different set of information like • HEADER: This reocord contains file name, date of submission and the PDB ID of the molecule. • TITLE: This record contains the title of the PDB entry. • COMPND: This record includes the protein name. • SOURCE: This record contains the name of the organism in which the particular protein is obtained. • KEYWDS: This record contains the keywords that describes about the protein.
  • 8. PDB FILE FORMAT: • EXPDTA: This record contains the method used for the protein structure experiment. • AUTHOR: This record contains the name of the contributors who put the data into the database. • REVDATA: This record contains the revision date of the data related to protein.(Date of modification) • JRNL: This record contains the journal details of the literature about the protein • REMARK: This record contains the remarks about the protein structure. • DBREF: This record contains the reference to the protein in the sequence databases.
  • 9. PDB FILE FORMAT: • SEQRES: This record contains information about the amino acid sequence of protein. • HET: This record contains details about the non protein substances in the protein. • HETNAM: This record contain the compound name of the non protein substances. • HETSYN: This record contains the identical compound name for the non protein substances. • FORMUL: This record contain the chemical formula of the non protein substances. • HELIX: This record holds the recognition of helical substructures.
  • 10. PDB FILE FORMAT: • LINK: This record holds the recognition of inter-residue bonds. • ATOM: This record contains the atomic coordinates for the structure. • HEATM: This record contains the atomic coordinate record for non protein substances. • CONECT: This record contains the details about the bonds involved in non protein atoms. • MASTER: This record contains the details about the number of REMARK records, HET records, HELIX records, CONECT records and SEQRES records, etc. • END: This record represent the end of the file. •
  • 11.
  • 12. THE PDB FORMAT • 123456789+123456789+123456789+123456789+123456789+123456789+123456789+123456789+ • HEADER RETINOIC-ACID TRANSPORT 28-SEP-94 1CBS 1CBS 2 • COMPND CELLULAR RETINOIC-ACID-BINDING PROTEIN TYPE II COMPLEXED 1CBS 3 • COMPND 2 WITH ALL-TRANS-RETINOIC ACID (THE PRESUMED PHYSIOLOGICAL 1CBS 4 • COMPND 3 LIGAND) 1CBS 5 • SOURCE HUMAN (HOMO SAPIENS) 1CBS 6 • SOURCE 2 EXPRESSION SYSTEM: (ESCHERICHIA COLI) BL21 (DE3) 1CBS 7 • SOURCE 3 PLASMID: PET-3A 1CBS 8 • SOURCE 4 GENE: HUMAN CRABP-II 1CBS 9 • AUTHOR G.J.KLEYWEGT,T.BERGFORS,T.A.JONES 1CBS 10 • REVDAT 1 26-JAN-95 1CBS 0 1CBS 11 • -------------------------------------------------------------------------------------------------------------------------------------------
  • 13. CATH: • The CATH means Class, Architecture,Topology and homologouus super family database for proteins • It was created by Janet Thornton and colleagues at the university college London. • It is available at http://www.biochem.ucl.ac.uk/bsm/cath • http://www.cathdb.info • It is a protein classification tool
  • 14. IT CONSISTS OF FOUR LEVELS • Class: It includes structural conformations of proteins and their contents(alpha, beta, alpha/beta, etc.) • Architecture: It describes the gross orientation of secondary structures. It also gives information about folding of polypeptide chains. • Topology: It deals with the structures formed due to different topological arrangement of secondary structures. It explains the super families of the proteins. • Homologous super family: It compares the sequence and structure of various proteins. It helps to trace the evolutionary relationship among the proteins.
  • 15.
  • 16. CATH • The CATH aims to provide official releases of protein structures every 12 months • It is a free publicly available online resource. • The latest version of CATH contains 1,14,215 domains,2178 homologous superfamilies,1110 fold groups.
  • 17.
  • 18. THE CATH SERVER • The CATH have recently set up a server which allows the user to submit the co-ordinates of the newly determined structure for automatic classification in CATH. • DOMAIN BOUNDARIES AND SEQUENCE COMPARISON • CATH contains a detective program which is good for identifying multidomain proteins. • The results from the detective are returned to the user in less than a minutes. • Identified domains are scanned against non identical representatives from CATH using a global sequence alignment method
  • 19. CATH SERVER • If a sequence match 95% then the domain is identical to one in CATH. • If a sequence match less than 30% then the structures are compared with all the sequence families (s-level). • ASSESING STRUCTURAL SIMILARITY: • TOPSCAN compares the secondary strucutres in each fold family to identify the possible fold families to which the new structures belong. • Subsequently the fast version of structure comparison SSAP scans represetatives from all the families • Structural pairs having a ssap score more than 80 are possible homologues while the score with 70-80 don’t have no sequence or functional similiarity. • Finally the SSAP structural alignment is displayed using a graphical display package.
  • 20.
  • 21.
  • 22. CSD • The cambridge structural Database is both a repository and a validated resource for 3-D structural data of molecules containing carbon and hydrogen. • It is used to know about the structures of organic, metal-organic and organometallic molecules • The specific entries in the CSD are complementary to PDB and Inorganic crystal structure database. • The data in the CSD is typically obtained by X-ray crystallography and less frequently by neutron diffraction
  • 23. CSD • The data in the CSD is submitted by crystallographers and chemists from all over the world. • The CSD is maintained by an incorporated company called Cambridge Crystallographic Data centre, CCDC • The CCDC are publicly available for download at the point of publication. • The CSD is updated with about 50,000 new structures each year and are freely available to support teaching and other activities • The CSD is available at • www.ccdc.cam.ac.uk • webcsd.ccdc.cam.ac.uk
  • 24.
  • 25.
  • 26.