SlideShare a Scribd company logo
1 of 42
Protein Data
Bank
Presented by
Alichy Sowmya
Shekinah Glory
Protein Data
Bank
Presentation deals with:
o What
o Why
o How
o Where
o Who
• The Protein Data Bank (PDB) is a database for the three-dimensional structural data
of large biological molecules, such as proteins and nucleic acids
• The data, typically obtained by X-ray crystallography, NMR spectroscopy, or,
increasingly, cryo-electron microscopy
• The data is freely accessible on the Internet via the websites of its member
organizations (PDBe, PDBj, RCSB, and BMRB)
• The PDB is overseen by an organization called the Worldwide Protein Data Bank,
wwPDB
What is PDB ?
Why did it start ?
Growing
crystallographic
data
Development of
BRAD in 1968
• In 1969, Dr Edgar Meyer began to write software to store atomic coordinates files in
a common format to make them available for geometric and graphical evaluation
(with sponsorship of Dr. Walton Hamilton at Bookhaven National laboratory
• In 1971, one of Dr. Meyer’s programs – SEARCH, enabled networking, that enabled
the researches to access information from database to study protein structures offline
• In 1973, upon Hamilton’s death, Dr. Tom Koetzle took over direction of PDB fo 20
years
How did it start ?
• In 1980s, IUCr guidelines established, number of structures deposited increases and
independent biological databases such as the NDB were established
• In Oct, 1998, PDB was transferred to Research Collaboratory for Structural
Bioinformatics (RCSB), complete transfer since 1999. Dr. Helen M Berman of
Rutgers University was the new director
• In 2003, with the formation of wwPDB, the PDB became an international
organization having three member organizations
• In 2006, the BMRB joined PDB
How did it start ?
Who runs it ?
The Worldwide PDB
(wwPDB) organization
manages the PDB archive and
ensures that the PDB is freely
and publicly available to the
global community
Protein Data Bank
in Europe
Protein Data Bank
Japan
Research Collaboratory for Structural
Bioinformatics Protein Data Bank
Biological Magnetic Resonance
Data Bank
Who runs it ?
Rich information about all PDB entries,
multiple search and browse facilities,
advanced services including PDBePISA,
PDBeFold and PDBeMotif, advanced
visualisation and validation of NMR and EM
structures, tools for bioinformaticians
Who runs it ?
Supports browsing in multiple languages
such as Japanese, Chinese, and Korean;
SeSAW identifies functionally or
evolutionarily conserved motifs by
locating and annotating sequence and
structural similarities, tools for
bioinformaticians, and more
Who runs it ?
Simple and advanced searching for
macromolecules and ligands, tabular
reports, specialized visualization tools,
sequence-structure comparisons,
RCSB PDB Mobile, Molecule of the
Month and other educational resources
at PDB-101, and more
Who runs it ?
Collects NMR data from any experiment and
captures assigned chemical shifts, coupling
constants, and peak lists for a variety of
macromolecules; contains derived annotations
such as hydrogen exchange rates, pKa values,
and relaxation parameters
wwPDB (https://www.wwpdb.org/ )
PDBe ( https://www.ebi.ac.uk/pdbe/ )
PDBj ( https://www.pdbj.org/ )
RCSB (https://www.rcsb.org/ )
BMRB ( http://www.bmrb.wisc.edu/ )
• The PDB is a repository of atomic coordinates and other information describing
proteins and other important biological macromolecules
• Structural biologists use methods such as X-ray crystallography, NMR spectroscopy,
and cryo-electron microscopy to determine the location of each atom relative to each
other in the molecule
• They then deposit this information, which is then annotated and publicly released into
the archive by the wwPDB
How data is collected?
• RCSB PDB website, allow you to search and explore the information under the PDB
header, including information on experimental methods and the chemistry and
biology of the protein
• Once you have found the PDB entries that you are interested in, you may
use visualization programs to allow you to read in the PDB file, display the protein
structure on your computer, download the information and create custom pictures of
it
• These programs also often include analysis tools that allow you to measure distances
and bond angles, and identify interesting structural features
How to retrieve the data ?
• One can search for their protein of interest by using the search bar in the RCSB PDB
website
• It allows one to search either by typing the PDB ID, name of the author (who has
deposited the structure), or the sequence of the protein or any particular ligand of
interest
How to search ?
• PDB ID, is the 4-character unique identifier of every entry in the Protein Data Bank
• A 4-character PDB ID is assigned to each new structure at the time of deposition
• The first character is a numeral in the range 1-9, while the last three characters can be
either numerals (in the range 0-9) or letters (in the range A-Z)
• If the PDB ID of an entry in the Protein Data Bank is known, it is the most direct way
to retrieve it from the database
• However, this can’t be used as an identifier for biomolecules, because several
structures of the same molecule in different enviroments or different conformations
are contained in PDB with different PDB IDs
PDB ID
• One or more PDB IDs can be typed or copied and pasted in the search box. Multiple
IDs can be separated by commas or white space, including line breaks.
• Example:
 Enter 4HHB into the text box next to "PDB ID(s)" and press "Submit Query". The Structure Summary page
for 4HHB will load
 Enter 2HHB, 3HHB, 4HHB into the text box and press "Submit Query". A Query Results Browser page with
a brief summary of the three structures will load. From there, clicking a PDB ID, thumbnail image, or
structure title will load the Structure Summary page for the respective ID
PDB ID
PDB ID
• The data in PDB is usually stored in 3 different file formats
 PDB file format
 mmCIF format
 PDBML
File formats
• mmCIF is the acronym for the macromolecular Crystallographic Information File
• mmCIF is based on a subset of the syntax rules for the Self Defining Text Archive
(STAR) file
• A Dictionary Description Language (DDL) defines the structure of mmCIF
dictionaries
• Dictionaries provide the metadata which define the content of mmCIF data files
• mmCIF data files, dictionaries and DDLs all are expressed in a common syntax
mmCIF
• The Protein Data Bank Markup Language (PDBML) provides a representation of
PDB data in XML format
• The description of this format is provided in XML schema of the PDB Exchange
Data Dictionary
• This schema is produced by direct translation of the PDBx/mmCIF Exchange Data
Dictionary Other data dictionaries used by the PDB have been electronically
translated into XML/XSD schemas
PDBML
• The Protein Data Bank Markup Language (PDBML) provides a representation of
PDB data in XML format
• The description of this format is provided in XML schema of the PDB Exchange
Data Dictionary
• This schema is produced by direct translation of the PDBx/mmCIF Exchange Data
Dictionary Other data dictionaries used by the PDB have been electronically
translated into XML/XSD schemas
PDB file format
How to read PDB file ?
• Sections of an Entry
The following table lists the various sections of a PDB coordinate entry and the
records comprising them:
How to read PDB file ?
• Types of Records
It is possible to group records into categories based upon how often the record type
appears in an entry.
 Single:
There are records that may only appear one time (without continuations) in
a file. It is an error for a duplicate of any of these records to appear in an
entry.
 Once in an entry but exceed the number of columns available:
There are records that conceptually exist only once in an entry, but the
information content may exceed the number of columns available. These
records are therefore continued on subsequent lines.
How to read PDB file ?
• Types of Records
 Multiple:
Most record types appear multiple times, often in groups where the
information is not logically concatenated but is presented in the form of a list.
Many of these record types have a custom serialization that may be used not
only to order the records, but also to connect to other record types.
 Multiple in an entry but exceed the number of columns available:
These records are therefore continued on subsequent lines. The second and
subsequent lines contain a continuation field which is a right-justified integer.
This number increments by one for each additional line of the record, and is
followed by a blank character.
How to read PDB file ?
• Types of Records
 Grouping:
There are three record types used to group other records.
 Other:
The remaining record types have a detailed inner structure.
How to read PDB file ?
• Types of Records
 Single:
How to read PDB file ?
• Types of Records
 Once in an entry but exceed the number of columns available :
How to read PDB file ?
• Types of Records
 Multiple :
How to read PDB file ?
• Types of Records
 Multiple in an entry but exceed the number of columns available :
How to read PDB file ?
• Types of Records
 Grouping :
How to read PDB file ?
• Types of Records
 Other :
JRNL - Literature citation that defines the coordinate set
REMARK - General remarks, some are structured and some are
free form
How to read PDB file ?
• Order of Records:
All records in a PDB coordinate entry must appear in a defined order. Mandatory
record types are present in all entries. When mandatory data are not provided, the
record name must appear in the entry with a NULL indicator. Optional items become
mandatory when certain conditions exist.
How to read PDB file ?
• Order of Records:
How to read PDB file ?
• Order of Records:
How to read PDB file ?
• Order of Records:
Want to learn further ?
PDB-101 is an online portal for teachers, students, and the general public to
promote exploration in the world of proteins and nucleic acids. Learning
about the diverse shapes and functions of these biological macromolecules
helps to understand all aspects of biomedicine and agriculture, from protein
synthesis to health and disease to biological energy.
( http://pdb101.rcsb.org/ )
Protein data bank

More Related Content

What's hot

What's hot (20)

Fasta
FastaFasta
Fasta
 
Introduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASEIntroduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASE
 
Protein data bank
Protein data bankProtein data bank
Protein data bank
 
DNA data bank of japan (DDBJ)
DNA data bank of japan (DDBJ)DNA data bank of japan (DDBJ)
DNA data bank of japan (DDBJ)
 
Nucleic Acid Sequence databases
Nucleic Acid Sequence databasesNucleic Acid Sequence databases
Nucleic Acid Sequence databases
 
Protein Databases
Protein DatabasesProtein Databases
Protein Databases
 
Introduction to NCBI
Introduction to NCBIIntroduction to NCBI
Introduction to NCBI
 
Prosite
PrositeProsite
Prosite
 
Uni prot presentation
Uni prot presentationUni prot presentation
Uni prot presentation
 
Proteins databases
Proteins databasesProteins databases
Proteins databases
 
MULTIPLE SEQUENCE ALIGNMENT
MULTIPLE  SEQUENCE  ALIGNMENTMULTIPLE  SEQUENCE  ALIGNMENT
MULTIPLE SEQUENCE ALIGNMENT
 
Clustal W - Multiple Sequence alignment
Clustal W - Multiple Sequence alignment   Clustal W - Multiple Sequence alignment
Clustal W - Multiple Sequence alignment
 
BLAST (Basic local alignment search Tool)
BLAST (Basic local alignment search Tool)BLAST (Basic local alignment search Tool)
BLAST (Basic local alignment search Tool)
 
protein data bank
protein data bankprotein data bank
protein data bank
 
Composite and Specialized databases
Composite and Specialized databasesComposite and Specialized databases
Composite and Specialized databases
 
Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-
 
Primary and secondary databases ppt by puneet kulyana
Primary and secondary databases ppt by puneet kulyanaPrimary and secondary databases ppt by puneet kulyana
Primary and secondary databases ppt by puneet kulyana
 
Tools and database of NCBI
Tools and database of NCBITools and database of NCBI
Tools and database of NCBI
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
 
Introduction to ncbi, embl, ddbj
Introduction to ncbi, embl, ddbjIntroduction to ncbi, embl, ddbj
Introduction to ncbi, embl, ddbj
 

Similar to Protein data bank

April 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early Adopters
April 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early AdoptersApril 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early Adopters
April 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early Adopters
National Information Standards Organization (NISO)
 

Similar to Protein data bank (20)

Data Retrieval Systems
Data Retrieval SystemsData Retrieval Systems
Data Retrieval Systems
 
Data retreival system
Data retreival systemData retreival system
Data retreival system
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
BITS: Overview of important biological databases beyond sequences
BITS: Overview of important biological databases beyond sequencesBITS: Overview of important biological databases beyond sequences
BITS: Overview of important biological databases beyond sequences
 
Entrez databases
Entrez databasesEntrez databases
Entrez databases
 
Protein sequence databases
Protein sequence databasesProtein sequence databases
Protein sequence databases
 
Protein Data Bank
Protein Data BankProtein Data Bank
Protein Data Bank
 
Protein Databases
Protein DatabasesProtein Databases
Protein Databases
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Database in bioinformatics
Database in bioinformaticsDatabase in bioinformatics
Database in bioinformatics
 
Pharmacoinformatics Database basics(sree)
Pharmacoinformatics Database basics(sree)Pharmacoinformatics Database basics(sree)
Pharmacoinformatics Database basics(sree)
 
Biological databases
Biological databasesBiological databases
Biological databases
 
protein databases.ppt
protein databases.pptprotein databases.ppt
protein databases.ppt
 
Protein Database
Protein DatabaseProtein Database
Protein Database
 
Biological data bioinformatics
Biological data bioinformatics Biological data bioinformatics
Biological data bioinformatics
 
biological databases.pptx
biological databases.pptxbiological databases.pptx
biological databases.pptx
 
Data Base in Bioinformatics.ppt
Data Base in Bioinformatics.pptData Base in Bioinformatics.ppt
Data Base in Bioinformatics.ppt
 
Presentation on Biological database By Elufer Akram @ University Of Science ...
Presentation on Biological database  By Elufer Akram @ University Of Science ...Presentation on Biological database  By Elufer Akram @ University Of Science ...
Presentation on Biological database By Elufer Akram @ University Of Science ...
 
April 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early Adopters
April 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early AdoptersApril 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early Adopters
April 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early Adopters
 
Genomic Databases-.pptx
Genomic Databases-.pptxGenomic Databases-.pptx
Genomic Databases-.pptx
 

More from Alichy Sowmya

PHARMACOGNOSTICAL AND BIOLOGICAL ACTIVITY EVALUATION OF DECALEPIS HAMILTONII
PHARMACOGNOSTICAL AND BIOLOGICAL ACTIVITY  EVALUATION OF DECALEPIS HAMILTONIIPHARMACOGNOSTICAL AND BIOLOGICAL ACTIVITY  EVALUATION OF DECALEPIS HAMILTONII
PHARMACOGNOSTICAL AND BIOLOGICAL ACTIVITY EVALUATION OF DECALEPIS HAMILTONII
Alichy Sowmya
 

More from Alichy Sowmya (12)

Plant tissue culture
Plant tissue culturePlant tissue culture
Plant tissue culture
 
Probability distribution in R
Probability distribution in RProbability distribution in R
Probability distribution in R
 
Regression analysis in R
Regression analysis in RRegression analysis in R
Regression analysis in R
 
Chemistry development kit
Chemistry development kitChemistry development kit
Chemistry development kit
 
Validation of homology modeling
Validation of homology modelingValidation of homology modeling
Validation of homology modeling
 
Big data in metabolism
Big data in metabolismBig data in metabolism
Big data in metabolism
 
PHARMACOGNOSTICAL AND BIOLOGICAL ACTIVITY EVALUATION OF DECALEPIS HAMILTONII
PHARMACOGNOSTICAL AND BIOLOGICAL ACTIVITY  EVALUATION OF DECALEPIS HAMILTONIIPHARMACOGNOSTICAL AND BIOLOGICAL ACTIVITY  EVALUATION OF DECALEPIS HAMILTONII
PHARMACOGNOSTICAL AND BIOLOGICAL ACTIVITY EVALUATION OF DECALEPIS HAMILTONII
 
SciFinder and its utility in Drug discovery
SciFinder and its utility in Drug discoverySciFinder and its utility in Drug discovery
SciFinder and its utility in Drug discovery
 
Prescription filling record
Prescription filling recordPrescription filling record
Prescription filling record
 
Information science
Information scienceInformation science
Information science
 
Limitations of in silico drug discovery methods
Limitations of in silico drug discovery methodsLimitations of in silico drug discovery methods
Limitations of in silico drug discovery methods
 
Crimean Congo Hemorrhagic fever
Crimean Congo Hemorrhagic feverCrimean Congo Hemorrhagic fever
Crimean Congo Hemorrhagic fever
 

Recently uploaded

Call Girls Aurangabad Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Aurangabad Just Call 8250077686 Top Class Call Girl Service AvailableCall Girls Aurangabad Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Aurangabad Just Call 8250077686 Top Class Call Girl Service Available
Dipal Arora
 
Call Girls Bhubaneswar Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Bhubaneswar Just Call 9907093804 Top Class Call Girl Service Avail...Call Girls Bhubaneswar Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Bhubaneswar Just Call 9907093804 Top Class Call Girl Service Avail...
Dipal Arora
 

Recently uploaded (20)

(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...
(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...
(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...
 
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any Time
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any TimeTop Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any Time
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any Time
 
Call Girls Aurangabad Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Aurangabad Just Call 8250077686 Top Class Call Girl Service AvailableCall Girls Aurangabad Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Aurangabad Just Call 8250077686 Top Class Call Girl Service Available
 
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
 
Top Rated Bangalore Call Girls Mg Road ⟟ 9332606886 ⟟ Call Me For Genuine S...
Top Rated Bangalore Call Girls Mg Road ⟟   9332606886 ⟟ Call Me For Genuine S...Top Rated Bangalore Call Girls Mg Road ⟟   9332606886 ⟟ Call Me For Genuine S...
Top Rated Bangalore Call Girls Mg Road ⟟ 9332606886 ⟟ Call Me For Genuine S...
 
Lucknow Call girls - 8800925952 - 24x7 service with hotel room
Lucknow Call girls - 8800925952 - 24x7 service with hotel roomLucknow Call girls - 8800925952 - 24x7 service with hotel room
Lucknow Call girls - 8800925952 - 24x7 service with hotel room
 
Call Girls Kochi Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Kochi Just Call 8250077686 Top Class Call Girl Service AvailableCall Girls Kochi Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Kochi Just Call 8250077686 Top Class Call Girl Service Available
 
Top Rated Bangalore Call Girls Richmond Circle ⟟ 9332606886 ⟟ Call Me For Ge...
Top Rated Bangalore Call Girls Richmond Circle ⟟  9332606886 ⟟ Call Me For Ge...Top Rated Bangalore Call Girls Richmond Circle ⟟  9332606886 ⟟ Call Me For Ge...
Top Rated Bangalore Call Girls Richmond Circle ⟟ 9332606886 ⟟ Call Me For Ge...
 
Call Girls Gwalior Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Gwalior Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Gwalior Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Gwalior Just Call 9907093804 Top Class Call Girl Service Available
 
Call Girls Ooty Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Ooty Just Call 8250077686 Top Class Call Girl Service AvailableCall Girls Ooty Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Ooty Just Call 8250077686 Top Class Call Girl Service Available
 
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
 
Call Girls Bhubaneswar Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Bhubaneswar Just Call 9907093804 Top Class Call Girl Service Avail...Call Girls Bhubaneswar Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Bhubaneswar Just Call 9907093804 Top Class Call Girl Service Avail...
 
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
 
VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...
VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...
VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...
 
Pondicherry Call Girls Book Now 9630942363 Top Class Pondicherry Escort Servi...
Pondicherry Call Girls Book Now 9630942363 Top Class Pondicherry Escort Servi...Pondicherry Call Girls Book Now 9630942363 Top Class Pondicherry Escort Servi...
Pondicherry Call Girls Book Now 9630942363 Top Class Pondicherry Escort Servi...
 
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...
 
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
 
Call Girls Tirupati Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Tirupati Just Call 8250077686 Top Class Call Girl Service AvailableCall Girls Tirupati Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Tirupati Just Call 8250077686 Top Class Call Girl Service Available
 
Call Girls Gwalior Just Call 8617370543 Top Class Call Girl Service Available
Call Girls Gwalior Just Call 8617370543 Top Class Call Girl Service AvailableCall Girls Gwalior Just Call 8617370543 Top Class Call Girl Service Available
Call Girls Gwalior Just Call 8617370543 Top Class Call Girl Service Available
 
Call Girls Horamavu WhatsApp Number 7001035870 Meeting With Bangalore Escorts
Call Girls Horamavu WhatsApp Number 7001035870 Meeting With Bangalore EscortsCall Girls Horamavu WhatsApp Number 7001035870 Meeting With Bangalore Escorts
Call Girls Horamavu WhatsApp Number 7001035870 Meeting With Bangalore Escorts
 

Protein data bank

  • 2. Protein Data Bank Presentation deals with: o What o Why o How o Where o Who
  • 3. • The Protein Data Bank (PDB) is a database for the three-dimensional structural data of large biological molecules, such as proteins and nucleic acids • The data, typically obtained by X-ray crystallography, NMR spectroscopy, or, increasingly, cryo-electron microscopy • The data is freely accessible on the Internet via the websites of its member organizations (PDBe, PDBj, RCSB, and BMRB) • The PDB is overseen by an organization called the Worldwide Protein Data Bank, wwPDB What is PDB ?
  • 4. Why did it start ? Growing crystallographic data Development of BRAD in 1968
  • 5. • In 1969, Dr Edgar Meyer began to write software to store atomic coordinates files in a common format to make them available for geometric and graphical evaluation (with sponsorship of Dr. Walton Hamilton at Bookhaven National laboratory • In 1971, one of Dr. Meyer’s programs – SEARCH, enabled networking, that enabled the researches to access information from database to study protein structures offline • In 1973, upon Hamilton’s death, Dr. Tom Koetzle took over direction of PDB fo 20 years How did it start ?
  • 6. • In 1980s, IUCr guidelines established, number of structures deposited increases and independent biological databases such as the NDB were established • In Oct, 1998, PDB was transferred to Research Collaboratory for Structural Bioinformatics (RCSB), complete transfer since 1999. Dr. Helen M Berman of Rutgers University was the new director • In 2003, with the formation of wwPDB, the PDB became an international organization having three member organizations • In 2006, the BMRB joined PDB How did it start ?
  • 7. Who runs it ? The Worldwide PDB (wwPDB) organization manages the PDB archive and ensures that the PDB is freely and publicly available to the global community Protein Data Bank in Europe Protein Data Bank Japan Research Collaboratory for Structural Bioinformatics Protein Data Bank Biological Magnetic Resonance Data Bank
  • 8. Who runs it ? Rich information about all PDB entries, multiple search and browse facilities, advanced services including PDBePISA, PDBeFold and PDBeMotif, advanced visualisation and validation of NMR and EM structures, tools for bioinformaticians
  • 9. Who runs it ? Supports browsing in multiple languages such as Japanese, Chinese, and Korean; SeSAW identifies functionally or evolutionarily conserved motifs by locating and annotating sequence and structural similarities, tools for bioinformaticians, and more
  • 10. Who runs it ? Simple and advanced searching for macromolecules and ligands, tabular reports, specialized visualization tools, sequence-structure comparisons, RCSB PDB Mobile, Molecule of the Month and other educational resources at PDB-101, and more
  • 11. Who runs it ? Collects NMR data from any experiment and captures assigned chemical shifts, coupling constants, and peak lists for a variety of macromolecules; contains derived annotations such as hydrogen exchange rates, pKa values, and relaxation parameters
  • 17. • The PDB is a repository of atomic coordinates and other information describing proteins and other important biological macromolecules • Structural biologists use methods such as X-ray crystallography, NMR spectroscopy, and cryo-electron microscopy to determine the location of each atom relative to each other in the molecule • They then deposit this information, which is then annotated and publicly released into the archive by the wwPDB How data is collected?
  • 18. • RCSB PDB website, allow you to search and explore the information under the PDB header, including information on experimental methods and the chemistry and biology of the protein • Once you have found the PDB entries that you are interested in, you may use visualization programs to allow you to read in the PDB file, display the protein structure on your computer, download the information and create custom pictures of it • These programs also often include analysis tools that allow you to measure distances and bond angles, and identify interesting structural features How to retrieve the data ?
  • 19. • One can search for their protein of interest by using the search bar in the RCSB PDB website • It allows one to search either by typing the PDB ID, name of the author (who has deposited the structure), or the sequence of the protein or any particular ligand of interest How to search ?
  • 20. • PDB ID, is the 4-character unique identifier of every entry in the Protein Data Bank • A 4-character PDB ID is assigned to each new structure at the time of deposition • The first character is a numeral in the range 1-9, while the last three characters can be either numerals (in the range 0-9) or letters (in the range A-Z) • If the PDB ID of an entry in the Protein Data Bank is known, it is the most direct way to retrieve it from the database • However, this can’t be used as an identifier for biomolecules, because several structures of the same molecule in different enviroments or different conformations are contained in PDB with different PDB IDs PDB ID
  • 21. • One or more PDB IDs can be typed or copied and pasted in the search box. Multiple IDs can be separated by commas or white space, including line breaks. • Example:  Enter 4HHB into the text box next to "PDB ID(s)" and press "Submit Query". The Structure Summary page for 4HHB will load  Enter 2HHB, 3HHB, 4HHB into the text box and press "Submit Query". A Query Results Browser page with a brief summary of the three structures will load. From there, clicking a PDB ID, thumbnail image, or structure title will load the Structure Summary page for the respective ID PDB ID
  • 23. • The data in PDB is usually stored in 3 different file formats  PDB file format  mmCIF format  PDBML File formats
  • 24. • mmCIF is the acronym for the macromolecular Crystallographic Information File • mmCIF is based on a subset of the syntax rules for the Self Defining Text Archive (STAR) file • A Dictionary Description Language (DDL) defines the structure of mmCIF dictionaries • Dictionaries provide the metadata which define the content of mmCIF data files • mmCIF data files, dictionaries and DDLs all are expressed in a common syntax mmCIF
  • 25. • The Protein Data Bank Markup Language (PDBML) provides a representation of PDB data in XML format • The description of this format is provided in XML schema of the PDB Exchange Data Dictionary • This schema is produced by direct translation of the PDBx/mmCIF Exchange Data Dictionary Other data dictionaries used by the PDB have been electronically translated into XML/XSD schemas PDBML
  • 26. • The Protein Data Bank Markup Language (PDBML) provides a representation of PDB data in XML format • The description of this format is provided in XML schema of the PDB Exchange Data Dictionary • This schema is produced by direct translation of the PDBx/mmCIF Exchange Data Dictionary Other data dictionaries used by the PDB have been electronically translated into XML/XSD schemas PDB file format
  • 27. How to read PDB file ? • Sections of an Entry The following table lists the various sections of a PDB coordinate entry and the records comprising them:
  • 28. How to read PDB file ? • Types of Records It is possible to group records into categories based upon how often the record type appears in an entry.  Single: There are records that may only appear one time (without continuations) in a file. It is an error for a duplicate of any of these records to appear in an entry.  Once in an entry but exceed the number of columns available: There are records that conceptually exist only once in an entry, but the information content may exceed the number of columns available. These records are therefore continued on subsequent lines.
  • 29. How to read PDB file ? • Types of Records  Multiple: Most record types appear multiple times, often in groups where the information is not logically concatenated but is presented in the form of a list. Many of these record types have a custom serialization that may be used not only to order the records, but also to connect to other record types.  Multiple in an entry but exceed the number of columns available: These records are therefore continued on subsequent lines. The second and subsequent lines contain a continuation field which is a right-justified integer. This number increments by one for each additional line of the record, and is followed by a blank character.
  • 30. How to read PDB file ? • Types of Records  Grouping: There are three record types used to group other records.  Other: The remaining record types have a detailed inner structure.
  • 31. How to read PDB file ? • Types of Records  Single:
  • 32. How to read PDB file ? • Types of Records  Once in an entry but exceed the number of columns available :
  • 33. How to read PDB file ? • Types of Records  Multiple :
  • 34. How to read PDB file ? • Types of Records  Multiple in an entry but exceed the number of columns available :
  • 35. How to read PDB file ? • Types of Records  Grouping :
  • 36. How to read PDB file ? • Types of Records  Other : JRNL - Literature citation that defines the coordinate set REMARK - General remarks, some are structured and some are free form
  • 37. How to read PDB file ? • Order of Records: All records in a PDB coordinate entry must appear in a defined order. Mandatory record types are present in all entries. When mandatory data are not provided, the record name must appear in the entry with a NULL indicator. Optional items become mandatory when certain conditions exist.
  • 38. How to read PDB file ? • Order of Records:
  • 39. How to read PDB file ? • Order of Records:
  • 40. How to read PDB file ? • Order of Records:
  • 41. Want to learn further ? PDB-101 is an online portal for teachers, students, and the general public to promote exploration in the world of proteins and nucleic acids. Learning about the diverse shapes and functions of these biological macromolecules helps to understand all aspects of biomedicine and agriculture, from protein synthesis to health and disease to biological energy. ( http://pdb101.rcsb.org/ )

Editor's Notes

  1. BRAD – Brookhaven Raster Display