SlideShare a Scribd company logo
1 of 32
Introduction to Databases
INTRODUCTION
DATA
Data is raw, unorganized facts that need to be processed.
Example:- Each student's test score is one piece of data.
INFORMATION
When data is processed, organized, structured or presented in a given context
so as to make it useful, it is called information.
Example:- score of a class or of the average entire school is information that
can be derived from the given data.
Database
 A database is a collection of data in an organized
manner, which is accessible in various ways.
 Biological Databases serve a critical purpose in the
collection and organization of data related to biological
systems.
 They provide a computational support and a user-friendly
interface to a researcher for a meaningful analysis of
biological data.
 A database is a computerized archive used to store and
organize data in such a way that information can be
retrieved easily via a variety of search criteria.
 Databases are composed of computer hardware and software
for data management.
 The chief objective of the development of a database is to
organize data in a set of structured records to enable easy
retrieval of information.
 Each record, also called an entry, should contain a number
of fields that hold the actual data items, for example, fields
for names, phone numbers, addresses, dates.
WHAT ARE THE BIOLOGICAL
DATABASES ???
Different classifications of
databases
 Type of data
 nucleotide sequences
 protein sequences
 proteins sequence patterns or motifs
 macromolecular 3D structure
 gene expression data
 metabolic pathways
Different classifications of
databases….
 Primary or derived databases
 Primary databases: experimental results directly
into database
 Secondary databases: results of analysis of
primary databases
 Aggregate of many databases
 Links to other data items
 Combination of data
 Consolidation of data
Different classifications of
databases….
 Availability
 Publicly available, no restrictions
 Available, but with copyright
 Accessible, but not downloadable
 Academic, but not freely available
 Proprietary, commercial; possibly free for
academics
TYPES OF DATABASES
 Primary Databases
 Secondary Databases
PRIMARY DATABASES
Contains bio-molecular data in its original form.
Experimental results are submitted directly into the database by
researchers, and the data are essentially archival in nature.
Once given a database accession number, the data in primary
databases are never changed.
Examples :- GenBank, EMBL and DDBJ for DNA/RNA sequences,
SWISS-PROT and PIR for protein sequences and PDB for molecular
structures.
GenBank
• Database from NCBI, includes sequences from
publicly available resources.
http://www.ncbi.nlm.nih.gov
/genbank/
15
NCBI and Entrez
 One of the largest and most comprehensive
databases belonging to the NIH – national institute
of health (USA)
 Entrez is the search engine of NCBI
 Search for :
genes, proteins, genomes, structures, diseases,
publications and more.
 http://www.ncbi.nlm.nih.gov/
Genbank
 An annotated collection of all publicly
available nucleotide and proteins
 Set up in 1979 at the LANL (Los Alamos).
 Maintained since 1992 NCBI (Bethesda).
GenBank file format
GenBank file format
EMBL
European Molecular Biological Laboratory
Nucleic acid database from EBI
(European Bioinformatics Institute)
Produced in collaboration with DDBJ and GenBank
Search engine – SRS (Sequence Retrieval System)
http://www.ebi.ac.uk
/
DDBJ
DNA Databank of Japan
Started in 1986 in collaboration with GenBank
Produced and maintained at NIG
(National Institute of Genetics)
http://www.ddbj.nig.ac.jp/
SWISS PROT http://www.ebi.ac.uk/uniprot/
…...
 Annotated sequence database established
in 1986
 Consists of sequence entries of different
lie formats
 Similar format to EMBL
 http://us.expasy.org/sprot/sprot-top.html
PIR
• Protein Information Resource
•A division of National Biomedical Research
•Foundation (NBRF) in U.S.
•One can search for entries or do sequence
similarity search at PIR site.
http://pir.georgetown.edu
/
TrEMBL
Translated European Molecular Biology Laboratory
Computer annotated supplement of SWISS PROT.
Contains all the translations of EMBL nucleotide
sequence entries not yet integrated in SWISS PROT.
http://www.ebi.ac.uk/trembl/
Protein DataBank (PDB)
 Important in solving real problems in molecular
biology
 Protein Databank
 PDB Established in 1972 at Brookhaven National
Laboratory (BNL)
 Sole international repository of macromolecular
structure data
 Moved to Research Collaboratory
for Structural Bioinformatics
http://www.rcsb.org/
PDB: example
HEADER LYASE(OXO-ACID) 01-OCT-91 12CA 12CA 2
COMPND CARBONIC ANHYDRASE /II (CARBONATE DEHYDRATASE) (/HCA II) 12CA 3
SOURCE HUMAN (HOMO SAPIENS) RECOMBINANT PROTEIN 12CA 5
AUTHOR S.K.NAIR,D.W.CHRISTIANSON 12CA 6
REVDAT 1 15-OCT-92 12CA 0 12CA 7
JRNL AUTH S.K.NAIR,T.L.CALDERONE,D.W.CHRISTIANSON,C.A.FIERKE 12CA 8
JRNL TITL ALTERING THE MOUTH OF A HYDROPHOBIC POCKET. 12CA 9
JRNL TITL 2 STRUCTURE AND KINETICS OF HUMAN CARBONIC ANHYDRASE 12CA 10
JRNL TITL 3 /II$ MUTANTS AT RESIDUE VAL-121 12CA 11
JRNL REF J.BIOL.CHEM. V. 266 17320 1991 12CA 12
JRNL REFN ASTM JBCHA3 US ISSN 0021-9258 071 12CA 13
REMARK 1 12CA 14EMARK 3 AUTHORS
HENDRICKSON,KONNERT 12CA 20
REMARK 3 R VALUE 0.170 12CA 21
REMARK 3 RMSD BOND DISTANCES 0.011 ANGSTROMS 12CA 22
REMARK 3 RMSD BOND ANGLES 1.3 DEGREES 12CA 23
REMARK 4 12CA 24
REMARK 4 N-TERMINAL RESIDUES SER 2, HIS 3, HIS 4 AND C-TERMINAL 12CA 25
REMARK 4 RESIDUE LYS 260 WERE NOT LOCATED IN THE DENSITY MAPS AND, 12CA 26
REMARK 4 THEREFORE, NO COORDINATES ARE INCLUDED FOR THESE RESIDUES. 12CA 27
………
COMPOSITE DATABASES
Collection of various primary database sequences
Renders sequence searching highly efficient as it searches
multiple resources
Examples :- NRDB (Non Redundant Database), OWL,
MIPSX, SWISS PROT + TrEMBL
SECONDARY DATABASES
Contains data derived from the results of analysing
primary data
Manually created or automatically generated
Contains more relevant and useful information
structured to specific requirements
Example :- PROSITE, PRINTS, BLOCKS, Pfam
PROSITE
Families of proteins
Can search using regular
expressions
Similar to unix commands
Families exhibit these patterns
So we can search over families
http://ca.expasy.org/
prosite/
BLOCKS
 Motifs/blocks are
created by
automatically
detecting the
most conserved
regions of each
protein family.
PRIMARY VS SECONDARY DATABASES

More Related Content

Similar to Databases.ppt

Database in bioinformatics
Database in bioinformaticsDatabase in bioinformatics
Database in bioinformaticsVinaKhan1
 
Primary and secondary database
Primary and secondary databasePrimary and secondary database
Primary and secondary databaseKAUSHAL SAHU
 
Bioinformatics biological databases
Bioinformatics biological databasesBioinformatics biological databases
Bioinformatics biological databasesSangeeta Das
 
Presentation on Biological database By Elufer Akram @ University Of Science ...
Presentation on Biological database  By Elufer Akram @ University Of Science ...Presentation on Biological database  By Elufer Akram @ University Of Science ...
Presentation on Biological database By Elufer Akram @ University Of Science ...Elufer Akram
 
Nucleic acid and protein databanks
Nucleic acid and protein databanksNucleic acid and protein databanks
Nucleic acid and protein databanksNithyaNandapal
 
Nucleic Acid Databases (NDB ) of bioinformatics pptx
Nucleic Acid Databases (NDB ) of bioinformatics pptxNucleic Acid Databases (NDB ) of bioinformatics pptx
Nucleic Acid Databases (NDB ) of bioinformatics pptxkarmandeepkaur7
 
Primary Databases.pptx
Primary Databases.pptxPrimary Databases.pptx
Primary Databases.pptxSwarup Malakar
 
Databases in Bioinformatics
Databases in BioinformaticsDatabases in Bioinformatics
Databases in BioinformaticsMeghaj Mallick
 
2 Discovery and Acquisition of Data1.pptx
2 Discovery and Acquisition of Data1.pptx2 Discovery and Acquisition of Data1.pptx
2 Discovery and Acquisition of Data1.pptxvijayapraba1
 
Bioinformatics introduction
Bioinformatics introductionBioinformatics introduction
Bioinformatics introductionDrGopaSarma
 
Biological Database Systems
Biological Database SystemsBiological Database Systems
Biological Database SystemsDenis Shestakov
 
Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...SBituila
 
Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...BibiQuinah
 
Nucleic Acid Sequence databases
Nucleic Acid Sequence databasesNucleic Acid Sequence databases
Nucleic Acid Sequence databasesPranavathiyani G
 
Biological Database (1)pptxpdfpdfpdf.pdf
Biological Database (1)pptxpdfpdfpdf.pdfBiological Database (1)pptxpdfpdfpdf.pdf
Biological Database (1)pptxpdfpdfpdf.pdfBioinformaticsCentre
 
Bioinformatics
BioinformaticsBioinformatics
BioinformaticsRaj Varun
 
R.P Maurya ppt on C C D C & DSSP(Bioinformatics)
R.P Maurya ppt  on C C D C & DSSP(Bioinformatics)R.P Maurya ppt  on C C D C & DSSP(Bioinformatics)
R.P Maurya ppt on C C D C & DSSP(Bioinformatics)R.P MAURYA
 

Similar to Databases.ppt (20)

Database in bioinformatics
Database in bioinformaticsDatabase in bioinformatics
Database in bioinformatics
 
Primary and secondary database
Primary and secondary databasePrimary and secondary database
Primary and secondary database
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Bioinformatics biological databases
Bioinformatics biological databasesBioinformatics biological databases
Bioinformatics biological databases
 
Presentation on Biological database By Elufer Akram @ University Of Science ...
Presentation on Biological database  By Elufer Akram @ University Of Science ...Presentation on Biological database  By Elufer Akram @ University Of Science ...
Presentation on Biological database By Elufer Akram @ University Of Science ...
 
Nucleic acid and protein databanks
Nucleic acid and protein databanksNucleic acid and protein databanks
Nucleic acid and protein databanks
 
Introduction to Biological databases
Introduction to Biological databasesIntroduction to Biological databases
Introduction to Biological databases
 
Nucleic Acid Databases (NDB ) of bioinformatics pptx
Nucleic Acid Databases (NDB ) of bioinformatics pptxNucleic Acid Databases (NDB ) of bioinformatics pptx
Nucleic Acid Databases (NDB ) of bioinformatics pptx
 
Primary Databases.pptx
Primary Databases.pptxPrimary Databases.pptx
Primary Databases.pptx
 
Databases in Bioinformatics
Databases in BioinformaticsDatabases in Bioinformatics
Databases in Bioinformatics
 
2 Discovery and Acquisition of Data1.pptx
2 Discovery and Acquisition of Data1.pptx2 Discovery and Acquisition of Data1.pptx
2 Discovery and Acquisition of Data1.pptx
 
Introduction to databases.pptx
Introduction to databases.pptxIntroduction to databases.pptx
Introduction to databases.pptx
 
Bioinformatics introduction
Bioinformatics introductionBioinformatics introduction
Bioinformatics introduction
 
Biological Database Systems
Biological Database SystemsBiological Database Systems
Biological Database Systems
 
Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...
 
Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...
 
Nucleic Acid Sequence databases
Nucleic Acid Sequence databasesNucleic Acid Sequence databases
Nucleic Acid Sequence databases
 
Biological Database (1)pptxpdfpdfpdf.pdf
Biological Database (1)pptxpdfpdfpdf.pdfBiological Database (1)pptxpdfpdfpdf.pdf
Biological Database (1)pptxpdfpdfpdf.pdf
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
R.P Maurya ppt on C C D C & DSSP(Bioinformatics)
R.P Maurya ppt  on C C D C & DSSP(Bioinformatics)R.P Maurya ppt  on C C D C & DSSP(Bioinformatics)
R.P Maurya ppt on C C D C & DSSP(Bioinformatics)
 

More from BlackHunt1

Plant breeding - The past, the present and the future.pptx
Plant breeding - The past, the present and the future.pptxPlant breeding - The past, the present and the future.pptx
Plant breeding - The past, the present and the future.pptxBlackHunt1
 
topic_14_-_genetic_technology.ppt
topic_14_-_genetic_technology.ppttopic_14_-_genetic_technology.ppt
topic_14_-_genetic_technology.pptBlackHunt1
 
Pierce5e_ch21_lecturePPT.ppt
Pierce5e_ch21_lecturePPT.pptPierce5e_ch21_lecturePPT.ppt
Pierce5e_ch21_lecturePPT.pptBlackHunt1
 
Lezione 17- Epigenetics.ppt
Lezione 17- Epigenetics.pptLezione 17- Epigenetics.ppt
Lezione 17- Epigenetics.pptBlackHunt1
 
4_4_lambda_decisions.ppt
4_4_lambda_decisions.ppt4_4_lambda_decisions.ppt
4_4_lambda_decisions.pptBlackHunt1
 
DNA replication_BTL.pptx
DNA replication_BTL.pptxDNA replication_BTL.pptx
DNA replication_BTL.pptxBlackHunt1
 
Gene_Expression.pptx
Gene_Expression.pptxGene_Expression.pptx
Gene_Expression.pptxBlackHunt1
 
_chapter 3.ppt_.ppt
_chapter 3.ppt_.ppt_chapter 3.ppt_.ppt
_chapter 3.ppt_.pptBlackHunt1
 
Bioinformatics&Databases.ppt
Bioinformatics&Databases.pptBioinformatics&Databases.ppt
Bioinformatics&Databases.pptBlackHunt1
 
Presentation A - Using Restriction Enzymes.pptx
Presentation A - Using Restriction Enzymes.pptxPresentation A - Using Restriction Enzymes.pptx
Presentation A - Using Restriction Enzymes.pptxBlackHunt1
 
Recombinant-DNA-Technology.pdf
Recombinant-DNA-Technology.pdfRecombinant-DNA-Technology.pdf
Recombinant-DNA-Technology.pdfBlackHunt1
 

More from BlackHunt1 (13)

Plant breeding - The past, the present and the future.pptx
Plant breeding - The past, the present and the future.pptxPlant breeding - The past, the present and the future.pptx
Plant breeding - The past, the present and the future.pptx
 
topic_14_-_genetic_technology.ppt
topic_14_-_genetic_technology.ppttopic_14_-_genetic_technology.ppt
topic_14_-_genetic_technology.ppt
 
Pierce5e_ch21_lecturePPT.ppt
Pierce5e_ch21_lecturePPT.pptPierce5e_ch21_lecturePPT.ppt
Pierce5e_ch21_lecturePPT.ppt
 
Lezione 17- Epigenetics.ppt
Lezione 17- Epigenetics.pptLezione 17- Epigenetics.ppt
Lezione 17- Epigenetics.ppt
 
slides1.ppt
slides1.pptslides1.ppt
slides1.ppt
 
45931.ppt
45931.ppt45931.ppt
45931.ppt
 
4_4_lambda_decisions.ppt
4_4_lambda_decisions.ppt4_4_lambda_decisions.ppt
4_4_lambda_decisions.ppt
 
DNA replication_BTL.pptx
DNA replication_BTL.pptxDNA replication_BTL.pptx
DNA replication_BTL.pptx
 
Gene_Expression.pptx
Gene_Expression.pptxGene_Expression.pptx
Gene_Expression.pptx
 
_chapter 3.ppt_.ppt
_chapter 3.ppt_.ppt_chapter 3.ppt_.ppt
_chapter 3.ppt_.ppt
 
Bioinformatics&Databases.ppt
Bioinformatics&Databases.pptBioinformatics&Databases.ppt
Bioinformatics&Databases.ppt
 
Presentation A - Using Restriction Enzymes.pptx
Presentation A - Using Restriction Enzymes.pptxPresentation A - Using Restriction Enzymes.pptx
Presentation A - Using Restriction Enzymes.pptx
 
Recombinant-DNA-Technology.pdf
Recombinant-DNA-Technology.pdfRecombinant-DNA-Technology.pdf
Recombinant-DNA-Technology.pdf
 

Recently uploaded

KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...M56BOOKSTORE PRODUCT/SERVICE
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesFatimaKhan178732
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docxPoojaSen20
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Class 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdfClass 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdfakmcokerachita
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Celine George
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 

Recently uploaded (20)

KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and Actinides
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docx
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
Class 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdfClass 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdf
 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 

Databases.ppt

  • 3. DATA Data is raw, unorganized facts that need to be processed. Example:- Each student's test score is one piece of data. INFORMATION When data is processed, organized, structured or presented in a given context so as to make it useful, it is called information. Example:- score of a class or of the average entire school is information that can be derived from the given data.
  • 4. Database  A database is a collection of data in an organized manner, which is accessible in various ways.  Biological Databases serve a critical purpose in the collection and organization of data related to biological systems.  They provide a computational support and a user-friendly interface to a researcher for a meaningful analysis of biological data.
  • 5.  A database is a computerized archive used to store and organize data in such a way that information can be retrieved easily via a variety of search criteria.  Databases are composed of computer hardware and software for data management.  The chief objective of the development of a database is to organize data in a set of structured records to enable easy retrieval of information.  Each record, also called an entry, should contain a number of fields that hold the actual data items, for example, fields for names, phone numbers, addresses, dates.
  • 6. WHAT ARE THE BIOLOGICAL DATABASES ???
  • 7.
  • 8. Different classifications of databases  Type of data  nucleotide sequences  protein sequences  proteins sequence patterns or motifs  macromolecular 3D structure  gene expression data  metabolic pathways
  • 9.
  • 10. Different classifications of databases….  Primary or derived databases  Primary databases: experimental results directly into database  Secondary databases: results of analysis of primary databases  Aggregate of many databases  Links to other data items  Combination of data  Consolidation of data
  • 11. Different classifications of databases….  Availability  Publicly available, no restrictions  Available, but with copyright  Accessible, but not downloadable  Academic, but not freely available  Proprietary, commercial; possibly free for academics
  • 12. TYPES OF DATABASES  Primary Databases  Secondary Databases
  • 13. PRIMARY DATABASES Contains bio-molecular data in its original form. Experimental results are submitted directly into the database by researchers, and the data are essentially archival in nature. Once given a database accession number, the data in primary databases are never changed. Examples :- GenBank, EMBL and DDBJ for DNA/RNA sequences, SWISS-PROT and PIR for protein sequences and PDB for molecular structures.
  • 14. GenBank • Database from NCBI, includes sequences from publicly available resources. http://www.ncbi.nlm.nih.gov /genbank/
  • 15. 15 NCBI and Entrez  One of the largest and most comprehensive databases belonging to the NIH – national institute of health (USA)  Entrez is the search engine of NCBI  Search for : genes, proteins, genomes, structures, diseases, publications and more.  http://www.ncbi.nlm.nih.gov/
  • 16. Genbank  An annotated collection of all publicly available nucleotide and proteins  Set up in 1979 at the LANL (Los Alamos).  Maintained since 1992 NCBI (Bethesda).
  • 19.
  • 20. EMBL European Molecular Biological Laboratory Nucleic acid database from EBI (European Bioinformatics Institute) Produced in collaboration with DDBJ and GenBank Search engine – SRS (Sequence Retrieval System) http://www.ebi.ac.uk /
  • 21. DDBJ DNA Databank of Japan Started in 1986 in collaboration with GenBank Produced and maintained at NIG (National Institute of Genetics) http://www.ddbj.nig.ac.jp/
  • 22. SWISS PROT http://www.ebi.ac.uk/uniprot/ …...  Annotated sequence database established in 1986  Consists of sequence entries of different lie formats  Similar format to EMBL  http://us.expasy.org/sprot/sprot-top.html
  • 23. PIR • Protein Information Resource •A division of National Biomedical Research •Foundation (NBRF) in U.S. •One can search for entries or do sequence similarity search at PIR site. http://pir.georgetown.edu /
  • 24. TrEMBL Translated European Molecular Biology Laboratory Computer annotated supplement of SWISS PROT. Contains all the translations of EMBL nucleotide sequence entries not yet integrated in SWISS PROT. http://www.ebi.ac.uk/trembl/
  • 25. Protein DataBank (PDB)  Important in solving real problems in molecular biology  Protein Databank  PDB Established in 1972 at Brookhaven National Laboratory (BNL)  Sole international repository of macromolecular structure data  Moved to Research Collaboratory for Structural Bioinformatics http://www.rcsb.org/
  • 26. PDB: example HEADER LYASE(OXO-ACID) 01-OCT-91 12CA 12CA 2 COMPND CARBONIC ANHYDRASE /II (CARBONATE DEHYDRATASE) (/HCA II) 12CA 3 SOURCE HUMAN (HOMO SAPIENS) RECOMBINANT PROTEIN 12CA 5 AUTHOR S.K.NAIR,D.W.CHRISTIANSON 12CA 6 REVDAT 1 15-OCT-92 12CA 0 12CA 7 JRNL AUTH S.K.NAIR,T.L.CALDERONE,D.W.CHRISTIANSON,C.A.FIERKE 12CA 8 JRNL TITL ALTERING THE MOUTH OF A HYDROPHOBIC POCKET. 12CA 9 JRNL TITL 2 STRUCTURE AND KINETICS OF HUMAN CARBONIC ANHYDRASE 12CA 10 JRNL TITL 3 /II$ MUTANTS AT RESIDUE VAL-121 12CA 11 JRNL REF J.BIOL.CHEM. V. 266 17320 1991 12CA 12 JRNL REFN ASTM JBCHA3 US ISSN 0021-9258 071 12CA 13 REMARK 1 12CA 14EMARK 3 AUTHORS HENDRICKSON,KONNERT 12CA 20 REMARK 3 R VALUE 0.170 12CA 21 REMARK 3 RMSD BOND DISTANCES 0.011 ANGSTROMS 12CA 22 REMARK 3 RMSD BOND ANGLES 1.3 DEGREES 12CA 23 REMARK 4 12CA 24 REMARK 4 N-TERMINAL RESIDUES SER 2, HIS 3, HIS 4 AND C-TERMINAL 12CA 25 REMARK 4 RESIDUE LYS 260 WERE NOT LOCATED IN THE DENSITY MAPS AND, 12CA 26 REMARK 4 THEREFORE, NO COORDINATES ARE INCLUDED FOR THESE RESIDUES. 12CA 27 ………
  • 27. COMPOSITE DATABASES Collection of various primary database sequences Renders sequence searching highly efficient as it searches multiple resources Examples :- NRDB (Non Redundant Database), OWL, MIPSX, SWISS PROT + TrEMBL
  • 28.
  • 29. SECONDARY DATABASES Contains data derived from the results of analysing primary data Manually created or automatically generated Contains more relevant and useful information structured to specific requirements Example :- PROSITE, PRINTS, BLOCKS, Pfam
  • 30. PROSITE Families of proteins Can search using regular expressions Similar to unix commands Families exhibit these patterns So we can search over families http://ca.expasy.org/ prosite/
  • 31. BLOCKS  Motifs/blocks are created by automatically detecting the most conserved regions of each protein family.
  • 32. PRIMARY VS SECONDARY DATABASES