Biological data bioinformatics

Biological Database
Aakifah Amreen H. E
2nd Semester MSc (BT)

Content
• Bioinformatics
• Introduction to database
• Objective of biological Database
• Features of biological Database
• Classification of biological Database
• Conclusion
• References

What is Bioinformatics
Bioinformatics is an interdisciplinary field mainly
involving molecular biology, Genetics, computer
science, mathematics, statistics.
It has been defined as,
As a means for analyzing, comparing, graphically
displaying, modeling, storing, searching and
ultimately distributing biological information, which
includes sequence, structure and function.

Introduction
Data base
is a collection of data in an organized manner,
which is easily accessible in many ways.
Biological Database
Is a collection of data that is structured,
searchable, updated periodically, and cross
referenced. It stores biological data in electronic
form

Objective of biological Database
• Recognize various data formats and know what
their primary use
• Know, understand and utilize all types of sequence
identifiers

Features of biological Databases
• Heterogeneity
• High volume data
• Uncertainty
• Data curation
• Data integration
• Data sharing
• Dynamics

DATA HETEROGENEITY
Availability of diverse complex data types.
Data types include - sequence, graph, high dimension
data, shapes, temporal data, patterns, extracted
features data.
HIGH VOLUME DATA
in addition to highly heterogenous, biological data are
voluminous to support comprehensive investigation in
various fields & directions. UNCERTAINTY
 Have great deal of uncertainty as they represent
biological phenomenon that are observed and assimed

DATA INTEGRATION
Across different structural scales, data is collected
from laboratory worldwide & integrated together
through a database and made available for use
DATA SHARING
For scientific community inspection
For cross verification
To prevent reputation & validation of data

DYNAMICS
New data is generated everyday in laboratories
And sometimes this new data contradicts with old
data
So, it's necessary to develop new organizational
database schemes to incorporate new data

Classification of biological Database
Data types
Maintainer status
Data access
Data source
Database design
Organism

1. DATA TYPE
• Sequence database
a. Nucleotide database - Gene bank, EMBL
b. Protein database - Swiss-Prot
• Structure database - PDB
• Chemical database - Pub chem
• Pathway database - KEGG
• Literature database - PubMed

2.MAINTAINER STATUS
• NCBI, EMBL
• Academic group of scientists
• Commercial company
3. DATA ACCESS
• Publicly available
• Available with copyright
• Browsing only accessible but not download able
• Restricted

4. DATA SOURCE
a. Primary database
Original data submission by researchers occurs.
Ex. Nucleotide - Gene bank, EMBL ; Protein - UniProt ;
Structure - PDB ;Literature - Pub Med
b. Secondary database
Results of analysis of primary database. Either
manually curated or by automated method.
Ex. Pfam, PROSITE

5.DATABASE DESIGN
• Flat files
• Relational database (SQL)
• Object oriented database
6. ORGANISMS
• Bacteria
• Virus
• Humans
• Animals

Conclusion
• A collection of data in systemic manner
• Which can be used to check if their is any
similarities between present data with old data all
ready present
• Easily accessible
• Doesn't consume much time

References
• Introduction to Bioinformatics, Arthur Less, Edition
5, Publisher Oxford University Press, 2019.
• Essential Bioinformatics, Jin Xion, Cambridge
University Press, 2006

Biological data bioinformatics

Biological data bioinformatics

More Related Content

What's hot

Similar to Biological data bioinformatics

More from AakifahAmreen

Recently uploaded

Biological data bioinformatics