SlideShare a Scribd company logo
1 of 28
Introductiontobioinformaticsand
BiologicalDatabase
By ā€“ Dr. Rajesh Kumar
Department of Botany
MGGAC, Mahe
Whatis bioinformatics?
ā€¢ In biology, bioinformatics is defined as, ā€œ
t
h
e use of
computer to store, retrieve, analyse or predict the
composition or structure of bio-moleculesā€ . Bioinformatics
is the application of computational techniques and
informationtechnology totheorganisationandmanagement
of biological data. Classical bioinformatics deals primarily
withsequenceanalysis.
Aimsofbioinformatics
ļƒ¼Developmentofdatabasecontainingall biological
information.
ļƒ¼Developmentofbettertoolsfordatadesigning,annotation
andmining.
ļƒ¼Designanddevelopmentofdrugsbyusingsimulation
software.
ļƒ¼Designanddevelopmentof softwaretools for protein
structurepredictionfunction,annotationanddockinganalysis.
ļƒ¼Creationanddevelopmentofsoftwaretoimprovetoolsfor
analysingsequencesfortheirfunctionandsimilaritywith
othersequences
Applications of bioinformatics
Applications of
bioinformatis
Gene
therapy
Drug
designing
Antibiotic
resistance
Crop
development
Medicine
biotechnology
Drought
resistance
Evolutionary
studies
Forensic
analysis
Veterinary
science
Wheather
analysisy
Waste
ceanup
Biological databases
ā€¢ Biological data are complex, exception-ridden, vast and
incomplete. Therefore several databases has been created
and interpreted to ensure unambiguous results. A
collection of biological data arranged in computer
readable form that enhances the speed of search and
retrieval and convenient to use is called biological
database. A good database must have updated
information.
Importanceofbiological database
ā€¢ A range of information like biological sequences,
structures, binding sites, metabolic interactions, molecular
action, functional relationships, protein families, motifs
and homologous can be retrieved by using biological
databases.The mainpurposeof abiological databaseis to
store and manage biological data and information in
computerreadableforms.
Typesofbiologicaldatabase
Nucleotidesequence
database
Primarydatabase
Deriveddatabase
secondarydatabase
Specialised
database
Structure
database
Protein structure
database
Genebank
DDBJ
EMBL
Proteinsequence
database
Swis-port
PIR
GenePept
PDB
EBI-MSD
MMDB
Domainand
motifdatabase
Prosite
Blocks
COG
Gene
expression
database
Metabolic
pathway
database
SCOP
CATH
GEO
GXD
MGED
KEGG
EMP
PathDB
TGI
GSOB
GPCRD
Primary databasevs.secondarydatabase
ā€¢ A primarydatabasecontainsonlysequenceorstructural
information.
ā€¢ Thedatabasederivedfromtheanalysisortreatmentof
primarydataaresecondarydatabase.Itisveryimportant
forinterferingproteinfunction.
Examplesofsome primary
biological database
GeneBank
ļƒ¼ Oneof thefastest growing repositories of knownnucleotide sequences,GeneBank
(Genetic Sequence Databank), has a flat file structure. It is an ASCII text file,
readable by both humans and computers. Besides sequence data, GeneBank files
contain information such as accession numbers and gene names, phylogenetic
classificationandreferencestopublishedliterature.
ļƒ¼ This database has been developed and maintained at the NCBI, Bethesda, MD,
USA, asapartofInternationalSequenceDatabaseCollaboration(INSDC).
ļƒ¼ Itis anopenaccesssequencedatabase.
ļƒ¼ It coordinateswithindividual laboratories
EMBL and DDBJ.
and other sequence databases like
Continueā€¦ā€¦ā€¦ā€¦ā€¦ā€¦..
ļƒ¼It is anannotatedcollection of all nucleotide sequencesthatareavailable to
thepublic.
ļƒ¼The nucleotide database was divided into three databases at NCBI:
CoreNucleotide database, Expressed Sequence Tag (EST) and Genome
SurveySequence(GSS).
ļƒ¼CoreNucleotide databasehasmostof thenucleotide sequencesused. It also
enclosesall nucleotiderecordsthatarenotintheEST andGSS databases.
ļƒ¼Submission of sequences to GeneBank can be done using BankIt, Sequin
andtbl2asntools.
EMBL(EuropeanMolecular
Biology Laboratory)
ā€¢ A comprehensive database of DNA and RNA sequences, EMBL
nucleotide sequencedatabaseis collected from scientific literature,
patientoffices andis directly submittedby researchers. EMBL has
been prepared in collaboration with GeneBank (USA) and the
DNA DatabaseofJapan(DDBJ).
ā€¢ Itis establishedin1980.
ā€¢ Itis maintainedbyEBI (EuropeanBioinformatics Institute)
Swiss-Port
ļƒ¼ This is a curatedprotein sequencedatabasethatoffers ahigh level
of integrationwithotherdatabasesandalso hasavery lowlevel of
redundancy. Swiss-Port strives toprovide protein sequenceswith a
high level of annotation (for instance, the description of protein
function, domain structure and post translational modifications,
etc.).
ļƒ¼ It is established in 1986 and maintained collaboratively , since
1987, by the department of Medical Biochemistry of the
UniversityofGenevaandtheEMBL dataLibrary.
ļƒ¼ TrEMBL is a computerā€“annotated supplement of Swiss-Port that
contains all translations of EMBL nucleotide sequence entries,
whichis notyetintegratedinSwiss-Port.
ļƒ¼ Currently Swiss-Port have 0.5 and TrEMBL have 7.6 milliom
sequences.
ProteinInformationResource(PIR)
ā€¢ PIR is an integrated public bioinformatics resource to
support genomic and proteomic research and scientific
studies. Nowadays, PIR offers a wide variety of resources
mainly oriented to assisting the propagation and
consistency of protein annotations like PIRSF, ProClass
andProLINK.
ExamplesofSome Secondary
Biological Database
MotifDatabases
ā€¢ Protein sequence motif is a set of conserved amino acid
residuesthatareimportantfor proteinfunctionandarelocated
within a certain distance from one another
. These motifs
usually provide clues to the functions of otherwise
uncharacterisedproteins.
ā€¢ The PROSITE database consists of documentation entries
describing protein domains, families and functional sites as
wellasassociatedpatternsandprofilestoidentifythem.
ā€¢ PRINT is adatabasefor proteinfingerprints. A fingerprint is a
group of conserved motifs used to characterise a protein
family.
DomainDatabase
ā€¢ A protein domain is an independently folded, structurally
compact unit that forms a steady three- dimensional structure
and shows a certain level of evolutionary conservation.
Typically ,aconserveddomaincontainsoneormoremotifs.
ā€¢ ProDomisaproteindomaindatabaseautomaticallygenerated
fromtheSwiss-PortandTrEMBL sequencedatabase.
ā€¢ SMART is a highly reliable and sensitive tool for
domain identification.
ā€¢ COG is adatabaseandaconvenienttoolformotifanddomain
identification.
3DStructuredatabases
ā€¢ PDB (Protein Data bank) is themainprimary databasefor 3D
structures of biological macromolecules determined by X-ray,
crystallography and NMR. It also accepts experimental data
usedtodeterminethestructuresandhomologymodels.
ā€¢ SCOP (Structural Classification of Protein database) classifies
protein 3D structures in a hierarchical scheme of structure
classes. All the protein structures in PDB are classified her,
andtheupdatednewstructuresaredepositedinPDB.
ā€¢ The CATH database (Class, Architecture, Topology,
Homologous) contains a hierarchical classification of protein
domainstructure.
Protein data bank
ā€¢ PDB (Protein data bank) is a repository for 3D structural
data obtained by x-ray crystallography or NMR
spectroscopyofproteinsandnucleic acids.
ā€¢ Research Collaboratory for Structural Bioinformatics
(RCSB) PDB provides avariety of tools andresourcesfor
studying the structures of biological macromolecules and
their relationship with other sequences, its function and
diseasescausedif any.
Geneexpressiondatabases
ā€¢ GEO or Gene Expression Omnibus is a curated online
resource and a gene expression molecular abundance
repository for gene expression data browsing, query and
retrieval.
ā€¢ GXD or GeneExpressionDatabase is acommunity resource
forgeneexpressioninformation.
ā€¢ MGED or Microarray Gene Expression data contains
microarray data, generated by functional genomics and
proteomicsexperiments.
ā€¢ ArrayExpress from European Bioinformatics Institute is a
repositoryfortranscriptomicsdata.
Metabolicpathwaydatabases
ā€¢ KEGG PATHWAY Databasecontains graphical pathwaymapsfor all
knownmetabolicpathwaysfromvariousorganisms.
ā€¢ EcoCyc is an E. coli database , stores information regarding the
genomeandbiochemical machineryofE.coli.
ā€¢ LIGAND is achemical databasefor enzymereactions attheInstitute
for Chemical Research, Kyoto. It is composite database currently
consisting of the COMPOUND, DRUG, GLYCAN, REACTION,
RPAIR andENZYMEdatabases.
ā€¢ MetaCyc is a non-redundant, experimentally elucidated metabolic
pathwaydatabase.
ā€¢ BRENDA is an enzyme database tat contains information on all
aspectsofenzymesandenzymaticreactions.
Genomedatabases
ā€¢ Genome databases give absolute information on the heritable
properties of an organism. These databases help to identify
genes and predict their functions. A few genome databases
havelinkswithspecificorganismdatabases.
ā€¢ GOLD (Genomes Online Database at the University of
Illinois, USA) contains a list of all thecomplete and ongoing
genomeprojectsworldwide.
ā€¢ Genomes at NCBI (National Centre for Biotechnology
Information,USA).
ā€¢ TIGR database (TDB), at the institute for Genomic Research
atRockeville MD,USA.
Virological databases
ā€¢ A virological database contains all the sequences and
related information of viruses of animals, plants, bacteria,
fungi and archea; for example, the HIV protease
database. A committee called The International
Committee on Taxonomy of Viruses(ICTV) authorises
and organises the taxonomic classification of viruses.
ICTVdB contains taxonomic information for over
thousandsofvirusspecies.
Worldbiodiversitydatabases
ā€¢ Taxonomic databases are built to document all known
species and them available and
worldwide.
accessibl
e databases contain taxonomic
hierarchies,
make
These
species names, synonyms, descriptions,
illustrations and references. For example: CCINFO,
STRAIN andALGAE.
Databaseforvariousmodelorganisms
ā€¢ Escherichia coli- E. coli Genome Centre(Wisconsin university,
USA), TheE.coli index(UniversityofBirmingham, UK)
ā€¢ Arabidopsisthaliana-TAIR (TheArabidopsisInformationResource)
ā€¢ Homosapiens-HumanGenomeResourcesatNCBI, USA
ā€¢ Oryza sativa (rice) -RGP (Rice Genome Research
Programme,Japan)
ā€¢ Drosaphilamelanogaster-FlyBase (DrosophilaGenomeDatabase)
ā€¢ Musmusculus(mouce)-MouceGenomeInformatics
ā€¢ Daniorerio(zebrafish)- ZFIN (Zebrafish InformationNetwork atthe
UniversityofOregon,USA)
ā€¢ S. cerevisiae (Bakers yeast)- SGD ()Yeast Genome Database
at Stanford,USA
AnnotationofGene?????
In molecular biology, genomes make the basic genetic material and
typically consistofDNA. Whereby,genomeincludethegenes(coding)
andnoncoding regions, of interest tous, arethecoding regions asthey
actively influence basic life processes. The genes contain useful
biological information that is required in building up and maintaining
an organism. Gene annotation can be defined merely as the process of
makingnucleotidesequencemeaningful.
Gene annotation involves the process of taking the raw DNA sequence
produced by genomesequencing projects andadding layers of analysis
and interpretation necessary to extracting biologically significant
information andplacing such derived details into context. Annotation is
the process by which pertinent information about these raw DNA
sequencesisaddedtothedatabases.
Accessionnumber
Accession numbers are unique identifiers which
permanently identify sequences in the database.
Accession numbers are assigned and
communicated to authors within twoworking days
ofthereceiptofsubmission.
Introduction to Biological database ppt(1).pptx

More Related Content

Similar to Introduction to Biological database ppt(1).pptx

Introduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASEIntroduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASEPrashantSharma807
Ā 
Data base in detail
Data base in detailData base in detail
Data base in detailVartika Mishra
Ā 
Bioinformatics in biotechnology by kk sahu
Bioinformatics in biotechnology by kk sahu Bioinformatics in biotechnology by kk sahu
Bioinformatics in biotechnology by kk sahu KAUSHAL SAHU
Ā 
Bioinformatics seminar
Bioinformatics seminarBioinformatics seminar
Bioinformatics seminarshashi bijapure
Ā 
Bioinformatic, and tools by kk sahu
Bioinformatic, and tools by kk sahuBioinformatic, and tools by kk sahu
Bioinformatic, and tools by kk sahuKAUSHAL SAHU
Ā 
SooryaKiran Bioinformatics
SooryaKiran BioinformaticsSooryaKiran Bioinformatics
SooryaKiran Bioinformaticscontactsoorya
Ā 
Protein database
Protein  databaseProtein  database
Protein databaseKAUSHAL SAHU
Ā 
Intro bioinfo
Intro bioinfoIntro bioinfo
Intro bioinfoVinitha Nair
Ā 
Intro bioinfo
Intro bioinfoIntro bioinfo
Intro bioinfoVinitha Nair
Ā 
bioinfomatics
bioinfomaticsbioinfomatics
bioinfomaticsnguyenpg
Ā 
Biological Database (1)pptxpdfpdfpdf.pdf
Biological Database (1)pptxpdfpdfpdf.pdfBiological Database (1)pptxpdfpdfpdf.pdf
Biological Database (1)pptxpdfpdfpdf.pdfBioinformaticsCentre
Ā 
Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...SBituila
Ā 
Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...BibiQuinah
Ā 
Biological databases.pptx
Biological databases.pptxBiological databases.pptx
Biological databases.pptxPagudalaSangeetha
Ā 
Bioinformatics introduction
Bioinformatics introductionBioinformatics introduction
Bioinformatics introductionDrGopaSarma
Ā 

Similar to Introduction to Biological database ppt(1).pptx (20)

Introduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASEIntroduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASE
Ā 
Biological database
Biological databaseBiological database
Biological database
Ā 
Data base in detail
Data base in detailData base in detail
Data base in detail
Ā 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
Ā 
Bioinformatics in biotechnology by kk sahu
Bioinformatics in biotechnology by kk sahu Bioinformatics in biotechnology by kk sahu
Bioinformatics in biotechnology by kk sahu
Ā 
Bioinformatics seminar
Bioinformatics seminarBioinformatics seminar
Bioinformatics seminar
Ā 
Bioinformatic, and tools by kk sahu
Bioinformatic, and tools by kk sahuBioinformatic, and tools by kk sahu
Bioinformatic, and tools by kk sahu
Ā 
Ncbi
NcbiNcbi
Ncbi
Ā 
Proteins databases
Proteins databasesProteins databases
Proteins databases
Ā 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
Ā 
SooryaKiran Bioinformatics
SooryaKiran BioinformaticsSooryaKiran Bioinformatics
SooryaKiran Bioinformatics
Ā 
Protein database
Protein  databaseProtein  database
Protein database
Ā 
Intro bioinfo
Intro bioinfoIntro bioinfo
Intro bioinfo
Ā 
Intro bioinfo
Intro bioinfoIntro bioinfo
Intro bioinfo
Ā 
bioinfomatics
bioinfomaticsbioinfomatics
bioinfomatics
Ā 
Biological Database (1)pptxpdfpdfpdf.pdf
Biological Database (1)pptxpdfpdfpdf.pdfBiological Database (1)pptxpdfpdfpdf.pdf
Biological Database (1)pptxpdfpdfpdf.pdf
Ā 
Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...
Ā 
Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...
Ā 
Biological databases.pptx
Biological databases.pptxBiological databases.pptx
Biological databases.pptx
Ā 
Bioinformatics introduction
Bioinformatics introductionBioinformatics introduction
Bioinformatics introduction
Ā 

More from RAJESHKUMAR428748

Methods of sterilization & totipotency.pdf
Methods of  sterilization & totipotency.pdfMethods of  sterilization & totipotency.pdf
Methods of sterilization & totipotency.pdfRAJESHKUMAR428748
Ā 
sterilization and methods of sterilization
sterilization and methods of sterilizationsterilization and methods of sterilization
sterilization and methods of sterilizationRAJESHKUMAR428748
Ā 
Introduction of climatechangeinindia-210918112730.pdf
Introduction of climatechangeinindia-210918112730.pdfIntroduction of climatechangeinindia-210918112730.pdf
Introduction of climatechangeinindia-210918112730.pdfRAJESHKUMAR428748
Ā 
Production of secondarymetabolites-200422175353.ppt
Production of secondarymetabolites-200422175353.pptProduction of secondarymetabolites-200422175353.ppt
Production of secondarymetabolites-200422175353.pptRAJESHKUMAR428748
Ā 
Introction to Molecular basis of Plant Breeding.pptx
Introction to Molecular basis of Plant Breeding.pptxIntroction to Molecular basis of Plant Breeding.pptx
Introction to Molecular basis of Plant Breeding.pptxRAJESHKUMAR428748
Ā 
Hypothesis - I.pptx Reasearch hyposthesis
Hypothesis - I.pptx Reasearch hyposthesisHypothesis - I.pptx Reasearch hyposthesis
Hypothesis - I.pptx Reasearch hyposthesisRAJESHKUMAR428748
Ā 
20- Tabular & Graphical Presentation of data(UG2017-18).ppt
20- Tabular & Graphical Presentation of data(UG2017-18).ppt20- Tabular & Graphical Presentation of data(UG2017-18).ppt
20- Tabular & Graphical Presentation of data(UG2017-18).pptRAJESHKUMAR428748
Ā 
session1-2.ppt Natural farming and organic farming
session1-2.ppt Natural farming and organic farmingsession1-2.ppt Natural farming and organic farming
session1-2.ppt Natural farming and organic farmingRAJESHKUMAR428748
Ā 
breeders right.ppt for Intellectual Property
breeders right.ppt for Intellectual Propertybreeders right.ppt for Intellectual Property
breeders right.ppt for Intellectual PropertyRAJESHKUMAR428748
Ā 
bioplasticfinal-150313052258-conversion-gate01.pptx
bioplasticfinal-150313052258-conversion-gate01.pptxbioplasticfinal-150313052258-conversion-gate01.pptx
bioplasticfinal-150313052258-conversion-gate01.pptxRAJESHKUMAR428748
Ā 
bioplastics and biotechnology for sustainable future
bioplastics and biotechnology for sustainable futurebioplastics and biotechnology for sustainable future
bioplastics and biotechnology for sustainable futureRAJESHKUMAR428748
Ā 
bioplastics and biotechnology: Green Plastics.pdf
bioplastics and biotechnology: Green Plastics.pdfbioplastics and biotechnology: Green Plastics.pdf
bioplastics and biotechnology: Green Plastics.pdfRAJESHKUMAR428748
Ā 
Better plantsasbioreactors-170225140248.pptx
Better plantsasbioreactors-170225140248.pptxBetter plantsasbioreactors-170225140248.pptx
Better plantsasbioreactors-170225140248.pptxRAJESHKUMAR428748
Ā 
Organic farming concept as pects and prospects
Organic farming concept as pects and prospectsOrganic farming concept as pects and prospects
Organic farming concept as pects and prospectsRAJESHKUMAR428748
Ā 
Traditional Knowledge Digital Library-TKDL
Traditional Knowledge Digital Library-TKDLTraditional Knowledge Digital Library-TKDL
Traditional Knowledge Digital Library-TKDLRAJESHKUMAR428748
Ā 
Application of medicinal plant and its extracts
Application of medicinal plant and its extractsApplication of medicinal plant and its extracts
Application of medicinal plant and its extractsRAJESHKUMAR428748
Ā 
1.pdf An introduction to medicinal plants
1.pdf An introduction to medicinal plants1.pdf An introduction to medicinal plants
1.pdf An introduction to medicinal plantsRAJESHKUMAR428748
Ā 
Nutrient defficiency pdf in greenhouse plants
Nutrient defficiency pdf in greenhouse plantsNutrient defficiency pdf in greenhouse plants
Nutrient defficiency pdf in greenhouse plantsRAJESHKUMAR428748
Ā 
Advatange and disadvatanges of GHT-2-21.pdf
Advatange and disadvatanges of GHT-2-21.pdfAdvatange and disadvatanges of GHT-2-21.pdf
Advatange and disadvatanges of GHT-2-21.pdfRAJESHKUMAR428748
Ā 
nethouse.pdf green house covering material
nethouse.pdf green house covering materialnethouse.pdf green house covering material
nethouse.pdf green house covering materialRAJESHKUMAR428748
Ā 

More from RAJESHKUMAR428748 (20)

Methods of sterilization & totipotency.pdf
Methods of  sterilization & totipotency.pdfMethods of  sterilization & totipotency.pdf
Methods of sterilization & totipotency.pdf
Ā 
sterilization and methods of sterilization
sterilization and methods of sterilizationsterilization and methods of sterilization
sterilization and methods of sterilization
Ā 
Introduction of climatechangeinindia-210918112730.pdf
Introduction of climatechangeinindia-210918112730.pdfIntroduction of climatechangeinindia-210918112730.pdf
Introduction of climatechangeinindia-210918112730.pdf
Ā 
Production of secondarymetabolites-200422175353.ppt
Production of secondarymetabolites-200422175353.pptProduction of secondarymetabolites-200422175353.ppt
Production of secondarymetabolites-200422175353.ppt
Ā 
Introction to Molecular basis of Plant Breeding.pptx
Introction to Molecular basis of Plant Breeding.pptxIntroction to Molecular basis of Plant Breeding.pptx
Introction to Molecular basis of Plant Breeding.pptx
Ā 
Hypothesis - I.pptx Reasearch hyposthesis
Hypothesis - I.pptx Reasearch hyposthesisHypothesis - I.pptx Reasearch hyposthesis
Hypothesis - I.pptx Reasearch hyposthesis
Ā 
20- Tabular & Graphical Presentation of data(UG2017-18).ppt
20- Tabular & Graphical Presentation of data(UG2017-18).ppt20- Tabular & Graphical Presentation of data(UG2017-18).ppt
20- Tabular & Graphical Presentation of data(UG2017-18).ppt
Ā 
session1-2.ppt Natural farming and organic farming
session1-2.ppt Natural farming and organic farmingsession1-2.ppt Natural farming and organic farming
session1-2.ppt Natural farming and organic farming
Ā 
breeders right.ppt for Intellectual Property
breeders right.ppt for Intellectual Propertybreeders right.ppt for Intellectual Property
breeders right.ppt for Intellectual Property
Ā 
bioplasticfinal-150313052258-conversion-gate01.pptx
bioplasticfinal-150313052258-conversion-gate01.pptxbioplasticfinal-150313052258-conversion-gate01.pptx
bioplasticfinal-150313052258-conversion-gate01.pptx
Ā 
bioplastics and biotechnology for sustainable future
bioplastics and biotechnology for sustainable futurebioplastics and biotechnology for sustainable future
bioplastics and biotechnology for sustainable future
Ā 
bioplastics and biotechnology: Green Plastics.pdf
bioplastics and biotechnology: Green Plastics.pdfbioplastics and biotechnology: Green Plastics.pdf
bioplastics and biotechnology: Green Plastics.pdf
Ā 
Better plantsasbioreactors-170225140248.pptx
Better plantsasbioreactors-170225140248.pptxBetter plantsasbioreactors-170225140248.pptx
Better plantsasbioreactors-170225140248.pptx
Ā 
Organic farming concept as pects and prospects
Organic farming concept as pects and prospectsOrganic farming concept as pects and prospects
Organic farming concept as pects and prospects
Ā 
Traditional Knowledge Digital Library-TKDL
Traditional Knowledge Digital Library-TKDLTraditional Knowledge Digital Library-TKDL
Traditional Knowledge Digital Library-TKDL
Ā 
Application of medicinal plant and its extracts
Application of medicinal plant and its extractsApplication of medicinal plant and its extracts
Application of medicinal plant and its extracts
Ā 
1.pdf An introduction to medicinal plants
1.pdf An introduction to medicinal plants1.pdf An introduction to medicinal plants
1.pdf An introduction to medicinal plants
Ā 
Nutrient defficiency pdf in greenhouse plants
Nutrient defficiency pdf in greenhouse plantsNutrient defficiency pdf in greenhouse plants
Nutrient defficiency pdf in greenhouse plants
Ā 
Advatange and disadvatanges of GHT-2-21.pdf
Advatange and disadvatanges of GHT-2-21.pdfAdvatange and disadvatanges of GHT-2-21.pdf
Advatange and disadvatanges of GHT-2-21.pdf
Ā 
nethouse.pdf green house covering material
nethouse.pdf green house covering materialnethouse.pdf green house covering material
nethouse.pdf green house covering material
Ā 

Recently uploaded

mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
Ā 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
Ā 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
Ā 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
Ā 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppCeline George
Ā 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
Ā 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
Ā 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
Ā 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxRoyAbrique
Ā 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
Ā 
18-04-UA_REPORT_MEDIALITERAŠ”Y_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAŠ”Y_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAŠ”Y_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAŠ”Y_INDEX-DM_23-1-final-eng.pdfssuser54595a
Ā 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
Ā 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting DataJhengPantaleon
Ā 
call girls in Kamla Market (DELHI) šŸ” >ą¼’9953330565šŸ” genuine Escort Service šŸ”āœ”ļøāœ”ļø
call girls in Kamla Market (DELHI) šŸ” >ą¼’9953330565šŸ” genuine Escort Service šŸ”āœ”ļøāœ”ļøcall girls in Kamla Market (DELHI) šŸ” >ą¼’9953330565šŸ” genuine Escort Service šŸ”āœ”ļøāœ”ļø
call girls in Kamla Market (DELHI) šŸ” >ą¼’9953330565šŸ” genuine Escort Service šŸ”āœ”ļøāœ”ļø9953056974 Low Rate Call Girls In Saket, Delhi NCR
Ā 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsKarinaGenton
Ā 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
Ā 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
Ā 

Recently uploaded (20)

TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
Ā 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
Ā 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
Ā 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
Ā 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
Ā 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website App
Ā 
Model Call Girl in Bikash Puri Delhi reach out to us at šŸ”9953056974šŸ”
Model Call Girl in Bikash Puri  Delhi reach out to us at šŸ”9953056974šŸ”Model Call Girl in Bikash Puri  Delhi reach out to us at šŸ”9953056974šŸ”
Model Call Girl in Bikash Puri Delhi reach out to us at šŸ”9953056974šŸ”
Ā 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
Ā 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
Ā 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
Ā 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
Ā 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Ā 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
Ā 
18-04-UA_REPORT_MEDIALITERAŠ”Y_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAŠ”Y_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAŠ”Y_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAŠ”Y_INDEX-DM_23-1-final-eng.pdf
Ā 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
Ā 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
Ā 
call girls in Kamla Market (DELHI) šŸ” >ą¼’9953330565šŸ” genuine Escort Service šŸ”āœ”ļøāœ”ļø
call girls in Kamla Market (DELHI) šŸ” >ą¼’9953330565šŸ” genuine Escort Service šŸ”āœ”ļøāœ”ļøcall girls in Kamla Market (DELHI) šŸ” >ą¼’9953330565šŸ” genuine Escort Service šŸ”āœ”ļøāœ”ļø
call girls in Kamla Market (DELHI) šŸ” >ą¼’9953330565šŸ” genuine Escort Service šŸ”āœ”ļøāœ”ļø
Ā 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its Characteristics
Ā 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
Ā 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Ā 

Introduction to Biological database ppt(1).pptx

  • 1. Introductiontobioinformaticsand BiologicalDatabase By ā€“ Dr. Rajesh Kumar Department of Botany MGGAC, Mahe
  • 2. Whatis bioinformatics? ā€¢ In biology, bioinformatics is defined as, ā€œ t h e use of computer to store, retrieve, analyse or predict the composition or structure of bio-moleculesā€ . Bioinformatics is the application of computational techniques and informationtechnology totheorganisationandmanagement of biological data. Classical bioinformatics deals primarily withsequenceanalysis.
  • 3. Aimsofbioinformatics ļƒ¼Developmentofdatabasecontainingall biological information. ļƒ¼Developmentofbettertoolsfordatadesigning,annotation andmining. ļƒ¼Designanddevelopmentofdrugsbyusingsimulation software. ļƒ¼Designanddevelopmentof softwaretools for protein structurepredictionfunction,annotationanddockinganalysis. ļƒ¼Creationanddevelopmentofsoftwaretoimprovetoolsfor analysingsequencesfortheirfunctionandsimilaritywith othersequences
  • 4. Applications of bioinformatics Applications of bioinformatis Gene therapy Drug designing Antibiotic resistance Crop development Medicine biotechnology Drought resistance Evolutionary studies Forensic analysis Veterinary science Wheather analysisy Waste ceanup
  • 5. Biological databases ā€¢ Biological data are complex, exception-ridden, vast and incomplete. Therefore several databases has been created and interpreted to ensure unambiguous results. A collection of biological data arranged in computer readable form that enhances the speed of search and retrieval and convenient to use is called biological database. A good database must have updated information.
  • 6. Importanceofbiological database ā€¢ A range of information like biological sequences, structures, binding sites, metabolic interactions, molecular action, functional relationships, protein families, motifs and homologous can be retrieved by using biological databases.The mainpurposeof abiological databaseis to store and manage biological data and information in computerreadableforms.
  • 8. Primary databasevs.secondarydatabase ā€¢ A primarydatabasecontainsonlysequenceorstructural information. ā€¢ Thedatabasederivedfromtheanalysisortreatmentof primarydataaresecondarydatabase.Itisveryimportant forinterferingproteinfunction.
  • 10. GeneBank ļƒ¼ Oneof thefastest growing repositories of knownnucleotide sequences,GeneBank (Genetic Sequence Databank), has a flat file structure. It is an ASCII text file, readable by both humans and computers. Besides sequence data, GeneBank files contain information such as accession numbers and gene names, phylogenetic classificationandreferencestopublishedliterature. ļƒ¼ This database has been developed and maintained at the NCBI, Bethesda, MD, USA, asapartofInternationalSequenceDatabaseCollaboration(INSDC). ļƒ¼ Itis anopenaccesssequencedatabase. ļƒ¼ It coordinateswithindividual laboratories EMBL and DDBJ. and other sequence databases like Continueā€¦ā€¦ā€¦ā€¦ā€¦ā€¦..
  • 11. ļƒ¼It is anannotatedcollection of all nucleotide sequencesthatareavailable to thepublic. ļƒ¼The nucleotide database was divided into three databases at NCBI: CoreNucleotide database, Expressed Sequence Tag (EST) and Genome SurveySequence(GSS). ļƒ¼CoreNucleotide databasehasmostof thenucleotide sequencesused. It also enclosesall nucleotiderecordsthatarenotintheEST andGSS databases. ļƒ¼Submission of sequences to GeneBank can be done using BankIt, Sequin andtbl2asntools.
  • 12. EMBL(EuropeanMolecular Biology Laboratory) ā€¢ A comprehensive database of DNA and RNA sequences, EMBL nucleotide sequencedatabaseis collected from scientific literature, patientoffices andis directly submittedby researchers. EMBL has been prepared in collaboration with GeneBank (USA) and the DNA DatabaseofJapan(DDBJ). ā€¢ Itis establishedin1980. ā€¢ Itis maintainedbyEBI (EuropeanBioinformatics Institute)
  • 13. Swiss-Port ļƒ¼ This is a curatedprotein sequencedatabasethatoffers ahigh level of integrationwithotherdatabasesandalso hasavery lowlevel of redundancy. Swiss-Port strives toprovide protein sequenceswith a high level of annotation (for instance, the description of protein function, domain structure and post translational modifications, etc.). ļƒ¼ It is established in 1986 and maintained collaboratively , since 1987, by the department of Medical Biochemistry of the UniversityofGenevaandtheEMBL dataLibrary. ļƒ¼ TrEMBL is a computerā€“annotated supplement of Swiss-Port that contains all translations of EMBL nucleotide sequence entries, whichis notyetintegratedinSwiss-Port. ļƒ¼ Currently Swiss-Port have 0.5 and TrEMBL have 7.6 milliom sequences.
  • 14. ProteinInformationResource(PIR) ā€¢ PIR is an integrated public bioinformatics resource to support genomic and proteomic research and scientific studies. Nowadays, PIR offers a wide variety of resources mainly oriented to assisting the propagation and consistency of protein annotations like PIRSF, ProClass andProLINK.
  • 16. MotifDatabases ā€¢ Protein sequence motif is a set of conserved amino acid residuesthatareimportantfor proteinfunctionandarelocated within a certain distance from one another . These motifs usually provide clues to the functions of otherwise uncharacterisedproteins. ā€¢ The PROSITE database consists of documentation entries describing protein domains, families and functional sites as wellasassociatedpatternsandprofilestoidentifythem. ā€¢ PRINT is adatabasefor proteinfingerprints. A fingerprint is a group of conserved motifs used to characterise a protein family.
  • 17. DomainDatabase ā€¢ A protein domain is an independently folded, structurally compact unit that forms a steady three- dimensional structure and shows a certain level of evolutionary conservation. Typically ,aconserveddomaincontainsoneormoremotifs. ā€¢ ProDomisaproteindomaindatabaseautomaticallygenerated fromtheSwiss-PortandTrEMBL sequencedatabase. ā€¢ SMART is a highly reliable and sensitive tool for domain identification. ā€¢ COG is adatabaseandaconvenienttoolformotifanddomain identification.
  • 18. 3DStructuredatabases ā€¢ PDB (Protein Data bank) is themainprimary databasefor 3D structures of biological macromolecules determined by X-ray, crystallography and NMR. It also accepts experimental data usedtodeterminethestructuresandhomologymodels. ā€¢ SCOP (Structural Classification of Protein database) classifies protein 3D structures in a hierarchical scheme of structure classes. All the protein structures in PDB are classified her, andtheupdatednewstructuresaredepositedinPDB. ā€¢ The CATH database (Class, Architecture, Topology, Homologous) contains a hierarchical classification of protein domainstructure.
  • 19. Protein data bank ā€¢ PDB (Protein data bank) is a repository for 3D structural data obtained by x-ray crystallography or NMR spectroscopyofproteinsandnucleic acids. ā€¢ Research Collaboratory for Structural Bioinformatics (RCSB) PDB provides avariety of tools andresourcesfor studying the structures of biological macromolecules and their relationship with other sequences, its function and diseasescausedif any.
  • 20. Geneexpressiondatabases ā€¢ GEO or Gene Expression Omnibus is a curated online resource and a gene expression molecular abundance repository for gene expression data browsing, query and retrieval. ā€¢ GXD or GeneExpressionDatabase is acommunity resource forgeneexpressioninformation. ā€¢ MGED or Microarray Gene Expression data contains microarray data, generated by functional genomics and proteomicsexperiments. ā€¢ ArrayExpress from European Bioinformatics Institute is a repositoryfortranscriptomicsdata.
  • 21. Metabolicpathwaydatabases ā€¢ KEGG PATHWAY Databasecontains graphical pathwaymapsfor all knownmetabolicpathwaysfromvariousorganisms. ā€¢ EcoCyc is an E. coli database , stores information regarding the genomeandbiochemical machineryofE.coli. ā€¢ LIGAND is achemical databasefor enzymereactions attheInstitute for Chemical Research, Kyoto. It is composite database currently consisting of the COMPOUND, DRUG, GLYCAN, REACTION, RPAIR andENZYMEdatabases. ā€¢ MetaCyc is a non-redundant, experimentally elucidated metabolic pathwaydatabase. ā€¢ BRENDA is an enzyme database tat contains information on all aspectsofenzymesandenzymaticreactions.
  • 22. Genomedatabases ā€¢ Genome databases give absolute information on the heritable properties of an organism. These databases help to identify genes and predict their functions. A few genome databases havelinkswithspecificorganismdatabases. ā€¢ GOLD (Genomes Online Database at the University of Illinois, USA) contains a list of all thecomplete and ongoing genomeprojectsworldwide. ā€¢ Genomes at NCBI (National Centre for Biotechnology Information,USA). ā€¢ TIGR database (TDB), at the institute for Genomic Research atRockeville MD,USA.
  • 23. Virological databases ā€¢ A virological database contains all the sequences and related information of viruses of animals, plants, bacteria, fungi and archea; for example, the HIV protease database. A committee called The International Committee on Taxonomy of Viruses(ICTV) authorises and organises the taxonomic classification of viruses. ICTVdB contains taxonomic information for over thousandsofvirusspecies.
  • 24. Worldbiodiversitydatabases ā€¢ Taxonomic databases are built to document all known species and them available and worldwide. accessibl e databases contain taxonomic hierarchies, make These species names, synonyms, descriptions, illustrations and references. For example: CCINFO, STRAIN andALGAE.
  • 25. Databaseforvariousmodelorganisms ā€¢ Escherichia coli- E. coli Genome Centre(Wisconsin university, USA), TheE.coli index(UniversityofBirmingham, UK) ā€¢ Arabidopsisthaliana-TAIR (TheArabidopsisInformationResource) ā€¢ Homosapiens-HumanGenomeResourcesatNCBI, USA ā€¢ Oryza sativa (rice) -RGP (Rice Genome Research Programme,Japan) ā€¢ Drosaphilamelanogaster-FlyBase (DrosophilaGenomeDatabase) ā€¢ Musmusculus(mouce)-MouceGenomeInformatics ā€¢ Daniorerio(zebrafish)- ZFIN (Zebrafish InformationNetwork atthe UniversityofOregon,USA) ā€¢ S. cerevisiae (Bakers yeast)- SGD ()Yeast Genome Database at Stanford,USA
  • 26. AnnotationofGene????? In molecular biology, genomes make the basic genetic material and typically consistofDNA. Whereby,genomeincludethegenes(coding) andnoncoding regions, of interest tous, arethecoding regions asthey actively influence basic life processes. The genes contain useful biological information that is required in building up and maintaining an organism. Gene annotation can be defined merely as the process of makingnucleotidesequencemeaningful. Gene annotation involves the process of taking the raw DNA sequence produced by genomesequencing projects andadding layers of analysis and interpretation necessary to extracting biologically significant information andplacing such derived details into context. Annotation is the process by which pertinent information about these raw DNA sequencesisaddedtothedatabases.
  • 27. Accessionnumber Accession numbers are unique identifiers which permanently identify sequences in the database. Accession numbers are assigned and communicated to authors within twoworking days ofthereceiptofsubmission.