SlideShare a Scribd company logo
Presented by – SWARUP MALAKAR
A database is a repository of sequence ( DNA or amino acids ) stored in a
computer which provide a centralized and homogenous view of its content.
or, it is a vast collection of data pertaining to a specific topic, e.g.,
nucleotide sequence, protein sequence etc.
Basically, it is an electronic environment.
Databases are at the heart of bioinformatics.
1. Sequence databases: - that involves the sequences of both proteins and nucleic
acids.
2. Structural databases:- that involves only protein databases.
In additionally, it is also classified into three categories:
A. Primary database B. Secondary databases C. Composite databases.
It contain information of the sequence or structure alone either protein or
nucleic acid .
Example: PIR, SWISS-PROT for protein sequences , NCBI, EMBL and DDBJ for
genome sequences.
PIR: It is functionally annotated
protein sequences and structure.
PIR has collaborated with EBI and
SIB to establish the UniProt (
United Protein Databases).
The central resource of
protein sequence and function.
TREMBL
NCBI ( National Centre of Biotechnology Information ):
- Nov 4, 1988 , the NCBI was established as division of the National Library of medicine for the
development of information systems in molecular biology.
- The NCBI is located in Bethesta, Maryland (U.S.A).
- NCBI built the GenBank, which is an annotated collection of publically available nucleotide and
protein sequences.
- In 1988, the three partners (DDBJ, EMBL and GenBank) of the international Nucelotide
Sequences Database collaboration had a meeting and agreed to use a common format.
i. Maintains collaboration with several NIH institutes, academia, industry and other governmental
agencies.
ii. Develops, distributes, supports and coordinates access to a variety of databases and software for
the scientific and medical communities.
iii. Develops and promotes standards for databases, data deposition and exchange, and biological
nomenclature.
iv. Engages the members of the international scientific community in informatics research and training
through the scientific visitors programs.
Link: https://www.ncbi.nlm.nih.gov/
 In 1992, NCBI has the responsibility for making available the
DNA sequence database to the GenBank.
 Coordinates with individual laboratories and other sequence
data base such those of EMBL and DDBJ.
 Moreover, NCBI has grown to provide other databases in
addition to GenBank.
 GenBank is a comprehensive sequence database that contains
publicly available DNA sequences for more than 1,19,000
different organisms obtained through the submission of
sequence data from individual lab and batch submissions from
large-scale of seq. projects.
 Daily data exchange with the EMBL data library in the UK and
the DNA Data Bank of Japan helps world wide coverage.
 Developed and maintained by European Molecular Biology Laboratory – European
Bioinformatics Institute (EMBL-EBI).
 Comprehensive data nucleotide sequence information.
 The European Molecular Biology Laboratory (EMBL) Nucleotide Sequence Database is a
comprehensive collection of primary nucleotide sequences maintained at the European
Bioinformatics Institute (EBI).
 Link: http:www.ebi.ac.uk/embl/
EMBL is supported by 22 member states, four prospect, and two associated states.
 The laboratory operatory operates from five sites: the main laboratory in Heidelberg, and
outstations Hinxton (EBI, in England), Grenoble (France), Hambury (Germany) and
Manterotando ( near Rome).
 EMBL groups and laboratories perform basic research in molecular biology and molecular
medicine as well as training for science student and visitors.
 Since 1982 this work has been done in collaboration with GenBank (NCBI, Bethesda, USA)
and the DNA Database of Japan (Mishima).
 For sequencing similar searching, a variety of tools (FASTA and BLAST
are available that allow external users to compare their own seq. against the data in
EMBL nucleotide sequence database and other database.
 The DNA Data Bank of Japan (DDBJ) is a biological database that collects DNA
sequences. It was established in 1986.
 Link: https://www.ddbj.nig.ac.jp
 It is located at the National Institute of Genetics (NIG) in the Shizuoka prefecture of
Japan.
 DDBJ is a member of the International Nucleotide Sequence Database
Collaboration or INSDC.
 It exchanges its data with European Molecular Biology Laboratory at the European
Bioinformatics Institute and with GenBank at the National Center for Biotechnology
Information on a daily basis.
 DDBJ Center collects nucleotide sequence data as a member of INSDC(International
Nucleotide Sequence Database Collaboration) and provides freely available nucleotide
sequence data and supercomputer system, to support research activities in life science.
 FEATURES
 group 1: biological source of the sequence (source) The feature, “source” (group 1) is
mandatory for all entries in the international nucleotide database. ...
 group 2: biological function features of the region. ...
 group 3: difference and/or change of the sequence data.
Data type Organism Accession numbers for annotated
sequences (number of entries)
Accession numbers for raw reads
Genome Radish (Raphanus sativus cv. Aokubi S-
h)
WGS: BAOO01000001-
BAOO01072909 (72 909 entries)
scaffold CON: DF196826-
DF236948 (40,123 entries)
DRR012610-DRR012624
Soybean (Glycine max cv. Enrei) BBNX02000001-BBNX02108601 (108
601 entries)
DRR021740-DRR021744
Common marmoset (Callithrix jacchus) WGS: BBXK01000001-
BBXK01109198 (109 198 entries)
scaffold CON: DG000097-
DG000120 (24 entries)
GSS: LB274659-LB427105 (152 447
entries)
DRR036754-DRR036764
List of notable data sets released from the DNA Data Bank of Japan (DDBJ) sequence databases from June 2015 to May 2016
 Hosted at National Institute of Genetics .
 Mainly from scientists in Japan and also from resources all over the world and shave this
nucleotide data with EMBL and GenBank.
 This officially , certified to collect nucleotide sequence from researchers sand to tissue the
internationally recognized number of data submitters.
 About 99% of the nucleotide data in INSDC are submitted by DDMJ
 This database plays a major role to improve the quality of INSDC.
 Each database entry include details of sequences, submitters details bibiliographic
references, biological significance and the scientific name and taxonomy of the organism.
 Features that identify coding regions transcription units, mutation sites etc. are displayed
in a feature table. Major activities of the database.
 Providing internationally recognized accession numbers to sequences.
 Bioinformatics database management developing tools for the analysis and visualization of
biological data.
 Conducting courses for beginners to reduce the complexity in the biological data analysis.
Primary Databases.pptx
Primary Databases.pptx

More Related Content

What's hot

PIR- Protein Information Resource
PIR- Protein Information ResourcePIR- Protein Information Resource
Bacterial, viral genome organisation
Bacterial, viral genome organisation Bacterial, viral genome organisation
Bacterial, viral genome organisation
ANU RAJ
 
Protein data bank
Protein data bankProtein data bank
Protein data bank
Alichy Sowmya
 
Prosite
PrositeProsite
Ddbj
DdbjDdbj
Whole genome shotgun sequencing
Whole genome shotgun sequencingWhole genome shotgun sequencing
Whole genome shotgun sequencing
Goutham Sarovar
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
Ramya S
 
Scop database
Scop databaseScop database
Scop database
Sayantani Roy
 
Gen bank databases
Gen bank databasesGen bank databases
Gen bank databases
Hafiz Muhammad Zeeshan Raza
 
Tools and database of NCBI
Tools and database of NCBITools and database of NCBI
Tools and database of NCBI
Santosh Kumar Sahoo
 
Scoring matrices
Scoring matricesScoring matrices
Scoring matrices
Ashwini
 
Protein databases
Protein databasesProtein databases
Protein databasessarumalay
 
BLAST
BLASTBLAST
Transposon mutagenesis & site directed mutagenesis
Transposon mutagenesis & site directed mutagenesisTransposon mutagenesis & site directed mutagenesis
Transposon mutagenesis & site directed mutagenesis
AnuKiruthika
 
Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-
naveed ul mushtaq
 
Protein database
Protein databaseProtein database
Protein database
Rajpal Choudhary
 
Swiss prot database
Swiss prot databaseSwiss prot database
Swiss prot database
sagrika chugh
 
DNA microarray
DNA microarrayDNA microarray
DNA microarray
manojjeya
 
DNA data bank of japan (DDBJ)
DNA data bank of japan (DDBJ)DNA data bank of japan (DDBJ)
DNA data bank of japan (DDBJ)
ZoufishanY
 

What's hot (20)

PIR- Protein Information Resource
PIR- Protein Information ResourcePIR- Protein Information Resource
PIR- Protein Information Resource
 
Bacterial, viral genome organisation
Bacterial, viral genome organisation Bacterial, viral genome organisation
Bacterial, viral genome organisation
 
NCBI
NCBINCBI
NCBI
 
Protein data bank
Protein data bankProtein data bank
Protein data bank
 
Prosite
PrositeProsite
Prosite
 
Ddbj
DdbjDdbj
Ddbj
 
Whole genome shotgun sequencing
Whole genome shotgun sequencingWhole genome shotgun sequencing
Whole genome shotgun sequencing
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
 
Scop database
Scop databaseScop database
Scop database
 
Gen bank databases
Gen bank databasesGen bank databases
Gen bank databases
 
Tools and database of NCBI
Tools and database of NCBITools and database of NCBI
Tools and database of NCBI
 
Scoring matrices
Scoring matricesScoring matrices
Scoring matrices
 
Protein databases
Protein databasesProtein databases
Protein databases
 
BLAST
BLASTBLAST
BLAST
 
Transposon mutagenesis & site directed mutagenesis
Transposon mutagenesis & site directed mutagenesisTransposon mutagenesis & site directed mutagenesis
Transposon mutagenesis & site directed mutagenesis
 
Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-
 
Protein database
Protein databaseProtein database
Protein database
 
Swiss prot database
Swiss prot databaseSwiss prot database
Swiss prot database
 
DNA microarray
DNA microarrayDNA microarray
DNA microarray
 
DNA data bank of japan (DDBJ)
DNA data bank of japan (DDBJ)DNA data bank of japan (DDBJ)
DNA data bank of japan (DDBJ)
 

Similar to Primary Databases.pptx

Bioinformatics in biotechnology by kk sahu
Bioinformatics in biotechnology by kk sahu Bioinformatics in biotechnology by kk sahu
Bioinformatics in biotechnology by kk sahu
KAUSHAL SAHU
 
Biological databases.pptx
Biological databases.pptxBiological databases.pptx
Biological databases.pptx
PagudalaSangeetha
 
Biological database
Biological databaseBiological database
Biological database
Iqbal college Peringammala TVM
 
Nucleic Acid Databases (NDB ) of bioinformatics pptx
Nucleic Acid Databases (NDB ) of bioinformatics pptxNucleic Acid Databases (NDB ) of bioinformatics pptx
Nucleic Acid Databases (NDB ) of bioinformatics pptx
karmandeepkaur7
 
Nucleic acid and protein databanks
Nucleic acid and protein databanksNucleic acid and protein databanks
Nucleic acid and protein databanks
NithyaNandapal
 
Database in bioinformatics
Database in bioinformaticsDatabase in bioinformatics
Database in bioinformatics
VinaKhan1
 
databases.pptx
databases.pptxdatabases.pptx
databases.pptx
ifra27
 
Presentation on Biological database By Elufer Akram @ University Of Science ...
Presentation on Biological database  By Elufer Akram @ University Of Science ...Presentation on Biological database  By Elufer Akram @ University Of Science ...
Presentation on Biological database By Elufer Akram @ University Of Science ...
Elufer Akram
 
DATABASES...............................pptx
DATABASES...............................pptxDATABASES...............................pptx
DATABASES...............................pptx
Cherry
 
Bioinformatics
BioinformaticsBioinformatics
Introduction to databases.pptx
Introduction to databases.pptxIntroduction to databases.pptx
Introduction to databases.pptx
sworna kumari chithiraivelu
 
Bioinformatics biological databases
Bioinformatics biological databasesBioinformatics biological databases
Bioinformatics biological databases
Sangeeta Das
 
Data base in detail
Data base in detailData base in detail
Data base in detail
Vartika Mishra
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
Raj Varun
 
Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...
SBituila
 
Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...
BibiQuinah
 
Biological Databases | Access to sequence data and related information
Biological Databases | Access to sequence data and related information Biological Databases | Access to sequence data and related information
Biological Databases | Access to sequence data and related information
NahalMalik1
 
Primary Bioinformatics Database.pptx
Primary Bioinformatics Database.pptxPrimary Bioinformatics Database.pptx
Primary Bioinformatics Database.pptx
Vandana Yadav03
 
Primary sequencing of nucleic acids
Primary sequencing of nucleic acidsPrimary sequencing of nucleic acids
Primary sequencing of nucleic acids
vibhakumari12
 
Biological databases
Biological databasesBiological databases
Biological databases
Biotech Online
 

Similar to Primary Databases.pptx (20)

Bioinformatics in biotechnology by kk sahu
Bioinformatics in biotechnology by kk sahu Bioinformatics in biotechnology by kk sahu
Bioinformatics in biotechnology by kk sahu
 
Biological databases.pptx
Biological databases.pptxBiological databases.pptx
Biological databases.pptx
 
Biological database
Biological databaseBiological database
Biological database
 
Nucleic Acid Databases (NDB ) of bioinformatics pptx
Nucleic Acid Databases (NDB ) of bioinformatics pptxNucleic Acid Databases (NDB ) of bioinformatics pptx
Nucleic Acid Databases (NDB ) of bioinformatics pptx
 
Nucleic acid and protein databanks
Nucleic acid and protein databanksNucleic acid and protein databanks
Nucleic acid and protein databanks
 
Database in bioinformatics
Database in bioinformaticsDatabase in bioinformatics
Database in bioinformatics
 
databases.pptx
databases.pptxdatabases.pptx
databases.pptx
 
Presentation on Biological database By Elufer Akram @ University Of Science ...
Presentation on Biological database  By Elufer Akram @ University Of Science ...Presentation on Biological database  By Elufer Akram @ University Of Science ...
Presentation on Biological database By Elufer Akram @ University Of Science ...
 
DATABASES...............................pptx
DATABASES...............................pptxDATABASES...............................pptx
DATABASES...............................pptx
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Introduction to databases.pptx
Introduction to databases.pptxIntroduction to databases.pptx
Introduction to databases.pptx
 
Bioinformatics biological databases
Bioinformatics biological databasesBioinformatics biological databases
Bioinformatics biological databases
 
Data base in detail
Data base in detailData base in detail
Data base in detail
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...
 
Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...
 
Biological Databases | Access to sequence data and related information
Biological Databases | Access to sequence data and related information Biological Databases | Access to sequence data and related information
Biological Databases | Access to sequence data and related information
 
Primary Bioinformatics Database.pptx
Primary Bioinformatics Database.pptxPrimary Bioinformatics Database.pptx
Primary Bioinformatics Database.pptx
 
Primary sequencing of nucleic acids
Primary sequencing of nucleic acidsPrimary sequencing of nucleic acids
Primary sequencing of nucleic acids
 
Biological databases
Biological databasesBiological databases
Biological databases
 

Recently uploaded

Hemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptxHemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptx
muralinath2
 
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of LipidsGBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
Areesha Ahmad
 
Cancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate PathwayCancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate Pathway
AADYARAJPANDEY1
 
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
muralinath2
 
Mammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also FunctionsMammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also Functions
YOGESH DOGRA
 
Richard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlandsRichard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlands
Richard Gill
 
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
ssuserbfdca9
 
Richard's entangled aventures in wonderland
Richard's entangled aventures in wonderlandRichard's entangled aventures in wonderland
Richard's entangled aventures in wonderland
Richard Gill
 
role of pramana in research.pptx in science
role of pramana in research.pptx in sciencerole of pramana in research.pptx in science
role of pramana in research.pptx in science
sonaliswain16
 
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
Health Advances
 
GBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram StainingGBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram Staining
Areesha Ahmad
 
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdfSCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SELF-EXPLANATORY
 
Orion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWSOrion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWS
Columbia Weather Systems
 
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
University of Maribor
 
extra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdfextra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdf
DiyaBiswas10
 
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptxBody fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
muralinath2
 
Lab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerinLab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerin
ossaicprecious19
 
In silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptxIn silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptx
AlaminAfendy1
 
erythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptxerythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptx
muralinath2
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Sérgio Sacani
 

Recently uploaded (20)

Hemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptxHemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptx
 
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of LipidsGBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
 
Cancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate PathwayCancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate Pathway
 
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
 
Mammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also FunctionsMammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also Functions
 
Richard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlandsRichard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlands
 
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
 
Richard's entangled aventures in wonderland
Richard's entangled aventures in wonderlandRichard's entangled aventures in wonderland
Richard's entangled aventures in wonderland
 
role of pramana in research.pptx in science
role of pramana in research.pptx in sciencerole of pramana in research.pptx in science
role of pramana in research.pptx in science
 
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
 
GBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram StainingGBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram Staining
 
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdfSCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
 
Orion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWSOrion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWS
 
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
 
extra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdfextra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdf
 
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptxBody fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
 
Lab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerinLab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerin
 
In silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptxIn silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptx
 
erythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptxerythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptx
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
 

Primary Databases.pptx

  • 1. Presented by – SWARUP MALAKAR
  • 2. A database is a repository of sequence ( DNA or amino acids ) stored in a computer which provide a centralized and homogenous view of its content. or, it is a vast collection of data pertaining to a specific topic, e.g., nucleotide sequence, protein sequence etc. Basically, it is an electronic environment. Databases are at the heart of bioinformatics.
  • 3. 1. Sequence databases: - that involves the sequences of both proteins and nucleic acids. 2. Structural databases:- that involves only protein databases. In additionally, it is also classified into three categories: A. Primary database B. Secondary databases C. Composite databases.
  • 4. It contain information of the sequence or structure alone either protein or nucleic acid . Example: PIR, SWISS-PROT for protein sequences , NCBI, EMBL and DDBJ for genome sequences.
  • 5. PIR: It is functionally annotated protein sequences and structure. PIR has collaborated with EBI and SIB to establish the UniProt ( United Protein Databases). The central resource of protein sequence and function.
  • 7. NCBI ( National Centre of Biotechnology Information ): - Nov 4, 1988 , the NCBI was established as division of the National Library of medicine for the development of information systems in molecular biology. - The NCBI is located in Bethesta, Maryland (U.S.A). - NCBI built the GenBank, which is an annotated collection of publically available nucleotide and protein sequences. - In 1988, the three partners (DDBJ, EMBL and GenBank) of the international Nucelotide Sequences Database collaboration had a meeting and agreed to use a common format.
  • 8. i. Maintains collaboration with several NIH institutes, academia, industry and other governmental agencies. ii. Develops, distributes, supports and coordinates access to a variety of databases and software for the scientific and medical communities. iii. Develops and promotes standards for databases, data deposition and exchange, and biological nomenclature. iv. Engages the members of the international scientific community in informatics research and training through the scientific visitors programs. Link: https://www.ncbi.nlm.nih.gov/
  • 9.  In 1992, NCBI has the responsibility for making available the DNA sequence database to the GenBank.  Coordinates with individual laboratories and other sequence data base such those of EMBL and DDBJ.  Moreover, NCBI has grown to provide other databases in addition to GenBank.  GenBank is a comprehensive sequence database that contains publicly available DNA sequences for more than 1,19,000 different organisms obtained through the submission of sequence data from individual lab and batch submissions from large-scale of seq. projects.  Daily data exchange with the EMBL data library in the UK and the DNA Data Bank of Japan helps world wide coverage.
  • 10.  Developed and maintained by European Molecular Biology Laboratory – European Bioinformatics Institute (EMBL-EBI).  Comprehensive data nucleotide sequence information.
  • 11.  The European Molecular Biology Laboratory (EMBL) Nucleotide Sequence Database is a comprehensive collection of primary nucleotide sequences maintained at the European Bioinformatics Institute (EBI).  Link: http:www.ebi.ac.uk/embl/ EMBL is supported by 22 member states, four prospect, and two associated states.  The laboratory operatory operates from five sites: the main laboratory in Heidelberg, and outstations Hinxton (EBI, in England), Grenoble (France), Hambury (Germany) and Manterotando ( near Rome).
  • 12.  EMBL groups and laboratories perform basic research in molecular biology and molecular medicine as well as training for science student and visitors.  Since 1982 this work has been done in collaboration with GenBank (NCBI, Bethesda, USA) and the DNA Database of Japan (Mishima).  For sequencing similar searching, a variety of tools (FASTA and BLAST are available that allow external users to compare their own seq. against the data in EMBL nucleotide sequence database and other database.
  • 13.  The DNA Data Bank of Japan (DDBJ) is a biological database that collects DNA sequences. It was established in 1986.  Link: https://www.ddbj.nig.ac.jp  It is located at the National Institute of Genetics (NIG) in the Shizuoka prefecture of Japan.  DDBJ is a member of the International Nucleotide Sequence Database Collaboration or INSDC.  It exchanges its data with European Molecular Biology Laboratory at the European Bioinformatics Institute and with GenBank at the National Center for Biotechnology Information on a daily basis.
  • 14.  DDBJ Center collects nucleotide sequence data as a member of INSDC(International Nucleotide Sequence Database Collaboration) and provides freely available nucleotide sequence data and supercomputer system, to support research activities in life science.  FEATURES  group 1: biological source of the sequence (source) The feature, “source” (group 1) is mandatory for all entries in the international nucleotide database. ...  group 2: biological function features of the region. ...  group 3: difference and/or change of the sequence data.
  • 15. Data type Organism Accession numbers for annotated sequences (number of entries) Accession numbers for raw reads Genome Radish (Raphanus sativus cv. Aokubi S- h) WGS: BAOO01000001- BAOO01072909 (72 909 entries) scaffold CON: DF196826- DF236948 (40,123 entries) DRR012610-DRR012624 Soybean (Glycine max cv. Enrei) BBNX02000001-BBNX02108601 (108 601 entries) DRR021740-DRR021744 Common marmoset (Callithrix jacchus) WGS: BBXK01000001- BBXK01109198 (109 198 entries) scaffold CON: DG000097- DG000120 (24 entries) GSS: LB274659-LB427105 (152 447 entries) DRR036754-DRR036764 List of notable data sets released from the DNA Data Bank of Japan (DDBJ) sequence databases from June 2015 to May 2016
  • 16.  Hosted at National Institute of Genetics .  Mainly from scientists in Japan and also from resources all over the world and shave this nucleotide data with EMBL and GenBank.  This officially , certified to collect nucleotide sequence from researchers sand to tissue the internationally recognized number of data submitters.  About 99% of the nucleotide data in INSDC are submitted by DDMJ  This database plays a major role to improve the quality of INSDC.  Each database entry include details of sequences, submitters details bibiliographic references, biological significance and the scientific name and taxonomy of the organism.
  • 17.  Features that identify coding regions transcription units, mutation sites etc. are displayed in a feature table. Major activities of the database.  Providing internationally recognized accession numbers to sequences.  Bioinformatics database management developing tools for the analysis and visualization of biological data.  Conducting courses for beginners to reduce the complexity in the biological data analysis.