SlideShare a Scribd company logo
1 of 75
DNA Data Bank of Japan (DDBJ)
INTRODUCTION
 The DNA Data Bank of Japan is a public
database of nucleotide sequences established at
the National Institute of Genetics (NIG).
DDBJ, http://www.ddbj. nig.ac.jp
HOMEPAGE OF DDBJ
HISTORY
 Since 1987, the DDBJ has been collecting
annotated nucleotide sequences as its traditional
database service.
 This endeavor has been conducted in
collaboration with GenBank at the National
Center for Biotechnology Information (NCBI)
and with European Molecular Biology
Laboratory (EMBL) at the European
Bioinformatics Institute (EBI). The collaborative
framework is called the International Nucleotide
Sequence Database Collaboration (INSDC).
 DDBJ collects and edit about 20% of the data
released by these 3 International databases.
 DDBJ began data bank activities in 1986 at NIG
and remains the only nucleotide sequence data
bank in Asia. Although DDBJ mainly receives its
data from Japanese researchers, it can accept
data from contributors from any other country.
DATA UNIT
The nucleotide sequence database is a set of
data units called ENTRIES.
In addition to the nucleotide sequence itself, each
entry contains information about the researcher
who determined the sequence plus related
references, organism and gene function and
features.
DIVISIONS OF DDBJ ENTERIES
 DDBJ classifies entries into 21 divisions as
below;
a: TAXONOMIC DIVISIONS
1. HUM Human
2. PRI Primates (other than human)
3. ROD Rodents
4. MAM Mammals (other than primates and rodents)
5. VRT Vertebrates (other than mammals)
6. INV Invertebrates (animals other than vertebrates)
7. PLN Plants, Fungi, Plastids (eukaryotes other than
animals)
8. BCT Bacteria (including both Eubacteria and
Archaea)
9. VRL Viruses
10. PHG Bacteriophages
b: OTHER DIVISIONS
1. PAT Sequence Data Related To Patent Application
2. ENV Sequences Obtained Via Environmental
Sampling Methods
3. SYN Synthetic Constructs; Artificially Constructed
Sequences
4. EST Expressed Sequence Tags; Short Single Pass
Cdna Sequences
5. TSA Transcriptome Shotgun Assemblies;
Assembled mRNA Sequences
6. GSS Genome Survey Sequences; Short Single
Pass Genomic Sequences
7. HTC High Throughput cDNA Sequences
8. HTG High Throughput Genomic Sequences
9. STS Sequence Tagged Sites
10. UNA The Data Not Annotated
11. CON Contig / Constructed
DATA RETREIVAL IN DDBJ
 For data retrieval on DDBJ, click on SEARCH
AND ANALYSIS on homepage.
 A window tab will open, with various searching
options.
1- GETENTRY
 DDBJ annotated/assembled data retrieval by
accession numbers.
 KEYWORD= ACCESSSION NUMBER thus, only
accession no. is used for the sequence search in
this method of data retrieval.
 In the ID box, write down the exact accession no.
of the sequence you want to search.
 The database is by default on
DDBJ/EMBL/GENBANK.
 OUTPUT FORMAT of any type can be chosen
based on user requirement.
 These formats are shown in the picture below:
 Choose any format and click on SEARCH.
 The sequence in formation will open in a new
window.
Flatfile format of DDBJ
Total nt seq FASTA format
CDS amino acid seq FASTA
format
CDS nt seq FASTA format
INSD-XML_v1.4 format
 Same steps are repeated for the search of
PROTEIN SEQUENCE on getentry.
 PROTEIN DATABASE are chosen which are
UNIPROT, PDB, DAD and PATENT.
 Select OUTPUT FORMAT and click on SEARCH.
2- ARSA
 DDBJ annotated/assembled data retrieval by
accession numbers and keywords.
 ACCESSION NUMBERS and KEYWORDS can
both be used for this method of sequence search.
 Put in the keywords or accession no. in the
search bar.
 The more the keywords, the narrower will be the
search.
 A LIST OF ENTERIES based on the search will
be provided below.
 Flatfile, XML and FASTA formats are provided ,
flatfile being the default.
 Click on any sequence or view/download multiple
sequences by selecting more than one sequence.
3- TX SEARCH
 Taxonomy database search of DDBJ.
 Type in the organism name and click on
SEARCH.
 A complete Lineage of the organism will be
given.
4- BLAST
 A blast homepage is also provided on search and
analysis page.
 The BLAST OUTPUT is different in formattinf
from that of provided at NCBI homepage’s
BLAST, the results however are the same.
 The SIGNIFICANT ALIGNMENTS table is first
thing given on DDBJ which is the second on
NCBI BLAST, and the HITS are second on DDBJ
with IDENTIFICATION LINE written below each
hit with its SCORE.
 And lastly the alignment of query with each hit is
given, along with several details.
5- CLUSTALW
 For Multiple alignment and phylogenetic tree-
making, ClustalW is also provided on DDBJ.
 The ClustalW output is almost same, however a
small detail is different.
 In DDBJ , ClustalW only give “*” identifier i.e
only fully conserved sequences are mentioned in
form of symbols.
6- GGGenome
 An ultrafast sequence search, in which you can
type any sequence and it will provide you with the
information from which organism, chromosome
no the sequence belongs to. It will give the exact
base pair no of the sequence too.
 Type in or paste the query sequencein the
search bar,choose the organism and hit
SEARCH.
7- GENDOO
 Functional profiling of gene and disease features
for omics analysis.
 Gendoo provides keywords including diseases,
drugs and biological phenomena related to genes
and diseases of interests.
 Type in the disease/gene,or their IDs to get all
the relevent genes/diseases associated with
them.
DDBJ STATISTICS
 DDBJ statistics gives information about the
releases and their records on DDBJ.
 It also provides useful facts about DDBJ totl data
volume, its contribution in INSDC, proportion of
each division and much more.
 Some important statistical information provided
by DDBJ are:
Latest Release Information
Data Category Distribution At Each
Archive
Organism Ranking By Bases
Journal Ranking By Counts In Flat
File
DATA SUBMISSION
 When you wish to publicize your sequence
through DDBJ, and your sequence
is acceptable for DDBJ, you can submit your
sequence to DDBJ, even if you have no plan to
publication of any research paper related to the
sequence.
 Once released, the nucleotide sequences
submitted to INSDC including DDBJ are available
for everyone.
(A) Nucleotide Sequence Submission
System
 DDBJ generally recommends you to
use Nucleotide Sequence Submission System
(B) Mass Submission System (MSS)
DDBJ recommend the use of MSS if:
 The submission consists of large number of
sequences (entries); greater than 1024,
 The submission involves long (greater than 500
kb) nucleotide sequences which result in a
complex submission containing many features
(greater than 30 in an entry) as in the case of
genome data, or
 The submission cannot be handled by Nucleotide
Sequence Submission System.
Sequence Data Transition
Assignment and Notification of
Accession Number
 We inform an accession number (unique number
assigned by the International Nucleotide
Sequence Database Collaboration) to the Contact
Person whose E-mail address is entered in the
"Contact person E-mail address" field.
This notification is normally sent within five
business days after receipt of the data.
Submitter
 Submitter of the entry is the person who have
responsibility to the submitted data in the entry, in
principle.
Only submitter can update his/her entry. Basically,
submitter takes responsibility to reply inquiry from
DDBJ or DDBJ users about his/her data.
Contact Person
 "Contact person" is the person who is responsible
about the descriptions of the entry and has a duty
as a representative to correspond with DDBJ and
its users."Contact person" has to be one of the
submitters, in principle.
 "Contact person" is the person who will make
contact with DDBJ and its users about the entry,
in principle. So, do not block E-mails from DDBJ.
 When user wishes to contact to the submitter(s)
of an entry of your interest, please contact DDBJ
with the inquiry form with reasons briefly, then we
will forward the message to the submitter(s).
Right of Entry Update
 Only submitters of the entry can update and
modify the entry. After data modification, the
submitter of the entry can also specify either of
immediate release or hold until publication.
However, in principle, if the entry have already
been open to the public, the entry can not restore
hold.
GROWTH IN DDBJ DATA
 When DDBJ first released its nucleotide
sequence database in July 1987, it consisted of
only 66 entries and 108,970 base pairs. In recent
years INSDC databases are increasing at annual
rate of 130-150%.
 Between June 2014 and May 2015, the DDBJ
periodical release increased by 11,879,389
entries and 31,427,753,923 base pairs.
NIG SUPERCOMPUTER
 The NIG supercomputer as a sequence
analytical platform. The DDBJ Center operates
the NIG supercomputer which specializes in
analysis of large-scale sequence data. The NIG
supercomputer offers computational
infrastructure for the construction of DDBJ
databases and analysis services, and provides
researchers with a large-scale data analysis and
supercomputing environment.
 The NIG supercomputer is currently composed of
two computer systems:
(i) the Phase 1 system which was introduced in
2012 .
(ii) the Phase 2 system which went into production
in 2014.

More Related Content

What's hot (20)

EMBL
EMBLEMBL
EMBL
 
Blast and fasta
Blast and fastaBlast and fasta
Blast and fasta
 
Clustal W - Multiple Sequence alignment
Clustal W - Multiple Sequence alignment   Clustal W - Multiple Sequence alignment
Clustal W - Multiple Sequence alignment
 
Cath
CathCath
Cath
 
Proteins databases
Proteins databasesProteins databases
Proteins databases
 
BLAST (Basic local alignment search Tool)
BLAST (Basic local alignment search Tool)BLAST (Basic local alignment search Tool)
BLAST (Basic local alignment search Tool)
 
Primary and secondary databases ppt by puneet kulyana
Primary and secondary databases ppt by puneet kulyanaPrimary and secondary databases ppt by puneet kulyana
Primary and secondary databases ppt by puneet kulyana
 
Prosite
PrositeProsite
Prosite
 
Scop database
Scop databaseScop database
Scop database
 
BLAST
BLASTBLAST
BLAST
 
Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-
 
Gene bank by kk sahu
Gene bank by kk sahuGene bank by kk sahu
Gene bank by kk sahu
 
Entrez databases
Entrez databasesEntrez databases
Entrez databases
 
Protein database
Protein databaseProtein database
Protein database
 
MULTIPLE SEQUENCE ALIGNMENT
MULTIPLE  SEQUENCE  ALIGNMENTMULTIPLE  SEQUENCE  ALIGNMENT
MULTIPLE SEQUENCE ALIGNMENT
 
Swiss prot database
Swiss prot databaseSwiss prot database
Swiss prot database
 
Biological database
Biological databaseBiological database
Biological database
 
Finding ORF
Finding ORFFinding ORF
Finding ORF
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
 
Gen bank databases
Gen bank databasesGen bank databases
Gen bank databases
 

Similar to DNA data bank of japan (DDBJ)

databases.pptx
databases.pptxdatabases.pptx
databases.pptxifra27
 
Data retriveal ,srg and dbget
Data retriveal ,srg and dbgetData retriveal ,srg and dbget
Data retriveal ,srg and dbgetSurendraKumar338
 
Database in bioinformatics
Database in bioinformaticsDatabase in bioinformatics
Database in bioinformaticsVinaKhan1
 
Primary sequencing of nucleic acids
Primary sequencing of nucleic acidsPrimary sequencing of nucleic acids
Primary sequencing of nucleic acidsvibhakumari12
 
Bioinformatics Lab01.docx
Bioinformatics Lab01.docxBioinformatics Lab01.docx
Bioinformatics Lab01.docxIvankaGetnet
 
Bioinformatics Final Report
Bioinformatics Final ReportBioinformatics Final Report
Bioinformatics Final ReportShruthi Choudary
 
Primary Databases.pptx
Primary Databases.pptxPrimary Databases.pptx
Primary Databases.pptxSwarup Malakar
 
Presentation on Biological database By Elufer Akram @ University Of Science ...
Presentation on Biological database  By Elufer Akram @ University Of Science ...Presentation on Biological database  By Elufer Akram @ University Of Science ...
Presentation on Biological database By Elufer Akram @ University Of Science ...Elufer Akram
 
database retrival.pdf
database retrival.pdfdatabase retrival.pdf
database retrival.pdfSrimathideviJ
 
Bioinformatics in biotechnology by kk sahu
Bioinformatics in biotechnology by kk sahu Bioinformatics in biotechnology by kk sahu
Bioinformatics in biotechnology by kk sahu KAUSHAL SAHU
 
Role of bioinformatics in life sciences research
Role of bioinformatics in life sciences researchRole of bioinformatics in life sciences research
Role of bioinformatics in life sciences researchAnshika Bansal
 
BITS: Overview of important biological databases beyond sequences
BITS: Overview of important biological databases beyond sequencesBITS: Overview of important biological databases beyond sequences
BITS: Overview of important biological databases beyond sequencesBITS
 

Similar to DNA data bank of japan (DDBJ) (20)

databases.pptx
databases.pptxdatabases.pptx
databases.pptx
 
Data retriveal ,srg and dbget
Data retriveal ,srg and dbgetData retriveal ,srg and dbget
Data retriveal ,srg and dbget
 
Database in bioinformatics
Database in bioinformaticsDatabase in bioinformatics
Database in bioinformatics
 
Intro to databases
Intro to databasesIntro to databases
Intro to databases
 
Genomic Databases-.pptx
Genomic Databases-.pptxGenomic Databases-.pptx
Genomic Databases-.pptx
 
Primary sequencing of nucleic acids
Primary sequencing of nucleic acidsPrimary sequencing of nucleic acids
Primary sequencing of nucleic acids
 
Article
ArticleArticle
Article
 
Ncbi
NcbiNcbi
Ncbi
 
Bioinformatics Lab01.docx
Bioinformatics Lab01.docxBioinformatics Lab01.docx
Bioinformatics Lab01.docx
 
Bioinformatics Final Report
Bioinformatics Final ReportBioinformatics Final Report
Bioinformatics Final Report
 
Blasta
BlastaBlasta
Blasta
 
Primary Databases.pptx
Primary Databases.pptxPrimary Databases.pptx
Primary Databases.pptx
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Presentation on Biological database By Elufer Akram @ University Of Science ...
Presentation on Biological database  By Elufer Akram @ University Of Science ...Presentation on Biological database  By Elufer Akram @ University Of Science ...
Presentation on Biological database By Elufer Akram @ University Of Science ...
 
database retrival.pdf
database retrival.pdfdatabase retrival.pdf
database retrival.pdf
 
BLAST
BLASTBLAST
BLAST
 
Bioinformatics in biotechnology by kk sahu
Bioinformatics in biotechnology by kk sahu Bioinformatics in biotechnology by kk sahu
Bioinformatics in biotechnology by kk sahu
 
Role of bioinformatics in life sciences research
Role of bioinformatics in life sciences researchRole of bioinformatics in life sciences research
Role of bioinformatics in life sciences research
 
Data Retrieval Systems
Data Retrieval SystemsData Retrieval Systems
Data Retrieval Systems
 
BITS: Overview of important biological databases beyond sequences
BITS: Overview of important biological databases beyond sequencesBITS: Overview of important biological databases beyond sequences
BITS: Overview of important biological databases beyond sequences
 

Recently uploaded

GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)Areesha Ahmad
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencySheetal Arora
 
fundamental of entomology all in one topics of entomology
fundamental of entomology all in one topics of entomologyfundamental of entomology all in one topics of entomology
fundamental of entomology all in one topics of entomologyDrAnita Sharma
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000Sapana Sha
 
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxPhysiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxAArockiyaNisha
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksSérgio Sacani
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsSérgio Sacani
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisDiwakar Mishra
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.Nitya salvi
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfSumit Kumar yadav
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptxRajatChauhan518211
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoSérgio Sacani
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPirithiRaju
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINsankalpkumarsahoo174
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfSumit Kumar yadav
 

Recently uploaded (20)

GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
 
fundamental of entomology all in one topics of entomology
fundamental of entomology all in one topics of entomologyfundamental of entomology all in one topics of entomology
fundamental of entomology all in one topics of entomology
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxPhysiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdf
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptx
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdf
 

DNA data bank of japan (DDBJ)

  • 1. DNA Data Bank of Japan (DDBJ)
  • 2. INTRODUCTION  The DNA Data Bank of Japan is a public database of nucleotide sequences established at the National Institute of Genetics (NIG). DDBJ, http://www.ddbj. nig.ac.jp
  • 4. HISTORY  Since 1987, the DDBJ has been collecting annotated nucleotide sequences as its traditional database service.  This endeavor has been conducted in collaboration with GenBank at the National Center for Biotechnology Information (NCBI) and with European Molecular Biology Laboratory (EMBL) at the European Bioinformatics Institute (EBI). The collaborative framework is called the International Nucleotide Sequence Database Collaboration (INSDC).
  • 5.
  • 6.  DDBJ collects and edit about 20% of the data released by these 3 International databases.
  • 7.  DDBJ began data bank activities in 1986 at NIG and remains the only nucleotide sequence data bank in Asia. Although DDBJ mainly receives its data from Japanese researchers, it can accept data from contributors from any other country.
  • 8. DATA UNIT The nucleotide sequence database is a set of data units called ENTRIES. In addition to the nucleotide sequence itself, each entry contains information about the researcher who determined the sequence plus related references, organism and gene function and features.
  • 9. DIVISIONS OF DDBJ ENTERIES  DDBJ classifies entries into 21 divisions as below;
  • 10. a: TAXONOMIC DIVISIONS 1. HUM Human 2. PRI Primates (other than human) 3. ROD Rodents 4. MAM Mammals (other than primates and rodents) 5. VRT Vertebrates (other than mammals) 6. INV Invertebrates (animals other than vertebrates) 7. PLN Plants, Fungi, Plastids (eukaryotes other than animals) 8. BCT Bacteria (including both Eubacteria and Archaea) 9. VRL Viruses 10. PHG Bacteriophages
  • 11. b: OTHER DIVISIONS 1. PAT Sequence Data Related To Patent Application 2. ENV Sequences Obtained Via Environmental Sampling Methods 3. SYN Synthetic Constructs; Artificially Constructed Sequences 4. EST Expressed Sequence Tags; Short Single Pass Cdna Sequences 5. TSA Transcriptome Shotgun Assemblies; Assembled mRNA Sequences 6. GSS Genome Survey Sequences; Short Single Pass Genomic Sequences 7. HTC High Throughput cDNA Sequences
  • 12. 8. HTG High Throughput Genomic Sequences 9. STS Sequence Tagged Sites 10. UNA The Data Not Annotated 11. CON Contig / Constructed
  • 13. DATA RETREIVAL IN DDBJ  For data retrieval on DDBJ, click on SEARCH AND ANALYSIS on homepage.  A window tab will open, with various searching options.
  • 14.
  • 15.
  • 16. 1- GETENTRY  DDBJ annotated/assembled data retrieval by accession numbers.  KEYWORD= ACCESSSION NUMBER thus, only accession no. is used for the sequence search in this method of data retrieval.
  • 17.
  • 18.  In the ID box, write down the exact accession no. of the sequence you want to search.  The database is by default on DDBJ/EMBL/GENBANK.  OUTPUT FORMAT of any type can be chosen based on user requirement.
  • 19.  These formats are shown in the picture below:
  • 20.  Choose any format and click on SEARCH.  The sequence in formation will open in a new window.
  • 22. Total nt seq FASTA format
  • 23. CDS amino acid seq FASTA format
  • 24. CDS nt seq FASTA format
  • 26.  Same steps are repeated for the search of PROTEIN SEQUENCE on getentry.  PROTEIN DATABASE are chosen which are UNIPROT, PDB, DAD and PATENT.  Select OUTPUT FORMAT and click on SEARCH.
  • 27.
  • 28. 2- ARSA  DDBJ annotated/assembled data retrieval by accession numbers and keywords.  ACCESSION NUMBERS and KEYWORDS can both be used for this method of sequence search.
  • 29.
  • 30.  Put in the keywords or accession no. in the search bar.  The more the keywords, the narrower will be the search.  A LIST OF ENTERIES based on the search will be provided below.
  • 31.
  • 32.  Flatfile, XML and FASTA formats are provided , flatfile being the default.  Click on any sequence or view/download multiple sequences by selecting more than one sequence.
  • 33. 3- TX SEARCH  Taxonomy database search of DDBJ.  Type in the organism name and click on SEARCH.  A complete Lineage of the organism will be given.
  • 34.
  • 35.
  • 36. 4- BLAST  A blast homepage is also provided on search and analysis page.
  • 37.
  • 38.  The BLAST OUTPUT is different in formattinf from that of provided at NCBI homepage’s BLAST, the results however are the same.  The SIGNIFICANT ALIGNMENTS table is first thing given on DDBJ which is the second on NCBI BLAST, and the HITS are second on DDBJ with IDENTIFICATION LINE written below each hit with its SCORE.
  • 39.
  • 40.
  • 41.  And lastly the alignment of query with each hit is given, along with several details.
  • 42.
  • 43. 5- CLUSTALW  For Multiple alignment and phylogenetic tree- making, ClustalW is also provided on DDBJ.
  • 44.
  • 45.  The ClustalW output is almost same, however a small detail is different.  In DDBJ , ClustalW only give “*” identifier i.e only fully conserved sequences are mentioned in form of symbols.
  • 46.
  • 47. 6- GGGenome  An ultrafast sequence search, in which you can type any sequence and it will provide you with the information from which organism, chromosome no the sequence belongs to. It will give the exact base pair no of the sequence too.
  • 48.
  • 49.  Type in or paste the query sequencein the search bar,choose the organism and hit SEARCH.
  • 50.
  • 51. 7- GENDOO  Functional profiling of gene and disease features for omics analysis.  Gendoo provides keywords including diseases, drugs and biological phenomena related to genes and diseases of interests.
  • 52.
  • 53.  Type in the disease/gene,or their IDs to get all the relevent genes/diseases associated with them.
  • 54.
  • 55. DDBJ STATISTICS  DDBJ statistics gives information about the releases and their records on DDBJ.  It also provides useful facts about DDBJ totl data volume, its contribution in INSDC, proportion of each division and much more.
  • 56.
  • 57.
  • 58.  Some important statistical information provided by DDBJ are:
  • 60. Data Category Distribution At Each Archive
  • 62. Journal Ranking By Counts In Flat File
  • 63. DATA SUBMISSION  When you wish to publicize your sequence through DDBJ, and your sequence is acceptable for DDBJ, you can submit your sequence to DDBJ, even if you have no plan to publication of any research paper related to the sequence.  Once released, the nucleotide sequences submitted to INSDC including DDBJ are available for everyone.
  • 64.
  • 65. (A) Nucleotide Sequence Submission System  DDBJ generally recommends you to use Nucleotide Sequence Submission System
  • 66. (B) Mass Submission System (MSS) DDBJ recommend the use of MSS if:  The submission consists of large number of sequences (entries); greater than 1024,  The submission involves long (greater than 500 kb) nucleotide sequences which result in a complex submission containing many features (greater than 30 in an entry) as in the case of genome data, or  The submission cannot be handled by Nucleotide Sequence Submission System.
  • 68. Assignment and Notification of Accession Number  We inform an accession number (unique number assigned by the International Nucleotide Sequence Database Collaboration) to the Contact Person whose E-mail address is entered in the "Contact person E-mail address" field. This notification is normally sent within five business days after receipt of the data.
  • 69. Submitter  Submitter of the entry is the person who have responsibility to the submitted data in the entry, in principle. Only submitter can update his/her entry. Basically, submitter takes responsibility to reply inquiry from DDBJ or DDBJ users about his/her data.
  • 70. Contact Person  "Contact person" is the person who is responsible about the descriptions of the entry and has a duty as a representative to correspond with DDBJ and its users."Contact person" has to be one of the submitters, in principle.  "Contact person" is the person who will make contact with DDBJ and its users about the entry, in principle. So, do not block E-mails from DDBJ.  When user wishes to contact to the submitter(s) of an entry of your interest, please contact DDBJ with the inquiry form with reasons briefly, then we will forward the message to the submitter(s).
  • 71. Right of Entry Update  Only submitters of the entry can update and modify the entry. After data modification, the submitter of the entry can also specify either of immediate release or hold until publication. However, in principle, if the entry have already been open to the public, the entry can not restore hold.
  • 72. GROWTH IN DDBJ DATA  When DDBJ first released its nucleotide sequence database in July 1987, it consisted of only 66 entries and 108,970 base pairs. In recent years INSDC databases are increasing at annual rate of 130-150%.  Between June 2014 and May 2015, the DDBJ periodical release increased by 11,879,389 entries and 31,427,753,923 base pairs.
  • 73.
  • 74. NIG SUPERCOMPUTER  The NIG supercomputer as a sequence analytical platform. The DDBJ Center operates the NIG supercomputer which specializes in analysis of large-scale sequence data. The NIG supercomputer offers computational infrastructure for the construction of DDBJ databases and analysis services, and provides researchers with a large-scale data analysis and supercomputing environment.
  • 75.  The NIG supercomputer is currently composed of two computer systems: (i) the Phase 1 system which was introduced in 2012 . (ii) the Phase 2 system which went into production in 2014.