SlideShare a Scribd company logo
DATA RETRIEVAL SYSTEM
Text-based Database Searching
Submitted By:
Dr. Shikha Thakur
Assistant Professor (Guest Faculty)
TCSC
Mumbai
Maharashtra
• The amount of biologically relevant data accessible via the WWW is
increasing at a very rapid rate.
• It is important for scientists to have easy and efficient ways of wading
through the data and finding what is important for their research.
• Knowing how to access and search for information in the database is
essential.
Depending on the type of data at hand, there are
two basic ways of searching:
• Using descriptive words to search text databases.
• Using a nucleotide or protein sequence to search sequence
databases.
Text- based database Searching
• There are three important data retrieval systems of particular
relevance to molecular biologists:
• Entrez ( at NCBI) (GI(Global Image disk image file) /Accession no.
• Sequence Retreival System, SRS (at EBI)
• DBGET/LinkDB (At Japan)
• The advantage of these retrieval systems is that they not only return
matches to a query, but also provide handy pointers to additional
important information in related databases.
Text-based database Searching
• The three systems differ in the databases they search and the links
they provide to other information.
• In using any of these systems, queries can be as simple as entering
the accession number of a newly published sequence or as complex
as searching multiple database fields for specific terms.
Text-Based Database Searching
• Basic Search Concepts
• Boolean Search – An advanced query search using two or more terms,
using Boolean operator AND, OR, NOT, default – AND
• Broadening the Search – If the results of a search produce no useful
entries, change or remove terms.
• Narrowing the search – If the results of a search produce no useful entries,
change or remove terms.
• Proximity Searching – To search with multiword terms or phrases, place
quotes around the terms.
• Wild Card – The character prepended or appended to a search term make
a search less specific., e.g., to look for all authors with last name Zav,
search using Zav*.
Entrez
• Entrez – is a molecular biology database and retrieval system
developed by the National Center for Biotechnology Information
(NCBI).
• It is an entry point for exploring distinct but integrated databases.
• (http://www.ncbi.nlm.nih.gov/Entrez/)
Entrez
• The Entrez system provides access to:
• Nucleotide sequence databases- GenBank/DDBJ/EBI
• Protein sequence databases – Swiss-Prot, PIR, PRF, PDB, and translated
protein sequences from DNA sequence databases.
• Genome and chromosome mapping data
• Molecular Modeling 3-D structures Databases.
• Literature database, PubMed – Provides excellent and easy access to
MEDLINE and pre-MEDLINE articles.
• Taxonomy database – Allows retrieval of DNA and protein sequences for
any taxonomic group.
• Specialized Databases – OMIM, dbSNP, UniSTS, etc.
Entrez
• The most valuable feature of Entrez is
• Its exploitation of the concept of ’neighbouring’.
• Which allows related articles indifferent databases to be linked to
each other, whether or not they are cross-referenced directly.
• Neighbours and links are listed in the order of similarity to the query.
• The similarity is based on pre-computed analysis of sequences,
structures and the literature.
Entrez
• One particularly useful feature in Entrez is –
• The ability to retrieve large sets of data based on some criterion and
to download them to a local computer- Batch Entrez
• Allowing these sequences to be worked on using analytical tools
available on local computer.
Entrez Features
1. Entrez Global Query – Search a subset of Entrez databases.
2. Batch Entrez –Upload a file of GI or accession numbers to retrieve
sequences.
3. Making Links Entrez – Linking to PubMed and Genbank
4.E-Utilities – Entrez programming utilities
5. LinkOut – External links to related resources.
6. Cubby – Provides with a stored search feature to store and update
searches, allows to customize your LinkOut display.
SRS.
• The Sequence Retrieval System (SRS) – A network browser for
datbases in molecular biology.
• It is a powerful sequence information indexing, search and retrieval
system (http://srs.ebi.ac.uk/)
SRS
• SRS is a homogeneous interface to over 80 biological databases
developed at the European Bioinformatics Institute (EBI) at Hinxton,
UK.
• The types of databases included are sequence and sequence related,
metabolic pathways, transcription factors, application results (e.g.,
BLAST), protein 3D- structure, genome, mapping, mutations, and
locus-specific mutatins.
• One can access and query their contents and navigate among them.
SRS
The Web page listing all the databases contains a link to a description
page about the database and includes the date of last update.
One can select one or more datbases to search before entering the
query.
• Over 30 versions of SRS are currently running on the WWW. Each
includes a different subset of databases and associated analytical
tools.
SRS
• SRS Features:
• SRS databases are well indexed, thus reducing the search time for the
large number of potential databases.
• SRS allows any flat file database to be indexed to any other. The
advantage being the derived indices may be rapidly searched allowing
users to retrieve link and access entries from all the interconnected
resources.
• The system has the particular strength that it can be readily
customized to use any defined set of databanks.
SRS
• Simple SRS queries
• By accession number
• Query on accession number: J00231
• By a simple author or organism: Ausubel and Rhizobium
• Boolean relations between keywords: and, or, but not
SRS
• Contd…
• Searching by dates: 01-Jan-1995:31-Dec-1995.
• Searching by size: 400:600
• Using hypertext links in an entry: Medline, Swiss- Prot and PDB
entries can be linked from within the EMBL database.
• Display of molecules via Rasmol plug-in
DBGET
• DBGET/LinkDB – Is an integrated bioinformatics database retrieval
system at GenomeNet, developed by the institute for Chemical
Research, Kyoto University, and the Human Genome Center of the
University of Tokyo.
DBGET
• DBGET – Is used to search and extract entries from a wide range of
molecular biology databases.
• LinkDB- Is used to compute links between entries in different
databases.
• It is designed to be a network distributed database system with an
open architecture, which is suitable for incorporating local databases
or establishing a server environment.
• http://www.genome.ad.jp/dbget/
DBGET
• DBGET/LinkDB is integrated with other search tools, such as BLAST,
FAST and MOTIF to conduct further retreivals instantly.
• DBGET provides access to about 20 databases, which are queried one
at a time. After querying one of these databases, DBGET presents
links to associated information in addition to the list of results.
• A unique feature of DBGET is its connection with the Kyoto
Encyclopedia of Genes and Genomes(KEGG) database – a database of
metabolic and regulatory pathways.
DBGET
• DBGET has three basic commands (or three basic modes in the Web
version), bfind, bget, and blink, to search and extract database
entries.
• blink – To search and extract database entries.
• bget – Performs the retrieval of database entries specified by the
combination of dbname:identifier
• bfind – Is used for searching entries by keywords
• Notable feature of DBGET, different from other text search systems, is
that no keyword indexing is performed when a database is installed or
updated.
DBGET
• Selected fields are extracted and stored in separate files for bfind
searches.
• An advantage for rapid database updates, but sometimes a
disadvantage for elaborate searching.
• To supplement bfind, the full text search STAG is provided.
• blink – The LinkDB search. Once entries of interest are found, it can
be used to retrieve related entries in a given database or all databases
in GenomeNet.
Example
• Let’s consider an example to show how each system can be used to
retrieve the SwissProt entry P04391, an ornithine
carbamoyltransferase protein in Escherichia coli.
• In Entrez, enter the name P04391 in the protein database query
form and view the entry and associated links and neighbours.
Example - SRS
• In SRS, first select the SwissProt database, then enter P04391 in the
query form and, once the entry is displayed search for links to other
related databases.
Example – LinkDB
• However, the fastest way of gathering the related information for this
entry is to search LinkDB.
• By simply entering swissport:P04391, a list of all links to all the
related databases is displayed.
Thank You

More Related Content

What's hot

ENTREZ.ppt
ENTREZ.pptENTREZ.ppt
ENTREZ.ppt
kishoreGupta17
 
Gene bank by kk sahu
Gene bank by kk sahuGene bank by kk sahu
Gene bank by kk sahu
KAUSHAL SAHU
 
Sequence Submission Tools
Sequence Submission ToolsSequence Submission Tools
Sequence Submission Tools
RishikaMaji
 
EMBL
EMBLEMBL
BTIS
BTISBTIS
BTIS
samhati27
 
Scop database
Scop databaseScop database
Scop database
Sayantani Roy
 
Data Retrieval Systems
Data Retrieval SystemsData Retrieval Systems
Data Retrieval Systems
Saramita De Chakravarti
 
Bioinformatics
BioinformaticsBioinformatics
Gen bank databases
Gen bank databasesGen bank databases
Gen bank databases
Hafiz Muhammad Zeeshan Raza
 
Clustal
ClustalClustal
Clustal
Benittabenny
 
Protein databases
Protein databasesProtein databases
Protein databasessarumalay
 
Ncbi
NcbiNcbi
Genomic databases
Genomic databasesGenomic databases
Genomic databases
DrSatyabrataSahoo
 
Prosite
PrositeProsite
Structural databases
Structural databases Structural databases
Structural databases
Priyadharshana
 
Ddbj
DdbjDdbj
Protein Databases
Protein DatabasesProtein Databases
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICS
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICSSTRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICS
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICS
SHEETHUMOLKS
 

What's hot (20)

ENTREZ.ppt
ENTREZ.pptENTREZ.ppt
ENTREZ.ppt
 
Gene bank by kk sahu
Gene bank by kk sahuGene bank by kk sahu
Gene bank by kk sahu
 
Sequence Submission Tools
Sequence Submission ToolsSequence Submission Tools
Sequence Submission Tools
 
EMBL
EMBLEMBL
EMBL
 
BTIS
BTISBTIS
BTIS
 
Scop database
Scop databaseScop database
Scop database
 
Data Retrieval Systems
Data Retrieval SystemsData Retrieval Systems
Data Retrieval Systems
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Gen bank databases
Gen bank databasesGen bank databases
Gen bank databases
 
Clustal
ClustalClustal
Clustal
 
Protein databases
Protein databasesProtein databases
Protein databases
 
NCBI
NCBINCBI
NCBI
 
Ncbi
NcbiNcbi
Ncbi
 
Genomic databases
Genomic databasesGenomic databases
Genomic databases
 
Prosite
PrositeProsite
Prosite
 
Structural databases
Structural databases Structural databases
Structural databases
 
EMBL-EBI
EMBL-EBIEMBL-EBI
EMBL-EBI
 
Ddbj
DdbjDdbj
Ddbj
 
Protein Databases
Protein DatabasesProtein Databases
Protein Databases
 
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICS
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICSSTRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICS
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICS
 

Similar to Data retreival system

Data retriveal ,srg and dbget
Data retriveal ,srg and dbgetData retriveal ,srg and dbget
Data retriveal ,srg and dbget
SurendraKumar338
 
Entrez databases
Entrez databasesEntrez databases
Entrez databases
Hafiz Muhammad Zeeshan Raza
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
ShailendraSinghKhich
 
biological databases.pptx
biological databases.pptxbiological databases.pptx
biological databases.pptx
science lover
 
Proteins databases
Proteins databasesProteins databases
Proteins databases
Hafiz Muhammad Zeeshan Raza
 
Sequence submission tools ............pptx
Sequence submission tools ............pptxSequence submission tools ............pptx
Sequence submission tools ............pptx
Cherry
 
Primary Bioinformatics Database.pptx
Primary Bioinformatics Database.pptxPrimary Bioinformatics Database.pptx
Primary Bioinformatics Database.pptx
Vandana Yadav03
 
Major databases in bioinformatics
Major databases in bioinformaticsMajor databases in bioinformatics
Major databases in bioinformatics
Vidya Kalaivani Rajkumar
 
Biological data base
Biological data baseBiological data base
Biological data base
kishoreGupta17
 
Biological Database (1)pptxpdfpdfpdf.pdf
Biological Database (1)pptxpdfpdfpdf.pdfBiological Database (1)pptxpdfpdfpdf.pdf
Biological Database (1)pptxpdfpdfpdf.pdf
BioinformaticsCentre
 
Genomic Databases-.pptx
Genomic Databases-.pptxGenomic Databases-.pptx
Genomic Databases-.pptx
jyosthsnakattula
 
Biological data bioinformatics
Biological data bioinformatics Biological data bioinformatics
Biological data bioinformatics
AakifahAmreen
 
DATABASES...............................pptx
DATABASES...............................pptxDATABASES...............................pptx
DATABASES...............................pptx
Cherry
 
Protein database
Protein  databaseProtein  database
Protein database
KAUSHAL SAHU
 
Primary, secondary, tertiary biological database
Primary, secondary, tertiary biological databasePrimary, secondary, tertiary biological database
Primary, secondary, tertiary biological database
KAUSHAL SAHU
 
Important protein databases and proteomics softwares
Important protein databases and proteomics softwaresImportant protein databases and proteomics softwares
Important protein databases and proteomics softwares
PUNJAB AGRICULTURAL UNIVERSITY, LUDHIANA, 141004, PUNJAB (INDIA)
 
Bioinformatics مي.pdf
Bioinformatics  مي.pdfBioinformatics  مي.pdf
Bioinformatics مي.pdf
nedalalazzwy
 
Hands on training_biological_databases.ppt
Hands on training_biological_databases.pptHands on training_biological_databases.ppt
Hands on training_biological_databases.ppt
Soumen Barman
 
Data-knowledge transition zones within the biomedical research ecosystem
Data-knowledge transition zones within the biomedical research ecosystemData-knowledge transition zones within the biomedical research ecosystem
Data-knowledge transition zones within the biomedical research ecosystem
Maryann Martone
 
Biological databases
Biological databasesBiological databases
Biological databases
Sarfaraz Nasri
 

Similar to Data retreival system (20)

Data retriveal ,srg and dbget
Data retriveal ,srg and dbgetData retriveal ,srg and dbget
Data retriveal ,srg and dbget
 
Entrez databases
Entrez databasesEntrez databases
Entrez databases
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
biological databases.pptx
biological databases.pptxbiological databases.pptx
biological databases.pptx
 
Proteins databases
Proteins databasesProteins databases
Proteins databases
 
Sequence submission tools ............pptx
Sequence submission tools ............pptxSequence submission tools ............pptx
Sequence submission tools ............pptx
 
Primary Bioinformatics Database.pptx
Primary Bioinformatics Database.pptxPrimary Bioinformatics Database.pptx
Primary Bioinformatics Database.pptx
 
Major databases in bioinformatics
Major databases in bioinformaticsMajor databases in bioinformatics
Major databases in bioinformatics
 
Biological data base
Biological data baseBiological data base
Biological data base
 
Biological Database (1)pptxpdfpdfpdf.pdf
Biological Database (1)pptxpdfpdfpdf.pdfBiological Database (1)pptxpdfpdfpdf.pdf
Biological Database (1)pptxpdfpdfpdf.pdf
 
Genomic Databases-.pptx
Genomic Databases-.pptxGenomic Databases-.pptx
Genomic Databases-.pptx
 
Biological data bioinformatics
Biological data bioinformatics Biological data bioinformatics
Biological data bioinformatics
 
DATABASES...............................pptx
DATABASES...............................pptxDATABASES...............................pptx
DATABASES...............................pptx
 
Protein database
Protein  databaseProtein  database
Protein database
 
Primary, secondary, tertiary biological database
Primary, secondary, tertiary biological databasePrimary, secondary, tertiary biological database
Primary, secondary, tertiary biological database
 
Important protein databases and proteomics softwares
Important protein databases and proteomics softwaresImportant protein databases and proteomics softwares
Important protein databases and proteomics softwares
 
Bioinformatics مي.pdf
Bioinformatics  مي.pdfBioinformatics  مي.pdf
Bioinformatics مي.pdf
 
Hands on training_biological_databases.ppt
Hands on training_biological_databases.pptHands on training_biological_databases.ppt
Hands on training_biological_databases.ppt
 
Data-knowledge transition zones within the biomedical research ecosystem
Data-knowledge transition zones within the biomedical research ecosystemData-knowledge transition zones within the biomedical research ecosystem
Data-knowledge transition zones within the biomedical research ecosystem
 
Biological databases
Biological databasesBiological databases
Biological databases
 

More from Shikha Thakur

Types of greenhouse
Types of greenhouseTypes of greenhouse
Types of greenhouse
Shikha Thakur
 
Medicinal plants on terrace
Medicinal plants on terraceMedicinal plants on terrace
Medicinal plants on terrace
Shikha Thakur
 
Biological Weapon Threat to Humanity
Biological Weapon Threat to HumanityBiological Weapon Threat to Humanity
Biological Weapon Threat to Humanity
Shikha Thakur
 
Bacteria
BacteriaBacteria
Bacteria
Shikha Thakur
 
Swiss prot
Swiss protSwiss prot
Swiss prot
Shikha Thakur
 
Energetics of kreb's cycle
Energetics of kreb's cycleEnergetics of kreb's cycle
Energetics of kreb's cycle
Shikha Thakur
 
Top 10 must vaccines
Top 10 must vaccinesTop 10 must vaccines
Top 10 must vaccines
Shikha Thakur
 
Introduction to Pubmed
Introduction to PubmedIntroduction to Pubmed
Introduction to Pubmed
Shikha Thakur
 
Proteomics
ProteomicsProteomics
Proteomics
Shikha Thakur
 
Career oppurtunities in the field of Bioinformatics
Career oppurtunities in the field of BioinformaticsCareer oppurtunities in the field of Bioinformatics
Career oppurtunities in the field of Bioinformatics
Shikha Thakur
 
Screening
ScreeningScreening
Screening
Shikha Thakur
 
Presentation1
Presentation1Presentation1
Presentation1
Shikha Thakur
 
Presentation2
Presentation2Presentation2
Presentation2
Shikha Thakur
 
Screening potential of biocontrol agents
Screening potential of biocontrol agentsScreening potential of biocontrol agents
Screening potential of biocontrol agents
Shikha Thakur
 

More from Shikha Thakur (14)

Types of greenhouse
Types of greenhouseTypes of greenhouse
Types of greenhouse
 
Medicinal plants on terrace
Medicinal plants on terraceMedicinal plants on terrace
Medicinal plants on terrace
 
Biological Weapon Threat to Humanity
Biological Weapon Threat to HumanityBiological Weapon Threat to Humanity
Biological Weapon Threat to Humanity
 
Bacteria
BacteriaBacteria
Bacteria
 
Swiss prot
Swiss protSwiss prot
Swiss prot
 
Energetics of kreb's cycle
Energetics of kreb's cycleEnergetics of kreb's cycle
Energetics of kreb's cycle
 
Top 10 must vaccines
Top 10 must vaccinesTop 10 must vaccines
Top 10 must vaccines
 
Introduction to Pubmed
Introduction to PubmedIntroduction to Pubmed
Introduction to Pubmed
 
Proteomics
ProteomicsProteomics
Proteomics
 
Career oppurtunities in the field of Bioinformatics
Career oppurtunities in the field of BioinformaticsCareer oppurtunities in the field of Bioinformatics
Career oppurtunities in the field of Bioinformatics
 
Screening
ScreeningScreening
Screening
 
Presentation1
Presentation1Presentation1
Presentation1
 
Presentation2
Presentation2Presentation2
Presentation2
 
Screening potential of biocontrol agents
Screening potential of biocontrol agentsScreening potential of biocontrol agents
Screening potential of biocontrol agents
 

Recently uploaded

Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
TravisMalana
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
NABLAS株式会社
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Subhajit Sahu
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
ewymefz
 
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
nscud
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
ewymefz
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
enxupq
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
slg6lamcq
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
ewymefz
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
ewymefz
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
ewymefz
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
Opendatabay
 
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
pchutichetpong
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
NABLAS株式会社
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
vcaxypu
 
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
Tiktokethiodaily
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
ArpitMalhotra16
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
nscud
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
ahzuo
 

Recently uploaded (20)

Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
 
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
 
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
 
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
 

Data retreival system

  • 1. DATA RETRIEVAL SYSTEM Text-based Database Searching Submitted By: Dr. Shikha Thakur Assistant Professor (Guest Faculty) TCSC Mumbai Maharashtra
  • 2. • The amount of biologically relevant data accessible via the WWW is increasing at a very rapid rate. • It is important for scientists to have easy and efficient ways of wading through the data and finding what is important for their research. • Knowing how to access and search for information in the database is essential.
  • 3. Depending on the type of data at hand, there are two basic ways of searching: • Using descriptive words to search text databases. • Using a nucleotide or protein sequence to search sequence databases.
  • 4. Text- based database Searching • There are three important data retrieval systems of particular relevance to molecular biologists: • Entrez ( at NCBI) (GI(Global Image disk image file) /Accession no. • Sequence Retreival System, SRS (at EBI) • DBGET/LinkDB (At Japan) • The advantage of these retrieval systems is that they not only return matches to a query, but also provide handy pointers to additional important information in related databases.
  • 5. Text-based database Searching • The three systems differ in the databases they search and the links they provide to other information. • In using any of these systems, queries can be as simple as entering the accession number of a newly published sequence or as complex as searching multiple database fields for specific terms.
  • 6. Text-Based Database Searching • Basic Search Concepts • Boolean Search – An advanced query search using two or more terms, using Boolean operator AND, OR, NOT, default – AND • Broadening the Search – If the results of a search produce no useful entries, change or remove terms. • Narrowing the search – If the results of a search produce no useful entries, change or remove terms. • Proximity Searching – To search with multiword terms or phrases, place quotes around the terms. • Wild Card – The character prepended or appended to a search term make a search less specific., e.g., to look for all authors with last name Zav, search using Zav*.
  • 7. Entrez • Entrez – is a molecular biology database and retrieval system developed by the National Center for Biotechnology Information (NCBI). • It is an entry point for exploring distinct but integrated databases. • (http://www.ncbi.nlm.nih.gov/Entrez/)
  • 8. Entrez • The Entrez system provides access to: • Nucleotide sequence databases- GenBank/DDBJ/EBI • Protein sequence databases – Swiss-Prot, PIR, PRF, PDB, and translated protein sequences from DNA sequence databases. • Genome and chromosome mapping data • Molecular Modeling 3-D structures Databases. • Literature database, PubMed – Provides excellent and easy access to MEDLINE and pre-MEDLINE articles. • Taxonomy database – Allows retrieval of DNA and protein sequences for any taxonomic group. • Specialized Databases – OMIM, dbSNP, UniSTS, etc.
  • 9.
  • 10.
  • 11.
  • 12.
  • 13.
  • 14.
  • 15.
  • 16.
  • 17. Entrez • The most valuable feature of Entrez is • Its exploitation of the concept of ’neighbouring’. • Which allows related articles indifferent databases to be linked to each other, whether or not they are cross-referenced directly. • Neighbours and links are listed in the order of similarity to the query. • The similarity is based on pre-computed analysis of sequences, structures and the literature.
  • 18. Entrez • One particularly useful feature in Entrez is – • The ability to retrieve large sets of data based on some criterion and to download them to a local computer- Batch Entrez • Allowing these sequences to be worked on using analytical tools available on local computer.
  • 19. Entrez Features 1. Entrez Global Query – Search a subset of Entrez databases. 2. Batch Entrez –Upload a file of GI or accession numbers to retrieve sequences. 3. Making Links Entrez – Linking to PubMed and Genbank 4.E-Utilities – Entrez programming utilities 5. LinkOut – External links to related resources. 6. Cubby – Provides with a stored search feature to store and update searches, allows to customize your LinkOut display.
  • 20. SRS. • The Sequence Retrieval System (SRS) – A network browser for datbases in molecular biology. • It is a powerful sequence information indexing, search and retrieval system (http://srs.ebi.ac.uk/)
  • 21.
  • 22.
  • 23. SRS • SRS is a homogeneous interface to over 80 biological databases developed at the European Bioinformatics Institute (EBI) at Hinxton, UK. • The types of databases included are sequence and sequence related, metabolic pathways, transcription factors, application results (e.g., BLAST), protein 3D- structure, genome, mapping, mutations, and locus-specific mutatins. • One can access and query their contents and navigate among them.
  • 24. SRS The Web page listing all the databases contains a link to a description page about the database and includes the date of last update. One can select one or more datbases to search before entering the query. • Over 30 versions of SRS are currently running on the WWW. Each includes a different subset of databases and associated analytical tools.
  • 25. SRS • SRS Features: • SRS databases are well indexed, thus reducing the search time for the large number of potential databases. • SRS allows any flat file database to be indexed to any other. The advantage being the derived indices may be rapidly searched allowing users to retrieve link and access entries from all the interconnected resources. • The system has the particular strength that it can be readily customized to use any defined set of databanks.
  • 26. SRS • Simple SRS queries • By accession number • Query on accession number: J00231 • By a simple author or organism: Ausubel and Rhizobium • Boolean relations between keywords: and, or, but not
  • 27. SRS • Contd… • Searching by dates: 01-Jan-1995:31-Dec-1995. • Searching by size: 400:600 • Using hypertext links in an entry: Medline, Swiss- Prot and PDB entries can be linked from within the EMBL database. • Display of molecules via Rasmol plug-in
  • 28.
  • 29. DBGET • DBGET/LinkDB – Is an integrated bioinformatics database retrieval system at GenomeNet, developed by the institute for Chemical Research, Kyoto University, and the Human Genome Center of the University of Tokyo.
  • 30.
  • 31.
  • 32. DBGET • DBGET – Is used to search and extract entries from a wide range of molecular biology databases. • LinkDB- Is used to compute links between entries in different databases. • It is designed to be a network distributed database system with an open architecture, which is suitable for incorporating local databases or establishing a server environment. • http://www.genome.ad.jp/dbget/
  • 33.
  • 34.
  • 35.
  • 36.
  • 37. DBGET • DBGET/LinkDB is integrated with other search tools, such as BLAST, FAST and MOTIF to conduct further retreivals instantly. • DBGET provides access to about 20 databases, which are queried one at a time. After querying one of these databases, DBGET presents links to associated information in addition to the list of results. • A unique feature of DBGET is its connection with the Kyoto Encyclopedia of Genes and Genomes(KEGG) database – a database of metabolic and regulatory pathways.
  • 38.
  • 39. DBGET • DBGET has three basic commands (or three basic modes in the Web version), bfind, bget, and blink, to search and extract database entries. • blink – To search and extract database entries. • bget – Performs the retrieval of database entries specified by the combination of dbname:identifier • bfind – Is used for searching entries by keywords • Notable feature of DBGET, different from other text search systems, is that no keyword indexing is performed when a database is installed or updated.
  • 40. DBGET • Selected fields are extracted and stored in separate files for bfind searches. • An advantage for rapid database updates, but sometimes a disadvantage for elaborate searching. • To supplement bfind, the full text search STAG is provided. • blink – The LinkDB search. Once entries of interest are found, it can be used to retrieve related entries in a given database or all databases in GenomeNet.
  • 41. Example • Let’s consider an example to show how each system can be used to retrieve the SwissProt entry P04391, an ornithine carbamoyltransferase protein in Escherichia coli. • In Entrez, enter the name P04391 in the protein database query form and view the entry and associated links and neighbours.
  • 42.
  • 43.
  • 44.
  • 45. Example - SRS • In SRS, first select the SwissProt database, then enter P04391 in the query form and, once the entry is displayed search for links to other related databases.
  • 46.
  • 47.
  • 48.
  • 49.
  • 50. Example – LinkDB • However, the fastest way of gathering the related information for this entry is to search LinkDB. • By simply entering swissport:P04391, a list of all links to all the related databases is displayed.
  • 51.
  • 52.
  • 53.