SlideShare a Scribd company logo
Welcome to the
presentation
On
BLAST
Introduction
Group Members:
Shahida khatun
MD. Firoz Ahmed
MD. Shariful Islam
Chandrima Das
Shantonu Kumar Roy
Merina Junaki
Ikhtina Afroz
Shanjida Afrin
MST. Shahinur Akter
MD. Ariful Islam Sagar
BLAST
(Basic local alignment search
Tool)
Contents
Definition
Background
Types of BLAST Program
Algorithm
BLAST Input-Output
BLAST search
BLAST Function
Objectives of BLAST
Definition
The Basic Local Alignment Search Tool
(BLAST) for comparing gene and protein
sequences against others in public
databases.
BLAST is a set of sequence comparison
algorithms used to search databases for
optimal local alignments to a query.
Definition
It breaks the query and databases
sequences into fragments and seeks
matches between them.
Nucleic acid/Protein Alignments were
time consuming. Alignments were done
by full alignments by using dynamic
programming. BLAST is 50 times faster
then dynamic programming.
Background
Beginning in the 1970s, scientists began
to accumulate DNA and protein
sequence data at an exponential rate; in
fact, researchers currently have
approximately 97 billion bases
sequenced and over 93 million records.
Amazingly, this sequence data doubles
every 18 months!
Background
Today, one of the most commonly used
tools to examine DNA and protein
sequences is the Basic Local Alignment
Search Tool, also known as BLAST.
BLAST is a computer algorithm that is
available for use online at the National
Center for Biotechnology Information
(NCBI) website and many other sites.
Types of BLAST
 Nucleotide-nucleotide BLAST (blastn)
- This program, given a DNA query,
returns the most similar DNA sequences from
the DNA database that the user specifies.
 Protein-protein BLAST (blastp)
- This program, given a protein query,
returns the most similar protein sequences from
the protein database that the user specifies.
 Position-Specific Iterative BLAST (PSI-
BLAST) (blastpgp)
- This program is used to find distant
relatives of a protein.
Types of BLAST
 Nucleotide 6-frame translation-protein
(blastx)
-This program compares the six-frame
conceptual translation products of a nucleotide
query sequence (both strands) against a protein
sequence database.
 Nucleotide 6-frame translation-nucleotide
6-frame translation (tblastx)
-The purpose of tblastx is to find very
distant relationships between nucleotide
sequences.
Types of BLAST
 Protein-nucleotide 6-frame translation
(tblastn)
-This program compares a protein query
against the all six reading frames of a
nucleotide sequence database.
 Large numbers of query sequences
(megablast)
-When comparing large numbers of input
sequences via the command-line BLAST,
"megablast" is much faster than running BLAST
multiple times.
Types of BLAST
Of these programs, BLASTn and BLASTp are
the most commonly used because they use
direct comparisons, and do not require
translations.
However, since protein sequences are better
conserved evolutionarily than nucleotide
sequences, tBLASTn, tBLASTx, and BLASTx,
produce more reliable and accurate results
when dealing with coding DNA.
BLAST Algorithm
The blast algorithm is fast, accurate and
web-accessible.
It is relatively faster than other sequence
similarity search tools.
Complex BLAST algorithm requires
multiple steps and many parameters.
BLAST Algorithm
An overview of the
BLAST algorithm (a
protein to protein
search) is as follows:
 Remove low-
complexity region or
sequence repeats in
the query sequence.
 Make a k-letter word
list of the query
sequence - Take k=3 for
example, we list the words of
length 3 in the query protein
sequence (k is usually 11 for a
DNA sequence) "sequentially",
until the last letter of the query
sequence is included.
BLAST Algorithm
 List the possible matching words.
 Organize the remaining high-scoring words into an
efficient search tree.
 Repeat step 3 to 4 for each k-letter word in the
query sequence.
 Scan the database sequences for exact matches
with the remaining high-scoring words.
 Extend the exact matches to high-scoring segment
pair (HSP).
BLAST Algorithm
 List all of the HSPs in the database whose score
is high enough to be considered.
 Evaluate the significance of the HSP score.
 Make two or more HSP regions into a longer
alignment.
 Show the gapped Smith-Waterman local
alignments of the query and each of the matched
database sequences.
 Report every match whose expect score is lower
than a threshold parameter E.
BLAST Input-Output
Input
Input sequences
in FASTA or Genbank format.
Output
BLAST output can be delivered in a variety of
formats. These formats include HTML, plain
text, and XML formatting. For NCBI's web-
page, the default format for output is HTML.
 An introduction that tells where the search
occurred and what database and query were
compared
BLAST Output
 A list of the
sequences in the
database containing
segment pairs whose
scores were least
likely to occur by
chance
 Alignments of the
high-scoring segment
pairs showing identical
and similar residues
 A complete list of the
parameter settings
used for the search.
BLAST Output
E-value (expectation value)
 The Expect value (E) is a parameter that
describes the number of hits one can
"expect" to see by chance when searching a
database of a particular size.
 It decreases exponentially as the Score (S) of
the match increases.
 Essentially, the E value describes the random
background noise.
 In general terms the smaller E is the more
likely the match is significant.
BLAST Output
 Default E value for blastn, blastp, blastx
and tblastn is 10
 At this setting, 10 hits with scores equal to
or better than the defined alignment score,
S, are expected to occur by chance. The E-
value can be increased or decreased to
alter the stringency of the search.
 Increase the E value when searching with
a short query, since it is likely to be found
many times by chance in a given database.
BLAST Output
Bit Score
 A bit score is another prominent statistical
indicator used in addition to the E value in
a BLAST output.
 The bit score measures sequence
similarity independent of query sequence
length and database size and is
normalized based on the raw pairwise
alignment score.
BLAST Search
• Go to http://www.ncbi.nlm.nih.gov/
• Select BLAST program
BLAST Search
Selecting the BLAST Database
BLAST Search
 Entering sequence
 Submitting search
BLAST Function
BLAST can be used for several purposes.
These include identifying species,
locating domains, establishing phylogeny,
DNA mapping, and comparison.
Identifying species
-With the use of BLAST, we can
possibly correctly identify a species or find
homologous species. This can be useful, for
example, when we are working with a DNA
sequence from an unknown species.
BLAST Function
Locating domains
- When working with a protein
sequence you can input it into BLAST, to
locate known domains within the sequence of
interest.
Establishing phylogeny
-Using the results received through
BLAST we can create a phylogenetic tree
using the BLAST web-page.
BLAST Function
DNA mapping
-When working with a known species,
and looking to sequence a gene at an
unknown location, BLAST can compare the
chromosomal position of the sequence of
interest, to relevant sequences in the
database
Comparison
-When working with genes, BLAST
can locate common genes in two related
species, and can be used to map
annotations from one organism to another.
Objectives of BLAST
 It is one of the most popular programs for
sequence analysis.
 Enables a researcher to compare a
query sequence with a library or database
of sequence.
 Identify library sequences that resemble
the query sequence above a certain
threshold.
The objective is to find high scoring
ungapped segments among related
sequences.
Objectives of BLAST
 Alignments of the high-scoring segment
pairs showing identical and similar
residues.
 A complete list of the parameter settings
used for the search.
That’s all from our presentation
THANK YOU

More Related Content

What's hot

What's hot (20)

Blast and fasta
Blast and fastaBlast and fasta
Blast and fasta
 
History and scope in bioinformatics
History and scope in bioinformaticsHistory and scope in bioinformatics
History and scope in bioinformatics
 
Major databases in bioinformatics
Major databases in bioinformaticsMajor databases in bioinformatics
Major databases in bioinformatics
 
NCBI National Center for Biotechnology Information
NCBI National Center for Biotechnology InformationNCBI National Center for Biotechnology Information
NCBI National Center for Biotechnology Information
 
blast bioinformatics
blast bioinformaticsblast bioinformatics
blast bioinformatics
 
BLAST
BLASTBLAST
BLAST
 
Proteins databases
Proteins databasesProteins databases
Proteins databases
 
SEQUENCE ANALYSIS
SEQUENCE ANALYSISSEQUENCE ANALYSIS
SEQUENCE ANALYSIS
 
Nucleic Acid Sequence databases
Nucleic Acid Sequence databasesNucleic Acid Sequence databases
Nucleic Acid Sequence databases
 
Entrez databases
Entrez databasesEntrez databases
Entrez databases
 
Genome annotation 2013
Genome annotation 2013Genome annotation 2013
Genome annotation 2013
 
Rasmol
RasmolRasmol
Rasmol
 
sequence of file formats in bioinformatics
sequence of file formats in bioinformaticssequence of file formats in bioinformatics
sequence of file formats in bioinformatics
 
Blast
BlastBlast
Blast
 
Gene bank by kk sahu
Gene bank by kk sahuGene bank by kk sahu
Gene bank by kk sahu
 
Protein data bank
Protein data bankProtein data bank
Protein data bank
 
Gen bank databases
Gen bank databasesGen bank databases
Gen bank databases
 
Fasta
FastaFasta
Fasta
 
Introduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASEIntroduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASE
 
Clustal
ClustalClustal
Clustal
 

Viewers also liked

BLAST [Basic Alignment Local Search Tool]
BLAST [Basic Alignment Local Search Tool]BLAST [Basic Alignment Local Search Tool]
BLAST [Basic Alignment Local Search Tool]
BiotechOnline
 
Basic Local Alignment Search Tool (BLAST)
Basic Local Alignment Search Tool (BLAST)Basic Local Alignment Search Tool (BLAST)
Basic Local Alignment Search Tool (BLAST)
Asiri Wijesinghe
 
Blast bioinformatics
Blast bioinformaticsBlast bioinformatics
Blast bioinformatics
atmapandey
 
Blast Resistant Design
Blast Resistant DesignBlast Resistant Design
Blast Resistant Design
Mithun Pal
 

Viewers also liked (20)

BLAST [Basic Alignment Local Search Tool]
BLAST [Basic Alignment Local Search Tool]BLAST [Basic Alignment Local Search Tool]
BLAST [Basic Alignment Local Search Tool]
 
Basic Local Alignment Search Tool (BLAST)
Basic Local Alignment Search Tool (BLAST)Basic Local Alignment Search Tool (BLAST)
Basic Local Alignment Search Tool (BLAST)
 
BLAST(Basic Local Alignment Tool)
BLAST(Basic Local Alignment Tool)BLAST(Basic Local Alignment Tool)
BLAST(Basic Local Alignment Tool)
 
Blast fasta 4
Blast fasta 4Blast fasta 4
Blast fasta 4
 
BLAST
BLASTBLAST
BLAST
 
Blast Loading & Its Effect on Structures
Blast Loading & Its Effect on StructuresBlast Loading & Its Effect on Structures
Blast Loading & Its Effect on Structures
 
BLAST and sequence alignment
BLAST and sequence alignmentBLAST and sequence alignment
BLAST and sequence alignment
 
Fasta
FastaFasta
Fasta
 
Blast bioinformatics
Blast bioinformaticsBlast bioinformatics
Blast bioinformatics
 
Fasta
FastaFasta
Fasta
 
Bioinformatics - Discovering the Bio Logic Of Nature
Bioinformatics - Discovering the Bio Logic Of NatureBioinformatics - Discovering the Bio Logic Of Nature
Bioinformatics - Discovering the Bio Logic Of Nature
 
Global and local alignment (bioinformatics)
Global and local alignment (bioinformatics)Global and local alignment (bioinformatics)
Global and local alignment (bioinformatics)
 
Blast load and its analysis
Blast load and its analysisBlast load and its analysis
Blast load and its analysis
 
Architectural And Structural Design Of Blast Resistant Buildings - PRESENTATION
Architectural And Structural Design Of Blast Resistant Buildings - PRESENTATIONArchitectural And Structural Design Of Blast Resistant Buildings - PRESENTATION
Architectural And Structural Design Of Blast Resistant Buildings - PRESENTATION
 
Blast Resistant Design
Blast Resistant DesignBlast Resistant Design
Blast Resistant Design
 
BLAST
BLASTBLAST
BLAST
 
SandBlast Agent
SandBlast AgentSandBlast Agent
SandBlast Agent
 
Sequence Alignment In Bioinformatics
Sequence Alignment In BioinformaticsSequence Alignment In Bioinformatics
Sequence Alignment In Bioinformatics
 
BLAST
BLASTBLAST
BLAST
 
K8 bs pa islam
K8 bs pa   islamK8 bs pa   islam
K8 bs pa islam
 

Similar to BLAST (Basic local alignment search Tool)

BLAST : features, types,algorithm, working etc.
BLAST : features, types,algorithm,  working  etc.BLAST : features, types,algorithm,  working  etc.
BLAST : features, types,algorithm, working etc.
Cherry
 
Bioinformatics Final Presentation
Bioinformatics Final PresentationBioinformatics Final Presentation
Bioinformatics Final Presentation
Shruthi Choudary
 

Similar to BLAST (Basic local alignment search Tool) (20)

Blast
BlastBlast
Blast
 
Blasta
BlastaBlasta
Blasta
 
BLAST
BLASTBLAST
BLAST
 
BLAST
BLASTBLAST
BLAST
 
Blast
Blast Blast
Blast
 
BLAST AND FASTA.pptx12345789999987544321234
BLAST AND FASTA.pptx12345789999987544321234BLAST AND FASTA.pptx12345789999987544321234
BLAST AND FASTA.pptx12345789999987544321234
 
Sequence similarity tools.pptx
Sequence similarity tools.pptxSequence similarity tools.pptx
Sequence similarity tools.pptx
 
Lecture 5.pptx
Lecture 5.pptxLecture 5.pptx
Lecture 5.pptx
 
Basic BLAST (BLASTn)
Basic BLAST (BLASTn)Basic BLAST (BLASTn)
Basic BLAST (BLASTn)
 
Blast gp assignment
Blast  gp assignmentBlast  gp assignment
Blast gp assignment
 
BLAST Search tool
BLAST Search toolBLAST Search tool
BLAST Search tool
 
Data base searching tool
Data base searching toolData base searching tool
Data base searching tool
 
Sequence homology search and multiple sequence alignment(1)
Sequence homology search and multiple sequence alignment(1)Sequence homology search and multiple sequence alignment(1)
Sequence homology search and multiple sequence alignment(1)
 
Sequencealignmentinbioinformatics 100204112518-phpapp02
Sequencealignmentinbioinformatics 100204112518-phpapp02Sequencealignmentinbioinformatics 100204112518-phpapp02
Sequencealignmentinbioinformatics 100204112518-phpapp02
 
BLAST : features, types,algorithm, working etc.
BLAST : features, types,algorithm,  working  etc.BLAST : features, types,algorithm,  working  etc.
BLAST : features, types,algorithm, working etc.
 
Bioinformatics MiRON
Bioinformatics MiRONBioinformatics MiRON
Bioinformatics MiRON
 
Sequence database
Sequence databaseSequence database
Sequence database
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
BIOINFORMATICS_AND_PHYLOGENY.pdf.pdf
BIOINFORMATICS_AND_PHYLOGENY.pdf.pdfBIOINFORMATICS_AND_PHYLOGENY.pdf.pdf
BIOINFORMATICS_AND_PHYLOGENY.pdf.pdf
 
Bioinformatics Final Presentation
Bioinformatics Final PresentationBioinformatics Final Presentation
Bioinformatics Final Presentation
 

More from Ariful Islam Sagar

Effects of selenium on wheat plants under drought condition
Effects of  selenium on wheat plants under drought conditionEffects of  selenium on wheat plants under drought condition
Effects of selenium on wheat plants under drought condition
Ariful Islam Sagar
 

More from Ariful Islam Sagar (7)

Periodic Table
Periodic TablePeriodic Table
Periodic Table
 
Effects of selenium on wheat plants under drought condition
Effects of  selenium on wheat plants under drought conditionEffects of  selenium on wheat plants under drought condition
Effects of selenium on wheat plants under drought condition
 
September on jessore road
September on jessore roadSeptember on jessore road
September on jessore road
 
Growth dependent identification
Growth dependent identificationGrowth dependent identification
Growth dependent identification
 
Sources of Non-molecular Data Useful in Systematics
Sources of Non-molecular Data Useful in SystematicsSources of Non-molecular Data Useful in Systematics
Sources of Non-molecular Data Useful in Systematics
 
In Vitro Pollination
In Vitro PollinationIn Vitro Pollination
In Vitro Pollination
 
Zygotic Embryo culture
Zygotic Embryo cultureZygotic Embryo culture
Zygotic Embryo culture
 

Recently uploaded

Industrial Training Report- AKTU Industrial Training Report
Industrial Training Report- AKTU Industrial Training ReportIndustrial Training Report- AKTU Industrial Training Report
Industrial Training Report- AKTU Industrial Training Report
Avinash Rai
 

Recently uploaded (20)

Industrial Training Report- AKTU Industrial Training Report
Industrial Training Report- AKTU Industrial Training ReportIndustrial Training Report- AKTU Industrial Training Report
Industrial Training Report- AKTU Industrial Training Report
 
Sectors of the Indian Economy - Class 10 Study Notes pdf
Sectors of the Indian Economy - Class 10 Study Notes pdfSectors of the Indian Economy - Class 10 Study Notes pdf
Sectors of the Indian Economy - Class 10 Study Notes pdf
 
Pragya Champions Chalice 2024 Prelims & Finals Q/A set, General Quiz
Pragya Champions Chalice 2024 Prelims & Finals Q/A set, General QuizPragya Champions Chalice 2024 Prelims & Finals Q/A set, General Quiz
Pragya Champions Chalice 2024 Prelims & Finals Q/A set, General Quiz
 
[GDSC YCCE] Build with AI Online Presentation
[GDSC YCCE] Build with AI Online Presentation[GDSC YCCE] Build with AI Online Presentation
[GDSC YCCE] Build with AI Online Presentation
 
50 ĐỀ LUYỆN THI IOE LỚP 9 - NĂM HỌC 2022-2023 (CÓ LINK HÌNH, FILE AUDIO VÀ ĐÁ...
50 ĐỀ LUYỆN THI IOE LỚP 9 - NĂM HỌC 2022-2023 (CÓ LINK HÌNH, FILE AUDIO VÀ ĐÁ...50 ĐỀ LUYỆN THI IOE LỚP 9 - NĂM HỌC 2022-2023 (CÓ LINK HÌNH, FILE AUDIO VÀ ĐÁ...
50 ĐỀ LUYỆN THI IOE LỚP 9 - NĂM HỌC 2022-2023 (CÓ LINK HÌNH, FILE AUDIO VÀ ĐÁ...
 
Introduction to Quality Improvement Essentials
Introduction to Quality Improvement EssentialsIntroduction to Quality Improvement Essentials
Introduction to Quality Improvement Essentials
 
Basic_QTL_Marker-assisted_Selection_Sourabh.ppt
Basic_QTL_Marker-assisted_Selection_Sourabh.pptBasic_QTL_Marker-assisted_Selection_Sourabh.ppt
Basic_QTL_Marker-assisted_Selection_Sourabh.ppt
 
Jose-Rizal-and-Philippine-Nationalism-National-Symbol-2.pptx
Jose-Rizal-and-Philippine-Nationalism-National-Symbol-2.pptxJose-Rizal-and-Philippine-Nationalism-National-Symbol-2.pptx
Jose-Rizal-and-Philippine-Nationalism-National-Symbol-2.pptx
 
Matatag-Curriculum and the 21st Century Skills Presentation.pptx
Matatag-Curriculum and the 21st Century Skills Presentation.pptxMatatag-Curriculum and the 21st Century Skills Presentation.pptx
Matatag-Curriculum and the 21st Century Skills Presentation.pptx
 
The Last Leaf, a short story by O. Henry
The Last Leaf, a short story by O. HenryThe Last Leaf, a short story by O. Henry
The Last Leaf, a short story by O. Henry
 
How to Create Map Views in the Odoo 17 ERP
How to Create Map Views in the Odoo 17 ERPHow to Create Map Views in the Odoo 17 ERP
How to Create Map Views in the Odoo 17 ERP
 
Gyanartha SciBizTech Quiz slideshare.pptx
Gyanartha SciBizTech Quiz slideshare.pptxGyanartha SciBizTech Quiz slideshare.pptx
Gyanartha SciBizTech Quiz slideshare.pptx
 
Application of Matrices in real life. Presentation on application of matrices
Application of Matrices in real life. Presentation on application of matricesApplication of Matrices in real life. Presentation on application of matrices
Application of Matrices in real life. Presentation on application of matrices
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
 
Research Methods in Psychology | Cambridge AS Level | Cambridge Assessment In...
Research Methods in Psychology | Cambridge AS Level | Cambridge Assessment In...Research Methods in Psychology | Cambridge AS Level | Cambridge Assessment In...
Research Methods in Psychology | Cambridge AS Level | Cambridge Assessment In...
 
PART A. Introduction to Costumer Service
PART A. Introduction to Costumer ServicePART A. Introduction to Costumer Service
PART A. Introduction to Costumer Service
 
UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...
UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...
UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...
 
MARUTI SUZUKI- A Successful Joint Venture in India.pptx
MARUTI SUZUKI- A Successful Joint Venture in India.pptxMARUTI SUZUKI- A Successful Joint Venture in India.pptx
MARUTI SUZUKI- A Successful Joint Venture in India.pptx
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345
 

BLAST (Basic local alignment search Tool)

  • 2. Introduction Group Members: Shahida khatun MD. Firoz Ahmed MD. Shariful Islam Chandrima Das Shantonu Kumar Roy Merina Junaki Ikhtina Afroz Shanjida Afrin MST. Shahinur Akter MD. Ariful Islam Sagar
  • 4. Contents Definition Background Types of BLAST Program Algorithm BLAST Input-Output BLAST search BLAST Function Objectives of BLAST
  • 5. Definition The Basic Local Alignment Search Tool (BLAST) for comparing gene and protein sequences against others in public databases. BLAST is a set of sequence comparison algorithms used to search databases for optimal local alignments to a query.
  • 6. Definition It breaks the query and databases sequences into fragments and seeks matches between them. Nucleic acid/Protein Alignments were time consuming. Alignments were done by full alignments by using dynamic programming. BLAST is 50 times faster then dynamic programming.
  • 7. Background Beginning in the 1970s, scientists began to accumulate DNA and protein sequence data at an exponential rate; in fact, researchers currently have approximately 97 billion bases sequenced and over 93 million records. Amazingly, this sequence data doubles every 18 months!
  • 8. Background Today, one of the most commonly used tools to examine DNA and protein sequences is the Basic Local Alignment Search Tool, also known as BLAST. BLAST is a computer algorithm that is available for use online at the National Center for Biotechnology Information (NCBI) website and many other sites.
  • 9. Types of BLAST  Nucleotide-nucleotide BLAST (blastn) - This program, given a DNA query, returns the most similar DNA sequences from the DNA database that the user specifies.  Protein-protein BLAST (blastp) - This program, given a protein query, returns the most similar protein sequences from the protein database that the user specifies.  Position-Specific Iterative BLAST (PSI- BLAST) (blastpgp) - This program is used to find distant relatives of a protein.
  • 10. Types of BLAST  Nucleotide 6-frame translation-protein (blastx) -This program compares the six-frame conceptual translation products of a nucleotide query sequence (both strands) against a protein sequence database.  Nucleotide 6-frame translation-nucleotide 6-frame translation (tblastx) -The purpose of tblastx is to find very distant relationships between nucleotide sequences.
  • 11. Types of BLAST  Protein-nucleotide 6-frame translation (tblastn) -This program compares a protein query against the all six reading frames of a nucleotide sequence database.  Large numbers of query sequences (megablast) -When comparing large numbers of input sequences via the command-line BLAST, "megablast" is much faster than running BLAST multiple times.
  • 12. Types of BLAST Of these programs, BLASTn and BLASTp are the most commonly used because they use direct comparisons, and do not require translations. However, since protein sequences are better conserved evolutionarily than nucleotide sequences, tBLASTn, tBLASTx, and BLASTx, produce more reliable and accurate results when dealing with coding DNA.
  • 13. BLAST Algorithm The blast algorithm is fast, accurate and web-accessible. It is relatively faster than other sequence similarity search tools. Complex BLAST algorithm requires multiple steps and many parameters.
  • 14. BLAST Algorithm An overview of the BLAST algorithm (a protein to protein search) is as follows:  Remove low- complexity region or sequence repeats in the query sequence.  Make a k-letter word list of the query sequence - Take k=3 for example, we list the words of length 3 in the query protein sequence (k is usually 11 for a DNA sequence) "sequentially", until the last letter of the query sequence is included.
  • 15. BLAST Algorithm  List the possible matching words.  Organize the remaining high-scoring words into an efficient search tree.  Repeat step 3 to 4 for each k-letter word in the query sequence.  Scan the database sequences for exact matches with the remaining high-scoring words.  Extend the exact matches to high-scoring segment pair (HSP).
  • 16. BLAST Algorithm  List all of the HSPs in the database whose score is high enough to be considered.  Evaluate the significance of the HSP score.  Make two or more HSP regions into a longer alignment.  Show the gapped Smith-Waterman local alignments of the query and each of the matched database sequences.  Report every match whose expect score is lower than a threshold parameter E.
  • 17. BLAST Input-Output Input Input sequences in FASTA or Genbank format. Output BLAST output can be delivered in a variety of formats. These formats include HTML, plain text, and XML formatting. For NCBI's web- page, the default format for output is HTML.  An introduction that tells where the search occurred and what database and query were compared
  • 18. BLAST Output  A list of the sequences in the database containing segment pairs whose scores were least likely to occur by chance  Alignments of the high-scoring segment pairs showing identical and similar residues  A complete list of the parameter settings used for the search.
  • 19. BLAST Output E-value (expectation value)  The Expect value (E) is a parameter that describes the number of hits one can "expect" to see by chance when searching a database of a particular size.  It decreases exponentially as the Score (S) of the match increases.  Essentially, the E value describes the random background noise.  In general terms the smaller E is the more likely the match is significant.
  • 20. BLAST Output  Default E value for blastn, blastp, blastx and tblastn is 10  At this setting, 10 hits with scores equal to or better than the defined alignment score, S, are expected to occur by chance. The E- value can be increased or decreased to alter the stringency of the search.  Increase the E value when searching with a short query, since it is likely to be found many times by chance in a given database.
  • 21. BLAST Output Bit Score  A bit score is another prominent statistical indicator used in addition to the E value in a BLAST output.  The bit score measures sequence similarity independent of query sequence length and database size and is normalized based on the raw pairwise alignment score.
  • 22. BLAST Search • Go to http://www.ncbi.nlm.nih.gov/ • Select BLAST program
  • 23. BLAST Search Selecting the BLAST Database
  • 24. BLAST Search  Entering sequence  Submitting search
  • 25.
  • 26. BLAST Function BLAST can be used for several purposes. These include identifying species, locating domains, establishing phylogeny, DNA mapping, and comparison. Identifying species -With the use of BLAST, we can possibly correctly identify a species or find homologous species. This can be useful, for example, when we are working with a DNA sequence from an unknown species.
  • 27. BLAST Function Locating domains - When working with a protein sequence you can input it into BLAST, to locate known domains within the sequence of interest. Establishing phylogeny -Using the results received through BLAST we can create a phylogenetic tree using the BLAST web-page.
  • 28. BLAST Function DNA mapping -When working with a known species, and looking to sequence a gene at an unknown location, BLAST can compare the chromosomal position of the sequence of interest, to relevant sequences in the database Comparison -When working with genes, BLAST can locate common genes in two related species, and can be used to map annotations from one organism to another.
  • 29. Objectives of BLAST  It is one of the most popular programs for sequence analysis.  Enables a researcher to compare a query sequence with a library or database of sequence.  Identify library sequences that resemble the query sequence above a certain threshold. The objective is to find high scoring ungapped segments among related sequences.
  • 30. Objectives of BLAST  Alignments of the high-scoring segment pairs showing identical and similar residues.  A complete list of the parameter settings used for the search. That’s all from our presentation