SlideShare a Scribd company logo
1 of 27
WELCOME TO OUR PRESENTATION
Name :
Mr. Shaon Bhatta Shuvo
Designation
Lecturer
Department
Department of Computer Science and Engineering
Daffodil International University
BLAST
(Basic local alignment search
Tool)
Contents
Definition
Background
Types of BLAST Program
Algorithm
BLAST Input-Output
BLAST search
BLAST Function
Objectives of BLAST
Definition
The Basic Local Alignment Search Tool
(BLAST) for comparing gene and protein
sequences against others in public
databases.
BLAST is a set of sequence comparison
algorithms used to search databases for
optimal local alignments to a query.
Basic Local Alignment Search Tool
BLAST finds regions of similarity between biological sequences.
The program compares nucleotide or protein sequences to
sequence databases and calculates the statistical significance.
The Basic Local Alignment Search Tool (BLAST) finds regions of
local similarity between sequences. The program compares
nucleotide or protein sequences to sequence databases and
calculates the statistical significance of matches. BLAST can be
used to infer functional and evolutionary relationships between
sequences as well as help identify members of gene families.
Background
Beginning in the 1970s, scientists began
to accumulate DNA and protein
sequence data at an exponential rate; in
fact, researchers currently have
approximately 97 billion bases
sequenced and over 93 million records.
Amazingly, this sequence data doubles
every 18 months!
Background
Today, one of the most commonly used
tools to examine DNA and protein
sequences is the Basic Local Alignment
Search Tool, also known as BLAST.
BLAST is a computer algorithm that is
available for use online at the National
Center for Biotechnology Information
(NCBI) website and many other sites.
Types of BLAST
 Nucleotide-nucleotide BLAST (blastn)
- This program, given a DNA query,
returns the most similar DNA sequences from
the DNA database that the user specifies.
 Protein-protein BLAST (blastp)
- This program, given a protein query,
returns the most similar protein sequences from
the protein database that the user specifies.
 Position-Specific Iterative BLAST (PSI-
BLAST) (blastpgp)
- This program is used to find distant
relatives of a protein.
Types of BLAST
 Nucleotide 6-frame translation-protein
(blastx)
-This program compares the six-frame
conceptual translation products of a nucleotide
query sequence (both strands) against a protein
sequence database.
 Nucleotide 6-frame translation-nucleotide
6-frame translation (tblastx)
-The purpose of tblastx is to find very
distant relationships between nucleotide
sequences.
Types of BLAST
 Protein-nucleotide 6-frame translation
(tblastn)
-This program compares a protein query
against the all six reading frames of a
nucleotide sequence database.
 Large numbers of query sequences
(megablast)
-When comparing large numbers of input
sequences via the command-line BLAST,
"megablast" is much faster than running BLAST
multiple times.
Types of BLAST
Of these programs, BLASTn and BLASTp are
the most commonly used because they use
direct comparisons, and do not require
translations.
However, since protein sequences are better
conserved evolutionarily than nucleotide
sequences, tBLASTn, tBLASTx, and BLASTx,
produce more reliable and accurate results
when dealing with coding DNA.
BLAST Algorithm
The blast algorithm is fast, accurate and
web-accessible.
It is relatively faster than other sequence
similarity search tools.
Complex BLAST algorithm requires
multiple steps and many parameters.
BLAST Algorithm
An overview of the
BLAST algorithm (a
protein to protein
search) is as follows:
 Remove low-
complexity region or
sequence repeats in
the query sequence.
 Make a k-letter word
list of the query
sequence - Take k=3 for
example, we list the words of
length 3 in the query protein
sequence (k is usually 11 for a
DNA sequence) "sequentially",
until the last letter of the query
sequence is included.
BLAST Algorithm
 List the possible matching words.
 Organize the remaining high-scoring words into an
efficient search tree.
 Repeat step 3 to 4 for each k-letter word in the
query sequence.
 Scan the database sequences for exact matches
with the remaining high-scoring words.
 Extend the exact matches to high-scoring segment
pair (HSP).
BLAST Input-Output
Input
Input sequences
in FASTA or Genbank format.
Output
BLAST output can be delivered in a variety of
formats. These formats include HTML, plain
text, and XML formatting. For NCBI's web-
page, the default format for output is HTML.
 An introduction that tells where the search
occurred and what database and query were
compared
BLAST Output
 A list of the
sequences in the
database containing
segment pairs whose
scores were least
likely to occur by
chance
 Alignments of the
high-scoring segment
pairs showing identical
and similar residues
 A complete list of the
parameter settings
used for the search.
BLAST Output
Bit Score
 A bit score is another prominent statistical
indicator used in addition to the E value in
a BLAST output.
 The bit score measures sequence
similarity independent of query sequence
length and database size and is
normalized based on the raw pairwise
alignment score.
BLAST Search
• Go to http://www.ncbi.nlm.nih.gov/
• Select BLAST program
BLAST Search
Selecting the BLAST Database
BLAST Search
 Entering sequence
 Submitting search
BLAST Function
Locating domains
- When working with a protein
sequence you can input it into BLAST, to
locate known domains within the sequence of
interest.
Establishing phylogeny
-Using the results received through
BLAST we can create a phylogenetic tree
using the BLAST web-page.
BLAST Function
DNA mapping
-When working with a known species,
and looking to sequence a gene at an
unknown location, BLAST can compare the
chromosomal position of the sequence of
interest, to relevant sequences in the
database
Comparison
-When working with genes, BLAST
can locate common genes in two related
species, and can be used to map
annotations from one organism to another.
Objectives of BLAST
 It is one of the most popular programs for
sequence analysis.
 Enables a researcher to compare a
query sequence with a library or database
of sequence.
 Identify library sequences that resemble
the query sequence above a certain
threshold.
The objective is to find high scoring
ungapped segments among related
sequences.
THANK YOU

More Related Content

What's hot (20)

Nucleic Acid Sequence databases
Nucleic Acid Sequence databasesNucleic Acid Sequence databases
Nucleic Acid Sequence databases
 
SAGE (Serial analysis of Gene Expression)
SAGE (Serial analysis of Gene Expression)SAGE (Serial analysis of Gene Expression)
SAGE (Serial analysis of Gene Expression)
 
Gene bank by kk sahu
Gene bank by kk sahuGene bank by kk sahu
Gene bank by kk sahu
 
Molecular probes
Molecular probesMolecular probes
Molecular probes
 
Swiss prot database
Swiss prot databaseSwiss prot database
Swiss prot database
 
cDNA Library Construction
cDNA Library ConstructioncDNA Library Construction
cDNA Library Construction
 
Restriction Mapping
Restriction MappingRestriction Mapping
Restriction Mapping
 
methods for protein structure prediction
methods for protein structure predictionmethods for protein structure prediction
methods for protein structure prediction
 
Protein structure visualization tools-RASMOL
Protein structure visualization tools-RASMOLProtein structure visualization tools-RASMOL
Protein structure visualization tools-RASMOL
 
Gene cloning strategies
Gene cloning strategiesGene cloning strategies
Gene cloning strategies
 
Ligase enzyme
Ligase enzyme Ligase enzyme
Ligase enzyme
 
Proteins databases
Proteins databasesProteins databases
Proteins databases
 
Needleman-Wunsch Algorithm
Needleman-Wunsch AlgorithmNeedleman-Wunsch Algorithm
Needleman-Wunsch Algorithm
 
Protein data bank
Protein data bankProtein data bank
Protein data bank
 
Blast
BlastBlast
Blast
 
Entrez databases
Entrez databasesEntrez databases
Entrez databases
 
DNA data bank of japan (DDBJ)
DNA data bank of japan (DDBJ)DNA data bank of japan (DDBJ)
DNA data bank of japan (DDBJ)
 
Fasta
FastaFasta
Fasta
 
Structural genomics
Structural genomicsStructural genomics
Structural genomics
 
Primary and secondary databases ppt by puneet kulyana
Primary and secondary databases ppt by puneet kulyanaPrimary and secondary databases ppt by puneet kulyana
Primary and secondary databases ppt by puneet kulyana
 

Similar to Blasta

BLAST(Basic Local Alignment Tool)
BLAST(Basic Local Alignment Tool)BLAST(Basic Local Alignment Tool)
BLAST(Basic Local Alignment Tool)Sobia
 
BLAST AND FASTA.pptx12345789999987544321234
BLAST AND FASTA.pptx12345789999987544321234BLAST AND FASTA.pptx12345789999987544321234
BLAST AND FASTA.pptx12345789999987544321234alizain9604
 
Blast bioinformatics
Blast bioinformaticsBlast bioinformatics
Blast bioinformaticsatmapandey
 
Lecture 5.pptx
Lecture 5.pptxLecture 5.pptx
Lecture 5.pptxericndunek
 
Sequence similarity tools.pptx
Sequence similarity tools.pptxSequence similarity tools.pptx
Sequence similarity tools.pptxPagudalaSangeetha
 
Data base searching tool
Data base searching toolData base searching tool
Data base searching toolNithyaNandapal
 
Basic BLAST (BLASTn)
Basic BLAST (BLASTn)Basic BLAST (BLASTn)
Basic BLAST (BLASTn)Syed Lokman
 
Blast and fasta
Blast and fastaBlast and fasta
Blast and fastaALLIENU
 
Blast gp assignment
Blast  gp assignmentBlast  gp assignment
Blast gp assignmentbarathvaj
 
Sequence homology search and multiple sequence alignment(1)
Sequence homology search and multiple sequence alignment(1)Sequence homology search and multiple sequence alignment(1)
Sequence homology search and multiple sequence alignment(1)AnkitTiwari354
 
Bioinformatic, and tools by kk sahu
Bioinformatic, and tools by kk sahuBioinformatic, and tools by kk sahu
Bioinformatic, and tools by kk sahuKAUSHAL SAHU
 

Similar to Blasta (20)

Blast
BlastBlast
Blast
 
BLAST
BLASTBLAST
BLAST
 
BLAST
BLASTBLAST
BLAST
 
BLAST(Basic Local Alignment Tool)
BLAST(Basic Local Alignment Tool)BLAST(Basic Local Alignment Tool)
BLAST(Basic Local Alignment Tool)
 
Blast
Blast Blast
Blast
 
BLAST
BLASTBLAST
BLAST
 
BLAST AND FASTA.pptx12345789999987544321234
BLAST AND FASTA.pptx12345789999987544321234BLAST AND FASTA.pptx12345789999987544321234
BLAST AND FASTA.pptx12345789999987544321234
 
Blast bioinformatics
Blast bioinformaticsBlast bioinformatics
Blast bioinformatics
 
BLAST
BLASTBLAST
BLAST
 
blast bioinformatics
blast bioinformaticsblast bioinformatics
blast bioinformatics
 
Lecture 5.pptx
Lecture 5.pptxLecture 5.pptx
Lecture 5.pptx
 
Sequence similarity tools.pptx
Sequence similarity tools.pptxSequence similarity tools.pptx
Sequence similarity tools.pptx
 
Data base searching tool
Data base searching toolData base searching tool
Data base searching tool
 
BLAST Search tool
BLAST Search toolBLAST Search tool
BLAST Search tool
 
Basic BLAST (BLASTn)
Basic BLAST (BLASTn)Basic BLAST (BLASTn)
Basic BLAST (BLASTn)
 
Blast and fasta
Blast and fastaBlast and fasta
Blast and fasta
 
Article
ArticleArticle
Article
 
Blast gp assignment
Blast  gp assignmentBlast  gp assignment
Blast gp assignment
 
Sequence homology search and multiple sequence alignment(1)
Sequence homology search and multiple sequence alignment(1)Sequence homology search and multiple sequence alignment(1)
Sequence homology search and multiple sequence alignment(1)
 
Bioinformatic, and tools by kk sahu
Bioinformatic, and tools by kk sahuBioinformatic, and tools by kk sahu
Bioinformatic, and tools by kk sahu
 

More from Daffodil International University (20)

Bresenham algorithm
Bresenham algorithmBresenham algorithm
Bresenham algorithm
 
Tic Tac Toe
Tic Tac ToeTic Tac Toe
Tic Tac Toe
 
The Waterfall Model & RAD MODEL
 The Waterfall Model &  RAD MODEL The Waterfall Model &  RAD MODEL
The Waterfall Model & RAD MODEL
 
Reinforcement learning
Reinforcement learningReinforcement learning
Reinforcement learning
 
online marketing
online marketingonline marketing
online marketing
 
normalization
normalizationnormalization
normalization
 
Bioinformatics lesson
Bioinformatics lessonBioinformatics lesson
Bioinformatics lesson
 
Bioinformatics lesson
Bioinformatics lessonBioinformatics lesson
Bioinformatics lesson
 
Liver
LiverLiver
Liver
 
Numerical methods
Numerical methodsNumerical methods
Numerical methods
 
stack in assembally language
stack in assembally languagestack in assembally language
stack in assembally language
 
OSI Model
OSI ModelOSI Model
OSI Model
 
Topology
TopologyTopology
Topology
 
Complex number
Complex numberComplex number
Complex number
 
Ahsan Manzil
Ahsan Manzil Ahsan Manzil
Ahsan Manzil
 
Big data
Big dataBig data
Big data
 
Search
SearchSearch
Search
 
Encoders
EncodersEncoders
Encoders
 
Applications of numerical methods
Applications of numerical methodsApplications of numerical methods
Applications of numerical methods
 
Finite difference & interpolation
Finite difference & interpolationFinite difference & interpolation
Finite difference & interpolation
 

Recently uploaded

Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxAvyJaneVismanos
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfSumit Tiwari
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxHistory Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxsocialsciencegdgrohi
 
Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfMahmoud M. Sallam
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsKarinaGenton
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Celine George
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting DataJhengPantaleon
 
Blooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxBlooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxUnboundStockton
 
Painted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaPainted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaVirag Sontakke
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17Celine George
 
internship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerinternship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerunnathinaik
 

Recently uploaded (20)

9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptx
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxHistory Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
 
Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdf
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its Characteristics
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
 
Blooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxBlooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docx
 
Painted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaPainted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of India
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17
 
internship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerinternship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developer
 

Blasta

  • 1. WELCOME TO OUR PRESENTATION
  • 2. Name : Mr. Shaon Bhatta Shuvo Designation Lecturer Department Department of Computer Science and Engineering Daffodil International University
  • 3.
  • 5. Contents Definition Background Types of BLAST Program Algorithm BLAST Input-Output BLAST search BLAST Function Objectives of BLAST
  • 6. Definition The Basic Local Alignment Search Tool (BLAST) for comparing gene and protein sequences against others in public databases. BLAST is a set of sequence comparison algorithms used to search databases for optimal local alignments to a query.
  • 7. Basic Local Alignment Search Tool BLAST finds regions of similarity between biological sequences. The program compares nucleotide or protein sequences to sequence databases and calculates the statistical significance. The Basic Local Alignment Search Tool (BLAST) finds regions of local similarity between sequences. The program compares nucleotide or protein sequences to sequence databases and calculates the statistical significance of matches. BLAST can be used to infer functional and evolutionary relationships between sequences as well as help identify members of gene families.
  • 8.
  • 9. Background Beginning in the 1970s, scientists began to accumulate DNA and protein sequence data at an exponential rate; in fact, researchers currently have approximately 97 billion bases sequenced and over 93 million records. Amazingly, this sequence data doubles every 18 months!
  • 10. Background Today, one of the most commonly used tools to examine DNA and protein sequences is the Basic Local Alignment Search Tool, also known as BLAST. BLAST is a computer algorithm that is available for use online at the National Center for Biotechnology Information (NCBI) website and many other sites.
  • 11. Types of BLAST  Nucleotide-nucleotide BLAST (blastn) - This program, given a DNA query, returns the most similar DNA sequences from the DNA database that the user specifies.  Protein-protein BLAST (blastp) - This program, given a protein query, returns the most similar protein sequences from the protein database that the user specifies.  Position-Specific Iterative BLAST (PSI- BLAST) (blastpgp) - This program is used to find distant relatives of a protein.
  • 12. Types of BLAST  Nucleotide 6-frame translation-protein (blastx) -This program compares the six-frame conceptual translation products of a nucleotide query sequence (both strands) against a protein sequence database.  Nucleotide 6-frame translation-nucleotide 6-frame translation (tblastx) -The purpose of tblastx is to find very distant relationships between nucleotide sequences.
  • 13. Types of BLAST  Protein-nucleotide 6-frame translation (tblastn) -This program compares a protein query against the all six reading frames of a nucleotide sequence database.  Large numbers of query sequences (megablast) -When comparing large numbers of input sequences via the command-line BLAST, "megablast" is much faster than running BLAST multiple times.
  • 14. Types of BLAST Of these programs, BLASTn and BLASTp are the most commonly used because they use direct comparisons, and do not require translations. However, since protein sequences are better conserved evolutionarily than nucleotide sequences, tBLASTn, tBLASTx, and BLASTx, produce more reliable and accurate results when dealing with coding DNA.
  • 15. BLAST Algorithm The blast algorithm is fast, accurate and web-accessible. It is relatively faster than other sequence similarity search tools. Complex BLAST algorithm requires multiple steps and many parameters.
  • 16. BLAST Algorithm An overview of the BLAST algorithm (a protein to protein search) is as follows:  Remove low- complexity region or sequence repeats in the query sequence.  Make a k-letter word list of the query sequence - Take k=3 for example, we list the words of length 3 in the query protein sequence (k is usually 11 for a DNA sequence) "sequentially", until the last letter of the query sequence is included.
  • 17. BLAST Algorithm  List the possible matching words.  Organize the remaining high-scoring words into an efficient search tree.  Repeat step 3 to 4 for each k-letter word in the query sequence.  Scan the database sequences for exact matches with the remaining high-scoring words.  Extend the exact matches to high-scoring segment pair (HSP).
  • 18. BLAST Input-Output Input Input sequences in FASTA or Genbank format. Output BLAST output can be delivered in a variety of formats. These formats include HTML, plain text, and XML formatting. For NCBI's web- page, the default format for output is HTML.  An introduction that tells where the search occurred and what database and query were compared
  • 19. BLAST Output  A list of the sequences in the database containing segment pairs whose scores were least likely to occur by chance  Alignments of the high-scoring segment pairs showing identical and similar residues  A complete list of the parameter settings used for the search.
  • 20. BLAST Output Bit Score  A bit score is another prominent statistical indicator used in addition to the E value in a BLAST output.  The bit score measures sequence similarity independent of query sequence length and database size and is normalized based on the raw pairwise alignment score.
  • 21. BLAST Search • Go to http://www.ncbi.nlm.nih.gov/ • Select BLAST program
  • 22. BLAST Search Selecting the BLAST Database
  • 23. BLAST Search  Entering sequence  Submitting search
  • 24. BLAST Function Locating domains - When working with a protein sequence you can input it into BLAST, to locate known domains within the sequence of interest. Establishing phylogeny -Using the results received through BLAST we can create a phylogenetic tree using the BLAST web-page.
  • 25. BLAST Function DNA mapping -When working with a known species, and looking to sequence a gene at an unknown location, BLAST can compare the chromosomal position of the sequence of interest, to relevant sequences in the database Comparison -When working with genes, BLAST can locate common genes in two related species, and can be used to map annotations from one organism to another.
  • 26. Objectives of BLAST  It is one of the most popular programs for sequence analysis.  Enables a researcher to compare a query sequence with a library or database of sequence.  Identify library sequences that resemble the query sequence above a certain threshold. The objective is to find high scoring ungapped segments among related sequences.