SlideShare a Scribd company logo
1 of 24
Basic Local Alignment Search Tool
BLAST
Master In Biotechnology
Subhampreetam
SOA University, BBSR
Can yeast be used as a model organi
sm to study cystic fibrosis?
2
Finding Model Organisms for Study o
f Disease
Model Organisms
• Cystic fibrosis is a genetic disorder tha
t affects humans
– If yeast contain a protein that is related (hom
ologous) to the protein involved in cystic fibro
sis
– Then yeast can be used as a model organi
sm to study this disease
• Study of the protein in yeast will tell us about t
he function of the protein in humans
3
BLAST helps you to find homologo
us genes and proteins
Homologous Proteins (or genes)
• Have a common ancestor (they’re related)
• Have similar structures
• Have similar functions
4
Criteria for considering two sequ
ences to be homologous
• Proteins are homologous if
– Their amino acid sequences are at least 25
% identical
• DNA sequences are homologous if
– they are at least 70% identical
– Note that sequences must be over 100 a.a. (o
r bp) in length
5
Whenever possible, it is better to co
mpare proteins
than to compare genes
What does BLAST do?
BLAST compares sequences
• BLAST takes a query sequence
• Compares it with millions of sequences in the G
enbank databases
– By constructing local alignments
• Lists those that appear to be similar to the query
sequence
– The “hit list”
• Tells you why it thinks they are homologs
– BLAST makes suggestions
– YOU make the conclusions
8
How do I input a query into BLAST?
Choose which “flavor” of BLAST to
use
• BLAST comes in many “flavors”
– Protein BLAST (BLASTp)
• Compares a protein query with sequences in G
enBank protein database
– Nucleotide BLAST (BLASTn)
• Compare nucleotide query with sequences in G
enBank nucleotide database
10
Enter your “query” sequence
• A sequence can be input as a (an)
– FASTA format sequence
– Accession number
– Protein blast can only accept amino acid sequ
ences
11
Choose search set
• Choose which database to search
– Default is non-redundant protein sequence
s (nr)
• Searches all databases that contain protein seq
uences
12
Choose organism
• Default is all organisms represented in
databases
• Use this to limit your search to one org
anism (eg. Yeast)
13
BLAST off!!
• Click on the BLAST button at the botto
m of the page!
14
How do I interpret the results of a BLAST search?
BLAST creates local alignments
• What is a local alignment?
– BLAST looks for similarities between regio
ns of two sequences
16
The BLAST output then describe
s how these aligned regions are
similar
• How long are the aligned segments?
• Did BLAST have to introduce gaps in order to a
lign the segments?
• How similar are the aligned segments?
17
The BLAST Output
The Graphic Display
1. How good is the match?
• Red = excellent!
• Pink = pretty good
• Green = OK, but look at other factors
• Blue = bad
• Black = really bad!
2. How long are the matched segments?
Longer = better
19
The hit list
• BLAST lists the best matches (hits)
– For each hit, BLAST provides:
• Accession number – links to Genbank flatfile
• Description
• “G” = genome link
• E-value
– An indicator of how good a match to the query seque
nce
• Score
– Link to an alignment
20
What is an E-value?
• E-value
– The chance that the match could be rando
m
– The lower the E-value, the more significant
the match
• E = 10-4 is considered the cutoff point
• E = 0 means that the two sequences are statisticall
y identical
21
Most people use the E- value as their
first indication of similarity!
The Alignment
• Look for:
– Long regions of alignment
– With few gaps
– % identity should be >25% for proteins
• (>70% for DNA)
23
BLAST makes suggestions,
You draw the conclusions!
• Look at E-value
• Look at graphic display
• If necessary, look at alignment
• Make your best guess!
24

More Related Content

Similar to Blast subham

blast presentation beevragh muneer.pptx
blast presentation  beevragh muneer.pptxblast presentation  beevragh muneer.pptx
blast presentation beevragh muneer.pptx
home
 

Similar to Blast subham (20)

BLAST
BLASTBLAST
BLAST
 
ppgardner-lecture06-homologysearch.pdf
ppgardner-lecture06-homologysearch.pdfppgardner-lecture06-homologysearch.pdf
ppgardner-lecture06-homologysearch.pdf
 
Introduction to Gene Mining: Part B: How similar are plant and animal version...
Introduction to Gene Mining: Part B: How similar are plant and animal version...Introduction to Gene Mining: Part B: How similar are plant and animal version...
Introduction to Gene Mining: Part B: How similar are plant and animal version...
 
BLAST AND FASTA.pptx12345789999987544321234
BLAST AND FASTA.pptx12345789999987544321234BLAST AND FASTA.pptx12345789999987544321234
BLAST AND FASTA.pptx12345789999987544321234
 
O.M.GSEA - An in-depth introduction to gene-set enrichment analysis
O.M.GSEA - An in-depth introduction to gene-set enrichment analysisO.M.GSEA - An in-depth introduction to gene-set enrichment analysis
O.M.GSEA - An in-depth introduction to gene-set enrichment analysis
 
Introduction to Bioinformatics: Part 3
Introduction to Bioinformatics: Part 3Introduction to Bioinformatics: Part 3
Introduction to Bioinformatics: Part 3
 
Blast 2013 1
Blast 2013 1Blast 2013 1
Blast 2013 1
 
2 md2016 annotation
2 md2016 annotation2 md2016 annotation
2 md2016 annotation
 
BLAST
BLASTBLAST
BLAST
 
Blast
BlastBlast
Blast
 
RNA Seq Data Analysis
RNA Seq Data AnalysisRNA Seq Data Analysis
RNA Seq Data Analysis
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
1 md2016 homology
1 md2016 homology1 md2016 homology
1 md2016 homology
 
Bioinformaatics for M.Sc. Biotecchnology.pptx
Bioinformaatics for M.Sc. Biotecchnology.pptxBioinformaatics for M.Sc. Biotecchnology.pptx
Bioinformaatics for M.Sc. Biotecchnology.pptx
 
blast presentation beevragh muneer.pptx
blast presentation  beevragh muneer.pptxblast presentation  beevragh muneer.pptx
blast presentation beevragh muneer.pptx
 
222397 lecture 16 17
222397 lecture 16 17222397 lecture 16 17
222397 lecture 16 17
 
BLAST
BLASTBLAST
BLAST
 
Sequence Analysis
Sequence AnalysisSequence Analysis
Sequence Analysis
 
blast bioinformatics
blast bioinformaticsblast bioinformatics
blast bioinformatics
 
Mixed Models: How to Effectively Account for Inbreeding and Population Struct...
Mixed Models: How to Effectively Account for Inbreeding and Population Struct...Mixed Models: How to Effectively Account for Inbreeding and Population Struct...
Mixed Models: How to Effectively Account for Inbreeding and Population Struct...
 

More from Subham Preetam

More from Subham Preetam (20)

Alcohol and its effects on the body
Alcohol and its effects on the bodyAlcohol and its effects on the body
Alcohol and its effects on the body
 
Cryptocurrency A brief History
Cryptocurrency A brief HistoryCryptocurrency A brief History
Cryptocurrency A brief History
 
Basics of Artificial Neural Network
Basics of Artificial Neural Network Basics of Artificial Neural Network
Basics of Artificial Neural Network
 
Fundamentals of Block chain Technology
Fundamentals of Block chain TechnologyFundamentals of Block chain Technology
Fundamentals of Block chain Technology
 
Beneficial use of Microorganism
 Beneficial use of Microorganism Beneficial use of Microorganism
Beneficial use of Microorganism
 
Recent Advancement in science (India part)
Recent Advancement in  science (India part)Recent Advancement in  science (India part)
Recent Advancement in science (India part)
 
Nanotechnology the Advanced Science
Nanotechnology the Advanced ScienceNanotechnology the Advanced Science
Nanotechnology the Advanced Science
 
Chandryaan 2
Chandryaan 2 Chandryaan 2
Chandryaan 2
 
Coronavirus
CoronavirusCoronavirus
Coronavirus
 
Membrane structure
Membrane structureMembrane structure
Membrane structure
 
Silk moth and silk production
Silk moth and silk productionSilk moth and silk production
Silk moth and silk production
 
IMMUNOPATHOLOGY ON SLEEPING SICKNESS:
IMMUNOPATHOLOGY ON SLEEPING SICKNESS: IMMUNOPATHOLOGY ON SLEEPING SICKNESS:
IMMUNOPATHOLOGY ON SLEEPING SICKNESS:
 
Food Packing
Food Packing Food Packing
Food Packing
 
Golden rice technology
Golden rice technologyGolden rice technology
Golden rice technology
 
Immunotherapy
Immunotherapy Immunotherapy
Immunotherapy
 
Biosurfect againt Cancer
Biosurfect againt Cancer Biosurfect againt Cancer
Biosurfect againt Cancer
 
Inhibition effect of Garlic and Onion
Inhibition effect of Garlic and Onion Inhibition effect of Garlic and Onion
Inhibition effect of Garlic and Onion
 
Biosurfectant production
Biosurfectant productionBiosurfectant production
Biosurfectant production
 
Ebola virus
  Ebola virus  Ebola virus
Ebola virus
 
Benifits of ginger
Benifits of gingerBenifits of ginger
Benifits of ginger
 

Recently uploaded

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Recently uploaded (20)

Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 

Blast subham

  • 1. Basic Local Alignment Search Tool BLAST Master In Biotechnology Subhampreetam SOA University, BBSR
  • 2. Can yeast be used as a model organi sm to study cystic fibrosis? 2 Finding Model Organisms for Study o f Disease
  • 3. Model Organisms • Cystic fibrosis is a genetic disorder tha t affects humans – If yeast contain a protein that is related (hom ologous) to the protein involved in cystic fibro sis – Then yeast can be used as a model organi sm to study this disease • Study of the protein in yeast will tell us about t he function of the protein in humans 3
  • 4. BLAST helps you to find homologo us genes and proteins Homologous Proteins (or genes) • Have a common ancestor (they’re related) • Have similar structures • Have similar functions 4
  • 5. Criteria for considering two sequ ences to be homologous • Proteins are homologous if – Their amino acid sequences are at least 25 % identical • DNA sequences are homologous if – they are at least 70% identical – Note that sequences must be over 100 a.a. (o r bp) in length 5
  • 6. Whenever possible, it is better to co mpare proteins than to compare genes
  • 8. BLAST compares sequences • BLAST takes a query sequence • Compares it with millions of sequences in the G enbank databases – By constructing local alignments • Lists those that appear to be similar to the query sequence – The “hit list” • Tells you why it thinks they are homologs – BLAST makes suggestions – YOU make the conclusions 8
  • 9. How do I input a query into BLAST?
  • 10. Choose which “flavor” of BLAST to use • BLAST comes in many “flavors” – Protein BLAST (BLASTp) • Compares a protein query with sequences in G enBank protein database – Nucleotide BLAST (BLASTn) • Compare nucleotide query with sequences in G enBank nucleotide database 10
  • 11. Enter your “query” sequence • A sequence can be input as a (an) – FASTA format sequence – Accession number – Protein blast can only accept amino acid sequ ences 11
  • 12. Choose search set • Choose which database to search – Default is non-redundant protein sequence s (nr) • Searches all databases that contain protein seq uences 12
  • 13. Choose organism • Default is all organisms represented in databases • Use this to limit your search to one org anism (eg. Yeast) 13
  • 14. BLAST off!! • Click on the BLAST button at the botto m of the page! 14
  • 15. How do I interpret the results of a BLAST search?
  • 16. BLAST creates local alignments • What is a local alignment? – BLAST looks for similarities between regio ns of two sequences 16
  • 17. The BLAST output then describe s how these aligned regions are similar • How long are the aligned segments? • Did BLAST have to introduce gaps in order to a lign the segments? • How similar are the aligned segments? 17
  • 19. The Graphic Display 1. How good is the match? • Red = excellent! • Pink = pretty good • Green = OK, but look at other factors • Blue = bad • Black = really bad! 2. How long are the matched segments? Longer = better 19
  • 20. The hit list • BLAST lists the best matches (hits) – For each hit, BLAST provides: • Accession number – links to Genbank flatfile • Description • “G” = genome link • E-value – An indicator of how good a match to the query seque nce • Score – Link to an alignment 20
  • 21. What is an E-value? • E-value – The chance that the match could be rando m – The lower the E-value, the more significant the match • E = 10-4 is considered the cutoff point • E = 0 means that the two sequences are statisticall y identical 21
  • 22. Most people use the E- value as their first indication of similarity!
  • 23. The Alignment • Look for: – Long regions of alignment – With few gaps – % identity should be >25% for proteins • (>70% for DNA) 23
  • 24. BLAST makes suggestions, You draw the conclusions! • Look at E-value • Look at graphic display • If necessary, look at alignment • Make your best guess! 24