SlideShare a Scribd company logo
1 of 20
BLAST
BLAST
• Basic Local Alignment Search Tool (BLAST) - a program
that can detect sequence similarity between a Query
sequence and sequences within a database.
• It uses a robust statistical framework that can determine if
the alignment between two sequences is statistically
significant.
Uses
– Helpful in identifying well known genes in a novel
sequence
– Helps to determine the relation between particular gene
or protein with other genes or proteins.
• Navigate to the BLAST main page at
http://blast.ncbi.nlm.nih.gov/Blast.cgi.
• All NCBI BLAST pages have the same header with
four tabs:
Tab Explanation
Home Link to the BLAST home page
Recent
Results
Link to results of the BLAST searches you have
performed in your current browser session
Saved
Strategies
BLAST input forms with the parameters you
have saved to your MyNCBI account
Help List of all BLAST help documentations
Two other sections:
• “Basic BLAST” section contains links to common
BLAST programs.
The different BLAST programs available on the NCBI web server
• “Specialized BLAST‟ section of the NCBI BLAST main
page can be used to align 2 or more sequences using ‘blast
2 sequences’
• Three main criterions
– BLAST program we wish to use
– query sequence we want to interpret
– database we want to search
Other optional parameters such as expect threshold and
other scoring parameters can be used to modify the
behaviour of BLAST.
BLAST search
It has four components:
query,
database,
program,
search purpose/goal.
1. Prepare the query sequence (sequence to be BLAST) you want to
use as raw or in fasta format.
2. Go to NCBI home page available at http://www.ncbi.nlm.nih.gov/
and choose BLAST program.
3. Choose the nucleotide BLAST program.
4. Copy and paste the sequence in the sequence box or alternatively
you can also browse and upload the sequence in fasta format from
your computer (Fig. 5).
5. Select the database you want to search against (preferably ‘Others’
(nr - non redundant).
6. If you wish to search against any specific organism, under
‘Organism’, enter first few characters of the organism’s name and
select the organism for the dropdown list.
4
5
6
7
9
7. Choose the appropriate program under ‘Program Selection’.
MEGABLAST - to identify identical nucleotide sequence.
Discontiguous MEGABLAST – to identify similar nucleotide
sequences, but not identical.
Blastn - to identify related nucleotide sequences from other
organisms (distantly related).
MEGABLAST
Discontiguous MEGABLAST
Blastn
8. Adjust any parameter, if required. Any parameter changed is
highlighted in yellow.
9. Click ‘BLAST’ button at the end of the page.
Interpreting the BLAST result
The BLAST result consists of three major sections:
(1) The Header - containing information about the
query sequence, the database searched, BLAST version
and its release date. It also provides a graphical overview
of the query coverage;
(2) The Descriptions – showing the one-line
descriptions of each hit (sequence found to match in the
database) of the query sequence; provides a quick
overview for browsing;
(3) The Alignments – displaying the pairwise
alignments of the query sequence against each database
sequence matched.
2
3
The header Section of the BLAST result.
The header
1. It is then followed by the graphical output. The query sequence is represented
by the numbered red bar at the top of the Figure.
2. The hits from the database are shown below the red bar; the most similar are
shown closest to the query. Sometimes a thin black line is observed in the bar. This
indicates two High Scoring Pairs (HSP) are from the same sequence.
Perpendicular line (arrow) - the distance between the two HSPs is less
Horizontal line (arrow) - more distance between the two HSPs.
3. The Figure also provides a colour key to represent the match of the hits. The
length of the colour bar indicates the query coverage. Moving the mouse cursor
over the bars displays the definition line above the graphic for that sequence
The Descriptions
4. The One-line descriptions is composed of (a) a brief textual
description , (b) the max score, (c) the total score, (d) the query
coverage, (e) the E value, (f) the maximum identity and the accession
number. Clicking the hyperlinks will open corresponding GenBank
record.
5. The one-line descriptions are useful for people to get a quick
overview of their search results.
Description: Information about the sequence record for a particular hit.
Hit: Matched sequence in the database.
Score: A score is numerical value that describes the overall quality of the
alignment.
Max score: Highest alignment score between query and database sequence
segment.
Total score: Sum of alignment scores of all segments from the same
database sequence that match the query sequence (calculated over all
segments). This score is different from the max score if several parts of the
database sequence match different parts of the query sequence.
Query coverage: Percent of query length that is included in the aligned
segments. This coverage is calculated over all segments.
E value: Number of alignments expected by chance with a particular score
or better. It is derived from P value. The E-value below 10-4 usually indicates
the evidence for homology.
Max identity: BLAST calculates the percentage identity between the query
and the hit in a nucleotide-to-nucleotide alignment. If there are multiple
alignments with a single hit, then only the highest percent identity is shown.
Accession: A unique identifier number of a specific (sequence) GenBank
record.
The alignments
6. The pairwise sequence alignment is preceded by the sequence identifier, the
full definition line, and the length of the matched sequence.
The ‘Range 1:’ indicates the first segment that is matched and the following
values tell the range that is being matched with our query sequence.
7. Next comes the bit score (the raw score is in parentheses) and then the
E value, then the percentage of identities, gaps and finally the strand. A
plus/plus indicates that both the query and subject are in forward strand.
A plus/minus indicates that the query is in reverse strand. This is
because, usually all the sequence in the database is stored only as plus
strand.
8. The ‘Query’ refers to our query sequence and the ‘Sbjct’ refers to the
subject sequence (the match) from the database.
9. By default, a maximum of 100 sequence matches are displayed, which
can be changed on the BLAST page ‘Algorithm parameters’ option.

More Related Content

What's hot (20)

sequence of file formats in bioinformatics
sequence of file formats in bioinformaticssequence of file formats in bioinformatics
sequence of file formats in bioinformatics
 
Uni prot presentation
Uni prot presentationUni prot presentation
Uni prot presentation
 
BLAST(Basic Local Alignment Tool)
BLAST(Basic Local Alignment Tool)BLAST(Basic Local Alignment Tool)
BLAST(Basic Local Alignment Tool)
 
Blasta
BlastaBlasta
Blasta
 
Protein information resource (PIR)
Protein information resource (PIR)Protein information resource (PIR)
Protein information resource (PIR)
 
Sequence homology search and multiple sequence alignment(1)
Sequence homology search and multiple sequence alignment(1)Sequence homology search and multiple sequence alignment(1)
Sequence homology search and multiple sequence alignment(1)
 
Distance based method
Distance based method Distance based method
Distance based method
 
Global and local alignment (bioinformatics)
Global and local alignment (bioinformatics)Global and local alignment (bioinformatics)
Global and local alignment (bioinformatics)
 
Tools of bioinforformatics by kk
Tools of bioinforformatics by kkTools of bioinforformatics by kk
Tools of bioinforformatics by kk
 
Phylogenetic data analysis
Phylogenetic data analysisPhylogenetic data analysis
Phylogenetic data analysis
 
Sequence alignment global vs. local
Sequence alignment  global vs. localSequence alignment  global vs. local
Sequence alignment global vs. local
 
Structural genomics
Structural genomicsStructural genomics
Structural genomics
 
Blast and fasta
Blast and fastaBlast and fasta
Blast and fasta
 
Protein structure visualization tools-RASMOL
Protein structure visualization tools-RASMOLProtein structure visualization tools-RASMOL
Protein structure visualization tools-RASMOL
 
Scoring matrices
Scoring matricesScoring matrices
Scoring matrices
 
Sequence database
Sequence databaseSequence database
Sequence database
 
Protein Database
Protein DatabaseProtein Database
Protein Database
 
Introduction to NCBI
Introduction to NCBIIntroduction to NCBI
Introduction to NCBI
 
Genome annotation 2013
Genome annotation 2013Genome annotation 2013
Genome annotation 2013
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 

Similar to Bioinformatics

Basic BLAST (BLASTn)
Basic BLAST (BLASTn)Basic BLAST (BLASTn)
Basic BLAST (BLASTn)Syed Lokman
 
Blast bioinformatics
Blast bioinformaticsBlast bioinformatics
Blast bioinformaticsatmapandey
 
Lecture 5.pptx
Lecture 5.pptxLecture 5.pptx
Lecture 5.pptxericndunek
 
How the blast work
How the blast workHow the blast work
How the blast workAtai Rabby
 
Sequence similarity tools.pptx
Sequence similarity tools.pptxSequence similarity tools.pptx
Sequence similarity tools.pptxPagudalaSangeetha
 
BLAST AND FASTA.pptx12345789999987544321234
BLAST AND FASTA.pptx12345789999987544321234BLAST AND FASTA.pptx12345789999987544321234
BLAST AND FASTA.pptx12345789999987544321234alizain9604
 
lecture4.ppt Sequence Alignmentaldf sdfsadf
lecture4.ppt Sequence Alignmentaldf sdfsadflecture4.ppt Sequence Alignmentaldf sdfsadf
lecture4.ppt Sequence Alignmentaldf sdfsadfalizain9604
 
Advanced BLAST (BlastP, PSI-BLAST)
Advanced BLAST (BlastP, PSI-BLAST)Advanced BLAST (BlastP, PSI-BLAST)
Advanced BLAST (BlastP, PSI-BLAST)Syed Lokman
 
Sequencealignmentinbioinformatics 100204112518-phpapp02
Sequencealignmentinbioinformatics 100204112518-phpapp02Sequencealignmentinbioinformatics 100204112518-phpapp02
Sequencealignmentinbioinformatics 100204112518-phpapp02PILLAI ASWATHY VISWANATH
 
2016 bioinformatics i_database_searching_wimvancriekinge
2016 bioinformatics i_database_searching_wimvancriekinge2016 bioinformatics i_database_searching_wimvancriekinge
2016 bioinformatics i_database_searching_wimvancriekingeProf. Wim Van Criekinge
 
Blast gp assignment
Blast  gp assignmentBlast  gp assignment
Blast gp assignmentbarathvaj
 
Bioinformatica 10-11-2011-t5-database searching
Bioinformatica 10-11-2011-t5-database searchingBioinformatica 10-11-2011-t5-database searching
Bioinformatica 10-11-2011-t5-database searchingProf. Wim Van Criekinge
 
Blast fasta
Blast fastaBlast fasta
Blast fastayaghava
 

Similar to Bioinformatics (20)

Blast
BlastBlast
Blast
 
Basic BLAST (BLASTn)
Basic BLAST (BLASTn)Basic BLAST (BLASTn)
Basic BLAST (BLASTn)
 
Blast
BlastBlast
Blast
 
Blast bioinformatics
Blast bioinformaticsBlast bioinformatics
Blast bioinformatics
 
Lecture 5.pptx
Lecture 5.pptxLecture 5.pptx
Lecture 5.pptx
 
BLAST
BLASTBLAST
BLAST
 
Database Searching
Database SearchingDatabase Searching
Database Searching
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
How the blast work
How the blast workHow the blast work
How the blast work
 
Sequence similarity tools.pptx
Sequence similarity tools.pptxSequence similarity tools.pptx
Sequence similarity tools.pptx
 
BLAST AND FASTA.pptx12345789999987544321234
BLAST AND FASTA.pptx12345789999987544321234BLAST AND FASTA.pptx12345789999987544321234
BLAST AND FASTA.pptx12345789999987544321234
 
Use of NCBI Databases in qPCR Assay Design
Use of NCBI Databases in qPCR Assay DesignUse of NCBI Databases in qPCR Assay Design
Use of NCBI Databases in qPCR Assay Design
 
lecture4.ppt Sequence Alignmentaldf sdfsadf
lecture4.ppt Sequence Alignmentaldf sdfsadflecture4.ppt Sequence Alignmentaldf sdfsadf
lecture4.ppt Sequence Alignmentaldf sdfsadf
 
Bioinformatica 08-12-2011-t8-go-hmm
Bioinformatica 08-12-2011-t8-go-hmmBioinformatica 08-12-2011-t8-go-hmm
Bioinformatica 08-12-2011-t8-go-hmm
 
Advanced BLAST (BlastP, PSI-BLAST)
Advanced BLAST (BlastP, PSI-BLAST)Advanced BLAST (BlastP, PSI-BLAST)
Advanced BLAST (BlastP, PSI-BLAST)
 
Sequencealignmentinbioinformatics 100204112518-phpapp02
Sequencealignmentinbioinformatics 100204112518-phpapp02Sequencealignmentinbioinformatics 100204112518-phpapp02
Sequencealignmentinbioinformatics 100204112518-phpapp02
 
2016 bioinformatics i_database_searching_wimvancriekinge
2016 bioinformatics i_database_searching_wimvancriekinge2016 bioinformatics i_database_searching_wimvancriekinge
2016 bioinformatics i_database_searching_wimvancriekinge
 
Blast gp assignment
Blast  gp assignmentBlast  gp assignment
Blast gp assignment
 
Bioinformatica 10-11-2011-t5-database searching
Bioinformatica 10-11-2011-t5-database searchingBioinformatica 10-11-2011-t5-database searching
Bioinformatica 10-11-2011-t5-database searching
 
Blast fasta
Blast fastaBlast fasta
Blast fasta
 

More from Afra Fathima

Radio active labeling
Radio active labelingRadio active labeling
Radio active labelingAfra Fathima
 
Micriobiology & Microbes
Micriobiology & MicrobesMicriobiology & Microbes
Micriobiology & MicrobesAfra Fathima
 
Identification of proteins by 2D gel
Identification of proteins by 2D gelIdentification of proteins by 2D gel
Identification of proteins by 2D gelAfra Fathima
 
HIGH PERFORMANCE LIQUID CHROMATOGRAPHY
HIGH PERFORMANCE LIQUID CHROMATOGRAPHYHIGH PERFORMANCE LIQUID CHROMATOGRAPHY
HIGH PERFORMANCE LIQUID CHROMATOGRAPHYAfra Fathima
 
Ammonium sulphate precipitation
Ammonium sulphate precipitationAmmonium sulphate precipitation
Ammonium sulphate precipitationAfra Fathima
 
Collection of blood, serum & plasma
Collection of blood, serum & plasmaCollection of blood, serum & plasma
Collection of blood, serum & plasmaAfra Fathima
 
Sub cellular fractionation
Sub cellular fractionationSub cellular fractionation
Sub cellular fractionationAfra Fathima
 
Cells of the immune system
Cells of the immune systemCells of the immune system
Cells of the immune systemAfra Fathima
 
Cells of the immune system
Cells  of the immune system Cells  of the immune system
Cells of the immune system Afra Fathima
 
Dinitro salicylic acid (DNSA) method
Dinitro salicylic acid (DNSA) methodDinitro salicylic acid (DNSA) method
Dinitro salicylic acid (DNSA) methodAfra Fathima
 
Cells of the immune system
Cells of the immune system Cells of the immune system
Cells of the immune system Afra Fathima
 
Cells of the immune system
Cells  of the immune system Cells  of the immune system
Cells of the immune system Afra Fathima
 
Cells of the immune system
Cells  of the immune system Cells  of the immune system
Cells of the immune system Afra Fathima
 
Cells of the immune system
Cells  of the immune systemCells  of the immune system
Cells of the immune systemAfra Fathima
 

More from Afra Fathima (20)

Time management
Time managementTime management
Time management
 
Radio active labeling
Radio active labelingRadio active labeling
Radio active labeling
 
Micriobiology & Microbes
Micriobiology & MicrobesMicriobiology & Microbes
Micriobiology & Microbes
 
Identification of proteins by 2D gel
Identification of proteins by 2D gelIdentification of proteins by 2D gel
Identification of proteins by 2D gel
 
SDS PAGE
SDS PAGESDS PAGE
SDS PAGE
 
HIGH PERFORMANCE LIQUID CHROMATOGRAPHY
HIGH PERFORMANCE LIQUID CHROMATOGRAPHYHIGH PERFORMANCE LIQUID CHROMATOGRAPHY
HIGH PERFORMANCE LIQUID CHROMATOGRAPHY
 
Dialysis
DialysisDialysis
Dialysis
 
Lowry's method
Lowry's  methodLowry's  method
Lowry's method
 
Bradford's method
Bradford's methodBradford's method
Bradford's method
 
Ammonium sulphate precipitation
Ammonium sulphate precipitationAmmonium sulphate precipitation
Ammonium sulphate precipitation
 
Collection of blood, serum & plasma
Collection of blood, serum & plasmaCollection of blood, serum & plasma
Collection of blood, serum & plasma
 
Sub cellular fractionation
Sub cellular fractionationSub cellular fractionation
Sub cellular fractionation
 
Cells of the immune system
Cells of the immune systemCells of the immune system
Cells of the immune system
 
Cells of the immune system
Cells  of the immune system Cells  of the immune system
Cells of the immune system
 
Dinitro salicylic acid (DNSA) method
Dinitro salicylic acid (DNSA) methodDinitro salicylic acid (DNSA) method
Dinitro salicylic acid (DNSA) method
 
Cells of the immune system
Cells of the immune system Cells of the immune system
Cells of the immune system
 
Cells of the immune system
Cells  of the immune system Cells  of the immune system
Cells of the immune system
 
Cells of the immune system
Cells  of the immune system Cells  of the immune system
Cells of the immune system
 
Cells of the immune system
Cells  of the immune systemCells  of the immune system
Cells of the immune system
 
Sulphadiazine
SulphadiazineSulphadiazine
Sulphadiazine
 

Recently uploaded

ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
Class 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdfClass 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdfakmcokerachita
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxRoyAbrique
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 

Recently uploaded (20)

ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
Class 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdfClass 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdf
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 

Bioinformatics

  • 2. BLAST • Basic Local Alignment Search Tool (BLAST) - a program that can detect sequence similarity between a Query sequence and sequences within a database. • It uses a robust statistical framework that can determine if the alignment between two sequences is statistically significant. Uses – Helpful in identifying well known genes in a novel sequence – Helps to determine the relation between particular gene or protein with other genes or proteins.
  • 3. • Navigate to the BLAST main page at http://blast.ncbi.nlm.nih.gov/Blast.cgi.
  • 4. • All NCBI BLAST pages have the same header with four tabs: Tab Explanation Home Link to the BLAST home page Recent Results Link to results of the BLAST searches you have performed in your current browser session Saved Strategies BLAST input forms with the parameters you have saved to your MyNCBI account Help List of all BLAST help documentations
  • 5. Two other sections: • “Basic BLAST” section contains links to common BLAST programs. The different BLAST programs available on the NCBI web server
  • 6. • “Specialized BLAST‟ section of the NCBI BLAST main page can be used to align 2 or more sequences using ‘blast 2 sequences’
  • 7. • Three main criterions – BLAST program we wish to use – query sequence we want to interpret – database we want to search Other optional parameters such as expect threshold and other scoring parameters can be used to modify the behaviour of BLAST. BLAST search It has four components: query, database, program, search purpose/goal.
  • 8. 1. Prepare the query sequence (sequence to be BLAST) you want to use as raw or in fasta format. 2. Go to NCBI home page available at http://www.ncbi.nlm.nih.gov/ and choose BLAST program.
  • 9. 3. Choose the nucleotide BLAST program. 4. Copy and paste the sequence in the sequence box or alternatively you can also browse and upload the sequence in fasta format from your computer (Fig. 5). 5. Select the database you want to search against (preferably ‘Others’ (nr - non redundant). 6. If you wish to search against any specific organism, under ‘Organism’, enter first few characters of the organism’s name and select the organism for the dropdown list.
  • 11. 7. Choose the appropriate program under ‘Program Selection’. MEGABLAST - to identify identical nucleotide sequence. Discontiguous MEGABLAST – to identify similar nucleotide sequences, but not identical. Blastn - to identify related nucleotide sequences from other organisms (distantly related). MEGABLAST
  • 13. 8. Adjust any parameter, if required. Any parameter changed is highlighted in yellow. 9. Click ‘BLAST’ button at the end of the page.
  • 14. Interpreting the BLAST result The BLAST result consists of three major sections: (1) The Header - containing information about the query sequence, the database searched, BLAST version and its release date. It also provides a graphical overview of the query coverage; (2) The Descriptions – showing the one-line descriptions of each hit (sequence found to match in the database) of the query sequence; provides a quick overview for browsing; (3) The Alignments – displaying the pairwise alignments of the query sequence against each database sequence matched.
  • 15. 2 3 The header Section of the BLAST result.
  • 16. The header 1. It is then followed by the graphical output. The query sequence is represented by the numbered red bar at the top of the Figure. 2. The hits from the database are shown below the red bar; the most similar are shown closest to the query. Sometimes a thin black line is observed in the bar. This indicates two High Scoring Pairs (HSP) are from the same sequence. Perpendicular line (arrow) - the distance between the two HSPs is less Horizontal line (arrow) - more distance between the two HSPs. 3. The Figure also provides a colour key to represent the match of the hits. The length of the colour bar indicates the query coverage. Moving the mouse cursor over the bars displays the definition line above the graphic for that sequence
  • 17. The Descriptions 4. The One-line descriptions is composed of (a) a brief textual description , (b) the max score, (c) the total score, (d) the query coverage, (e) the E value, (f) the maximum identity and the accession number. Clicking the hyperlinks will open corresponding GenBank record. 5. The one-line descriptions are useful for people to get a quick overview of their search results.
  • 18. Description: Information about the sequence record for a particular hit. Hit: Matched sequence in the database. Score: A score is numerical value that describes the overall quality of the alignment. Max score: Highest alignment score between query and database sequence segment. Total score: Sum of alignment scores of all segments from the same database sequence that match the query sequence (calculated over all segments). This score is different from the max score if several parts of the database sequence match different parts of the query sequence. Query coverage: Percent of query length that is included in the aligned segments. This coverage is calculated over all segments. E value: Number of alignments expected by chance with a particular score or better. It is derived from P value. The E-value below 10-4 usually indicates the evidence for homology. Max identity: BLAST calculates the percentage identity between the query and the hit in a nucleotide-to-nucleotide alignment. If there are multiple alignments with a single hit, then only the highest percent identity is shown. Accession: A unique identifier number of a specific (sequence) GenBank record.
  • 19. The alignments 6. The pairwise sequence alignment is preceded by the sequence identifier, the full definition line, and the length of the matched sequence. The ‘Range 1:’ indicates the first segment that is matched and the following values tell the range that is being matched with our query sequence.
  • 20. 7. Next comes the bit score (the raw score is in parentheses) and then the E value, then the percentage of identities, gaps and finally the strand. A plus/plus indicates that both the query and subject are in forward strand. A plus/minus indicates that the query is in reverse strand. This is because, usually all the sequence in the database is stored only as plus strand. 8. The ‘Query’ refers to our query sequence and the ‘Sbjct’ refers to the subject sequence (the match) from the database. 9. By default, a maximum of 100 sequence matches are displayed, which can be changed on the BLAST page ‘Algorithm parameters’ option.