SlideShare a Scribd company logo
FASTA
Amandeep Singh
Assistant Professor
Department of Biotechnology
GSSDGS Khalsa College Patiala
Introduction
FASTA uses an algorithm for similarity search for nucleotide or protein
sequence from a biological database.
Nucleotide Sequence (Query)
Protein Sequence (Query)
Nucleotide Sequence (Database)
Protein Sequence (Database)
FASTA Algorithm
It start from a Dot-plot or Dot-matrix.
A B C D E F
A
B
M
D
L
F
Second Sequence (Database)
First Sequence
(Query)
Shows regions of similarity
between 2 Sequences
represented as diagonals.
FASTA Algorithm
• FASTA goes a step forward from dot-plot
• It calculates the sum of dots along each diagonal.
• It is a “word” based method.
• It looks for matching “word” or the sequence of patterns called “k-tuple”
Tuple: Finite ordered list of elements
Sequence patterns: 1 or 2 amino acids, or 5 or 6 nucleotides
• Build local alignment using this “word” or “k-tuple”.
• Match identical “word”
• Create diagonals by joining adjacent matches.
• Rescore the highest scoring system using PAM or BLOSUM matrix.
• Best of these scores is called init1.
• Join segments using gaps, the best score from this is called initn.
• Use Dynamic programing (Smith-Waterman algorithm) to create the optimal alignment.
FASTA Algorithm
FASTA Implementation
FASTA3 (https://www.ebi.ac.uk/Tools/sss/fasta/) at the EBI is one of
the most popular FASTA implementations.
FASTA Output
• The Histogram
• The Sequence listing
• The Local alignments
FASTA Output
The Histogram
• First part of FASTA output is Histogram.
• Predicted extreme value is represented by asterisk * symbol
• Actual numbers obtained is represented by equal = sign
• First column: z-opt score
• Second column: number of sequences with these z-opt scores
• Third column: Expected number of alignments
Histogram used to determine, whether statistical theory is valid or not.
• If equal sign follow predicted value  Valid
• If equal sign do not follow predicted value  Invalid
FASTA Output: The Histogram
FASTA Output: The Sequence listing
• Listing of the best scoring sequences in the database.
• Best sequence: reported first
• Worst sequence: reported last
First Column Second
Column
Opt
column
Last
Column
Database Database
accession
number
Database
identifier
Total length
of database
sequence
Final score E-Value
FASTA Output: The Sequence listing
FASTA Output: The Local alignments
Display:
 The local alignment
 Init1 & Initn scores
 E-value
 Opt-score
 Z-score
 Percent identity
Significance of E-Value
• E-Value or Expected value is about number of
alignments hit by chance.
• Smaller the E-value: Less likely a given alignment
occurred by chance.
Variants of FASTA
• FastA - Compares a DNA query sequence to a DNA database, or a
protein query to a protein database, detecting the sequence type
automatically.
• FASTX - Compares a DNA query to a protein database. It may
introduce gaps only between codons.
• FASTY - Compares a DNA query to a protein database, optimizing
gap location, even within codons.
• TFASTA - Compares a protein query to a DNA database.

More Related Content

What's hot

Scoring matrices
Scoring matricesScoring matrices
Scoring matrices
Ashwini
 
Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-
naveed ul mushtaq
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
Subhranil Bhattacharjee
 
Prosite
PrositeProsite
Bioinformatics data mining
Bioinformatics data miningBioinformatics data mining
Bioinformatics data mining
Sangeeta Das
 
Sequence Submission Tools
Sequence Submission ToolsSequence Submission Tools
Sequence Submission Tools
RishikaMaji
 
Sequence Analysis
Sequence AnalysisSequence Analysis
Sequence Analysis
DEBPRASAD DUTTA
 
sequence alignment
sequence alignmentsequence alignment
sequence alignment
ammar kareem
 
Blast and fasta
Blast and fastaBlast and fasta
Blast and fasta
ALLIENU
 
Gene bank by kk sahu
Gene bank by kk sahuGene bank by kk sahu
Gene bank by kk sahu
KAUSHAL SAHU
 
Cath
CathCath
Cath
Ramya S
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
Ramya S
 
Gen bank databases
Gen bank databasesGen bank databases
Gen bank databases
Hafiz Muhammad Zeeshan Raza
 
Protein data bank
Protein data bankProtein data bank
Protein data bank
Yogesh Joshi
 
Finding ORF
Finding ORFFinding ORF
Finding ORF
Sabahat Ali
 
Structural databases
Structural databases Structural databases
Structural databases
Priyadharshana
 
Gen bank
Gen bankGen bank
Protein data bank
Protein data bankProtein data bank
Protein data bank
Alichy Sowmya
 
Introduction to sequence alignment partii
Introduction to sequence alignment partiiIntroduction to sequence alignment partii
Introduction to sequence alignment partii
SumatiHajela
 

What's hot (20)

Scoring matrices
Scoring matricesScoring matrices
Scoring matrices
 
Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
 
Prosite
PrositeProsite
Prosite
 
Bioinformatics data mining
Bioinformatics data miningBioinformatics data mining
Bioinformatics data mining
 
Sequence Submission Tools
Sequence Submission ToolsSequence Submission Tools
Sequence Submission Tools
 
Sequence Analysis
Sequence AnalysisSequence Analysis
Sequence Analysis
 
sequence alignment
sequence alignmentsequence alignment
sequence alignment
 
Blast and fasta
Blast and fastaBlast and fasta
Blast and fasta
 
Gene bank by kk sahu
Gene bank by kk sahuGene bank by kk sahu
Gene bank by kk sahu
 
Cath
CathCath
Cath
 
blast bioinformatics
blast bioinformaticsblast bioinformatics
blast bioinformatics
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
 
Gen bank databases
Gen bank databasesGen bank databases
Gen bank databases
 
Protein data bank
Protein data bankProtein data bank
Protein data bank
 
Finding ORF
Finding ORFFinding ORF
Finding ORF
 
Structural databases
Structural databases Structural databases
Structural databases
 
Gen bank
Gen bankGen bank
Gen bank
 
Protein data bank
Protein data bankProtein data bank
Protein data bank
 
Introduction to sequence alignment partii
Introduction to sequence alignment partiiIntroduction to sequence alignment partii
Introduction to sequence alignment partii
 

Similar to FASTA

Blast fasta
Blast fastaBlast fasta
Blast fastayaghava
 
BLAST AND FASTA.pptx
BLAST AND FASTA.pptxBLAST AND FASTA.pptx
BLAST AND FASTA.pptx
PiyushBehgal1
 
Blast bioinformatics
Blast bioinformaticsBlast bioinformatics
Blast bioinformaticsatmapandey
 
Sequence comparison techniques
Sequence comparison techniquesSequence comparison techniques
Sequence comparison techniques
ruchibioinfo
 
BLAST : features, types,algorithm, working etc.
BLAST : features, types,algorithm,  working  etc.BLAST : features, types,algorithm,  working  etc.
BLAST : features, types,algorithm, working etc.
Cherry
 
BLAST AND FASTA.pptx12345789999987544321234
BLAST AND FASTA.pptx12345789999987544321234BLAST AND FASTA.pptx12345789999987544321234
BLAST AND FASTA.pptx12345789999987544321234
alizain9604
 
Sequence similarity tools.pptx
Sequence similarity tools.pptxSequence similarity tools.pptx
Sequence similarity tools.pptx
PagudalaSangeetha
 
Database Searching
Database SearchingDatabase Searching
Database Searching
Meghaj Mallick
 
FastA HOMOLOGY SEARCH ALGORITHM
FastA HOMOLOGY SEARCH ALGORITHMFastA HOMOLOGY SEARCH ALGORITHM
FastA HOMOLOGY SEARCH ALGORITHM
Muunda Mudenda
 
Sequence homology search and multiple sequence alignment(1)
Sequence homology search and multiple sequence alignment(1)Sequence homology search and multiple sequence alignment(1)
Sequence homology search and multiple sequence alignment(1)
AnkitTiwari354
 
Sequence database
Sequence databaseSequence database
Sequence database
Dr.M.Prasad Naidu
 
Blast
BlastBlast
BLAST (Basic local alignment search Tool)
BLAST (Basic local alignment search Tool)BLAST (Basic local alignment search Tool)
BLAST (Basic local alignment search Tool)
Ariful Islam Sagar
 
BIOINFORMATICS_AND_PHYLOGENY.pdf.pdf
BIOINFORMATICS_AND_PHYLOGENY.pdf.pdfBIOINFORMATICS_AND_PHYLOGENY.pdf.pdf
BIOINFORMATICS_AND_PHYLOGENY.pdf.pdf
sirwansleman
 
Sequence-analysis-pairwise-alignment.pdf
Sequence-analysis-pairwise-alignment.pdfSequence-analysis-pairwise-alignment.pdf
Sequence-analysis-pairwise-alignment.pdf
sriaisvariyasundar
 
lecture4.ppt Sequence Alignmentaldf sdfsadf
lecture4.ppt Sequence Alignmentaldf sdfsadflecture4.ppt Sequence Alignmentaldf sdfsadf
lecture4.ppt Sequence Alignmentaldf sdfsadf
alizain9604
 
Blast Algorithm
Blast AlgorithmBlast Algorithm
Presentation for blast algorithm bio-informatice
Presentation for blast algorithm bio-informaticePresentation for blast algorithm bio-informatice
Presentation for blast algorithm bio-informatice
zahid6
 

Similar to FASTA (20)

Blast fasta
Blast fastaBlast fasta
Blast fasta
 
BLAST AND FASTA.pptx
BLAST AND FASTA.pptxBLAST AND FASTA.pptx
BLAST AND FASTA.pptx
 
Blast bioinformatics
Blast bioinformaticsBlast bioinformatics
Blast bioinformatics
 
Sequence comparison techniques
Sequence comparison techniquesSequence comparison techniques
Sequence comparison techniques
 
BLAST : features, types,algorithm, working etc.
BLAST : features, types,algorithm,  working  etc.BLAST : features, types,algorithm,  working  etc.
BLAST : features, types,algorithm, working etc.
 
BLAST AND FASTA.pptx12345789999987544321234
BLAST AND FASTA.pptx12345789999987544321234BLAST AND FASTA.pptx12345789999987544321234
BLAST AND FASTA.pptx12345789999987544321234
 
Mayank
MayankMayank
Mayank
 
Sequence similarity tools.pptx
Sequence similarity tools.pptxSequence similarity tools.pptx
Sequence similarity tools.pptx
 
Database Searching
Database SearchingDatabase Searching
Database Searching
 
Blast 2013 1
Blast 2013 1Blast 2013 1
Blast 2013 1
 
FastA HOMOLOGY SEARCH ALGORITHM
FastA HOMOLOGY SEARCH ALGORITHMFastA HOMOLOGY SEARCH ALGORITHM
FastA HOMOLOGY SEARCH ALGORITHM
 
Sequence homology search and multiple sequence alignment(1)
Sequence homology search and multiple sequence alignment(1)Sequence homology search and multiple sequence alignment(1)
Sequence homology search and multiple sequence alignment(1)
 
Sequence database
Sequence databaseSequence database
Sequence database
 
Blast
BlastBlast
Blast
 
BLAST (Basic local alignment search Tool)
BLAST (Basic local alignment search Tool)BLAST (Basic local alignment search Tool)
BLAST (Basic local alignment search Tool)
 
BIOINFORMATICS_AND_PHYLOGENY.pdf.pdf
BIOINFORMATICS_AND_PHYLOGENY.pdf.pdfBIOINFORMATICS_AND_PHYLOGENY.pdf.pdf
BIOINFORMATICS_AND_PHYLOGENY.pdf.pdf
 
Sequence-analysis-pairwise-alignment.pdf
Sequence-analysis-pairwise-alignment.pdfSequence-analysis-pairwise-alignment.pdf
Sequence-analysis-pairwise-alignment.pdf
 
lecture4.ppt Sequence Alignmentaldf sdfsadf
lecture4.ppt Sequence Alignmentaldf sdfsadflecture4.ppt Sequence Alignmentaldf sdfsadf
lecture4.ppt Sequence Alignmentaldf sdfsadf
 
Blast Algorithm
Blast AlgorithmBlast Algorithm
Blast Algorithm
 
Presentation for blast algorithm bio-informatice
Presentation for blast algorithm bio-informaticePresentation for blast algorithm bio-informatice
Presentation for blast algorithm bio-informatice
 

More from Thapar Institute of Engineering & Technology, Patiala, Punjab, India

SDS PAGE
SDS PAGESDS PAGE
Agarose gel electrophoresis
Agarose gel electrophoresisAgarose gel electrophoresis
Preparation and staining of specimens for microscopy
Preparation and staining of specimens for microscopyPreparation and staining of specimens for microscopy
Preparation and staining of specimens for microscopy
Thapar Institute of Engineering & Technology, Patiala, Punjab, India
 
Microbial polysaccharides
Microbial polysaccharidesMicrobial polysaccharides
Organic acids production copy
Organic acids production   copyOrganic acids production   copy
Methods of strain improvement
Methods of strain improvementMethods of strain improvement
Refrigeration
RefrigerationRefrigeration
Patents
PatentsPatents
Vaccines
VaccinesVaccines
Chemical reactions and rancidity of fats
Chemical reactions and rancidity of fatsChemical reactions and rancidity of fats
Characteristics of biological databases
Characteristics of biological databasesCharacteristics of biological databases
Organoleptic properties of proteins
Organoleptic properties of proteinsOrganoleptic properties of proteins
Denaturation of proteins
Denaturation of proteinsDenaturation of proteins
OMIM- Online Mendelian Inheritance in Man
OMIM- Online Mendelian Inheritance in Man OMIM- Online Mendelian Inheritance in Man
Antigen & antigenicity
Antigen & antigenicityAntigen & antigenicity
Protein Data Bank (PDB)
Protein Data Bank (PDB)Protein Data Bank (PDB)
SWISS-PROT
SWISS-PROTSWISS-PROT
PIR- Protein Information Resource
PIR- Protein Information ResourcePIR- Protein Information Resource
Organs of the immune system
Organs of the immune systemOrgans of the immune system

More from Thapar Institute of Engineering & Technology, Patiala, Punjab, India (20)

SDS PAGE
SDS PAGESDS PAGE
SDS PAGE
 
Agarose gel electrophoresis
Agarose gel electrophoresisAgarose gel electrophoresis
Agarose gel electrophoresis
 
Prokaryotic and eukaryotic cell
Prokaryotic and eukaryotic cellProkaryotic and eukaryotic cell
Prokaryotic and eukaryotic cell
 
Preparation and staining of specimens for microscopy
Preparation and staining of specimens for microscopyPreparation and staining of specimens for microscopy
Preparation and staining of specimens for microscopy
 
Microbial polysaccharides
Microbial polysaccharidesMicrobial polysaccharides
Microbial polysaccharides
 
Organic acids production copy
Organic acids production   copyOrganic acids production   copy
Organic acids production copy
 
Methods of strain improvement
Methods of strain improvementMethods of strain improvement
Methods of strain improvement
 
Refrigeration
RefrigerationRefrigeration
Refrigeration
 
Patents
PatentsPatents
Patents
 
Vaccines
VaccinesVaccines
Vaccines
 
Chemical reactions and rancidity of fats
Chemical reactions and rancidity of fatsChemical reactions and rancidity of fats
Chemical reactions and rancidity of fats
 
Characteristics of biological databases
Characteristics of biological databasesCharacteristics of biological databases
Characteristics of biological databases
 
Organoleptic properties of proteins
Organoleptic properties of proteinsOrganoleptic properties of proteins
Organoleptic properties of proteins
 
Denaturation of proteins
Denaturation of proteinsDenaturation of proteins
Denaturation of proteins
 
OMIM- Online Mendelian Inheritance in Man
OMIM- Online Mendelian Inheritance in Man OMIM- Online Mendelian Inheritance in Man
OMIM- Online Mendelian Inheritance in Man
 
Antigen & antigenicity
Antigen & antigenicityAntigen & antigenicity
Antigen & antigenicity
 
Protein Data Bank (PDB)
Protein Data Bank (PDB)Protein Data Bank (PDB)
Protein Data Bank (PDB)
 
SWISS-PROT
SWISS-PROTSWISS-PROT
SWISS-PROT
 
PIR- Protein Information Resource
PIR- Protein Information ResourcePIR- Protein Information Resource
PIR- Protein Information Resource
 
Organs of the immune system
Organs of the immune systemOrgans of the immune system
Organs of the immune system
 

Recently uploaded

The Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdfThe Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdf
kaushalkr1407
 
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCECLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
BhavyaRajput3
 
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideas
GeoBlogs
 
Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
Balvir Singh
 
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
Celine George
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
EverAndrsGuerraGuerr
 
Additional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdfAdditional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdf
joachimlavalley1
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
Jisc
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
Pavel ( NSTU)
 
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXXPhrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
MIRIAMSALINAS13
 
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptxStudents, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
EduSkills OECD
 
MARUTI SUZUKI- A Successful Joint Venture in India.pptx
MARUTI SUZUKI- A Successful Joint Venture in India.pptxMARUTI SUZUKI- A Successful Joint Venture in India.pptx
MARUTI SUZUKI- A Successful Joint Venture in India.pptx
bennyroshan06
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
Special education needs
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
Delapenabediema
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
Tamralipta Mahavidyalaya
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
DeeptiGupta154
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
Jisc
 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345
beazzy04
 
Template Jadual Bertugas Kelas (Boleh Edit)
Template Jadual Bertugas Kelas (Boleh Edit)Template Jadual Bertugas Kelas (Boleh Edit)
Template Jadual Bertugas Kelas (Boleh Edit)
rosedainty
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
Mohd Adib Abd Muin, Senior Lecturer at Universiti Utara Malaysia
 

Recently uploaded (20)

The Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdfThe Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdf
 
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCECLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
 
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideas
 
Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
 
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
 
Additional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdfAdditional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdf
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
 
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXXPhrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
 
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptxStudents, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
 
MARUTI SUZUKI- A Successful Joint Venture in India.pptx
MARUTI SUZUKI- A Successful Joint Venture in India.pptxMARUTI SUZUKI- A Successful Joint Venture in India.pptx
MARUTI SUZUKI- A Successful Joint Venture in India.pptx
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345
 
Template Jadual Bertugas Kelas (Boleh Edit)
Template Jadual Bertugas Kelas (Boleh Edit)Template Jadual Bertugas Kelas (Boleh Edit)
Template Jadual Bertugas Kelas (Boleh Edit)
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
 

FASTA

  • 1. FASTA Amandeep Singh Assistant Professor Department of Biotechnology GSSDGS Khalsa College Patiala
  • 2. Introduction FASTA uses an algorithm for similarity search for nucleotide or protein sequence from a biological database. Nucleotide Sequence (Query) Protein Sequence (Query) Nucleotide Sequence (Database) Protein Sequence (Database)
  • 3. FASTA Algorithm It start from a Dot-plot or Dot-matrix. A B C D E F A B M D L F Second Sequence (Database) First Sequence (Query) Shows regions of similarity between 2 Sequences represented as diagonals.
  • 4. FASTA Algorithm • FASTA goes a step forward from dot-plot • It calculates the sum of dots along each diagonal. • It is a “word” based method. • It looks for matching “word” or the sequence of patterns called “k-tuple” Tuple: Finite ordered list of elements Sequence patterns: 1 or 2 amino acids, or 5 or 6 nucleotides • Build local alignment using this “word” or “k-tuple”. • Match identical “word” • Create diagonals by joining adjacent matches. • Rescore the highest scoring system using PAM or BLOSUM matrix. • Best of these scores is called init1. • Join segments using gaps, the best score from this is called initn. • Use Dynamic programing (Smith-Waterman algorithm) to create the optimal alignment.
  • 6. FASTA Implementation FASTA3 (https://www.ebi.ac.uk/Tools/sss/fasta/) at the EBI is one of the most popular FASTA implementations.
  • 7. FASTA Output • The Histogram • The Sequence listing • The Local alignments
  • 8. FASTA Output The Histogram • First part of FASTA output is Histogram. • Predicted extreme value is represented by asterisk * symbol • Actual numbers obtained is represented by equal = sign • First column: z-opt score • Second column: number of sequences with these z-opt scores • Third column: Expected number of alignments Histogram used to determine, whether statistical theory is valid or not. • If equal sign follow predicted value  Valid • If equal sign do not follow predicted value  Invalid
  • 9. FASTA Output: The Histogram
  • 10. FASTA Output: The Sequence listing • Listing of the best scoring sequences in the database. • Best sequence: reported first • Worst sequence: reported last First Column Second Column Opt column Last Column Database Database accession number Database identifier Total length of database sequence Final score E-Value
  • 11. FASTA Output: The Sequence listing
  • 12. FASTA Output: The Local alignments Display:  The local alignment  Init1 & Initn scores  E-value  Opt-score  Z-score  Percent identity
  • 13. Significance of E-Value • E-Value or Expected value is about number of alignments hit by chance. • Smaller the E-value: Less likely a given alignment occurred by chance.
  • 14. Variants of FASTA • FastA - Compares a DNA query sequence to a DNA database, or a protein query to a protein database, detecting the sequence type automatically. • FASTX - Compares a DNA query to a protein database. It may introduce gaps only between codons. • FASTY - Compares a DNA query to a protein database, optimizing gap location, even within codons. • TFASTA - Compares a protein query to a DNA database.