SlideShare a Scribd company logo
1 of 9
Lecture 5:- Bioinformatic tools for DNA technologies
Dr. Naulikha Kituyi
Department of Biological Sciences
University of Embu
Biochemistry-
2020
Bioinformatics tools
SELF-TEST QUESTIONS
•Describe the variants of BLAST and their
applications
•Discuss the relevance of sequence alignment
•Discuss the differences between Homology and
similarity
Bioinformatic tools are computer programs that
analyze one or more sequences. There are a
dizzying array of bioinformatic tools that can
analyze sequences to find protein domains
(Pfam), or that can search through databases of
millions of sequences to find ones that are similar
(BLAST) or that can find potential protein-coding
regions (ORF-Finder).
BLAST
• BLAST (Basic Local Alignment Search Tool) is one of the
most widely used tools to gain sequence information.
Finding similarity between DNA and protein sequences
against a database is one of the first things people do when
trying to get immediate information about a sequence of
interest. Doing these searches allows scientists to gain
knowledge about that particular gene’s function. BLAST
finds regions of similarity between the input sequence and
sequences found in its databases.
• Sequence alignment is a way of arranging two or more
sequences (DNA, RNA, or aa.(proteins)) to identify regions
of similar character patterns
• Sequence similarity could be a result of functional,
structural, or evolutionary relationships between the
sequences
• Procedure involves searching for series of identical or
similar characters/patterns in the same order between the
sequences
• Non identical characters aligned as mismatches or
opposite a gap in the other sequence
• Alignment made between a known sequence and unknown
sequence or between two unknown sequences
Why Sequence
Alignment (uses)?:
• Useful in DNA and Protein sequences for:
– Discovering functional information
– Predicting molecular structure
– Discovering evolutionary relationships
• Sequences that are very much alike probably
have:
– Same function
– Similar secondary and 3-D structure (if proteins)
– Shared ancestral sequence (though not always)
• Sequence alignment enables the following:
– Annotation of new sequences
– Modeling of protein structures
– Phylogenetic analysis
The BLAST Algorithm
BLAST - Basic Local Alignment Search Tool
Blast programs use a heuristic search algorithm.. Blast programs were
designed for fast database searching, with minimal sacrifice of
sensitivity for distantly related sequences. The programs search
databases in a special compressed format. Variants of BLAST
BLASTN - Compares a DNA query to a DNA database.Searches
both strands automatically. It is optimized for speed, rather than
sensitivity.
BLASTP - Compares a protein query to a protein database.
BLASTX - Compares a DNA query to a protein database , by
translating the query sequence in the 6 possible frames , and
comparing each against the database (3 reading frames from each
strand of the DNA) searching.
TBLASTN - Compares a protein query to a DNA database, in the 6
possible frames of the database.
TBLASTX - Compares the protein encoded in a DNA query to the
protein encoded in a DNA database, in the 6 6 possible frames of
both query and database sequences.
BLAST2 - Also called advanced BLAST. It can perform gapped
General View of How the BLAST
Program Works
BLAST - The program compares the
query to each sequence in database
using heuristic rules to speed up the
pairwise comparison. It creates
sequence abstraction by listing exact
and similar words. This is done in
advance for each sequence in the
database on the run for a certain query.
BLAST finds similar words between the
query and each database sequence, It
then extends such words to obtain high-
scoring sequence pairs (HSPs). It also
calculates statistics analytically like
FastA does.
Using Blast For Sequence Alignment
Please attempt the following:
Go on to www.ncbi.nlm.nih.gov. and blast
the Dengue virus sequence that we
examined in Lecture 2,( NC_001477)
using blastn. Copy and paste the
sequence accession number into the
search engine, click on search to confirm
the identity of the sequence then click on
the blastn option to see all the search
results with similar sequences. To see
the alignment for the sequences and the
percent identities, click on any of the
sequences like the one highlighted above
and go to alignment.
Terms used in Sequence Comparisons
Terms used in Sequence Comparisons
Homologous- Two related sequences are termed as
homologous to each other. These can be either
orthologs or paralogs. The homologous protein from
two different organisms with similar functions are
termed as orthologs where as homologous protein
with different function in an organism is called as
paralog.
Identity and similarity- The ratio of identical amino acids
residues to the total number of amino acids present in the
entire length of the sequence is termed as identity .
Whereas ratio of similar amino acids in a sequence
relative to the total number of amino acid present is termed
as similarity. The extend of similarity between two amino
acids is calculated with a similarity matrix
• Identity & Similarity:
• Sequence Identity: Exactly the same Amino acid or
nucleotide in the same position
• Sequence Similarity: Content includes identity and
substitutions (aa residues) with similar chemical
properties
• Similarity: A quantifiable property- Two sequences
are similar if order of sequence characters is
recognizably the same and they can be aligned
Global and Local Alignmennt
The Alignment of two query sequences can be
global or local (Figure.4). In global alignment,
the complete length of the protein sequences are
compared to another where as in the case of local
alignment, only a part of the sequence is
compared. The global alignment is used to
classify the protein into different classes where
as local alignment is used to identify the motif or
domain.
Sequence Alignment Example- Homology
10 20 30 40 50 60
HUMAN MNPLLILTFVAAALAAPFDDDDKIVGGYNCEENSVPYQVSLNSGYHFCGGSLINEQWVVS
:. ::::..:.::.: :..:::::::::.: :.:::::::::::::::::::::.:::::
MSALLILALVGAAVAFPLEDDDKIVGGYTCPEHSVPYQVSLNSGYHFCGGSLINDQWVVS
RAT
10 20 30 40 50 60
70 80 90 100 110 120
HUMAN AGHCYKSRIQVRLGEHNIEVLEGNEQFINAAKIIRHPQYDRKTLNNDIMLIKLSSRAVIN
:.::::::::::::::::.::::.::::::::::.::.:. ::::::::::::: . .:
AAHCYKSRIQVRLGEHNINVLEGDEQFINAAKIIKHPNYSSWTLNNDIMLIKLSSPVKLN
RAT
70 80 90 100 110 120
130 140 150 160 170 180
HUMAN ARVSTISLPTAPPATGTKCLISGWGNTASSGADYPDELQCLDAPVLSQAKCEASYPGKIT
:::. ..::.: .::.::::::::: :.:.. :: :::.:::::::: :::.:::.::
ARVAPVALPSACAPAGTQCLISGWGNTLSNGVNNPDLLQCVDAPVLSQADCEAAYPGEIT
RAT
130 140 150 160 170 180
190 200 210 220 230 240
HUMAN SNMFCVGFLEGGKDSCQGDSGGPVVCNGQLQGVVSWGDGCAQKNKPGVYTKVYNYVKWIK
:.:.::::::::::::::::::::::::::::.:::: ::: ..::::::: :.: ::.
RAT SSMICVGFLEGGKDSCQGDSGGPVVCNGQLQGIVSWGYGCALPDNPGVYTKVCNFVGWIQ
190 200 210 220 230 240
HUMAN NTIAAN
.:::::
DTIAAN
RAT
Human (247 aa) vs Rat (246 aa) Trypsin : show 76.4% identity (91.9% similarity) in 246 aa overlap (1-246:1-246) , E(1) < 2e-86
The similarity is statistically significant ( > expected by chance) , so sequences can be considered homologous
Detection of CPG Islands
Detection of methylated-CpG islands in easily accessible biological materials such as
serum has the potential to be useful for the early diagnosis of cancer. Most currently used
methods for detecting methylated-CpG islands are based on sodium bisulfite conversion
of genomic DNA, followed by PCR reactions
CpG Islands
The CpG sites or CG sites are regions of DNA where a cytosine nucleotide is followed by
a guanine nucleotide in the linear sequence of bases along its 5' → 3' direction. CpG sites
occur with high frequency in genomic regions called CpG islands (or CG islands).
Cytosines in CpG dinucleotides can be methylated to form 5-methylcytosines. Enzymes
that add a methyl group are called DNA methyltransferases. In mammals, 70% to 80% of
CpG cytosines are methylated.[1] Methylating the cytosine within a gene can change its
expression, a mechanism that is part of a larger field of science studying gene regulation
that is called epigenetics.
In humans, about 70% of promoters located near the transcription start site of a gene
(proximal promoters) contain a CpG island

More Related Content

Similar to Lecture 5.pptx

Sequence similarity tools.pptx
Sequence similarity tools.pptxSequence similarity tools.pptx
Sequence similarity tools.pptxPagudalaSangeetha
 
Basic BLAST (BLASTn)
Basic BLAST (BLASTn)Basic BLAST (BLASTn)
Basic BLAST (BLASTn)Syed Lokman
 
Performance Improvement of BLAST with Use of MSA Techniques to Search Ancesto...
Performance Improvement of BLAST with Use of MSA Techniques to Search Ancesto...Performance Improvement of BLAST with Use of MSA Techniques to Search Ancesto...
Performance Improvement of BLAST with Use of MSA Techniques to Search Ancesto...journal ijrtem
 
Performance Improvement of BLAST with Use of MSA Techniques to Search Ancesto...
Performance Improvement of BLAST with Use of MSA Techniques to Search Ancesto...Performance Improvement of BLAST with Use of MSA Techniques to Search Ancesto...
Performance Improvement of BLAST with Use of MSA Techniques to Search Ancesto...IJRTEMJOURNAL
 
BLAST(Basic Local Alignment Tool)
BLAST(Basic Local Alignment Tool)BLAST(Basic Local Alignment Tool)
BLAST(Basic Local Alignment Tool)Sobia
 
Bioinformatic, and tools by kk sahu
Bioinformatic, and tools by kk sahuBioinformatic, and tools by kk sahu
Bioinformatic, and tools by kk sahuKAUSHAL SAHU
 
Blast bioinformatics
Blast bioinformaticsBlast bioinformatics
Blast bioinformaticsatmapandey
 
Bioinformaatics for M.Sc. Biotecchnology.pptx
Bioinformaatics for M.Sc. Biotecchnology.pptxBioinformaatics for M.Sc. Biotecchnology.pptx
Bioinformaatics for M.Sc. Biotecchnology.pptxRanjan Jyoti Sarma
 
Sequencedatabases
SequencedatabasesSequencedatabases
SequencedatabasesAbhik Seal
 
Blast gp assignment
Blast  gp assignmentBlast  gp assignment
Blast gp assignmentbarathvaj
 
BLAST AND FASTA.pptx
BLAST AND FASTA.pptxBLAST AND FASTA.pptx
BLAST AND FASTA.pptxPiyushBehgal1
 

Similar to Lecture 5.pptx (20)

Blasta
BlastaBlasta
Blasta
 
BLAST
BLASTBLAST
BLAST
 
Blast
BlastBlast
Blast
 
Blast
BlastBlast
Blast
 
Sequence similarity tools.pptx
Sequence similarity tools.pptxSequence similarity tools.pptx
Sequence similarity tools.pptx
 
BLAST Search tool
BLAST Search toolBLAST Search tool
BLAST Search tool
 
Sequence alignment.pptx
Sequence alignment.pptxSequence alignment.pptx
Sequence alignment.pptx
 
Basic BLAST (BLASTn)
Basic BLAST (BLASTn)Basic BLAST (BLASTn)
Basic BLAST (BLASTn)
 
BLAST
BLASTBLAST
BLAST
 
Performance Improvement of BLAST with Use of MSA Techniques to Search Ancesto...
Performance Improvement of BLAST with Use of MSA Techniques to Search Ancesto...Performance Improvement of BLAST with Use of MSA Techniques to Search Ancesto...
Performance Improvement of BLAST with Use of MSA Techniques to Search Ancesto...
 
Performance Improvement of BLAST with Use of MSA Techniques to Search Ancesto...
Performance Improvement of BLAST with Use of MSA Techniques to Search Ancesto...Performance Improvement of BLAST with Use of MSA Techniques to Search Ancesto...
Performance Improvement of BLAST with Use of MSA Techniques to Search Ancesto...
 
BLAST
BLASTBLAST
BLAST
 
BLAST(Basic Local Alignment Tool)
BLAST(Basic Local Alignment Tool)BLAST(Basic Local Alignment Tool)
BLAST(Basic Local Alignment Tool)
 
Bioinformatic, and tools by kk sahu
Bioinformatic, and tools by kk sahuBioinformatic, and tools by kk sahu
Bioinformatic, and tools by kk sahu
 
Sequence Alignment
Sequence AlignmentSequence Alignment
Sequence Alignment
 
Blast bioinformatics
Blast bioinformaticsBlast bioinformatics
Blast bioinformatics
 
Bioinformaatics for M.Sc. Biotecchnology.pptx
Bioinformaatics for M.Sc. Biotecchnology.pptxBioinformaatics for M.Sc. Biotecchnology.pptx
Bioinformaatics for M.Sc. Biotecchnology.pptx
 
Sequencedatabases
SequencedatabasesSequencedatabases
Sequencedatabases
 
Blast gp assignment
Blast  gp assignmentBlast  gp assignment
Blast gp assignment
 
BLAST AND FASTA.pptx
BLAST AND FASTA.pptxBLAST AND FASTA.pptx
BLAST AND FASTA.pptx
 

Recently uploaded

Grade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxGrade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxChelloAnnAsuncion2
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxiammrhaywood
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfTechSoup
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomnelietumpap1
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17Celine George
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceSamikshaHamane
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxHumphrey A Beña
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Jisc
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designMIPLM
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersSabitha Banu
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPCeline George
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfMr Bounab Samir
 
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...Postal Advocate Inc.
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...Nguyen Thanh Tu Collection
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Celine George
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxAnupkumar Sharma
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)lakshayb543
 

Recently uploaded (20)

Grade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxGrade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choom
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in Pharmacovigilance
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-design
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginners
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERP
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
 
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 

Lecture 5.pptx

  • 1. Lecture 5:- Bioinformatic tools for DNA technologies Dr. Naulikha Kituyi Department of Biological Sciences University of Embu Biochemistry- 2020
  • 2. Bioinformatics tools SELF-TEST QUESTIONS •Describe the variants of BLAST and their applications •Discuss the relevance of sequence alignment •Discuss the differences between Homology and similarity Bioinformatic tools are computer programs that analyze one or more sequences. There are a dizzying array of bioinformatic tools that can analyze sequences to find protein domains (Pfam), or that can search through databases of millions of sequences to find ones that are similar (BLAST) or that can find potential protein-coding regions (ORF-Finder). BLAST • BLAST (Basic Local Alignment Search Tool) is one of the most widely used tools to gain sequence information. Finding similarity between DNA and protein sequences against a database is one of the first things people do when trying to get immediate information about a sequence of interest. Doing these searches allows scientists to gain knowledge about that particular gene’s function. BLAST finds regions of similarity between the input sequence and sequences found in its databases. • Sequence alignment is a way of arranging two or more sequences (DNA, RNA, or aa.(proteins)) to identify regions of similar character patterns • Sequence similarity could be a result of functional, structural, or evolutionary relationships between the sequences • Procedure involves searching for series of identical or similar characters/patterns in the same order between the sequences • Non identical characters aligned as mismatches or opposite a gap in the other sequence • Alignment made between a known sequence and unknown sequence or between two unknown sequences
  • 3. Why Sequence Alignment (uses)?: • Useful in DNA and Protein sequences for: – Discovering functional information – Predicting molecular structure – Discovering evolutionary relationships • Sequences that are very much alike probably have: – Same function – Similar secondary and 3-D structure (if proteins) – Shared ancestral sequence (though not always) • Sequence alignment enables the following: – Annotation of new sequences – Modeling of protein structures – Phylogenetic analysis
  • 4. The BLAST Algorithm BLAST - Basic Local Alignment Search Tool Blast programs use a heuristic search algorithm.. Blast programs were designed for fast database searching, with minimal sacrifice of sensitivity for distantly related sequences. The programs search databases in a special compressed format. Variants of BLAST BLASTN - Compares a DNA query to a DNA database.Searches both strands automatically. It is optimized for speed, rather than sensitivity. BLASTP - Compares a protein query to a protein database. BLASTX - Compares a DNA query to a protein database , by translating the query sequence in the 6 possible frames , and comparing each against the database (3 reading frames from each strand of the DNA) searching. TBLASTN - Compares a protein query to a DNA database, in the 6 possible frames of the database. TBLASTX - Compares the protein encoded in a DNA query to the protein encoded in a DNA database, in the 6 6 possible frames of both query and database sequences. BLAST2 - Also called advanced BLAST. It can perform gapped General View of How the BLAST Program Works BLAST - The program compares the query to each sequence in database using heuristic rules to speed up the pairwise comparison. It creates sequence abstraction by listing exact and similar words. This is done in advance for each sequence in the database on the run for a certain query. BLAST finds similar words between the query and each database sequence, It then extends such words to obtain high- scoring sequence pairs (HSPs). It also calculates statistics analytically like FastA does.
  • 5. Using Blast For Sequence Alignment Please attempt the following: Go on to www.ncbi.nlm.nih.gov. and blast the Dengue virus sequence that we examined in Lecture 2,( NC_001477) using blastn. Copy and paste the sequence accession number into the search engine, click on search to confirm the identity of the sequence then click on the blastn option to see all the search results with similar sequences. To see the alignment for the sequences and the percent identities, click on any of the sequences like the one highlighted above and go to alignment.
  • 6. Terms used in Sequence Comparisons Terms used in Sequence Comparisons Homologous- Two related sequences are termed as homologous to each other. These can be either orthologs or paralogs. The homologous protein from two different organisms with similar functions are termed as orthologs where as homologous protein with different function in an organism is called as paralog. Identity and similarity- The ratio of identical amino acids residues to the total number of amino acids present in the entire length of the sequence is termed as identity . Whereas ratio of similar amino acids in a sequence relative to the total number of amino acid present is termed as similarity. The extend of similarity between two amino acids is calculated with a similarity matrix • Identity & Similarity: • Sequence Identity: Exactly the same Amino acid or nucleotide in the same position • Sequence Similarity: Content includes identity and substitutions (aa residues) with similar chemical properties • Similarity: A quantifiable property- Two sequences are similar if order of sequence characters is recognizably the same and they can be aligned
  • 7. Global and Local Alignmennt The Alignment of two query sequences can be global or local (Figure.4). In global alignment, the complete length of the protein sequences are compared to another where as in the case of local alignment, only a part of the sequence is compared. The global alignment is used to classify the protein into different classes where as local alignment is used to identify the motif or domain.
  • 8. Sequence Alignment Example- Homology 10 20 30 40 50 60 HUMAN MNPLLILTFVAAALAAPFDDDDKIVGGYNCEENSVPYQVSLNSGYHFCGGSLINEQWVVS :. ::::..:.::.: :..:::::::::.: :.:::::::::::::::::::::.::::: MSALLILALVGAAVAFPLEDDDKIVGGYTCPEHSVPYQVSLNSGYHFCGGSLINDQWVVS RAT 10 20 30 40 50 60 70 80 90 100 110 120 HUMAN AGHCYKSRIQVRLGEHNIEVLEGNEQFINAAKIIRHPQYDRKTLNNDIMLIKLSSRAVIN :.::::::::::::::::.::::.::::::::::.::.:. ::::::::::::: . .: AAHCYKSRIQVRLGEHNINVLEGDEQFINAAKIIKHPNYSSWTLNNDIMLIKLSSPVKLN RAT 70 80 90 100 110 120 130 140 150 160 170 180 HUMAN ARVSTISLPTAPPATGTKCLISGWGNTASSGADYPDELQCLDAPVLSQAKCEASYPGKIT :::. ..::.: .::.::::::::: :.:.. :: :::.:::::::: :::.:::.:: ARVAPVALPSACAPAGTQCLISGWGNTLSNGVNNPDLLQCVDAPVLSQADCEAAYPGEIT RAT 130 140 150 160 170 180 190 200 210 220 230 240 HUMAN SNMFCVGFLEGGKDSCQGDSGGPVVCNGQLQGVVSWGDGCAQKNKPGVYTKVYNYVKWIK :.:.::::::::::::::::::::::::::::.:::: ::: ..::::::: :.: ::. RAT SSMICVGFLEGGKDSCQGDSGGPVVCNGQLQGIVSWGYGCALPDNPGVYTKVCNFVGWIQ 190 200 210 220 230 240 HUMAN NTIAAN .::::: DTIAAN RAT Human (247 aa) vs Rat (246 aa) Trypsin : show 76.4% identity (91.9% similarity) in 246 aa overlap (1-246:1-246) , E(1) < 2e-86 The similarity is statistically significant ( > expected by chance) , so sequences can be considered homologous
  • 9. Detection of CPG Islands Detection of methylated-CpG islands in easily accessible biological materials such as serum has the potential to be useful for the early diagnosis of cancer. Most currently used methods for detecting methylated-CpG islands are based on sodium bisulfite conversion of genomic DNA, followed by PCR reactions CpG Islands The CpG sites or CG sites are regions of DNA where a cytosine nucleotide is followed by a guanine nucleotide in the linear sequence of bases along its 5' → 3' direction. CpG sites occur with high frequency in genomic regions called CpG islands (or CG islands). Cytosines in CpG dinucleotides can be methylated to form 5-methylcytosines. Enzymes that add a methyl group are called DNA methyltransferases. In mammals, 70% to 80% of CpG cytosines are methylated.[1] Methylating the cytosine within a gene can change its expression, a mechanism that is part of a larger field of science studying gene regulation that is called epigenetics. In humans, about 70% of promoters located near the transcription start site of a gene (proximal promoters) contain a CpG island