Species identification.pptx

•

0 likes•5 views

MaiAnh409544

Species identification

Science

Phylogenetic analyses using
Sanger sequencing
Above all for yeasts
By Walter P. Pfliegler

Links for softwares and tutorials
• Phylogenetics course
• https://www.ncbi.nlm.nih.gov/Class/NAWBIS/Modules/Phylogenetics
/phylo1.html
• MEGA manual
• https://www.megasoftware.net/web_help_7/helpfile.htm#hc_first_ti
me_user.htm

Sequence analysis
• After PCR, we can have PCR products sequenced
• What we get is an ABI file that can be opened e.g. in SnapGene
viewer or Chromas https://chromas-lite.software.informer.com/2.1/
• The sequence can be corrected
• And saved as a .fasta file
• Fasta files are standard files in sequence analyis and they always begin with
an > followed by an identifier, and then by the sequence itself
• Fasta files, like many other files we will use, can be opened in Text editor in
Windows or in Gedit

Chromas Peaks representing signal strength for each labeled nucleotide
Quality score for each base

• Using Chromas, we can edit/correct/delete nucleotides
• After saving the ABI file, we can save in FASTA
• Note that I never use the original FASTA files provided
by the sequencing company
I delete these as they are ambiguous

Saving into .fasta (and opening it in text
editor)

• The reverse read can be
converted into reverse
complement with a click
• And after corrections,
saved in FASTA as well

• I usually align the forward and
reverse reads using this
• https://www.ebi.ac.uk/Tools/psa/e
mboss_needle/nucleotide.html
• I manually add the overhanging
region from the reverse read to
the forward read (after opening in
Windows Text editor), and correct
any ambiguous bases in the
forward read if possible
• If there are ambiguous sites or
even gaps, I look again at the ABI
files. Which one seems more
plausible?
• I save the complete full read in
Text editor, but it is still a FASTA file

Searching in GenBank
• https://www.ncbi.nlm.nih.gov/n
uccore
• You can search for sequences in
GenBank and apply filters
• Each sequence has an unique
accession number
• Some sequences are genes,
some are whole chromosomes,
some are just contigs, some are
unspecified, etc.

GenBank
• Features are shown
• If you click on FASTA, you will see
the sequence, and you can
specify a range to be shown

Task
• Now imagine that you have an
unknown organism, say, a yeast,
and you sequence one of its
genes to be able to identify it
• You perform PCR and Sanger
sequencing to obtain the
sequence of a given gene
• In this example, the gene will be
the ribosomal large subunit
(LSU), sequenced with primer
NL1
• Download an example .abi file
• Open it in Chromas
• Correct and save it in .fasta

• We can find out the identity of
our sample by using the BLAST
algorithm of the NCBI. Basically,
this alorithm searches for
sequences similar to our
sequence in the global database
• https://blast.ncbi.nlm.nih.gov/Bl
ast.cgi?PROGRAM=blastn&PAGE
_TYPE=BlastSearch&LINK_LOC=b
lasthome
• Paste your sequence here or
upload your fasta file
• Hit BLAST
• Results will appear: which
species matches your sample?

• Example .abi files are uploaded
as a .rar file to the e-learning
website

Similar to Species identification.pptx

Computational biology bls 303Bruno Mmassy

Using VarSeq to Improve Variant Analysis Research WorkflowsDelaina Hawkins

Using VarSeq to Improve Variant Analysis Research WorkflowsGolden Helix Inc

Toast 2015 qiime_talkTOASTworkshop

Ensembl annotationGenome Reference Consortium

Mar2013 Performance Metrics Working GroupGenomeInABottle

Prokka - rapid bacterial genome annotation - ABPHM 2013Torsten Seemann

Gene Ontology Projectvaibhavdeoda

Eccmid meet the-expertNick Loman

Giab poster structural variants ashg 2018GenomeInABottle

Visualize genomes with Integrated Genome BrowserAnn Loraine

ECCMID 2015 Meet-The-Expert: Bioinformatics ToolsNick Loman

GIAB Integrating multiple technologies to form benchmark SVs 180517GenomeInABottle

Giab jan2016 intro and update 160128GenomeInABottle

Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...GenomeInABottle

Michael Reich, GenomeSpace Workshop, fged_seattle_2013Functional Genomics Data Society

Important protein databases and proteomics softwaresPUNJAB AGRICULTURAL UNIVERSITY, LUDHIANA, 141004, PUNJAB (INDIA)

Curation Introduction - Apollo WorkshopMonica Munoz-Torres

150219 agbt giab_poster_marcGenomeInABottle

Working with files (concepts/pseudocode/python)FerryKemperman

Similar to Species identification.pptx (20)

Computational biology bls 303

Using VarSeq to Improve Variant Analysis Research Workflows

Toast 2015 qiime_talk

Ensembl annotation

Mar2013 Performance Metrics Working Group

Prokka - rapid bacterial genome annotation - ABPHM 2013

Gene Ontology Project

Eccmid meet the-expert

Giab poster structural variants ashg 2018

Visualize genomes with Integrated Genome Browser

ECCMID 2015 Meet-The-Expert: Bioinformatics Tools

GIAB Integrating multiple technologies to form benchmark SVs 180517

Giab jan2016 intro and update 160128

Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...

Michael Reich, GenomeSpace Workshop, fged_seattle_2013

Important protein databases and proteomics softwares

Curation Introduction - Apollo Workshop

150219 agbt giab_poster_marc

Working with files (concepts/pseudocode/python)

Recently uploaded

Module 4: Mendelian Genetics and Punnett SquareIsiahStephanRadaza

Scheme-of-Work-Science-Stage-4 cambridge science.docxyaramohamed343013

RESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptxFarihaAbdulRasheed

zoogeography of pakistan.pptx fauna of Pakistanzohaibmir069

Heredity: Inheritance and Variation of TraitsCharlene Llagas

Volatile Oils Pharmacognosy And Phytochemistry -INandakishor Bhaurao Deshmukh

Call Girls in Hauz Khas Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl

Engler and Prantl system of classification in plant taxonomyNistarini College, Purulia (W.B) India

BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.PraveenaKalaiselvan1

Speech, hearing, noise, intelligibility.pptxpriyankatabhane

insect anatomy and insect body wall and their physiologyDrAnita Sharma

Forest laws, Indian forest laws, why they are importantadityabhardwaj282

Harmful and Useful Microorganisms Presentationtahreemzahra82

Microphone- characteristics,carbon microphone, dynamic microphone.pptxpriyankatabhane

Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...lizamodels9

Call Girls in Aiims Metro Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl

Welcome to GFDL for Take Your Child To Work DayZachary Labe

Evidences of Evolution General Biology 2John Carlo Rollon

SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxkessiyaTpeter

LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxmalonesandreagweneth

Recently uploaded (20)

Module 4: Mendelian Genetics and Punnett Square

Scheme-of-Work-Science-Stage-4 cambridge science.docx

RESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptx

zoogeography of pakistan.pptx fauna of Pakistan

Heredity: Inheritance and Variation of Traits

Volatile Oils Pharmacognosy And Phytochemistry -I

Call Girls in Hauz Khas Delhi 💯Call Us 🔝9953322196🔝 💯Escort.

Engler and Prantl system of classification in plant taxonomy

BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.

Speech, hearing, noise, intelligibility.pptx

insect anatomy and insect body wall and their physiology

Forest laws, Indian forest laws, why they are important

Harmful and Useful Microorganisms Presentation

Microphone- characteristics,carbon microphone, dynamic microphone.pptx

Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...

Call Girls in Aiims Metro Delhi 💯Call Us 🔝9953322196🔝 💯Escort.

Welcome to GFDL for Take Your Child To Work Day

Evidences of Evolution General Biology 2

SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx

LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx

Species identification.pptx

1. Phylogenetic analyses using Sanger sequencing Above all for yeasts By Walter P. Pfliegler

2. Links for softwares and tutorials • Phylogenetics course • https://www.ncbi.nlm.nih.gov/Class/NAWBIS/Modules/Phylogenetics /phylo1.html • MEGA manual • https://www.megasoftware.net/web_help_7/helpfile.htm#hc_first_ti me_user.htm

3. 1. Working with Sanger sequences

4. Sequence analysis • After PCR, we can have PCR products sequenced • What we get is an ABI file that can be opened e.g. in SnapGene viewer or Chromas https://chromas-lite.software.informer.com/2.1/ • The sequence can be corrected • And saved as a .fasta file • Fasta files are standard files in sequence analyis and they always begin with an > followed by an identifier, and then by the sequence itself • Fasta files, like many other files we will use, can be opened in Text editor in Windows or in Gedit

5. Chromas Peaks representing signal strength for each labeled nucleotide Quality score for each base

6. • Using Chromas, we can edit/correct/delete nucleotides • After saving the ABI file, we can save in FASTA • Note that I never use the original FASTA files provided by the sequencing company I delete these as they are ambiguous

7. Saving into .fasta (and opening it in text editor)

8. • The reverse read can be converted into reverse complement with a click • And after corrections, saved in FASTA as well

9. SnapGene Viewer

10. • I usually align the forward and reverse reads using this • https://www.ebi.ac.uk/Tools/psa/e mboss_needle/nucleotide.html • I manually add the overhanging region from the reverse read to the forward read (after opening in Windows Text editor), and correct any ambiguous bases in the forward read if possible • If there are ambiguous sites or even gaps, I look again at the ABI files. Which one seems more plausible? • I save the complete full read in Text editor, but it is still a FASTA file

11. Searching in GenBank • https://www.ncbi.nlm.nih.gov/n uccore • You can search for sequences in GenBank and apply filters • Each sequence has an unique accession number • Some sequences are genes, some are whole chromosomes, some are just contigs, some are unspecified, etc.

12. GenBank • Features are shown • If you click on FASTA, you will see the sequence, and you can specify a range to be shown

13. Task • Now imagine that you have an unknown organism, say, a yeast, and you sequence one of its genes to be able to identify it • You perform PCR and Sanger sequencing to obtain the sequence of a given gene • In this example, the gene will be the ribosomal large subunit (LSU), sequenced with primer NL1 • Download an example .abi file • Open it in Chromas • Correct and save it in .fasta

14. • We can find out the identity of our sample by using the BLAST algorithm of the NCBI. Basically, this alorithm searches for sequences similar to our sequence in the global database • https://blast.ncbi.nlm.nih.gov/Bl ast.cgi?PROGRAM=blastn&PAGE _TYPE=BlastSearch&LINK_LOC=b lasthome • Paste your sequence here or upload your fasta file • Hit BLAST • Results will appear: which species matches your sample?

15. • Example .abi files are uploaded as a .rar file to the e-learning website

Species identification.pptx

Recommended

Recommended

More Related Content

Similar to Species identification.pptx

Similar to Species identification.pptx (20)

Recently uploaded

Recently uploaded (20)

Species identification.pptx