SlideShare a Scribd company logo
1 of 1
Download to read offline
ASM2015 Workshop 20: Typing of Bacterial Pathogens in 2015 May 30, 2015
SNP CALLING & OUTBREAK RECONSTRUCTION IN CLONAL PATHOGENS
Tips for successful sequencing
1. Choose the right platform and read length. For identifying SNPs in a monomorphic pathogen,
mapping short reads against a reference genome works well. 150-300bp paired-end reads on an
Illumina platform are common. There may be cases in which you want a de novo assembly or want
to cover long repetitive regions. Running a single isolate of interest on a long-read platform can give
you the scaffold against which short reads can then be mapped.

2. Multiplex appropriately, aiming for at least 50x coverage, if not more. 

3. When sequencing outbreaks, it can be helpful to include a few non-outbreak cases to ensure the
sample labels did not get mixed up during the run.

Bioinformatics advice (with thanks to Nick Loman)
1. First, a little quality control. FastQC can tell you if the reads look good. Trimmomatic can remove
sequencing adaptors, but trimming for base quality isn’t generally needed. BLAST, Kraken, or
Metaphlan will let you know if what you sequenced is what you think it should be. 

2. Decide on your goal: mapping your reads to a reference (perfect for outbreak reconstruction SNP
calling, building a phylogeny of isolates) or creating a de novo genome assembly (useful for trying to
describe a genome in which there may be some interesting biology). 

3. A good reference mapping pipeline would be BWA for alignment, Samtools or VarScan to call
SNPs and indels, a custom script for extracting and filtering SNPs, and then a phylogenetic tree
from a SNP aligment using RaxML or BEAST. For non-clonal pathogens, a recombination filtering
step could use Gubbins or Clonal Frame. There are tools that integrate several steps too, like
Snippy or BRESEQ. 

4. A good reference mapping also depends on having a good reference - it should be closely related
to your organism and should be high-quality - early Sanger genomes work well.

5. If you are just interested in typing from your reads using an existing MLST scheme, or matching
reads to a set of genes - say, antibiotic resistance genes from your organism of interest, use SRST2.

6. Quality SNPs will have decent coverage, will occur on both forward and reverse reads, won’t be
near the ends of reads or contigs, and won’t be densely clumped. Inspecting your SNPs visually is a
good practice.

Interpreting genomic data through the epidemiological lens
1. Nothing in biology makes sense except in light of evolution.

2. Identical sequences don’t necessarily always imply transmission from person to person - there
could be exposure from a common source, or within-host genetic diversity might make it difficult to
identify an infector.

3. Sometimes it’s easier to rule out transmission than rule it in.

4. Information to consider besides shared contact includes: date of symptom onset, date of
diagnosis, date put on treatment, infectiousness, hospitalizations, and duration of infectious period.
By Jennifer Gardy, BC Centre for Disease Control

More Related Content

Recently uploaded

Exomoons & Exorings with the Habitable Worlds Observatory I: On the Detection...
Exomoons & Exorings with the Habitable Worlds Observatory I: On the Detection...Exomoons & Exorings with the Habitable Worlds Observatory I: On the Detection...
Exomoons & Exorings with the Habitable Worlds Observatory I: On the Detection...
Sérgio Sacani
 
Tuberculosis (TB)-Notes.pdf microbiology notes
Tuberculosis (TB)-Notes.pdf microbiology notesTuberculosis (TB)-Notes.pdf microbiology notes
Tuberculosis (TB)-Notes.pdf microbiology notes
jyothisaisri
 
Isolation of AMF by wet sieving and decantation method pptx
Isolation of AMF by wet sieving and decantation method pptxIsolation of AMF by wet sieving and decantation method pptx
Isolation of AMF by wet sieving and decantation method pptx
GOWTHAMIM22
 
Quantifying Artificial Intelligence and What Comes Next!
Quantifying Artificial Intelligence and What Comes Next!Quantifying Artificial Intelligence and What Comes Next!
Quantifying Artificial Intelligence and What Comes Next!
University of Hertfordshire
 
MIP Award presentation at the IEEE International Conference on Software Analy...
MIP Award presentation at the IEEE International Conference on Software Analy...MIP Award presentation at the IEEE International Conference on Software Analy...
MIP Award presentation at the IEEE International Conference on Software Analy...
Annibale Panichella
 

Recently uploaded (20)

Exomoons & Exorings with the Habitable Worlds Observatory I: On the Detection...
Exomoons & Exorings with the Habitable Worlds Observatory I: On the Detection...Exomoons & Exorings with the Habitable Worlds Observatory I: On the Detection...
Exomoons & Exorings with the Habitable Worlds Observatory I: On the Detection...
 
NUMERICAL Proof Of TIme Electron Theory.
NUMERICAL Proof Of TIme Electron Theory.NUMERICAL Proof Of TIme Electron Theory.
NUMERICAL Proof Of TIme Electron Theory.
 
Constraints on Neutrino Natal Kicks from Black-Hole Binary VFTS 243
Constraints on Neutrino Natal Kicks from Black-Hole Binary VFTS 243Constraints on Neutrino Natal Kicks from Black-Hole Binary VFTS 243
Constraints on Neutrino Natal Kicks from Black-Hole Binary VFTS 243
 
Molecular and Cellular Mechanism of Action of Hormones such as Growth Hormone...
Molecular and Cellular Mechanism of Action of Hormones such as Growth Hormone...Molecular and Cellular Mechanism of Action of Hormones such as Growth Hormone...
Molecular and Cellular Mechanism of Action of Hormones such as Growth Hormone...
 
mixotrophy in cyanobacteria: a dual nutritional strategy
mixotrophy in cyanobacteria: a dual nutritional strategymixotrophy in cyanobacteria: a dual nutritional strategy
mixotrophy in cyanobacteria: a dual nutritional strategy
 
Tuberculosis (TB)-Notes.pdf microbiology notes
Tuberculosis (TB)-Notes.pdf microbiology notesTuberculosis (TB)-Notes.pdf microbiology notes
Tuberculosis (TB)-Notes.pdf microbiology notes
 
Soil and Water Conservation Engineering (SWCE) is a specialized field of stud...
Soil and Water Conservation Engineering (SWCE) is a specialized field of stud...Soil and Water Conservation Engineering (SWCE) is a specialized field of stud...
Soil and Water Conservation Engineering (SWCE) is a specialized field of stud...
 
Isolation of AMF by wet sieving and decantation method pptx
Isolation of AMF by wet sieving and decantation method pptxIsolation of AMF by wet sieving and decantation method pptx
Isolation of AMF by wet sieving and decantation method pptx
 
Lubrication System in forced feed system
Lubrication System in forced feed systemLubrication System in forced feed system
Lubrication System in forced feed system
 
Extensive Pollution of Uranus and Neptune’s Atmospheres by Upsweep of Icy Mat...
Extensive Pollution of Uranus and Neptune’s Atmospheres by Upsweep of Icy Mat...Extensive Pollution of Uranus and Neptune’s Atmospheres by Upsweep of Icy Mat...
Extensive Pollution of Uranus and Neptune’s Atmospheres by Upsweep of Icy Mat...
 
Triploidy ...............................pptx
Triploidy ...............................pptxTriploidy ...............................pptx
Triploidy ...............................pptx
 
B lymphocytes, Receptors, Maturation and Activation
B lymphocytes, Receptors, Maturation and ActivationB lymphocytes, Receptors, Maturation and Activation
B lymphocytes, Receptors, Maturation and Activation
 
GBSN - Microbiology (Unit 6) Human and Microbial interaction
GBSN - Microbiology (Unit 6) Human and Microbial interactionGBSN - Microbiology (Unit 6) Human and Microbial interaction
GBSN - Microbiology (Unit 6) Human and Microbial interaction
 
Erythropoiesis- Dr.E. Muralinath-C Kalyan
Erythropoiesis- Dr.E. Muralinath-C KalyanErythropoiesis- Dr.E. Muralinath-C Kalyan
Erythropoiesis- Dr.E. Muralinath-C Kalyan
 
Quantifying Artificial Intelligence and What Comes Next!
Quantifying Artificial Intelligence and What Comes Next!Quantifying Artificial Intelligence and What Comes Next!
Quantifying Artificial Intelligence and What Comes Next!
 
IISc Bangalore M.E./M.Tech. courses and fees 2024
IISc Bangalore M.E./M.Tech. courses and fees 2024IISc Bangalore M.E./M.Tech. courses and fees 2024
IISc Bangalore M.E./M.Tech. courses and fees 2024
 
GBSN - Microbiology Lab (Microbiology Lab Safety Procedures)
GBSN -  Microbiology Lab (Microbiology Lab Safety Procedures)GBSN -  Microbiology Lab (Microbiology Lab Safety Procedures)
GBSN - Microbiology Lab (Microbiology Lab Safety Procedures)
 
NuGOweek 2024 programme final FLYER short.pdf
NuGOweek 2024 programme final FLYER short.pdfNuGOweek 2024 programme final FLYER short.pdf
NuGOweek 2024 programme final FLYER short.pdf
 
MIP Award presentation at the IEEE International Conference on Software Analy...
MIP Award presentation at the IEEE International Conference on Software Analy...MIP Award presentation at the IEEE International Conference on Software Analy...
MIP Award presentation at the IEEE International Conference on Software Analy...
 
Plasmapheresis - Dr. E. Muralinath - Kalyan . C.pptx
Plasmapheresis - Dr. E. Muralinath - Kalyan . C.pptxPlasmapheresis - Dr. E. Muralinath - Kalyan . C.pptx
Plasmapheresis - Dr. E. Muralinath - Kalyan . C.pptx
 

Featured

How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
ThinkNow
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
Kurio // The Social Media Age(ncy)
 

Featured (20)

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 

ASM workshop handout

  • 1. ASM2015 Workshop 20: Typing of Bacterial Pathogens in 2015 May 30, 2015 SNP CALLING & OUTBREAK RECONSTRUCTION IN CLONAL PATHOGENS Tips for successful sequencing 1. Choose the right platform and read length. For identifying SNPs in a monomorphic pathogen, mapping short reads against a reference genome works well. 150-300bp paired-end reads on an Illumina platform are common. There may be cases in which you want a de novo assembly or want to cover long repetitive regions. Running a single isolate of interest on a long-read platform can give you the scaffold against which short reads can then be mapped. 2. Multiplex appropriately, aiming for at least 50x coverage, if not more. 3. When sequencing outbreaks, it can be helpful to include a few non-outbreak cases to ensure the sample labels did not get mixed up during the run. Bioinformatics advice (with thanks to Nick Loman) 1. First, a little quality control. FastQC can tell you if the reads look good. Trimmomatic can remove sequencing adaptors, but trimming for base quality isn’t generally needed. BLAST, Kraken, or Metaphlan will let you know if what you sequenced is what you think it should be. 2. Decide on your goal: mapping your reads to a reference (perfect for outbreak reconstruction SNP calling, building a phylogeny of isolates) or creating a de novo genome assembly (useful for trying to describe a genome in which there may be some interesting biology). 3. A good reference mapping pipeline would be BWA for alignment, Samtools or VarScan to call SNPs and indels, a custom script for extracting and filtering SNPs, and then a phylogenetic tree from a SNP aligment using RaxML or BEAST. For non-clonal pathogens, a recombination filtering step could use Gubbins or Clonal Frame. There are tools that integrate several steps too, like Snippy or BRESEQ. 4. A good reference mapping also depends on having a good reference - it should be closely related to your organism and should be high-quality - early Sanger genomes work well. 5. If you are just interested in typing from your reads using an existing MLST scheme, or matching reads to a set of genes - say, antibiotic resistance genes from your organism of interest, use SRST2. 6. Quality SNPs will have decent coverage, will occur on both forward and reverse reads, won’t be near the ends of reads or contigs, and won’t be densely clumped. Inspecting your SNPs visually is a good practice. Interpreting genomic data through the epidemiological lens 1. Nothing in biology makes sense except in light of evolution. 2. Identical sequences don’t necessarily always imply transmission from person to person - there could be exposure from a common source, or within-host genetic diversity might make it difficult to identify an infector. 3. Sometimes it’s easier to rule out transmission than rule it in. 4. Information to consider besides shared contact includes: date of symptom onset, date of diagnosis, date put on treatment, infectiousness, hospitalizations, and duration of infectious period. By Jennifer Gardy, BC Centre for Disease Control