SlideShare a Scribd company logo
Microbiome
Analysis
16S AND METAGENOMICS
‘
Welcome!
Your Tutorial Team:
Me (16S theory)
Mike Hall (16S practical)
Morgan Langille (metagenomics theory and practical)
Special thanks to:
Will Hsiao (CBW presentation)
2
Today’s presentation
CBW “Analysis of
metagenomic data”
3
http://bioinformatics.ca/workshops/2015/analysis-metagenomic-data-2015
Overview
Morning session
1. A brief history of molecules and microbes
2. Why 16S?
3. How 16S analysis is usually done
4. Assumptions
5. Hands-on practical
Afternoon session
1. 16S vs Metagenomics
2. Metagenome Taxonomic Composition
3. Metagenome Functional Composition
4. PICRUSt: Functional Inference
5. Hands-on practical
4
Learning objectives
At the end of the 16S tutorial, you should be able to do the following:
1. Run a simple QIIME analysis of a data set
(https://www.dropbox.com/s/kpte51nm17wav9o/stool_data.zip)
2. Interpret analysis results
3. Understand the limitations of the standard 16S analysis pipeline
5
Defining metagenomics
Microbiome: Attributed to Joshua Lederberg by Hooper and Gordon (2001):
“the collective genome of our indigenous microbes (microflora), the idea
being that a comprehensive genetic view of Homo sapiens as a life-form
should include the genes in our microbiome”
Is also used to mean microbiota, the group of microorganisms found in a
particular setting
(usage varies: be careful and precise!)
Metagenome: Handelsman et al. (1998) “…advances in molecular biology
and eukaryotic genomics, which have laid the groundwork for cloning and
functional analysis of the collective genomes of soil microflora, which we
term the metagenome of the soil.”
Does not encompass marker-gene surveys (e.g., 16S)
This report says it does.
6
Micro-what?
Metagenomics is often defined to encompass only Bacteria and Archaea
(and often Archaea are excluded too!)
Other small things to consider:
◦ Viruses / phages
◦ Microbial eukaryotes
◦ Worms (helminths, nematodes, …)
7
Lukeš et al. (2015) PLoS Pathogens
The dawn of metagenomics
3.5 BYA – the Archaean Eon
16S position 349 (-ish)
?
G A
Archaea Bacteria
8
Aaaaand more recently
t
9
The 16S
ribosomal RNA gene
THE FIRST WORD IN MICROBIAL BIODIVERSITY
10
11
Yarza et al. (2014)
Escherichia coli
ribosome (PDB 4YBB)
So much RNA!
Why 16S?
The “universal phylogenetic marker”
(1) Present in all living organisms
(2) Single copy* (no recombination)
(3) Highly conserved + highly variable regions
(4) Huge reference databases
12
Milestones
13
1990: “proposal for the domains Archaea, Bacteria, and Eucarya”
Milestones
14
Nature (1990)
2002: “…as much as 50% of the total
surface microbial community…”
Milestones
15
PNAS (2006)
Many critical papers
followed (error
filtering, clustering
approaches, …)
Milestones
16
Huttenhower, Gevers et al. (2012)
+ 681 metagenomic samples
16S analysis
HOW IT’S DONE
17
Your basic workflow
Sample
collection
DNA
extraction
Amplification Analysis
18
Sample collection and DNA extraction
Defined protocols exist, many kits (e.g. PowerSoil®)
Need to consider barriers to DNA recovery and PCR (e.g. humic acids
from soil, bile salts from feces)
Additional mechanical approaches (e.g., mechanical lysis of tissues with
bead beating)
Kits and rogue lab DNA can end up in your sample – need to run
negative controls!!
◦ Example from [year redacted]: shocking finding of bacterial DNA in the
[location redacted]! However, [taxonomic group redacted] was a known
frequent contaminant of DNA extraction kits.
19
20
Size fractionation
http://www.jove.com/video/52685/automated-gel-size-selection-to-improve-quality-next-generation
Choosing a PCR strategy
Need to consider:
◦ Correct melting temperature (60-65 degrees C for Illumina
protocol)
◦ DNA sequencing read length (influences choice of primers)
◦ Primer specificity!
◦ Comparability with previous studies?
[Good luck with that]
[but that’s what the Earth Microbiome Project protocol
http://www.earthmicrobiome.org/emp-standard-protocols/16s/
is meant to achieve]
21
Which variable regions to target?
V1-V3 favours Prevotella, Fusobacterium, Streptococcus, Granulicatella, Bacteroides,
Porphyromonas and Treponema
V4-V6 favours Streptococcus, Treponema, Prevotella, Eubacterium, Porphyromonas,
Campylobacter and Enterococcus.
◦ failed to detect Fusobacterium
V7-V9 favours Veillonella, Streptococcus, Eubacterium, Enterococcus, Treponema,
Catonella and Selenomonas.
◦ failed to detect Selenomonas, TM7 and Mycoplasma
22
At least there’s no shortage of options…
23
Detailed in silico evaluation of primers, experimental evaluation of two sets
Heavily biased recovery of Bacteria, Archaea, and missing groups depending on primer
choice.
“Out of the 175 primers and 512 primer pairs checked, only 10 can be recommended as
broad-range primers.”
Amplification
Example: Illumina protocol
24
Analysis
(examples mostly from QIIME)
1. Quality Control
◦ Error checking
2. Sample diversity
◦ Taxonomy agnostic
◦ Taxonomy aware
3. Similarity among samples
4. Associations with metadata/groups (ANOSIM, MRPP)
5. Machine-learning classification
6. Functional prediction
25
26
QIIME Mothur
A python interface to glue together many
programs
Single program with minimal external
dependency
Wrappers for existing programs Reimplementation of popular algorithms
Large number of dependencies / VM
available
Easy to install and setup; work best on single
multi-core server with lots of memory
More scalable Less scalable
Steeper learning curve but more flexible
workflow if you can write your own scripts
Easy to learn but workflow works the best
with built-in tools
http://www.ncbi.nlm.nih.gov/pubmed/2406
0131
http://www.mothur.org/wiki/MiSeq_SOP
Will Hsiao
“Analysis” #1
Quality Control
27
Quality score filtering:
◦ Minimal length of consecutive high-quality bases (as % of total read length)
◦ Maximal number of consecutive low-quality bases
◦ Maximal number of ambiguous bases (N’s)
◦ Minimum Phred quality score
Other quality filtering tools available
◦ Cutadapt (https://github.com/marcelm/cutadapt)
◦ Trimmomatic (http://www.usadellab.org/cms/?page=trimmomatic)
◦ Sickle (https://github.com/najoshi/sickle)
Chimera checking:
◦ UCHIME
28
Sequence quality summary using FASTQC
http://www.bioinformatics.babraham.ac.uk/projects/fastqc/
Analysis #2
Within-sample (“alpha”) diversity
To describe the diversity of a sample, you need to know what you are
counting!
Individual sequences?
◦ Most precise, but vulnerable to sequencing error effects – inflation of
diversity
Clusters of sequences?
◦ Operational taxonomic units (OTUs) – 97% sequence identity as the
“species” level of similarity
Taxonomic groups?
◦ It’s always reassuring to put names on things, but taxonomic labels can be
extremely misleading
29
OTU clustering
30
Choose a % identity threshold
97%
Cluster centroids in some order
(e.g., length, abundance) – these
are reference sequences
Continue procedure until all
sequences are clustered OTU
(singletons may be excluded)
Calculate distances between sequences
6%
What’s in a name?
31
Bacteroides
Parabacteroides
Ruminococcus
???
???
???
???
Akkermansia
Taxonomic assignment
Many choices:
BLAST – assign taxonomic label of closest match (simple, possibly too simple)
Phylogenetic placement – e.g. Pplacer (Matsen et al., BMC Bioinformatics
2010)
Machine-learning classification, in particular Naïve Bayes e.g. RDP Classifier,
Wang et al. (2007) BMC Bioinformatics
32
Example RDP Classifier output
33
GD6JEAT01AYGPE Root rootrank 1.0 Bacteria domain 1.0
"Planctomycetes" phylum 1.0 "Planctomycetacia"class 1.0
Planctomycetales order 1.0 Planctomycetaceaefamily 1.0
Schlesneria genus 0.96
GD6JEAT01BEUG6 Root rootrank 1.0 Bacteria domain 1.0
Firmicutes phylum 0.32 Clostridia class 0.26
Clostridiales order 0.23 Ruminococcaceae family 0.22
Anaerotruncus genus 0.19
Includes bootstrap support
Calculating alpha diversity
OTU counts – richness only
Simpson index – probability of sampling two individuals of the same type
Phylogenetic diversity – sum of branch lengths
34
Example: human body-site diversity
35
Huttenhower, Gevers et al. (2012)
Analysis #3
Among-sample (“beta”) diversity
1. Perform pairwise comparisons between all samples to build a
dissimilarity matrix
2. Summarize the matrix using based on major patterns of covariance
or hierarchical similarity
36
Analysis #3
Among-sample (“beta”) diversity
Given a pair of samples (described as e.g. OTU abundance), calculate
their dissimilarity
Beta-diversity measures can be:
◦ non-phylogenetic or phylogenetic
◦ weighted or unweighted
There are a lot of measures!
-Bray-Curtis (weighted, non-phylogenetic)
-Jaccard (unweighted, non-phylogenetic)
-Weighted UniFrac (weighted, phylogenetic)
-…
37
Analysis #3
Among-sample (“beta”) diversity
How similar are the results of different
measures?
CORRELATIONS between calculated
values
38
Parks and Beiko (2013): ISME J
Analysis #3
Among-sample (“beta”) diversity
What to do with a dissimilarity matrix?
39
Yatsunenko et al. (2012) Nature Parks and Beiko (2012) Mol Biol Evol
Ordination
Clustering
Analysis #3
Among-sample (“beta”) diversity
Different beta-diversity measures can
yield dramatically different clusters!
40
Parks and Beiko (2013): ISME J
Analysis #4
Associations with metadata
PERMANOVA: Permutational multivariate analysis of variance
ANOSIM: Rank-based analysis of similarity
Mantel test: Comparison of between-group vs within-group distances
41
Good review: Anderson and Walsh (2013) Ecological Monographs
Example:
Weighted UniFrac distance: root compartment
explains 46.62% of variance (PERMANOVA p<0.001)
Unweighted UniFrac: root compartment explains only
18.07% of variance (PERMANOVA p<0.001); soil type
is more important
Analysis #5
Machine-learning classification
Identify aspects of community structure that are predictive of sample
attributes
Advantages of machine-learning approaches:
◦ Non-linear combinations of variables
◦ Data transformations
◦ Can accommodate many different representations of the data
Disadvantages:
◦ Complex, may “overfit”
◦ Can be time consuming
◦ Obfuscation of predictive rules
42
Random forests
(supervised_learning.py)
43
“…there are only weak and, for the most part, non-significant associations of
particular taxa or overall diversity with the obese human gut that hold true across
different studies. However, using supervised learning with receiver operator
curves to maximize sensitivity and specificity, one can categorize subjects
according to lean and obese states with in some cases considerable accuracy…”
Tree-based classifications
Nested clade analysis
and feature selection
Classification of plaque samples
using support vector machines
44
Ning and Beiko (2015): Microbiome
Analysis #6
Functional prediction
PICRUSt: Langille et al (2013) Nat Biotechnol
45
Morgan can tell you about this…
Assumptions
THAT ARE OFTEN FALSE
46
Do not assume that
#1: 16S is an effective proxy for microbial diversity.
#2: All 16S studies are created equal, with results that are comparable.
#3: Rarefaction is a good idea.
#4: 16S OTUs describe ecologically cohesive units (“species”?).
#5: The 16S tree is the “Tree of Life”.
47
Assumption #1
16S is an effective proxy for microbial diversity.
48
rrnDB: Stoddard et al.
NAR (2014)
Estimating copy number:
Kembel et al. (2012) and
PICRUSt (coming up later)
Variation: Coenye and Vandamme (2003)
Assumption #1
16S is an effective proxy for microbial
diversity.
Alternative marker genes: cpn60, rpoB, …
Smaller reference databases!
Protein-coding genes!
49
Assumption #2
All 16S studies are created equal.
Effects of sequencing platform, V region, amplicon vs metagenomics
50
Tremblay et al. (2015)
Front Microbiol
Assumption #3
Rarefaction is a good idea.
Example of statistics before and after rarefaction:
Loss of statistical power
Random subsampling can increase false-positive differences
Arbitrary minimum library size chosen for downsampling
Alternatives e.g. Negative Binomial fitting (e.g., DeSeq2)
51
McMurdie and Holmes (2014) PLoS Comp Biol
Assumption #4
16S OTUs describe ecologically cohesive units.
52
Distribution of
sequence similarity
(dashed line = OTU threshold)
branch lengths
Nguyen et al. (2016) npj Biofilms and Microbiomes
Assumption #4
16S OTUs describe ecologically cohesive units.
53
Hall et al., in preparation
Same OTU, different temporal patterns
Assumption #4
16S OTUs describe ecologically cohesive units.
54
Many alternatives exist,
including Swarm: Mahé et al.
(2015) PeerJ
Assumption #5
The 16S tree is the “Tree of Life”.
16S is limited for several reasons:
Limited resolving power
Subject to compositional bias
Subject to recombination and lateral
transfer
Models typically applied to protein-
coding genes do not make sense for
noncoding RNA
55
Moving On
ADVENTURES IN “MULTI-OMICS”
56
Multi-omics??
16S can profile the biodiversity of a microbial sample…
But we need the metagenome to shine a light on function…
The metatranscriptome tells us what is expressed under specific
conditions…
And the metaproteome can quantify the relative abundance of different
enzymes…
While the metametabolome focuses on the products of metabolism.
What do we really need?
57
Metagenomic / metatranscriptomic AMD analysis - Hua et al., ISME J (2015)
Draft genomes at MG-RAST
59
Differences in the microbiome between arsenic-
exposed and control mice
16S taxonomic analysis + metametabolomics
Taxonomy
Metabolic
function
Hands on!
LET’S MAKE SCIENCE HAPPEN
60
The Dataset
61
Workflow
1. Retrieve data
2. Cluster sequences
3. Taxonomic classification
4. Phylogenetic tree construction
5. OTU table creation
6. Downstream visualization / analysis
62
FIN
63
Presentations
http://www.slideshare.net/MickWatson/studying-the-microbiome
http://bioinformatics.ca/metagenomics2015module2pptx

More Related Content

What's hot

Genomics and bioinformatics
Genomics and bioinformatics Genomics and bioinformatics
Genomics and bioinformatics Senthil Natesan
 
Next generation sequencing methods
Next generation sequencing methods Next generation sequencing methods
Next generation sequencing methods
Mrinal Vashisth
 
Metagenomics sk presentation 17.10.2017
Metagenomics sk presentation 17.10.2017 Metagenomics sk presentation 17.10.2017
Metagenomics sk presentation 17.10.2017
SUNILKUMARSAHOO16
 
Metagenomics and it’s applications
Metagenomics and it’s applicationsMetagenomics and it’s applications
Metagenomics and it’s applications
Sham Sadiq
 
Amplicon Sequencing Introduction
Amplicon Sequencing IntroductionAmplicon Sequencing Introduction
Amplicon Sequencing Introduction
Aaron Marc Saunders
 
Protein micro array
Protein micro arrayProtein micro array
Protein micro array
krupa sagar
 
Presentation on Biological database By Elufer Akram @ University Of Science ...
Presentation on Biological database  By Elufer Akram @ University Of Science ...Presentation on Biological database  By Elufer Akram @ University Of Science ...
Presentation on Biological database By Elufer Akram @ University Of Science ...
Elufer Akram
 
Transcriptome analysis
Transcriptome analysisTranscriptome analysis
Transcriptome analysis
Divya Srivastava
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomicshemantbreeder
 
Web based servers and softwares for genome analysis
Web based servers and softwares for genome analysisWeb based servers and softwares for genome analysis
Web based servers and softwares for genome analysis
Dr. Naveen Gaurav srivastava
 
Functional genomics, and tools
Functional genomics, and toolsFunctional genomics, and tools
Functional genomics, and tools
KAUSHAL SAHU
 
Introduction to Metagenomics Data Analysis - UEB-VHIR - 2013
Introduction to Metagenomics Data Analysis - UEB-VHIR - 2013Introduction to Metagenomics Data Analysis - UEB-VHIR - 2013
Introduction to Metagenomics Data Analysis - UEB-VHIR - 2013
VHIR Vall d’Hebron Institut de Recerca
 
Bioinformatics Final Presentation
Bioinformatics Final PresentationBioinformatics Final Presentation
Bioinformatics Final PresentationShruthi Choudary
 
Uni prot presentation
Uni prot presentationUni prot presentation
Uni prot presentation
Rida Khalid
 
Studying the microbiome
Studying the microbiomeStudying the microbiome
Studying the microbiome
Mick Watson
 
Proteomics
Proteomics   Proteomics
Proteomics
Mohit Bharti
 
Introduction to 16s r rna sequencing cd genomics
Introduction to 16s r rna sequencing cd genomicsIntroduction to 16s r rna sequencing cd genomics
Introduction to 16s r rna sequencing cd genomics
Dynah Perry
 
Expressed sequence tag (EST), molecular marker
Expressed sequence tag (EST), molecular markerExpressed sequence tag (EST), molecular marker
Expressed sequence tag (EST), molecular marker
KAUSHAL SAHU
 
The next generation sequencing platform of roche 454
The next generation sequencing platform of roche 454The next generation sequencing platform of roche 454
The next generation sequencing platform of roche 454
creativebiogene1
 
Tech Talk: UCSC Genome Browser
Tech Talk: UCSC Genome BrowserTech Talk: UCSC Genome Browser
Tech Talk: UCSC Genome Browser
Hoffman Lab
 

What's hot (20)

Genomics and bioinformatics
Genomics and bioinformatics Genomics and bioinformatics
Genomics and bioinformatics
 
Next generation sequencing methods
Next generation sequencing methods Next generation sequencing methods
Next generation sequencing methods
 
Metagenomics sk presentation 17.10.2017
Metagenomics sk presentation 17.10.2017 Metagenomics sk presentation 17.10.2017
Metagenomics sk presentation 17.10.2017
 
Metagenomics and it’s applications
Metagenomics and it’s applicationsMetagenomics and it’s applications
Metagenomics and it’s applications
 
Amplicon Sequencing Introduction
Amplicon Sequencing IntroductionAmplicon Sequencing Introduction
Amplicon Sequencing Introduction
 
Protein micro array
Protein micro arrayProtein micro array
Protein micro array
 
Presentation on Biological database By Elufer Akram @ University Of Science ...
Presentation on Biological database  By Elufer Akram @ University Of Science ...Presentation on Biological database  By Elufer Akram @ University Of Science ...
Presentation on Biological database By Elufer Akram @ University Of Science ...
 
Transcriptome analysis
Transcriptome analysisTranscriptome analysis
Transcriptome analysis
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
Web based servers and softwares for genome analysis
Web based servers and softwares for genome analysisWeb based servers and softwares for genome analysis
Web based servers and softwares for genome analysis
 
Functional genomics, and tools
Functional genomics, and toolsFunctional genomics, and tools
Functional genomics, and tools
 
Introduction to Metagenomics Data Analysis - UEB-VHIR - 2013
Introduction to Metagenomics Data Analysis - UEB-VHIR - 2013Introduction to Metagenomics Data Analysis - UEB-VHIR - 2013
Introduction to Metagenomics Data Analysis - UEB-VHIR - 2013
 
Bioinformatics Final Presentation
Bioinformatics Final PresentationBioinformatics Final Presentation
Bioinformatics Final Presentation
 
Uni prot presentation
Uni prot presentationUni prot presentation
Uni prot presentation
 
Studying the microbiome
Studying the microbiomeStudying the microbiome
Studying the microbiome
 
Proteomics
Proteomics   Proteomics
Proteomics
 
Introduction to 16s r rna sequencing cd genomics
Introduction to 16s r rna sequencing cd genomicsIntroduction to 16s r rna sequencing cd genomics
Introduction to 16s r rna sequencing cd genomics
 
Expressed sequence tag (EST), molecular marker
Expressed sequence tag (EST), molecular markerExpressed sequence tag (EST), molecular marker
Expressed sequence tag (EST), molecular marker
 
The next generation sequencing platform of roche 454
The next generation sequencing platform of roche 454The next generation sequencing platform of roche 454
The next generation sequencing platform of roche 454
 
Tech Talk: UCSC Genome Browser
Tech Talk: UCSC Genome BrowserTech Talk: UCSC Genome Browser
Tech Talk: UCSC Genome Browser
 

Viewers also liked

Toast 2015 qiime_talk
Toast 2015 qiime_talkToast 2015 qiime_talk
Toast 2015 qiime_talkTOASTworkshop
 
Esa 2014 qiime
Esa 2014 qiimeEsa 2014 qiime
Esa 2014 qiimeZech Xu
 
Bacterial Identification by 16s rRNA Sequencing.ppt
Bacterial Identification by 16s rRNA Sequencing.pptBacterial Identification by 16s rRNA Sequencing.ppt
Bacterial Identification by 16s rRNA Sequencing.pptRakesh Kumar
 
Caporaso sloan qiime_workshop_slides_18_oct2012
Caporaso sloan qiime_workshop_slides_18_oct2012Caporaso sloan qiime_workshop_slides_18_oct2012
Caporaso sloan qiime_workshop_slides_18_oct2012gregcaporaso
 
Microbiome studies using 16S ribosomal DNA PCR: some cautionary tales.
Microbiome studies using 16S ribosomal DNA PCR: some cautionary tales.Microbiome studies using 16S ribosomal DNA PCR: some cautionary tales.
Microbiome studies using 16S ribosomal DNA PCR: some cautionary tales.
jennomics
 
Introduction to 16S Analysis with NGS - BMR Genomics
Introduction to 16S Analysis with NGS - BMR GenomicsIntroduction to 16S Analysis with NGS - BMR Genomics
Introduction to 16S Analysis with NGS - BMR Genomics
Andrea Telatin
 
UC Davis EVE 161 Lecture 7 - rRNA workflows - by Jonathan Eisen @phylogenomics
UC Davis EVE 161 Lecture 7 - rRNA workflows - by Jonathan Eisen @phylogenomicsUC Davis EVE 161 Lecture 7 - rRNA workflows - by Jonathan Eisen @phylogenomics
UC Davis EVE 161 Lecture 7 - rRNA workflows - by Jonathan Eisen @phylogenomics
Jonathan Eisen
 
The benefits of environment specific curation of the public databases for tax...
The benefits of environment specific curation of the public databases for tax...The benefits of environment specific curation of the public databases for tax...
The benefits of environment specific curation of the public databases for tax...
Aaron Marc Saunders
 
[13.07.07] karst mewe13 dna_extraction_nonotes
[13.07.07] karst mewe13 dna_extraction_nonotes[13.07.07] karst mewe13 dna_extraction_nonotes
[13.07.07] karst mewe13 dna_extraction_nonotessorenkarst
 
Policy Brief-Costly Disease: How to reduce out of pocket expenditure in Diabe...
Policy Brief-Costly Disease: How to reduce out of pocket expenditure in Diabe...Policy Brief-Costly Disease: How to reduce out of pocket expenditure in Diabe...
Policy Brief-Costly Disease: How to reduce out of pocket expenditure in Diabe...
Anupam Singh
 
Toast 2015 qiime_talk2
Toast 2015 qiime_talk2Toast 2015 qiime_talk2
Toast 2015 qiime_talk2TOASTworkshop
 
American Gut Project presentation at Masaryk University
American Gut Project presentation at Masaryk UniversityAmerican Gut Project presentation at Masaryk University
American Gut Project presentation at Masaryk University
mcdonadt
 
Introduction to 16S rRNA gene multivariate analysis
Introduction to 16S rRNA gene multivariate analysisIntroduction to 16S rRNA gene multivariate analysis
Introduction to 16S rRNA gene multivariate analysis
Josh Neufeld
 
[13.09.19] 16S workshop introduction
[13.09.19] 16S workshop introduction[13.09.19] 16S workshop introduction
[13.09.19] 16S workshop introduction
Mads Albertsen
 
Biodiversity
BiodiversityBiodiversity
Biodiversity
Gulati Tutorials
 
Jonathan Eisen @phylogenomics talk for #LAMG12
Jonathan Eisen @phylogenomics talk for #LAMG12Jonathan Eisen @phylogenomics talk for #LAMG12
Jonathan Eisen @phylogenomics talk for #LAMG12Jonathan Eisen
 
16S Ribosomal DNA Sequence Analysis
16S Ribosomal DNA Sequence Analysis16S Ribosomal DNA Sequence Analysis
16S Ribosomal DNA Sequence AnalysisAbdulrahman Muhammad
 
GLBIO/CCBC Metagenomics Workshop
GLBIO/CCBC Metagenomics WorkshopGLBIO/CCBC Metagenomics Workshop
GLBIO/CCBC Metagenomics Workshop
Morgan Langille
 

Viewers also liked (20)

Toast 2015 qiime_talk
Toast 2015 qiime_talkToast 2015 qiime_talk
Toast 2015 qiime_talk
 
Esa 2014 qiime
Esa 2014 qiimeEsa 2014 qiime
Esa 2014 qiime
 
Bacterial Identification by 16s rRNA Sequencing.ppt
Bacterial Identification by 16s rRNA Sequencing.pptBacterial Identification by 16s rRNA Sequencing.ppt
Bacterial Identification by 16s rRNA Sequencing.ppt
 
Caporaso sloan qiime_workshop_slides_18_oct2012
Caporaso sloan qiime_workshop_slides_18_oct2012Caporaso sloan qiime_workshop_slides_18_oct2012
Caporaso sloan qiime_workshop_slides_18_oct2012
 
Microbiome studies using 16S ribosomal DNA PCR: some cautionary tales.
Microbiome studies using 16S ribosomal DNA PCR: some cautionary tales.Microbiome studies using 16S ribosomal DNA PCR: some cautionary tales.
Microbiome studies using 16S ribosomal DNA PCR: some cautionary tales.
 
Introduction to 16S Analysis with NGS - BMR Genomics
Introduction to 16S Analysis with NGS - BMR GenomicsIntroduction to 16S Analysis with NGS - BMR Genomics
Introduction to 16S Analysis with NGS - BMR Genomics
 
UC Davis EVE 161 Lecture 7 - rRNA workflows - by Jonathan Eisen @phylogenomics
UC Davis EVE 161 Lecture 7 - rRNA workflows - by Jonathan Eisen @phylogenomicsUC Davis EVE 161 Lecture 7 - rRNA workflows - by Jonathan Eisen @phylogenomics
UC Davis EVE 161 Lecture 7 - rRNA workflows - by Jonathan Eisen @phylogenomics
 
The benefits of environment specific curation of the public databases for tax...
The benefits of environment specific curation of the public databases for tax...The benefits of environment specific curation of the public databases for tax...
The benefits of environment specific curation of the public databases for tax...
 
[13.07.07] karst mewe13 dna_extraction_nonotes
[13.07.07] karst mewe13 dna_extraction_nonotes[13.07.07] karst mewe13 dna_extraction_nonotes
[13.07.07] karst mewe13 dna_extraction_nonotes
 
Policy Brief-Costly Disease: How to reduce out of pocket expenditure in Diabe...
Policy Brief-Costly Disease: How to reduce out of pocket expenditure in Diabe...Policy Brief-Costly Disease: How to reduce out of pocket expenditure in Diabe...
Policy Brief-Costly Disease: How to reduce out of pocket expenditure in Diabe...
 
Toast 2015 qiime_talk2
Toast 2015 qiime_talk2Toast 2015 qiime_talk2
Toast 2015 qiime_talk2
 
American Gut Project presentation at Masaryk University
American Gut Project presentation at Masaryk UniversityAmerican Gut Project presentation at Masaryk University
American Gut Project presentation at Masaryk University
 
Thesis
ThesisThesis
Thesis
 
16s
16s16s
16s
 
Introduction to 16S rRNA gene multivariate analysis
Introduction to 16S rRNA gene multivariate analysisIntroduction to 16S rRNA gene multivariate analysis
Introduction to 16S rRNA gene multivariate analysis
 
[13.09.19] 16S workshop introduction
[13.09.19] 16S workshop introduction[13.09.19] 16S workshop introduction
[13.09.19] 16S workshop introduction
 
Biodiversity
BiodiversityBiodiversity
Biodiversity
 
Jonathan Eisen @phylogenomics talk for #LAMG12
Jonathan Eisen @phylogenomics talk for #LAMG12Jonathan Eisen @phylogenomics talk for #LAMG12
Jonathan Eisen @phylogenomics talk for #LAMG12
 
16S Ribosomal DNA Sequence Analysis
16S Ribosomal DNA Sequence Analysis16S Ribosomal DNA Sequence Analysis
16S Ribosomal DNA Sequence Analysis
 
GLBIO/CCBC Metagenomics Workshop
GLBIO/CCBC Metagenomics WorkshopGLBIO/CCBC Metagenomics Workshop
GLBIO/CCBC Metagenomics Workshop
 

Similar to CCBC tutorial beiko

EVE161: Microbial Phylogenomics - Class 1 - Introduction
EVE161: Microbial Phylogenomics - Class 1 - IntroductionEVE161: Microbial Phylogenomics - Class 1 - Introduction
EVE161: Microbial Phylogenomics - Class 1 - Introduction
Jonathan Eisen
 
Talk by J. Eisen for NZ Computational Genomics meeting
Talk by J. Eisen for NZ Computational Genomics meetingTalk by J. Eisen for NZ Computational Genomics meeting
Talk by J. Eisen for NZ Computational Genomics meeting
Jonathan Eisen
 
Introduction-to-Bioinformatics-1.ppt
Introduction-to-Bioinformatics-1.pptIntroduction-to-Bioinformatics-1.ppt
Introduction-to-Bioinformatics-1.ppt
RichardEstradaC
 
Bioinformatics for Computer Scientists.ppt
Bioinformatics for Computer Scientists.pptBioinformatics for Computer Scientists.ppt
Bioinformatics for Computer Scientists.ppt
Abdullah Yousafzai
 
Ensembl Browser Workshop
Ensembl Browser WorkshopEnsembl Browser Workshop
Ensembl Browser Workshop
Denise Carvalho-Silva, PhD
 
Quality Assessment of Biomedical Metadata using Topic Modeling
Quality Assessment of Biomedical Metadata using Topic ModelingQuality Assessment of Biomedical Metadata using Topic Modeling
Quality Assessment of Biomedical Metadata using Topic Modeling
Stuti Nayak
 
2014 bangkok-talk
2014 bangkok-talk2014 bangkok-talk
2014 bangkok-talk
c.titus.brown
 
rheumatoid arthritis
rheumatoid arthritisrheumatoid arthritis
rheumatoid arthritis
Ankit Bhardwaj
 
Bioinformatics (Exam point of view)
Bioinformatics (Exam point of view)Bioinformatics (Exam point of view)
Bioinformatics (Exam point of view)
Sijo A
 
Bioinformatics seminar
Bioinformatics seminarBioinformatics seminar
Bioinformatics seminar
shashi bijapure
 
2014
20142014
UKSG 2023 - Will artificial intelligence change how readers use the research ...
UKSG 2023 - Will artificial intelligence change how readers use the research ...UKSG 2023 - Will artificial intelligence change how readers use the research ...
UKSG 2023 - Will artificial intelligence change how readers use the research ...
UKSG: connecting the knowledge community
 
Apollo Workshop AGS2017 Introduction
Apollo Workshop AGS2017 IntroductionApollo Workshop AGS2017 Introduction
Apollo Workshop AGS2017 Introduction
Monica Munoz-Torres
 
Giab jan2016 intro and update 160128
Giab jan2016 intro and update 160128Giab jan2016 intro and update 160128
Giab jan2016 intro and update 160128
GenomeInABottle
 
Web Apollo: Lessons learned from community-based biocuration efforts.
Web Apollo: Lessons learned from community-based biocuration efforts.Web Apollo: Lessons learned from community-based biocuration efforts.
Web Apollo: Lessons learned from community-based biocuration efforts.
Monica Munoz-Torres
 
2014 marine-microbes-grc
2014 marine-microbes-grc2014 marine-microbes-grc
2014 marine-microbes-grcc.titus.brown
 
Diversity Diversity Diversity Diversity ....
Diversity Diversity Diversity Diversity ....Diversity Diversity Diversity Diversity ....
Diversity Diversity Diversity Diversity ....
Jonathan Eisen
 
Open Minds Bring Open Collaborations
Open Minds Bring Open CollaborationsOpen Minds Bring Open Collaborations
Open Minds Bring Open Collaborations
Prashanth N Suravajhala
 
2014 09 30_t1_bioinformatics_wim_vancriekinge
2014 09 30_t1_bioinformatics_wim_vancriekinge2014 09 30_t1_bioinformatics_wim_vancriekinge
2014 09 30_t1_bioinformatics_wim_vancriekinge
Prof. Wim Van Criekinge
 
The Human Genome Project - Part III
The Human Genome Project - Part IIIThe Human Genome Project - Part III
The Human Genome Project - Part III
hhalhaddad
 

Similar to CCBC tutorial beiko (20)

EVE161: Microbial Phylogenomics - Class 1 - Introduction
EVE161: Microbial Phylogenomics - Class 1 - IntroductionEVE161: Microbial Phylogenomics - Class 1 - Introduction
EVE161: Microbial Phylogenomics - Class 1 - Introduction
 
Talk by J. Eisen for NZ Computational Genomics meeting
Talk by J. Eisen for NZ Computational Genomics meetingTalk by J. Eisen for NZ Computational Genomics meeting
Talk by J. Eisen for NZ Computational Genomics meeting
 
Introduction-to-Bioinformatics-1.ppt
Introduction-to-Bioinformatics-1.pptIntroduction-to-Bioinformatics-1.ppt
Introduction-to-Bioinformatics-1.ppt
 
Bioinformatics for Computer Scientists.ppt
Bioinformatics for Computer Scientists.pptBioinformatics for Computer Scientists.ppt
Bioinformatics for Computer Scientists.ppt
 
Ensembl Browser Workshop
Ensembl Browser WorkshopEnsembl Browser Workshop
Ensembl Browser Workshop
 
Quality Assessment of Biomedical Metadata using Topic Modeling
Quality Assessment of Biomedical Metadata using Topic ModelingQuality Assessment of Biomedical Metadata using Topic Modeling
Quality Assessment of Biomedical Metadata using Topic Modeling
 
2014 bangkok-talk
2014 bangkok-talk2014 bangkok-talk
2014 bangkok-talk
 
rheumatoid arthritis
rheumatoid arthritisrheumatoid arthritis
rheumatoid arthritis
 
Bioinformatics (Exam point of view)
Bioinformatics (Exam point of view)Bioinformatics (Exam point of view)
Bioinformatics (Exam point of view)
 
Bioinformatics seminar
Bioinformatics seminarBioinformatics seminar
Bioinformatics seminar
 
2014
20142014
2014
 
UKSG 2023 - Will artificial intelligence change how readers use the research ...
UKSG 2023 - Will artificial intelligence change how readers use the research ...UKSG 2023 - Will artificial intelligence change how readers use the research ...
UKSG 2023 - Will artificial intelligence change how readers use the research ...
 
Apollo Workshop AGS2017 Introduction
Apollo Workshop AGS2017 IntroductionApollo Workshop AGS2017 Introduction
Apollo Workshop AGS2017 Introduction
 
Giab jan2016 intro and update 160128
Giab jan2016 intro and update 160128Giab jan2016 intro and update 160128
Giab jan2016 intro and update 160128
 
Web Apollo: Lessons learned from community-based biocuration efforts.
Web Apollo: Lessons learned from community-based biocuration efforts.Web Apollo: Lessons learned from community-based biocuration efforts.
Web Apollo: Lessons learned from community-based biocuration efforts.
 
2014 marine-microbes-grc
2014 marine-microbes-grc2014 marine-microbes-grc
2014 marine-microbes-grc
 
Diversity Diversity Diversity Diversity ....
Diversity Diversity Diversity Diversity ....Diversity Diversity Diversity Diversity ....
Diversity Diversity Diversity Diversity ....
 
Open Minds Bring Open Collaborations
Open Minds Bring Open CollaborationsOpen Minds Bring Open Collaborations
Open Minds Bring Open Collaborations
 
2014 09 30_t1_bioinformatics_wim_vancriekinge
2014 09 30_t1_bioinformatics_wim_vancriekinge2014 09 30_t1_bioinformatics_wim_vancriekinge
2014 09 30_t1_bioinformatics_wim_vancriekinge
 
The Human Genome Project - Part III
The Human Genome Project - Part IIIThe Human Genome Project - Part III
The Human Genome Project - Part III
 

More from beiko

ASMNGS_ARETE_Beiko_2022Oct19.pptx
ASMNGS_ARETE_Beiko_2022Oct19.pptxASMNGS_ARETE_Beiko_2022Oct19.pptx
ASMNGS_ARETE_Beiko_2022Oct19.pptx
beiko
 
Beiko cmo gen_epi_monday
Beiko cmo gen_epi_mondayBeiko cmo gen_epi_monday
Beiko cmo gen_epi_monday
beiko
 
Beiko networks 2019_final
Beiko networks 2019_finalBeiko networks 2019_final
Beiko networks 2019_final
beiko
 
Biomedical data
Biomedical dataBiomedical data
Biomedical data
beiko
 
Rob csm2018
Rob csm2018Rob csm2018
Rob csm2018
beiko
 
Beiko taconic-nov3
Beiko taconic-nov3Beiko taconic-nov3
Beiko taconic-nov3
beiko
 
GenGIS presentation at Vizbi 2016
GenGIS presentation at Vizbi 2016GenGIS presentation at Vizbi 2016
GenGIS presentation at Vizbi 2016
beiko
 
Beiko ANL Soil Metagenomics presentation
Beiko ANL Soil Metagenomics presentationBeiko ANL Soil Metagenomics presentation
Beiko ANL Soil Metagenomics presentation
beiko
 
DCSI presentation 2015
DCSI presentation 2015DCSI presentation 2015
DCSI presentation 2015
beiko
 
2015 06-12-beiko-irida-big data
2015 06-12-beiko-irida-big data2015 06-12-beiko-irida-big data
2015 06-12-beiko-irida-big data
beiko
 
Beiko cms final
Beiko cms finalBeiko cms final
Beiko cms final
beiko
 
Is microbial ecology driven by roaming genes?
Is microbial ecology driven by roaming genes?Is microbial ecology driven by roaming genes?
Is microbial ecology driven by roaming genes?
beiko
 
Beiko hpcs
Beiko hpcsBeiko hpcs
Beiko hpcs
beiko
 
Gene sharing in microbes: good for the individual, good for the community?
Gene sharing in microbes: good for the individual, good for the community?Gene sharing in microbes: good for the individual, good for the community?
Gene sharing in microbes: good for the individual, good for the community?
beiko
 
Beiko biogeography
Beiko biogeographyBeiko biogeography
Beiko biogeography
beiko
 
2014 04-beiko-biology
2014 04-beiko-biology2014 04-beiko-biology
2014 04-beiko-biology
beiko
 
Beiko Deep Genomics presentation - "Grand theft operon - lateral city"
Beiko Deep Genomics presentation - "Grand theft operon - lateral city"Beiko Deep Genomics presentation - "Grand theft operon - lateral city"
Beiko Deep Genomics presentation - "Grand theft operon - lateral city"beiko
 
Rob's GenGIS presentation at IBS Special Meeting (Montreal 2013)
Rob's GenGIS presentation at IBS Special Meeting (Montreal 2013)Rob's GenGIS presentation at IBS Special Meeting (Montreal 2013)
Rob's GenGIS presentation at IBS Special Meeting (Montreal 2013)
beiko
 
Beiko dcsi2013
Beiko dcsi2013Beiko dcsi2013
Beiko dcsi2013
beiko
 
Beiko smbe2013-final
Beiko smbe2013-finalBeiko smbe2013-final
Beiko smbe2013-finalbeiko
 

More from beiko (20)

ASMNGS_ARETE_Beiko_2022Oct19.pptx
ASMNGS_ARETE_Beiko_2022Oct19.pptxASMNGS_ARETE_Beiko_2022Oct19.pptx
ASMNGS_ARETE_Beiko_2022Oct19.pptx
 
Beiko cmo gen_epi_monday
Beiko cmo gen_epi_mondayBeiko cmo gen_epi_monday
Beiko cmo gen_epi_monday
 
Beiko networks 2019_final
Beiko networks 2019_finalBeiko networks 2019_final
Beiko networks 2019_final
 
Biomedical data
Biomedical dataBiomedical data
Biomedical data
 
Rob csm2018
Rob csm2018Rob csm2018
Rob csm2018
 
Beiko taconic-nov3
Beiko taconic-nov3Beiko taconic-nov3
Beiko taconic-nov3
 
GenGIS presentation at Vizbi 2016
GenGIS presentation at Vizbi 2016GenGIS presentation at Vizbi 2016
GenGIS presentation at Vizbi 2016
 
Beiko ANL Soil Metagenomics presentation
Beiko ANL Soil Metagenomics presentationBeiko ANL Soil Metagenomics presentation
Beiko ANL Soil Metagenomics presentation
 
DCSI presentation 2015
DCSI presentation 2015DCSI presentation 2015
DCSI presentation 2015
 
2015 06-12-beiko-irida-big data
2015 06-12-beiko-irida-big data2015 06-12-beiko-irida-big data
2015 06-12-beiko-irida-big data
 
Beiko cms final
Beiko cms finalBeiko cms final
Beiko cms final
 
Is microbial ecology driven by roaming genes?
Is microbial ecology driven by roaming genes?Is microbial ecology driven by roaming genes?
Is microbial ecology driven by roaming genes?
 
Beiko hpcs
Beiko hpcsBeiko hpcs
Beiko hpcs
 
Gene sharing in microbes: good for the individual, good for the community?
Gene sharing in microbes: good for the individual, good for the community?Gene sharing in microbes: good for the individual, good for the community?
Gene sharing in microbes: good for the individual, good for the community?
 
Beiko biogeography
Beiko biogeographyBeiko biogeography
Beiko biogeography
 
2014 04-beiko-biology
2014 04-beiko-biology2014 04-beiko-biology
2014 04-beiko-biology
 
Beiko Deep Genomics presentation - "Grand theft operon - lateral city"
Beiko Deep Genomics presentation - "Grand theft operon - lateral city"Beiko Deep Genomics presentation - "Grand theft operon - lateral city"
Beiko Deep Genomics presentation - "Grand theft operon - lateral city"
 
Rob's GenGIS presentation at IBS Special Meeting (Montreal 2013)
Rob's GenGIS presentation at IBS Special Meeting (Montreal 2013)Rob's GenGIS presentation at IBS Special Meeting (Montreal 2013)
Rob's GenGIS presentation at IBS Special Meeting (Montreal 2013)
 
Beiko dcsi2013
Beiko dcsi2013Beiko dcsi2013
Beiko dcsi2013
 
Beiko smbe2013-final
Beiko smbe2013-finalBeiko smbe2013-final
Beiko smbe2013-final
 

Recently uploaded

Unveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdfUnveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Erdal Coalmaker
 
Mammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also FunctionsMammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also Functions
YOGESH DOGRA
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Sérgio Sacani
 
general properties of oerganologametal.ppt
general properties of oerganologametal.pptgeneral properties of oerganologametal.ppt
general properties of oerganologametal.ppt
IqrimaNabilatulhusni
 
In silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptxIn silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptx
AlaminAfendy1
 
ESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptxESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptx
muralinath2
 
Cancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate PathwayCancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate Pathway
AADYARAJPANDEY1
 
Hemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptxHemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptx
muralinath2
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
Nistarini College, Purulia (W.B) India
 
Richard's entangled aventures in wonderland
Richard's entangled aventures in wonderlandRichard's entangled aventures in wonderland
Richard's entangled aventures in wonderland
Richard Gill
 
Lateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensiveLateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensive
silvermistyshot
 
Richard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlandsRichard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlands
Richard Gill
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
moosaasad1975
 
RNA INTERFERENCE: UNRAVELING GENETIC SILENCING
RNA INTERFERENCE: UNRAVELING GENETIC SILENCINGRNA INTERFERENCE: UNRAVELING GENETIC SILENCING
RNA INTERFERENCE: UNRAVELING GENETIC SILENCING
AADYARAJPANDEY1
 
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATIONPRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
ChetanK57
 
Leaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdfLeaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdf
RenuJangid3
 
Lab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerinLab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerin
ossaicprecious19
 
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
Health Advances
 
Citrus Greening Disease and its Management
Citrus Greening Disease and its ManagementCitrus Greening Disease and its Management
Citrus Greening Disease and its Management
subedisuryaofficial
 
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
muralinath2
 

Recently uploaded (20)

Unveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdfUnveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdf
 
Mammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also FunctionsMammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also Functions
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
 
general properties of oerganologametal.ppt
general properties of oerganologametal.pptgeneral properties of oerganologametal.ppt
general properties of oerganologametal.ppt
 
In silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptxIn silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptx
 
ESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptxESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptx
 
Cancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate PathwayCancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate Pathway
 
Hemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptxHemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptx
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
 
Richard's entangled aventures in wonderland
Richard's entangled aventures in wonderlandRichard's entangled aventures in wonderland
Richard's entangled aventures in wonderland
 
Lateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensiveLateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensive
 
Richard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlandsRichard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlands
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
 
RNA INTERFERENCE: UNRAVELING GENETIC SILENCING
RNA INTERFERENCE: UNRAVELING GENETIC SILENCINGRNA INTERFERENCE: UNRAVELING GENETIC SILENCING
RNA INTERFERENCE: UNRAVELING GENETIC SILENCING
 
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATIONPRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
 
Leaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdfLeaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdf
 
Lab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerinLab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerin
 
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
 
Citrus Greening Disease and its Management
Citrus Greening Disease and its ManagementCitrus Greening Disease and its Management
Citrus Greening Disease and its Management
 
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
 

CCBC tutorial beiko

  • 2. Welcome! Your Tutorial Team: Me (16S theory) Mike Hall (16S practical) Morgan Langille (metagenomics theory and practical) Special thanks to: Will Hsiao (CBW presentation) 2
  • 3. Today’s presentation CBW “Analysis of metagenomic data” 3 http://bioinformatics.ca/workshops/2015/analysis-metagenomic-data-2015
  • 4. Overview Morning session 1. A brief history of molecules and microbes 2. Why 16S? 3. How 16S analysis is usually done 4. Assumptions 5. Hands-on practical Afternoon session 1. 16S vs Metagenomics 2. Metagenome Taxonomic Composition 3. Metagenome Functional Composition 4. PICRUSt: Functional Inference 5. Hands-on practical 4
  • 5. Learning objectives At the end of the 16S tutorial, you should be able to do the following: 1. Run a simple QIIME analysis of a data set (https://www.dropbox.com/s/kpte51nm17wav9o/stool_data.zip) 2. Interpret analysis results 3. Understand the limitations of the standard 16S analysis pipeline 5
  • 6. Defining metagenomics Microbiome: Attributed to Joshua Lederberg by Hooper and Gordon (2001): “the collective genome of our indigenous microbes (microflora), the idea being that a comprehensive genetic view of Homo sapiens as a life-form should include the genes in our microbiome” Is also used to mean microbiota, the group of microorganisms found in a particular setting (usage varies: be careful and precise!) Metagenome: Handelsman et al. (1998) “…advances in molecular biology and eukaryotic genomics, which have laid the groundwork for cloning and functional analysis of the collective genomes of soil microflora, which we term the metagenome of the soil.” Does not encompass marker-gene surveys (e.g., 16S) This report says it does. 6
  • 7. Micro-what? Metagenomics is often defined to encompass only Bacteria and Archaea (and often Archaea are excluded too!) Other small things to consider: ◦ Viruses / phages ◦ Microbial eukaryotes ◦ Worms (helminths, nematodes, …) 7 Lukeš et al. (2015) PLoS Pathogens
  • 8. The dawn of metagenomics 3.5 BYA – the Archaean Eon 16S position 349 (-ish) ? G A Archaea Bacteria 8
  • 10. The 16S ribosomal RNA gene THE FIRST WORD IN MICROBIAL BIODIVERSITY 10
  • 11. 11 Yarza et al. (2014) Escherichia coli ribosome (PDB 4YBB) So much RNA!
  • 12. Why 16S? The “universal phylogenetic marker” (1) Present in all living organisms (2) Single copy* (no recombination) (3) Highly conserved + highly variable regions (4) Huge reference databases 12
  • 13. Milestones 13 1990: “proposal for the domains Archaea, Bacteria, and Eucarya”
  • 14. Milestones 14 Nature (1990) 2002: “…as much as 50% of the total surface microbial community…”
  • 15. Milestones 15 PNAS (2006) Many critical papers followed (error filtering, clustering approaches, …)
  • 16. Milestones 16 Huttenhower, Gevers et al. (2012) + 681 metagenomic samples
  • 19. Sample collection and DNA extraction Defined protocols exist, many kits (e.g. PowerSoil®) Need to consider barriers to DNA recovery and PCR (e.g. humic acids from soil, bile salts from feces) Additional mechanical approaches (e.g., mechanical lysis of tissues with bead beating) Kits and rogue lab DNA can end up in your sample – need to run negative controls!! ◦ Example from [year redacted]: shocking finding of bacterial DNA in the [location redacted]! However, [taxonomic group redacted] was a known frequent contaminant of DNA extraction kits. 19
  • 21. Choosing a PCR strategy Need to consider: ◦ Correct melting temperature (60-65 degrees C for Illumina protocol) ◦ DNA sequencing read length (influences choice of primers) ◦ Primer specificity! ◦ Comparability with previous studies? [Good luck with that] [but that’s what the Earth Microbiome Project protocol http://www.earthmicrobiome.org/emp-standard-protocols/16s/ is meant to achieve] 21
  • 22. Which variable regions to target? V1-V3 favours Prevotella, Fusobacterium, Streptococcus, Granulicatella, Bacteroides, Porphyromonas and Treponema V4-V6 favours Streptococcus, Treponema, Prevotella, Eubacterium, Porphyromonas, Campylobacter and Enterococcus. ◦ failed to detect Fusobacterium V7-V9 favours Veillonella, Streptococcus, Eubacterium, Enterococcus, Treponema, Catonella and Selenomonas. ◦ failed to detect Selenomonas, TM7 and Mycoplasma 22
  • 23. At least there’s no shortage of options… 23 Detailed in silico evaluation of primers, experimental evaluation of two sets Heavily biased recovery of Bacteria, Archaea, and missing groups depending on primer choice. “Out of the 175 primers and 512 primer pairs checked, only 10 can be recommended as broad-range primers.”
  • 25. Analysis (examples mostly from QIIME) 1. Quality Control ◦ Error checking 2. Sample diversity ◦ Taxonomy agnostic ◦ Taxonomy aware 3. Similarity among samples 4. Associations with metadata/groups (ANOSIM, MRPP) 5. Machine-learning classification 6. Functional prediction 25
  • 26. 26 QIIME Mothur A python interface to glue together many programs Single program with minimal external dependency Wrappers for existing programs Reimplementation of popular algorithms Large number of dependencies / VM available Easy to install and setup; work best on single multi-core server with lots of memory More scalable Less scalable Steeper learning curve but more flexible workflow if you can write your own scripts Easy to learn but workflow works the best with built-in tools http://www.ncbi.nlm.nih.gov/pubmed/2406 0131 http://www.mothur.org/wiki/MiSeq_SOP Will Hsiao
  • 27. “Analysis” #1 Quality Control 27 Quality score filtering: ◦ Minimal length of consecutive high-quality bases (as % of total read length) ◦ Maximal number of consecutive low-quality bases ◦ Maximal number of ambiguous bases (N’s) ◦ Minimum Phred quality score Other quality filtering tools available ◦ Cutadapt (https://github.com/marcelm/cutadapt) ◦ Trimmomatic (http://www.usadellab.org/cms/?page=trimmomatic) ◦ Sickle (https://github.com/najoshi/sickle) Chimera checking: ◦ UCHIME
  • 28. 28 Sequence quality summary using FASTQC http://www.bioinformatics.babraham.ac.uk/projects/fastqc/
  • 29. Analysis #2 Within-sample (“alpha”) diversity To describe the diversity of a sample, you need to know what you are counting! Individual sequences? ◦ Most precise, but vulnerable to sequencing error effects – inflation of diversity Clusters of sequences? ◦ Operational taxonomic units (OTUs) – 97% sequence identity as the “species” level of similarity Taxonomic groups? ◦ It’s always reassuring to put names on things, but taxonomic labels can be extremely misleading 29
  • 30. OTU clustering 30 Choose a % identity threshold 97% Cluster centroids in some order (e.g., length, abundance) – these are reference sequences Continue procedure until all sequences are clustered OTU (singletons may be excluded) Calculate distances between sequences 6%
  • 31. What’s in a name? 31 Bacteroides Parabacteroides Ruminococcus ??? ??? ??? ??? Akkermansia
  • 32. Taxonomic assignment Many choices: BLAST – assign taxonomic label of closest match (simple, possibly too simple) Phylogenetic placement – e.g. Pplacer (Matsen et al., BMC Bioinformatics 2010) Machine-learning classification, in particular Naïve Bayes e.g. RDP Classifier, Wang et al. (2007) BMC Bioinformatics 32
  • 33. Example RDP Classifier output 33 GD6JEAT01AYGPE Root rootrank 1.0 Bacteria domain 1.0 "Planctomycetes" phylum 1.0 "Planctomycetacia"class 1.0 Planctomycetales order 1.0 Planctomycetaceaefamily 1.0 Schlesneria genus 0.96 GD6JEAT01BEUG6 Root rootrank 1.0 Bacteria domain 1.0 Firmicutes phylum 0.32 Clostridia class 0.26 Clostridiales order 0.23 Ruminococcaceae family 0.22 Anaerotruncus genus 0.19 Includes bootstrap support
  • 34. Calculating alpha diversity OTU counts – richness only Simpson index – probability of sampling two individuals of the same type Phylogenetic diversity – sum of branch lengths 34
  • 35. Example: human body-site diversity 35 Huttenhower, Gevers et al. (2012)
  • 36. Analysis #3 Among-sample (“beta”) diversity 1. Perform pairwise comparisons between all samples to build a dissimilarity matrix 2. Summarize the matrix using based on major patterns of covariance or hierarchical similarity 36
  • 37. Analysis #3 Among-sample (“beta”) diversity Given a pair of samples (described as e.g. OTU abundance), calculate their dissimilarity Beta-diversity measures can be: ◦ non-phylogenetic or phylogenetic ◦ weighted or unweighted There are a lot of measures! -Bray-Curtis (weighted, non-phylogenetic) -Jaccard (unweighted, non-phylogenetic) -Weighted UniFrac (weighted, phylogenetic) -… 37
  • 38. Analysis #3 Among-sample (“beta”) diversity How similar are the results of different measures? CORRELATIONS between calculated values 38 Parks and Beiko (2013): ISME J
  • 39. Analysis #3 Among-sample (“beta”) diversity What to do with a dissimilarity matrix? 39 Yatsunenko et al. (2012) Nature Parks and Beiko (2012) Mol Biol Evol Ordination Clustering
  • 40. Analysis #3 Among-sample (“beta”) diversity Different beta-diversity measures can yield dramatically different clusters! 40 Parks and Beiko (2013): ISME J
  • 41. Analysis #4 Associations with metadata PERMANOVA: Permutational multivariate analysis of variance ANOSIM: Rank-based analysis of similarity Mantel test: Comparison of between-group vs within-group distances 41 Good review: Anderson and Walsh (2013) Ecological Monographs Example: Weighted UniFrac distance: root compartment explains 46.62% of variance (PERMANOVA p<0.001) Unweighted UniFrac: root compartment explains only 18.07% of variance (PERMANOVA p<0.001); soil type is more important
  • 42. Analysis #5 Machine-learning classification Identify aspects of community structure that are predictive of sample attributes Advantages of machine-learning approaches: ◦ Non-linear combinations of variables ◦ Data transformations ◦ Can accommodate many different representations of the data Disadvantages: ◦ Complex, may “overfit” ◦ Can be time consuming ◦ Obfuscation of predictive rules 42
  • 43. Random forests (supervised_learning.py) 43 “…there are only weak and, for the most part, non-significant associations of particular taxa or overall diversity with the obese human gut that hold true across different studies. However, using supervised learning with receiver operator curves to maximize sensitivity and specificity, one can categorize subjects according to lean and obese states with in some cases considerable accuracy…”
  • 44. Tree-based classifications Nested clade analysis and feature selection Classification of plaque samples using support vector machines 44 Ning and Beiko (2015): Microbiome
  • 45. Analysis #6 Functional prediction PICRUSt: Langille et al (2013) Nat Biotechnol 45 Morgan can tell you about this…
  • 47. Do not assume that #1: 16S is an effective proxy for microbial diversity. #2: All 16S studies are created equal, with results that are comparable. #3: Rarefaction is a good idea. #4: 16S OTUs describe ecologically cohesive units (“species”?). #5: The 16S tree is the “Tree of Life”. 47
  • 48. Assumption #1 16S is an effective proxy for microbial diversity. 48 rrnDB: Stoddard et al. NAR (2014) Estimating copy number: Kembel et al. (2012) and PICRUSt (coming up later) Variation: Coenye and Vandamme (2003)
  • 49. Assumption #1 16S is an effective proxy for microbial diversity. Alternative marker genes: cpn60, rpoB, … Smaller reference databases! Protein-coding genes! 49
  • 50. Assumption #2 All 16S studies are created equal. Effects of sequencing platform, V region, amplicon vs metagenomics 50 Tremblay et al. (2015) Front Microbiol
  • 51. Assumption #3 Rarefaction is a good idea. Example of statistics before and after rarefaction: Loss of statistical power Random subsampling can increase false-positive differences Arbitrary minimum library size chosen for downsampling Alternatives e.g. Negative Binomial fitting (e.g., DeSeq2) 51 McMurdie and Holmes (2014) PLoS Comp Biol
  • 52. Assumption #4 16S OTUs describe ecologically cohesive units. 52 Distribution of sequence similarity (dashed line = OTU threshold) branch lengths Nguyen et al. (2016) npj Biofilms and Microbiomes
  • 53. Assumption #4 16S OTUs describe ecologically cohesive units. 53 Hall et al., in preparation Same OTU, different temporal patterns
  • 54. Assumption #4 16S OTUs describe ecologically cohesive units. 54 Many alternatives exist, including Swarm: Mahé et al. (2015) PeerJ
  • 55. Assumption #5 The 16S tree is the “Tree of Life”. 16S is limited for several reasons: Limited resolving power Subject to compositional bias Subject to recombination and lateral transfer Models typically applied to protein- coding genes do not make sense for noncoding RNA 55
  • 56. Moving On ADVENTURES IN “MULTI-OMICS” 56
  • 57. Multi-omics?? 16S can profile the biodiversity of a microbial sample… But we need the metagenome to shine a light on function… The metatranscriptome tells us what is expressed under specific conditions… And the metaproteome can quantify the relative abundance of different enzymes… While the metametabolome focuses on the products of metabolism. What do we really need? 57
  • 58. Metagenomic / metatranscriptomic AMD analysis - Hua et al., ISME J (2015) Draft genomes at MG-RAST
  • 59. 59 Differences in the microbiome between arsenic- exposed and control mice 16S taxonomic analysis + metametabolomics Taxonomy Metabolic function
  • 60. Hands on! LET’S MAKE SCIENCE HAPPEN 60
  • 62. Workflow 1. Retrieve data 2. Cluster sequences 3. Taxonomic classification 4. Phylogenetic tree construction 5. OTU table creation 6. Downstream visualization / analysis 62