SlideShare a Scribd company logo
1 of 24
STAMPS 2018,
Shotgun Metagenomics!!
Titus Brown
whoami? -> Titus Brown
• Bioinformatics method development, evaluaton, and
application.
• Strangely passionate about environmental
metagenomics…
• Cross-cutting interests:
• Good (efficient, accurate, remixable) software
development, on the open source model.
• Reproducibility.
• Open science; preprints, blogging, Twitter.
@ctitusbrown on Twitter
http://ivory.idyll.org/blog/
The Plan
•Saturday afternoon: Intro to shotgun sequencing
and metagenome assembly
•Saturday evening: lobster feast!
•Sunday: break!
•Monday morning: binning genomes!
•Monday afternoon: more assembly!
•Monday evening: workflows (and werewolf ?)
•Tuesday morning: k-mers are awesome!
•Tuesday afternoon: Functional analysis!
•Tuesday evening: Capstone!
As usual…
Shotgun sequencing & de novo assembly:
It was the Gest of times, it was the wor
, it was the worst of timZs, it was the
isdom, it was the age of foolisXness
, it was the worVt of times, it was the
mes, it was Ahe age of wisdom, it was th
It was the best of times, it Gas the wor
mes, it was the age of witdom, it was th
isdom, it was tIe age of foolishness
It was the best of times, it was the worst of times, it was the age of wisdom, it was
the age of foolishness
Shotgun sequencing analogy:
feeding books into a paper shredder,
digitizing the shreds, and reconstructing
the book.
Although for books, we often know the language and not just the alphabet 
FASTQ etc.
@SRR606249.17/1
GAGTATGTTCTCATAGAGGTTGGTANNNNT
+
B@BDDFFFHHHHHJIJJJJGHIJHJ####1
@SRR606249.17/2
CGAANNNNNNNNNNNNNNNNNCCTGGCTCA
+
CCCF#################22@GHIJJJ
Shotgun sequencing
“Coverage” is the average number of reads that overlap
each true base in (meta)genome.
Here, the coverage is ~10 – just draw a line straight down from the top
through all of the reads.
Shotgun metagenomics: sequencing
communities.
Shotgun metagenomics
• Collect samples;
• Extract DNA;
• Feed into sequencer;
• Computationally analyze.
Wikipedia: Environmental shotgun
sequencing.png
Goals of shotgun metagenomics
• Expand beyond taxonomic/community structure characterization
possible with 16s;
• Analyze virus, plasmid, strain-level content;
• Evaluate metabolic capacity (e.g. “is nirK present?”)
• Reconstruct genomes from metagenomes, if possible.
16s? Or Shotgun metagenomics?
•Cons vs amplicon:
•lower coverage / more expensive (good? bad?
what are the tradeoffs?)
•much more computationally challenging to
analyze
•Pros vs amplicon:
•different bias (no primers)
•virus/phage can be detected
•function can be more directly detected
•recover (putative) genomes
Shotgun metagenome assembly: reconstruct
original genome by finding overlaps in data
UMD assembly primer (cbcb.umd.edu)
Randomly sequencing DNA, then finding overlaps and inferring true sequence:
Shotgun sequencing & de novo assembly:
It was the Gest of times, it was the wor
, it was the worst of timZs, it was the
isdom, it was the age of foolisXness
, it was the worVt of times, it was the
mes, it was Ahe age of wisdom, it was th
It was the best of times, it Gas the wor
mes, it was the age of witdom, it was th
isdom, it was tIe age of foolishness
It was the best of times, it was the worst of times, it was the age of wisdom, it was
the age of foolishness
Note: Shotgun metagenome data is always
incomplete.
Smaller circles represent (some of)
your actual community.
Blue circle represents what’s in your
sequencing data set.
Shotgun metagenome data may not
contain everything in your
community; may contain strain
variants; may contain “unknown”
microbes.
What can we do with shotgun
metagenomics?
• do taxonomic analysis directly on the reads - (Tuesday!)
• search the reads for genes of interest (function, taxonomy)
• assemble the reads into contigs, longer stretches of DNA - (next few
days)
• annotate contigs with taxonomy, function
• (note distinction between de novo assembly and reference-based assembly)
• cluster contigs together to extract genome bins, aka “metagenome
assembled genomes” - (Monday!)
• compare contig abundances across samples to look for differentially
abundant sequences
• (see Mike Lee’s diagram of All the Tools!)
Important notes on assembly:
•assembly squashes abundance 8:
•assembly ignores complicated regions :(
•assembly is surprisingly accurate and (when
using megahit, at least) computationally
tractable :)
Important notes on quantifying assembled
contigs:
• it’s not clear what tools to use for differential
abundance analysis
• we have a tutorial for quantifying contig
abundance with salmon - see
also notebook
• mapping abundances look wavy
Some open computational research
questions:
•right now assembly based approaches simply
…discard some proportion of the data. we
should figure out a better way.
•what is the value of long reads in shotgun
metagenomics?
Assembly results
% megahit -1 R1.fq.gz -2 R2.fq.gz
…
--- [STAT] 7774 contigs, total 4987609 bp, min 200 bp, max 8658 bp,
avg 642 bp, N50 1049 bp
% head final.contigs.fa
>k141_3 flag=0 multi=20.8090 len=230
TGATCCTGTAGTGACTAACGGACGGATGTAAGCATCTTTTAGACCATTACGTT
CTAACAATTCGTAAGTAATTTGAGTTAGTTGTTCTTCAGAATAATTCAATTTAAT
ATTCATGACTTCTGCTCCATACTTAAGCCTTTTGTAATGCTCATATGACTTAAAA
ATTTTAGTTTCTTCGTTGGTATTGTATGCTCTTATACCTTCGAAAACCCCATTTC
CATAGTGTAAA
Quast results
# contigs (>= 0 bp) 7774
# contigs (>= 1000 bp) 1308
# contigs (>= 5000 bp) 23
# contigs (>= 10000 bp) 0
Total length (>= 0 bp) 4987609
Total length (>= 1000 bp) 2560563
Total length (>= 5000 bp) 133727
Total length (>= 10000 bp) 0
# contigs 2398
Largest contig 8658
Total length 3331977
GC (%) 31.60
N50 1684
Open question: How much should I
sequence?
Open question: How accurate/effective is
functional classification on shotgun metagenome
data?

More Related Content

Similar to Stamps.pptx

2013 ucdavis-smbe-eukaryotes
2013 ucdavis-smbe-eukaryotes2013 ucdavis-smbe-eukaryotes
2013 ucdavis-smbe-eukaryotes
c.titus.brown
 

Similar to Stamps.pptx (20)

2014 ucl
2014 ucl2014 ucl
2014 ucl
 
Sequencing @ BitLab
Sequencing @ BitLabSequencing @ BitLab
Sequencing @ BitLab
 
P
 Systems 
Model 
Optimisation 
by
 Means 
of 
Evolutionary 
Based 
Search
 ...
P
 Systems 
Model 
Optimisation 
by
 Means 
of 
Evolutionary 
Based 
Search
 ...P
 Systems 
Model 
Optimisation 
by
 Means 
of 
Evolutionary 
Based 
Search
 ...
P
 Systems 
Model 
Optimisation 
by
 Means 
of 
Evolutionary 
Based 
Search
 ...
 
Lecture on the annotation of transposable elements
Lecture on the annotation of transposable elementsLecture on the annotation of transposable elements
Lecture on the annotation of transposable elements
 
2013 ucdavis-smbe-eukaryotes
2013 ucdavis-smbe-eukaryotes2013 ucdavis-smbe-eukaryotes
2013 ucdavis-smbe-eukaryotes
 
Cshl minseqe 2013_ouellette
Cshl minseqe 2013_ouelletteCshl minseqe 2013_ouellette
Cshl minseqe 2013_ouellette
 
Scalable Genome Analysis With ADAM
Scalable Genome Analysis With ADAMScalable Genome Analysis With ADAM
Scalable Genome Analysis With ADAM
 
2013 py con awesome big data algorithms
2013 py con awesome big data algorithms2013 py con awesome big data algorithms
2013 py con awesome big data algorithms
 
Theory and practice of graphical population analysis
Theory and practice of graphical population analysisTheory and practice of graphical population analysis
Theory and practice of graphical population analysis
 
Predicting Gene Loss in Plants: Lessons Learned From Laptop-Scale Data
Predicting Gene Loss in Plants: Lessons Learned From Laptop-Scale DataPredicting Gene Loss in Plants: Lessons Learned From Laptop-Scale Data
Predicting Gene Loss in Plants: Lessons Learned From Laptop-Scale Data
 
2014 naples
2014 naples2014 naples
2014 naples
 
40 Years of Genome Assembly: Are We Done Yet?
40 Years of Genome Assembly: Are We Done Yet?40 Years of Genome Assembly: Are We Done Yet?
40 Years of Genome Assembly: Are We Done Yet?
 
2012 stamps-mbl-1
2012 stamps-mbl-12012 stamps-mbl-1
2012 stamps-mbl-1
 
Apollo Workshop AGS2017 Introduction
Apollo Workshop AGS2017 IntroductionApollo Workshop AGS2017 Introduction
Apollo Workshop AGS2017 Introduction
 
An introduction to Web Apollo for the Biomphalaria glabatra research community.
An introduction to Web Apollo for the Biomphalaria glabatra research community.An introduction to Web Apollo for the Biomphalaria glabatra research community.
An introduction to Web Apollo for the Biomphalaria glabatra research community.
 
Reproducible research - to infinity
Reproducible research - to infinityReproducible research - to infinity
Reproducible research - to infinity
 
2015 illinois-talk
2015 illinois-talk2015 illinois-talk
2015 illinois-talk
 
The Impact of Information Technology on Chemistry and Related Sciences
The Impact of Information Technology on Chemistry and Related SciencesThe Impact of Information Technology on Chemistry and Related Sciences
The Impact of Information Technology on Chemistry and Related Sciences
 
Genome Assembly: the art of trying to make one BIG thing from millions of ver...
Genome Assembly: the art of trying to make one BIG thing from millions of ver...Genome Assembly: the art of trying to make one BIG thing from millions of ver...
Genome Assembly: the art of trying to make one BIG thing from millions of ver...
 
Microbiome studies using 16S ribosomal DNA PCR: some cautionary tales.
Microbiome studies using 16S ribosomal DNA PCR: some cautionary tales.Microbiome studies using 16S ribosomal DNA PCR: some cautionary tales.
Microbiome studies using 16S ribosomal DNA PCR: some cautionary tales.
 

More from aaaa bbb

aassddffgghhjjkklliuuytgvScreening 2.pptx
aassddffgghhjjkklliuuytgvScreening 2.pptxaassddffgghhjjkklliuuytgvScreening 2.pptx
aassddffgghhjjkklliuuytgvScreening 2.pptx
aaaa bbb
 
Biodiversity jkygvkHotspot in India.pptx
Biodiversity jkygvkHotspot in India.pptxBiodiversity jkygvkHotspot in India.pptx
Biodiversity jkygvkHotspot in India.pptx
aaaa bbb
 
uselesshhhhhhhnahihaikajlanaghajsts.pptx
uselesshhhhhhhnahihaikajlanaghajsts.pptxuselesshhhhhhhnahihaikajlanaghajsts.pptx
uselesshhhhhhhnahihaikajlanaghajsts.pptx
aaaa bbb
 
yslides2453189g9oodfineandverygoood.pptx
yslides2453189g9oodfineandverygoood.pptxyslides2453189g9oodfineandverygoood.pptx
yslides2453189g9oodfineandverygoood.pptx
aaaa bbb
 

More from aaaa bbb (7)

aassddffgghhjjkklliuuytgvScreening 2.pptx
aassddffgghhjjkklliuuytgvScreening 2.pptxaassddffgghhjjkklliuuytgvScreening 2.pptx
aassddffgghhjjkklliuuytgvScreening 2.pptx
 
Isolation and screening_Class I - Copy.pptx
Isolation and screening_Class I - Copy.pptxIsolation and screening_Class I - Copy.pptx
Isolation and screening_Class I - Copy.pptx
 
Biodiversity jkygvkHotspot in India.pptx
Biodiversity jkygvkHotspot in India.pptxBiodiversity jkygvkHotspot in India.pptx
Biodiversity jkygvkHotspot in India.pptx
 
uselesshhhhhhhnahihaikajlanaghajsts.pptx
uselesshhhhhhhnahihaikajlanaghajsts.pptxuselesshhhhhhhnahihaikajlanaghajsts.pptx
uselesshhhhhhhnahihaikajlanaghajsts.pptx
 
yslides2453189g9oodfineandverygoood.pptx
yslides2453189g9oodfineandverygoood.pptxyslides2453189g9oodfineandverygoood.pptx
yslides2453189g9oodfineandverygoood.pptx
 
ngs.pptx
ngs.pptxngs.pptx
ngs.pptx
 
Useful.ppt
Useful.pptUseful.ppt
Useful.ppt
 

Recently uploaded

Call Girl Service In Dubai #$# O56521286O #$# Dubai Call Girls
Call Girl Service In Dubai #$# O56521286O #$# Dubai Call GirlsCall Girl Service In Dubai #$# O56521286O #$# Dubai Call Girls
Call Girl Service In Dubai #$# O56521286O #$# Dubai Call Girls
parisharma5056
 
Dubai Call Girl Number # 00971588312479 # Call Girl Number In Dubai # (UAE)
Dubai Call Girl Number # 00971588312479 # Call Girl Number In Dubai # (UAE)Dubai Call Girl Number # 00971588312479 # Call Girl Number In Dubai # (UAE)
Dubai Call Girl Number # 00971588312479 # Call Girl Number In Dubai # (UAE)
Business Bay Call Girls || 0529877582 || Call Girls Service in Business Bay Dubai
 
FULL NIGHT — 9999894380 Call Girls In Mahipalpur | Delhi
FULL NIGHT — 9999894380 Call Girls In Mahipalpur | DelhiFULL NIGHT — 9999894380 Call Girls In Mahipalpur | Delhi
FULL NIGHT — 9999894380 Call Girls In Mahipalpur | Delhi
SaketCallGirlsCallUs
 
(9711106444 )🫦#Sexy Desi Call Girls Noida Sector 4 Escorts Service Delhi 🫶
(9711106444 )🫦#Sexy Desi Call Girls Noida Sector 4 Escorts Service Delhi 🫶(9711106444 )🫦#Sexy Desi Call Girls Noida Sector 4 Escorts Service Delhi 🫶
(9711106444 )🫦#Sexy Desi Call Girls Noida Sector 4 Escorts Service Delhi 🫶
delhimunirka444
 
FULL NIGHT — 9999894380 Call Girls In Dwarka Mor | Delhi
FULL NIGHT — 9999894380 Call Girls In Dwarka Mor | DelhiFULL NIGHT — 9999894380 Call Girls In Dwarka Mor | Delhi
FULL NIGHT — 9999894380 Call Girls In Dwarka Mor | Delhi
SaketCallGirlsCallUs
 
FULL NIGHT — 9999894380 Call Girls In Badarpur | Delhi
FULL NIGHT — 9999894380 Call Girls In Badarpur | DelhiFULL NIGHT — 9999894380 Call Girls In Badarpur | Delhi
FULL NIGHT — 9999894380 Call Girls In Badarpur | Delhi
SaketCallGirlsCallUs
 
Dubai Call Girl Number # 0522916705 # Call Girl Number In Dubai # (UAE)
Dubai Call Girl Number # 0522916705 # Call Girl Number In Dubai # (UAE)Dubai Call Girl Number # 0522916705 # Call Girl Number In Dubai # (UAE)
Dubai Call Girl Number # 0522916705 # Call Girl Number In Dubai # (UAE)
Business Bay Call Girls || 0529877582 || Call Girls Service in Business Bay Dubai
 
Pakistani Bur Dubai Call Girls # +971528960100 # Pakistani Call Girls In Bur ...
Pakistani Bur Dubai Call Girls # +971528960100 # Pakistani Call Girls In Bur ...Pakistani Bur Dubai Call Girls # +971528960100 # Pakistani Call Girls In Bur ...
Pakistani Bur Dubai Call Girls # +971528960100 # Pakistani Call Girls In Bur ...
Business Bay Call Girls || 0529877582 || Call Girls Service in Business Bay Dubai
 
FULL NIGHT — 9999894380 Call Girls In Najafgarh | Delhi
FULL NIGHT — 9999894380 Call Girls In Najafgarh | DelhiFULL NIGHT — 9999894380 Call Girls In Najafgarh | Delhi
FULL NIGHT — 9999894380 Call Girls In Najafgarh | Delhi
SaketCallGirlsCallUs
 
Deira Call Girls # 0588312479 # Call Girls In Deira Dubai ~ (UAE)
Deira Call Girls # 0588312479 # Call Girls In Deira Dubai ~ (UAE)Deira Call Girls # 0588312479 # Call Girls In Deira Dubai ~ (UAE)
Deira Call Girls # 0588312479 # Call Girls In Deira Dubai ~ (UAE)
Business Bay Call Girls || 0529877582 || Call Girls Service in Business Bay Dubai
 
Verified # 971581275265 # Indian Call Girls In Deira By International City Ca...
Verified # 971581275265 # Indian Call Girls In Deira By International City Ca...Verified # 971581275265 # Indian Call Girls In Deira By International City Ca...
Verified # 971581275265 # Indian Call Girls In Deira By International City Ca...
home
 
Authentic # 00971556872006 # Hot Call Girls Service in Dubai By International...
Authentic # 00971556872006 # Hot Call Girls Service in Dubai By International...Authentic # 00971556872006 # Hot Call Girls Service in Dubai By International...
Authentic # 00971556872006 # Hot Call Girls Service in Dubai By International...
home
 
UAE Call Girls # 0528675665 # Independent Call Girls In Dubai ~ (UAE)
UAE Call Girls # 0528675665 # Independent Call Girls In Dubai ~ (UAE)UAE Call Girls # 0528675665 # Independent Call Girls In Dubai ~ (UAE)
UAE Call Girls # 0528675665 # Independent Call Girls In Dubai ~ (UAE)
Business Bay Call Girls || 0529877582 || Call Girls Service in Business Bay Dubai
 

Recently uploaded (20)

Young⚡Call Girls in Uttam Nagar Delhi >༒9667401043 Escort Service
Young⚡Call Girls in Uttam Nagar Delhi >༒9667401043 Escort ServiceYoung⚡Call Girls in Uttam Nagar Delhi >༒9667401043 Escort Service
Young⚡Call Girls in Uttam Nagar Delhi >༒9667401043 Escort Service
 
Storyboard short: Ferrarius Tries to Sing
Storyboard short: Ferrarius Tries to SingStoryboard short: Ferrarius Tries to Sing
Storyboard short: Ferrarius Tries to Sing
 
AaliyahBell_themist_v01.pdf .
AaliyahBell_themist_v01.pdf             .AaliyahBell_themist_v01.pdf             .
AaliyahBell_themist_v01.pdf .
 
Sirmaur Call Girls Book Now 8617697112 Top Class Pondicherry Escort Service A...
Sirmaur Call Girls Book Now 8617697112 Top Class Pondicherry Escort Service A...Sirmaur Call Girls Book Now 8617697112 Top Class Pondicherry Escort Service A...
Sirmaur Call Girls Book Now 8617697112 Top Class Pondicherry Escort Service A...
 
Call Girl Service In Dubai #$# O56521286O #$# Dubai Call Girls
Call Girl Service In Dubai #$# O56521286O #$# Dubai Call GirlsCall Girl Service In Dubai #$# O56521286O #$# Dubai Call Girls
Call Girl Service In Dubai #$# O56521286O #$# Dubai Call Girls
 
Dubai Call Girl Number # 00971588312479 # Call Girl Number In Dubai # (UAE)
Dubai Call Girl Number # 00971588312479 # Call Girl Number In Dubai # (UAE)Dubai Call Girl Number # 00971588312479 # Call Girl Number In Dubai # (UAE)
Dubai Call Girl Number # 00971588312479 # Call Girl Number In Dubai # (UAE)
 
(INDIRA) Call Girl Jammu Call Now 8617697112 Jammu Escorts 24x7
(INDIRA) Call Girl Jammu Call Now 8617697112 Jammu Escorts 24x7(INDIRA) Call Girl Jammu Call Now 8617697112 Jammu Escorts 24x7
(INDIRA) Call Girl Jammu Call Now 8617697112 Jammu Escorts 24x7
 
GENUINE EscoRtS,Call Girls IN South Delhi Locanto TM''| +91-8377087607
GENUINE EscoRtS,Call Girls IN South Delhi Locanto TM''| +91-8377087607GENUINE EscoRtS,Call Girls IN South Delhi Locanto TM''| +91-8377087607
GENUINE EscoRtS,Call Girls IN South Delhi Locanto TM''| +91-8377087607
 
FULL NIGHT — 9999894380 Call Girls In Mahipalpur | Delhi
FULL NIGHT — 9999894380 Call Girls In Mahipalpur | DelhiFULL NIGHT — 9999894380 Call Girls In Mahipalpur | Delhi
FULL NIGHT — 9999894380 Call Girls In Mahipalpur | Delhi
 
(9711106444 )🫦#Sexy Desi Call Girls Noida Sector 4 Escorts Service Delhi 🫶
(9711106444 )🫦#Sexy Desi Call Girls Noida Sector 4 Escorts Service Delhi 🫶(9711106444 )🫦#Sexy Desi Call Girls Noida Sector 4 Escorts Service Delhi 🫶
(9711106444 )🫦#Sexy Desi Call Girls Noida Sector 4 Escorts Service Delhi 🫶
 
FULL NIGHT — 9999894380 Call Girls In Dwarka Mor | Delhi
FULL NIGHT — 9999894380 Call Girls In Dwarka Mor | DelhiFULL NIGHT — 9999894380 Call Girls In Dwarka Mor | Delhi
FULL NIGHT — 9999894380 Call Girls In Dwarka Mor | Delhi
 
Young⚡Call Girls in Tughlakabad Delhi >༒9667401043 Escort Service
Young⚡Call Girls in Tughlakabad Delhi >༒9667401043 Escort ServiceYoung⚡Call Girls in Tughlakabad Delhi >༒9667401043 Escort Service
Young⚡Call Girls in Tughlakabad Delhi >༒9667401043 Escort Service
 
FULL NIGHT — 9999894380 Call Girls In Badarpur | Delhi
FULL NIGHT — 9999894380 Call Girls In Badarpur | DelhiFULL NIGHT — 9999894380 Call Girls In Badarpur | Delhi
FULL NIGHT — 9999894380 Call Girls In Badarpur | Delhi
 
Dubai Call Girl Number # 0522916705 # Call Girl Number In Dubai # (UAE)
Dubai Call Girl Number # 0522916705 # Call Girl Number In Dubai # (UAE)Dubai Call Girl Number # 0522916705 # Call Girl Number In Dubai # (UAE)
Dubai Call Girl Number # 0522916705 # Call Girl Number In Dubai # (UAE)
 
Pakistani Bur Dubai Call Girls # +971528960100 # Pakistani Call Girls In Bur ...
Pakistani Bur Dubai Call Girls # +971528960100 # Pakistani Call Girls In Bur ...Pakistani Bur Dubai Call Girls # +971528960100 # Pakistani Call Girls In Bur ...
Pakistani Bur Dubai Call Girls # +971528960100 # Pakistani Call Girls In Bur ...
 
FULL NIGHT — 9999894380 Call Girls In Najafgarh | Delhi
FULL NIGHT — 9999894380 Call Girls In Najafgarh | DelhiFULL NIGHT — 9999894380 Call Girls In Najafgarh | Delhi
FULL NIGHT — 9999894380 Call Girls In Najafgarh | Delhi
 
Deira Call Girls # 0588312479 # Call Girls In Deira Dubai ~ (UAE)
Deira Call Girls # 0588312479 # Call Girls In Deira Dubai ~ (UAE)Deira Call Girls # 0588312479 # Call Girls In Deira Dubai ~ (UAE)
Deira Call Girls # 0588312479 # Call Girls In Deira Dubai ~ (UAE)
 
Verified # 971581275265 # Indian Call Girls In Deira By International City Ca...
Verified # 971581275265 # Indian Call Girls In Deira By International City Ca...Verified # 971581275265 # Indian Call Girls In Deira By International City Ca...
Verified # 971581275265 # Indian Call Girls In Deira By International City Ca...
 
Authentic # 00971556872006 # Hot Call Girls Service in Dubai By International...
Authentic # 00971556872006 # Hot Call Girls Service in Dubai By International...Authentic # 00971556872006 # Hot Call Girls Service in Dubai By International...
Authentic # 00971556872006 # Hot Call Girls Service in Dubai By International...
 
UAE Call Girls # 0528675665 # Independent Call Girls In Dubai ~ (UAE)
UAE Call Girls # 0528675665 # Independent Call Girls In Dubai ~ (UAE)UAE Call Girls # 0528675665 # Independent Call Girls In Dubai ~ (UAE)
UAE Call Girls # 0528675665 # Independent Call Girls In Dubai ~ (UAE)
 

Stamps.pptx

  • 2. whoami? -> Titus Brown • Bioinformatics method development, evaluaton, and application. • Strangely passionate about environmental metagenomics… • Cross-cutting interests: • Good (efficient, accurate, remixable) software development, on the open source model. • Reproducibility. • Open science; preprints, blogging, Twitter. @ctitusbrown on Twitter http://ivory.idyll.org/blog/
  • 3. The Plan •Saturday afternoon: Intro to shotgun sequencing and metagenome assembly •Saturday evening: lobster feast! •Sunday: break! •Monday morning: binning genomes! •Monday afternoon: more assembly! •Monday evening: workflows (and werewolf ?) •Tuesday morning: k-mers are awesome! •Tuesday afternoon: Functional analysis! •Tuesday evening: Capstone!
  • 5. Shotgun sequencing & de novo assembly: It was the Gest of times, it was the wor , it was the worst of timZs, it was the isdom, it was the age of foolisXness , it was the worVt of times, it was the mes, it was Ahe age of wisdom, it was th It was the best of times, it Gas the wor mes, it was the age of witdom, it was th isdom, it was tIe age of foolishness It was the best of times, it was the worst of times, it was the age of wisdom, it was the age of foolishness
  • 6. Shotgun sequencing analogy: feeding books into a paper shredder, digitizing the shreds, and reconstructing the book. Although for books, we often know the language and not just the alphabet 
  • 8. Shotgun sequencing “Coverage” is the average number of reads that overlap each true base in (meta)genome. Here, the coverage is ~10 – just draw a line straight down from the top through all of the reads.
  • 10. Shotgun metagenomics • Collect samples; • Extract DNA; • Feed into sequencer; • Computationally analyze. Wikipedia: Environmental shotgun sequencing.png
  • 11. Goals of shotgun metagenomics • Expand beyond taxonomic/community structure characterization possible with 16s; • Analyze virus, plasmid, strain-level content; • Evaluate metabolic capacity (e.g. “is nirK present?”) • Reconstruct genomes from metagenomes, if possible.
  • 12. 16s? Or Shotgun metagenomics? •Cons vs amplicon: •lower coverage / more expensive (good? bad? what are the tradeoffs?) •much more computationally challenging to analyze •Pros vs amplicon: •different bias (no primers) •virus/phage can be detected •function can be more directly detected •recover (putative) genomes
  • 13. Shotgun metagenome assembly: reconstruct original genome by finding overlaps in data UMD assembly primer (cbcb.umd.edu) Randomly sequencing DNA, then finding overlaps and inferring true sequence:
  • 14. Shotgun sequencing & de novo assembly: It was the Gest of times, it was the wor , it was the worst of timZs, it was the isdom, it was the age of foolisXness , it was the worVt of times, it was the mes, it was Ahe age of wisdom, it was th It was the best of times, it Gas the wor mes, it was the age of witdom, it was th isdom, it was tIe age of foolishness It was the best of times, it was the worst of times, it was the age of wisdom, it was the age of foolishness
  • 15. Note: Shotgun metagenome data is always incomplete. Smaller circles represent (some of) your actual community. Blue circle represents what’s in your sequencing data set. Shotgun metagenome data may not contain everything in your community; may contain strain variants; may contain “unknown” microbes.
  • 16. What can we do with shotgun metagenomics? • do taxonomic analysis directly on the reads - (Tuesday!) • search the reads for genes of interest (function, taxonomy) • assemble the reads into contigs, longer stretches of DNA - (next few days) • annotate contigs with taxonomy, function • (note distinction between de novo assembly and reference-based assembly) • cluster contigs together to extract genome bins, aka “metagenome assembled genomes” - (Monday!) • compare contig abundances across samples to look for differentially abundant sequences • (see Mike Lee’s diagram of All the Tools!)
  • 17. Important notes on assembly: •assembly squashes abundance 8: •assembly ignores complicated regions :( •assembly is surprisingly accurate and (when using megahit, at least) computationally tractable :)
  • 18. Important notes on quantifying assembled contigs: • it’s not clear what tools to use for differential abundance analysis • we have a tutorial for quantifying contig abundance with salmon - see also notebook • mapping abundances look wavy
  • 19. Some open computational research questions: •right now assembly based approaches simply …discard some proportion of the data. we should figure out a better way. •what is the value of long reads in shotgun metagenomics?
  • 20.
  • 21. Assembly results % megahit -1 R1.fq.gz -2 R2.fq.gz … --- [STAT] 7774 contigs, total 4987609 bp, min 200 bp, max 8658 bp, avg 642 bp, N50 1049 bp % head final.contigs.fa >k141_3 flag=0 multi=20.8090 len=230 TGATCCTGTAGTGACTAACGGACGGATGTAAGCATCTTTTAGACCATTACGTT CTAACAATTCGTAAGTAATTTGAGTTAGTTGTTCTTCAGAATAATTCAATTTAAT ATTCATGACTTCTGCTCCATACTTAAGCCTTTTGTAATGCTCATATGACTTAAAA ATTTTAGTTTCTTCGTTGGTATTGTATGCTCTTATACCTTCGAAAACCCCATTTC CATAGTGTAAA
  • 22. Quast results # contigs (>= 0 bp) 7774 # contigs (>= 1000 bp) 1308 # contigs (>= 5000 bp) 23 # contigs (>= 10000 bp) 0 Total length (>= 0 bp) 4987609 Total length (>= 1000 bp) 2560563 Total length (>= 5000 bp) 133727 Total length (>= 10000 bp) 0 # contigs 2398 Largest contig 8658 Total length 3331977 GC (%) 31.60 N50 1684
  • 23. Open question: How much should I sequence?
  • 24. Open question: How accurate/effective is functional classification on shotgun metagenome data?