Introduction to 16S rRNA gene multivariate analysisJosh Neufeld
Short introductory talk on multivariate statistics for 16S rRNA gene analysis given at the 2nd Soil Metagenomics conference in Braunschweig Germany, December 2013. A previous talk had discussed quality filtering, chimera detection, and clustering algorithms.
Course: Bioinformatics for Biologiacl Researchers (2014).
Session: 3.1- Introduction to Metagenomics. Applications, Approaches and Tools.
Statistics and Bioinformatisc Unit (UEB) from Vall d'Hebron Research Institute (www.vhir.org), Barcelona.
Metagenomics is the study of genetic material recovered directly from environmental samples. Metagenomics is a molecular tool used to analyse DNA acquired from environmental samples, in order to study the community of microorganisms present, without the necessity of obtaining pure cultures.
Introduction to 16S rRNA gene multivariate analysisJosh Neufeld
Short introductory talk on multivariate statistics for 16S rRNA gene analysis given at the 2nd Soil Metagenomics conference in Braunschweig Germany, December 2013. A previous talk had discussed quality filtering, chimera detection, and clustering algorithms.
Course: Bioinformatics for Biologiacl Researchers (2014).
Session: 3.1- Introduction to Metagenomics. Applications, Approaches and Tools.
Statistics and Bioinformatisc Unit (UEB) from Vall d'Hebron Research Institute (www.vhir.org), Barcelona.
Metagenomics is the study of genetic material recovered directly from environmental samples. Metagenomics is a molecular tool used to analyse DNA acquired from environmental samples, in order to study the community of microorganisms present, without the necessity of obtaining pure cultures.
Metagenomics is the study of a collection of genetic material (genomes) from a mixed community of organisms. Metagenomics usually refers to the study of microbial communities.
Tools for Metagenomics with 16S/ITS and Whole Genome Shotgun SequencesSurya Saha
Presented at Cornell Symbiosis symposium. Workflow for processing amplicon based 16S/ITS sequences as well as whole genome shotgun sequences are described. Slides include short description and links for each tool.
DISCLAIMER: This is a small subset of tools out there. No disrespect to methods not mentioned.
Sample Prep Solutions for Microbiome ResearchQIAGEN
An accurate molecular analysis of the microbial constituents of a particular community is contingent upon high-yielding and non-biased nucleic acid extraction methodologies. Only by ensuring that all species and classes of microorganisms present in a sample are effectively lysed during extraction will one be able to reliably assess the composition of that sample. An additional challenge faced in nucleic acid extraction is the presence of persistent, co-purifying polymerase inhibitors endogenous to one’s sample. This presentation will focus on nucleic acid extraction tools developed by MO BIO Laboratories that facilitate accurate non-biased community analysis and eliminate common amplification problems via the depletion of endogenous polymerase inhibitors using our patented Inhibitor Removal Technology.
A brief introduction to amplicon sequencing of the 16S rRNA gene for the analysis of microbial diversity. This talk was presented originally at the Workshop: Introduction to Systems Biology, Aalborg Denmark. 2013-10-29
Owing to the fragmented nature of EST reads, it is worthwhile to attempt to organize the reads into assemblies that provide a consensus view of the sampled transcripts.
Such fragmented, EST data or gene sequence data, placed in correct context, and indexed by gene such that all expressed data concerning a single gene is in a single index class, and each index class contains the information for only one gene is an EST cluster.
Metagenomics is the study of a collection of genetic material (genomes) from a mixed community of organisms. Metagenomics usually refers to the study of microbial communities.
Tools for Metagenomics with 16S/ITS and Whole Genome Shotgun SequencesSurya Saha
Presented at Cornell Symbiosis symposium. Workflow for processing amplicon based 16S/ITS sequences as well as whole genome shotgun sequences are described. Slides include short description and links for each tool.
DISCLAIMER: This is a small subset of tools out there. No disrespect to methods not mentioned.
Sample Prep Solutions for Microbiome ResearchQIAGEN
An accurate molecular analysis of the microbial constituents of a particular community is contingent upon high-yielding and non-biased nucleic acid extraction methodologies. Only by ensuring that all species and classes of microorganisms present in a sample are effectively lysed during extraction will one be able to reliably assess the composition of that sample. An additional challenge faced in nucleic acid extraction is the presence of persistent, co-purifying polymerase inhibitors endogenous to one’s sample. This presentation will focus on nucleic acid extraction tools developed by MO BIO Laboratories that facilitate accurate non-biased community analysis and eliminate common amplification problems via the depletion of endogenous polymerase inhibitors using our patented Inhibitor Removal Technology.
A brief introduction to amplicon sequencing of the 16S rRNA gene for the analysis of microbial diversity. This talk was presented originally at the Workshop: Introduction to Systems Biology, Aalborg Denmark. 2013-10-29
Owing to the fragmented nature of EST reads, it is worthwhile to attempt to organize the reads into assemblies that provide a consensus view of the sampled transcripts.
Such fragmented, EST data or gene sequence data, placed in correct context, and indexed by gene such that all expressed data concerning a single gene is in a single index class, and each index class contains the information for only one gene is an EST cluster.
New High Throughput Sequencing technologies at the Norwegian Sequencing Centr...Lex Nederbragt
A talk I gave at the Microbiology Research Group (University of Oslo) about new High Throughput Sequencing instruments at the Norwegian Sequencing Centre. I also mentioned future upgrades, and the upcoming nanopore sequencing platform of Oxford nanopore
Next-generation sequencing (NGS) has revolutionized the way we analyze diseases and commercial outfits such as Illumina, Helicos, QIAGEN and Pacific Biosciences have made significant contributions. In addition, the launch of direct-to-consumer genetic testing solutions has dramatically changed the way consumers access genomics data. Until a few years ago, the cost of sequencing was a major bottleneck. Recent developments have reduced the cost from thousands of dollars to a couple of cents per megabase. When did these changes start? What were the changes in the commercial sector in the last 15 years? This infographic is a timeline of the NGS commercial marketplace.
Sequencing, Genome Assembly and the SGN PlatformSurya Saha
This talk was presented at IASRI Pusa on June 13th, 2014.
Centre for Agricultural Bioinformatics
Indian Agricultural Statistics Research Institute
Library Avenue, Pusa, New Delhi - 110012 (INDIA)
http://cabgrid.res.in/cabin/
Phylogenomic methods for comparative evolutionary biology - University Colleg...Joe Parker
Invited research seminar given to MSc students at University College Dublin on 24th October 2013.
I introduce the discipline of phylogenomics - comparative phylogenetic analyses of DNA sequences across genomes - and some of the applications and recent breakthroughs in the field.
As an in-depth case study I explain the methods and significance of our 2013 Nature paper on adaptive genotypic molecular convergence in echolocating mammals.
I then highlight some of the avenues of study on the frontiers of current research.
Molecular QC: Interpreting your Bioinformatics PipelineCandy Smellie
What is the impact of assay failure in your laboratory and how do you monitor for it?
The most heavily degraded samples are not suitable for standard exome coverage: sometimes it’s not even a matter of getting bad sequencing, you might get nothing at all!
FFPE artifacts increase with storage time
Artifacts go against the statistical power of your variant calling analysis
Molecular reference standards help filter out bad mappings and spurious variants
Bioinformatics pipelines allow adding Molecular Reference Standards in your joint variant calling pipeline
Genome In A Bottle Reference Standards are invaluable for validating variant calling analysis
NIST and its collaborators shared datasets created with most NGS technologies
Horizon Diagnostics shared annotated, merged variant calls from NIST for the Ashkenazim Trio
~35K variants are predicted having high or moderate impact within the Trio
GM24385 (Ashkenazim Son) includes 352 small variants with high/moderate impact which are absent in Father and Mother
Routinely monitor the performance of your workflows and assays with independent external controls
Dr. Douglas Marthaler - Use of Next Generation Sequencing for Whole Genome An...John Blue
Use of Next Generation Sequencing for Whole Genome Analysis of Pathogens - Dr. Douglas Marthaler, Veterinary Diagnostic Laboratory, College of Veterinary Medicine, University of Minnesota, from the 2016 Allen D. Leman Swine Conference, September 17-20, 2016, St. Paul, Minnesota, USA.
More presentations at http://www.swinecast.com/2016-leman-swine-conference-material
my students use ideas from my class on business models to develop a business model for ion proton's DNA sequencer. This sequencer uses semiconductor technology to read an organism's DNA sequence and is faster and cheaper than existing sequencers. This presentation describes the value proposition, customer selection, method of value capture and other aspects of a business model for Ion Proton's DNA sequencer
Next Generation Sequencing and its Applications in Medical Research - Frances...Sri Ambati
The so-called “next-generation” sequencing (NGS) technologies allows us, in a short time and in parallel, to sequence massive amounts of DNA, overcoming the limitations of the original Sanger sequencing methods used to sequence the first human genome. NGS technologies have had an enormous impact on biomedical research within a short time frame. This talk will give an overview of these applications with specific examples from Mendelian genomics and cancer research. #h2ony
NGS Targeted Enrichment Technology in Cancer Research: NGS Tech Overview Webi...QIAGEN
This slidedeck discusses the most biologically efficient, cost-effective method for successful NGS. The GeneRead DNA QuantiMIZE Kits enable determination of the optimum conditions for targeted enrichment of DNA isolated from biological samples, while the GeneRead DNAseq Panels V2 allow you to quickly and reliably deep sequence your genes of interest. Applications in translational and clinical research are highlighted.
Course: Bioinformatics for Biomedical Research (2014).
Session: 2.1.1- Next Generation Sequencing. Technologies and Applications. Part I: NGS Introduction and Technology Overview.
Statistics and Bioinformatisc Unit (UEB) & High Technology Unit (UAT) from Vall d'Hebron Research Institute (www.vhir.org), Barcelona.
whole genome analysis
history
needs
steps involved
human genome data
NGS
pyrosequencing
illumina
SOLiD
Ion torrent
PacBio
applications
problems
benefits
Microbiology has experienced a transformation during the last 25 years that has altered microbiologists' view of microorganisms and how to study them. The realization that most microorganisms cannot be grown readily in pure culture forced microbiologists to question their belief that the microbial world had been conquered. We were forced to replace this belief with an acknowledgment of the extent of our ignorance about the range of metabolic and organismal diversity.
Use of DNA barcoding and its role in the plant species/varietal Identifica...Senthil Natesan
Plant DNA barcoding research is shifting beyond performance comparisons of different DNA regions towards practical applications. The main aim of DNA barcoding is to establish a shared community resource of DNA sequences that can be used for organismal identification and taxonomic clarification. This approach was successfully pioneered in animals using a portion of the cytochrome oxidase 1(CO1) mitochondrial gene. In plants, establishing a standardized DNA barcoding system has been more challenging. The studies on cucumis sp for the application of DNA barcode shows the possibility of discrimination at species level not the varietal level using the matK gene barcode. The phylogenetic tree constructed by using matK gene sequences clearly differentiated the species C. sativus and C. melo which will help for the future application in cucumis taxonomy and phylogeny studies
Microbiome Isolation and DNA Enrichment Protocol: Pathogen Detection Webinar ...QIAGEN
This slidedeck presents an easy-to-use workflow that allows selective isolation of microbial DNA from samples that are intrinsically rich in host DNA. This protocol includes steps for efficient depletion of host DNA while providing optimized conditions specific for bacterial lysis. This workflow is also specific for the identification of live bacteria, avoiding false results due to nucleic acids from dead bacteria. Enriched microbial DNA can be directly used in other molecular methods such as whole genome sequencing, qPCR and microarray assays.
Course: Bioinformatics for Biomedical Research (2014).
Session: 2.1.3- Next Generation Sequencing. Technologies and Applications. Part III: NGS Applications II.
Statistics and Bioinformatisc Unit (UEB) & High Technology Unit (UAT) from Vall d'Hebron Research Institute (www.vhir.org), Barcelona.
Microbiome studies using 16S ribosomal DNA PCR: some cautionary tales.jennomics
Presentation at a workshop conducted by the UC Davis Bioinformatics Core Facility: Using the Linux Command Line for Analysis of High Throughput Sequence Data, September 15-19, 2014
Microbiome Identification to Characterization: Pathogen Detection Webinar Ser...QIAGEN
The research community has begun correlating the makeup of individual microbiomes with disorders and diseases such as autism, atherosclerosis, obesity and cancer. To accomplish this, researchers must first identify and characterize these microbial communities. This slidedeck will begin with a general introduction of metagenomics and an overview of experimental strategies. Following this, a comprehensive microbiome assay pipeline will be introduced. We conclude with application-based examples that demonstrate how to identify and characterize microbiome profiles.
Molecular pathology in microbiology and metagenomicsCharithRanatunga
INTRODUCTION
HISTORY
Steps
Analysis
Metagenomic Process
Sequence-based analysis
Function-based analysis
Application of metagenomics
Future Directions of metagenomics
Examples for metagenomics projects
Professional air quality monitoring systems provide immediate, on-site data for analysis, compliance, and decision-making.
Monitor common gases, weather parameters, particulates.
What is greenhouse gasses and how many gasses are there to affect the Earth.moosaasad1975
What are greenhouse gasses how they affect the earth and its environment what is the future of the environment and earth how the weather and the climate effects.
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...Ana Luísa Pinho
Functional Magnetic Resonance Imaging (fMRI) provides means to characterize brain activations in response to behavior. However, cognitive neuroscience has been limited to group-level effects referring to the performance of specific tasks. To obtain the functional profile of elementary cognitive mechanisms, the combination of brain responses to many tasks is required. Yet, to date, both structural atlases and parcellation-based activations do not fully account for cognitive function and still present several limitations. Further, they do not adapt overall to individual characteristics. In this talk, I will give an account of deep-behavioral phenotyping strategies, namely data-driven methods in large task-fMRI datasets, to optimize functional brain-data collection and improve inference of effects-of-interest related to mental processes. Key to this approach is the employment of fast multi-functional paradigms rich on features that can be well parametrized and, consequently, facilitate the creation of psycho-physiological constructs to be modelled with imaging data. Particular emphasis will be given to music stimuli when studying high-order cognitive mechanisms, due to their ecological nature and quality to enable complex behavior compounded by discrete entities. I will also discuss how deep-behavioral phenotyping and individualized models applied to neuroimaging data can better account for the subject-specific organization of domain-general cognitive systems in the human brain. Finally, the accumulation of functional brain signatures brings the possibility to clarify relationships among tasks and create a univocal link between brain systems and mental functions through: (1) the development of ontologies proposing an organization of cognitive processes; and (2) brain-network taxonomies describing functional specialization. To this end, tools to improve commensurability in cognitive science are necessary, such as public repositories, ontology-based platforms and automated meta-analysis tools. I will thus discuss some brain-atlasing resources currently under development, and their applicability in cognitive as well as clinical neuroscience.
Introduction:
RNA interference (RNAi) or Post-Transcriptional Gene Silencing (PTGS) is an important biological process for modulating eukaryotic gene expression.
It is highly conserved process of posttranscriptional gene silencing by which double stranded RNA (dsRNA) causes sequence-specific degradation of mRNA sequences.
dsRNA-induced gene silencing (RNAi) is reported in a wide range of eukaryotes ranging from worms, insects, mammals and plants.
This process mediates resistance to both endogenous parasitic and exogenous pathogenic nucleic acids, and regulates the expression of protein-coding genes.
What are small ncRNAs?
micro RNA (miRNA)
short interfering RNA (siRNA)
Properties of small non-coding RNA:
Involved in silencing mRNA transcripts.
Called “small” because they are usually only about 21-24 nucleotides long.
Synthesized by first cutting up longer precursor sequences (like the 61nt one that Lee discovered).
Silence an mRNA by base pairing with some sequence on the mRNA.
Discovery of siRNA?
The first small RNA:
In 1993 Rosalind Lee (Victor Ambros lab) was studying a non- coding gene in C. elegans, lin-4, that was involved in silencing of another gene, lin-14, at the appropriate time in the
development of the worm C. elegans.
Two small transcripts of lin-4 (22nt and 61nt) were found to be complementary to a sequence in the 3' UTR of lin-14.
Because lin-4 encoded no protein, she deduced that it must be these transcripts that are causing the silencing by RNA-RNA interactions.
Types of RNAi ( non coding RNA)
MiRNA
Length (23-25 nt)
Trans acting
Binds with target MRNA in mismatch
Translation inhibition
Si RNA
Length 21 nt.
Cis acting
Bind with target Mrna in perfect complementary sequence
Piwi-RNA
Length ; 25 to 36 nt.
Expressed in Germ Cells
Regulates trnasposomes activity
MECHANISM OF RNAI:
First the double-stranded RNA teams up with a protein complex named Dicer, which cuts the long RNA into short pieces.
Then another protein complex called RISC (RNA-induced silencing complex) discards one of the two RNA strands.
The RISC-docked, single-stranded RNA then pairs with the homologous mRNA and destroys it.
THE RISC COMPLEX:
RISC is large(>500kD) RNA multi- protein Binding complex which triggers MRNA degradation in response to MRNA
Unwinding of double stranded Si RNA by ATP independent Helicase
Active component of RISC is Ago proteins( ENDONUCLEASE) which cleave target MRNA.
DICER: endonuclease (RNase Family III)
Argonaute: Central Component of the RNA-Induced Silencing Complex (RISC)
One strand of the dsRNA produced by Dicer is retained in the RISC complex in association with Argonaute
ARGONAUTE PROTEIN :
1.PAZ(PIWI/Argonaute/ Zwille)- Recognition of target MRNA
2.PIWI (p-element induced wimpy Testis)- breaks Phosphodiester bond of mRNA.)RNAse H activity.
MiRNA:
The Double-stranded RNAs are naturally produced in eukaryotic cells during development, and they have a key role in regulating gene expression .
Slide 1: Title Slide
Extrachromosomal Inheritance
Slide 2: Introduction to Extrachromosomal Inheritance
Definition: Extrachromosomal inheritance refers to the transmission of genetic material that is not found within the nucleus.
Key Components: Involves genes located in mitochondria, chloroplasts, and plasmids.
Slide 3: Mitochondrial Inheritance
Mitochondria: Organelles responsible for energy production.
Mitochondrial DNA (mtDNA): Circular DNA molecule found in mitochondria.
Inheritance Pattern: Maternally inherited, meaning it is passed from mothers to all their offspring.
Diseases: Examples include Leber’s hereditary optic neuropathy (LHON) and mitochondrial myopathy.
Slide 4: Chloroplast Inheritance
Chloroplasts: Organelles responsible for photosynthesis in plants.
Chloroplast DNA (cpDNA): Circular DNA molecule found in chloroplasts.
Inheritance Pattern: Often maternally inherited in most plants, but can vary in some species.
Examples: Variegation in plants, where leaf color patterns are determined by chloroplast DNA.
Slide 5: Plasmid Inheritance
Plasmids: Small, circular DNA molecules found in bacteria and some eukaryotes.
Features: Can carry antibiotic resistance genes and can be transferred between cells through processes like conjugation.
Significance: Important in biotechnology for gene cloning and genetic engineering.
Slide 6: Mechanisms of Extrachromosomal Inheritance
Non-Mendelian Patterns: Do not follow Mendel’s laws of inheritance.
Cytoplasmic Segregation: During cell division, organelles like mitochondria and chloroplasts are randomly distributed to daughter cells.
Heteroplasmy: Presence of more than one type of organellar genome within a cell, leading to variation in expression.
Slide 7: Examples of Extrachromosomal Inheritance
Four O’clock Plant (Mirabilis jalapa): Shows variegated leaves due to different cpDNA in leaf cells.
Petite Mutants in Yeast: Result from mutations in mitochondrial DNA affecting respiration.
Slide 8: Importance of Extrachromosomal Inheritance
Evolution: Provides insight into the evolution of eukaryotic cells.
Medicine: Understanding mitochondrial inheritance helps in diagnosing and treating mitochondrial diseases.
Agriculture: Chloroplast inheritance can be used in plant breeding and genetic modification.
Slide 9: Recent Research and Advances
Gene Editing: Techniques like CRISPR-Cas9 are being used to edit mitochondrial and chloroplast DNA.
Therapies: Development of mitochondrial replacement therapy (MRT) for preventing mitochondrial diseases.
Slide 10: Conclusion
Summary: Extrachromosomal inheritance involves the transmission of genetic material outside the nucleus and plays a crucial role in genetics, medicine, and biotechnology.
Future Directions: Continued research and technological advancements hold promise for new treatments and applications.
Slide 11: Questions and Discussion
Invite Audience: Open the floor for any questions or further discussion on the topic.
Richard's entangled aventures in wonderlandRichard Gill
Since the loophole-free Bell experiments of 2020 and the Nobel prizes in physics of 2022, critics of Bell's work have retreated to the fortress of super-determinism. Now, super-determinism is a derogatory word - it just means "determinism". Palmer, Hance and Hossenfelder argue that quantum mechanics and determinism are not incompatible, using a sophisticated mathematical construction based on a subtle thinning of allowed states and measurements in quantum mechanics, such that what is left appears to make Bell's argument fail, without altering the empirical predictions of quantum mechanics. I think however that it is a smoke screen, and the slogan "lost in math" comes to my mind. I will discuss some other recent disproofs of Bell's theorem using the language of causality based on causal graphs. Causal thinking is also central to law and justice. I will mention surprising connections to my work on serial killer nurse cases, in particular the Dutch case of Lucia de Berk and the current UK case of Lucy Letby.
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...Scintica Instrumentation
Intravital microscopy (IVM) is a powerful tool utilized to study cellular behavior over time and space in vivo. Much of our understanding of cell biology has been accomplished using various in vitro and ex vivo methods; however, these studies do not necessarily reflect the natural dynamics of biological processes. Unlike traditional cell culture or fixed tissue imaging, IVM allows for the ultra-fast high-resolution imaging of cellular processes over time and space and were studied in its natural environment. Real-time visualization of biological processes in the context of an intact organism helps maintain physiological relevance and provide insights into the progression of disease, response to treatments or developmental processes.
In this webinar we give an overview of advanced applications of the IVM system in preclinical research. IVIM technology is a provider of all-in-one intravital microscopy systems and solutions optimized for in vivo imaging of live animal models at sub-micron resolution. The system’s unique features and user-friendly software enables researchers to probe fast dynamic biological processes such as immune cell tracking, cell-cell interaction as well as vascularization and tumor metastasis with exceptional detail. This webinar will also give an overview of IVM being utilized in drug development, offering a view into the intricate interaction between drugs/nanoparticles and tissues in vivo and allows for the evaluation of therapeutic intervention in a variety of tissues and organs. This interdisciplinary collaboration continues to drive the advancements of novel therapeutic strategies.
A brief information about the SCOP protein database used in bioinformatics.
The Structural Classification of Proteins (SCOP) database is a comprehensive and authoritative resource for the structural and evolutionary relationships of proteins. It provides a detailed and curated classification of protein structures, grouping them into families, superfamilies, and folds based on their structural and sequence similarities.
This pdf is about the Schizophrenia.
For more details visit on YouTube; @SELF-EXPLANATORY;
https://www.youtube.com/channel/UCAiarMZDNhe1A3Rnpr_WkzA/videos
Thanks...!
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...University of Maribor
Slides from:
11th International Conference on Electrical, Electronics and Computer Engineering (IcETRAN), Niš, 3-6 June 2024
Track: Artificial Intelligence
https://www.etran.rs/2024/en/home-english/
1. Studying the microbiome
Mick Watson
Head of Bioinformatics, Edinburgh Genomics, University of Edinburgh
Research Group Leader, The Roslin Institute, University of Edinburgh
2. Edinburgh Genomics
• Genomics facility based at the University of Edinburgh
• Available for collaborations on an academic, non-profit basis
• Formed from merger of
– ARK-Genomics
– The GenePool
• Funded by three major bio UK research councils
• A range of technologies and expertise available
http://genomics.ed.ac.uk
3. Prevailing theory of the individual
• An individual consists of at least 10x as many bacterial cells as “host” cells *
• Each individual is a “supra-organism”
– a composite of host and microbial cells contribute the functions necessary for the
individual to survive
• The genetic landscape of any individual is a composite of the host genome
and the genomes of the millions of microbial symbionts that live on and
within that individual
• It is clearly important to take a holistic view when examining any animal
phenotype
My focus
• Move from discovery science to applied science
• “What’s there?” “What can we do with it?”
4. • The “ten times” figure
comes from a paper in
1972, and is estimated
from 1g of human faeces
• More modern estimates
range from equal to 100
times!
• American Society for
Microbiology 2014 report
puts the ratio closer to 3:1
• Panel included Peter
Turnbaugh
• There’s still more of them
though….
http://www.bostonglobe.com/ideas/2014/09/13/your-body-mostly-microbes-
actually-have-idea/qlcoKot4wfUXecjeVaFKFN/story.html
5. Microbiome research is undergoing a crisis
Please don’t make things worse
• Crisis 1
– The correlation/causation fallacy. For example….
– Patients with type II diabetes have a different gut microbiome compared
to healthy patients
– Does the microbiome cause diabetes?
– Or do they have a different microbiome because they have diabetes?
(therefore different diet)
• Crisis 2
– A lot of people want to do it, but don’t know how
– Errors, bad experimental design, incorrect conclusions
6. What is the microbiome?
“the ecological community of commensal,
symbiotic, and pathogenic microorganisms that
literally share our body space”
- Joshua Lederberg
Note: includes funghi, protists, archaea, bacteria, algae, viruses etc etc etc
(whisper it: most “microbiome” studies only look at bacteria/archaea)
7. How do we study the microbiome?
• Marker gene vs shotgun metagenomics
• Marker gene
– 16S / 18S / ITS
– Amplify this and compare
• Metagenomics
– Extract all DNA
– Fragment, sequence, interpret
• In theory, the latter least biased*
8. 16S studies are not metagenomics
http://phylogenomics.blogspot.co.uk/2012/08/referring-to-16s-surveys-as.html, http://biomickwatson.wordpress.com/2014/01/12/youre-probably-not-doing-metagenomics/
9. 16S
• Prokaryotic rRNA subunit
• Present in all (?) bacterial/archaeal genomes, contains constant
and hypervariable regions
• Hypervariable regions may give “species specific” signatures
10. 16S process
• Current sequencing technologies can’t sequence whole thing
• Design primers in constant regions and PCR
• Amplify 1 or more hypervariable regions
• Cluster similar sequences into OTUs
• Compare to 16S database and assign phylogenetic group
• Compare abundance across sample groups (QIIME, Mothur)
11. 16S problems
• Some genomes have multiple copies of the 16S gene
• The constant regions aren’t constant
– Design degenerate primers
– Some primers pick up certain groups better than others
– A perfect match primer will amplify better than one containing mis-matches
• The abundances from 16S are wrong, we simply hope that
they are consistently wrong across samples
• Absence really difficult to prove/wrong to assume
• Chimeras, PCR artefacts consisting of 16S gene fragments
from two different molecules
12. • Ashelford KE, Chuzhanova NA, Fry JC, Jones AJ, Weightman AJ. At least 1 in 20 16S rRNA sequence records
currently held in public repositories is estimated to contain substantial anomalies. Appl Environ Microbiol.
2005 71(12):7724-36.
15. Technology Advantages Disadvantages Output per run
Illumina Highly accurate; cheap;
Sequencing: what’s on the market?
industry leader; multiple
platforms
Slower than Ion; short
reads;
HiSeq X Ten: 18Tb
HiSeq X: 1.8Tb
2500:HO 600Gb -> 1Tb
2500:RO: 180Gb
NextSeq: 140Gb
MiSeq: 25Gb
Ion Torrent Fast; cheap machine Very poor on
homopolymers; doesn’t
match Illumina on
throughput
PGM: 2Gb
Proton P1: 10Gb
Proton P2: 30Gb
PacBio Long reads; single molecule High error rate, needs
correction; low
throughput; expensive
machine
300-500Mb
Oxford
Nanopore
MinION
Long reads; single molecule;
cheap; portable
High error rate; unknown
quantity
Unknown
Complete
Genomics
Highly accurate; cheap Limited to human; black
box
Unknown; human
genomes can be purchased
17. 16S sequencing strategy?
• Platform: MiSeq
• Theoretically:
– 2x150bp can sequence ~180bp amplicon
– 2x250bp can sequence ~480bp amplicon
– 2x300bp can sequence ~580bp amplicon
18. Important paper
• Amongst other
things, sequenced
a mock
community with
different
sequencing and
bioinformatics
strategies
• Kozich JJ, Westcott SL, Baxter
NT, Highlander SK, Schloss PD.
Development of a dual-index
sequencing strategy and
curation pipeline for analyzing
amplicon sequence data on
the MiSeq Illumina sequencing
platform. Appl Environ
Microbiol. 2013 S79(17):5112-
20.
19. • Three 16S regions sequenced using 2x250bp
– V4 (~250 bp), V34 (430bp), and V45 regions (~375 bp)
– In the Mock community, there should be 20 OTUs
20. 16S sequencing strategy?
• The only strategy that got close to the correct result is
complete overlap of 2x250bp MiSeq reads
22. Shotgun metagenomics
• Take ecosystem, extract all DNA and sequence it
• Should be unbiased, right?... Right?
• (NB: issues on the next few slides are also issues for
marker gene studies)
23. Extraction protocol
“we found that each DNA
extraction method resulted in
unique community patterns”
“We observed significant differences
in distribution of bacterial taxa
depending on the method.”
24. Storage
“Samples frozen with and without glycerol as cryoprotectant
indicated a major loss of Bacteroidetes in unprotected samples”
25. • In the chicken caecum, bacteroidetes dominate, followed by
firmicutes:
• Nordentoft S et al (2011) The influence of the cage system and colonisation of Salmonella Enteritidis on
the microbial gut flora of laying hens studied by T-RFLP and 454 pyrosequencing. BMC Microbiol. 11:187
26. • In the chicken caecum, firmicutes dominate, few
proteobacteria, no bacteroidetes
• Danzeisen JL et al (2011). Modulations of the chicken cecal microbiome and metagenome in response to
anticoccidial and growth promoter treatment. PLOS ONE. 6(11):e27949.
27. • Did I mention that microbiome research is
undergoing a crisis?
• It gets worse…..
28. Contamination
• Sequenced a pure culture of
Salmonella bongori
• Extracted DNA using different kits
• Did serial dilutions of the pure
culture to assess impact of
contaminating species
29.
30. The kits
• FastDNA Spin Kit For Soil (FP), MoBio UltraClean Microbial
DNA Isolation Kit (MB), QIAmp DNA Stool Mini Kit (QIA) and
PSP Spin Stool DNA Plus kit (PSP)
FP had a stable kit profile dominated by Burkholderia, PSP was dominated by
Bradyrhizobium, while the QIA kit had the most complex mix of bacterial DNA.
Bradyrhizobiaceae, Burkholderiaceae, Chitinophagaceae, Comomonadaceae,
Propionibacteriaceae and Pseudomonadaceae were present in at least three quarters of
the dilutions from PSP, FP and QIA kits. However, relative abundances of taxa at the
Family level varied according to kit: FP was marked by Burkholderiaceae and
Enterobacteriaceae, PSP was marked by Bradyrhizobiaceae and Chitinophagaceae. The
contamination in the QIA kit was relatively diverse in comparison to the other kits, and
included higher proportions of Aerococcaceae, Bacillaceae, Flavobacteriaceae,
Microbacteriaceae, Paenibacillaceae, Planctomycetaceae and Polyangiaceae than the
other kits. Kit MB did not have a distinct contaminant profile and varied from dilution to
dilution due to paucity of reads
31. “These metagenomic results therefore clearly
show that contamination becomes the dominant
feature of sequence data from low biomass
samples, and that the kit used to extract DNA can
have an impact on the observed bacterial
diversity”
32. From Salter et al:
“Tellingly, Laurence et al [1] recently
demonstrated with an in silico
analysis that Bradyrhizobium is a
common contaminant of
sequencing datasets including the
1000 Human Genome Project”
1. Laurence M, Hatzis C, Brash DE.
Common contaminants in next-generation
sequencing that hinder
discovery of low-abundance microbes.
PLoS One. 2014 9(5):e97876.
Adenoids are at the back of the nasal cavity
Bradyrhizobium is a soil bacterium
35. Shotgun metagenomics
• Can assemble
– MetaVelvet, Meta-IDBA, Ray Meta, MetAMOS
– Different techniques for partitioning
• Coverage, sequence composition, connectivity
• MetaWatt, CONCOCT
– Predict genes: Glimmer-MG, FragGenScan
• Use reference
– Kraken, PhyloSift, MetaPhlAn, HUMAnN
36. All-in-one solution
• EBI Metagenomics
• Hunter S, et al. EBI metagenomics--a new resource for the analysis and archiving of
metagenomic data. Nucleic Acids Res. 2014 42(Database issue):D600-6.
39. Conclusions
• I love microbiome research (honestly!)
• Really, incredibly exciting… but….
• Every step counts
• Be very careful, at all stages
• 16S – cheap, biased but effective
• WGS – expensive, information rich, less biased
• Beware contamination, include controls