Next-generation sequencing techniques such as Illumina and 454 pyrosequencing were discussed for applications including microbial genome sequencing and metagenomic profiling of microbial communities from targeted gene markers or shotgun sequencing. Key steps include library preparation, sequencing, and downstream bioinformatics analysis of sequencing data for tasks like genome assembly, gene annotation, and taxonomic classification of microbial taxa.
1. A DNA microarray contains thousands of DNA probes attached to a solid surface in defined locations. Each probe represents a single gene.
2. Sample mRNA is converted to fluorescently labeled cDNA and hybridized to the DNA microarray. The level of fluorescence indicates the expression level of each gene.
3. After washing, the microarray is scanned and analyzed to determine changes in gene expression between control and test samples. This allows high-throughput analysis of gene expression profiles.
The study of the complete set of RNAs (transcriptome) encoded by the genome of a specific cell or organism at a specific time or under a specific set of conditions is called Transcriptomics.
Transcriptomics aims:
I. To catalogue all species of transcripts, including mRNAs, noncoding RNAs and small RNAs.
II. To determine the transcriptional structure of genes, in terms of their start sites, 5′ and 3′ ends, splicing patterns and other post-transcriptional modifications.
III. To quantify the changing expression levels of each transcript during development and under different conditions.
Synthetic biology is the design and construction of novel artificial biological pathways, organisms, or devices. This document discusses synthetic biology tools and applications in pathway engineering, with a focus on plants. It provides examples of introducing the artemisinin biosynthesis pathway into tobacco using a combinatorial supertransformation approach, and engineering yeast to produce the sesquiterpene α-santalene. While synthetic biology has potential applications in health, environment and energy, it also faces risks that must be addressed including unintentional release of modified organisms and dual-use concerns.
The yeast two-hybrid system is used to identify protein-protein interactions. It involves fusing two interacting proteins to a DNA-binding domain and transcriptional activation domain. If the proteins interact, it brings the domains together and activates a reporter gene. This allows identification of novel interactions and domains involved. Some advantages are it occurs in vivo, can find weak interactions, and doesn't require purified proteins. Disadvantages include false positives and some proteins may not fold correctly in yeast.
Metagenomics is the study of genetic material recovered directly from environmental samples. Metagenomics is a molecular tool used to analyse DNA acquired from environmental samples, in order to study the community of microorganisms present, without the necessity of obtaining pure cultures.
1) 16S rRNA sequencing is the gold standard for bacterial identification and can identify novel, rare, or aberrant bacterial strains that other phenotypic methods cannot.
2) The document presents data from sequencing 300 clinical isolates, identifying many new species and some new genera, and providing definitive identifications in 88% of cases.
3) While powerful, 16S rRNA sequencing has some limitations like requiring a pure culture and difficulty differentiating closely related species, and interpretation requires consideration of database issues.
Microarrays allow researchers to analyze gene expression and detect mutations across thousands of genes simultaneously. They consist of miniaturized spots containing DNA, proteins, or other biomolecules immobilized on a solid surface. When a fluorescently labeled sample is applied, only matching molecules will hybridize, allowing for quantification. The main types are DNA microarrays for analyzing gene expression, tissue microarrays for pathology studies, and peptide arrays for protein interactions. DNA microarrays use glass slides coated with specific DNA sequences to analyze gene expression profiles in tissues or cells.
The document describes a seminar on high-throughput sequencing bioinformatics. It discusses analyzing microbiome samples using 16S rRNA sequencing and tools like Mothur and QIIME. It provides an overview of analyzing 16S sequences, including quality filtering, OTU clustering, classification, and diversity analysis. It also outlines running a Mothur tutorial to analyze a mock microbiome dataset from 21 samples using the Mothur MiSeq standard operating procedure.
1. A DNA microarray contains thousands of DNA probes attached to a solid surface in defined locations. Each probe represents a single gene.
2. Sample mRNA is converted to fluorescently labeled cDNA and hybridized to the DNA microarray. The level of fluorescence indicates the expression level of each gene.
3. After washing, the microarray is scanned and analyzed to determine changes in gene expression between control and test samples. This allows high-throughput analysis of gene expression profiles.
The study of the complete set of RNAs (transcriptome) encoded by the genome of a specific cell or organism at a specific time or under a specific set of conditions is called Transcriptomics.
Transcriptomics aims:
I. To catalogue all species of transcripts, including mRNAs, noncoding RNAs and small RNAs.
II. To determine the transcriptional structure of genes, in terms of their start sites, 5′ and 3′ ends, splicing patterns and other post-transcriptional modifications.
III. To quantify the changing expression levels of each transcript during development and under different conditions.
Synthetic biology is the design and construction of novel artificial biological pathways, organisms, or devices. This document discusses synthetic biology tools and applications in pathway engineering, with a focus on plants. It provides examples of introducing the artemisinin biosynthesis pathway into tobacco using a combinatorial supertransformation approach, and engineering yeast to produce the sesquiterpene α-santalene. While synthetic biology has potential applications in health, environment and energy, it also faces risks that must be addressed including unintentional release of modified organisms and dual-use concerns.
The yeast two-hybrid system is used to identify protein-protein interactions. It involves fusing two interacting proteins to a DNA-binding domain and transcriptional activation domain. If the proteins interact, it brings the domains together and activates a reporter gene. This allows identification of novel interactions and domains involved. Some advantages are it occurs in vivo, can find weak interactions, and doesn't require purified proteins. Disadvantages include false positives and some proteins may not fold correctly in yeast.
Metagenomics is the study of genetic material recovered directly from environmental samples. Metagenomics is a molecular tool used to analyse DNA acquired from environmental samples, in order to study the community of microorganisms present, without the necessity of obtaining pure cultures.
1) 16S rRNA sequencing is the gold standard for bacterial identification and can identify novel, rare, or aberrant bacterial strains that other phenotypic methods cannot.
2) The document presents data from sequencing 300 clinical isolates, identifying many new species and some new genera, and providing definitive identifications in 88% of cases.
3) While powerful, 16S rRNA sequencing has some limitations like requiring a pure culture and difficulty differentiating closely related species, and interpretation requires consideration of database issues.
Microarrays allow researchers to analyze gene expression and detect mutations across thousands of genes simultaneously. They consist of miniaturized spots containing DNA, proteins, or other biomolecules immobilized on a solid surface. When a fluorescently labeled sample is applied, only matching molecules will hybridize, allowing for quantification. The main types are DNA microarrays for analyzing gene expression, tissue microarrays for pathology studies, and peptide arrays for protein interactions. DNA microarrays use glass slides coated with specific DNA sequences to analyze gene expression profiles in tissues or cells.
The document describes a seminar on high-throughput sequencing bioinformatics. It discusses analyzing microbiome samples using 16S rRNA sequencing and tools like Mothur and QIIME. It provides an overview of analyzing 16S sequences, including quality filtering, OTU clustering, classification, and diversity analysis. It also outlines running a Mothur tutorial to analyze a mock microbiome dataset from 21 samples using the Mothur MiSeq standard operating procedure.
Microarrays allow researchers to analyze gene expression across thousands of genes simultaneously. DNA probes are arrayed on a small glass or nylon slide, and labeled mRNA from samples is hybridized to the probes. Fluorescent scanning detects which genes are expressed. Data analysis includes normalization, distance metrics, clustering, and visualization to group genes with similar expression profiles and identify patterns of co-regulated genes. Microarrays enable functional genomics studies of development, disease, response to drugs or environmental factors, and more.
The document discusses codon optimization, which is a process that improves gene expression and increases translational efficiency of a gene of interest. It does this by accounting for the codon bias of the host organism. Codons are three nucleotide sequences in RNA that specify which amino acid is needed for protein synthesis. The document also lists some common codon optimization tools like Twistbioscience, Idtdna, and Genscript.
Applications of genomics and proteomics pptIbad khan
Applications of genomics and proteomics ppt
genomics and proteomics ppt
in the field of health genomics and proteomics ppt
oncology ppt
biomedical application of genomics and proteomics ppt
agriculture application of genomics and proteomics ppt
proteomics in agriculture ppt
diagnosis of infectious disease ppt
personalized medicine ppt
The document discusses various "omics" fields of study including genomics, proteomics, metabolomics, and others. It provides definitions and descriptions of each type of omics, focusing on the large sets of biological molecules they each study such as genomes, proteomes, metabolomes, etc. It explains that omics fields examine biological data on a large scale and provide insights into biological processes, functions, and interactions on a systems-wide level.
Microarray technology allows researchers to analyze gene expression levels on a genomic scale. DNA microarrays contain many genes arranged on a slide that can be used to detect differences in gene expression between samples. The microarray workflow involves sample preparation, hybridization of labeled cDNA to the array, image scanning, data normalization and statistical analysis to identify differentially expressed genes between conditions. Multiple testing is a challenge and statistical methods must account for false positives and negatives.
Map-based cloning is a technique used to identify the genetic cause of a mutant phenotype by isolating overlapping DNA segments that progress along the chromosome toward a candidate gene. The process involves initially identifying a marker close to the gene of interest and then saturating the region with additional markers. Large populations are screened to find markers that rarely recombine with the gene. Genomic libraries are screened to find clones containing the markers, and chromosomal walking is used to obtain flanking markers on a single clone. DNA fragments between the markers are tested to rescue the wild-type phenotype and identify the candidate gene.
This document discusses functional genomics and its approaches. It defines functional genomics as the worldwide experimental approach to access the function of genes by using information from structural genomics. The key functional genomics approaches discussed are transcriptomics, proteomics, metabolomics, interactomics, epigenetics, and nutrigenomics. Modern techniques discussed include expressed sequence tags (ESTs), serial analysis of gene expression (SAGE), and microarray analysis.
Shotgun metagenomics sequencing allows researchers to comprehensively sample all genes in organisms present in a complex sample without culturing. This provides insights into bacterial diversity, abundance, and uncultured microbes. Bioinformatics pipelines guide analysis including quality filtering, assembly, binning, gene finding, fingerprinting, and phylogeny/diversity modeling to understand communities. Metagenomics has applications in antibiotic/drug discovery, bioremediation, agriculture, human microbiome mapping, and more. Tools like QIIME, Mothur, MEGAN, and MG-RAST facilitate large-scale metagenomic analysis.
The document provides an overview of plant genome sequence assembly, including:
1) A brief history of sequencing technologies and their improvements over time, from Sanger sequencing to newer technologies producing longer reads.
2) Key steps in a sequencing project including read processing, filtering, and corrections before assembly into contigs and scaffolds using appropriate software.
3) Factors to consider for experimental design and assembly optimization such as sequencing depth, library types, and software choices depending on the genome and data characteristics.
This document provides a summary of a seminar on comparative genomics techniques. It discusses three levels of genome research: structural genomics, functional genomics, and comparative genomics. Comparative genomics involves analyzing and comparing different genomes to study gene content, function, organization, and evolution. Techniques discussed include genome sequencing, mapping, and bioinformatics tools. The document also outlines what can be compared between genomes and how comparative genomics has provided insights into evolution and gene function.
Nanopore DNA sequencing is a fourth generation sequencing technique that involves passing single strands of DNA through a nanopore and detecting changes in electrical current caused by each nucleotide base. There are two main types of nanopores - biological nanopores which are protein channels inserted into membranes, and solid-state nanopores fabricated in thin materials like silicon nitride or graphene. Some examples of biological nanopores used for sequencing are the alpha-hemolysin pore and the MspA pore. Nanopore sequencing has advantages over other techniques in being label-free, capable of very long reads, and requiring low sample amounts. However, challenges remain in slowing DNA translocation for higher resolution and reducing noise in the electrical signals.
Metagenomics is the study of genomes recovered from environmental samples without culturing. It involves extracting DNA from an environmental sample and sequencing the DNA. This allows study of the 99% of microbes that cannot be cultured. Metagenomics has applications in discovering new antibiotics, enzymes, and understanding microbial communities and host interactions. It provides a culture-independent way to access genetic diversity and biotechnological potential from uncultured microorganisms.
This document discusses metagenomics, which is the study of genetic material recovered directly from environmental samples without culturing organisms. It outlines the difference between traditional genomics which studies one organism at a time in culture, versus metagenomics which sequences all DNA in a sample without culturing. The document then covers historical events in metagenomics, techniques used including direct DNA extraction and sequencing or function-based screening, applications such as discovering microbial diversity and novel enzymes, and future directions such as understanding human microbiomes and discovering novel pathways and organisms.
This document discusses the field of metagenomics, which involves directly extracting and sequencing genetic material from environmental samples without culturing individual microbial species. It provides a brief history of metagenomics from early microbiologists in the 17th century to recent large-scale sequencing projects. Methods of metagenomic analysis like sequence-driven and function-driven approaches are described. Applications to studying uncultured symbiotic microbes, extreme environments, and the human gut microbiome are also summarized.
DNA microarray technology allows researchers to analyze gene expression patterns across thousands of genes simultaneously. It involves affixing DNA probes to a solid surface in an orderly array and then measuring which genes are expressed by the level of hybridization with fluorescently labeled cDNA or cRNA from samples. The document discusses the history and principles of microarray techniques, including types such as cDNA and oligonucleotide microarrays. It also covers applications in genomics research and analysis of microarray data.
Probe labeling is defined as sequence use to search the mixture of nucleic acid for molecule containing complementary sequence.In molecular biology, hybridization is a phenomenon in which single-stranded deoxyribonucleic acid (DNA) or ribonucleic acid (RNA) molecules anneal to complementary DNA or RNA
Comparative genomics involves systematically comparing genome sequences from different organisms. It uses computer programs to identify homologous genomic regions and align sequences at the base-pair level. Comparing genomes at different phylogenetic distances can provide insights into gene structure/function, evolution, and characteristics unique to each organism. Key tools for comparative genomics include genome browsers, aligners, and databases that classify orthologous gene clusters conserved across species.
Genome editing methods such as ZFNs, TALENs, and CRISPR/Cas9 use engineered nucleases to create targeted double-stranded breaks in DNA which are then repaired through endogenous cellular processes. These nucleases can be used to modify genomes through techniques like gene knockout, targeted mutation insertion/deletion/correction, and studying gene function. CRISPR/Cas9 uses a guide RNA and Cas9 nuclease to target specific DNA sequences for editing. The four main steps for CRISPR are: 1) selecting target sequences near a PAM site, 2) designing and cloning gRNA, 3) delivering Cas9 and gRNA into cells, and 4) DNA repair after cleavage results in gene modification
This document provides an overview and comparison of popular next-generation sequencing platforms. It discusses the common sequencing pipeline including library preparation, massively parallel sequencing, and bioinformatics analysis. Popular platforms like Roche 454, Illumina, and SOLiD are described in detail focusing on their specific sequencing chemistries and performance characteristics. Newer third-generation platforms such as Ion Torrent, PacBio, and Oxford Nanopore are also introduced. A wide range of NGS applications from whole genome sequencing to RNA-seq are outlined.
This document discusses high-throughput DNA sequencing technologies and their application to genome assembly projects. It provides a brief history of DNA sequencing, from early chemical and chain termination methods to current massively parallel sequencing technologies. It also describes several long-read sequencing technologies, including Pacific Biosciences SMRT sequencing and Oxford Nanopore sequencing. Examples are given of genome projects utilizing these technologies along with short-read sequencing data.
Microarrays allow researchers to analyze gene expression across thousands of genes simultaneously. DNA probes are arrayed on a small glass or nylon slide, and labeled mRNA from samples is hybridized to the probes. Fluorescent scanning detects which genes are expressed. Data analysis includes normalization, distance metrics, clustering, and visualization to group genes with similar expression profiles and identify patterns of co-regulated genes. Microarrays enable functional genomics studies of development, disease, response to drugs or environmental factors, and more.
The document discusses codon optimization, which is a process that improves gene expression and increases translational efficiency of a gene of interest. It does this by accounting for the codon bias of the host organism. Codons are three nucleotide sequences in RNA that specify which amino acid is needed for protein synthesis. The document also lists some common codon optimization tools like Twistbioscience, Idtdna, and Genscript.
Applications of genomics and proteomics pptIbad khan
Applications of genomics and proteomics ppt
genomics and proteomics ppt
in the field of health genomics and proteomics ppt
oncology ppt
biomedical application of genomics and proteomics ppt
agriculture application of genomics and proteomics ppt
proteomics in agriculture ppt
diagnosis of infectious disease ppt
personalized medicine ppt
The document discusses various "omics" fields of study including genomics, proteomics, metabolomics, and others. It provides definitions and descriptions of each type of omics, focusing on the large sets of biological molecules they each study such as genomes, proteomes, metabolomes, etc. It explains that omics fields examine biological data on a large scale and provide insights into biological processes, functions, and interactions on a systems-wide level.
Microarray technology allows researchers to analyze gene expression levels on a genomic scale. DNA microarrays contain many genes arranged on a slide that can be used to detect differences in gene expression between samples. The microarray workflow involves sample preparation, hybridization of labeled cDNA to the array, image scanning, data normalization and statistical analysis to identify differentially expressed genes between conditions. Multiple testing is a challenge and statistical methods must account for false positives and negatives.
Map-based cloning is a technique used to identify the genetic cause of a mutant phenotype by isolating overlapping DNA segments that progress along the chromosome toward a candidate gene. The process involves initially identifying a marker close to the gene of interest and then saturating the region with additional markers. Large populations are screened to find markers that rarely recombine with the gene. Genomic libraries are screened to find clones containing the markers, and chromosomal walking is used to obtain flanking markers on a single clone. DNA fragments between the markers are tested to rescue the wild-type phenotype and identify the candidate gene.
This document discusses functional genomics and its approaches. It defines functional genomics as the worldwide experimental approach to access the function of genes by using information from structural genomics. The key functional genomics approaches discussed are transcriptomics, proteomics, metabolomics, interactomics, epigenetics, and nutrigenomics. Modern techniques discussed include expressed sequence tags (ESTs), serial analysis of gene expression (SAGE), and microarray analysis.
Shotgun metagenomics sequencing allows researchers to comprehensively sample all genes in organisms present in a complex sample without culturing. This provides insights into bacterial diversity, abundance, and uncultured microbes. Bioinformatics pipelines guide analysis including quality filtering, assembly, binning, gene finding, fingerprinting, and phylogeny/diversity modeling to understand communities. Metagenomics has applications in antibiotic/drug discovery, bioremediation, agriculture, human microbiome mapping, and more. Tools like QIIME, Mothur, MEGAN, and MG-RAST facilitate large-scale metagenomic analysis.
The document provides an overview of plant genome sequence assembly, including:
1) A brief history of sequencing technologies and their improvements over time, from Sanger sequencing to newer technologies producing longer reads.
2) Key steps in a sequencing project including read processing, filtering, and corrections before assembly into contigs and scaffolds using appropriate software.
3) Factors to consider for experimental design and assembly optimization such as sequencing depth, library types, and software choices depending on the genome and data characteristics.
This document provides a summary of a seminar on comparative genomics techniques. It discusses three levels of genome research: structural genomics, functional genomics, and comparative genomics. Comparative genomics involves analyzing and comparing different genomes to study gene content, function, organization, and evolution. Techniques discussed include genome sequencing, mapping, and bioinformatics tools. The document also outlines what can be compared between genomes and how comparative genomics has provided insights into evolution and gene function.
Nanopore DNA sequencing is a fourth generation sequencing technique that involves passing single strands of DNA through a nanopore and detecting changes in electrical current caused by each nucleotide base. There are two main types of nanopores - biological nanopores which are protein channels inserted into membranes, and solid-state nanopores fabricated in thin materials like silicon nitride or graphene. Some examples of biological nanopores used for sequencing are the alpha-hemolysin pore and the MspA pore. Nanopore sequencing has advantages over other techniques in being label-free, capable of very long reads, and requiring low sample amounts. However, challenges remain in slowing DNA translocation for higher resolution and reducing noise in the electrical signals.
Metagenomics is the study of genomes recovered from environmental samples without culturing. It involves extracting DNA from an environmental sample and sequencing the DNA. This allows study of the 99% of microbes that cannot be cultured. Metagenomics has applications in discovering new antibiotics, enzymes, and understanding microbial communities and host interactions. It provides a culture-independent way to access genetic diversity and biotechnological potential from uncultured microorganisms.
This document discusses metagenomics, which is the study of genetic material recovered directly from environmental samples without culturing organisms. It outlines the difference between traditional genomics which studies one organism at a time in culture, versus metagenomics which sequences all DNA in a sample without culturing. The document then covers historical events in metagenomics, techniques used including direct DNA extraction and sequencing or function-based screening, applications such as discovering microbial diversity and novel enzymes, and future directions such as understanding human microbiomes and discovering novel pathways and organisms.
This document discusses the field of metagenomics, which involves directly extracting and sequencing genetic material from environmental samples without culturing individual microbial species. It provides a brief history of metagenomics from early microbiologists in the 17th century to recent large-scale sequencing projects. Methods of metagenomic analysis like sequence-driven and function-driven approaches are described. Applications to studying uncultured symbiotic microbes, extreme environments, and the human gut microbiome are also summarized.
DNA microarray technology allows researchers to analyze gene expression patterns across thousands of genes simultaneously. It involves affixing DNA probes to a solid surface in an orderly array and then measuring which genes are expressed by the level of hybridization with fluorescently labeled cDNA or cRNA from samples. The document discusses the history and principles of microarray techniques, including types such as cDNA and oligonucleotide microarrays. It also covers applications in genomics research and analysis of microarray data.
Probe labeling is defined as sequence use to search the mixture of nucleic acid for molecule containing complementary sequence.In molecular biology, hybridization is a phenomenon in which single-stranded deoxyribonucleic acid (DNA) or ribonucleic acid (RNA) molecules anneal to complementary DNA or RNA
Comparative genomics involves systematically comparing genome sequences from different organisms. It uses computer programs to identify homologous genomic regions and align sequences at the base-pair level. Comparing genomes at different phylogenetic distances can provide insights into gene structure/function, evolution, and characteristics unique to each organism. Key tools for comparative genomics include genome browsers, aligners, and databases that classify orthologous gene clusters conserved across species.
Genome editing methods such as ZFNs, TALENs, and CRISPR/Cas9 use engineered nucleases to create targeted double-stranded breaks in DNA which are then repaired through endogenous cellular processes. These nucleases can be used to modify genomes through techniques like gene knockout, targeted mutation insertion/deletion/correction, and studying gene function. CRISPR/Cas9 uses a guide RNA and Cas9 nuclease to target specific DNA sequences for editing. The four main steps for CRISPR are: 1) selecting target sequences near a PAM site, 2) designing and cloning gRNA, 3) delivering Cas9 and gRNA into cells, and 4) DNA repair after cleavage results in gene modification
This document provides an overview and comparison of popular next-generation sequencing platforms. It discusses the common sequencing pipeline including library preparation, massively parallel sequencing, and bioinformatics analysis. Popular platforms like Roche 454, Illumina, and SOLiD are described in detail focusing on their specific sequencing chemistries and performance characteristics. Newer third-generation platforms such as Ion Torrent, PacBio, and Oxford Nanopore are also introduced. A wide range of NGS applications from whole genome sequencing to RNA-seq are outlined.
This document discusses high-throughput DNA sequencing technologies and their application to genome assembly projects. It provides a brief history of DNA sequencing, from early chemical and chain termination methods to current massively parallel sequencing technologies. It also describes several long-read sequencing technologies, including Pacific Biosciences SMRT sequencing and Oxford Nanopore sequencing. Examples are given of genome projects utilizing these technologies along with short-read sequencing data.
Making powerful science: an introduction to NGS and beyondAdamCribbs1
This slide deck is from the Botnar Research Centre introduction to NGS sequencing workshop 2021- an overview of the theoretical concepts behind sequencing are given
The document discusses RNA-seq analysis. It begins with an introduction to Mikael Huss, a bioinformatics scientist, and provides an overview of how genomics, RNA profiles, protein profiles, and interactomics relate within systems biology. The document then discusses how gene expression analysis can provide insights into basic research questions regarding tissue and cell identity, as well as insights into diseases by identifying genes that are over- or under-expressed in patients. Finally, it provides a brief overview of the typical workflow for RNA-seq analysis, which involves mapping RNA sequencing reads to a reference genome or transcriptome.
It contains information about- DNA Sequencing; History and Era sequencing; Next Generation Sequencing- Introduction, Workflow, Illumina/Solexa sequencing, Roche/454 sequencing, Ion Torrent sequencing, ABI-SOLiD sequencing; Comparison between NGS & Sangers and NGS Platforms; Advantages and Applications of NGS; Future Applications of NGS.
Next generation sequencing has revolutionized DNA sequencing by allowing millions of DNA fragments to be sequenced in parallel. This has increased sequencing speed and reduced costs. NGS is now used for applications like genome sequencing, transcriptome analysis, metagenomics, and studying genetic variation. It has provided insights into molecular biology and genomics that were not possible with previous sequencing methods.
The document provides information on PCR and RT-PCR including definitions, components, steps, types, and applications. PCR is described as a technique for amplifying a single DNA template using thermal cycling. It requires a DNA template, primers, Taq polymerase, dNTPs, and buffer. The main steps are denaturation of the DNA, annealing of primers, and elongation. RT-PCR is described as a technique for amplifying RNA using reverse transcriptase to generate cDNA, which is then amplified using PCR. Applications described include disease diagnosis, forensics, paternity testing, and detecting infections.
This document discusses wet-lab considerations for Illumina sequencing data analysis. It describes the typical Illumina sequencing workflow including library preparation, cluster formation, sequencing, and data analysis. It provides details on DNA and RNA input requirements, library construction steps like fragmentation and adapter ligation, and quality control methods. The document also discusses newer sequencing technologies like Pacific Biosciences and Oxford Nanopore sequencing.
This document discusses technical variability in PacBio full-length cDNA sequencing (Iso-Seq). It summarizes the Iso-Seq experimental and informatics pipelines, and analyzes read count variation between technical replicates and tissues. While technical variation is minimal, amplification biases from different enzymes and detection limits remain areas for improvement. Combining Iso-Seq with short-read data may help address these challenges.
The field of next-generation sequencing (NGS) has been experiencing explosive growth over the past several years and shows little sign of slowing down. The increasing capabilities and dramatically lowered costs have expanded NGS's reach beyond that of the human genome into nearly every corner of biological research. An overview of the platforms on the market today, including an assessment of their relative strengths and weaknesses, will be presented. The presentation will conclude with a peek into where the technology is going and what will be available in the future.
The document discusses sources affecting next-generation sequencing (NGS) quality and how to identify problematic NGS samples. It analyzes base sequencing quality, quality trimming, biases from base composition, potential contaminations, and gene content of two samples (A and B). Sample B showed poorer base quality, more unmapped reads, and evidence of Proteobacteria contamination compared to Sample A. Further quality control is recommended to identify issues before assembly.
The document summarizes a study that used Illumina Hi-seq sequencing to analyze taxon diversity in bulk insect samples. The researchers tested two approaches: 1) PCR-based amplification of the COI barcode region followed by Illumina sequencing, and 2) direct shotgun sequencing of total mitochondrial DNA without PCR. Both approaches showed potential for high-throughput environmental barcoding, though methodological improvements are still needed to address issues like taxonomic and biomass biases. The study demonstrates that Illumina sequencing can perform comparably to other platforms for analyzing mixed insect samples and may help solve amplification biases through a PCR-free method.
RNA-Seq transcriptome analysis of Gonium pectorale cell cycle.Jennifer Shelton
This document summarizes Dr. Tara N. Marriage's RNA-Seq analysis of the cell cycle transcriptome in the multicellular alga Gonium pectorale. RNA was extracted from G. pectorale cells collected hourly across a 24 hour period and pooled into time points corresponding to different cell cycle phases. RNA-Seq libraries were constructed and sequenced, and the reads were mapped and analyzed for differential gene expression. Preliminary results identified over 2400 differentially expressed genes across the cell cycle and hierarchical clustering of expression profiles. Several key cell cycle genes were found to be differentially expressed during mitosis. The analysis is ongoing to further investigate cell cycle regulation and changes contributing to multicellularity in Gonium compared
AGRF in conjunction with EMBL Australia recently organised a workshop at Monash University Clayton. This workshop was targeted at beginners and biologists who are new to analysing Next-Gen Sequencing data. The workshop also aimed to provide users with a snapshot of bioinformatics and data analysis tips on how to begin to analyse project data. An introduction to RNA-seq data analysis was presented by AGRF Senior Bioinformatician Dr. Sonika Tyagi.
Presented: 1st August 2012
NGS has enabled high-throughput genome sequencing and analysis, changing genomic research. Technologies like Roche 454, Solexa/Illumina, and SOLiD allow massively parallel sequencing of genomes. NGS has applications in de novo genome sequencing, resequencing, RNA-seq, ChIP-seq, methylation analysis, and more. It provides advantages over microarrays like detecting novel transcripts, splicing variants, and sequence variations. NGS data requires processing including quality control, mapping, and variant identification to realize its full potential to revolutionize genomic research and medicine.
RNA-Seq transcriptome analysis of Gonium pectorale cell cycleJennifer Shelton
This document summarizes an RNA-Seq analysis of the cell cycle transcriptome in the multicellular green alga Gonium pectorale. RNA was collected from G. pectorale cells across four time points in the cell cycle and sequenced. Differentially expressed genes were identified, including genes related to mitosis that were highly expressed at specific time points. Preliminary hierarchical clustering analysis grouped genes by expression patterns across the cell cycle. The goal is to compare the G. pectorale cell cycle transcriptome to Chlamydomonas to investigate changes in gene expression related to the evolution of multicellularity.
How to cluster and sequence an ngs library (james hadfield160416)James Hadfield
A presentation for people intersted in understanding how Illumina adapter ligation, clustering ands SBS sequencing work. Follow core-genomics http://core-genomics.blogspot.co.uk/
Updated: New High Throughput Sequencing technologies at the Norwegian Sequenc...Lex Nederbragt
Un update of the previous talk with the same title. A talk I gave at the Computational Life Science initiative (University of Oslo) about new High Throughput Sequencing instruments at the Norwegian Sequencing Centre. I also mentioned future upgrades, and the upcoming nanopore sequencing platform of Oxford nanopore.
Unlocking Productivity: Leveraging the Potential of Copilot in Microsoft 365, a presentation by Christoforos Vlachos, Senior Solutions Manager – Modern Workplace, Uni Systems
Dr. Sean Tan, Head of Data Science, Changi Airport Group
Discover how Changi Airport Group (CAG) leverages graph technologies and generative AI to revolutionize their search capabilities. This session delves into the unique search needs of CAG’s diverse passengers and customers, showcasing how graph data structures enhance the accuracy and relevance of AI-generated search results, mitigating the risk of “hallucinations” and improving the overall customer journey.
Essentials of Automations: The Art of Triggers and Actions in FMESafe Software
In this second installment of our Essentials of Automations webinar series, we’ll explore the landscape of triggers and actions, guiding you through the nuances of authoring and adapting workspaces for seamless automations. Gain an understanding of the full spectrum of triggers and actions available in FME, empowering you to enhance your workspaces for efficient automation.
We’ll kick things off by showcasing the most commonly used event-based triggers, introducing you to various automation workflows like manual triggers, schedules, directory watchers, and more. Plus, see how these elements play out in real scenarios.
Whether you’re tweaking your current setup or building from the ground up, this session will arm you with the tools and insights needed to transform your FME usage into a powerhouse of productivity. Join us to discover effective strategies that simplify complex processes, enhancing your productivity and transforming your data management practices with FME. Let’s turn complexity into clarity and make your workspaces work wonders!
Climate Impact of Software Testing at Nordic Testing DaysKari Kakkonen
My slides at Nordic Testing Days 6.6.2024
Climate impact / sustainability of software testing discussed on the talk. ICT and testing must carry their part of global responsibility to help with the climat warming. We can minimize the carbon footprint but we can also have a carbon handprint, a positive impact on the climate. Quality characteristics can be added with sustainability, and then measured continuously. Test environments can be used less, and in smaller scale and on demand. Test techniques can be used in optimizing or minimizing number of tests. Test automation can be used to speed up testing.
Full-RAG: A modern architecture for hyper-personalizationZilliz
Mike Del Balso, CEO & Co-Founder at Tecton, presents "Full RAG," a novel approach to AI recommendation systems, aiming to push beyond the limitations of traditional models through a deep integration of contextual insights and real-time data, leveraging the Retrieval-Augmented Generation architecture. This talk will outline Full RAG's potential to significantly enhance personalization, address engineering challenges such as data management and model training, and introduce data enrichment with reranking as a key solution. Attendees will gain crucial insights into the importance of hyperpersonalization in AI, the capabilities of Full RAG for advanced personalization, and strategies for managing complex data integrations for deploying cutting-edge AI solutions.
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfPaige Cruz
Monitoring and observability aren’t traditionally found in software curriculums and many of us cobble this knowledge together from whatever vendor or ecosystem we were first introduced to and whatever is a part of your current company’s observability stack.
While the dev and ops silo continues to crumble….many organizations still relegate monitoring & observability as the purview of ops, infra and SRE teams. This is a mistake - achieving a highly observable system requires collaboration up and down the stack.
I, a former op, would like to extend an invitation to all application developers to join the observability party will share these foundational concepts to build on:
Threats to mobile devices are more prevalent and increasing in scope and complexity. Users of mobile devices desire to take full advantage of the features
available on those devices, but many of the features provide convenience and capability but sacrifice security. This best practices guide outlines steps the users can take to better protect personal devices and information.
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!SOFTTECHHUB
As the digital landscape continually evolves, operating systems play a critical role in shaping user experiences and productivity. The launch of Nitrux Linux 3.5.0 marks a significant milestone, offering a robust alternative to traditional systems such as Windows 11. This article delves into the essence of Nitrux Linux 3.5.0, exploring its unique features, advantages, and how it stands as a compelling choice for both casual users and tech enthusiasts.
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AIVladimir Iglovikov, Ph.D.
Presented by Vladimir Iglovikov:
- https://www.linkedin.com/in/iglovikov/
- https://x.com/viglovikov
- https://www.instagram.com/ternaus/
This presentation delves into the journey of Albumentations.ai, a highly successful open-source library for data augmentation.
Created out of a necessity for superior performance in Kaggle competitions, Albumentations has grown to become a widely used tool among data scientists and machine learning practitioners.
This case study covers various aspects, including:
People: The contributors and community that have supported Albumentations.
Metrics: The success indicators such as downloads, daily active users, GitHub stars, and financial contributions.
Challenges: The hurdles in monetizing open-source projects and measuring user engagement.
Development Practices: Best practices for creating, maintaining, and scaling open-source libraries, including code hygiene, CI/CD, and fast iteration.
Community Building: Strategies for making adoption easy, iterating quickly, and fostering a vibrant, engaged community.
Marketing: Both online and offline marketing tactics, focusing on real, impactful interactions and collaborations.
Mental Health: Maintaining balance and not feeling pressured by user demands.
Key insights include the importance of automation, making the adoption process seamless, and leveraging offline interactions for marketing. The presentation also emphasizes the need for continuous small improvements and building a friendly, inclusive community that contributes to the project's growth.
Vladimir Iglovikov brings his extensive experience as a Kaggle Grandmaster, ex-Staff ML Engineer at Lyft, sharing valuable lessons and practical advice for anyone looking to enhance the adoption of their open-source projects.
Explore more about Albumentations and join the community at:
GitHub: https://github.com/albumentations-team/albumentations
Website: https://albumentations.ai/
LinkedIn: https://www.linkedin.com/company/100504475
Twitter: https://x.com/albumentations
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Albert Hoitingh
In this session I delve into the encryption technology used in Microsoft 365 and Microsoft Purview. Including the concepts of Customer Key and Double Key Encryption.
Epistemic Interaction - tuning interfaces to provide information for AI supportAlan Dix
Paper presented at SYNERGY workshop at AVI 2024, Genoa, Italy. 3rd June 2024
https://alandix.com/academic/papers/synergy2024-epistemic/
As machine learning integrates deeper into human-computer interactions, the concept of epistemic interaction emerges, aiming to refine these interactions to enhance system adaptability. This approach encourages minor, intentional adjustments in user behaviour to enrich the data available for system learning. This paper introduces epistemic interaction within the context of human-system communication, illustrating how deliberate interaction design can improve system understanding and adaptation. Through concrete examples, we demonstrate the potential of epistemic interaction to significantly advance human-computer interaction by leveraging intuitive human communication strategies to inform system design and functionality, offering a novel pathway for enriching user-system engagements.
A tale of scale & speed: How the US Navy is enabling software delivery from l...sonjaschweigert1
Rapid and secure feature delivery is a goal across every application team and every branch of the DoD. The Navy’s DevSecOps platform, Party Barge, has achieved:
- Reduction in onboarding time from 5 weeks to 1 day
- Improved developer experience and productivity through actionable findings and reduction of false positives
- Maintenance of superior security standards and inherent policy enforcement with Authorization to Operate (ATO)
Development teams can ship efficiently and ensure applications are cyber ready for Navy Authorizing Officials (AOs). In this webinar, Sigma Defense and Anchore will give attendees a look behind the scenes and demo secure pipeline automation and security artifacts that speed up application ATO and time to production.
We will cover:
- How to remove silos in DevSecOps
- How to build efficient development pipeline roles and component templates
- How to deliver security artifacts that matter for ATO’s (SBOMs, vulnerability reports, and policy evidence)
- How to streamline operations with automated policy checks on container images
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...DanBrown980551
Do you want to learn how to model and simulate an electrical network from scratch in under an hour?
Then welcome to this PowSyBl workshop, hosted by Rte, the French Transmission System Operator (TSO)!
During the webinar, you will discover the PowSyBl ecosystem as well as handle and study an electrical network through an interactive Python notebook.
PowSyBl is an open source project hosted by LF Energy, which offers a comprehensive set of features for electrical grid modelling and simulation. Among other advanced features, PowSyBl provides:
- A fully editable and extendable library for grid component modelling;
- Visualization tools to display your network;
- Grid simulation tools, such as power flows, security analyses (with or without remedial actions) and sensitivity analyses;
The framework is mostly written in Java, with a Python binding so that Python developers can access PowSyBl functionalities as well.
What you will learn during the webinar:
- For beginners: discover PowSyBl's functionalities through a quick general presentation and the notebook, without needing any expert coding skills;
- For advanced developers: master the skills to efficiently apply PowSyBl functionalities to your real-world scenarios.
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...James Anderson
Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM
The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management.
The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM).
Speakers:
Bob Boule
Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle.
Gopinath Rebala
Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.
Pushing the limits of ePRTC: 100ns holdover for 100 daysAdtran
At WSTS 2024, Alon Stern explored the topic of parametric holdover and explained how recent research findings can be implemented in real-world PNT networks to achieve 100 nanoseconds of accuracy for up to 100 days.
Removing Uninteresting Bytes in Software FuzzingAftab Hussain
Imagine a world where software fuzzing, the process of mutating bytes in test seeds to uncover hidden and erroneous program behaviors, becomes faster and more effective. A lot depends on the initial seeds, which can significantly dictate the trajectory of a fuzzing campaign, particularly in terms of how long it takes to uncover interesting behaviour in your code. We introduce DIAR, a technique designed to speedup fuzzing campaigns by pinpointing and eliminating those uninteresting bytes in the seeds. Picture this: instead of wasting valuable resources on meaningless mutations in large, bloated seeds, DIAR removes the unnecessary bytes, streamlining the entire process.
In this work, we equipped AFL, a popular fuzzer, with DIAR and examined two critical Linux libraries -- Libxml's xmllint, a tool for parsing xml documents, and Binutil's readelf, an essential debugging and security analysis command-line tool used to display detailed information about ELF (Executable and Linkable Format). Our preliminary results show that AFL+DIAR does not only discover new paths more quickly but also achieves higher coverage overall. This work thus showcases how starting with lean and optimized seeds can lead to faster, more comprehensive fuzzing campaigns -- and DIAR helps you find such seeds.
- These are slides of the talk given at IEEE International Conference on Software Testing Verification and Validation Workshop, ICSTW 2022.
1. Next-Generation Sequencing of Microbial
Genomes and Metagenomes
Christine King
Farncombe Metagenomics Facility
Human Microbiome Journal Club
July 13, 2012
7. Which instrument(s) to use?
Read length vs number of reads
Cost per base, per sample, per project (multiplexing?)
Accuracy
Run time, wait time
Application Lengt # Accura Instruments Considerations
h Reads cy
De novo +++ ++ ++ MiSeq, 454, Ion Mix lengths
(small)
De novo +++ +++ ++ HiSeq, 454, Mix lengths, MP
(large) SOLiD
Re-seq ++ ++ ++ MiSeq, Ion Multiplex?
(small)
Re-seq (large) ++ +++ ++ HiSeq, SOLiD Enrichment?
RNA-seq + +++ + Illumina, SOLiD, Ref? Size?
(count) Ion Rare?
8. Library Preparation
Goal: fragments of DNA, each end flanked by adaptor
sequences
Adaptors contain amplification- and sequencing primer
binding sites; platform- and chemistry-specific
Optional: sample-specific barcodes/indexes/MIDs/tags
allow multiplexing during sequencing
Library QC: quantity, size
9. Library Preparation
Library types:
Shotgun (DNA)
May begin with ChIP
May follow with sequence capture
Mate pair (DNA)
Amplicon (DNA)
Total RNA
May enrich for mRNA (poly-A enrichment, rRNA depletion)
Convert to cDNA (then similar to DNA protocols)
Small RNA
RNA ligations, convert to cDNA after
13. Library Preparation: Mate Pair
Begin with large
fragments (e.g. 3kb,
20kb)
Circularize and
fragment again
Illumina: direct ligation
454: Cre/Lox
recombination
Enrich for fragments
containing the junction
Proceed with shotgun
library prep
14. Library Preparation: Mate Pair
Why? Paired
sequences are a known
distance apart;
improves genome
assembly
Note: 454 calls these
“paired end libraries”,
not to be confused with
Illumina’s “paired end
sequencing”!
15. Sequencing: Illumina
Cluster generation
Library fragments hybridize
to oligos on the flow cell
New strand synthesized,
original denatured,
removed
Free end binds to adjacent
oligos (bridge formation)
Complimentary strand
synthesized, denatured
(both tethered to flow cell)
Repeat to form clonal
cluster
Cleave one oligo, denature
to leave ssDNA clusters
~800K clusters/mm^2
16. Sequencing: Illumina
Variety of workflows:
Single- or paired end reads
0, 1, or 2 index reads
17. Sequencing: Illumina
At each cycle, all 4 fluorescently-labeled
nucleotides pass over the flow cell
Each cluster incorporates one nt (terminator) per
cycle
Fluor is imaged, then cleaved
De-block and repeat
18. Sequencing: Illumina
Other terminology:
cBot – accessory instrument that performs cluster
generation
Lanes – divisions (8) of HiSeq and GAIIx flow cells
PhiX – bacteriophage with small, balanced genome; PhiX
library spiked in with samples for QC
Phasing/pre-phasing – nt incorporation falls behind or
jumps ahead on a portion of strands in the cluster and
contributes to noise
Chastity filter – measures signal purity (after intensity
corrections); if the background signal is high, cluster will be
discarded
BaseSpace – cloud computing site for processing MiSeq
data
File format: fastq
19. Sequencing: 454
emPCR: clonal
amplification of
bead-bound library
in microdroplets
Library input
amounts critical!
One molecule per
bead
Titration procedure
20. Sequencing: 454
Library capture:
beads coated with
complimentary oligo
Amplification:
droplet contains
PCR reagents and
the other oligo
Post-PCR: millions
of identical
fragments attached
to the bead
21. Sequencing: 454
Bead Recovery: Enrichment: capture
physical and successfully
chemical disruption amplified beads
using biotinylated
primers + magnetic,
streptavidin beads
24. Sequencing: 454
Pyrosequencing
4 nucleotides flow
separately
If nt
incorporation…PPi...light
APS + PPi (sulfurylase)
ATP
Luciferin + ATP (luciferase)
light + oxyluciferin
Amount of light
proportional to #nt
incorporated
Rinse and repeat with next
nt
25. Sequencing: 454
Camera captures light
emitted from every well
during every nucleotide flow
26. Sequencing: 454
Flowgram: representation of a sequence, based on the
pattern of light emitted from a single well
27. Sequencing: 454
Other terminology:
Lib-L/Lib-A: adapter variants, “ligated” or “annealed”
Titanium chemistry: ~450 bp reads on all instruments
XL+ chemistry: ~700 bp reads on the FLX+ instrument
Flow: one of the four nucleotides flows over the PTP
Cycle: a set of four flows, in order
Valley flow: if number of bases incorporated in a given
read during that flow is uncertain, e.g. 1.5 units of light
(background signal, homopolymers)
File format: sff (standard flowgram format)
28. Sequencing: Ion Torrent
Procedures and
chemistry similar to 454
Instead of PPi, measure
H+ release (pH change)
via semiconductor chip
No expensive camera or
laser required, no
modified nucleotides
29. Sequence Quality
Phred (Q) Probabilit Base Call
Error probabilities
Score y of Error Accuracy determined using
(P)
training sets,
10 1 in 10 90%
platform-specific
20 1 in 100 99%
30 1 in 1K 99.9%
biases
40 1 in 10K 99.99% Expressed as a
50 1 in 100K 99.999% quality value (QV or Q
score) per base
Similar to PHRED
scores:
Q = -10 log10P
P = 10 -Q/10
30. Project 1: Microbial Genome
Considerations: Coverage
Reference genome? Depth (number of
How much coverage times a particular
do I want? base is “covered” by a
read (e.g. 25X)
How big is the
genome Breadth (% of genome
with at least 1X
How much data do I
coverage)
need?
bp needed = genome
size X coverage
Which
instrument/chemistry
configuration to use?
31. Project 1: Microbial Genome
Sample preparation
Isolate high quality (not
degraded) and high purity (no
RNA) gDNA
Verify on a gel
Quantify using dsDNA-specific
dye
Library preparation
Can do this yourself if you like
~ $200 per sample for Nextera
Cheaper protocols
Cheaper in bulk
Barcode compatibility
32. Project 1: Microbial Genome
Library QC
Insertsize confirmed on BioAnalyzer (within
range, no artifacts)
Pool barcoded libraries (normalize based on
PicoGreen quantification)
Absolute quantification of library pools using
qPCR
33. Project 1: Microbial Genome
MiSeq sequencing
Diluteand denature library pool (optimal
concentration requires titration...)
Spike in PhiX library as needed (e.g. 1%)
Prepare and load reagents, flow cell
Basic filtering and de-multiplexing performed
automatically
Download fastq files from BaseSpace
34. Project 1: Microbial Genome
Data processing Assembly:
Additional filtering overlapping reads
Trim the ends are assembled to
Remove PCR eachother based on
duplicates sequence similarity
= contigs
36. Project 2: Microbial Community
Shotgun Targeted
metagenomics metagenomics
Unbiased survey of Limited survey of
community content community content
Random library Targeted loci provide
fragments may excellent taxonomic
provide very little resolution, but may
taxonomic resolution exclude certain taxa
(e.g. conserved,
unknown)
Identify OTUs, classify
Identify genes, by taxonomy
classify by function
37. Project 2: Microbial Community
16S rRNA
Multi-copy gene (1.5
kb)
Conserved and
hypervariable regions
Extensive databases
from known species
38. Project 2: Microbial Community
Considerations: Sample preparation:
Biases in sampling Isolate
DNA
methods, culturing, PCR amplify, purify
DNA isolation, High-fidelity
PCR...replicate polymerase
Available SOPs Barcoded primers
How many reads per No primer dimers!
sample? NormalizePCR
Read length products and pool
matters!
39. Project 2: Microbial Community
454 Sequencing Data processing
emPCR titrations De-multiplexing
with different library Additionalfiltering
input Trim the barcodes,
Bulk emPCR primers
Sequence Check for chimeras
Basic filtering
Collect sff files
40. Project 2: Microbial Community
Clustering
Sequences grouped
by similarity = OTUs
41. Project 2: Microbial Community
Taxonomic
identification
OTUs are classifed by
comparing to known
16S sequences
Level of classification
(e.g. family vs
genus)?
Diversity
Within sample
Between samples