Slides from my session on the SciLifeLab NBIS course "Introduction to Bioinformatics Using NGS Data". Held in Linköping, May 23 2018.
For more information about the course, see https://scilifelab.github.io/courses/ngsintro/1805/
BEST PRACTICE TO MAXIMIZE THROUGHPUT WITH NANOPORE TECHNOLOGY & DE NOVO SEQUE...Baptiste Mayjonade
1) The document discusses best practices for maximizing throughput when using Nanopore technology, including ensuring high purity and integrity of input DNA samples.
2) It describes using Nanopore sequencing to generate de novo reference genomes for genetic lines of Arabidopsis thaliana, with high quality assemblies obtained.
3) Generating long reads with Nanopore allows detection of structural variations between genomes, with the potential to improve genome-wide association mapping.
New Technologies at the Center for Bioinformatics & Functional Genomics at Mi...Andor Kiss
A review of recent molecular biology technologies at the core genomics facility at Miami University (Ohio). The aim of this talk is to introduce the facility's capabilities to faculty, graduate and undergraduate students at MiamiU.
Take your RNA research to the next level with QIAGEN LNA tools!QIAGEN
Download the flyer!
Experience truly exceptional RNA research with QIAGEN's next-generation, LNA®-enhanced tools. LNA (Locked Nucleic Acid) oligos bind with much higher affinity and specificity to RNA targets than standard DNA and RNA oligos – This enables specific and sensitive detection of small RNAs and discrimination between highly similar
sequences.
This document discusses the design, production, and application of Spike-In RNA Variants (SIRVs) created by Lexogen. It describes how SIRVs were modeled after natural mammalian genes and transcripts to include various transcript variants and isoforms. Their sequences were derived from viral genomes and modified to have no homology to other organisms. SIRVs were produced via in vitro transcription and tested in RNA-seq experiments mixed with other RNA controls and samples. They are intended to serve as external controls to help validate bioinformatics pipelines and the detection of transcript variants.
SeqsLab: a high performance genomics data analysis platform based on Apache S...Yun Lung Li
On the way to precision medicine, personal genomics is an essential topic since the genome holds the most fundamental information of individual. NGS is a comprehensive way to explore human genome, but with the intimidating amount of data and computation effort. Atgenomix SeqsLab is a high performance genomics data analysis platform aiming to close the Bio-IT gap.
Big data solution for ngs data analysisYun Lung Li
This document outlines a presentation on big data solutions for NGS data analysis using software containerization and distributed analytics. It discusses Docker containerization and its use at Atgenomix to simplify cluster environments. It also covers NGS genome analysis techniques like read mapping, variant calling, and using Spark and Hadoop for data parallelization. Elasticsearch is introduced for distributed, RESTful search and analytics of variant data.
The document discusses genomic research and sequencing technologies. It provides a history of genomic research from early sequencing methods like Sanger sequencing to modern massively parallel sequencing. It describes several next-generation sequencing platforms, including their read lengths, accuracy, applications, and differences. It emphasizes that data analysis is a major challenge and advises consulting sequencing facilities and having dedicated bioinformaticians for projects.
BEST PRACTICE TO MAXIMIZE THROUGHPUT WITH NANOPORE TECHNOLOGY & DE NOVO SEQUE...Baptiste Mayjonade
1) The document discusses best practices for maximizing throughput when using Nanopore technology, including ensuring high purity and integrity of input DNA samples.
2) It describes using Nanopore sequencing to generate de novo reference genomes for genetic lines of Arabidopsis thaliana, with high quality assemblies obtained.
3) Generating long reads with Nanopore allows detection of structural variations between genomes, with the potential to improve genome-wide association mapping.
New Technologies at the Center for Bioinformatics & Functional Genomics at Mi...Andor Kiss
A review of recent molecular biology technologies at the core genomics facility at Miami University (Ohio). The aim of this talk is to introduce the facility's capabilities to faculty, graduate and undergraduate students at MiamiU.
Take your RNA research to the next level with QIAGEN LNA tools!QIAGEN
Download the flyer!
Experience truly exceptional RNA research with QIAGEN's next-generation, LNA®-enhanced tools. LNA (Locked Nucleic Acid) oligos bind with much higher affinity and specificity to RNA targets than standard DNA and RNA oligos – This enables specific and sensitive detection of small RNAs and discrimination between highly similar
sequences.
This document discusses the design, production, and application of Spike-In RNA Variants (SIRVs) created by Lexogen. It describes how SIRVs were modeled after natural mammalian genes and transcripts to include various transcript variants and isoforms. Their sequences were derived from viral genomes and modified to have no homology to other organisms. SIRVs were produced via in vitro transcription and tested in RNA-seq experiments mixed with other RNA controls and samples. They are intended to serve as external controls to help validate bioinformatics pipelines and the detection of transcript variants.
SeqsLab: a high performance genomics data analysis platform based on Apache S...Yun Lung Li
On the way to precision medicine, personal genomics is an essential topic since the genome holds the most fundamental information of individual. NGS is a comprehensive way to explore human genome, but with the intimidating amount of data and computation effort. Atgenomix SeqsLab is a high performance genomics data analysis platform aiming to close the Bio-IT gap.
Big data solution for ngs data analysisYun Lung Li
This document outlines a presentation on big data solutions for NGS data analysis using software containerization and distributed analytics. It discusses Docker containerization and its use at Atgenomix to simplify cluster environments. It also covers NGS genome analysis techniques like read mapping, variant calling, and using Spark and Hadoop for data parallelization. Elasticsearch is introduced for distributed, RESTful search and analytics of variant data.
The document discusses genomic research and sequencing technologies. It provides a history of genomic research from early sequencing methods like Sanger sequencing to modern massively parallel sequencing. It describes several next-generation sequencing platforms, including their read lengths, accuracy, applications, and differences. It emphasizes that data analysis is a major challenge and advises consulting sequencing facilities and having dedicated bioinformaticians for projects.
The document discusses genomic research and sequencing technologies. It provides a history of genomic research from early sequencing methods like Sanger sequencing to modern massively parallel sequencing. It describes several next-generation sequencing platforms, their read lengths, accuracy, applications, and differences. It emphasizes that data analysis is a major challenge and recommends consulting sequencing facilities and having bioinformaticians available for analysis.
Whole Genome Sequencing - Data Processing and QC at SciLifeLab NGIPhil Ewels
Slides presented at the "Rare Disease Genomics" course held at the Centre for Molecular Medicine (Karolinska Institute, Stockholm, Sweden). Phil Ewels, 4th December 2017.
This document discusses the history and evolution of DNA sequencing technologies. It begins with early manual sequencing methods developed in the 1970s by Sanger and others. Automated Sanger sequencing and the sequencing of larger genomes followed in the 1980s-1990s. Next generation sequencing (NGS) methods were developed starting in 1996 and became commercially available in 2005, enabling massively parallel sequencing. NGS platforms such as 454, Illumina, and SOLiD are discussed. Third generation real-time sequencing methods such as PacBio and nanopore sequencing are also introduced, providing longer read lengths. The document compares key parameters of different sequencing methods such as read length, accuracy, throughput, cost and advantages/disadvantages.
Slides from my talk as part of the NBIS RNA-seq tutorial course. I describe how we process RNA-seq data at the Swedish National Genomics Infrastructure and how our NGI-RNAseq analysis pipeline works. https://github.com/SciLifeLab/NGI-RNAseq
DNA sequencing determines the order of nucleotides in a DNA segment. The first generation methods developed in 1977 were Maxam-Gilbert and Sanger sequencing, with Sanger becoming the most common. Next generation sequencing now includes Illumina, which is the most widely used, as well as PacBio and Nanopore third generation real-time single molecule techniques.
Next-Generation Sequencing an Intro to Tech and Applications: NGS Tech Overvi...QIAGEN
This slidedeck provides a technical overview of DNA/RNA preprocessing, template preparation, sequencing and data analysis. It covers the applications for NGS technologies, including guidelines for how to select the technology that will best address your biological question.
This document summarizes next generation sequencing and bioinformatics analysis pipelines. It discusses the Ion Torrent and Pacific Biosciences sequencing platforms, and provides examples of standard and in-house analysis pipelines for these platforms. It also gives three specific examples of in-house developed pipelines: (1) a local variant database for exome sequencing analysis, (2) de novo genome assembly using long reads from Pacific Biosciences, and (3) a clinical sequencing workflow for detecting mutations in leukemia patients. The document outlines ongoing developments for both sequencing technologies and analysis pipelines.
This document summarizes next generation sequencing and bioinformatics analysis pipelines. It discusses the Ion Torrent and Pacific Biosciences sequencing platforms, and provides examples of standard and in-house analysis pipelines for these platforms. It also gives three specific examples of in-house developed pipelines: (1) a local variant database for exome sequencing analysis, (2) de novo genome assembly using long reads from Pacific Biosciences, and (3) a clinical sequencing workflow for detecting mutations in leukemia patients. The document outlines ongoing developments with the sequencing platforms and analysis methods.
Introduction to Next-Generation Sequencing (NGS) TechnologyQIAGEN
The continuous evolution of NGS technology has led to an enormous diversification in NGS applications and dramatically decreased the costs to sequence a complete human genome.
In this presentation, we will discuss the following major topics:
• Basic overview of NGS sequencing technologies
• Next-generation sequencing workflow
• Spectrum of NGS applications
• QIAGEN universal NGS solutions
Slides from my talk as part of the NBIS ChIP-seq tutorial course. I describe how we process ChIP-seq data at the Swedish National Genomics Infrastructure and how our NGI-ChIPseq analysis pipeline works. https://github.com/SciLifeLab/NGI-ChIPseq
The document summarizes Ion Torrent sequencing technology. It detects hydrogen ions released during DNA polymerization rather than using optics. The sequencing occurs on semiconductor chips patterned through photolithography into wells, each sequencing a different template. As nucleotides are incorporated, hydrogen ions change the pH detected by ion sensors below each well. This allows massively parallel sequencing that is faster, cheaper and simpler than previous technologies.
This document discusses DNA sequencing techniques. It begins by defining DNA sequencing as determining the order of nucleotides in a DNA molecule. It then summarizes 4 generations of sequencing methods: 1) Sanger sequencing from 1977, the predominant early method; 2) Next generation techniques from 2005-2007 including 454, SOLiD, and Illumina that enabled high-throughput short reads; 3) Third generation methods like PacBio and Oxford Nanopore that enable long reads but with lower accuracy; and 4) Some additional terms used in sequencing like oligonucleotides, plasmids, and cloning vectors.
GENA was established in 2005 in Stavanger, Norway to provide DNA analysis services as a supplement to the National Institute of Forensic Medicine. It has a modern, high-capacity laboratory accredited for forensic DNA analysis. GENA provides rapid, high-throughput forensic DNA services through automated and sensitive technology. It is ISO 17025 accredited and experienced in forensic DNA services and research cooperation projects in bacterial sequencing, human disease testing, and aquaculture species identification. It is seeking to expand its DNA testing services and partners in biological testing.
EpiChrom 2019 - Updates in Epigenomics at the NGIPhil Ewels
Slides from my talk at the SciLifeLab EpiChrom 2019 meeting: https://www.scilifelab.se/epichrom-2019/
# New epigenomics services at the National Genomics Infrastructure
A quick walkthrough of new library preparation methods on offer to study epigenetic signals at the National Genomics Infrastructure.
This document provides an overview of nanopore sequencing. It begins by outlining the benefits of nanopore sequencing such as flexibility, single molecule sequencing, and cost-effectiveness. It then discusses the history and concept of nanopore sequencing, which was envisioned in the 1990s and involves using nanopores to sequence DNA strands. The document describes how Oxford Nanopore's technology uses protein nanopores embedded in membranes to detect the electrical signals of DNA as it passes through. It also explains the components involved, such as the biological or solid-state nanopore, array of microscaffolds, and sensor chip.
The document discusses next-generation sequencing (NGS) technologies and NGS targeted re-sequencing. It provides an overview of NGS technologies including their development over time. It then discusses NGS targeted re-sequencing by focusing on specific regions of interest through library enrichment techniques. Finally, it outlines the typical NGS exome sequencing pipeline from sample preparation to data processing, analysis and reporting of results.
Thoughts on the recent announcements by Oxford Nanopore TechnologiesKeith Bradnam
This document summarizes a presentation about announcements from Oxford Nanopore Technologies at the Advances in Genome Biology & Technology meeting. Oxford Nanopore unveiled two new sequencing devices - the desktop-sized GridION and the palm-sized MinION sequencer, which resembles a USB stick. Both devices use nanopore sequencing to directly sequence DNA or RNA in real-time without the need for sample preparation. This technology could potentially sequence an entire human genome in 15 minutes. Oxford Nanopore is aiming for increased sequencing accuracy and lower costs with these portable, easy-to-use sequencers.
Making powerful science: an introduction to NGS and beyondAdamCribbs1
This slide deck is from the Botnar Research Centre introduction to NGS sequencing workshop 2021- an overview of the theoretical concepts behind sequencing are given
Reproducible bioinformatics for everyone: Nextflow & nf-corePhil Ewels
Slides from my talk at the Karolinska Institute in Huddinge (Stockholm, Sweden). June 2022.
General introduction to Nextflow and nf-core, covering what they are and why you should use them!
Reproducible bioinformatics workflows with Nextflow and nf-corePhil Ewels
Slides from my talk at a GenomeWeb webinar, I discuss how we use Nextflow at the SciLifeLab National Genomics Infrastructure and how this led to the founding of the nf-core community project.
More Related Content
Similar to Lecture: NGS at the National Genomics Infrastructure
The document discusses genomic research and sequencing technologies. It provides a history of genomic research from early sequencing methods like Sanger sequencing to modern massively parallel sequencing. It describes several next-generation sequencing platforms, their read lengths, accuracy, applications, and differences. It emphasizes that data analysis is a major challenge and recommends consulting sequencing facilities and having bioinformaticians available for analysis.
Whole Genome Sequencing - Data Processing and QC at SciLifeLab NGIPhil Ewels
Slides presented at the "Rare Disease Genomics" course held at the Centre for Molecular Medicine (Karolinska Institute, Stockholm, Sweden). Phil Ewels, 4th December 2017.
This document discusses the history and evolution of DNA sequencing technologies. It begins with early manual sequencing methods developed in the 1970s by Sanger and others. Automated Sanger sequencing and the sequencing of larger genomes followed in the 1980s-1990s. Next generation sequencing (NGS) methods were developed starting in 1996 and became commercially available in 2005, enabling massively parallel sequencing. NGS platforms such as 454, Illumina, and SOLiD are discussed. Third generation real-time sequencing methods such as PacBio and nanopore sequencing are also introduced, providing longer read lengths. The document compares key parameters of different sequencing methods such as read length, accuracy, throughput, cost and advantages/disadvantages.
Slides from my talk as part of the NBIS RNA-seq tutorial course. I describe how we process RNA-seq data at the Swedish National Genomics Infrastructure and how our NGI-RNAseq analysis pipeline works. https://github.com/SciLifeLab/NGI-RNAseq
DNA sequencing determines the order of nucleotides in a DNA segment. The first generation methods developed in 1977 were Maxam-Gilbert and Sanger sequencing, with Sanger becoming the most common. Next generation sequencing now includes Illumina, which is the most widely used, as well as PacBio and Nanopore third generation real-time single molecule techniques.
Next-Generation Sequencing an Intro to Tech and Applications: NGS Tech Overvi...QIAGEN
This slidedeck provides a technical overview of DNA/RNA preprocessing, template preparation, sequencing and data analysis. It covers the applications for NGS technologies, including guidelines for how to select the technology that will best address your biological question.
This document summarizes next generation sequencing and bioinformatics analysis pipelines. It discusses the Ion Torrent and Pacific Biosciences sequencing platforms, and provides examples of standard and in-house analysis pipelines for these platforms. It also gives three specific examples of in-house developed pipelines: (1) a local variant database for exome sequencing analysis, (2) de novo genome assembly using long reads from Pacific Biosciences, and (3) a clinical sequencing workflow for detecting mutations in leukemia patients. The document outlines ongoing developments for both sequencing technologies and analysis pipelines.
This document summarizes next generation sequencing and bioinformatics analysis pipelines. It discusses the Ion Torrent and Pacific Biosciences sequencing platforms, and provides examples of standard and in-house analysis pipelines for these platforms. It also gives three specific examples of in-house developed pipelines: (1) a local variant database for exome sequencing analysis, (2) de novo genome assembly using long reads from Pacific Biosciences, and (3) a clinical sequencing workflow for detecting mutations in leukemia patients. The document outlines ongoing developments with the sequencing platforms and analysis methods.
Introduction to Next-Generation Sequencing (NGS) TechnologyQIAGEN
The continuous evolution of NGS technology has led to an enormous diversification in NGS applications and dramatically decreased the costs to sequence a complete human genome.
In this presentation, we will discuss the following major topics:
• Basic overview of NGS sequencing technologies
• Next-generation sequencing workflow
• Spectrum of NGS applications
• QIAGEN universal NGS solutions
Slides from my talk as part of the NBIS ChIP-seq tutorial course. I describe how we process ChIP-seq data at the Swedish National Genomics Infrastructure and how our NGI-ChIPseq analysis pipeline works. https://github.com/SciLifeLab/NGI-ChIPseq
The document summarizes Ion Torrent sequencing technology. It detects hydrogen ions released during DNA polymerization rather than using optics. The sequencing occurs on semiconductor chips patterned through photolithography into wells, each sequencing a different template. As nucleotides are incorporated, hydrogen ions change the pH detected by ion sensors below each well. This allows massively parallel sequencing that is faster, cheaper and simpler than previous technologies.
This document discusses DNA sequencing techniques. It begins by defining DNA sequencing as determining the order of nucleotides in a DNA molecule. It then summarizes 4 generations of sequencing methods: 1) Sanger sequencing from 1977, the predominant early method; 2) Next generation techniques from 2005-2007 including 454, SOLiD, and Illumina that enabled high-throughput short reads; 3) Third generation methods like PacBio and Oxford Nanopore that enable long reads but with lower accuracy; and 4) Some additional terms used in sequencing like oligonucleotides, plasmids, and cloning vectors.
GENA was established in 2005 in Stavanger, Norway to provide DNA analysis services as a supplement to the National Institute of Forensic Medicine. It has a modern, high-capacity laboratory accredited for forensic DNA analysis. GENA provides rapid, high-throughput forensic DNA services through automated and sensitive technology. It is ISO 17025 accredited and experienced in forensic DNA services and research cooperation projects in bacterial sequencing, human disease testing, and aquaculture species identification. It is seeking to expand its DNA testing services and partners in biological testing.
EpiChrom 2019 - Updates in Epigenomics at the NGIPhil Ewels
Slides from my talk at the SciLifeLab EpiChrom 2019 meeting: https://www.scilifelab.se/epichrom-2019/
# New epigenomics services at the National Genomics Infrastructure
A quick walkthrough of new library preparation methods on offer to study epigenetic signals at the National Genomics Infrastructure.
This document provides an overview of nanopore sequencing. It begins by outlining the benefits of nanopore sequencing such as flexibility, single molecule sequencing, and cost-effectiveness. It then discusses the history and concept of nanopore sequencing, which was envisioned in the 1990s and involves using nanopores to sequence DNA strands. The document describes how Oxford Nanopore's technology uses protein nanopores embedded in membranes to detect the electrical signals of DNA as it passes through. It also explains the components involved, such as the biological or solid-state nanopore, array of microscaffolds, and sensor chip.
The document discusses next-generation sequencing (NGS) technologies and NGS targeted re-sequencing. It provides an overview of NGS technologies including their development over time. It then discusses NGS targeted re-sequencing by focusing on specific regions of interest through library enrichment techniques. Finally, it outlines the typical NGS exome sequencing pipeline from sample preparation to data processing, analysis and reporting of results.
Thoughts on the recent announcements by Oxford Nanopore TechnologiesKeith Bradnam
This document summarizes a presentation about announcements from Oxford Nanopore Technologies at the Advances in Genome Biology & Technology meeting. Oxford Nanopore unveiled two new sequencing devices - the desktop-sized GridION and the palm-sized MinION sequencer, which resembles a USB stick. Both devices use nanopore sequencing to directly sequence DNA or RNA in real-time without the need for sample preparation. This technology could potentially sequence an entire human genome in 15 minutes. Oxford Nanopore is aiming for increased sequencing accuracy and lower costs with these portable, easy-to-use sequencers.
Making powerful science: an introduction to NGS and beyondAdamCribbs1
This slide deck is from the Botnar Research Centre introduction to NGS sequencing workshop 2021- an overview of the theoretical concepts behind sequencing are given
Reproducible bioinformatics for everyone: Nextflow & nf-corePhil Ewels
Slides from my talk at the Karolinska Institute in Huddinge (Stockholm, Sweden). June 2022.
General introduction to Nextflow and nf-core, covering what they are and why you should use them!
Reproducible bioinformatics workflows with Nextflow and nf-corePhil Ewels
Slides from my talk at a GenomeWeb webinar, I discuss how we use Nextflow at the SciLifeLab National Genomics Infrastructure and how this led to the founding of the nf-core community project.
ELIXIR Proteomics Community - Connection with nf-corePhil Ewels
ELIXIR Proteomics Community and the Nextflow nf-core community - a meeting to discuss the joint efforton the standardization of analytical workflows.
Find out more at https://nf-co.re/
SciLifeLab Coffee & Code, Sept 25th 2020.
An introduction to regular expressions at the SciLifeLab / NGI Sweden "Coffee 'n code" talk. Aimed at people who sort-of-know what regexes are, but find them a bit terrifying..
Watch the talk on YouTube: https://youtu.be/2Yp6kvdUMxM
Nextflow Camp 2019: nf-core tutorial (Updated Feb 2020)Phil Ewels
The nf-core community provides a range of tools to help new users get to grips with nextflow - both by providing complete pipelines that can be used out of the box, and also by helping developers with best practices. Companion tools can create a bare-bones pipeline from a template scattered with TO-DO pointers and CI with linting tools check code quality. Guidelines and documentation help to get nextflow newbies on their feet in no time. Best of all, the nf-core community is always on hand to help.
In this tutorial we discuss the best-practice guidelines developed by the nf-core community, why they're important and give insight into the best tips and tricks for budding nextflow pipeline developers.
----
Updated Feb 2020 to switch TravisCI for GitHub Actions, plus a couple of other tweaks.
The nf-core community provides a range of tools to help new users get to grips with nextflow - both by providing complete pipelines that can be used out of the box, and also by helping developers with best practices. Companion tools can create a bare-bones pipeline from a template scattered with TO-DO pointers and CI with linting tools check code quality. Guidelines and documentation help to get nextflow newbies on their feet in no time. Best of all, the nf-core community is always on hand to help.
In this tutorial we discuss the best-practice guidelines developed by the nf-core community, why they're important and give insight into the best tips and tricks for budding nextflow pipeline developers.
Slides from my talk given at the AWS Loft event in Stockholm, November 2018.
When genomic data is staged for analysis on Amazon S3, researchers have fast access to large volumes of data without needing to download and store their own copies. In this session, you will learn how a researcher at Sweden's SciLifeLab has made reference genome data available in the cloud as an AWS Public Dataset, and how this makes it easier for researchers to do large scale genomic analysis using tools like EMR and AWS Batch.
Talk from the SciLifeLab NGI NovaSeq seminar in September 2018. I describe how differences in illumina sequencing on the new NovaSeq 6000 can affect your sequencing, with illustrated examples from qcfail.com
This document discusses NGS quality control using MultiQC. It provides background on the large volume of sequencing data processed in 2016 at NGI Stockholm, including different sequencing types and the challenges of manual quality control. It then introduces MultiQC as a tool that parses key metrics from analysis results/logs to create a single HTML report summarizing a project. It provides information on how MultiQC works, how to install and run it, and exercises for users. It also discusses customizing MultiQC reports and developing new modules.
Developing Reliable QC at the Swedish National Genomics InfrastructurePhil Ewels
Good quality control procedures are essential for sequencing facilities. The SciLifeLab National Genomics Infrastructure is an accredited facility that processes thousands of samples every month, driving us to develop high-throughput QC procedures. We use a LIMS, a bespoke web system and most recently MultiQC - a tool that I have written to summarise analysis log files and produce reports that visualise key sample metrics.
In this talk I describe how our different systems integrate and how we use MultiQC results for both project level reporting and long term monitoring.
Standardising Swedish genomics analyses using nextflowPhil Ewels
The SciLifeLab National Genomics Infrastructure is one of the largest sequencing facilities in Europe. We are an accredited facility providing library preparation, sequencing, basic analysis and quality control for Swedish research groups. Our sample throughput requires a highly automated and robust bioinformatics platform. Until recently, we had multiple analysis pipelines built with a range of different workflow tools for each data type. This made development work difficult and led to inevitable technical debt. In this talk I will describe how we have migrated to Nextflow for a range of our data types, the difficulties that we faced and how we hope to leverage Nextflow to migrate to the cloud in coming years.
Using effective visual aids is important for getting across your message when describing data. This can be in a presentation, poster or paper. This talk goes through some basic design tips that can help your visual aids look professional and work effectively.
Written for the Enabling Excellence ETN. https://eetraining.wordpress.com/
This document discusses the bioinformatics analysis of ChIP-seq data. It begins with an overview of ChIP-seq experiments and the major steps in processing and analyzing the sequencing data, including quality control, alignment, peak calling, and downstream analyses. Pipelines for automated analysis are described, such as Cluster Flow and Nextflow. The talk emphasizes that there is no single correct approach and the analysis depends on the biological question and experimental design.
The document discusses how the internet and websites work from a technical perspective. It covers how a web address is resolved to a server, the basic components of a webpage like HTML, CSS and images, how databases and templates allow dynamic content, and how cookies are used to store information on a user's browser. Real examples of code are provided to illustrate these concepts. Useful links are also included for hosting and creating websites.
When I was asked to give a companion lecture in support of ‘The Philosophy of Science’ (https://shorturl.at/4pUXz) I decided not to walk through the detail of the many methodologies in order of use. Instead, I chose to employ a long standing, and ongoing, scientific development as an exemplar. And so, I chose the ever evolving story of Thermodynamics as a scientific investigation at its best.
Conducted over a period of >200 years, Thermodynamics R&D, and application, benefitted from the highest levels of professionalism, collaboration, and technical thoroughness. New layers of application, methodology, and practice were made possible by the progressive advance of technology. In turn, this has seen measurement and modelling accuracy continually improved at a micro and macro level.
Perhaps most importantly, Thermodynamics rapidly became a primary tool in the advance of applied science/engineering/technology, spanning micro-tech, to aerospace and cosmology. I can think of no better a story to illustrate the breadth of scientific methodologies and applications at their best.
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...University of Maribor
Slides from talk:
Aleš Zamuda: Remote Sensing and Computational, Evolutionary, Supercomputing, and Intelligent Systems.
11th International Conference on Electrical, Electronics and Computer Engineering (IcETRAN), Niš, 3-6 June 2024
Inter-Society Networking Panel GRSS/MTT-S/CIS Panel Session: Promoting Connection and Cooperation
https://www.etran.rs/2024/en/home-english/
Authoring a personal GPT for your research and practice: How we created the Q...Leonel Morgado
Thematic analysis in qualitative research is a time-consuming and systematic task, typically done using teams. Team members must ground their activities on common understandings of the major concepts underlying the thematic analysis, and define criteria for its development. However, conceptual misunderstandings, equivocations, and lack of adherence to criteria are challenges to the quality and speed of this process. Given the distributed and uncertain nature of this process, we wondered if the tasks in thematic analysis could be supported by readily available artificial intelligence chatbots. Our early efforts point to potential benefits: not just saving time in the coding process but better adherence to criteria and grounding, by increasing triangulation between humans and artificial intelligence. This tutorial will provide a description and demonstration of the process we followed, as two academic researchers, to develop a custom ChatGPT to assist with qualitative coding in the thematic data analysis process of immersive learning accounts in a survey of the academic literature: QUAL-E Immersive Learning Thematic Analysis Helper. In the hands-on time, participants will try out QUAL-E and develop their ideas for their own qualitative coding ChatGPT. Participants that have the paid ChatGPT Plus subscription can create a draft of their assistants. The organizers will provide course materials and slide deck that participants will be able to utilize to continue development of their custom GPT. The paid subscription to ChatGPT Plus is not required to participate in this workshop, just for trying out personal GPTs during it.
ESPP presentation to EU Waste Water Network, 4th June 2024 “EU policies driving nutrient removal and recycling
and the revised UWWTD (Urban Waste Water Treatment Directive)”
Or: Beyond linear.
Abstract: Equivariant neural networks are neural networks that incorporate symmetries. The nonlinear activation functions in these networks result in interesting nonlinear equivariant maps between simple representations, and motivate the key player of this talk: piecewise linear representation theory.
Disclaimer: No one is perfect, so please mind that there might be mistakes and typos.
dtubbenhauer@gmail.com
Corrected slides: dtubbenhauer.com/talks.html
The debris of the ‘last major merger’ is dynamically youngSérgio Sacani
The Milky Way’s (MW) inner stellar halo contains an [Fe/H]-rich component with highly eccentric orbits, often referred to as the
‘last major merger.’ Hypotheses for the origin of this component include Gaia-Sausage/Enceladus (GSE), where the progenitor
collided with the MW proto-disc 8–11 Gyr ago, and the Virgo Radial Merger (VRM), where the progenitor collided with the
MW disc within the last 3 Gyr. These two scenarios make different predictions about observable structure in local phase space,
because the morphology of debris depends on how long it has had to phase mix. The recently identified phase-space folds in Gaia
DR3 have positive caustic velocities, making them fundamentally different than the phase-mixed chevrons found in simulations
at late times. Roughly 20 per cent of the stars in the prograde local stellar halo are associated with the observed caustics. Based
on a simple phase-mixing model, the observed number of caustics are consistent with a merger that occurred 1–2 Gyr ago.
We also compare the observed phase-space distribution to FIRE-2 Latte simulations of GSE-like mergers, using a quantitative
measurement of phase mixing (2D causticality). The observed local phase-space distribution best matches the simulated data
1–2 Gyr after collision, and certainly not later than 3 Gyr. This is further evidence that the progenitor of the ‘last major merger’
did not collide with the MW proto-disc at early times, as is thought for the GSE, but instead collided with the MW disc within
the last few Gyr, consistent with the body of work surrounding the VRM.
ESR spectroscopy in liquid food and beverages.pptxPRIYANKA PATEL
With increasing population, people need to rely on packaged food stuffs. Packaging of food materials requires the preservation of food. There are various methods for the treatment of food to preserve them and irradiation treatment of food is one of them. It is the most common and the most harmless method for the food preservation as it does not alter the necessary micronutrients of food materials. Although irradiated food doesn’t cause any harm to the human health but still the quality assessment of food is required to provide consumers with necessary information about the food. ESR spectroscopy is the most sophisticated way to investigate the quality of the food and the free radicals induced during the processing of the food. ESR spin trapping technique is useful for the detection of highly unstable radicals in the food. The antioxidant capability of liquid food and beverages in mainly performed by spin trapping technique.
Current Ms word generated power point presentation covers major details about the micronuclei test. It's significance and assays to conduct it. It is used to detect the micronuclei formation inside the cells of nearly every multicellular organism. It's formation takes place during chromosomal sepration at metaphase.
The binding of cosmological structures by massless topological defectsSérgio Sacani
Assuming spherical symmetry and weak field, it is shown that if one solves the Poisson equation or the Einstein field
equations sourced by a topological defect, i.e. a singularity of a very specific form, the result is a localized gravitational
field capable of driving flat rotation (i.e. Keplerian circular orbits at a constant speed for all radii) of test masses on a thin
spherical shell without any underlying mass. Moreover, a large-scale structure which exploits this solution by assembling
concentrically a number of such topological defects can establish a flat stellar or galactic rotation curve, and can also deflect
light in the same manner as an equipotential (isothermal) sphere. Thus, the need for dark matter or modified gravity theory is
mitigated, at least in part.
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...Travis Hills MN
Travis Hills of Minnesota developed a method to convert waste into high-value dry fertilizer, significantly enriching soil quality. By providing farmers with a valuable resource derived from waste, Travis Hills helps enhance farm profitability while promoting environmental stewardship. Travis Hills' sustainable practices lead to cost savings and increased revenue for farmers by improving resource efficiency and reducing waste.
Immersive Learning That Works: Research Grounding and Paths ForwardLeonel Morgado
We will metaverse into the essence of immersive learning, into its three dimensions and conceptual models. This approach encompasses elements from teaching methodologies to social involvement, through organizational concerns and technologies. Challenging the perception of learning as knowledge transfer, we introduce a 'Uses, Practices & Strategies' model operationalized by the 'Immersive Learning Brain' and ‘Immersion Cube’ frameworks. This approach offers a comprehensive guide through the intricacies of immersive educational experiences and spotlighting research frontiers, along the immersion dimensions of system, narrative, and agency. Our discourse extends to stakeholders beyond the academic sphere, addressing the interests of technologists, instructional designers, and policymakers. We span various contexts, from formal education to organizational transformation to the new horizon of an AI-pervasive society. This keynote aims to unite the iLRN community in a collaborative journey towards a future where immersive learning research and practice coalesce, paving the way for innovative educational research and practice landscapes.
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...Leonel Morgado
Current descriptions of immersive learning cases are often difficult or impossible to compare. This is due to a myriad of different options on what details to include, which aspects are relevant, and on the descriptive approaches employed. Also, these aspects often combine very specific details with more general guidelines or indicate intents and rationales without clarifying their implementation. In this paper we provide a method to describe immersive learning cases that is structured to enable comparisons, yet flexible enough to allow researchers and practitioners to decide which aspects to include. This method leverages a taxonomy that classifies educational aspects at three levels (uses, practices, and strategies) and then utilizes two frameworks, the Immersive Learning Brain and the Immersion Cube, to enable a structured description and interpretation of immersive learning cases. The method is then demonstrated on a published immersive learning case on training for wind turbine maintenance using virtual reality. Applying the method results in a structured artifact, the Immersive Learning Case Sheet, that tags the case with its proximal uses, practices, and strategies, and refines the free text case description to ensure that matching details are included. This contribution is thus a case description method in support of future comparative research of immersive learning cases. We then discuss how the resulting description and interpretation can be leveraged to change immersion learning cases, by enriching them (considering low-effort changes or additions) or innovating (exploring more challenging avenues of transformation). The method holds significant promise to support better-grounded research in immersive learning.
The technology uses reclaimed CO₂ as the dyeing medium in a closed loop process. When pressurized, CO₂ becomes supercritical (SC-CO₂). In this state CO₂ has a very high solvent power, allowing the dye to dissolve easily.
Lecture: NGS at the National Genomics Infrastructure
1. NGI stockholm
NGI Sweden
Next Generation Sequencing at the
National Genomics Infrastructure
Phil Ewels
phil.ewels@scilifelab.se
Introduction to Bioinformatics Using NGS Data
Linköping, 2018-05-23
4. NGI stockholm
National Genomics Infrastructure
Proteomics
Metabolomics
Single-Cell Biology
Cellular & Molecular Imaging
Molecular Structure
Chemical Biology
Genome Engineering
Diagnostic Development
Drug Discovery & Development
NationalBioinformaticsInfrastructure
DataOffice
SciLifeLab NGI
Technology Platforms
Research Programs
National Genomics Infrastructure
5. NGI stockholm
SciLifeLab NGI
Stockholm Uppsala
Genomics Production SNP&Seq
Uppsala Genome CenterGenomics Applications DevelopmentGenomics Applications Development
National Genomics Infrastructure
6. NGI stockholm
SciLifeLab NGI
Our mission is to offer a
state-of-the-art infrastructure
for massively parallel DNA sequencing
and SNP genotyping, available to
researchers all over Sweden
7. NGI stockholm
SciLifeLab NGI
National resource
State-of-the-art
infrastructure
Guidelines and
support
We provide
guidelines and support
for sample collection, study
design, protocol selection and
bioinformatics analysis
9. NGI stockholm
NGI Organisation
Funding
Staff salaries
Premises and service
contracts
Capital equipment
Host universities
SciLifeLab
VR
KAW
User fees
Reagent costs
NGI Stockholm NGI Uppsala
10. NGI stockholm
Project timeline
Sample QC
Library
preparation,
Sequencing,
Genotyping
Data processing
and primary
analysis
Scientific support
and project
consultation
Data delivery
11. NGI stockholm
Project timeline
Sample QC
Library
preparation,
Sequencing,
Genotyping
Data processing
and primary
analysis
Scientific support
and project
consultation
Data delivery
12. NGI stockholm
Just
Sequencing
Methods offered at NGI
FFPE
Sequencing
10X
Genomics
Nanoporesequencing
ATAC-seq
UserQC
(cheap preps)
Hi-C(ox)Bisulphite
sequencing
RAD-seq
RNA
de novoDNA
Data analysis
pipelines included
13. NGI stockholm
NGI Stockholm
NGI Stockholm Projects in 2017
RNA-Seq
WG Re-Seq
De-Novo
Metagenomics
Targeted Re-Seq
ChIP-Seq
Epigenetics
RAD Seq
0 45 90 135 180
1
13
15
31
58
61
159
177
• RNA-seq is the most common project type
14. NGI stockholm
NGI Stockholm
• RNA-seq is the most common project type
• In total, NGI Sweden processed 1068 NGS projects with
almost 50 000 samples in 2017
NGI Stockholm Samples in 2017
RNA-Seq
WG Re-Seq
De-Novo
Metagenomics
Targeted Re-Seq
ChIP-Seq
Epigenetics
RAD Seq
0 4000 8000 12000 16000
192
180
397
5,909
4,496
211
4,551
15,022
15. NGI stockholm
NGI Stockholm
• Median turn around times from QC passed to data
delivered for 2017
• Sequencing only: 11.5 days
• RNA: 6.5 weeks
• WGS: 8 weeks
https://ngisweden.scilifelab.se/
file/stockholm_dashboard
19. NGI stockholm
Illumina Sequencing
• Largest provider of sequencing technology
• NGS machines use "Sequencing-by-synthesis"
• Developed at the University of Cambridge in 1990s
• Spun into a company called Solexa in 1998
• Solexa acquired by illumina in 2007
• Responsible for vast majority of DNA sequencing
experiments worldwide
30. NGI stockholm
Illumina at NGI
iSeq 100
Coming soon to NGI Uppsala
Small cheap runs
MiSeq Small runs, long reads (2x300bp)
HiSeq 2500 Primary machine for most of NGI's history
HiSeq X
Cheap, high throughput
Only allowed to run WGS with > 15X coverage
NovaSeq 6000
Newest machine, both Stockholm & Uppsala
Will eventually replace HiSeq 2500
31. NGI stockholm
How to choose
• Number of reads required
• How many samples, how deeply sequenced?
• Type of reads required
• Single End / Paired End, length?
• Urgency and cost
• Sharing flow cells with other users
• Best price for the project
32. NGI stockholm
Patterned flow cells
• New type of flow cell
• HiSeq 4000, HiSeq X, NovaSeq
• Single sequence per well
• Higher density, more data
• What's index-hopping?
• ExAmp can mix up index pairs in
tiny fraction of reads
• Avoided with dual unique indexes
33. Patterned flow cells
• Patterned flow cells can give "optical duplicates"
• https://sequencing.qcfail.com/articles/illumina-patterned-flow-
cells-generate-duplicated-sequences/
• Can be treated like regular PCR duplicates
HiSeq 2500 HiSeq 4000
34. NGI stockholm
Two colour chemistry
• Older SBS used four different fluorophores
• One for each nucleotide
• New machines use two
• Faster and cheaper
• NextSeq, NovaSeq, iSeq
• No signal = G
• Can get poly-G if something
goes wrong
https://sequencing.qcfail.com/articles/illumina-2-colour-
chemistry-can-overcall-high-confidence-g-bases/
35.
36. NGI stockholm
PacBio
• Pacific Biosciences - specialists in long reads
• Also uses fluorescent nucleotides
• Polymerases immobilised at the bottom of tiny wells give
off pulses as the nucleotides are incorporated
• Each well is independent, doesn't use sequencing
rounds like illumina
• Can work with much longer DNA fragments
• 250 bp – 60 kb (max ~160 kb)
40. NGI stockholm
PacBio Sequencing
• Long reads are excellent for de-novo genome assembly
and isoform detection
• Output is expensive compared to illumina, but getting
better
• Small genomes are no problem. Larger genomes are
now becoming more feasible.
• New amplification-free enrichment using CRISPR-Cas9
41.
42. NGI stockholm
Oxford Nanopore
• Newest contender in the sequencing world
• Lots of hype and taken several years to become a reality
• Still developing very fast
• Quality, yield and cost changing almost monthly
• High error rates (but better than they used to be)
• Now 2-13% depending on sequencing type
49. NGI stockholm
Oxford Nanopore
• The best technology available for ultra long reads
• Twitter users report getting reads over 1 Mbp long
• "Whale spotting" - finding the longest reads on the end
of the distribution curve
• Price dropping rapidly, but still expensive compared to
illumina
• NGI has 2x MinIONs, hoping for PromethION soon
50.
51. NGI stockholm
Ion Torrent
• Main application
• Microbial and metagenomic sequencing
• Targeted re-sequencing (gene panels)
• Clinical sequencing
• Short, single-end reads
• Fast run times
55. NGI stockholm
Sequencing Type
• No need to remember all of this
• Many considerations, changing all the time
• We are experts - come and speak to us!
support@ngisweden.se
https://ngisweden.scilifelab.se/
57. NGI stockholm
Library Preparation
• All high throughput sequencing requires some kind of
library preparation
• Add adapters for sequencing chemistry
• Adjust DNA fragment lengths
• Incorporate biological signal into sequence
• Add required enzymes
• Different library preps enable different applications
58. NGI stockholm
RNA Sequencing
• Choose a type of RNA
• Protein coding mRNA (poly-A)
• All RNA (rRNA depletion)
• Small RNA
• Choose your question
• Differential gene expression
• Differential isoform detection & quantification
• Fusion gene detection
• Define your limitations
• Low-input material
• Low quality material (eg. FFPE)
60. NGI stockholm
DNA Sequencing
• Choose your question
• SNP, SNV, indel calling
• Structural variant detection
• De-novo genome assembly
• Choose your priorities
• Sequencing accuracy
• Sequencing depth
• Ultra-long reads
• Define your requirements
• Low-input material
• Low quality material (eg. FFPE)
61. NGI stockholm
DNA Sequencing
• Illumina sequencing DNA library prep kits
• Illumina TruSeq DNA PCR Free
• Rubicon ThruPLEX
• Illumina Nextera XT
• Illumina Nextera Flex
• 10X Genomics
• Oxford Nanopore, PacBio, IonTorrent
Best quality
Low input
Cheap (plate format)
Fast and simple
Linked reads
62. NGI stockholm
10X Genomics
• Chromium instrument uses droplet emulsion technology
for nanoliter reaction volumes
• Linked-read sequencing
• Large molecules fragmented in droplets and barcoded
• Normal short-read illumina sequencing used
• Long fragments (20-100+ Kbp) reassembled from barcodes
• Regular illumina sequencing libraries produced
64. NGI stockholm
10X Genomics
• Single cell RNA sequencing
• Thousands of cells captured in droplets
• Each RNA molecule tagged with droplet
barcode
65. NGI stockholm
• Now testing Hi-C in NGI Stockholm
• Proximity ligation assay to detect physical colocation of
DNA fragments within cell nuclei
• Multiple applications for data
• Epigenetics
• De-novo genome assembly
• Structural variation detection
Hi-C
Chr 14
Chr14
66. NGI stockholm
Methylation Sequencing
• Bisulphite sequencing detects Cytosine methylation in
genomic DNA
• Unmethylated Cs converted to Uracil by bisulfites and sequenced as T
• Methylated Cs are protected and sequenced as C
• Oxidative bisulphite informs about hydroxy-methylation
• Current under development at NGI Stockholm
• PacBio and Oxford Nanopore able to detect some
native base modifications
67. NGI stockholm
RAD Sequencing
• Restriction-site Associated DNA sequencing, also
known as GBS (Genotyping By Sequencing)
• Genome fragmented using a restriction enzyme
• Narrow size range purified - same regions of genome for
all individuals
• Allows cheap high-depth variant calling for large
numbers of samples, without a reference genome
• Excellent for population genomics and ecology
69. NGI stockholm
Bioinformatics at NGI
• Raw sequencing data management
• Demultiplexing, data transfers, backups, delivery
• Quality control
• Every project is checked against quality criteria
• Automated analysis pipelines
• Standardised pipelines give reproducible results
• Software development
70. NGI stockholm
NGI Data Handling
Sequencer
Network
storage
Preprocessing
Backup
UPPMAX
(Irma)
UPPMAX
(Grus)
SNIC Supr
Authentication
Your
computer
Your analysis
server
71. NGI stockholm
Grus Deliveries
• UPPMAX tool for NGI data deliveries
• NGI creates a SNIC Supr "delivery project" for each NGI
sequencing project
• Project PI and contact person given access, according
to what was put on the order form
• Email sent with project ID and instructions
• Grus is for secure short term storage only
• Requires two-factor authentication
72. NGI stockholm
Analysis Pipelines
• Initial data analysis for major protocols
• Internal QC and standardised starting point for users
• All software open source and on GitHub
• http://opensource.scilifelab.se/
• http://github.com/SciLifeLab/
• Accredited facility
75. Sarek Somatic
• SNPs, SNVs and indels
• Structural variants
• Heterogeneity, ploidy and CNVs
• Germline and/or Somatic analysis
• Formerly called Cancer Analysis Workflow
https://github.com/SciLifeLab/Sarek
MuTect2
Strelka
FreeBayes
GATK
HaplotypeCaller
MuTect1
ASCAT
Manta
• Tumour/Normal pair WGS analysis
based on GATK best practices
76. Sarek
• Tool split into sub-workflows
• Bash wrapper script runs whole
workflow
• Manuscript submitted this week,
preprint available on bioRxiv
• https://www.biorxiv.org/content/early/
2018/05/09/316976
77. NGI-RNAseq
NGI, April – December 2017
10,227samples processed
131user projects
https://github.com/SciLifeLab/NGI-RNAseq
MIT Licence
Read alignment
Gene counts
Quality Control
Reporting
Raw data
79. NGI stockholm
nf-core
• A community effort to collect a curated set of Nextflow
analysis pipelines
• GitHub organisation to collect pipelines in one place
• No institute-specific branding
• Strict set of guideline requirements
• Automated testing for code style and function
https://nf-co.re
81. NGI stockholm
Quality Control
• Every project has some level of quality control checks
• Sequencing quality
• FastQC, FastQ Screen
• Analysis pipelines give application-specific QC
• Qualimap, RSeQC
• Reporting is done using MultiQC
82. MultiQC
• Reporting tool, parses logs from completed analysis
• Creates single HTML report for all samples & steps in a
project
• Interactive plots for data exploration
• Current version now has 61 supported tools
• Works with anything from tens → thousands of samples
• Highly customisable
86. NGI stockholm
If you have a project
• Visit our order portal
• Create projects
• Request meetings
• Send us an email
https://ngisweden.scilifelab.se
support@ngisweden.se
87. NGI stockholm
Find our tools
• View our open-source
software
• All code available on
GitHub
http://opensource.scilifelab.se
88. Acknowledgements
NGI stockholm
Thanks to:
Max Käller
Olga Vinnere Pettersson
NGI Sweden
Phil Ewels
phil.ewels@scilifelab.se
ewels
tallphil
support@ngisweden.se
http://ngisweden.scilifelab.se
http://opensource.scilifelab.se