SlideShare a Scribd company logo
Single-Cell RNA-seq analysis
Section 1:Overview of Single-Cell RNA-seq
Presented By Alireza Dosutmohammadi
M.Sc. in Bioinformatics, Tarbiat Modares University
Oct 2021
RNA-seq Vs Microarrays
RNA-seq allows profiling the transcripts in a sample in an efficient and cost-
effective way.
RNA-seq allows for an unbiased sampling of all transcripts in a sample, rather
than being limited to a pre-determined set of transcripts
2
with bulk RNA-seq:
we can only estimate the average expression level for each gene across a
population of cells, without regard for the heterogeneity in gene expression
across individual cells of that sample. Therefore, it is insufficient for studying
heterogeneous systems, e.g. early development studies or complex tissues
such as the brain.
Unlike with the bulk approach, with scRNA-seq we can estimate a distribution
of expression levels for each gene across a population of cells.
Bulk RNA-seq Vs Single-Cell RNA-seq
3
Bulk RNA-seq Vs Single-Cell RNA-seq
4
5
Bulk RNA-seq Vs Single-Cell RNA-seq
Single-Cell RNA-seq
6
• Human Cell Atlas (H. sapiens)
• Tabula Muris (M. musculus)
• Fly Cell Atlas (D. melanogaster)
• Cell Atlas of Worm (C. elegans)
• Arabidopsis Root Atlas (A. thaliana)
Single-cell Atlases
7
Sample Preparation Protocols
8
• Tissue dissection and cell dissociating to obtain a suspension of cells.
• Optionally cells may be selected (e.g. based on membrane markers,
fluorescent transgenes or staining dyes).
• Capture single cells into individual reaction containers (e.g. wells or oil
droplets).
• Extracting the RNA from each cell.
• Reverse-transcribing the RNA to more stable cDNA.
• Amplifying the cDNA (either by in vitro transcription or by PCR).
• Preparing the sequencing library with adequate molecular adapters.
• Sequencing, usually with paired-end Illumina protocols.
• Processing the raw data to obtain a count matrix of genes-by-cells
• Carrying several downstream analysis.
Sample Preparation Protocols
9
Sample Preparation Protocols
10
Sample Preparation Protocols
two most important aspects are: cell capture or isolation and transcript
quantification.
11
Sample Preparation Protocols
In tissues where cell dissociation is difficult or in frozen tissue samples, instead
of isolating whole single cells it is possible to instead isolate single nuclei. Apart
from the isolation step, the protocol to prepare single-nuclei sequencing
libraries is similar to that of single-cell protocols. However, nuclear RNA usually
contains a higher proportion of unprocessed RNA, with more of the sequenced
transcripts containing introns.
12
Sample Preparation Protocols
Cell Capturing methods:
• Microtitre-plate-based
• Microfluidic-array-based
• Microfluidic-droplet-based
The strategy determines the throughput of the experiment.
13
Sample Preparation Protocols
Cell Capturing methods:
14
Sample Preparation Protocols
Cell Capturing methods:
Microtitre-plate methods (well-based methods ):
• isolating cells into individual wells of the plate using, for example, pipetting,
microdissection or fluorescent activated cell sorting (FACS).
• Advantage: take pictures of the cells before library preparation.
• Can identify and discard damaged cells or find wells containing doublets
associate information such as cell size and the intensity of any used labels
with the well coordinates.
• The main drawback: they are often low-throughput.
15
Sample Preparation Protocols
Cell Capturing methods:
Microfluidic-array platforms:
• integrated system for capturing cells and for carrying out the reactions
necessary for the library preparations.
• they provide a higher throughput than microtitre-plate-based methods.
• only around 10% of cells are captured in a microfluidic platform
• they are not appropriate if one is dealing with rare cell-types or very small
amounts of input.
• has to be taken with the cell sizes captured by the arrays, as the nanowells
are customised for particular sizes. this may therefore affect the unbiased
sampling of cells in complex tissues.
• the chip is relatively expensive.
16
Sample Preparation Protocols
Cell Capturing methods:
Microfluidic-droplet methods:
• offer the highest throughput.
• They work by encapsulating individual cells inside a nanoliter-sized oil droplet,
together with a bead.
• The bead is loaded with enzymes and other components required to construct
the library.
• Each bead contains a unique barcode which is attached to all of the
sequencing reads originating from that cell.
• all of the droplets can be pooled, sequenced together and the reads can
subsequently be assigned to the cell of origin based on those barcodes.
• Droplet platforms have relatively cheap library preparation costs on the order of
0.05 USD/cell.
17
Sample Preparation Protocols
Transcript Quantification:
• full-length: uniform read coverage across the whole transcript
• tag-based: only capture either the 5’ or 3’ ends
18
Sample Preparation Protocols
Transcript Quantification:
• full-length Protocol:
• identical to what is done in bulk RNA-seq
• Although in theory full-length protocols should provide an even coverage of
transcripts, there can sometimes be biases in the coverage across the gene
body.
• Full-length protocols also allow the detection of splice variants.
19
Sample Preparation Protocols
Transcript Quantification:
• SMART-seq2 is a popular low-throughput method, providing full-length transcript
quantification. It is ideally suited for studying a smaller group of cells in greater
detail.
• 10x Chromium is a popular high-throughput method, using UMIs for transcript
quantification (from either 3’ or 5’ ends). It is ideally suited to study highly
heterogeneous tissues and sample large populations of cells at scale.
20
Sample Preparation Protocols
Transcript Quantification:
• tag-based protocols:
• only one of the ends (3’ or 5’) of the transcript is sequenced.
• 3’ protocols are more commonly used, many protocols now allow sequencing from
either end (e.g. 10x Chromium supports both).
• Advantage of 5’-end sequencing: obtain information about the transcription start site
(TSS), which allows to explore whether there is differential TSS usage across cells.
• Advantage: they can be combined with unique molecular identifiers (UMIs), which
can help improve the accuracy of transcript quantification.
• Unique molecular identifiers (UMIs) are a type of molecular barcoding that provides
error correction and increased accuracy during sequencing. These molecular
barcodes are short sequences used to uniquely tag each molecule in a sample
library.
• Disadvantage: being restricted to one end of the transcript only, it reduces our ability
to unambiguously align reads to a transcript, as well as making it difficult to distinguish
different isoforms.
21
Importance of Single-Cell RNA-seq
22
Comparing different protocols
23
24
Comparing different protocols
25
Comparing different protocols
• Sensitivity: how many genes are detected per cell
• accuracy (e.g. compared to bulk RNA-seq)
• recover all cell types present in a sample.
• low-throughput methods have higher sensitivity compared to high-
throughput methods, such as 10x Chromium.
• low-throughput methods did not capture some of the rarer cell types in their
samples, leading to an incomplete characterisation of the cell population.
26
Comparing different protocols
• if one is interested in characterizing the composition of a heterogeneous
tissue, then a droplet-based method is more appropriate, as it allows a very
large number of cells to be captured in a mostly unbiased manner.
• Full-length transcript quantification will be more appropriate if one is
interested in studying different isoforms, since tagged protocols are much
more limited in this regard. By contrast, UMIs can only be used with tagged
protocols and they can improve gene-level quantification.
• If one is interested in rare cell types (for which known markers are not
available), then more cells need to be sequenced, which will increase the
cost of the experiment.
27
What Protocol Should I Choose?
• take into account when performing scRNA-seq experiments Factors such as:
the cost per cell, how many cells one needs, or how much to sequence
each cell.
• Care has to be taken to avoid biases due to batches being processed at
different times.
Important !
28
How many cells do we need to sample so that we see at least n cells of each
type?
• This depends on the number of cell type present and the diversity, i.e. the
entropy.
• Assume that there are 10 rare cell types, each one present at a fraction of
2% of the total population. If we want to be 95% confident that our sample
contains at least 5 cells from each of those cell types, we need to sample
at least 619 cells in total.
29
Important !
• The main difference between bulk and single cell RNA-seq is that each
sequencing library represents a single cell, instead of a population of cells.
• Another important aspect to take into account are batch effects. These
can be observed even when sequencing the same material using different
technologies, and if not properly normalised, can lead to incorrect
conclusions.
Data challenges
30
31
Data challenges
• if planning an experiment to compare healthy and diseased tissues
from 10 patients each, if only 10 samples can be processed per day, it
is best to do 5 healthy + 5 diseased together each day, rather than
prepare all healthy samples one day and all diseased samples in
another.
• Another consideration is to ensure that there is replication of tissue
samples. For example, when collecting tissue from an organ, it may be
a good idea to take multiple samples from different parts of the organ.
Or consider the time of day when samples/replicates are collected
(due to possible circadian changes in gene expression).
• all the common best practices in experimental design should be taken
into account.
32
Data challenges
33
Data challenges

More Related Content

What's hot

Single-Cell Analysis - Powered by REPLI-g: Single Cell Analysis Series Part 1
Single-Cell Analysis - Powered by REPLI-g: Single Cell Analysis Series Part 1Single-Cell Analysis - Powered by REPLI-g: Single Cell Analysis Series Part 1
Single-Cell Analysis - Powered by REPLI-g: Single Cell Analysis Series Part 1
QIAGEN
 
Transcriptomics and metabolomics
Transcriptomics and metabolomicsTranscriptomics and metabolomics
Transcriptomics and metabolomics
Sukhjinder Singh
 
Advances and Applications Enabled by Single Cell Technology
Advances and Applications Enabled by Single Cell TechnologyAdvances and Applications Enabled by Single Cell Technology
Advances and Applications Enabled by Single Cell Technology
QIAGEN
 
Transcriptomics approaches
Transcriptomics approachesTranscriptomics approaches
Transcriptomics approaches
CharupriyaChauhan1
 
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
VHIR Vall d’Hebron Institut de Recerca
 
Single cell RNA sequencing; Methods and applications
Single cell RNA sequencing; Methods and applicationsSingle cell RNA sequencing; Methods and applications
Single cell RNA sequencing; Methods and applications
faraharooj
 
So you want to do a: RNAseq experiment, Differential Gene Expression Analysis
So you want to do a: RNAseq experiment, Differential Gene Expression AnalysisSo you want to do a: RNAseq experiment, Differential Gene Expression Analysis
So you want to do a: RNAseq experiment, Differential Gene Expression Analysis
University of California, Davis
 
RNA-seq: A High-resolution View of the Transcriptome
RNA-seq: A High-resolution View of the TranscriptomeRNA-seq: A High-resolution View of the Transcriptome
RNA-seq: A High-resolution View of the Transcriptome
Sean Davis
 
RNA-Seq
RNA-SeqRNA-Seq
Genome assembly: An Introduction (2016)
Genome assembly: An Introduction (2016)Genome assembly: An Introduction (2016)
Genome assembly: An Introduction (2016)
Sebastian Schmeier
 
New generation sequencing equipments
New generation sequencing equipmentsNew generation sequencing equipments
New generation sequencing equipments
Kalaivani P
 
RNA-seq: general concept, goal and experimental design - part 1
RNA-seq: general concept, goal and experimental design - part 1RNA-seq: general concept, goal and experimental design - part 1
RNA-seq: general concept, goal and experimental design - part 1
BITS
 
Digital Droplet PCR
Digital Droplet PCRDigital Droplet PCR
Digital Droplet PCR
Aashish Patel
 
RNAseq Analysis
RNAseq AnalysisRNAseq Analysis
RNAseq Analysis
Aureliano Bombarely
 
Applications of Single-Cell Sequencing: Innovative and Tailor-Made Services
Applications of Single-Cell Sequencing: Innovative and Tailor-Made ServicesApplications of Single-Cell Sequencing: Innovative and Tailor-Made Services
Applications of Single-Cell Sequencing: Innovative and Tailor-Made Services
InsideScientific
 
Processing Raw scRNA-Seq Sequencing Data
Processing Raw scRNA-Seq Sequencing DataProcessing Raw scRNA-Seq Sequencing Data
Processing Raw scRNA-Seq Sequencing Data
Alireza Doustmohammadi
 
NGS - QC & Dataformat
NGS - QC & Dataformat NGS - QC & Dataformat
NGS - QC & Dataformat
Karan Veer Singh
 
RNA-seq quality control and pre-processing
RNA-seq quality control and pre-processingRNA-seq quality control and pre-processing
RNA-seq quality control and pre-processing
mikaelhuss
 
Introduction to Next Generation Sequencing
Introduction to Next Generation SequencingIntroduction to Next Generation Sequencing
Introduction to Next Generation Sequencing
Farid MUSA
 
RNA-seq differential expression analysis
RNA-seq differential expression analysisRNA-seq differential expression analysis
RNA-seq differential expression analysis
mikaelhuss
 

What's hot (20)

Single-Cell Analysis - Powered by REPLI-g: Single Cell Analysis Series Part 1
Single-Cell Analysis - Powered by REPLI-g: Single Cell Analysis Series Part 1Single-Cell Analysis - Powered by REPLI-g: Single Cell Analysis Series Part 1
Single-Cell Analysis - Powered by REPLI-g: Single Cell Analysis Series Part 1
 
Transcriptomics and metabolomics
Transcriptomics and metabolomicsTranscriptomics and metabolomics
Transcriptomics and metabolomics
 
Advances and Applications Enabled by Single Cell Technology
Advances and Applications Enabled by Single Cell TechnologyAdvances and Applications Enabled by Single Cell Technology
Advances and Applications Enabled by Single Cell Technology
 
Transcriptomics approaches
Transcriptomics approachesTranscriptomics approaches
Transcriptomics approaches
 
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
 
Single cell RNA sequencing; Methods and applications
Single cell RNA sequencing; Methods and applicationsSingle cell RNA sequencing; Methods and applications
Single cell RNA sequencing; Methods and applications
 
So you want to do a: RNAseq experiment, Differential Gene Expression Analysis
So you want to do a: RNAseq experiment, Differential Gene Expression AnalysisSo you want to do a: RNAseq experiment, Differential Gene Expression Analysis
So you want to do a: RNAseq experiment, Differential Gene Expression Analysis
 
RNA-seq: A High-resolution View of the Transcriptome
RNA-seq: A High-resolution View of the TranscriptomeRNA-seq: A High-resolution View of the Transcriptome
RNA-seq: A High-resolution View of the Transcriptome
 
RNA-Seq
RNA-SeqRNA-Seq
RNA-Seq
 
Genome assembly: An Introduction (2016)
Genome assembly: An Introduction (2016)Genome assembly: An Introduction (2016)
Genome assembly: An Introduction (2016)
 
New generation sequencing equipments
New generation sequencing equipmentsNew generation sequencing equipments
New generation sequencing equipments
 
RNA-seq: general concept, goal and experimental design - part 1
RNA-seq: general concept, goal and experimental design - part 1RNA-seq: general concept, goal and experimental design - part 1
RNA-seq: general concept, goal and experimental design - part 1
 
Digital Droplet PCR
Digital Droplet PCRDigital Droplet PCR
Digital Droplet PCR
 
RNAseq Analysis
RNAseq AnalysisRNAseq Analysis
RNAseq Analysis
 
Applications of Single-Cell Sequencing: Innovative and Tailor-Made Services
Applications of Single-Cell Sequencing: Innovative and Tailor-Made ServicesApplications of Single-Cell Sequencing: Innovative and Tailor-Made Services
Applications of Single-Cell Sequencing: Innovative and Tailor-Made Services
 
Processing Raw scRNA-Seq Sequencing Data
Processing Raw scRNA-Seq Sequencing DataProcessing Raw scRNA-Seq Sequencing Data
Processing Raw scRNA-Seq Sequencing Data
 
NGS - QC & Dataformat
NGS - QC & Dataformat NGS - QC & Dataformat
NGS - QC & Dataformat
 
RNA-seq quality control and pre-processing
RNA-seq quality control and pre-processingRNA-seq quality control and pre-processing
RNA-seq quality control and pre-processing
 
Introduction to Next Generation Sequencing
Introduction to Next Generation SequencingIntroduction to Next Generation Sequencing
Introduction to Next Generation Sequencing
 
RNA-seq differential expression analysis
RNA-seq differential expression analysisRNA-seq differential expression analysis
RNA-seq differential expression analysis
 

Similar to Overview of Single-Cell RNA-seq

Shotgun (2) metagenomics
Shotgun (2) metagenomicsShotgun (2) metagenomics
Shotgun (2) metagenomics
deepankarshashni
 
MOLECULAR AND CYTOGENETIC ANALYSIS -BMLS GENERAL &HBT-1.pptx
MOLECULAR AND CYTOGENETIC ANALYSIS -BMLS GENERAL &HBT-1.pptxMOLECULAR AND CYTOGENETIC ANALYSIS -BMLS GENERAL &HBT-1.pptx
MOLECULAR AND CYTOGENETIC ANALYSIS -BMLS GENERAL &HBT-1.pptx
AmosiRichard
 
DNA SEQUENCING METHODS AND STRATEGIES FOR GENOME SEQUENCING
DNA SEQUENCING METHODS AND STRATEGIES FOR GENOME SEQUENCINGDNA SEQUENCING METHODS AND STRATEGIES FOR GENOME SEQUENCING
DNA SEQUENCING METHODS AND STRATEGIES FOR GENOME SEQUENCING
Puneet Kulyana
 
Sequencing genes and genomes
Sequencing genes and genomesSequencing genes and genomes
Sequencing genes and genomes
sepidehsaroghi
 
Gene Libarries
Gene LibarriesGene Libarries
Gene Libarries
Sabahat Ali
 
Barker immemxi final March 2016
Barker immemxi final March 2016Barker immemxi final March 2016
Barker immemxi final March 2016
IRIDA_community
 
Flow cytometry for cell componenet analysis
Flow cytometry for cell componenet analysisFlow cytometry for cell componenet analysis
Flow cytometry for cell componenet analysis
RAJA GOPAL
 
genetic_engineering__unit_3__lecture_1.ppt
genetic_engineering__unit_3__lecture_1.pptgenetic_engineering__unit_3__lecture_1.ppt
genetic_engineering__unit_3__lecture_1.ppt
Umehabiba502674
 
Microarray and dna chips for transcriptome study
Microarray and dna chips for transcriptome studyMicroarray and dna chips for transcriptome study
Microarray and dna chips for transcriptome study
Bia Khan
 
Transcriptomics
TranscriptomicsTranscriptomics
Transcriptomics
Mohammad Hosein Romena
 
Transcriptomics,techniqes, applications.pdf
Transcriptomics,techniqes, applications.pdfTranscriptomics,techniqes, applications.pdf
Transcriptomics,techniqes, applications.pdf
shinycthomas
 
Systems biology for Medicine' is 'Experimental methods and the big datasets
Systems biology for Medicine' is 'Experimental methods and the big datasetsSystems biology for Medicine' is 'Experimental methods and the big datasets
Systems biology for Medicine' is 'Experimental methods and the big datasets
improvemed
 
Basic molecular genetic studies in atherosclerosis
Basic molecular genetic studies in atherosclerosisBasic molecular genetic studies in atherosclerosis
Basic molecular genetic studies in atherosclerosis
Society for Heart Attack Prevention and Eradication
 
212 basic molecular genetic studies in atherosclerosis
212 basic molecular genetic studies in atherosclerosis212 basic molecular genetic studies in atherosclerosis
212 basic molecular genetic studies in atherosclerosis
Society for Heart Attack Prevention and Eradication
 
212 basic molecular genetic studies in atherosclerosis
212 basic molecular genetic studies in atherosclerosis212 basic molecular genetic studies in atherosclerosis
212 basic molecular genetic studies in atherosclerosis
SHAPE Society
 
Molecular genetics
Molecular geneticsMolecular genetics
Genomics seminar
Genomics seminarGenomics seminar
Genomics seminar
S Rasouli
 
Third Generation Sequencing
Third Generation Sequencing Third Generation Sequencing
Third Generation Sequencing
priyanka raviraj
 
Next Generation Sequencing
Next Generation SequencingNext Generation Sequencing
Next Generation Sequencing
shinycthomas
 
Genomic library
Genomic libraryGenomic library
Genomic library
Chinnu S Kumar
 

Similar to Overview of Single-Cell RNA-seq (20)

Shotgun (2) metagenomics
Shotgun (2) metagenomicsShotgun (2) metagenomics
Shotgun (2) metagenomics
 
MOLECULAR AND CYTOGENETIC ANALYSIS -BMLS GENERAL &HBT-1.pptx
MOLECULAR AND CYTOGENETIC ANALYSIS -BMLS GENERAL &HBT-1.pptxMOLECULAR AND CYTOGENETIC ANALYSIS -BMLS GENERAL &HBT-1.pptx
MOLECULAR AND CYTOGENETIC ANALYSIS -BMLS GENERAL &HBT-1.pptx
 
DNA SEQUENCING METHODS AND STRATEGIES FOR GENOME SEQUENCING
DNA SEQUENCING METHODS AND STRATEGIES FOR GENOME SEQUENCINGDNA SEQUENCING METHODS AND STRATEGIES FOR GENOME SEQUENCING
DNA SEQUENCING METHODS AND STRATEGIES FOR GENOME SEQUENCING
 
Sequencing genes and genomes
Sequencing genes and genomesSequencing genes and genomes
Sequencing genes and genomes
 
Gene Libarries
Gene LibarriesGene Libarries
Gene Libarries
 
Barker immemxi final March 2016
Barker immemxi final March 2016Barker immemxi final March 2016
Barker immemxi final March 2016
 
Flow cytometry for cell componenet analysis
Flow cytometry for cell componenet analysisFlow cytometry for cell componenet analysis
Flow cytometry for cell componenet analysis
 
genetic_engineering__unit_3__lecture_1.ppt
genetic_engineering__unit_3__lecture_1.pptgenetic_engineering__unit_3__lecture_1.ppt
genetic_engineering__unit_3__lecture_1.ppt
 
Microarray and dna chips for transcriptome study
Microarray and dna chips for transcriptome studyMicroarray and dna chips for transcriptome study
Microarray and dna chips for transcriptome study
 
Transcriptomics
TranscriptomicsTranscriptomics
Transcriptomics
 
Transcriptomics,techniqes, applications.pdf
Transcriptomics,techniqes, applications.pdfTranscriptomics,techniqes, applications.pdf
Transcriptomics,techniqes, applications.pdf
 
Systems biology for Medicine' is 'Experimental methods and the big datasets
Systems biology for Medicine' is 'Experimental methods and the big datasetsSystems biology for Medicine' is 'Experimental methods and the big datasets
Systems biology for Medicine' is 'Experimental methods and the big datasets
 
Basic molecular genetic studies in atherosclerosis
Basic molecular genetic studies in atherosclerosisBasic molecular genetic studies in atherosclerosis
Basic molecular genetic studies in atherosclerosis
 
212 basic molecular genetic studies in atherosclerosis
212 basic molecular genetic studies in atherosclerosis212 basic molecular genetic studies in atherosclerosis
212 basic molecular genetic studies in atherosclerosis
 
212 basic molecular genetic studies in atherosclerosis
212 basic molecular genetic studies in atherosclerosis212 basic molecular genetic studies in atherosclerosis
212 basic molecular genetic studies in atherosclerosis
 
Molecular genetics
Molecular geneticsMolecular genetics
Molecular genetics
 
Genomics seminar
Genomics seminarGenomics seminar
Genomics seminar
 
Third Generation Sequencing
Third Generation Sequencing Third Generation Sequencing
Third Generation Sequencing
 
Next Generation Sequencing
Next Generation SequencingNext Generation Sequencing
Next Generation Sequencing
 
Genomic library
Genomic libraryGenomic library
Genomic library
 

More from Alireza Doustmohammadi

Introduction to Applied Machine Learning
Introduction to Applied Machine LearningIntroduction to Applied Machine Learning
Introduction to Applied Machine Learning
Alireza Doustmohammadi
 
OSPREY 3.0: Open-Source Protein Redesign for You
OSPREY 3.0: Open-Source Protein Redesign for YouOSPREY 3.0: Open-Source Protein Redesign for You
OSPREY 3.0: Open-Source Protein Redesign for You
Alireza Doustmohammadi
 
WGCNA: an R package for weighted correlation network analysis
WGCNA: an R package for weighted  correlation network analysisWGCNA: an R package for weighted  correlation network analysis
WGCNA: an R package for weighted correlation network analysis
Alireza Doustmohammadi
 
Introduction to Kaa IoT platform
Introduction to Kaa IoT platformIntroduction to Kaa IoT platform
Introduction to Kaa IoT platform
Alireza Doustmohammadi
 
Speech processing and the induction of spoken language
Speech processing and the induction of spoken languageSpeech processing and the induction of spoken language
Speech processing and the induction of spoken language
Alireza Doustmohammadi
 
Digital data storage technologies
Digital data storage technologiesDigital data storage technologies
Digital data storage technologies
Alireza Doustmohammadi
 
DevOps
DevOpsDevOps
differential expression genes (DEG)
differential expression genes (DEG)differential expression genes (DEG)
differential expression genes (DEG)
Alireza Doustmohammadi
 
Lowest common ancestor (LCA) algorithm
Lowest common ancestor (LCA) algorithmLowest common ancestor (LCA) algorithm
Lowest common ancestor (LCA) algorithm
Alireza Doustmohammadi
 

More from Alireza Doustmohammadi (9)

Introduction to Applied Machine Learning
Introduction to Applied Machine LearningIntroduction to Applied Machine Learning
Introduction to Applied Machine Learning
 
OSPREY 3.0: Open-Source Protein Redesign for You
OSPREY 3.0: Open-Source Protein Redesign for YouOSPREY 3.0: Open-Source Protein Redesign for You
OSPREY 3.0: Open-Source Protein Redesign for You
 
WGCNA: an R package for weighted correlation network analysis
WGCNA: an R package for weighted  correlation network analysisWGCNA: an R package for weighted  correlation network analysis
WGCNA: an R package for weighted correlation network analysis
 
Introduction to Kaa IoT platform
Introduction to Kaa IoT platformIntroduction to Kaa IoT platform
Introduction to Kaa IoT platform
 
Speech processing and the induction of spoken language
Speech processing and the induction of spoken languageSpeech processing and the induction of spoken language
Speech processing and the induction of spoken language
 
Digital data storage technologies
Digital data storage technologiesDigital data storage technologies
Digital data storage technologies
 
DevOps
DevOpsDevOps
DevOps
 
differential expression genes (DEG)
differential expression genes (DEG)differential expression genes (DEG)
differential expression genes (DEG)
 
Lowest common ancestor (LCA) algorithm
Lowest common ancestor (LCA) algorithmLowest common ancestor (LCA) algorithm
Lowest common ancestor (LCA) algorithm
 

Recently uploaded

Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdfUdemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
Fernanda Palhano
 
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
xclpvhuk
 
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
Timothy Spann
 
Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...
Bill641377
 
原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理
原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理
原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理
a9qfiubqu
 
Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024
ElizabethGarrettChri
 
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
ihavuls
 
Intelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicineIntelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicine
AndrzejJarynowski
 
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Kaxil Naik
 
Challenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more importantChallenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more important
Sm321
 
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
mkkikqvo
 
Global Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headedGlobal Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headed
vikram sood
 
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
hyfjgavov
 
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
Social Samosa
 
University of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma TranscriptUniversity of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma Transcript
soxrziqu
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
Timothy Spann
 
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
taqyea
 
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
bopyb
 
writing report business partner b1+ .pdf
writing report business partner b1+ .pdfwriting report business partner b1+ .pdf
writing report business partner b1+ .pdf
VyNguyen709676
 
The Ipsos - AI - Monitor 2024 Report.pdf
The  Ipsos - AI - Monitor 2024 Report.pdfThe  Ipsos - AI - Monitor 2024 Report.pdf
The Ipsos - AI - Monitor 2024 Report.pdf
Social Samosa
 

Recently uploaded (20)

Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdfUdemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
 
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
 
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
 
Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...
 
原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理
原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理
原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理
 
Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024
 
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
 
Intelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicineIntelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicine
 
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
 
Challenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more importantChallenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more important
 
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
 
Global Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headedGlobal Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headed
 
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
 
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
 
University of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma TranscriptUniversity of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma Transcript
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
 
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
 
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
 
writing report business partner b1+ .pdf
writing report business partner b1+ .pdfwriting report business partner b1+ .pdf
writing report business partner b1+ .pdf
 
The Ipsos - AI - Monitor 2024 Report.pdf
The  Ipsos - AI - Monitor 2024 Report.pdfThe  Ipsos - AI - Monitor 2024 Report.pdf
The Ipsos - AI - Monitor 2024 Report.pdf
 

Overview of Single-Cell RNA-seq

  • 1. Single-Cell RNA-seq analysis Section 1:Overview of Single-Cell RNA-seq Presented By Alireza Dosutmohammadi M.Sc. in Bioinformatics, Tarbiat Modares University Oct 2021
  • 2. RNA-seq Vs Microarrays RNA-seq allows profiling the transcripts in a sample in an efficient and cost- effective way. RNA-seq allows for an unbiased sampling of all transcripts in a sample, rather than being limited to a pre-determined set of transcripts 2
  • 3. with bulk RNA-seq: we can only estimate the average expression level for each gene across a population of cells, without regard for the heterogeneity in gene expression across individual cells of that sample. Therefore, it is insufficient for studying heterogeneous systems, e.g. early development studies or complex tissues such as the brain. Unlike with the bulk approach, with scRNA-seq we can estimate a distribution of expression levels for each gene across a population of cells. Bulk RNA-seq Vs Single-Cell RNA-seq 3
  • 4. Bulk RNA-seq Vs Single-Cell RNA-seq 4
  • 5. 5 Bulk RNA-seq Vs Single-Cell RNA-seq
  • 7. • Human Cell Atlas (H. sapiens) • Tabula Muris (M. musculus) • Fly Cell Atlas (D. melanogaster) • Cell Atlas of Worm (C. elegans) • Arabidopsis Root Atlas (A. thaliana) Single-cell Atlases 7
  • 9. • Tissue dissection and cell dissociating to obtain a suspension of cells. • Optionally cells may be selected (e.g. based on membrane markers, fluorescent transgenes or staining dyes). • Capture single cells into individual reaction containers (e.g. wells or oil droplets). • Extracting the RNA from each cell. • Reverse-transcribing the RNA to more stable cDNA. • Amplifying the cDNA (either by in vitro transcription or by PCR). • Preparing the sequencing library with adequate molecular adapters. • Sequencing, usually with paired-end Illumina protocols. • Processing the raw data to obtain a count matrix of genes-by-cells • Carrying several downstream analysis. Sample Preparation Protocols 9
  • 11. Sample Preparation Protocols two most important aspects are: cell capture or isolation and transcript quantification. 11
  • 12. Sample Preparation Protocols In tissues where cell dissociation is difficult or in frozen tissue samples, instead of isolating whole single cells it is possible to instead isolate single nuclei. Apart from the isolation step, the protocol to prepare single-nuclei sequencing libraries is similar to that of single-cell protocols. However, nuclear RNA usually contains a higher proportion of unprocessed RNA, with more of the sequenced transcripts containing introns. 12
  • 13. Sample Preparation Protocols Cell Capturing methods: • Microtitre-plate-based • Microfluidic-array-based • Microfluidic-droplet-based The strategy determines the throughput of the experiment. 13
  • 14. Sample Preparation Protocols Cell Capturing methods: 14
  • 15. Sample Preparation Protocols Cell Capturing methods: Microtitre-plate methods (well-based methods ): • isolating cells into individual wells of the plate using, for example, pipetting, microdissection or fluorescent activated cell sorting (FACS). • Advantage: take pictures of the cells before library preparation. • Can identify and discard damaged cells or find wells containing doublets associate information such as cell size and the intensity of any used labels with the well coordinates. • The main drawback: they are often low-throughput. 15
  • 16. Sample Preparation Protocols Cell Capturing methods: Microfluidic-array platforms: • integrated system for capturing cells and for carrying out the reactions necessary for the library preparations. • they provide a higher throughput than microtitre-plate-based methods. • only around 10% of cells are captured in a microfluidic platform • they are not appropriate if one is dealing with rare cell-types or very small amounts of input. • has to be taken with the cell sizes captured by the arrays, as the nanowells are customised for particular sizes. this may therefore affect the unbiased sampling of cells in complex tissues. • the chip is relatively expensive. 16
  • 17. Sample Preparation Protocols Cell Capturing methods: Microfluidic-droplet methods: • offer the highest throughput. • They work by encapsulating individual cells inside a nanoliter-sized oil droplet, together with a bead. • The bead is loaded with enzymes and other components required to construct the library. • Each bead contains a unique barcode which is attached to all of the sequencing reads originating from that cell. • all of the droplets can be pooled, sequenced together and the reads can subsequently be assigned to the cell of origin based on those barcodes. • Droplet platforms have relatively cheap library preparation costs on the order of 0.05 USD/cell. 17
  • 18. Sample Preparation Protocols Transcript Quantification: • full-length: uniform read coverage across the whole transcript • tag-based: only capture either the 5’ or 3’ ends 18
  • 19. Sample Preparation Protocols Transcript Quantification: • full-length Protocol: • identical to what is done in bulk RNA-seq • Although in theory full-length protocols should provide an even coverage of transcripts, there can sometimes be biases in the coverage across the gene body. • Full-length protocols also allow the detection of splice variants. 19
  • 20. Sample Preparation Protocols Transcript Quantification: • SMART-seq2 is a popular low-throughput method, providing full-length transcript quantification. It is ideally suited for studying a smaller group of cells in greater detail. • 10x Chromium is a popular high-throughput method, using UMIs for transcript quantification (from either 3’ or 5’ ends). It is ideally suited to study highly heterogeneous tissues and sample large populations of cells at scale. 20
  • 21. Sample Preparation Protocols Transcript Quantification: • tag-based protocols: • only one of the ends (3’ or 5’) of the transcript is sequenced. • 3’ protocols are more commonly used, many protocols now allow sequencing from either end (e.g. 10x Chromium supports both). • Advantage of 5’-end sequencing: obtain information about the transcription start site (TSS), which allows to explore whether there is differential TSS usage across cells. • Advantage: they can be combined with unique molecular identifiers (UMIs), which can help improve the accuracy of transcript quantification. • Unique molecular identifiers (UMIs) are a type of molecular barcoding that provides error correction and increased accuracy during sequencing. These molecular barcodes are short sequences used to uniquely tag each molecule in a sample library. • Disadvantage: being restricted to one end of the transcript only, it reduces our ability to unambiguously align reads to a transcript, as well as making it difficult to distinguish different isoforms. 21
  • 26. • Sensitivity: how many genes are detected per cell • accuracy (e.g. compared to bulk RNA-seq) • recover all cell types present in a sample. • low-throughput methods have higher sensitivity compared to high- throughput methods, such as 10x Chromium. • low-throughput methods did not capture some of the rarer cell types in their samples, leading to an incomplete characterisation of the cell population. 26 Comparing different protocols
  • 27. • if one is interested in characterizing the composition of a heterogeneous tissue, then a droplet-based method is more appropriate, as it allows a very large number of cells to be captured in a mostly unbiased manner. • Full-length transcript quantification will be more appropriate if one is interested in studying different isoforms, since tagged protocols are much more limited in this regard. By contrast, UMIs can only be used with tagged protocols and they can improve gene-level quantification. • If one is interested in rare cell types (for which known markers are not available), then more cells need to be sequenced, which will increase the cost of the experiment. 27 What Protocol Should I Choose?
  • 28. • take into account when performing scRNA-seq experiments Factors such as: the cost per cell, how many cells one needs, or how much to sequence each cell. • Care has to be taken to avoid biases due to batches being processed at different times. Important ! 28
  • 29. How many cells do we need to sample so that we see at least n cells of each type? • This depends on the number of cell type present and the diversity, i.e. the entropy. • Assume that there are 10 rare cell types, each one present at a fraction of 2% of the total population. If we want to be 95% confident that our sample contains at least 5 cells from each of those cell types, we need to sample at least 619 cells in total. 29 Important !
  • 30. • The main difference between bulk and single cell RNA-seq is that each sequencing library represents a single cell, instead of a population of cells. • Another important aspect to take into account are batch effects. These can be observed even when sequencing the same material using different technologies, and if not properly normalised, can lead to incorrect conclusions. Data challenges 30
  • 32. • if planning an experiment to compare healthy and diseased tissues from 10 patients each, if only 10 samples can be processed per day, it is best to do 5 healthy + 5 diseased together each day, rather than prepare all healthy samples one day and all diseased samples in another. • Another consideration is to ensure that there is replication of tissue samples. For example, when collecting tissue from an organ, it may be a good idea to take multiple samples from different parts of the organ. Or consider the time of day when samples/replicates are collected (due to possible circadian changes in gene expression). • all the common best practices in experimental design should be taken into account. 32 Data challenges