SlideShare a Scribd company logo
1 of 38
Download to read offline
Single Cell Sequencing
for Drug Discovery:
Applications and
Challenges
Sarah Middleton
Computational Biologist, GSK
Image: Quanta Magazine
Single Cell Sequencing
for Drug Discovery:
Applications and
Challenges
Sarah Middleton
Computational Biologist, GSK
Image: Quanta Magazine
Target validation
• Perturbation ‘omics analysis (e.g. CRISPR)
• In vitro / in vivo expression analysis
Target identification
• Genetic disease association (GWAS)
• Disease vs healthy gene expression
• Pathway/network analysis
Pre-clinical development
• Biomarker prediction
• Animal model data analysis
• Patient stratification
Post-market
research
• Drug repurposing
• Combination therapy
Clinical trials
• Patient stratification
• Medical image analysis
Lead optimization &
Candidate selection
• Mechanism of action
• Pharmacodynamics
Compound screening
• Tractability prediction
• Assay development/analysis
• Rational design
Images adapted from: www.researchamerica.org; Owens, B. (2012) Nature
Computational biology in pharma R&D
Target validation
• Perturbation ‘omics analysis (e.g. CRISPR)
• In vitro / in vivo expression analysis
Target identification
• Genetic disease association (GWAS)
• Disease vs healthy gene expression
• Pathway/network analysis
Pre-clinical development
• Biomarker prediction
• Animal model data analysis
• Patient stratification
Post-market
research
• Drug repurposing
• Combination therapy
Clinical trials
• Patient stratification
• Medical image analysis
Lead optimization &
Candidate selection
• Mechanism of action
• Pharmacodynamics
Compound screening
• Tractability prediction
• Assay development/analysis
• Rational design
Images adapted from: www.researchamerica.org; Owens, B. (2012) Nature
Computational biology in pharma R&D
‘Omics data:
• Genome (DNA-seq)
• Transcriptome (RNA-seq;
aka “gene expression”)
• Proteome (mass spec)
• Metabolome, interactome, ...
Gene expression
analysis
(RNA-seq)
Image adapted from: Grun et al. (2015) Nature
“Single cell”
Gene expression
analysis
(RNA-seq)
Image adapted from: Grun et al. (2015) Nature
Millions
of cells
“Bulk”
“Single cell”
Gene expression
analysis
(RNA-seq)
Image adapted from: Grun et al. (2015) Nature
Millions
of cells
“Bulk”
“Single cell”
Cell 1 Cell 2 Cell 3
Image adapted from: Owens, B. (2012) Nature
RNA
RNA
RNA
Image adapted from: Owens, B. (2012) Nature
Reverse Transcription:
~50% capture efficiency
Several other steps:
80-90% efficiency each
Overall: 10-20%
RNA molecules
(~100-300k/cell)
RNA
RNA
RNA
Single cell application: mechanisms of therapy-resistant cancer
Image: Valdes-More et al. (2018) Frontiers in Immunology
Single cell application: mechanisms of therapy-resistant cancer
Image: Valdes-More et al. (2018) Frontiers in Immunology
Cancer patients
Responders Non-responders
Immunotherapy
(activates immune response
to fight the cancer)
Collect tumor
single cells
before and after
treatment
Single cell application: mechanisms of therapy-resistant cancer
Image: Valdes-More et al. (2018) Frontiers in Immunology
Cancer patients
Responders Non-responders
Immunotherapy
(activates immune response
to fight the cancer)
Collect tumor
single cells
before and after
treatment
Pre-treatment cells
Identify predictive
markers of response
Post-treatment cells
Identify mechanisms of
immunotherapy
resistance
Single cell application: mechanisms of therapy-resistant cancer
Image: Valdes-More et al. (2018) Frontiers in Immunology
CDK4/6 inhibitors as potential combination therapy to improve response
Jerby-Arnon et al. (2018) Cell
Single cell application: Disease-specific cell types in Alzheimer’s
Keren-Shaul et al. (2017) Cell
Single cell application: Disease-specific cell types in Alzheimer’s
Keren-Shaul et al. (2017) Cell
Single cell application: Disease-specific cell types in Alzheimer’s
Keren-Shaul et al. (2017) Cell
Single cell application: Disease-specific cell types in Alzheimer’s
Keren-Shaul et al. (2017) Cell
Regular
microglia
Disease-specific microglia
Activated; digest
aggregates, plaques
Negative
feedback
(checkpoints)
Scaling up single cell analysis
Images: Angerer et al. (2017) Curr Op in Sys Biol; Broad Inst; darwinian-medicine.com
1.3 million neurons
from mouse brain
Scaling up single cell analysis
Goal: create a map of all cells in the human body
Images: Angerer et al. (2017) Curr Op in Sys Biol; Broad Inst; darwinian-medicine.com
1.3 million neurons
from mouse brain
Several trillion cells; hundreds of cell types
Scaling up single cell analysis
Goal: create a map of all cells in the human body
Images: Angerer et al. (2017) Curr Op in Sys Biol; Broad Inst; darwinian-medicine.com
1.3 million neurons
from mouse brain
Human microbiome
100 trillion microbes/person
Several trillion cells; hundreds of cell types
Scaling up single cell analysis
Wolf et al. (2018) Genome Biology
HDF5 files for large ‘omics data
n > p
n = observations (cells, 100k-1mil+)
p = features (genes, 20k)
https://towardsdatascience.com/deep-learning-for-single-cell-biology-935d45064438; Lin et al. (2017) Nucleic Acids Research; Shaham et al. (2017) Bioinformatics
Autoencoder (AE) for non-linear
dimensionality reduction
PCA AE
Big data has advantages! Neural networks for cell type identification
Residual neural networks for batch effect correction
The missing data problem
Genes
Cells
Truth
Observed
Undercounting & “dropout”
Color = # molecules
van Dijk et al. (2018) Cell
Reverse Transcription:
~50% capture efficiency
Several other steps:
80-90% efficiency each
Overall: 10-20%
RNA molecules
(~100-300k/cell) Dropout
The missing data problem
1. Model missing information
2. Imputation
van Dijk et al. (2018) Cell; Arisdakessian et al. (2018) bioRxiv
Missing data makes cell subtype identification difficult
Lambrechts et al. (2018) Nat Medicine; Russ et al. (2013) Frontiers in Genetics
Coarse-grained types separate well...
tsne1
tsne2
Missing data makes cell subtype identification difficult
Lambrechts et al. (2018) Nat Medicine; Russ et al. (2013) Frontiers in Genetics
Coarse-grained types separate well...
tsne1
tsne2
Missing data makes cell subtype identification difficult
Lambrechts et al. (2018) Nat Medicine; Russ et al. (2013) Frontiers in Genetics
Coarse-grained types separate well...
...But finer subdivisions can be murky
tsne1
tsne2
tsne1
tsne2
T cells
Hirahara (2016) Int Immunol; Lloyd and Hessel (2010) Nat Rev Immunol
Hirahara (2016) Int Immunol; Lloyd and Hessel (2010) Nat Rev Immunol
T cell subtypes in Asthma
Example: CD4+ T cells from lung
Russ et al. (2013) Frontiers in Genetics
Lung biopsy Blood
CD4+
T cells
CD8+
T cells
CD45+
T cells
Healthy Asthma
T regulatory cells (“Tregs”)
Immune-suppressive functions
 Difference in Treg abundance between healthy and asthma?
 Difference in Treg gene expression between healthy and asthma?
Example: CD4+ T cells from lung
Cells belonging to each donor
tsne1
tsne2
Result: can’t confidently identify sub-types of CD4 T cells
Two problems:
1. Marker only detected in a few cells (likely due to dropouts)
2. Clustering is driven by donor (likely due to batch effects)
Treg marker expression
tsne1
tsne2
Simple approach to improve cell type identification
1. Aggregated info gives a clearer picture
• Individual gene expression is noisy & prone
to dropout
• Combining signal from multiple genes can be
more reliable
2. Biologically relevant genes are better for
clustering
• Variation less dominated by technical/batch
effects
Two concepts that help:
Summed expr of 21
Treg-related genes
Treg
markers
Simple approach to improve cell type identification
A better feature space for cell type
identification
1. Define a set of axes to measure similarity of
each cell to various cell types or functional
states
2. Place cells into the space according to their
scores for each axis & identify clusters
Axis 1: “Treg-ness” = FOXP3 + CTLA4 + IL10 + ...
Axis 4: “Exhaustion” = PDCD1 + CD244 + CD160 + ...
Axis 3: “G2/M Phase” = CDK1 + CCNA2 + CCB1 + ...
Axis 2: “Th1-ness” = TBX21 + CXCR3 + CCR5 + ...
...
Th1-ness
Treg-ness
Exhaustion
“Marker space”
Improved cell type identification
Clusters not driven by patient!
Before
After
tSNE of marker space
tsne1
tsne2
Improved cell type identification
Th1-nessTreg-nessTh17 / Th22-ness
Terminal differentiationAnti-inflammatoryType II Interferon Response
Now we can compare treatment groups to identify:
• Changes in cell type composition
• Changes gene expression within a cell type
Before
After
tSNE of marker space
tsne1
tsne2
Future directions
Cell type,
functional state
Image: Broad Inst

More Related Content

What's hot

Multi-omics data integration methods: kernel and other machine learning appro...
Multi-omics data integration methods: kernel and other machine learning appro...Multi-omics data integration methods: kernel and other machine learning appro...
Multi-omics data integration methods: kernel and other machine learning appro...tuxette
 
An Introduction to Machine Learning and Genomics
An Introduction to Machine Learning and GenomicsAn Introduction to Machine Learning and Genomics
An Introduction to Machine Learning and GenomicsBrittany Lasseigne, Ph.D.
 
AI and whole slide imaging biomarkers
AI and whole slide imaging biomarkersAI and whole slide imaging biomarkers
AI and whole slide imaging biomarkersJoel Saltz
 
Introduction to Single-cell RNA-seq
Introduction to Single-cell RNA-seqIntroduction to Single-cell RNA-seq
Introduction to Single-cell RNA-seqTimothy Tickle
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomicsAmol Kunde
 
RNA-seq for DE analysis: detecting differential expression - part 5
RNA-seq for DE analysis: detecting differential expression - part 5RNA-seq for DE analysis: detecting differential expression - part 5
RNA-seq for DE analysis: detecting differential expression - part 5BITS
 
Single cell RNA sequencing; Methods and applications
Single cell RNA sequencing; Methods and applicationsSingle cell RNA sequencing; Methods and applications
Single cell RNA sequencing; Methods and applicationsfaraharooj
 
RNA Sequencing from Single Cell
RNA Sequencing from Single CellRNA Sequencing from Single Cell
RNA Sequencing from Single CellQIAGEN
 
De novo genome assembly - T.Seemann - IMB winter school 2016 - brisbane, au ...
De novo genome assembly  - T.Seemann - IMB winter school 2016 - brisbane, au ...De novo genome assembly  - T.Seemann - IMB winter school 2016 - brisbane, au ...
De novo genome assembly - T.Seemann - IMB winter school 2016 - brisbane, au ...Torsten Seemann
 
Uses of Artificial Intelligence in Bioinformatics
Uses of Artificial Intelligence in BioinformaticsUses of Artificial Intelligence in Bioinformatics
Uses of Artificial Intelligence in BioinformaticsPragya Pai
 
Machine Learning Interpretability / Explainability
Machine Learning Interpretability / ExplainabilityMachine Learning Interpretability / Explainability
Machine Learning Interpretability / ExplainabilityRaouf KESKES
 
Basics of Data Analysis in Bioinformatics
Basics of Data Analysis in BioinformaticsBasics of Data Analysis in Bioinformatics
Basics of Data Analysis in BioinformaticsElena Sügis
 
Challenges and opportunities in personal omics profiling
Challenges and opportunities in personal omics profilingChallenges and opportunities in personal omics profiling
Challenges and opportunities in personal omics profilingSenthil Natesan
 

What's hot (20)

Multi-omics data integration methods: kernel and other machine learning appro...
Multi-omics data integration methods: kernel and other machine learning appro...Multi-omics data integration methods: kernel and other machine learning appro...
Multi-omics data integration methods: kernel and other machine learning appro...
 
An Introduction to Machine Learning and Genomics
An Introduction to Machine Learning and GenomicsAn Introduction to Machine Learning and Genomics
An Introduction to Machine Learning and Genomics
 
Cancer genome
Cancer genomeCancer genome
Cancer genome
 
Shap
ShapShap
Shap
 
Exome sequence analysis
Exome sequence analysisExome sequence analysis
Exome sequence analysis
 
Explaining the assembly model
Explaining the assembly modelExplaining the assembly model
Explaining the assembly model
 
AI and whole slide imaging biomarkers
AI and whole slide imaging biomarkersAI and whole slide imaging biomarkers
AI and whole slide imaging biomarkers
 
Introduction to Single-cell RNA-seq
Introduction to Single-cell RNA-seqIntroduction to Single-cell RNA-seq
Introduction to Single-cell RNA-seq
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
RNA-seq for DE analysis: detecting differential expression - part 5
RNA-seq for DE analysis: detecting differential expression - part 5RNA-seq for DE analysis: detecting differential expression - part 5
RNA-seq for DE analysis: detecting differential expression - part 5
 
Single cell RNA sequencing; Methods and applications
Single cell RNA sequencing; Methods and applicationsSingle cell RNA sequencing; Methods and applications
Single cell RNA sequencing; Methods and applications
 
RNA Sequencing from Single Cell
RNA Sequencing from Single CellRNA Sequencing from Single Cell
RNA Sequencing from Single Cell
 
RNA-seq Analysis
RNA-seq AnalysisRNA-seq Analysis
RNA-seq Analysis
 
RNA-Seq
RNA-SeqRNA-Seq
RNA-Seq
 
De novo genome assembly - T.Seemann - IMB winter school 2016 - brisbane, au ...
De novo genome assembly  - T.Seemann - IMB winter school 2016 - brisbane, au ...De novo genome assembly  - T.Seemann - IMB winter school 2016 - brisbane, au ...
De novo genome assembly - T.Seemann - IMB winter school 2016 - brisbane, au ...
 
Uses of Artificial Intelligence in Bioinformatics
Uses of Artificial Intelligence in BioinformaticsUses of Artificial Intelligence in Bioinformatics
Uses of Artificial Intelligence in Bioinformatics
 
Machine Learning Interpretability / Explainability
Machine Learning Interpretability / ExplainabilityMachine Learning Interpretability / Explainability
Machine Learning Interpretability / Explainability
 
Basics of Data Analysis in Bioinformatics
Basics of Data Analysis in BioinformaticsBasics of Data Analysis in Bioinformatics
Basics of Data Analysis in Bioinformatics
 
The Cancer Genome Atlas Update
The Cancer Genome Atlas UpdateThe Cancer Genome Atlas Update
The Cancer Genome Atlas Update
 
Challenges and opportunities in personal omics profiling
Challenges and opportunities in personal omics profilingChallenges and opportunities in personal omics profiling
Challenges and opportunities in personal omics profiling
 

Similar to Single-Cell Sequencing for Drug Discovery: Applications and Challenges

Evolution of Knowledge Discovery and Management
Evolution of Knowledge Discovery and Management Evolution of Knowledge Discovery and Management
Evolution of Knowledge Discovery and Management inscit2006
 
Big Data in Pharma - Overview and Use Cases
Big Data in Pharma - Overview and Use CasesBig Data in Pharma - Overview and Use Cases
Big Data in Pharma - Overview and Use CasesJosef Scheiber
 
Challenges and opportunities for machine learning in biomedical research
Challenges and opportunities for machine learning in biomedical researchChallenges and opportunities for machine learning in biomedical research
Challenges and opportunities for machine learning in biomedical researchFranciscoJAzuajeG
 
Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Ian Foster
 
PSB 2018 presentation
PSB 2018 presentationPSB 2018 presentation
PSB 2018 presentationQike (Max) Li
 
Digital Pathology, FDA Approval and Precision Medicine
Digital Pathology, FDA Approval and Precision MedicineDigital Pathology, FDA Approval and Precision Medicine
Digital Pathology, FDA Approval and Precision MedicineJoel Saltz
 
Zen and the Art of Data Science Maintenance
Zen and the Art of Data Science MaintenanceZen and the Art of Data Science Maintenance
Zen and the Art of Data Science MaintenanceElsevier
 
Visual Exploration of Clinical and Genomic Data for Patient Stratification
Visual Exploration of Clinical and Genomic Data for Patient StratificationVisual Exploration of Clinical and Genomic Data for Patient Stratification
Visual Exploration of Clinical and Genomic Data for Patient StratificationNils Gehlenborg
 
Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...
Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...
Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...Sean Ekins
 
Prof. Mark Coles (Oxford University) - Data-driven systems medicine
Prof. Mark Coles (Oxford University) - Data-driven systems medicineProf. Mark Coles (Oxford University) - Data-driven systems medicine
Prof. Mark Coles (Oxford University) - Data-driven systems medicinemntbs1
 
Slides for st judes
Slides for st judesSlides for st judes
Slides for st judesSean Ekins
 
Bioinformatics-R program의 실례
Bioinformatics-R program의 실례Bioinformatics-R program의 실례
Bioinformatics-R program의 실례mothersafe
 
Bioinformatics Strategies for Exposome 100416
Bioinformatics Strategies for Exposome 100416Bioinformatics Strategies for Exposome 100416
Bioinformatics Strategies for Exposome 100416Chirag Patel
 
Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...
Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...
Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...Remedy Informatics
 
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...IJDKP
 
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...IJDKP
 
C&E news talk sept 16
C&E news talk sept 16C&E news talk sept 16
C&E news talk sept 16Sean Ekins
 

Similar to Single-Cell Sequencing for Drug Discovery: Applications and Challenges (20)

Evolution of Knowledge Discovery and Management
Evolution of Knowledge Discovery and Management Evolution of Knowledge Discovery and Management
Evolution of Knowledge Discovery and Management
 
Big Data in Pharma - Overview and Use Cases
Big Data in Pharma - Overview and Use CasesBig Data in Pharma - Overview and Use Cases
Big Data in Pharma - Overview and Use Cases
 
Challenges and opportunities for machine learning in biomedical research
Challenges and opportunities for machine learning in biomedical researchChallenges and opportunities for machine learning in biomedical research
Challenges and opportunities for machine learning in biomedical research
 
Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009
 
PSB 2018 presentation
PSB 2018 presentationPSB 2018 presentation
PSB 2018 presentation
 
Digital Pathology, FDA Approval and Precision Medicine
Digital Pathology, FDA Approval and Precision MedicineDigital Pathology, FDA Approval and Precision Medicine
Digital Pathology, FDA Approval and Precision Medicine
 
Zen and the Art of Data Science Maintenance
Zen and the Art of Data Science MaintenanceZen and the Art of Data Science Maintenance
Zen and the Art of Data Science Maintenance
 
Mason abrf single_cell_2017
Mason abrf single_cell_2017Mason abrf single_cell_2017
Mason abrf single_cell_2017
 
Visual Exploration of Clinical and Genomic Data for Patient Stratification
Visual Exploration of Clinical and Genomic Data for Patient StratificationVisual Exploration of Clinical and Genomic Data for Patient Stratification
Visual Exploration of Clinical and Genomic Data for Patient Stratification
 
Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...
Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...
Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...
 
Prof. Mark Coles (Oxford University) - Data-driven systems medicine
Prof. Mark Coles (Oxford University) - Data-driven systems medicineProf. Mark Coles (Oxford University) - Data-driven systems medicine
Prof. Mark Coles (Oxford University) - Data-driven systems medicine
 
Slides for st judes
Slides for st judesSlides for st judes
Slides for st judes
 
Bioinformatics-R program의 실례
Bioinformatics-R program의 실례Bioinformatics-R program의 실례
Bioinformatics-R program의 실례
 
Bioinformatics Strategies for Exposome 100416
Bioinformatics Strategies for Exposome 100416Bioinformatics Strategies for Exposome 100416
Bioinformatics Strategies for Exposome 100416
 
NTU-2019
NTU-2019NTU-2019
NTU-2019
 
Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...
Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...
Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...
 
Translating next generation sequencing to practice
Translating next generation sequencing to practiceTranslating next generation sequencing to practice
Translating next generation sequencing to practice
 
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
 
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
 
C&E news talk sept 16
C&E news talk sept 16C&E news talk sept 16
C&E news talk sept 16
 

More from inside-BigData.com

Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...inside-BigData.com
 
Transforming Private 5G Networks
Transforming Private 5G NetworksTransforming Private 5G Networks
Transforming Private 5G Networksinside-BigData.com
 
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...inside-BigData.com
 
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...inside-BigData.com
 
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...inside-BigData.com
 
HPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural NetworksHPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural Networksinside-BigData.com
 
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean MonitoringBiohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoringinside-BigData.com
 
Machine Learning for Weather Forecasts
Machine Learning for Weather ForecastsMachine Learning for Weather Forecasts
Machine Learning for Weather Forecastsinside-BigData.com
 
HPC AI Advisory Council Update
HPC AI Advisory Council UpdateHPC AI Advisory Council Update
HPC AI Advisory Council Updateinside-BigData.com
 
Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19inside-BigData.com
 
Energy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic TuningEnergy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic Tuninginside-BigData.com
 
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPODHPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPODinside-BigData.com
 
Versal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud AccelerationVersal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud Accelerationinside-BigData.com
 
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance EfficientlyZettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance Efficientlyinside-BigData.com
 
Scaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's EraScaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's Erainside-BigData.com
 
CUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computingCUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computinginside-BigData.com
 
Introducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi ClusterIntroducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi Clusterinside-BigData.com
 

More from inside-BigData.com (20)

Major Market Shifts in IT
Major Market Shifts in ITMajor Market Shifts in IT
Major Market Shifts in IT
 
Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...
 
Transforming Private 5G Networks
Transforming Private 5G NetworksTransforming Private 5G Networks
Transforming Private 5G Networks
 
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
 
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
 
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
 
HPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural NetworksHPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural Networks
 
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean MonitoringBiohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
 
Machine Learning for Weather Forecasts
Machine Learning for Weather ForecastsMachine Learning for Weather Forecasts
Machine Learning for Weather Forecasts
 
HPC AI Advisory Council Update
HPC AI Advisory Council UpdateHPC AI Advisory Council Update
HPC AI Advisory Council Update
 
Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19
 
Energy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic TuningEnergy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic Tuning
 
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPODHPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
 
State of ARM-based HPC
State of ARM-based HPCState of ARM-based HPC
State of ARM-based HPC
 
Versal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud AccelerationVersal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud Acceleration
 
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance EfficientlyZettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
 
Scaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's EraScaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's Era
 
CUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computingCUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computing
 
Introducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi ClusterIntroducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi Cluster
 
Overview of HPC Interconnects
Overview of HPC InterconnectsOverview of HPC Interconnects
Overview of HPC Interconnects
 

Recently uploaded

Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 

Recently uploaded (20)

Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 

Single-Cell Sequencing for Drug Discovery: Applications and Challenges

  • 1. Single Cell Sequencing for Drug Discovery: Applications and Challenges Sarah Middleton Computational Biologist, GSK Image: Quanta Magazine
  • 2.
  • 3. Single Cell Sequencing for Drug Discovery: Applications and Challenges Sarah Middleton Computational Biologist, GSK Image: Quanta Magazine
  • 4. Target validation • Perturbation ‘omics analysis (e.g. CRISPR) • In vitro / in vivo expression analysis Target identification • Genetic disease association (GWAS) • Disease vs healthy gene expression • Pathway/network analysis Pre-clinical development • Biomarker prediction • Animal model data analysis • Patient stratification Post-market research • Drug repurposing • Combination therapy Clinical trials • Patient stratification • Medical image analysis Lead optimization & Candidate selection • Mechanism of action • Pharmacodynamics Compound screening • Tractability prediction • Assay development/analysis • Rational design Images adapted from: www.researchamerica.org; Owens, B. (2012) Nature Computational biology in pharma R&D
  • 5. Target validation • Perturbation ‘omics analysis (e.g. CRISPR) • In vitro / in vivo expression analysis Target identification • Genetic disease association (GWAS) • Disease vs healthy gene expression • Pathway/network analysis Pre-clinical development • Biomarker prediction • Animal model data analysis • Patient stratification Post-market research • Drug repurposing • Combination therapy Clinical trials • Patient stratification • Medical image analysis Lead optimization & Candidate selection • Mechanism of action • Pharmacodynamics Compound screening • Tractability prediction • Assay development/analysis • Rational design Images adapted from: www.researchamerica.org; Owens, B. (2012) Nature Computational biology in pharma R&D ‘Omics data: • Genome (DNA-seq) • Transcriptome (RNA-seq; aka “gene expression”) • Proteome (mass spec) • Metabolome, interactome, ...
  • 6. Gene expression analysis (RNA-seq) Image adapted from: Grun et al. (2015) Nature “Single cell”
  • 7. Gene expression analysis (RNA-seq) Image adapted from: Grun et al. (2015) Nature Millions of cells “Bulk” “Single cell”
  • 8. Gene expression analysis (RNA-seq) Image adapted from: Grun et al. (2015) Nature Millions of cells “Bulk” “Single cell” Cell 1 Cell 2 Cell 3
  • 9. Image adapted from: Owens, B. (2012) Nature RNA RNA RNA
  • 10. Image adapted from: Owens, B. (2012) Nature Reverse Transcription: ~50% capture efficiency Several other steps: 80-90% efficiency each Overall: 10-20% RNA molecules (~100-300k/cell) RNA RNA RNA
  • 11. Single cell application: mechanisms of therapy-resistant cancer Image: Valdes-More et al. (2018) Frontiers in Immunology
  • 12. Single cell application: mechanisms of therapy-resistant cancer Image: Valdes-More et al. (2018) Frontiers in Immunology Cancer patients Responders Non-responders Immunotherapy (activates immune response to fight the cancer) Collect tumor single cells before and after treatment
  • 13. Single cell application: mechanisms of therapy-resistant cancer Image: Valdes-More et al. (2018) Frontiers in Immunology Cancer patients Responders Non-responders Immunotherapy (activates immune response to fight the cancer) Collect tumor single cells before and after treatment Pre-treatment cells Identify predictive markers of response Post-treatment cells Identify mechanisms of immunotherapy resistance
  • 14. Single cell application: mechanisms of therapy-resistant cancer Image: Valdes-More et al. (2018) Frontiers in Immunology CDK4/6 inhibitors as potential combination therapy to improve response Jerby-Arnon et al. (2018) Cell
  • 15. Single cell application: Disease-specific cell types in Alzheimer’s Keren-Shaul et al. (2017) Cell
  • 16. Single cell application: Disease-specific cell types in Alzheimer’s Keren-Shaul et al. (2017) Cell
  • 17. Single cell application: Disease-specific cell types in Alzheimer’s Keren-Shaul et al. (2017) Cell
  • 18. Single cell application: Disease-specific cell types in Alzheimer’s Keren-Shaul et al. (2017) Cell Regular microglia Disease-specific microglia Activated; digest aggregates, plaques Negative feedback (checkpoints)
  • 19. Scaling up single cell analysis Images: Angerer et al. (2017) Curr Op in Sys Biol; Broad Inst; darwinian-medicine.com 1.3 million neurons from mouse brain
  • 20. Scaling up single cell analysis Goal: create a map of all cells in the human body Images: Angerer et al. (2017) Curr Op in Sys Biol; Broad Inst; darwinian-medicine.com 1.3 million neurons from mouse brain Several trillion cells; hundreds of cell types
  • 21. Scaling up single cell analysis Goal: create a map of all cells in the human body Images: Angerer et al. (2017) Curr Op in Sys Biol; Broad Inst; darwinian-medicine.com 1.3 million neurons from mouse brain Human microbiome 100 trillion microbes/person Several trillion cells; hundreds of cell types
  • 22. Scaling up single cell analysis Wolf et al. (2018) Genome Biology HDF5 files for large ‘omics data
  • 23. n > p n = observations (cells, 100k-1mil+) p = features (genes, 20k)
  • 24. https://towardsdatascience.com/deep-learning-for-single-cell-biology-935d45064438; Lin et al. (2017) Nucleic Acids Research; Shaham et al. (2017) Bioinformatics Autoencoder (AE) for non-linear dimensionality reduction PCA AE Big data has advantages! Neural networks for cell type identification Residual neural networks for batch effect correction
  • 25. The missing data problem Genes Cells Truth Observed Undercounting & “dropout” Color = # molecules van Dijk et al. (2018) Cell Reverse Transcription: ~50% capture efficiency Several other steps: 80-90% efficiency each Overall: 10-20% RNA molecules (~100-300k/cell) Dropout
  • 26. The missing data problem 1. Model missing information 2. Imputation van Dijk et al. (2018) Cell; Arisdakessian et al. (2018) bioRxiv
  • 27. Missing data makes cell subtype identification difficult Lambrechts et al. (2018) Nat Medicine; Russ et al. (2013) Frontiers in Genetics Coarse-grained types separate well... tsne1 tsne2
  • 28. Missing data makes cell subtype identification difficult Lambrechts et al. (2018) Nat Medicine; Russ et al. (2013) Frontiers in Genetics Coarse-grained types separate well... tsne1 tsne2
  • 29. Missing data makes cell subtype identification difficult Lambrechts et al. (2018) Nat Medicine; Russ et al. (2013) Frontiers in Genetics Coarse-grained types separate well... ...But finer subdivisions can be murky tsne1 tsne2 tsne1 tsne2 T cells
  • 30. Hirahara (2016) Int Immunol; Lloyd and Hessel (2010) Nat Rev Immunol
  • 31. Hirahara (2016) Int Immunol; Lloyd and Hessel (2010) Nat Rev Immunol T cell subtypes in Asthma
  • 32. Example: CD4+ T cells from lung Russ et al. (2013) Frontiers in Genetics Lung biopsy Blood CD4+ T cells CD8+ T cells CD45+ T cells Healthy Asthma T regulatory cells (“Tregs”) Immune-suppressive functions  Difference in Treg abundance between healthy and asthma?  Difference in Treg gene expression between healthy and asthma?
  • 33. Example: CD4+ T cells from lung Cells belonging to each donor tsne1 tsne2 Result: can’t confidently identify sub-types of CD4 T cells Two problems: 1. Marker only detected in a few cells (likely due to dropouts) 2. Clustering is driven by donor (likely due to batch effects) Treg marker expression tsne1 tsne2
  • 34. Simple approach to improve cell type identification 1. Aggregated info gives a clearer picture • Individual gene expression is noisy & prone to dropout • Combining signal from multiple genes can be more reliable 2. Biologically relevant genes are better for clustering • Variation less dominated by technical/batch effects Two concepts that help: Summed expr of 21 Treg-related genes Treg markers
  • 35. Simple approach to improve cell type identification A better feature space for cell type identification 1. Define a set of axes to measure similarity of each cell to various cell types or functional states 2. Place cells into the space according to their scores for each axis & identify clusters Axis 1: “Treg-ness” = FOXP3 + CTLA4 + IL10 + ... Axis 4: “Exhaustion” = PDCD1 + CD244 + CD160 + ... Axis 3: “G2/M Phase” = CDK1 + CCNA2 + CCB1 + ... Axis 2: “Th1-ness” = TBX21 + CXCR3 + CCR5 + ... ... Th1-ness Treg-ness Exhaustion “Marker space”
  • 36. Improved cell type identification Clusters not driven by patient! Before After tSNE of marker space tsne1 tsne2
  • 37. Improved cell type identification Th1-nessTreg-nessTh17 / Th22-ness Terminal differentiationAnti-inflammatoryType II Interferon Response Now we can compare treatment groups to identify: • Changes in cell type composition • Changes gene expression within a cell type Before After tSNE of marker space tsne1 tsne2
  • 38. Future directions Cell type, functional state Image: Broad Inst