SlideShare a Scribd company logo
1 of 35
Download to read offline
PICS: Pathway Informed
Classification System for cancer
analysis using gene expression data
David Craft and Michael Young
MGH Brown Bag
April 12, 2016
Scenario: a patient with an advanced stage cancer who
has failed traditional treatments is told something like the
following:
We can offer you this new drug that was FDA approved 3
years ago. For your type of cancer, the response rate is
about 20% durable response after 2 years, 30% some
shrinkage but not durable, and 50% do not respond at all.
Additionally, the following side effects….
How can we improve on our predictive capability?
Probability of
eradication
Probability of toxicity 1
Probability of sensitivity
to X Gy of radiation
…
“We do not have good models for predicting
patient response to treatment”
Genomic characterization of cancers
Thousands of papers
characterizing genomic
nature of cancer.
Very little of this is in
usage on the front lines of
clinical cancer care.
Some clinical mutation/drug examples:
vemurafenib for BRAFv600 mutations
erlotinib for EGFR mutations
crizotinib for ALK mutations
The “Central Dogma” of biology
DNA
RNA
Protein
Microarrays can measure upwards of 20,000 gene expression
levels.
With typical early phase drug testing trials, patient cohort numbers
are usually much smaller than this (30 - 100 patients).
Plenty of room for misleading correlations.
There has
been some
success with
the “you have
mutation x
therefore
take this
drug”, but …
Cartoon of the central dogma A microarray measuring
RNA levels
Although the central dogma is considered vastly
over-simplified, RNA still a useful “signal”
-James Shapiro, U. of Chicago
Data mining approach without any reference to biological systems.
A biological pathway is a well defined biochemical process that
occurs in living cells and organisms.
1000s have been curated over the years.
From GENES to PATHWAYS
An example pathway from KEGG
KEGG = Kyoto
encyclopedia of
genes and
genomes
Genes pathways
genes
patients
(20,000 x 30) matrix
Biology informed
dimensionality reduction
Pathway scores
Gene expression levels for 12 genes of the pyruvate metabolism
pathway from 156 bladder patients in PRECOG. The first 12
columns are non-cancerous patient samples:
PCA
decomposition of
expression levels
of genes in a
particular pathway
pca 1
pca2
In this example score
for a patient could be
(pca1,pca2)
How to score a pathway?
1) PCA on gene expression levels
2) NTC ...
3) GED ...
3D → 2D
Expression level of gene 1
Expressionlevelofgene2
Expressionlevelofgene3
PCA decomposition
How to score a pathway?
1) PCA on gene expression levels
2) NTC: Compute “distance” in gene expression space from a
patient cancer sample to the mean of normal tissues in the
dataset.
normal samples cancer samples
X
Gene expression for gene 1
Gene
expression for
gene 2
Visual demo of
NTC method for a
pathway with two
genes.
NTC = normal tissue
centroid
How to score a pathway?
1) PCA on gene expression levels
2) NTC: ...
3) GED: Gene expression deviation
Two scores per patient for each
pathway:
One which sums up the positive
gene deviations [genes that
have higher expression
compared to normal samples]
and one which sums up the
negative gene deviations
Gene deviation for
gene g patient p
Expression level of
gene g patient p
Mean expression
level of gene g for
normal samples
Standard deviation
of expression level of
gene g for normal
samples
Kolmogorov-Smirnov to test
difference of two distributions.
If a gene passes then:
How to decide if we should use a particular
pathway (Silhouette score)
pca 1
pca2
1) Use PCA to reduce
dimensionality of
pathway gene
expression levels.
2) For the known groups
“normal” and “cancer”
evaluate for each
sample a silhouette
score. [1 is perfect
separation, <1 is worse
separation].
3) Average all the
silhouette scores for an
overall score. Take the
pathway if that score is
big enough.
Proposed system
PICS: Pathway Informed Classification System
Module 1
dimension
reduction
techniques
KEGG pathway
database
Patient gene
expression levels
Pathway
scores Module 2
Clustering
algorithms
Pathway scores for
multiple patients and
pathways
Pathway-
based
classification
Module 3
Prediction
Machine learning
Pathway
scores
Proposed
treatment
Improved
probabilities
of
a list of
possible side
effects and
tumor cure.
Overallsurvival
time
Improved
predictability with
pathway-based
quantitative
learning.
Standard
separation
1 2
3 4
Clustering methods
K-means or K-medoids
Attribute 1
Attribute2
Clustering methods
K-means or K-medoids Hierarchical clustering
Each row is a
type of avocado
(or, a set of gene
expression
values of a tissue
sample, i.e. a list
of attributes)
Attribute 1
Attribute2
Public data sources
Pathways
KEGG
Reactome
Biocarta
wikipathways
MSigDB
Gene expression
precog @ Stanford
[PREdiction of Clinical Outcomes from
Genomic Profiles]
TCGA / cBioPortal
Result 1: Pan-cancer pathway clusteringPathways
Patients (~500)
Dataset
from
precog@
Stanford
PCA
scoringk = kidney
os = osteosarcoma
(Adult germ cell has 6 normal samples indicated by -)
metaboliccellprocessesImmune&endocrine
Acute myeloid
leukemia
Adult germ
cell
OvarianK OS
Gene expression values for B-cell receptor pathway
Sub-classifying individual cancer types
Example 1
Lung (NSCLC)
pathway scoring:
GED
Lung (NSCLC)
pathway scoring:
GED
Example 2: Pancreas (NTC method)
Group 1 Group 2
Results summary for individual
cancer sets
Example 3:
GBM
(GED)
Example 5:
Suboptimally
debulked
ovarian
(NTC)
The KM separation
from the orig report
Example for a set with no normal tissues
samples: melanoma
Melanoma classification (PCA)
Group 1 (285 patients) Group 2 (185 patients)
Primarily
immune
system
pathways
Primarily
immune
system
pathways
Melanoma classification
Possible additions
Which pathways to include
Pathway topology (SPIA, Paradigm, ...)
Germline DNA and tumor. Multiple tumor
samples
Other assays [RNAseq, proteomics,
mutations, copy number variations,
methylation status, gene fusions, …]
The microbiome
PICS for toxicity/side effect prediction
Thousands consolidated at MsigDB:
H (hallmark gene sets, 50 gene sets)
C1 (positional gene sets, 326 gene sets)
by chromosome: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 X Y
C2 (curated gene sets, 4726 gene sets)
CGP (chemical and genetic perturbations, 3396 gene sets)
CP (Canonical pathways, 1330 gene sets)
CP:BIOCARTA (BioCarta gene sets, 217 gene sets)
CP:KEGG (KEGG gene sets, 186 gene sets)
CP:REACTOME (Reactome gene sets, 674 gene sets)
C3 (motif gene sets, 836 gene sets)
MIR (microRNA targets, 221 gene sets)
TFT (transcription factor targets, 615 gene sets)
C4 (computational gene sets, 858 gene sets)
CGN (cancer gene neighborhoods, 427 gene sets)
CM (cancer modules, 431 gene sets)
C5 (GO gene sets, 1454 gene sets)
BP (GO biological process, 825 gene sets)
CC (GO cellular component, 233 gene sets)
MF (GO molecular function, 396 gene sets)
C6 (oncogenic signatures, 189 gene sets)
C7 (immunologic signatures, 4872 gene sets)
Finding the right balance:
Stats/machine
learning
Biology
Matching drugs/interventions to patients
~70 new cancer
drugs FDA approved
in the last 5 years!
All drugs (not just cancer)
Conclusions
Module 3
Prediction
Machine learning
Pathway
scores
Proposed
treatment
Improved
probabilities
of
a list of
possible side
effects and
tumor cure.
3
Pathway scoring can regularize large dimension gene data.
Pathway scoring as a framework for consolidating new “next
generation” genomic/transcriptome assays.
Questions comments ideas emails strongly welcomed!
Thank you!
Thanks Yotam Drier
and Michael Tolstorukov
for helpful discussions

More Related Content

What's hot

Jenny Giannopoulou, Prostate cancer methylome, fged_seattle_2013
Jenny Giannopoulou, Prostate cancer methylome, fged_seattle_2013Jenny Giannopoulou, Prostate cancer methylome, fged_seattle_2013
Jenny Giannopoulou, Prostate cancer methylome, fged_seattle_2013Functional Genomics Data Society
 
Roles of circular rn as and their interactions with micro rnas in human disor...
Roles of circular rn as and their interactions with micro rnas in human disor...Roles of circular rn as and their interactions with micro rnas in human disor...
Roles of circular rn as and their interactions with micro rnas in human disor...Clinical Surgery Research Communications
 
Liangqun ms defense.pptx
Liangqun ms defense.pptxLiangqun ms defense.pptx
Liangqun ms defense.pptxLiangqun Lu
 
Lnc rna neat1 is involved in temozolomide resistance by regulating mgmt in gl...
Lnc rna neat1 is involved in temozolomide resistance by regulating mgmt in gl...Lnc rna neat1 is involved in temozolomide resistance by regulating mgmt in gl...
Lnc rna neat1 is involved in temozolomide resistance by regulating mgmt in gl...Clinical Surgery Research Communications
 
Characterization of Novel ctDNA Reference Materials Developed using the Genom...
Characterization of Novel ctDNA Reference Materials Developed using the Genom...Characterization of Novel ctDNA Reference Materials Developed using the Genom...
Characterization of Novel ctDNA Reference Materials Developed using the Genom...Thermo Fisher Scientific
 
Invicta eshre-poster-mitochondrial dna
Invicta eshre-poster-mitochondrial dnaInvicta eshre-poster-mitochondrial dna
Invicta eshre-poster-mitochondrial dnaINVICTA GENETICS
 
BiPday 2014 -- Santorsola Mariangela
BiPday 2014 -- Santorsola MariangelaBiPday 2014 -- Santorsola Mariangela
BiPday 2014 -- Santorsola Mariangelaeventi-ITBbari
 
Biotechnology of High Sensitivity PCR for Oncology Biomarkers
 Biotechnology of High Sensitivity PCR for Oncology Biomarkers Biotechnology of High Sensitivity PCR for Oncology Biomarkers
Biotechnology of High Sensitivity PCR for Oncology BiomarkersKirsten Copren
 
A Next-Generation Sequencing Assay to Estimate Tumor Mutation Load at > 5% Al...
A Next-Generation Sequencing Assay to Estimate Tumor Mutation Load at > 5% Al...A Next-Generation Sequencing Assay to Estimate Tumor Mutation Load at > 5% Al...
A Next-Generation Sequencing Assay to Estimate Tumor Mutation Load at > 5% Al...Thermo Fisher Scientific
 
Kurrey_et_al-2009-STEM_CELLS
Kurrey_et_al-2009-STEM_CELLSKurrey_et_al-2009-STEM_CELLS
Kurrey_et_al-2009-STEM_CELLSSwati Jalgaonkar
 
SVM based prioritization of cancer causing mutations in centromere protein f...
SVM based prioritization of cancer causing mutations in centromere protein f...SVM based prioritization of cancer causing mutations in centromere protein f...
SVM based prioritization of cancer causing mutations in centromere protein f...Ambuj Kumar
 
HDx™ Reference Standards and Reference Materials for Next Generation Sequenci...
HDx™ Reference Standards and Reference Materials for Next Generation Sequenci...HDx™ Reference Standards and Reference Materials for Next Generation Sequenci...
HDx™ Reference Standards and Reference Materials for Next Generation Sequenci...Candy Smellie
 
chromosomal abnormalities by Iqra malik
chromosomal abnormalities by Iqra malik chromosomal abnormalities by Iqra malik
chromosomal abnormalities by Iqra malik hafizaiqramalik
 
The Molecular Analysis on Circulating Tumor Cells to Determine Prognostic and...
The Molecular Analysis on Circulating Tumor Cells to Determine Prognostic and...The Molecular Analysis on Circulating Tumor Cells to Determine Prognostic and...
The Molecular Analysis on Circulating Tumor Cells to Determine Prognostic and...QIAGEN
 

What's hot (20)

Molecular profiling 2013
Molecular profiling 2013Molecular profiling 2013
Molecular profiling 2013
 
The Cancer Genome Atlas Update
The Cancer Genome Atlas UpdateThe Cancer Genome Atlas Update
The Cancer Genome Atlas Update
 
Jenny Giannopoulou, Prostate cancer methylome, fged_seattle_2013
Jenny Giannopoulou, Prostate cancer methylome, fged_seattle_2013Jenny Giannopoulou, Prostate cancer methylome, fged_seattle_2013
Jenny Giannopoulou, Prostate cancer methylome, fged_seattle_2013
 
RapportHicham
RapportHichamRapportHicham
RapportHicham
 
Roles of circular rn as and their interactions with micro rnas in human disor...
Roles of circular rn as and their interactions with micro rnas in human disor...Roles of circular rn as and their interactions with micro rnas in human disor...
Roles of circular rn as and their interactions with micro rnas in human disor...
 
Liangqun ms defense.pptx
Liangqun ms defense.pptxLiangqun ms defense.pptx
Liangqun ms defense.pptx
 
Lnc rna neat1 is involved in temozolomide resistance by regulating mgmt in gl...
Lnc rna neat1 is involved in temozolomide resistance by regulating mgmt in gl...Lnc rna neat1 is involved in temozolomide resistance by regulating mgmt in gl...
Lnc rna neat1 is involved in temozolomide resistance by regulating mgmt in gl...
 
Characterization of Novel ctDNA Reference Materials Developed using the Genom...
Characterization of Novel ctDNA Reference Materials Developed using the Genom...Characterization of Novel ctDNA Reference Materials Developed using the Genom...
Characterization of Novel ctDNA Reference Materials Developed using the Genom...
 
Invicta eshre-poster-mitochondrial dna
Invicta eshre-poster-mitochondrial dnaInvicta eshre-poster-mitochondrial dna
Invicta eshre-poster-mitochondrial dna
 
BiPday 2014 -- Santorsola Mariangela
BiPday 2014 -- Santorsola MariangelaBiPday 2014 -- Santorsola Mariangela
BiPday 2014 -- Santorsola Mariangela
 
Biotechnology of High Sensitivity PCR for Oncology Biomarkers
 Biotechnology of High Sensitivity PCR for Oncology Biomarkers Biotechnology of High Sensitivity PCR for Oncology Biomarkers
Biotechnology of High Sensitivity PCR for Oncology Biomarkers
 
A Next-Generation Sequencing Assay to Estimate Tumor Mutation Load at > 5% Al...
A Next-Generation Sequencing Assay to Estimate Tumor Mutation Load at > 5% Al...A Next-Generation Sequencing Assay to Estimate Tumor Mutation Load at > 5% Al...
A Next-Generation Sequencing Assay to Estimate Tumor Mutation Load at > 5% Al...
 
Kurrey_et_al-2009-STEM_CELLS
Kurrey_et_al-2009-STEM_CELLSKurrey_et_al-2009-STEM_CELLS
Kurrey_et_al-2009-STEM_CELLS
 
SVM based prioritization of cancer causing mutations in centromere protein f...
SVM based prioritization of cancer causing mutations in centromere protein f...SVM based prioritization of cancer causing mutations in centromere protein f...
SVM based prioritization of cancer causing mutations in centromere protein f...
 
Methyl flyer lo
Methyl flyer loMethyl flyer lo
Methyl flyer lo
 
HDx™ Reference Standards and Reference Materials for Next Generation Sequenci...
HDx™ Reference Standards and Reference Materials for Next Generation Sequenci...HDx™ Reference Standards and Reference Materials for Next Generation Sequenci...
HDx™ Reference Standards and Reference Materials for Next Generation Sequenci...
 
Project_702
Project_702Project_702
Project_702
 
chromosomal abnormalities by Iqra malik
chromosomal abnormalities by Iqra malik chromosomal abnormalities by Iqra malik
chromosomal abnormalities by Iqra malik
 
The Molecular Analysis on Circulating Tumor Cells to Determine Prognostic and...
The Molecular Analysis on Circulating Tumor Cells to Determine Prognostic and...The Molecular Analysis on Circulating Tumor Cells to Determine Prognostic and...
The Molecular Analysis on Circulating Tumor Cells to Determine Prognostic and...
 
target identefication
target identeficationtarget identefication
target identefication
 

Similar to PICS: Pathway Informed Classification System for cancer analysis using gene expression data

Assessing the clinical utility of cancer genomic and proteomic data across tu...
Assessing the clinical utility of cancer genomic and proteomic data across tu...Assessing the clinical utility of cancer genomic and proteomic data across tu...
Assessing the clinical utility of cancer genomic and proteomic data across tu...Gul Muneer
 
CDAC 2018 Boeva analysis chromatin
CDAC 2018 Boeva analysis chromatinCDAC 2018 Boeva analysis chromatin
CDAC 2018 Boeva analysis chromatinMarco Antoniotti
 
Mastering RNA-Seq (NGS Data Analysis) - A Critical Approach To Transcriptomic...
Mastering RNA-Seq (NGS Data Analysis) - A Critical Approach To Transcriptomic...Mastering RNA-Seq (NGS Data Analysis) - A Critical Approach To Transcriptomic...
Mastering RNA-Seq (NGS Data Analysis) - A Critical Approach To Transcriptomic...Elia Brodsky
 
NY Prostate Cancer Conference - S. Stone - Session 1: Cell cycle progression ...
NY Prostate Cancer Conference - S. Stone - Session 1: Cell cycle progression ...NY Prostate Cancer Conference - S. Stone - Session 1: Cell cycle progression ...
NY Prostate Cancer Conference - S. Stone - Session 1: Cell cycle progression ...European School of Oncology
 
Bioinformatics in dermato-oncology
Bioinformatics in dermato-oncologyBioinformatics in dermato-oncology
Bioinformatics in dermato-oncologyJoaquin Dopazo
 
Navigating through disease maps
Navigating through disease mapsNavigating through disease maps
Navigating through disease mapsJoaquin Dopazo
 
Zinc supplementation may reduce the risk of hepatocellular carcinoma using bi...
Zinc supplementation may reduce the risk of hepatocellular carcinoma using bi...Zinc supplementation may reduce the risk of hepatocellular carcinoma using bi...
Zinc supplementation may reduce the risk of hepatocellular carcinoma using bi...caijjournal
 
Comparitive genomic hybridisation
Comparitive genomic hybridisationComparitive genomic hybridisation
Comparitive genomic hybridisationnamrathrs87
 
Utilization of NGS to Identify Clinically-Relevant Mutations in cfDNA: Meet t...
Utilization of NGS to Identify Clinically-Relevant Mutations in cfDNA: Meet t...Utilization of NGS to Identify Clinically-Relevant Mutations in cfDNA: Meet t...
Utilization of NGS to Identify Clinically-Relevant Mutations in cfDNA: Meet t...QIAGEN
 
Capacitación fuerza de ventas BMS Nivo/Ipi 1ra línea RCC
Capacitación fuerza de ventas BMS Nivo/Ipi 1ra línea RCCCapacitación fuerza de ventas BMS Nivo/Ipi 1ra línea RCC
Capacitación fuerza de ventas BMS Nivo/Ipi 1ra línea RCCMauricio Lema
 
MCO 2011 - Slide 30 - K. Öberg - Spotlight session - Neuroendocrine tumours
MCO 2011 - Slide 30 - K. Öberg - Spotlight session - Neuroendocrine tumoursMCO 2011 - Slide 30 - K. Öberg - Spotlight session - Neuroendocrine tumours
MCO 2011 - Slide 30 - K. Öberg - Spotlight session - Neuroendocrine tumoursEuropean School of Oncology
 
CDAC 2018 Gonzales-Perez interpretation of cancer genomes
CDAC 2018 Gonzales-Perez interpretation of cancer genomesCDAC 2018 Gonzales-Perez interpretation of cancer genomes
CDAC 2018 Gonzales-Perez interpretation of cancer genomesMarco Antoniotti
 
Identification of the Enrichment Pathways of Catalase in Multiple Primary Tumors
Identification of the Enrichment Pathways of Catalase in Multiple Primary TumorsIdentification of the Enrichment Pathways of Catalase in Multiple Primary Tumors
Identification of the Enrichment Pathways of Catalase in Multiple Primary Tumorsdaranisaha
 
Genomica - Microarreglos de DNA
Genomica - Microarreglos de DNAGenomica - Microarreglos de DNA
Genomica - Microarreglos de DNAUlises Urzua
 
Kshivets_WCGIC2023.pdf
Kshivets_WCGIC2023.pdfKshivets_WCGIC2023.pdf
Kshivets_WCGIC2023.pdfOleg Kshivets
 
INMUNOTERPAPIA EN CANCER DE VIAS BILIARES AVANZADO.pptx
INMUNOTERPAPIA EN CANCER DE VIAS BILIARES AVANZADO.pptxINMUNOTERPAPIA EN CANCER DE VIAS BILIARES AVANZADO.pptx
INMUNOTERPAPIA EN CANCER DE VIAS BILIARES AVANZADO.pptxResidenteOncologiaMd
 
Construction and Validation of Prognostic Signature Model Based on Metastatic...
Construction and Validation of Prognostic Signature Model Based on Metastatic...Construction and Validation of Prognostic Signature Model Based on Metastatic...
Construction and Validation of Prognostic Signature Model Based on Metastatic...daranisaha
 
Microarray biotechnologg ppy dna microarrays
Microarray biotechnologg ppy dna microarraysMicroarray biotechnologg ppy dna microarrays
Microarray biotechnologg ppy dna microarraysayeshasattarsandhu
 

Similar to PICS: Pathway Informed Classification System for cancer analysis using gene expression data (20)

Assessing the clinical utility of cancer genomic and proteomic data across tu...
Assessing the clinical utility of cancer genomic and proteomic data across tu...Assessing the clinical utility of cancer genomic and proteomic data across tu...
Assessing the clinical utility of cancer genomic and proteomic data across tu...
 
CDAC 2018 Boeva analysis chromatin
CDAC 2018 Boeva analysis chromatinCDAC 2018 Boeva analysis chromatin
CDAC 2018 Boeva analysis chromatin
 
Mastering RNA-Seq (NGS Data Analysis) - A Critical Approach To Transcriptomic...
Mastering RNA-Seq (NGS Data Analysis) - A Critical Approach To Transcriptomic...Mastering RNA-Seq (NGS Data Analysis) - A Critical Approach To Transcriptomic...
Mastering RNA-Seq (NGS Data Analysis) - A Critical Approach To Transcriptomic...
 
NY Prostate Cancer Conference - S. Stone - Session 1: Cell cycle progression ...
NY Prostate Cancer Conference - S. Stone - Session 1: Cell cycle progression ...NY Prostate Cancer Conference - S. Stone - Session 1: Cell cycle progression ...
NY Prostate Cancer Conference - S. Stone - Session 1: Cell cycle progression ...
 
Bioinformatics in dermato-oncology
Bioinformatics in dermato-oncologyBioinformatics in dermato-oncology
Bioinformatics in dermato-oncology
 
Navigating through disease maps
Navigating through disease mapsNavigating through disease maps
Navigating through disease maps
 
20160218 hisham toma services
20160218 hisham toma services20160218 hisham toma services
20160218 hisham toma services
 
Zinc supplementation may reduce the risk of hepatocellular carcinoma using bi...
Zinc supplementation may reduce the risk of hepatocellular carcinoma using bi...Zinc supplementation may reduce the risk of hepatocellular carcinoma using bi...
Zinc supplementation may reduce the risk of hepatocellular carcinoma using bi...
 
Lepow Day Poster 2
Lepow Day Poster 2Lepow Day Poster 2
Lepow Day Poster 2
 
Comparitive genomic hybridisation
Comparitive genomic hybridisationComparitive genomic hybridisation
Comparitive genomic hybridisation
 
Utilization of NGS to Identify Clinically-Relevant Mutations in cfDNA: Meet t...
Utilization of NGS to Identify Clinically-Relevant Mutations in cfDNA: Meet t...Utilization of NGS to Identify Clinically-Relevant Mutations in cfDNA: Meet t...
Utilization of NGS to Identify Clinically-Relevant Mutations in cfDNA: Meet t...
 
Capacitación fuerza de ventas BMS Nivo/Ipi 1ra línea RCC
Capacitación fuerza de ventas BMS Nivo/Ipi 1ra línea RCCCapacitación fuerza de ventas BMS Nivo/Ipi 1ra línea RCC
Capacitación fuerza de ventas BMS Nivo/Ipi 1ra línea RCC
 
MCO 2011 - Slide 30 - K. Öberg - Spotlight session - Neuroendocrine tumours
MCO 2011 - Slide 30 - K. Öberg - Spotlight session - Neuroendocrine tumoursMCO 2011 - Slide 30 - K. Öberg - Spotlight session - Neuroendocrine tumours
MCO 2011 - Slide 30 - K. Öberg - Spotlight session - Neuroendocrine tumours
 
CDAC 2018 Gonzales-Perez interpretation of cancer genomes
CDAC 2018 Gonzales-Perez interpretation of cancer genomesCDAC 2018 Gonzales-Perez interpretation of cancer genomes
CDAC 2018 Gonzales-Perez interpretation of cancer genomes
 
Identification of the Enrichment Pathways of Catalase in Multiple Primary Tumors
Identification of the Enrichment Pathways of Catalase in Multiple Primary TumorsIdentification of the Enrichment Pathways of Catalase in Multiple Primary Tumors
Identification of the Enrichment Pathways of Catalase in Multiple Primary Tumors
 
Genomica - Microarreglos de DNA
Genomica - Microarreglos de DNAGenomica - Microarreglos de DNA
Genomica - Microarreglos de DNA
 
Kshivets_WCGIC2023.pdf
Kshivets_WCGIC2023.pdfKshivets_WCGIC2023.pdf
Kshivets_WCGIC2023.pdf
 
INMUNOTERPAPIA EN CANCER DE VIAS BILIARES AVANZADO.pptx
INMUNOTERPAPIA EN CANCER DE VIAS BILIARES AVANZADO.pptxINMUNOTERPAPIA EN CANCER DE VIAS BILIARES AVANZADO.pptx
INMUNOTERPAPIA EN CANCER DE VIAS BILIARES AVANZADO.pptx
 
Construction and Validation of Prognostic Signature Model Based on Metastatic...
Construction and Validation of Prognostic Signature Model Based on Metastatic...Construction and Validation of Prognostic Signature Model Based on Metastatic...
Construction and Validation of Prognostic Signature Model Based on Metastatic...
 
Microarray biotechnologg ppy dna microarrays
Microarray biotechnologg ppy dna microarraysMicroarray biotechnologg ppy dna microarrays
Microarray biotechnologg ppy dna microarrays
 

Recently uploaded

VIP Call Girls Tirunelveli Aaradhya 8250192130 Independent Escort Service Tir...
VIP Call Girls Tirunelveli Aaradhya 8250192130 Independent Escort Service Tir...VIP Call Girls Tirunelveli Aaradhya 8250192130 Independent Escort Service Tir...
VIP Call Girls Tirunelveli Aaradhya 8250192130 Independent Escort Service Tir...narwatsonia7
 
Bangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% Safe
Bangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% SafeBangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% Safe
Bangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% Safenarwatsonia7
 
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...Garima Khatri
 
VIP Call Girls Pune Vani 9907093804 Short 1500 Night 6000 Best call girls Ser...
VIP Call Girls Pune Vani 9907093804 Short 1500 Night 6000 Best call girls Ser...VIP Call Girls Pune Vani 9907093804 Short 1500 Night 6000 Best call girls Ser...
VIP Call Girls Pune Vani 9907093804 Short 1500 Night 6000 Best call girls Ser...Miss joya
 
Call Girl Bangalore Nandini 7001305949 Independent Escort Service Bangalore
Call Girl Bangalore Nandini 7001305949 Independent Escort Service BangaloreCall Girl Bangalore Nandini 7001305949 Independent Escort Service Bangalore
Call Girl Bangalore Nandini 7001305949 Independent Escort Service Bangalorenarwatsonia7
 
Call Girls Yelahanka Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Yelahanka Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls Yelahanka Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Yelahanka Just Call 7001305949 Top Class Call Girl Service Availablenarwatsonia7
 
Call Girl Chennai Indira 9907093804 Independent Call Girls Service Chennai
Call Girl Chennai Indira 9907093804 Independent Call Girls Service ChennaiCall Girl Chennai Indira 9907093804 Independent Call Girls Service Chennai
Call Girl Chennai Indira 9907093804 Independent Call Girls Service ChennaiNehru place Escorts
 
Call Girls Service Pune Vaishnavi 9907093804 Short 1500 Night 6000 Best call ...
Call Girls Service Pune Vaishnavi 9907093804 Short 1500 Night 6000 Best call ...Call Girls Service Pune Vaishnavi 9907093804 Short 1500 Night 6000 Best call ...
Call Girls Service Pune Vaishnavi 9907093804 Short 1500 Night 6000 Best call ...Miss joya
 
VIP Call Girls Indore Kirti 💚😋 9256729539 🚀 Indore Escorts
VIP Call Girls Indore Kirti 💚😋  9256729539 🚀 Indore EscortsVIP Call Girls Indore Kirti 💚😋  9256729539 🚀 Indore Escorts
VIP Call Girls Indore Kirti 💚😋 9256729539 🚀 Indore Escortsaditipandeya
 
VIP Call Girls Pune Vrinda 9907093804 Short 1500 Night 6000 Best call girls S...
VIP Call Girls Pune Vrinda 9907093804 Short 1500 Night 6000 Best call girls S...VIP Call Girls Pune Vrinda 9907093804 Short 1500 Night 6000 Best call girls S...
VIP Call Girls Pune Vrinda 9907093804 Short 1500 Night 6000 Best call girls S...Miss joya
 
Russian Call Girls Chennai Madhuri 9907093804 Independent Call Girls Service ...
Russian Call Girls Chennai Madhuri 9907093804 Independent Call Girls Service ...Russian Call Girls Chennai Madhuri 9907093804 Independent Call Girls Service ...
Russian Call Girls Chennai Madhuri 9907093804 Independent Call Girls Service ...Nehru place Escorts
 
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls AvailableVip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls AvailableNehru place Escorts
 
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...CALL GIRLS
 
CALL ON ➥9907093804 🔝 Call Girls Baramati ( Pune) Girls Service
CALL ON ➥9907093804 🔝 Call Girls Baramati ( Pune)  Girls ServiceCALL ON ➥9907093804 🔝 Call Girls Baramati ( Pune)  Girls Service
CALL ON ➥9907093804 🔝 Call Girls Baramati ( Pune) Girls ServiceMiss joya
 
Kesar Bagh Call Girl Price 9548273370 , Lucknow Call Girls Service
Kesar Bagh Call Girl Price 9548273370 , Lucknow Call Girls ServiceKesar Bagh Call Girl Price 9548273370 , Lucknow Call Girls Service
Kesar Bagh Call Girl Price 9548273370 , Lucknow Call Girls Servicemakika9823
 
Russian Call Girl Brookfield - 7001305949 Escorts Service 50% Off with Cash O...
Russian Call Girl Brookfield - 7001305949 Escorts Service 50% Off with Cash O...Russian Call Girl Brookfield - 7001305949 Escorts Service 50% Off with Cash O...
Russian Call Girl Brookfield - 7001305949 Escorts Service 50% Off with Cash O...narwatsonia7
 
Call Girls Service in Bommanahalli - 7001305949 with real photos and phone nu...
Call Girls Service in Bommanahalli - 7001305949 with real photos and phone nu...Call Girls Service in Bommanahalli - 7001305949 with real photos and phone nu...
Call Girls Service in Bommanahalli - 7001305949 with real photos and phone nu...narwatsonia7
 
Bangalore Call Girls Marathahalli 📞 9907093804 High Profile Service 100% Safe
Bangalore Call Girls Marathahalli 📞 9907093804 High Profile Service 100% SafeBangalore Call Girls Marathahalli 📞 9907093804 High Profile Service 100% Safe
Bangalore Call Girls Marathahalli 📞 9907093804 High Profile Service 100% Safenarwatsonia7
 
Call Girls Service In Shyam Nagar Whatsapp 8445551418 Independent Escort Service
Call Girls Service In Shyam Nagar Whatsapp 8445551418 Independent Escort ServiceCall Girls Service In Shyam Nagar Whatsapp 8445551418 Independent Escort Service
Call Girls Service In Shyam Nagar Whatsapp 8445551418 Independent Escort Serviceparulsinha
 
Call Girl Coimbatore Prisha☎️ 8250192130 Independent Escort Service Coimbatore
Call Girl Coimbatore Prisha☎️  8250192130 Independent Escort Service CoimbatoreCall Girl Coimbatore Prisha☎️  8250192130 Independent Escort Service Coimbatore
Call Girl Coimbatore Prisha☎️ 8250192130 Independent Escort Service Coimbatorenarwatsonia7
 

Recently uploaded (20)

VIP Call Girls Tirunelveli Aaradhya 8250192130 Independent Escort Service Tir...
VIP Call Girls Tirunelveli Aaradhya 8250192130 Independent Escort Service Tir...VIP Call Girls Tirunelveli Aaradhya 8250192130 Independent Escort Service Tir...
VIP Call Girls Tirunelveli Aaradhya 8250192130 Independent Escort Service Tir...
 
Bangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% Safe
Bangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% SafeBangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% Safe
Bangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% Safe
 
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
 
VIP Call Girls Pune Vani 9907093804 Short 1500 Night 6000 Best call girls Ser...
VIP Call Girls Pune Vani 9907093804 Short 1500 Night 6000 Best call girls Ser...VIP Call Girls Pune Vani 9907093804 Short 1500 Night 6000 Best call girls Ser...
VIP Call Girls Pune Vani 9907093804 Short 1500 Night 6000 Best call girls Ser...
 
Call Girl Bangalore Nandini 7001305949 Independent Escort Service Bangalore
Call Girl Bangalore Nandini 7001305949 Independent Escort Service BangaloreCall Girl Bangalore Nandini 7001305949 Independent Escort Service Bangalore
Call Girl Bangalore Nandini 7001305949 Independent Escort Service Bangalore
 
Call Girls Yelahanka Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Yelahanka Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls Yelahanka Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Yelahanka Just Call 7001305949 Top Class Call Girl Service Available
 
Call Girl Chennai Indira 9907093804 Independent Call Girls Service Chennai
Call Girl Chennai Indira 9907093804 Independent Call Girls Service ChennaiCall Girl Chennai Indira 9907093804 Independent Call Girls Service Chennai
Call Girl Chennai Indira 9907093804 Independent Call Girls Service Chennai
 
Call Girls Service Pune Vaishnavi 9907093804 Short 1500 Night 6000 Best call ...
Call Girls Service Pune Vaishnavi 9907093804 Short 1500 Night 6000 Best call ...Call Girls Service Pune Vaishnavi 9907093804 Short 1500 Night 6000 Best call ...
Call Girls Service Pune Vaishnavi 9907093804 Short 1500 Night 6000 Best call ...
 
VIP Call Girls Indore Kirti 💚😋 9256729539 🚀 Indore Escorts
VIP Call Girls Indore Kirti 💚😋  9256729539 🚀 Indore EscortsVIP Call Girls Indore Kirti 💚😋  9256729539 🚀 Indore Escorts
VIP Call Girls Indore Kirti 💚😋 9256729539 🚀 Indore Escorts
 
VIP Call Girls Pune Vrinda 9907093804 Short 1500 Night 6000 Best call girls S...
VIP Call Girls Pune Vrinda 9907093804 Short 1500 Night 6000 Best call girls S...VIP Call Girls Pune Vrinda 9907093804 Short 1500 Night 6000 Best call girls S...
VIP Call Girls Pune Vrinda 9907093804 Short 1500 Night 6000 Best call girls S...
 
Russian Call Girls Chennai Madhuri 9907093804 Independent Call Girls Service ...
Russian Call Girls Chennai Madhuri 9907093804 Independent Call Girls Service ...Russian Call Girls Chennai Madhuri 9907093804 Independent Call Girls Service ...
Russian Call Girls Chennai Madhuri 9907093804 Independent Call Girls Service ...
 
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls AvailableVip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
 
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
 
CALL ON ➥9907093804 🔝 Call Girls Baramati ( Pune) Girls Service
CALL ON ➥9907093804 🔝 Call Girls Baramati ( Pune)  Girls ServiceCALL ON ➥9907093804 🔝 Call Girls Baramati ( Pune)  Girls Service
CALL ON ➥9907093804 🔝 Call Girls Baramati ( Pune) Girls Service
 
Kesar Bagh Call Girl Price 9548273370 , Lucknow Call Girls Service
Kesar Bagh Call Girl Price 9548273370 , Lucknow Call Girls ServiceKesar Bagh Call Girl Price 9548273370 , Lucknow Call Girls Service
Kesar Bagh Call Girl Price 9548273370 , Lucknow Call Girls Service
 
Russian Call Girl Brookfield - 7001305949 Escorts Service 50% Off with Cash O...
Russian Call Girl Brookfield - 7001305949 Escorts Service 50% Off with Cash O...Russian Call Girl Brookfield - 7001305949 Escorts Service 50% Off with Cash O...
Russian Call Girl Brookfield - 7001305949 Escorts Service 50% Off with Cash O...
 
Call Girls Service in Bommanahalli - 7001305949 with real photos and phone nu...
Call Girls Service in Bommanahalli - 7001305949 with real photos and phone nu...Call Girls Service in Bommanahalli - 7001305949 with real photos and phone nu...
Call Girls Service in Bommanahalli - 7001305949 with real photos and phone nu...
 
Bangalore Call Girls Marathahalli 📞 9907093804 High Profile Service 100% Safe
Bangalore Call Girls Marathahalli 📞 9907093804 High Profile Service 100% SafeBangalore Call Girls Marathahalli 📞 9907093804 High Profile Service 100% Safe
Bangalore Call Girls Marathahalli 📞 9907093804 High Profile Service 100% Safe
 
Call Girls Service In Shyam Nagar Whatsapp 8445551418 Independent Escort Service
Call Girls Service In Shyam Nagar Whatsapp 8445551418 Independent Escort ServiceCall Girls Service In Shyam Nagar Whatsapp 8445551418 Independent Escort Service
Call Girls Service In Shyam Nagar Whatsapp 8445551418 Independent Escort Service
 
Call Girl Coimbatore Prisha☎️ 8250192130 Independent Escort Service Coimbatore
Call Girl Coimbatore Prisha☎️  8250192130 Independent Escort Service CoimbatoreCall Girl Coimbatore Prisha☎️  8250192130 Independent Escort Service Coimbatore
Call Girl Coimbatore Prisha☎️ 8250192130 Independent Escort Service Coimbatore
 

PICS: Pathway Informed Classification System for cancer analysis using gene expression data

  • 1. PICS: Pathway Informed Classification System for cancer analysis using gene expression data David Craft and Michael Young MGH Brown Bag April 12, 2016
  • 2. Scenario: a patient with an advanced stage cancer who has failed traditional treatments is told something like the following: We can offer you this new drug that was FDA approved 3 years ago. For your type of cancer, the response rate is about 20% durable response after 2 years, 30% some shrinkage but not durable, and 50% do not respond at all. Additionally, the following side effects…. How can we improve on our predictive capability?
  • 3. Probability of eradication Probability of toxicity 1 Probability of sensitivity to X Gy of radiation … “We do not have good models for predicting patient response to treatment”
  • 4. Genomic characterization of cancers Thousands of papers characterizing genomic nature of cancer. Very little of this is in usage on the front lines of clinical cancer care. Some clinical mutation/drug examples: vemurafenib for BRAFv600 mutations erlotinib for EGFR mutations crizotinib for ALK mutations
  • 5. The “Central Dogma” of biology DNA RNA Protein
  • 6. Microarrays can measure upwards of 20,000 gene expression levels. With typical early phase drug testing trials, patient cohort numbers are usually much smaller than this (30 - 100 patients). Plenty of room for misleading correlations. There has been some success with the “you have mutation x therefore take this drug”, but … Cartoon of the central dogma A microarray measuring RNA levels
  • 7. Although the central dogma is considered vastly over-simplified, RNA still a useful “signal” -James Shapiro, U. of Chicago
  • 8. Data mining approach without any reference to biological systems. A biological pathway is a well defined biochemical process that occurs in living cells and organisms. 1000s have been curated over the years. From GENES to PATHWAYS
  • 9. An example pathway from KEGG KEGG = Kyoto encyclopedia of genes and genomes
  • 10. Genes pathways genes patients (20,000 x 30) matrix Biology informed dimensionality reduction
  • 11. Pathway scores Gene expression levels for 12 genes of the pyruvate metabolism pathway from 156 bladder patients in PRECOG. The first 12 columns are non-cancerous patient samples: PCA decomposition of expression levels of genes in a particular pathway pca 1 pca2 In this example score for a patient could be (pca1,pca2)
  • 12. How to score a pathway? 1) PCA on gene expression levels 2) NTC ... 3) GED ... 3D → 2D Expression level of gene 1 Expressionlevelofgene2 Expressionlevelofgene3 PCA decomposition
  • 13. How to score a pathway? 1) PCA on gene expression levels 2) NTC: Compute “distance” in gene expression space from a patient cancer sample to the mean of normal tissues in the dataset. normal samples cancer samples X Gene expression for gene 1 Gene expression for gene 2 Visual demo of NTC method for a pathway with two genes. NTC = normal tissue centroid
  • 14. How to score a pathway? 1) PCA on gene expression levels 2) NTC: ... 3) GED: Gene expression deviation Two scores per patient for each pathway: One which sums up the positive gene deviations [genes that have higher expression compared to normal samples] and one which sums up the negative gene deviations Gene deviation for gene g patient p Expression level of gene g patient p Mean expression level of gene g for normal samples Standard deviation of expression level of gene g for normal samples Kolmogorov-Smirnov to test difference of two distributions. If a gene passes then:
  • 15. How to decide if we should use a particular pathway (Silhouette score) pca 1 pca2 1) Use PCA to reduce dimensionality of pathway gene expression levels. 2) For the known groups “normal” and “cancer” evaluate for each sample a silhouette score. [1 is perfect separation, <1 is worse separation]. 3) Average all the silhouette scores for an overall score. Take the pathway if that score is big enough.
  • 16. Proposed system PICS: Pathway Informed Classification System Module 1 dimension reduction techniques KEGG pathway database Patient gene expression levels Pathway scores Module 2 Clustering algorithms Pathway scores for multiple patients and pathways Pathway- based classification Module 3 Prediction Machine learning Pathway scores Proposed treatment Improved probabilities of a list of possible side effects and tumor cure. Overallsurvival time Improved predictability with pathway-based quantitative learning. Standard separation 1 2 3 4
  • 17. Clustering methods K-means or K-medoids Attribute 1 Attribute2
  • 18. Clustering methods K-means or K-medoids Hierarchical clustering Each row is a type of avocado (or, a set of gene expression values of a tissue sample, i.e. a list of attributes) Attribute 1 Attribute2
  • 19. Public data sources Pathways KEGG Reactome Biocarta wikipathways MSigDB Gene expression precog @ Stanford [PREdiction of Clinical Outcomes from Genomic Profiles] TCGA / cBioPortal
  • 20. Result 1: Pan-cancer pathway clusteringPathways Patients (~500) Dataset from precog@ Stanford PCA scoringk = kidney os = osteosarcoma (Adult germ cell has 6 normal samples indicated by -) metaboliccellprocessesImmune&endocrine Acute myeloid leukemia Adult germ cell OvarianK OS
  • 21. Gene expression values for B-cell receptor pathway
  • 25. Example 2: Pancreas (NTC method) Group 1 Group 2
  • 26. Results summary for individual cancer sets
  • 28. Example 5: Suboptimally debulked ovarian (NTC) The KM separation from the orig report
  • 29. Example for a set with no normal tissues samples: melanoma
  • 30. Melanoma classification (PCA) Group 1 (285 patients) Group 2 (185 patients) Primarily immune system pathways Primarily immune system pathways
  • 32. Possible additions Which pathways to include Pathway topology (SPIA, Paradigm, ...) Germline DNA and tumor. Multiple tumor samples Other assays [RNAseq, proteomics, mutations, copy number variations, methylation status, gene fusions, …] The microbiome PICS for toxicity/side effect prediction Thousands consolidated at MsigDB: H (hallmark gene sets, 50 gene sets) C1 (positional gene sets, 326 gene sets) by chromosome: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 X Y C2 (curated gene sets, 4726 gene sets) CGP (chemical and genetic perturbations, 3396 gene sets) CP (Canonical pathways, 1330 gene sets) CP:BIOCARTA (BioCarta gene sets, 217 gene sets) CP:KEGG (KEGG gene sets, 186 gene sets) CP:REACTOME (Reactome gene sets, 674 gene sets) C3 (motif gene sets, 836 gene sets) MIR (microRNA targets, 221 gene sets) TFT (transcription factor targets, 615 gene sets) C4 (computational gene sets, 858 gene sets) CGN (cancer gene neighborhoods, 427 gene sets) CM (cancer modules, 431 gene sets) C5 (GO gene sets, 1454 gene sets) BP (GO biological process, 825 gene sets) CC (GO cellular component, 233 gene sets) MF (GO molecular function, 396 gene sets) C6 (oncogenic signatures, 189 gene sets) C7 (immunologic signatures, 4872 gene sets) Finding the right balance: Stats/machine learning Biology
  • 33. Matching drugs/interventions to patients ~70 new cancer drugs FDA approved in the last 5 years! All drugs (not just cancer)
  • 34. Conclusions Module 3 Prediction Machine learning Pathway scores Proposed treatment Improved probabilities of a list of possible side effects and tumor cure. 3 Pathway scoring can regularize large dimension gene data. Pathway scoring as a framework for consolidating new “next generation” genomic/transcriptome assays. Questions comments ideas emails strongly welcomed!
  • 35. Thank you! Thanks Yotam Drier and Michael Tolstorukov for helpful discussions