SlideShare a Scribd company logo
1 of 22
Team C
Variational Autoencoder for Anti-Cancer Drug
Response Prediction
Steven Yuan Varus
Dexin
Team C
● Why?
● How?
● What are the results?
● Conclusions
● Future works
Road map
● Need
○ The huge amount of death caused by cancer each year
○ The diversity of cancer and anti-cancer drugs makes it difficult to
provide customized therapy for cancer patients
○ Accurate prediction of drug response is important
● Possibility
○ A large amount of accessible genomic data of cancer cell lines and
molecular data of anti-cancer drugs
○ Machine learning methods that can help extract useful features
from these data & facilitate the analysis
Motivation
● Problem
○ Too hard to get labeled data
○ How to learn from unlabeled data?
● Variational auto-encoder (VAE)
○ Encoder: learns the distribution of lantent vectors of input data
○ Decoder: reconstructs input data with latent vectors sampled
from this distribution
○ Robust unsupervised learning method
Motivation
Related Models
CDRscan(Neural Network) Paccmann(Attention-based)
● VAE model for gene expression data(geneVAE)
● Junction Tree VAE for anti-cancer drug molecular data(JTVAE)
● Multi-Layer Perceptron (MLP) model to produce final prediction
● Baseline model: Support Vector Regression (SVR)
Methods in our paper
GeneVAE
● Encoder & Decoder: Two-
layer fully connected
neural network (Each)
● Normalized input gene
expression data
● Sampled by guassian
distirbution at latent space
GeneVAE
t-SNE result before VAE t-SNE result after VAE
JTVAE
● Graph VAE
○ Fine-grained connectivity
information
○ node-by-node generation
● Tree VAE
○ Tree structure
○ node by functional group
JTVAE
● Similar drugs share
latent vectors
adjacent to each
other in terms of
Euclidean Distance
Multi Layer Perceptron
● Data
○ Gene expression data: Cancer Cell Line Encyclopedia (CCLE) data set
○ Important genes related with cancer: Cancer Genomic Census (CGC)
data set
○ Organic compound molecular structure data: ZINC data set
○ Anti-cancer drug molecular structure data: PubChem data set
○ Drug response data: Genomics of Drug Sensitivity in Cancer (GDSC)
data set
Experiments
Gene expression data in CCLE dataset
Experiments
Gene expression data filtered by CGC dataset
CGC
dataset
● Evaluation: Pearson Correlation (R2 score) & Root Mean Square Error
(RMSE)
● Test specifically on breast cancer
● Model comparison
○ CGC+SVR
○ RAW+SVR
○ CGC+MLP
○ RAW+VAE+MLP
○ CGC+VAE+MLP
Experiments
● R2 score: 0.678
● RMSE: 1.489
CGC+SVR
● R2 score: 0.700
● RMSE: 1.439
CGC+VAE+SVR
breast cancer (baseline)
● R2 score: 0.822 (averagely)
● RMSE: 1.133 (averagely)
CGC+MLP
● R2 score: 0.805 (averagely)
● RMSE: 1.163 (averagely)
RAW+VAE+MLP
breast cancer
● R2 score: 0.830 (averagely)
● RMSE: 1.130 (averagely)
CGC+VAE+MLP (BRCA)
breast cancer vs. pan-cancer
● R2 score: 0.8455 (averagely)
● RMSE: 1.080 (averagely)
CGC+VAE+MLP (PAN)
Comparison
● High-Pearson-Correlation Predition
○ 0.830 R2 score on anti-breast cancer drug prediction (our model)
○ 0.845 R2 score on pan cancer drug prediction (our model)
○ 0.843 R2 score, high score of similar research(CDRscan: Link)
● Data encoded by VAE retains the features of input data.
● Accurate prediction on drugs with similar latent vectors and functional
groups
Conclusions
● Model Improvement
○ Attention-based model
○ Sequence generation model based on Graph Neural Network(GNN)
and Graph Attention Network (GAT)
○ Hyper-parameters tuning(eg. Batch Norm layers)
● Date Improvement
○ Add TCGA cell line data set(Comparability & Crediblity)
○ Better gene subset selection (Protein-protein network propagation)
● Drug response prediction toolkit
Future work
Steven, Yuan, Varus, Dexin
Thank you all!

More Related Content

Similar to Team c final slides

Lab Presentation, iEVORA CpG Biomarkers Overview
Lab Presentation, iEVORA CpG Biomarkers OverviewLab Presentation, iEVORA CpG Biomarkers Overview
Lab Presentation, iEVORA CpG Biomarkers OverviewSean Maden
 
H2O World - H2O for Genomics with Hussam Al-Deen Ashab
H2O World - H2O for Genomics with Hussam Al-Deen AshabH2O World - H2O for Genomics with Hussam Al-Deen Ashab
H2O World - H2O for Genomics with Hussam Al-Deen AshabSri Ambati
 
Computer‐Aided Diagnosis of Breast Cancer Using Ensemble Convolutional Neural...
Computer‐Aided Diagnosis of Breast Cancer Using Ensemble Convolutional Neural...Computer‐Aided Diagnosis of Breast Cancer Using Ensemble Convolutional Neural...
Computer‐Aided Diagnosis of Breast Cancer Using Ensemble Convolutional Neural...Yan-Wei Lee
 
Artificial Intelligence in pathology
Artificial Intelligence in pathologyArtificial Intelligence in pathology
Artificial Intelligence in pathologynehaSingh1543
 
[DigiHealth 22] Budget friendly sample sizes for genomics research - Ognjen M...
[DigiHealth 22] Budget friendly sample sizes for genomics research - Ognjen M...[DigiHealth 22] Budget friendly sample sizes for genomics research - Ognjen M...
[DigiHealth 22] Budget friendly sample sizes for genomics research - Ognjen M...DataScienceConferenc1
 
Adjuvant chemotherapy of breast cancer
Adjuvant chemotherapy of breast cancerAdjuvant chemotherapy of breast cancer
Adjuvant chemotherapy of breast cancerGita Bhat
 
CSCI 6505 Machine Learning Project
CSCI 6505 Machine Learning ProjectCSCI 6505 Machine Learning Project
CSCI 6505 Machine Learning Projectbutest
 
Bioinformatics-R program의 실례
Bioinformatics-R program의 실례Bioinformatics-R program의 실례
Bioinformatics-R program의 실례mothersafe
 
Oncotype dx presentation
Oncotype dx presentationOncotype dx presentation
Oncotype dx presentationahmed mjali
 
Design of an Intelligent System for Improving Classification of Cancer Diseases
Design of an Intelligent System for Improving Classification of Cancer DiseasesDesign of an Intelligent System for Improving Classification of Cancer Diseases
Design of an Intelligent System for Improving Classification of Cancer DiseasesMohamed Loey
 
The Presence and Persistence of Resistant and Stem Cell-Like Tumor Cells as a...
The Presence and Persistence of Resistant and Stem Cell-Like Tumor Cells as a...The Presence and Persistence of Resistant and Stem Cell-Like Tumor Cells as a...
The Presence and Persistence of Resistant and Stem Cell-Like Tumor Cells as a...QIAGEN
 
brain tumor presentation.pptxbraintumorpresentationonbraintumor
brain tumor presentation.pptxbraintumorpresentationonbraintumorbrain tumor presentation.pptxbraintumorpresentationonbraintumor
brain tumor presentation.pptxbraintumorpresentationonbraintumorNagavelliMadhavi
 
Data Science in Healthcare -The University Malaya Medical Centre Breast Cance...
Data Science in Healthcare -The University Malaya Medical Centre Breast Cance...Data Science in Healthcare -The University Malaya Medical Centre Breast Cance...
Data Science in Healthcare -The University Malaya Medical Centre Breast Cance...University of Malaya
 
Artificial Intelligence in Radiation Oncology
Artificial Intelligence in Radiation OncologyArtificial Intelligence in Radiation Oncology
Artificial Intelligence in Radiation OncologyWookjin Choi
 
Utilization of NGS to Identify Clinically-Relevant Mutations in cfDNA: Meet t...
Utilization of NGS to Identify Clinically-Relevant Mutations in cfDNA: Meet t...Utilization of NGS to Identify Clinically-Relevant Mutations in cfDNA: Meet t...
Utilization of NGS to Identify Clinically-Relevant Mutations in cfDNA: Meet t...QIAGEN
 
Renal Cell Carcinoma Risk Stratification
Renal Cell Carcinoma Risk StratificationRenal Cell Carcinoma Risk Stratification
Renal Cell Carcinoma Risk StratificationDr.Bhavin Vadodariya
 
Bioinformatics
BioinformaticsBioinformatics
BioinformaticsIncedo
 
Evolution of molecular prognostic testing in ER positive breast cancer
Evolution of molecular prognostic testing in ER positive breast cancerEvolution of molecular prognostic testing in ER positive breast cancer
Evolution of molecular prognostic testing in ER positive breast cancerBell Symposium & MSP Seminar
 

Similar to Team c final slides (20)

Lab Presentation, iEVORA CpG Biomarkers Overview
Lab Presentation, iEVORA CpG Biomarkers OverviewLab Presentation, iEVORA CpG Biomarkers Overview
Lab Presentation, iEVORA CpG Biomarkers Overview
 
H2O World - H2O for Genomics with Hussam Al-Deen Ashab
H2O World - H2O for Genomics with Hussam Al-Deen AshabH2O World - H2O for Genomics with Hussam Al-Deen Ashab
H2O World - H2O for Genomics with Hussam Al-Deen Ashab
 
Computer‐Aided Diagnosis of Breast Cancer Using Ensemble Convolutional Neural...
Computer‐Aided Diagnosis of Breast Cancer Using Ensemble Convolutional Neural...Computer‐Aided Diagnosis of Breast Cancer Using Ensemble Convolutional Neural...
Computer‐Aided Diagnosis of Breast Cancer Using Ensemble Convolutional Neural...
 
Artificial Intelligence in pathology
Artificial Intelligence in pathologyArtificial Intelligence in pathology
Artificial Intelligence in pathology
 
[DigiHealth 22] Budget friendly sample sizes for genomics research - Ognjen M...
[DigiHealth 22] Budget friendly sample sizes for genomics research - Ognjen M...[DigiHealth 22] Budget friendly sample sizes for genomics research - Ognjen M...
[DigiHealth 22] Budget friendly sample sizes for genomics research - Ognjen M...
 
Adjuvant chemotherapy of breast cancer
Adjuvant chemotherapy of breast cancerAdjuvant chemotherapy of breast cancer
Adjuvant chemotherapy of breast cancer
 
CSCI 6505 Machine Learning Project
CSCI 6505 Machine Learning ProjectCSCI 6505 Machine Learning Project
CSCI 6505 Machine Learning Project
 
Bioinformatics-R program의 실례
Bioinformatics-R program의 실례Bioinformatics-R program의 실례
Bioinformatics-R program의 실례
 
Oncotype dx presentation
Oncotype dx presentationOncotype dx presentation
Oncotype dx presentation
 
Breast Cancer Prediction - Arwa Marfatia.pptx
Breast Cancer Prediction - Arwa Marfatia.pptxBreast Cancer Prediction - Arwa Marfatia.pptx
Breast Cancer Prediction - Arwa Marfatia.pptx
 
Design of an Intelligent System for Improving Classification of Cancer Diseases
Design of an Intelligent System for Improving Classification of Cancer DiseasesDesign of an Intelligent System for Improving Classification of Cancer Diseases
Design of an Intelligent System for Improving Classification of Cancer Diseases
 
Gene_Identification_Report
Gene_Identification_ReportGene_Identification_Report
Gene_Identification_Report
 
The Presence and Persistence of Resistant and Stem Cell-Like Tumor Cells as a...
The Presence and Persistence of Resistant and Stem Cell-Like Tumor Cells as a...The Presence and Persistence of Resistant and Stem Cell-Like Tumor Cells as a...
The Presence and Persistence of Resistant and Stem Cell-Like Tumor Cells as a...
 
brain tumor presentation.pptxbraintumorpresentationonbraintumor
brain tumor presentation.pptxbraintumorpresentationonbraintumorbrain tumor presentation.pptxbraintumorpresentationonbraintumor
brain tumor presentation.pptxbraintumorpresentationonbraintumor
 
Data Science in Healthcare -The University Malaya Medical Centre Breast Cance...
Data Science in Healthcare -The University Malaya Medical Centre Breast Cance...Data Science in Healthcare -The University Malaya Medical Centre Breast Cance...
Data Science in Healthcare -The University Malaya Medical Centre Breast Cance...
 
Artificial Intelligence in Radiation Oncology
Artificial Intelligence in Radiation OncologyArtificial Intelligence in Radiation Oncology
Artificial Intelligence in Radiation Oncology
 
Utilization of NGS to Identify Clinically-Relevant Mutations in cfDNA: Meet t...
Utilization of NGS to Identify Clinically-Relevant Mutations in cfDNA: Meet t...Utilization of NGS to Identify Clinically-Relevant Mutations in cfDNA: Meet t...
Utilization of NGS to Identify Clinically-Relevant Mutations in cfDNA: Meet t...
 
Renal Cell Carcinoma Risk Stratification
Renal Cell Carcinoma Risk StratificationRenal Cell Carcinoma Risk Stratification
Renal Cell Carcinoma Risk Stratification
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Evolution of molecular prognostic testing in ER positive breast cancer
Evolution of molecular prognostic testing in ER positive breast cancerEvolution of molecular prognostic testing in ER positive breast cancer
Evolution of molecular prognostic testing in ER positive breast cancer
 

Recently uploaded

Brand experience Dream Center Peoria Presentation.pdf
Brand experience Dream Center Peoria Presentation.pdfBrand experience Dream Center Peoria Presentation.pdf
Brand experience Dream Center Peoria Presentation.pdftbatkhuu1
 
Moving beyond multi-touch attribution - DigiMarCon CanWest 2024
Moving beyond multi-touch attribution - DigiMarCon CanWest 2024Moving beyond multi-touch attribution - DigiMarCon CanWest 2024
Moving beyond multi-touch attribution - DigiMarCon CanWest 2024Richard Ingilby
 
April 2024 - VBOUT Partners Meeting Group
April 2024 - VBOUT Partners Meeting GroupApril 2024 - VBOUT Partners Meeting Group
April 2024 - VBOUT Partners Meeting GroupVbout.com
 
Social Samosa Guidebook for SAMMIES 2024.pdf
Social Samosa Guidebook for SAMMIES 2024.pdfSocial Samosa Guidebook for SAMMIES 2024.pdf
Social Samosa Guidebook for SAMMIES 2024.pdfSocial Samosa
 
How videos can elevate your Google rankings and improve your EEAT - Benjamin ...
How videos can elevate your Google rankings and improve your EEAT - Benjamin ...How videos can elevate your Google rankings and improve your EEAT - Benjamin ...
How videos can elevate your Google rankings and improve your EEAT - Benjamin ...Benjamin Szturmaj
 
Beyond Resumes_ How Volunteering Shapes Career Trajectories by Kent Kubie
Beyond Resumes_ How Volunteering Shapes Career Trajectories by Kent KubieBeyond Resumes_ How Volunteering Shapes Career Trajectories by Kent Kubie
Beyond Resumes_ How Volunteering Shapes Career Trajectories by Kent KubieKent Kubie
 
marketing strategy of tanishq word PPROJECT.pdf
marketing strategy of tanishq word PPROJECT.pdfmarketing strategy of tanishq word PPROJECT.pdf
marketing strategy of tanishq word PPROJECT.pdfarsathsahil
 
Brand experience Peoria City Soccer Presentation.pdf
Brand experience Peoria City Soccer Presentation.pdfBrand experience Peoria City Soccer Presentation.pdf
Brand experience Peoria City Soccer Presentation.pdftbatkhuu1
 
TOP DUBAI AGENCY OFFERS EXPERT DIGITAL MARKETING SERVICES.pdf
TOP DUBAI AGENCY OFFERS EXPERT DIGITAL MARKETING SERVICES.pdfTOP DUBAI AGENCY OFFERS EXPERT DIGITAL MARKETING SERVICES.pdf
TOP DUBAI AGENCY OFFERS EXPERT DIGITAL MARKETING SERVICES.pdfasiyahanif9977
 
pptx.marketing strategy of tanishq. pptx
pptx.marketing strategy of tanishq. pptxpptx.marketing strategy of tanishq. pptx
pptx.marketing strategy of tanishq. pptxarsathsahil
 
The Skin Games 2024 25 - Sponsorship Deck
The Skin Games 2024 25 - Sponsorship DeckThe Skin Games 2024 25 - Sponsorship Deck
The Skin Games 2024 25 - Sponsorship DeckToluwanimi Balogun
 
Unraveling the Mystery of Roanoke Colony: What Really Happened?
Unraveling the Mystery of Roanoke Colony: What Really Happened?Unraveling the Mystery of Roanoke Colony: What Really Happened?
Unraveling the Mystery of Roanoke Colony: What Really Happened?elizabethella096
 
CALL ON ➥8923113531 🔝Call Girls Hazratganj Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Hazratganj Lucknow best sexual service OnlineCALL ON ➥8923113531 🔝Call Girls Hazratganj Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Hazratganj Lucknow best sexual service Onlineanilsa9823
 
BDSM⚡Call Girls in Sector 150 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 150 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 150 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 150 Noida Escorts >༒8448380779 Escort ServiceDelhi Call girls
 
Cost-effective tactics for navigating CPC surges
Cost-effective tactics for navigating CPC surgesCost-effective tactics for navigating CPC surges
Cost-effective tactics for navigating CPC surgesPushON Ltd
 

Recently uploaded (20)

Brand experience Dream Center Peoria Presentation.pdf
Brand experience Dream Center Peoria Presentation.pdfBrand experience Dream Center Peoria Presentation.pdf
Brand experience Dream Center Peoria Presentation.pdf
 
Moving beyond multi-touch attribution - DigiMarCon CanWest 2024
Moving beyond multi-touch attribution - DigiMarCon CanWest 2024Moving beyond multi-touch attribution - DigiMarCon CanWest 2024
Moving beyond multi-touch attribution - DigiMarCon CanWest 2024
 
April 2024 - VBOUT Partners Meeting Group
April 2024 - VBOUT Partners Meeting GroupApril 2024 - VBOUT Partners Meeting Group
April 2024 - VBOUT Partners Meeting Group
 
Social Samosa Guidebook for SAMMIES 2024.pdf
Social Samosa Guidebook for SAMMIES 2024.pdfSocial Samosa Guidebook for SAMMIES 2024.pdf
Social Samosa Guidebook for SAMMIES 2024.pdf
 
How videos can elevate your Google rankings and improve your EEAT - Benjamin ...
How videos can elevate your Google rankings and improve your EEAT - Benjamin ...How videos can elevate your Google rankings and improve your EEAT - Benjamin ...
How videos can elevate your Google rankings and improve your EEAT - Benjamin ...
 
Beyond Resumes_ How Volunteering Shapes Career Trajectories by Kent Kubie
Beyond Resumes_ How Volunteering Shapes Career Trajectories by Kent KubieBeyond Resumes_ How Volunteering Shapes Career Trajectories by Kent Kubie
Beyond Resumes_ How Volunteering Shapes Career Trajectories by Kent Kubie
 
The Fandom Dividend - Catalyzing Brand Growth through Cultural Engagement - M...
The Fandom Dividend - Catalyzing Brand Growth through Cultural Engagement - M...The Fandom Dividend - Catalyzing Brand Growth through Cultural Engagement - M...
The Fandom Dividend - Catalyzing Brand Growth through Cultural Engagement - M...
 
marketing strategy of tanishq word PPROJECT.pdf
marketing strategy of tanishq word PPROJECT.pdfmarketing strategy of tanishq word PPROJECT.pdf
marketing strategy of tanishq word PPROJECT.pdf
 
Brand experience Peoria City Soccer Presentation.pdf
Brand experience Peoria City Soccer Presentation.pdfBrand experience Peoria City Soccer Presentation.pdf
Brand experience Peoria City Soccer Presentation.pdf
 
TOP DUBAI AGENCY OFFERS EXPERT DIGITAL MARKETING SERVICES.pdf
TOP DUBAI AGENCY OFFERS EXPERT DIGITAL MARKETING SERVICES.pdfTOP DUBAI AGENCY OFFERS EXPERT DIGITAL MARKETING SERVICES.pdf
TOP DUBAI AGENCY OFFERS EXPERT DIGITAL MARKETING SERVICES.pdf
 
pptx.marketing strategy of tanishq. pptx
pptx.marketing strategy of tanishq. pptxpptx.marketing strategy of tanishq. pptx
pptx.marketing strategy of tanishq. pptx
 
The Skin Games 2024 25 - Sponsorship Deck
The Skin Games 2024 25 - Sponsorship DeckThe Skin Games 2024 25 - Sponsorship Deck
The Skin Games 2024 25 - Sponsorship Deck
 
Unraveling the Mystery of Roanoke Colony: What Really Happened?
Unraveling the Mystery of Roanoke Colony: What Really Happened?Unraveling the Mystery of Roanoke Colony: What Really Happened?
Unraveling the Mystery of Roanoke Colony: What Really Happened?
 
Generative AI Master Class - Generative AI, Unleash Creative Opportunity - Pe...
Generative AI Master Class - Generative AI, Unleash Creative Opportunity - Pe...Generative AI Master Class - Generative AI, Unleash Creative Opportunity - Pe...
Generative AI Master Class - Generative AI, Unleash Creative Opportunity - Pe...
 
CALL ON ➥8923113531 🔝Call Girls Hazratganj Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Hazratganj Lucknow best sexual service OnlineCALL ON ➥8923113531 🔝Call Girls Hazratganj Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Hazratganj Lucknow best sexual service Online
 
BDSM⚡Call Girls in Sector 150 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 150 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 150 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 150 Noida Escorts >༒8448380779 Escort Service
 
Cost-effective tactics for navigating CPC surges
Cost-effective tactics for navigating CPC surgesCost-effective tactics for navigating CPC surges
Cost-effective tactics for navigating CPC surges
 
SEO Master Class - Steve Wiideman, Wiideman Consulting Group
SEO Master Class - Steve Wiideman, Wiideman Consulting GroupSEO Master Class - Steve Wiideman, Wiideman Consulting Group
SEO Master Class - Steve Wiideman, Wiideman Consulting Group
 
The Future of Brands on LinkedIn - Alison Kaltman
The Future of Brands on LinkedIn - Alison KaltmanThe Future of Brands on LinkedIn - Alison Kaltman
The Future of Brands on LinkedIn - Alison Kaltman
 
BUY GMAIL ACCOUNTS PVA USA IP INDIAN IP GMAIL
BUY GMAIL ACCOUNTS PVA USA IP INDIAN IP GMAILBUY GMAIL ACCOUNTS PVA USA IP INDIAN IP GMAIL
BUY GMAIL ACCOUNTS PVA USA IP INDIAN IP GMAIL
 

Team c final slides

  • 1. Team C Variational Autoencoder for Anti-Cancer Drug Response Prediction
  • 3. ● Why? ● How? ● What are the results? ● Conclusions ● Future works Road map
  • 4. ● Need ○ The huge amount of death caused by cancer each year ○ The diversity of cancer and anti-cancer drugs makes it difficult to provide customized therapy for cancer patients ○ Accurate prediction of drug response is important ● Possibility ○ A large amount of accessible genomic data of cancer cell lines and molecular data of anti-cancer drugs ○ Machine learning methods that can help extract useful features from these data & facilitate the analysis Motivation
  • 5. ● Problem ○ Too hard to get labeled data ○ How to learn from unlabeled data? ● Variational auto-encoder (VAE) ○ Encoder: learns the distribution of lantent vectors of input data ○ Decoder: reconstructs input data with latent vectors sampled from this distribution ○ Robust unsupervised learning method Motivation
  • 6. Related Models CDRscan(Neural Network) Paccmann(Attention-based)
  • 7. ● VAE model for gene expression data(geneVAE) ● Junction Tree VAE for anti-cancer drug molecular data(JTVAE) ● Multi-Layer Perceptron (MLP) model to produce final prediction ● Baseline model: Support Vector Regression (SVR) Methods in our paper
  • 8. GeneVAE ● Encoder & Decoder: Two- layer fully connected neural network (Each) ● Normalized input gene expression data ● Sampled by guassian distirbution at latent space
  • 9. GeneVAE t-SNE result before VAE t-SNE result after VAE
  • 10. JTVAE ● Graph VAE ○ Fine-grained connectivity information ○ node-by-node generation ● Tree VAE ○ Tree structure ○ node by functional group
  • 11. JTVAE ● Similar drugs share latent vectors adjacent to each other in terms of Euclidean Distance
  • 13. ● Data ○ Gene expression data: Cancer Cell Line Encyclopedia (CCLE) data set ○ Important genes related with cancer: Cancer Genomic Census (CGC) data set ○ Organic compound molecular structure data: ZINC data set ○ Anti-cancer drug molecular structure data: PubChem data set ○ Drug response data: Genomics of Drug Sensitivity in Cancer (GDSC) data set Experiments
  • 14. Gene expression data in CCLE dataset Experiments Gene expression data filtered by CGC dataset CGC dataset
  • 15. ● Evaluation: Pearson Correlation (R2 score) & Root Mean Square Error (RMSE) ● Test specifically on breast cancer ● Model comparison ○ CGC+SVR ○ RAW+SVR ○ CGC+MLP ○ RAW+VAE+MLP ○ CGC+VAE+MLP Experiments
  • 16. ● R2 score: 0.678 ● RMSE: 1.489 CGC+SVR ● R2 score: 0.700 ● RMSE: 1.439 CGC+VAE+SVR breast cancer (baseline)
  • 17. ● R2 score: 0.822 (averagely) ● RMSE: 1.133 (averagely) CGC+MLP ● R2 score: 0.805 (averagely) ● RMSE: 1.163 (averagely) RAW+VAE+MLP breast cancer
  • 18. ● R2 score: 0.830 (averagely) ● RMSE: 1.130 (averagely) CGC+VAE+MLP (BRCA) breast cancer vs. pan-cancer ● R2 score: 0.8455 (averagely) ● RMSE: 1.080 (averagely) CGC+VAE+MLP (PAN)
  • 20. ● High-Pearson-Correlation Predition ○ 0.830 R2 score on anti-breast cancer drug prediction (our model) ○ 0.845 R2 score on pan cancer drug prediction (our model) ○ 0.843 R2 score, high score of similar research(CDRscan: Link) ● Data encoded by VAE retains the features of input data. ● Accurate prediction on drugs with similar latent vectors and functional groups Conclusions
  • 21. ● Model Improvement ○ Attention-based model ○ Sequence generation model based on Graph Neural Network(GNN) and Graph Attention Network (GAT) ○ Hyper-parameters tuning(eg. Batch Norm layers) ● Date Improvement ○ Add TCGA cell line data set(Comparability & Crediblity) ○ Better gene subset selection (Protein-protein network propagation) ● Drug response prediction toolkit Future work
  • 22. Steven, Yuan, Varus, Dexin Thank you all!