SlideShare a Scribd company logo
Cancer Genomics:
Integrative and Scalable Solutions
in R/Bioconductor
Martin Morgan (PI), Kasper Hansen, Vincent Carey,
Levi Waldron (co-investigators)
May 24, 2018
@leviwaldron1
www.github.com/waldronlab
NCI ITCR #5U24CA180996
Outline
• Infrastructure produced by 5U24CA180996
– Developing Bioconductor for the comprehension
of complex multi’omic investigations of cancer
• How my lab has used this infrastructure
• How others are using this infrastructure
Credit: Marcel Ramos
33 diseases
50 platforms
19 data types
Integrative Representation and
Analysis of Complex Cancer Data Sets
Aim 1
e.g. TCGA
The need for MultiAssayExperiment
Need a core Bioconductor data structure to:
– harmonize single-assay data structures
– relate multiple assays with one set of clinical data
– handle missing and replicate observations
– accommodate ID-based and range-based data
– support on-disk representations of big data
MultiAssayExperiment design
Credit: Marcel Ramos
5
Ramos M, Schiffer L, Re A, Azhar R, Basunia A, Rodriguez C, Chan T, Chapman P, Davis SR, Gomez-Cabrero D, Culhane
AC, Haibe-Kains B, Hansen KD, Kodali H, Louis MS, Mer AS, Riester M, Morgan M, Carey V, Waldron L: Software for the
Integration of Multiomics Experiments in Bioconductor. Cancer Res. 2017, 77:e39–e42.
The MultiAssayExperiment API
Credit:
Marcel Ramos
6
RaggedExperiment
(segmented copy number, mutations)
Credit:
Marcel Ramos
Martin Morgan
Aim 2: data resources
1. curatedTCGAData
– simplified pan-cancer analysis of TCGA
2. MultiAssayExperimentData
– cBioPortal, …
3. curatedMetagenomicData1
– curated data for human microbiome studies
1Pasolli E, Schiffer L, Manghi P, Renson A, Obenchain V, Truong DT, Beghini F, Malik F, Ramos M, Dowd
JB, Huttenhower C, Morgan M, Segata N, Waldron L: Accessible, curated metagenomic data through
ExperimentHub. Nat. Methods 2017, 14:1023–1024.
Aim 3: scalable infrastructure
1. ExperimentHub
– Distribution of curatedTCGAData, etc
2. DelayedArray
– Out-of-memory data representation using HDF5
3. BiocParallel
– Simplified parallel evaluation
ITCR-funded multi’omics framework
ITCR
5U24CA180996
Subtype ambiguity of high-grade
serous ovarian carcinoma (HGSOC)
• HGSOC is a clinically homogeneous disease
with widely varying outcomes
• Multiple studies have “confirmed” the
existence of 4 molecular subtypes
• Our recent work shows most HGSOC tumors
are ambiguous in their classification
Chen GM, Kannan L, Geistlinger L, Kofia V, Safikhani Z, Gendoo DMA, Parmigiani G,
Birrer M, Haibe-Kains B, Waldron L: Consensus on Molecular Subtypes of Ovarian
Cancer. bioRxiv 2017:162685. (In Press, Clinical Cancer Research)
Source of subtype ambiguity?
HGSOC is usually subclonal. Are subtypes:
1. evolved differently in subclones of the same
tumor (“late”)?
2. ambiguous even in single cells (“intrinsic”)?
Inference from copy number and
expression of bulk TCGA tumors
Absolute copy number :
expression subtype
copy number subclonality :
expression subtype
Null hypothesis:
Subtype-associated DNA alterations occur no earlier or later than other alterations
H0 : ρ( SA , SC ) = 0 ρ ... Spearman correlation
Other MultiAssayExperiment Data
Resources
1. BiocOncoTK
– TCGA PanCanAtlas
– Cancer Cell Line Encyclopedia
– TARGET
2. Conquer
– Gene and Tx abundances from single-cell RNAseq
Software using MultiAssayExperiment
Analysis of cancer data
1. GOpro - Pan-cancer gene ontology analysis
2. omicsPrint - Cross ‘omic genetic fingerprinting
Systems biology
1. Hipathia - computation of signal transduction along signaling pathways
2. InTAD - detection of correlation between expressed genes and selected
epigenomic signals, topologically associated domains (TADs)
3. CageR - Precise mapping of transcription start sites and promoterome
mining
4. ELMER - Infer regulatory element landscapes and transcription factor
networks from cancer methylomes
Statistical methods
1. ClassifyR - a framework for cross-validated classification
2. missRows - handling missing individuals in multi’omics data integration
Thank you
ITCR #5U24CA180996 Collaborators (shortlist)
– Martin Morgan, Vincent Carey, Kasper Hansen
– Valerie Obenchain, Lori Shepherd
Lab
16Slides: https://tinyurl.com/LWITCR2018
@_schifferl
@schifferl
Lucas Schiffer, MPH
@M2RUseR
@LiNk-NY
Marcel Ramos, MPH
@lgeistlinger
Ludwig Geistlinger, PhD
@shbrief
Sehyun Oh, PhD

More Related Content

What's hot

Cancer stem cell theory and evidence from colorecatal
Cancer stem cell theory and evidence from colorecatalCancer stem cell theory and evidence from colorecatal
Cancer stem cell theory and evidence from colorecatal
Kareem Ahmed
 
MCO 2011 - Slide 18 - M.R. Alison - Joint medics and nurses spotlight session...
MCO 2011 - Slide 18 - M.R. Alison - Joint medics and nurses spotlight session...MCO 2011 - Slide 18 - M.R. Alison - Joint medics and nurses spotlight session...
MCO 2011 - Slide 18 - M.R. Alison - Joint medics and nurses spotlight session...European School of Oncology
 
American gut microbiome-2
American gut microbiome-2American gut microbiome-2
American gut microbiome-2
JaclynW
 
Stem cells as a cause of cancers
Stem cells as a cause of cancersStem cells as a cause of cancers
Stem cells as a cause of cancers
Andleeb Sultana
 
Hla genotyping
Hla genotypingHla genotyping
Hla genotyping
Creative Enzymes
 
Big Data and Genomic Medicine by Corey Nislow
Big Data and Genomic Medicine by Corey NislowBig Data and Genomic Medicine by Corey Nislow
Big Data and Genomic Medicine by Corey Nislow
Knome_Inc
 
NHL
NHLNHL
NHL
tttran
 
Are we Still Missing the Target in Trying to Prevent and Treat Human Cancers?...
Are we Still Missing the Target in Trying to Prevent and Treat Human Cancers?...Are we Still Missing the Target in Trying to Prevent and Treat Human Cancers?...
Are we Still Missing the Target in Trying to Prevent and Treat Human Cancers?...
CrimsonpublishersCancer
 
Cancer immunotherapy market
Cancer immunotherapy marketCancer immunotherapy market
Cancer immunotherapy market
KuicK Research
 
Kurrey_et_al-2009-STEM_CELLS
Kurrey_et_al-2009-STEM_CELLSKurrey_et_al-2009-STEM_CELLS
Kurrey_et_al-2009-STEM_CELLSSwati Jalgaonkar
 
Journal of Cancer Science & Research
Journal of Cancer Science & ResearchJournal of Cancer Science & Research
Journal of Cancer Science & Research
Symbiosis Online Publishing
 
The opportunity of stem cell to treat diabetes and cancer
The opportunity of stem cell to treat diabetes and cancerThe opportunity of stem cell to treat diabetes and cancer
The opportunity of stem cell to treat diabetes and cancer
Researchsio
 
PBSS sf 10-28-2016 flyer
PBSS sf 10-28-2016 flyerPBSS sf 10-28-2016 flyer
PBSS sf 10-28-2016 flyer
Vinita Gupta
 
Basics of immunotherapy in colorectal cancer
Basics of immunotherapy in colorectal cancerBasics of immunotherapy in colorectal cancer
Basics of immunotherapy in colorectal cancer
Mohamed Abdulla
 
CCRC Clonality Poster
CCRC Clonality PosterCCRC Clonality Poster
CCRC Clonality PosterNathan How
 
Unique Populations - 2019 CURE OM Symposium
Unique Populations - 2019 CURE OM SymposiumUnique Populations - 2019 CURE OM Symposium
Unique Populations - 2019 CURE OM Symposium
Melanoma Research Foundation
 
Identification Of Pancreatic Cancer Stem Cells
Identification Of Pancreatic Cancer Stem CellsIdentification Of Pancreatic Cancer Stem Cells
Identification Of Pancreatic Cancer Stem Cellsbegelfer
 

What's hot (20)

Cancer stem cell theory and evidence from colorecatal
Cancer stem cell theory and evidence from colorecatalCancer stem cell theory and evidence from colorecatal
Cancer stem cell theory and evidence from colorecatal
 
MCO 2011 - Slide 18 - M.R. Alison - Joint medics and nurses spotlight session...
MCO 2011 - Slide 18 - M.R. Alison - Joint medics and nurses spotlight session...MCO 2011 - Slide 18 - M.R. Alison - Joint medics and nurses spotlight session...
MCO 2011 - Slide 18 - M.R. Alison - Joint medics and nurses spotlight session...
 
American gut microbiome-2
American gut microbiome-2American gut microbiome-2
American gut microbiome-2
 
Stem cells as a cause of cancers
Stem cells as a cause of cancersStem cells as a cause of cancers
Stem cells as a cause of cancers
 
genomics upenn publication 1
genomics upenn publication 1genomics upenn publication 1
genomics upenn publication 1
 
Hla genotyping
Hla genotypingHla genotyping
Hla genotyping
 
Big Data and Genomic Medicine by Corey Nislow
Big Data and Genomic Medicine by Corey NislowBig Data and Genomic Medicine by Corey Nislow
Big Data and Genomic Medicine by Corey Nislow
 
poster_Xingzhi2
poster_Xingzhi2poster_Xingzhi2
poster_Xingzhi2
 
NHL
NHLNHL
NHL
 
Are we Still Missing the Target in Trying to Prevent and Treat Human Cancers?...
Are we Still Missing the Target in Trying to Prevent and Treat Human Cancers?...Are we Still Missing the Target in Trying to Prevent and Treat Human Cancers?...
Are we Still Missing the Target in Trying to Prevent and Treat Human Cancers?...
 
Cancer immunotherapy market
Cancer immunotherapy marketCancer immunotherapy market
Cancer immunotherapy market
 
Kurrey_et_al-2009-STEM_CELLS
Kurrey_et_al-2009-STEM_CELLSKurrey_et_al-2009-STEM_CELLS
Kurrey_et_al-2009-STEM_CELLS
 
Journal of Cancer Science & Research
Journal of Cancer Science & ResearchJournal of Cancer Science & Research
Journal of Cancer Science & Research
 
The opportunity of stem cell to treat diabetes and cancer
The opportunity of stem cell to treat diabetes and cancerThe opportunity of stem cell to treat diabetes and cancer
The opportunity of stem cell to treat diabetes and cancer
 
PBSS sf 10-28-2016 flyer
PBSS sf 10-28-2016 flyerPBSS sf 10-28-2016 flyer
PBSS sf 10-28-2016 flyer
 
Basics of immunotherapy in colorectal cancer
Basics of immunotherapy in colorectal cancerBasics of immunotherapy in colorectal cancer
Basics of immunotherapy in colorectal cancer
 
CCRC Clonality Poster
CCRC Clonality PosterCCRC Clonality Poster
CCRC Clonality Poster
 
Unique Populations - 2019 CURE OM Symposium
Unique Populations - 2019 CURE OM SymposiumUnique Populations - 2019 CURE OM Symposium
Unique Populations - 2019 CURE OM Symposium
 
Identification Of Pancreatic Cancer Stem Cells
Identification Of Pancreatic Cancer Stem CellsIdentification Of Pancreatic Cancer Stem Cells
Identification Of Pancreatic Cancer Stem Cells
 
Quest for the Magic Bullet
Quest for the Magic BulletQuest for the Magic Bullet
Quest for the Magic Bullet
 

Similar to 2018 05 24-waldron-itcr

NCI Cancer Genomic Data Commons for NCAB September 2016
NCI Cancer Genomic Data Commons for NCAB September 2016NCI Cancer Genomic Data Commons for NCAB September 2016
NCI Cancer Genomic Data Commons for NCAB September 2016
Warren Kibbe
 
Genomics and Computation in Precision Medicine March 2017
Genomics and Computation in Precision Medicine March 2017Genomics and Computation in Precision Medicine March 2017
Genomics and Computation in Precision Medicine March 2017
Warren Kibbe
 
ISCB ECCB BD2K keynote Kibbe 201707
ISCB ECCB BD2K keynote Kibbe 201707ISCB ECCB BD2K keynote Kibbe 201707
ISCB ECCB BD2K keynote Kibbe 201707
Warren Kibbe
 
Translating Genomes | Personalizing Medicine
Translating Genomes | Personalizing MedicineTranslating Genomes | Personalizing Medicine
Translating Genomes | Personalizing Medicine
Candy Smellie
 
Cancer Moonshot, Data sharing and the Genomic Data Commons
Cancer Moonshot, Data sharing and the Genomic Data CommonsCancer Moonshot, Data sharing and the Genomic Data Commons
Cancer Moonshot, Data sharing and the Genomic Data Commons
Warren Kibbe
 
Visualizing Patient Cohorts: Integrating Data Types, Relationships, and Time
Visualizing Patient Cohorts: Integrating Data Types, Relationships, and TimeVisualizing Patient Cohorts: Integrating Data Types, Relationships, and Time
Visualizing Patient Cohorts: Integrating Data Types, Relationships, and Time
Nils Gehlenborg
 
Data Sharing: Highlights from NCI Experience and Cancer Moonshot Task Force
Data Sharing: Highlights from NCI Experience and Cancer Moonshot Task ForceData Sharing: Highlights from NCI Experience and Cancer Moonshot Task Force
Data Sharing: Highlights from NCI Experience and Cancer Moonshot Task Force
Jerry Lee
 
Converged IT Summit - NCI Data Sharing
Converged IT Summit - NCI Data SharingConverged IT Summit - NCI Data Sharing
Converged IT Summit - NCI Data Sharing
Warren Kibbe
 
A Vision for a Cancer Research Knowledge System
A Vision for a Cancer Research Knowledge SystemA Vision for a Cancer Research Knowledge System
A Vision for a Cancer Research Knowledge System
Warren Kibbe
 
Big data sharing
Big data sharingBig data sharing
Big data sharing
Warren Kibbe
 
TCGC The Clinical Genome Conference 2015
TCGC The Clinical Genome Conference 2015TCGC The Clinical Genome Conference 2015
TCGC The Clinical Genome Conference 2015
Nicole Proulx
 
Integrative analysis and visualization of clinical and molecular data for can...
Integrative analysis and visualization of clinical and molecular data for can...Integrative analysis and visualization of clinical and molecular data for can...
Integrative analysis and visualization of clinical and molecular data for can...
Lake Como School of Advanced Studies
 
Amia tb-review-13
Amia tb-review-13Amia tb-review-13
Amia tb-review-13
Russ Altman
 
Question of Quality Conference 2016 - Personalized Cancer Medicine
Question of Quality Conference 2016 - Personalized Cancer MedicineQuestion of Quality Conference 2016 - Personalized Cancer Medicine
Question of Quality Conference 2016 - Personalized Cancer Medicine
HCA Healthcare UK
 
#HCAQofQ Tariq Mughal
#HCAQofQ Tariq Mughal#HCAQofQ Tariq Mughal
#HCAQofQ Tariq Mughal
Rebecca Pullen
 
AI and whole slide imaging biomarkers
AI and whole slide imaging biomarkersAI and whole slide imaging biomarkers
AI and whole slide imaging biomarkers
Joel Saltz
 
Minot ASM NGS 2018-09-26
Minot ASM NGS 2018-09-26Minot ASM NGS 2018-09-26
Minot ASM NGS 2018-09-26
Samuel Minot
 

Similar to 2018 05 24-waldron-itcr (20)

NCI Cancer Genomic Data Commons for NCAB September 2016
NCI Cancer Genomic Data Commons for NCAB September 2016NCI Cancer Genomic Data Commons for NCAB September 2016
NCI Cancer Genomic Data Commons for NCAB September 2016
 
Genomics and Computation in Precision Medicine March 2017
Genomics and Computation in Precision Medicine March 2017Genomics and Computation in Precision Medicine March 2017
Genomics and Computation in Precision Medicine March 2017
 
ISCB ECCB BD2K keynote Kibbe 201707
ISCB ECCB BD2K keynote Kibbe 201707ISCB ECCB BD2K keynote Kibbe 201707
ISCB ECCB BD2K keynote Kibbe 201707
 
Translating Genomes | Personalizing Medicine
Translating Genomes | Personalizing MedicineTranslating Genomes | Personalizing Medicine
Translating Genomes | Personalizing Medicine
 
Cancer Moonshot, Data sharing and the Genomic Data Commons
Cancer Moonshot, Data sharing and the Genomic Data CommonsCancer Moonshot, Data sharing and the Genomic Data Commons
Cancer Moonshot, Data sharing and the Genomic Data Commons
 
Visualizing Patient Cohorts: Integrating Data Types, Relationships, and Time
Visualizing Patient Cohorts: Integrating Data Types, Relationships, and TimeVisualizing Patient Cohorts: Integrating Data Types, Relationships, and Time
Visualizing Patient Cohorts: Integrating Data Types, Relationships, and Time
 
Data Sharing: Highlights from NCI Experience and Cancer Moonshot Task Force
Data Sharing: Highlights from NCI Experience and Cancer Moonshot Task ForceData Sharing: Highlights from NCI Experience and Cancer Moonshot Task Force
Data Sharing: Highlights from NCI Experience and Cancer Moonshot Task Force
 
Converged IT Summit - NCI Data Sharing
Converged IT Summit - NCI Data SharingConverged IT Summit - NCI Data Sharing
Converged IT Summit - NCI Data Sharing
 
A Vision for a Cancer Research Knowledge System
A Vision for a Cancer Research Knowledge SystemA Vision for a Cancer Research Knowledge System
A Vision for a Cancer Research Knowledge System
 
Big data sharing
Big data sharingBig data sharing
Big data sharing
 
TCGC The Clinical Genome Conference 2015
TCGC The Clinical Genome Conference 2015TCGC The Clinical Genome Conference 2015
TCGC The Clinical Genome Conference 2015
 
cancer
cancercancer
cancer
 
Integrative analysis and visualization of clinical and molecular data for can...
Integrative analysis and visualization of clinical and molecular data for can...Integrative analysis and visualization of clinical and molecular data for can...
Integrative analysis and visualization of clinical and molecular data for can...
 
Amia tb-review-13
Amia tb-review-13Amia tb-review-13
Amia tb-review-13
 
MCR_Article_JW
MCR_Article_JWMCR_Article_JW
MCR_Article_JW
 
Question of Quality Conference 2016 - Personalized Cancer Medicine
Question of Quality Conference 2016 - Personalized Cancer MedicineQuestion of Quality Conference 2016 - Personalized Cancer Medicine
Question of Quality Conference 2016 - Personalized Cancer Medicine
 
#HCAQofQ Tariq Mughal
#HCAQofQ Tariq Mughal#HCAQofQ Tariq Mughal
#HCAQofQ Tariq Mughal
 
AI and whole slide imaging biomarkers
AI and whole slide imaging biomarkersAI and whole slide imaging biomarkers
AI and whole slide imaging biomarkers
 
Minot ASM NGS 2018-09-26
Minot ASM NGS 2018-09-26Minot ASM NGS 2018-09-26
Minot ASM NGS 2018-09-26
 
Plegable pdf
Plegable pdfPlegable pdf
Plegable pdf
 

More from Levi Waldron

Towards Replicable and Genereralizable Genomic Prediction Models
Towards Replicable and Genereralizable Genomic Prediction ModelsTowards Replicable and Genereralizable Genomic Prediction Models
Towards Replicable and Genereralizable Genomic Prediction Models
Levi Waldron
 
The impact of different sources of heterogeneity on loss of accuracy from gen...
The impact of different sources of heterogeneity on loss of accuracy from gen...The impact of different sources of heterogeneity on loss of accuracy from gen...
The impact of different sources of heterogeneity on loss of accuracy from gen...
Levi Waldron
 
Multi-omics methods and resources for Bioconductor
Multi-omics methods and resources for BioconductorMulti-omics methods and resources for Bioconductor
Multi-omics methods and resources for Bioconductor
Levi Waldron
 
2017 12-04 euro-biocsig
2017 12-04 euro-biocsig2017 12-04 euro-biocsig
2017 12-04 euro-biocsig
Levi Waldron
 
Why re-use core classes?
Why re-use core classes?Why re-use core classes?
Why re-use core classes?
Levi Waldron
 
Multi-omics infrastructure and data for R/Bioconductor
Multi-omics infrastructure and data for R/BioconductorMulti-omics infrastructure and data for R/Bioconductor
Multi-omics infrastructure and data for R/Bioconductor
Levi Waldron
 

More from Levi Waldron (6)

Towards Replicable and Genereralizable Genomic Prediction Models
Towards Replicable and Genereralizable Genomic Prediction ModelsTowards Replicable and Genereralizable Genomic Prediction Models
Towards Replicable and Genereralizable Genomic Prediction Models
 
The impact of different sources of heterogeneity on loss of accuracy from gen...
The impact of different sources of heterogeneity on loss of accuracy from gen...The impact of different sources of heterogeneity on loss of accuracy from gen...
The impact of different sources of heterogeneity on loss of accuracy from gen...
 
Multi-omics methods and resources for Bioconductor
Multi-omics methods and resources for BioconductorMulti-omics methods and resources for Bioconductor
Multi-omics methods and resources for Bioconductor
 
2017 12-04 euro-biocsig
2017 12-04 euro-biocsig2017 12-04 euro-biocsig
2017 12-04 euro-biocsig
 
Why re-use core classes?
Why re-use core classes?Why re-use core classes?
Why re-use core classes?
 
Multi-omics infrastructure and data for R/Bioconductor
Multi-omics infrastructure and data for R/BioconductorMulti-omics infrastructure and data for R/Bioconductor
Multi-omics infrastructure and data for R/Bioconductor
 

Recently uploaded

SAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdfSAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdf
KrushnaDarade1
 
ANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptx
ANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptxANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptx
ANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptx
RASHMI M G
 
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
Wasswaderrick3
 
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
University of Maribor
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Sérgio Sacani
 
ESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptxESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptx
PRIYANKA PATEL
 
Toxic effects of heavy metals : Lead and Arsenic
Toxic effects of heavy metals : Lead and ArsenicToxic effects of heavy metals : Lead and Arsenic
Toxic effects of heavy metals : Lead and Arsenic
sanjana502982
 
nodule formation by alisha dewangan.pptx
nodule formation by alisha dewangan.pptxnodule formation by alisha dewangan.pptx
nodule formation by alisha dewangan.pptx
alishadewangan1
 
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
University of Maribor
 
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
yqqaatn0
 
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Sérgio Sacani
 
Phenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvementPhenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvement
IshaGoswami9
 
20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx
Sharon Liu
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
SAMIR PANDA
 
In silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptxIn silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptx
AlaminAfendy1
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
University of Rennes, INSA Rennes, Inria/IRISA, CNRS
 
Lateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensiveLateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensive
silvermistyshot
 
NuGOweek 2024 Ghent programme overview flyer
NuGOweek 2024 Ghent programme overview flyerNuGOweek 2024 Ghent programme overview flyer
NuGOweek 2024 Ghent programme overview flyer
pablovgd
 
Orion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWSOrion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWS
Columbia Weather Systems
 
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
yqqaatn0
 

Recently uploaded (20)

SAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdfSAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdf
 
ANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptx
ANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptxANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptx
ANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptx
 
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
 
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
 
ESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptxESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptx
 
Toxic effects of heavy metals : Lead and Arsenic
Toxic effects of heavy metals : Lead and ArsenicToxic effects of heavy metals : Lead and Arsenic
Toxic effects of heavy metals : Lead and Arsenic
 
nodule formation by alisha dewangan.pptx
nodule formation by alisha dewangan.pptxnodule formation by alisha dewangan.pptx
nodule formation by alisha dewangan.pptx
 
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
 
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
 
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
 
Phenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvementPhenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvement
 
20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
 
In silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptxIn silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptx
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
 
Lateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensiveLateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensive
 
NuGOweek 2024 Ghent programme overview flyer
NuGOweek 2024 Ghent programme overview flyerNuGOweek 2024 Ghent programme overview flyer
NuGOweek 2024 Ghent programme overview flyer
 
Orion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWSOrion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWS
 
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
 

2018 05 24-waldron-itcr

  • 1. Cancer Genomics: Integrative and Scalable Solutions in R/Bioconductor Martin Morgan (PI), Kasper Hansen, Vincent Carey, Levi Waldron (co-investigators) May 24, 2018 @leviwaldron1 www.github.com/waldronlab NCI ITCR #5U24CA180996
  • 2. Outline • Infrastructure produced by 5U24CA180996 – Developing Bioconductor for the comprehension of complex multi’omic investigations of cancer • How my lab has used this infrastructure • How others are using this infrastructure
  • 3. Credit: Marcel Ramos 33 diseases 50 platforms 19 data types Integrative Representation and Analysis of Complex Cancer Data Sets Aim 1 e.g. TCGA
  • 4. The need for MultiAssayExperiment Need a core Bioconductor data structure to: – harmonize single-assay data structures – relate multiple assays with one set of clinical data – handle missing and replicate observations – accommodate ID-based and range-based data – support on-disk representations of big data
  • 5. MultiAssayExperiment design Credit: Marcel Ramos 5 Ramos M, Schiffer L, Re A, Azhar R, Basunia A, Rodriguez C, Chan T, Chapman P, Davis SR, Gomez-Cabrero D, Culhane AC, Haibe-Kains B, Hansen KD, Kodali H, Louis MS, Mer AS, Riester M, Morgan M, Carey V, Waldron L: Software for the Integration of Multiomics Experiments in Bioconductor. Cancer Res. 2017, 77:e39–e42.
  • 7. RaggedExperiment (segmented copy number, mutations) Credit: Marcel Ramos Martin Morgan
  • 8. Aim 2: data resources 1. curatedTCGAData – simplified pan-cancer analysis of TCGA 2. MultiAssayExperimentData – cBioPortal, … 3. curatedMetagenomicData1 – curated data for human microbiome studies 1Pasolli E, Schiffer L, Manghi P, Renson A, Obenchain V, Truong DT, Beghini F, Malik F, Ramos M, Dowd JB, Huttenhower C, Morgan M, Segata N, Waldron L: Accessible, curated metagenomic data through ExperimentHub. Nat. Methods 2017, 14:1023–1024.
  • 9. Aim 3: scalable infrastructure 1. ExperimentHub – Distribution of curatedTCGAData, etc 2. DelayedArray – Out-of-memory data representation using HDF5 3. BiocParallel – Simplified parallel evaluation
  • 11. Subtype ambiguity of high-grade serous ovarian carcinoma (HGSOC) • HGSOC is a clinically homogeneous disease with widely varying outcomes • Multiple studies have “confirmed” the existence of 4 molecular subtypes • Our recent work shows most HGSOC tumors are ambiguous in their classification Chen GM, Kannan L, Geistlinger L, Kofia V, Safikhani Z, Gendoo DMA, Parmigiani G, Birrer M, Haibe-Kains B, Waldron L: Consensus on Molecular Subtypes of Ovarian Cancer. bioRxiv 2017:162685. (In Press, Clinical Cancer Research)
  • 12. Source of subtype ambiguity? HGSOC is usually subclonal. Are subtypes: 1. evolved differently in subclones of the same tumor (“late”)? 2. ambiguous even in single cells (“intrinsic”)?
  • 13. Inference from copy number and expression of bulk TCGA tumors Absolute copy number : expression subtype copy number subclonality : expression subtype Null hypothesis: Subtype-associated DNA alterations occur no earlier or later than other alterations H0 : ρ( SA , SC ) = 0 ρ ... Spearman correlation
  • 14. Other MultiAssayExperiment Data Resources 1. BiocOncoTK – TCGA PanCanAtlas – Cancer Cell Line Encyclopedia – TARGET 2. Conquer – Gene and Tx abundances from single-cell RNAseq
  • 15. Software using MultiAssayExperiment Analysis of cancer data 1. GOpro - Pan-cancer gene ontology analysis 2. omicsPrint - Cross ‘omic genetic fingerprinting Systems biology 1. Hipathia - computation of signal transduction along signaling pathways 2. InTAD - detection of correlation between expressed genes and selected epigenomic signals, topologically associated domains (TADs) 3. CageR - Precise mapping of transcription start sites and promoterome mining 4. ELMER - Infer regulatory element landscapes and transcription factor networks from cancer methylomes Statistical methods 1. ClassifyR - a framework for cross-validated classification 2. missRows - handling missing individuals in multi’omics data integration
  • 16. Thank you ITCR #5U24CA180996 Collaborators (shortlist) – Martin Morgan, Vincent Carey, Kasper Hansen – Valerie Obenchain, Lori Shepherd Lab 16Slides: https://tinyurl.com/LWITCR2018 @_schifferl @schifferl Lucas Schiffer, MPH @M2RUseR @LiNk-NY Marcel Ramos, MPH @lgeistlinger Ludwig Geistlinger, PhD @shbrief Sehyun Oh, PhD

Editor's Notes

  1. Hi. I’d like to introduce you to MultiAssayExperiment, a framework for the representation and analysis of multi-omics experiments in Bioconductor.
  2. This work was motivated by the need to simplify general statistical analysis and development of bioinformatic tools for a study as complex as the Cancer Genome Atlas, where 33 cancers were assayed on many platforms to generate different types of data, but also to provide a simplified framework for more easily reproducible and less error-prone analysis of simpler experiments involving just a couple of complementary assays and clinical data.
  3. A core data structure was needed to * harmonize existing structures for different types of data, * relate assays with each other and clinical data * handle the reality that such experiments are often incomplete and missing observations on some assays, and also may contain replicates, time series, or matched normal, * accommodate data that are indexed by IDs such as genes and data indexed by genomic ranges, * and support on-disk representations for big data
  4. MultiAssayExperiment addresses these challenges by relating a table of information about subjects, say clinical and pathological data, to a series of genomic data sets of arbitrary shape and even non-tabular data, via a map or a network relating these. This sounds complex and it can be, but from the analyst’s perspective, there is an API that will be familiar to users of R, and that abstracts this complexity from the user. Constructing, accessing, subsetting, data management or manipulation, and combining and reshaping into forms usable by standard tools become straightforward.
  5. To help those wanting to analyze TCGA data, we’ve constructed MultiAssayExperiments for 33 cancer types. Each cancer type is represented by a single object containing all the most commonly used, unrestricted data. These objects are immediately usable, even on most laptops, with the API shown on the previous page.
  6. met Jin Xu from East China Normal University, Shanghai