SlideShare a Scribd company logo
1 of 59
National Cancer Institute
U.S. DEPARTMENT OF HEALTH AND HUMAN SERVICES
National Institutes of Health
NCI Informatics
and Genomics
September 2014
Disclaimer
• These views are my own and do not
necessarily reflect those of the NCI
Overview
• National Challenges in Cancer Data
• Disruptive Technologies
• NCI Genomics Data Commons
• NCI Cloud Pilots
• Building a national learning health system
for cancer clinical genomics
National Challenges in Cancer
Informatics
• Lowering barriers to data access,
analysis and modeling for cancer
research
• Integration of data and learning from
basic and clinical research with
cancer care that enable prediction
and improved outcomes
We need:
• Open Science (Open Access, Open Data,
Open Source) and Data Liquidity for the
cancer community
• Semantic interoperability through CDEs
and Case Report Forms mapped to
standards
• Sustainable models for informatics
infrastructure, services, data
Where we are
Disruptive technologies
Getting social
Open access to data
Disruptive Technologies
• Printing
• Steam power
• Transportation
• Electricity
• Antibiotics
• Semiconductors &VLSI design
• http
• High throughput biology
Systems view - end of reductionism?
Precision Oncology
• The era of precision medicine and precision
oncology is predicated on the integration of
research, care, and molecular medicine and
the availability of data for modeling, risk
analysis, and optimal care
How do we re-engineer
translational research policies
that will enable a true learning
healthcare system?
Disruptive Technologies
• Printing
• Steam power
• Transportation
• Electricity
• Antibiotics
• Semiconductors &VLSI design
• http
• High throughput biology
• Ubiquitous computing
Everyone is a data provider
Data immersion
World:
6.6B active mobile contracts
1.9B smart phone contracts
1.1B land lines
World population 7.1B
US:
345M active mobile contracts
287M smart phone contracts
US population 313M
What about social media?
• Social media may be one avenue for
modifying behaviors that result in cancer
• Properly orchestrated, social media can
have dramatic impact on quality of life
for patients and survivors
• It can reach into all segments of our
society, including underserved populations
Public Health
• These three modifiable factors -
infectious disease, smoking, and poor
nutrition and lack of exercise contribute
to at least 50% of our current cancer
burden. And the cost from loss of quality of
life, pain and suffering is incalculable.
Some NCI Big Data activities
• TCGA, TARGET and ICGC
– Cancer Genomics Data Commons
– NCI Cloud Pilots
• Molecular Clinical Trials:
– MPACT, MATCH, Exceptional Responders
Data are accumulating!
From the Second Machine Age
From: The Second Machine Age: Work, Progress, and Prosperity in a Time of Brilliant
Technologies by Erik Brynjolfsson & Andrew McAfee
Molecular data is Big Data
• Brief trip down memory lane
• Sequencing and the Human Genome
Project
GenBank High Throughput
Genome Sequence (HTGS)
February 12, 2001
HGP outcomes
• $5.6B investment in 2010 dollars
• $800B economic development
• Enabled many basic discoveries, clinical
therapies and diagnostics, and applied
technologies
TCGA history
• About three years post-HGP
• Initiated in 2005
• Collaboration of NHGRI and NCI to
examine GBM, Lung and Ovarian cancer
using genomic techniques in 2006.
• Expanded to 20+ tumor types.
TCGA drivers
• Providing high quality reference sets for
20+ tissue types
• Providing a platform for systems biology
and hypothesis generation
• Providing a test bed for understanding the
real world implications of consent and data
access policies on genomic and clinical
data.
24
Assays and Data Types
25
Focus on TCGA
• TCGA consortium slides
• Thanks to Lou Staudt and Jean Claude
Zenklusen
TCGA –
Lessons from
structural
genomics
Jean Claude Zenklusen,
Ph.D.
Director
TCGA Program Office
National Cancer Institute
The Mutational Burden of Human Cancer
Mike Lawrence and Gaddy Getz
Increasing genomic
complexity
Childhood
cancers
Carcinogens
TCGA Nature 497:67 (2013)
Molecular Subgroups Refine Histological Diagnosis
Of Endometrial Carcinoma
POLE
(ultra-
mutated)
MSI
(hypermutated)
Copy-number low
(endometriod)
Copy-number high
(serous-like)
Histology
Mutations
Per Mb
PolE
MSI / MSH2
Copy #
PTEN
p53
Serous
misdiagnosed
as endometrioid?
Endometrioid
Serous
Histology
TCGA Nature 497:67 (2013)
Molecular Diagnosis of Endometrial Cancer May
Influence Choice of Therapy
POLE
(ultra-
mutated)
MSI
(hypermutated)
Copy-number low
(endometriod)
Copy-number high
(serous-like)
Histology
Mutations
Per Mb
PolE
MSI / MSH2
Copy #
PTEN
p53
Adjuvant
chemotherapy?
Adjuvant
radiotherapy?
Surgery only?
GDC
NCI Cancer Genomics Data Commons
NCI Genomics
Data Commons
Genomic +
clinical data
. . .
GDC
NCI Cancer Genomics Data Commons
NCI Genomics
Data Commons
Genomic +
clinical data
. . .
Cancer
information
donor
GDC
Identify
low-frequency
cancer drivers
Define genomic
determinants of response
to therapy
Compose clinical trial
cohorts sharing
Targeted genetic lesions
Cancer
information
donor
Utility of a Cancer Knowledge Base
Driver for the Cloud Pilots
• An inflection point for TCGA is looming
0
500,000
1,000,000
1,500,000
2,000,000
2,500,000
7/1/09
1/1/10
7/1/10
1/1/11
7/1/11
1/1/12
7/1/12
1/1/13
7/1/13
1/1/14
7/1/14
Gigabytes(GB)
NCI Cloud Pilots
• Funding for up to 3 cloud pilots - 24
month pilots that are meant to inform the
Cancer Genomics Data Commons
– Explore models for cancer genomics APIs
– Explore cloud models for data+analysis
• Announced this week: The Institute for
Systems Biology, The Broad Institute, and
Seven Bridges will be the initial consortium
NCI Cloud Pilots
• A way to move computation to the data
• Sustainable models for providing access
to data
• Reproducible pipelines for QA, variant
calling, knowledge sharing
• Define genomics/phenomics APIs for
discovering new variants contributing to
cancer, enhancing response, modulating
risk
GDC
Relationship of the Cancer Genomics
Data Commons and NCI Cloud Pilots
NCI Cloud
Computational Centers
Periodic
Data Freezes
Search /
retrieve
Analysis
NCI Genomics
Data Commons
Cancer Genomics Cloud Pilots
Institute of Medicine Report
Sept 10, 2013
Delivering High-Quality Cancer Care: Charting
a New Course for System in Crisis
Understanding the outcomes of individual cancer patients as
well as groups of similar patients
1
Capturing data from real-world settings that researchers
can then analyze to generate new knowledge
2
A “Learning” healthcare IT system that learns routinely and
iteratively by analyzing captured data, generating evidence,
and implementing new insights into subsequent care.
3
“Learning IT System”
IOM Report on Cancer Care
Search Prior Knowledge: Enable clinicians to use
previous patients’ experiences to guide future care.
1
Care Team Collaboration: Facilitate a
coordinated cancer care workforce & mechanisms for
easily sharing information with each other.
2
Cancer Research: Improve the evidence base for quality
cancer care by utilizing all of the data captured during real-
world clinical encounters and integrating it with data captured
from other sources.
3
What’s next?
Searching1
Mining2
Prediction3
Can searching
prior knowledge
help future
patients?
Netflix’s Cinematch software analyzes each customer’s film-viewing habits and
recommends other movies.
Can we make a Cinematch
for cancer patients?
Patients like me
• Patients with diagnoses,
symptoms and labs like yours are
eligible for these trials…
• Patient-centered resources…
If we can forecast
the weather, can
we forecast
cancer?
Where is the weather moving?
Doppler & Map Fusion
Animating the Weather
Dimension of time assists in decision making.
What about the future?
Present 5 Hours into Future
What changed?
Equations
Satellite data
Computers
1
2
3
Modeling Tumor Growth
Mathematical model: proliferation
of cells with the potential for
invasion and metastasis
Swanson et al., British Journal of Cancer, 2007: 1-7.
Personalized Tumor Model
Imaging used to seed the model
Personalized Tumor Model
Today Future
Radiation Treatment Effects
L-Q model used to
describe cell killing
New term defines
cell killing
Population
Decision
Support
Rapid Learning Systems
Patient-level data are aggregated to achieve population-based change,
and results are applied to care of individual patients.
Predict
outcomes
Precision Oncology
• The era of precision medicine and precision
oncology is predicated on the integration of
research, care, and molecular medicine and
the availability of data for modeling, risk
analysis, and optimal care
How do we re-engineer
translational research policies
that will enable a true learning
healthcare system?
The future
• Elastic computing ‘clouds’
• Social networks
• Big Data analytics
• Precision medicine
• Measuring health
• Practicing protective medicine
Semantic and
synoptic data
Intervening
before health is
compromised
Learning systems that enable learning
from every cancer patient
Thank you
Warren A. Kibbe
warren.kibbe@nih.gov
NCI Cloud Pilots Drive Cancer Genomics Data Commons

More Related Content

What's hot

Nci clinical genomics data sharing ncra sept 2016
Nci clinical genomics data sharing ncra sept 2016Nci clinical genomics data sharing ncra sept 2016
Nci clinical genomics data sharing ncra sept 2016Warren Kibbe
 
Genomics and Computation in Precision Medicine March 2017
Genomics and Computation in Precision Medicine March 2017Genomics and Computation in Precision Medicine March 2017
Genomics and Computation in Precision Medicine March 2017Warren Kibbe
 
NCI Cancer Genomics, Open Science and PMI: FAIR
NCI Cancer Genomics, Open Science and PMI: FAIR NCI Cancer Genomics, Open Science and PMI: FAIR
NCI Cancer Genomics, Open Science and PMI: FAIR Warren Kibbe
 
NCI Cancer Genomic Data Commons for NCAB September 2016
NCI Cancer Genomic Data Commons for NCAB September 2016NCI Cancer Genomic Data Commons for NCAB September 2016
NCI Cancer Genomic Data Commons for NCAB September 2016Warren Kibbe
 
Cancer Moonshot, Data sharing and the Genomic Data Commons
Cancer Moonshot, Data sharing and the Genomic Data CommonsCancer Moonshot, Data sharing and the Genomic Data Commons
Cancer Moonshot, Data sharing and the Genomic Data CommonsWarren Kibbe
 
Day 2 Big Data panel at the NIH BD2K All Hands 2016 meeting
Day 2 Big Data panel at the NIH BD2K All Hands 2016 meetingDay 2 Big Data panel at the NIH BD2K All Hands 2016 meeting
Day 2 Big Data panel at the NIH BD2K All Hands 2016 meetingWarren Kibbe
 
National Cancer Data Ecosystem and Data Sharing
National Cancer Data Ecosystem and Data SharingNational Cancer Data Ecosystem and Data Sharing
National Cancer Data Ecosystem and Data SharingWarren Kibbe
 
A Vision for a Cancer Research Knowledge System
A Vision for a Cancer Research Knowledge SystemA Vision for a Cancer Research Knowledge System
A Vision for a Cancer Research Knowledge SystemWarren Kibbe
 
C-Change Cancer Big Data, NCI Genomic Data Commons, Cloud Pilots
C-Change Cancer Big Data, NCI Genomic Data Commons, Cloud PilotsC-Change Cancer Big Data, NCI Genomic Data Commons, Cloud Pilots
C-Change Cancer Big Data, NCI Genomic Data Commons, Cloud PilotsWarren Kibbe
 
NCI Cancer Imaging Program - Cancer Research Data Ecosystem
NCI Cancer Imaging Program - Cancer Research Data EcosystemNCI Cancer Imaging Program - Cancer Research Data Ecosystem
NCI Cancer Imaging Program - Cancer Research Data EcosystemWarren Kibbe
 
Digital Pathology: Precision Medicine, Deep Learning and Computer Aided Inter...
Digital Pathology: Precision Medicine, Deep Learning and Computer Aided Inter...Digital Pathology: Precision Medicine, Deep Learning and Computer Aided Inter...
Digital Pathology: Precision Medicine, Deep Learning and Computer Aided Inter...Joel Saltz
 
Converged IT Summit - NCI Data Sharing
Converged IT Summit - NCI Data SharingConverged IT Summit - NCI Data Sharing
Converged IT Summit - NCI Data SharingWarren Kibbe
 
SuperComputing 16 HPC Matters Panel on Precision Medicine
SuperComputing 16 HPC Matters Panel on Precision MedicineSuperComputing 16 HPC Matters Panel on Precision Medicine
SuperComputing 16 HPC Matters Panel on Precision MedicineWarren Kibbe
 
NCI TEAG Cancer Moonshot Blue Ribbon Panel Presentation Oct 2016
NCI TEAG Cancer Moonshot Blue Ribbon Panel Presentation Oct 2016NCI TEAG Cancer Moonshot Blue Ribbon Panel Presentation Oct 2016
NCI TEAG Cancer Moonshot Blue Ribbon Panel Presentation Oct 2016Warren Kibbe
 
Advancing Innovation and Convergence in Cancer Research: US Federal Cancer Mo...
Advancing Innovation and Convergence in Cancer Research: US Federal Cancer Mo...Advancing Innovation and Convergence in Cancer Research: US Federal Cancer Mo...
Advancing Innovation and Convergence in Cancer Research: US Federal Cancer Mo...Jerry Lee
 
CI4CC Moonshot Blue Ribbon Panel Report 20161010
CI4CC Moonshot Blue Ribbon Panel Report 20161010CI4CC Moonshot Blue Ribbon Panel Report 20161010
CI4CC Moonshot Blue Ribbon Panel Report 20161010Warren Kibbe
 
Extreme Computing, Clinical Medicine and GPUs or Can GPUs Cure Cancer
Extreme Computing, Clinical Medicine and GPUs or Can GPUs Cure CancerExtreme Computing, Clinical Medicine and GPUs or Can GPUs Cure Cancer
Extreme Computing, Clinical Medicine and GPUs or Can GPUs Cure CancerJoel Saltz
 
Twenty Years of Whole Slide Imaging - the Coming Phase Change
Twenty Years of Whole Slide Imaging - the Coming Phase ChangeTwenty Years of Whole Slide Imaging - the Coming Phase Change
Twenty Years of Whole Slide Imaging - the Coming Phase ChangeJoel Saltz
 

What's hot (20)

Nci clinical genomics data sharing ncra sept 2016
Nci clinical genomics data sharing ncra sept 2016Nci clinical genomics data sharing ncra sept 2016
Nci clinical genomics data sharing ncra sept 2016
 
Genomics and Computation in Precision Medicine March 2017
Genomics and Computation in Precision Medicine March 2017Genomics and Computation in Precision Medicine March 2017
Genomics and Computation in Precision Medicine March 2017
 
NCI Cancer Genomics, Open Science and PMI: FAIR
NCI Cancer Genomics, Open Science and PMI: FAIR NCI Cancer Genomics, Open Science and PMI: FAIR
NCI Cancer Genomics, Open Science and PMI: FAIR
 
NCI Cancer Genomic Data Commons for NCAB September 2016
NCI Cancer Genomic Data Commons for NCAB September 2016NCI Cancer Genomic Data Commons for NCAB September 2016
NCI Cancer Genomic Data Commons for NCAB September 2016
 
Cancer Moonshot, Data sharing and the Genomic Data Commons
Cancer Moonshot, Data sharing and the Genomic Data CommonsCancer Moonshot, Data sharing and the Genomic Data Commons
Cancer Moonshot, Data sharing and the Genomic Data Commons
 
Day 2 Big Data panel at the NIH BD2K All Hands 2016 meeting
Day 2 Big Data panel at the NIH BD2K All Hands 2016 meetingDay 2 Big Data panel at the NIH BD2K All Hands 2016 meeting
Day 2 Big Data panel at the NIH BD2K All Hands 2016 meeting
 
National Cancer Data Ecosystem and Data Sharing
National Cancer Data Ecosystem and Data SharingNational Cancer Data Ecosystem and Data Sharing
National Cancer Data Ecosystem and Data Sharing
 
A Vision for a Cancer Research Knowledge System
A Vision for a Cancer Research Knowledge SystemA Vision for a Cancer Research Knowledge System
A Vision for a Cancer Research Knowledge System
 
C-Change Cancer Big Data, NCI Genomic Data Commons, Cloud Pilots
C-Change Cancer Big Data, NCI Genomic Data Commons, Cloud PilotsC-Change Cancer Big Data, NCI Genomic Data Commons, Cloud Pilots
C-Change Cancer Big Data, NCI Genomic Data Commons, Cloud Pilots
 
NCI Cancer Imaging Program - Cancer Research Data Ecosystem
NCI Cancer Imaging Program - Cancer Research Data EcosystemNCI Cancer Imaging Program - Cancer Research Data Ecosystem
NCI Cancer Imaging Program - Cancer Research Data Ecosystem
 
Digital Pathology: Precision Medicine, Deep Learning and Computer Aided Inter...
Digital Pathology: Precision Medicine, Deep Learning and Computer Aided Inter...Digital Pathology: Precision Medicine, Deep Learning and Computer Aided Inter...
Digital Pathology: Precision Medicine, Deep Learning and Computer Aided Inter...
 
Converged IT Summit - NCI Data Sharing
Converged IT Summit - NCI Data SharingConverged IT Summit - NCI Data Sharing
Converged IT Summit - NCI Data Sharing
 
Dalton
DaltonDalton
Dalton
 
Dalton presentation
Dalton presentationDalton presentation
Dalton presentation
 
SuperComputing 16 HPC Matters Panel on Precision Medicine
SuperComputing 16 HPC Matters Panel on Precision MedicineSuperComputing 16 HPC Matters Panel on Precision Medicine
SuperComputing 16 HPC Matters Panel on Precision Medicine
 
NCI TEAG Cancer Moonshot Blue Ribbon Panel Presentation Oct 2016
NCI TEAG Cancer Moonshot Blue Ribbon Panel Presentation Oct 2016NCI TEAG Cancer Moonshot Blue Ribbon Panel Presentation Oct 2016
NCI TEAG Cancer Moonshot Blue Ribbon Panel Presentation Oct 2016
 
Advancing Innovation and Convergence in Cancer Research: US Federal Cancer Mo...
Advancing Innovation and Convergence in Cancer Research: US Federal Cancer Mo...Advancing Innovation and Convergence in Cancer Research: US Federal Cancer Mo...
Advancing Innovation and Convergence in Cancer Research: US Federal Cancer Mo...
 
CI4CC Moonshot Blue Ribbon Panel Report 20161010
CI4CC Moonshot Blue Ribbon Panel Report 20161010CI4CC Moonshot Blue Ribbon Panel Report 20161010
CI4CC Moonshot Blue Ribbon Panel Report 20161010
 
Extreme Computing, Clinical Medicine and GPUs or Can GPUs Cure Cancer
Extreme Computing, Clinical Medicine and GPUs or Can GPUs Cure CancerExtreme Computing, Clinical Medicine and GPUs or Can GPUs Cure Cancer
Extreme Computing, Clinical Medicine and GPUs or Can GPUs Cure Cancer
 
Twenty Years of Whole Slide Imaging - the Coming Phase Change
Twenty Years of Whole Slide Imaging - the Coming Phase ChangeTwenty Years of Whole Slide Imaging - the Coming Phase Change
Twenty Years of Whole Slide Imaging - the Coming Phase Change
 

Similar to NCI Cloud Pilots Drive Cancer Genomics Data Commons

Clinical Genomics and Medicine
Clinical Genomics and MedicineClinical Genomics and Medicine
Clinical Genomics and MedicineWarren Kibbe
 
EBI Industry programme TCGA Warren KIbbe November 2013
EBI Industry programme TCGA Warren KIbbe November 2013EBI Industry programme TCGA Warren KIbbe November 2013
EBI Industry programme TCGA Warren KIbbe November 2013Warren Kibbe
 
Data Commons & Data Science Workshop
Data Commons & Data Science WorkshopData Commons & Data Science Workshop
Data Commons & Data Science WorkshopWarren Kibbe
 
Super computing 19 Cancer Computing Workshop Keynote
Super computing 19 Cancer Computing Workshop KeynoteSuper computing 19 Cancer Computing Workshop Keynote
Super computing 19 Cancer Computing Workshop KeynoteWarren Kibbe
 
Enabling Translational Medicine with e-Science
Enabling Translational Medicine with e-ScienceEnabling Translational Medicine with e-Science
Enabling Translational Medicine with e-ScienceOla Spjuth
 
Oncology Discoveries, University of Chicago
Oncology Discoveries, University of ChicagoOncology Discoveries, University of Chicago
Oncology Discoveries, University of Chicagouchicagotech
 
Healthcare Conference 2013 : Toekomstvisie op ICT in de gezondheidszorg - pro...
Healthcare Conference 2013 : Toekomstvisie op ICT in de gezondheidszorg - pro...Healthcare Conference 2013 : Toekomstvisie op ICT in de gezondheidszorg - pro...
Healthcare Conference 2013 : Toekomstvisie op ICT in de gezondheidszorg - pro...D3 Consutling
 
Health IT Summit Austin 2013 - Presentation "The Impact of All Data on Health...
Health IT Summit Austin 2013 - Presentation "The Impact of All Data on Health...Health IT Summit Austin 2013 - Presentation "The Impact of All Data on Health...
Health IT Summit Austin 2013 - Presentation "The Impact of All Data on Health...Health IT Conference – iHT2
 
Digital Pathology, FDA Approval and Precision Medicine
Digital Pathology, FDA Approval and Precision MedicineDigital Pathology, FDA Approval and Precision Medicine
Digital Pathology, FDA Approval and Precision MedicineJoel Saltz
 
Challenges and Opportunities for Digital PCR in the CLIA Laboratory of the Mo...
Challenges and Opportunities for Digital PCR in the CLIA Laboratory of the Mo...Challenges and Opportunities for Digital PCR in the CLIA Laboratory of the Mo...
Challenges and Opportunities for Digital PCR in the CLIA Laboratory of the Mo...Kate Barlow
 
Role of data in precision oncology
Role of data in precision oncologyRole of data in precision oncology
Role of data in precision oncologyWarren Kibbe
 
DOE-NCI Pilots presentation at the Frederick National Laboratory Advisory Com...
DOE-NCI Pilots presentation at the Frederick National Laboratory Advisory Com...DOE-NCI Pilots presentation at the Frederick National Laboratory Advisory Com...
DOE-NCI Pilots presentation at the Frederick National Laboratory Advisory Com...Warren Kibbe
 

Similar to NCI Cloud Pilots Drive Cancer Genomics Data Commons (17)

Clinical Genomics and Medicine
Clinical Genomics and MedicineClinical Genomics and Medicine
Clinical Genomics and Medicine
 
EBI Industry programme TCGA Warren KIbbe November 2013
EBI Industry programme TCGA Warren KIbbe November 2013EBI Industry programme TCGA Warren KIbbe November 2013
EBI Industry programme TCGA Warren KIbbe November 2013
 
Data Commons & Data Science Workshop
Data Commons & Data Science WorkshopData Commons & Data Science Workshop
Data Commons & Data Science Workshop
 
Super computing 19 Cancer Computing Workshop Keynote
Super computing 19 Cancer Computing Workshop KeynoteSuper computing 19 Cancer Computing Workshop Keynote
Super computing 19 Cancer Computing Workshop Keynote
 
Sfide della Oncologia Personalizzata
Sfide della Oncologia PersonalizzataSfide della Oncologia Personalizzata
Sfide della Oncologia Personalizzata
 
Enabling Translational Medicine with e-Science
Enabling Translational Medicine with e-ScienceEnabling Translational Medicine with e-Science
Enabling Translational Medicine with e-Science
 
Big data
Big dataBig data
Big data
 
Oncology Discoveries, University of Chicago
Oncology Discoveries, University of ChicagoOncology Discoveries, University of Chicago
Oncology Discoveries, University of Chicago
 
Healthcare Conference 2013 : Toekomstvisie op ICT in de gezondheidszorg - pro...
Healthcare Conference 2013 : Toekomstvisie op ICT in de gezondheidszorg - pro...Healthcare Conference 2013 : Toekomstvisie op ICT in de gezondheidszorg - pro...
Healthcare Conference 2013 : Toekomstvisie op ICT in de gezondheidszorg - pro...
 
Health IT Summit Austin 2013 - Presentation "The Impact of All Data on Health...
Health IT Summit Austin 2013 - Presentation "The Impact of All Data on Health...Health IT Summit Austin 2013 - Presentation "The Impact of All Data on Health...
Health IT Summit Austin 2013 - Presentation "The Impact of All Data on Health...
 
Digital Pathology, FDA Approval and Precision Medicine
Digital Pathology, FDA Approval and Precision MedicineDigital Pathology, FDA Approval and Precision Medicine
Digital Pathology, FDA Approval and Precision Medicine
 
Challenges and Opportunities for Digital PCR in the CLIA Laboratory of the Mo...
Challenges and Opportunities for Digital PCR in the CLIA Laboratory of the Mo...Challenges and Opportunities for Digital PCR in the CLIA Laboratory of the Mo...
Challenges and Opportunities for Digital PCR in the CLIA Laboratory of the Mo...
 
Big data sharing
Big data sharingBig data sharing
Big data sharing
 
Role of data in precision oncology
Role of data in precision oncologyRole of data in precision oncology
Role of data in precision oncology
 
POCkeT: Reducing Cervical Cancer in Peru through GVC analysis
POCkeT: Reducing Cervical Cancer in Peru through GVC analysis POCkeT: Reducing Cervical Cancer in Peru through GVC analysis
POCkeT: Reducing Cervical Cancer in Peru through GVC analysis
 
DOE-NCI Pilots presentation at the Frederick National Laboratory Advisory Com...
DOE-NCI Pilots presentation at the Frederick National Laboratory Advisory Com...DOE-NCI Pilots presentation at the Frederick National Laboratory Advisory Com...
DOE-NCI Pilots presentation at the Frederick National Laboratory Advisory Com...
 
Nov 2014 ouellette_windsor_icgc_final
Nov 2014 ouellette_windsor_icgc_finalNov 2014 ouellette_windsor_icgc_final
Nov 2014 ouellette_windsor_icgc_final
 

NCI Cloud Pilots Drive Cancer Genomics Data Commons

  • 1. National Cancer Institute U.S. DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health NCI Informatics and Genomics September 2014
  • 2. Disclaimer • These views are my own and do not necessarily reflect those of the NCI
  • 3. Overview • National Challenges in Cancer Data • Disruptive Technologies • NCI Genomics Data Commons • NCI Cloud Pilots • Building a national learning health system for cancer clinical genomics
  • 4. National Challenges in Cancer Informatics • Lowering barriers to data access, analysis and modeling for cancer research • Integration of data and learning from basic and clinical research with cancer care that enable prediction and improved outcomes
  • 5. We need: • Open Science (Open Access, Open Data, Open Source) and Data Liquidity for the cancer community • Semantic interoperability through CDEs and Case Report Forms mapped to standards • Sustainable models for informatics infrastructure, services, data
  • 6. Where we are Disruptive technologies Getting social Open access to data
  • 7. Disruptive Technologies • Printing • Steam power • Transportation • Electricity • Antibiotics • Semiconductors &VLSI design • http • High throughput biology Systems view - end of reductionism?
  • 8. Precision Oncology • The era of precision medicine and precision oncology is predicated on the integration of research, care, and molecular medicine and the availability of data for modeling, risk analysis, and optimal care How do we re-engineer translational research policies that will enable a true learning healthcare system?
  • 9.
  • 10. Disruptive Technologies • Printing • Steam power • Transportation • Electricity • Antibiotics • Semiconductors &VLSI design • http • High throughput biology • Ubiquitous computing Everyone is a data provider Data immersion World: 6.6B active mobile contracts 1.9B smart phone contracts 1.1B land lines World population 7.1B US: 345M active mobile contracts 287M smart phone contracts US population 313M
  • 11. What about social media? • Social media may be one avenue for modifying behaviors that result in cancer • Properly orchestrated, social media can have dramatic impact on quality of life for patients and survivors • It can reach into all segments of our society, including underserved populations
  • 12. Public Health • These three modifiable factors - infectious disease, smoking, and poor nutrition and lack of exercise contribute to at least 50% of our current cancer burden. And the cost from loss of quality of life, pain and suffering is incalculable.
  • 13. Some NCI Big Data activities • TCGA, TARGET and ICGC – Cancer Genomics Data Commons – NCI Cloud Pilots • Molecular Clinical Trials: – MPACT, MATCH, Exceptional Responders
  • 15. From the Second Machine Age From: The Second Machine Age: Work, Progress, and Prosperity in a Time of Brilliant Technologies by Erik Brynjolfsson & Andrew McAfee
  • 16. Molecular data is Big Data • Brief trip down memory lane • Sequencing and the Human Genome Project
  • 17.
  • 18.
  • 21. HGP outcomes • $5.6B investment in 2010 dollars • $800B economic development • Enabled many basic discoveries, clinical therapies and diagnostics, and applied technologies
  • 22. TCGA history • About three years post-HGP • Initiated in 2005 • Collaboration of NHGRI and NCI to examine GBM, Lung and Ovarian cancer using genomic techniques in 2006. • Expanded to 20+ tumor types.
  • 23. TCGA drivers • Providing high quality reference sets for 20+ tissue types • Providing a platform for systems biology and hypothesis generation • Providing a test bed for understanding the real world implications of consent and data access policies on genomic and clinical data.
  • 24. 24
  • 25. Assays and Data Types 25
  • 26. Focus on TCGA • TCGA consortium slides • Thanks to Lou Staudt and Jean Claude Zenklusen
  • 27. TCGA – Lessons from structural genomics Jean Claude Zenklusen, Ph.D. Director TCGA Program Office National Cancer Institute
  • 28. The Mutational Burden of Human Cancer Mike Lawrence and Gaddy Getz Increasing genomic complexity Childhood cancers Carcinogens
  • 29. TCGA Nature 497:67 (2013) Molecular Subgroups Refine Histological Diagnosis Of Endometrial Carcinoma POLE (ultra- mutated) MSI (hypermutated) Copy-number low (endometriod) Copy-number high (serous-like) Histology Mutations Per Mb PolE MSI / MSH2 Copy # PTEN p53 Serous misdiagnosed as endometrioid? Endometrioid Serous Histology
  • 30. TCGA Nature 497:67 (2013) Molecular Diagnosis of Endometrial Cancer May Influence Choice of Therapy POLE (ultra- mutated) MSI (hypermutated) Copy-number low (endometriod) Copy-number high (serous-like) Histology Mutations Per Mb PolE MSI / MSH2 Copy # PTEN p53 Adjuvant chemotherapy? Adjuvant radiotherapy? Surgery only?
  • 31. GDC NCI Cancer Genomics Data Commons NCI Genomics Data Commons Genomic + clinical data . . .
  • 32. GDC NCI Cancer Genomics Data Commons NCI Genomics Data Commons Genomic + clinical data . . . Cancer information donor
  • 33. GDC Identify low-frequency cancer drivers Define genomic determinants of response to therapy Compose clinical trial cohorts sharing Targeted genetic lesions Cancer information donor Utility of a Cancer Knowledge Base
  • 34. Driver for the Cloud Pilots • An inflection point for TCGA is looming 0 500,000 1,000,000 1,500,000 2,000,000 2,500,000 7/1/09 1/1/10 7/1/10 1/1/11 7/1/11 1/1/12 7/1/12 1/1/13 7/1/13 1/1/14 7/1/14 Gigabytes(GB)
  • 35. NCI Cloud Pilots • Funding for up to 3 cloud pilots - 24 month pilots that are meant to inform the Cancer Genomics Data Commons – Explore models for cancer genomics APIs – Explore cloud models for data+analysis • Announced this week: The Institute for Systems Biology, The Broad Institute, and Seven Bridges will be the initial consortium
  • 36. NCI Cloud Pilots • A way to move computation to the data • Sustainable models for providing access to data • Reproducible pipelines for QA, variant calling, knowledge sharing • Define genomics/phenomics APIs for discovering new variants contributing to cancer, enhancing response, modulating risk
  • 37. GDC Relationship of the Cancer Genomics Data Commons and NCI Cloud Pilots NCI Cloud Computational Centers Periodic Data Freezes Search / retrieve Analysis NCI Genomics Data Commons
  • 39. Institute of Medicine Report Sept 10, 2013 Delivering High-Quality Cancer Care: Charting a New Course for System in Crisis Understanding the outcomes of individual cancer patients as well as groups of similar patients 1 Capturing data from real-world settings that researchers can then analyze to generate new knowledge 2 A “Learning” healthcare IT system that learns routinely and iteratively by analyzing captured data, generating evidence, and implementing new insights into subsequent care. 3
  • 40. “Learning IT System” IOM Report on Cancer Care Search Prior Knowledge: Enable clinicians to use previous patients’ experiences to guide future care. 1 Care Team Collaboration: Facilitate a coordinated cancer care workforce & mechanisms for easily sharing information with each other. 2 Cancer Research: Improve the evidence base for quality cancer care by utilizing all of the data captured during real- world clinical encounters and integrating it with data captured from other sources. 3
  • 43. Netflix’s Cinematch software analyzes each customer’s film-viewing habits and recommends other movies. Can we make a Cinematch for cancer patients?
  • 44. Patients like me • Patients with diagnoses, symptoms and labs like yours are eligible for these trials… • Patient-centered resources…
  • 45. If we can forecast the weather, can we forecast cancer?
  • 46. Where is the weather moving? Doppler & Map Fusion
  • 47. Animating the Weather Dimension of time assists in decision making.
  • 48. What about the future? Present 5 Hours into Future
  • 50.
  • 51. Modeling Tumor Growth Mathematical model: proliferation of cells with the potential for invasion and metastasis Swanson et al., British Journal of Cancer, 2007: 1-7.
  • 52. Personalized Tumor Model Imaging used to seed the model
  • 54. Radiation Treatment Effects L-Q model used to describe cell killing New term defines cell killing
  • 55. Population Decision Support Rapid Learning Systems Patient-level data are aggregated to achieve population-based change, and results are applied to care of individual patients. Predict outcomes
  • 56. Precision Oncology • The era of precision medicine and precision oncology is predicated on the integration of research, care, and molecular medicine and the availability of data for modeling, risk analysis, and optimal care How do we re-engineer translational research policies that will enable a true learning healthcare system?
  • 57. The future • Elastic computing ‘clouds’ • Social networks • Big Data analytics • Precision medicine • Measuring health • Practicing protective medicine Semantic and synoptic data Intervening before health is compromised Learning systems that enable learning from every cancer patient
  • 58. Thank you Warren A. Kibbe warren.kibbe@nih.gov