C-Change Cancer Big Data, NCI Genomic Data Commons, Cloud Pilots
1. Precision Medicine and Cancer
Big Data in Research
June 18, 2015
Warren Kibbe, PhD
NCI Center for Biomedical Informatics
2. 2
Some Basic Ingredients for Precision Medicine Big Data
Open Science. Supporting Open Access, Open Data, Open Source,
and Data Liquidity for the cancer community
Standardization through CDEs and Case Report Forms
Interoperability by exposing existing knowledge through appropriate
integration of ontologies, vocabularies and taxonomies
Sustainable models for informatics infrastructure, services, data,
metadata, curation
3. 3
Understanding Cancer
Precision medicine will lead to fundamental understanding of the
complex interplay between genetics, epigenetics, nutrition,
environment and clinical presentation
4. 4
New NCI MATCH TRIAL
"Precision Medicine uses genetic
information from a person’s cancer
to determine a patient’s treatment
with a treatment targeted to that
particular genetic abnormality."
5. The Cancer Genome Atlas
A comprehensive effort to accelerate our
understanding of the molecular basis of cancer
6. 6
TCGA: The Cancer Genome Atlas
Launched in 2006 by NCI & NHGRI
Complete characterization of ~35 adult cancers
~20 common cancers at 500 cases each
~15 rare cancers at 50-150 cases each
Copy Number, Gene Expression, Methylation, DNA Sequencing (WGS/WXS), Clinical
data
~11,000 cases
Project ending in 2016
Future projects to use the TCGA infrastructure
Exceptional Responders, ALChEMIST, Clinical Trial Sequencing Program (CTSP), Cancer Driver
Discovery Program (CDDP)
http://cancergenome.nih.gov/
11. NCI Cloud Pilots
The Broad
PI: Gad Getz
Institute for Systems Biology
PI: Ilya Shmulevich
Seven Bridges Genomics
PI: Deniz Kural
12. 12
NCI GDC and the Cloud Pilots
Working together to build common APIs
Working with the Global Alliance for Genomics and Health (GA4GH) to
define the next generation of secure, flexible, meaningful,
interoperable, lightweight interfaces
Competing on the implementation, collaborating on the interface
Aligned with BD2K and serving as a part of the NIH Commons and
working toward shared goals of FAIR (Findable, Accessible,
Interoperable, Reusable)
Exploring and defining sustainable precision medicine information
infrastructure
13. Thank you
Warren Kibbe
warren.kibbe@nih.gov
Thanks to content contributors:
Sherri de Coronado, Gilberto Fragoso, Mark Jensen, Warren Kibbe, Juli
Klemm, Tony Kerlavage, Elizabeth Gillanders and others.
Re: Interoperability, NCI has worked towards that goal for a long time –
Providing a metadata repository and tooling for CDEs and Case Report Forms,
http://dctd.cancer.gov/MajorInitiatives/NCI-sponsored_trials_in_precision_medicine.htm
http://dctd.cancer.gov/MajorInitiatives/NCI-MATCH.pdf
http://www.seminoncol.org/article/S0093-7754%2814%2900122-5/abstract
MATCH is one of several NCI precision medicine initiatives:
The initial set of trials will focus on different questions: (1) Exceptional Responders Initiative—why do a minority of patients with solid tumors or lymphoma respond very well to some drugs even if the majority do not?; (2) NCI MATCH trial—can molecular markers predict response to targeted therapies in patients with advanced cancer resistant to standard treatment?; (3) ALCHEMIST trial—will targeted epidermal growth factor receptor (EGFR) and anaplastic lymphoma kinase (ALK) inhibitors improve survival for adenocarcinoma of the lung in the adjuvant setting?
http://meetinglibrary.asco.org/content/114000071-144
Since the first data sets were published, the number of publications has nearly doubled every year. 60 publications as of the end of February, 2015 already. On track for more than 1000 TCGA publications by the end of 2015
We now understand underlying basic and cancer biology due to the human genome project and the technologies emerging from it. TCGA activities and analyses were built upon the success of the HGP