SlideShare a Scribd company logo
Data Harmonization for a Molecularly
Driven Health System
Warren A. Kibbe, Ph.D.
Professor, Biostats & Bioinformatics
Chief Data Officer, Duke Cancer Institute
warren.kibbe@duke.edu
@wakibbe
#DataSharing
#LearningHealthSystem
#DataHarmonization
Sections
• Learning Health Systems
• Data Commons
• Data Harmonization
The World is Changing
• Pace of Commercialization
• Reach of Markets
• Role of Data
• Change in Healthcare
• Change in Computing
• Societal Changes
Is the US able to keep up?
R&D By Country
US R&D Funding as share of GDP
R&D spending / STEM
How do we continue to innovate?
Data Science
Twitter impacts science
Eric Topol
Changes in Computing
• Converged devices
• Converged IT
• Ubiquity of devices, data, mHealth
2017200220072012
10/23/2001
(~5yrsold)
1/9/2007
(~10yrsold)
iPod(10GBmax)
iPhone(EDGE,16GBmax)
9/16/1999
(~3yrsold)
802.11bWiFi
4/3/2010
(~13yrsold)
iPad(EDGE,64GBmax)
4/23/2005
(~8yrsold)
9/26/2006
(~9yrsold)
7/15/2006
2/7/2007
Google
Drive
4/24/2012
(~15yrsold)7/11/2008
(~11yrsold)
iPhone3G
(16GBmax)
9/12/2012
(~15yrsold)
iPhone5(LTE,128GBmax)
Google
Baseline
3/9/2015
(~18yrsold)
Apple
ResearchKit
HTCVRHeadset
4/5/2016
(~19yrsold)
7/14/2014
(~17yrsold)
NextGen
Courtesy of Jerry Lee, NCI
Changes in Technology
Pace of Technology Adoption
Changes in Commercialization
Changes in Oncology
• Cancer is a grand challenge
• Anatomic vs molecular classification
• Health vs Disease
Understanding Cancer
• Precision medicine will lead to fundamental
understanding of the complex interplay between
genetics, epigenetics, nutrition, environment and
clinical presentation and direct effective,
evidence-based prevention and treatment.
Ramifications across many aspects of health care
IOM
(Now NAM)
Report
2006-11
NAM
Workshops
“Science, informatics, incentives, and
culture are aligned for continuous
improvement and innovation, with best
practices seamlessly embedded in the
delivery process and new knowledge
captured as an integral by-product of the
delivery experience.”
—Institute of Medicine
LEARNING HEALTH SYSTEMS
Another imperative is that such systems
do their work:
• Transparently (how does one learn
without well documented processes?)
• Reproducibly (good practices must
always be repeatable at scale and
scientifically reproducible)
• Only with the above can the science in
“data science” be done with sufficient
rigor
LEARNING HEALTH SYSTEMS
ASSEMBLE
ANALYZE
INTERPRET
FEEDBACK
CHANGE
LEARNING HEALTH SYSTEMS
Learning Health Systems in NEJM
Goals
• Contain rising cost of healthcare
• Maximize the value of care
• Increase public discourse and
marketplace for healthcare
Drivers
• Decision Making is too complex
• Clinical decisions are based on
practice, not evidence
• Inefficiency and waste in healthcare
Human cognitive capacity is constant
Lack of Evidence
EHRs and the Learning Health System
LHS definition
Problems for LHS to solve
Inefficient Healthcare
Poor Health in spite of high expenditures
Curve hasn’t improved
2015
View from 2006
EHRs are now ubiquitous
But evidence-driven decision support
remains a future vision
Hope
Cloud computing, data commons,
service-based computing provide some
powerful tools for solving data access,
data analysis, data analytics, and data
visualization problems at scale,
securely.
Sebastian Thrun
So what is a Data Commons
Commons Topology
Compute Platform: Cloud or HPC
Services: APIs, Containers, Indexing,
Software: Services & Tools
scientific analysis tools/workflows
Data
“Reference” Data Sets
User defined data
DigitalObjectCompliance
App store/User Interface
PaaS
SaaS
IaaS
https://datascience.nih.gov/commons
Commons Compliance
• Treat products of research – data,
methods, papers etc. as digital
objects
• These digital objects exist in a
shared virtual space
• Digital object compliance through
FAIR principles:
– Findable
– Accessible (and usable)
– Interoperable
– Reusable
Data Sharing and the FAIR Principles
FAIR –
Making data
Findable,
Accessible,
Attributable,
Interoperable,
Reusable,
and provide Recognition
Force11 white paper
https://www.force11.org/group/fairgroup/fairprinciples
“The Commons is an effort at
creating a sharing economy and
for building community. We hope
for a more cost effective and
productive research
environment while bringing
people together in a unique
way.“
Phil Bourne
44
Blue Ribbon Panel Report
Cancer Moonshot℠ Blue Ribbon Panel
“The Cancer Moonshot Task Force was
directed to consult with external experts
from relevant scientific sectors, including
the presidentially appointed National
Cancer Advisory Board(NCAB).
A Blue Ribbon Panel of scientific experts
was created to advise the NCAB.”
Vision:
Enable the creation of a Learning Healthcare System for
Cancer, where as a nation we learn from the contributed
knowledge and experience of every cancer patient. As
part of the Cancer Moonshot, we want to unleash the
power of data to enhance, improve, and inform the journey
of every cancer patient from the point of diagnosis
through survivorship.
A National Cancer Data Ecosystem
Cancer Research
Data Commons
SBG CGC
Broad FireCloud ISB CGC
Courtesy NCI-CBIIT
Data Commons Framework – What Is It?
47
Modular Components
Secure user authentication and authorization
Metadata validation and tools
Domain-specific, extensible data models and dictionaries
API and container environment for tools and pipelines
Access to computational workspaces for storing data, tools, and
results
Reusable, expandable
framework for a Data
Commons
Core principles and
structures for a Data
Commons
Set of modular
components that can be
leveraged across Data
Commons
Narrow Middle Architecture (End-to-End Design)
1. AuthN / AuthZ
2. Metadata validation
3. Extensible data model
4. APIs for containers, workflows & tools
5. Workspaces
science outdata in
Courtesy Bob
Grossman, U. Chicago
49
NCI Cancer Research Data Commons (CRDC) - Concept
NCI Scope: “Create a data
science infrastructure necessary
to connect repositories, analytical
tools, and knowledge bases”
Data commons co-locate data,
storage and computing
infrastructure with commonly
used services, tools & apps for
analyzing and sharing data to
create an interoperable resource
for the research community.*
*Robert L. Grossman, Allison Heath, Mark Murphy, Maria Patterson and Walt Wells, A
Case for Data Commons Towards Data Science as a Service, IEEE Computing in Science
and Engineer, 2016. Source of image: The CDIS, GDC, & OCC data commons
infrastructure at the University of Chicago Kenwood Data Center.
50
Data Commons Framework
Clinical Proteomics ImagingGenomics Immuno-
oncology
Animal Models Cancer Biomarkers
NCI Cancer Research
Data Commons
SBG CGC
Broad FireCloud ISB CGC
Elastic
Compute
Query
Visualization
Clinical Proteomics Tumor
Analysis Consortium*
Tool
Deployment
The Cancer Imaging Archive*
TCIA
Web
Interface
APIs Data
Submission
Authentication
& Authorization
Authentication
& Authorization
Data Models &
Dictionaries
Computational
Workspaces
Data Contributors and Consumers
Tool
Repositories
Metadata
Validation
& Tools
Analysis
Courtesy NCI-CBIIT
Gen3 Data Commons
Gen3 Data Commons
Gen3 Data Commons
NCI Genomic Data Commons
NCI Genomic Data Commons
NCI Genomic Data Commons
Data Harmonization
• The process of semantic and
syntactic mapping of data to a set of
definitions, predefined data
elements, data model.
• Validation and Harmonization of
primary and secondary data is crucial
to enable analysis and reuse
Spanning the Semantic Chasm of Despair
Building a Translational Bridge
CD2H
Thanks to Melissa Haendel
Project Highlight: Harmonizing clinical data models
Sentinel
I2b2/ACT
OMOP
PCORNET
▪ Different countries use different “outlets”.
▪ There is a need for travel adapters.
The Solution:
▪ Use a converter between various adapters.
▪ Allow researchers to ask a question once and
receive results from many different sources
Project Highlight: LOINC2HPO
◆ Develop a software tool to map
LOINC codes to HPO terms
◆ Develop software to convert
EHR observations into HPO
terms for use in clinical
research
Steps
Develop a tool for converting LOINC laboratory codes and values into more
phenotypically meaningful language (Human Phenotype Ontology) to allow for
translational interoperability and new analytics
2657-5 “Nitrite [Mass/volume] in Urine” Numeric
20407-3 “Nitrite [Mass/volume] in Urine by Test
strip”
Numeric
32710-6 “Nitrite [Presence] in Urine” Positive/Negati
ve
5802-4 “Nitrite [Presence] in Urine by Test strip” Positive/Negati
ve
50558-6 “Nitrite [Presence] in Urine by
Automated test strip
Positive/Negati
ve
LOINC Outcome
HPO: Nitrituria
INSERT CDE Browser Screenshot?
CIBMTR Center for Cancer
Research
Over 35 NCI Programs, Plus
Cancer Centers and Consortia
GDC
Data Sharing Index
• We need metrics for data, software,
algorithm use, usability, conformance
• Data sharing stimulates science,
innovation, commercialization
• Providing recognition and attribution
to data providers and software &
algorithm builders is critical for a
robust data sharing ecosystem
• Support and measure FAIRness!
Questions?
Warren Kibbe, Ph.D.
warren.kibbe@duke.edu
@wakibbe

More Related Content

What's hot

Some Frameworks for Improving Analytic Operations at Your Company
Some Frameworks for Improving Analytic Operations at Your CompanySome Frameworks for Improving Analytic Operations at Your Company
Some Frameworks for Improving Analytic Operations at Your Company
Robert Grossman
 
Big Data as a Catalyst for Collaboration & Innovation
Big Data as a Catalyst for Collaboration & InnovationBig Data as a Catalyst for Collaboration & Innovation
Big Data as a Catalyst for Collaboration & Innovation
Philip Bourne
 
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
Robert Grossman
 
The Vision for Data @ the NIH
The Vision for Data @ the NIHThe Vision for Data @ the NIH
The Vision for Data @ the NIH
Philip Bourne
 
Darwin ai covid-net mitre
Darwin ai   covid-net mitreDarwin ai   covid-net mitre
Darwin ai covid-net mitre
ianmitch
 
What is Data Commons and How Can Your Organization Build One?
What is Data Commons and How Can Your Organization Build One?What is Data Commons and How Can Your Organization Build One?
What is Data Commons and How Can Your Organization Build One?
Robert Grossman
 
SWOT Analysis - What Does it Tell Us?
SWOT Analysis - What Does it Tell Us?SWOT Analysis - What Does it Tell Us?
SWOT Analysis - What Does it Tell Us?
Philip Bourne
 
dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09...
dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09...dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09...
dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09...
dkNET
 
Some Proposed Principles for Interoperating Cloud Based Data Platforms
Some Proposed Principles for Interoperating Cloud Based Data PlatformsSome Proposed Principles for Interoperating Cloud Based Data Platforms
Some Proposed Principles for Interoperating Cloud Based Data Platforms
Robert Grossman
 
A Data Biosphere for Biomedical Research
A Data Biosphere for Biomedical ResearchA Data Biosphere for Biomedical Research
A Data Biosphere for Biomedical Research
Robert Grossman
 
Introduction to Big Data and its Potential for Dementia Research
Introduction to Big Data and its Potential for Dementia ResearchIntroduction to Big Data and its Potential for Dementia Research
Introduction to Big Data and its Potential for Dementia Research
David De Roure
 
The Future of FAIR Data: An international social, legal and technological inf...
The Future of FAIR Data: An international social, legal and technological inf...The Future of FAIR Data: An international social, legal and technological inf...
The Future of FAIR Data: An international social, legal and technological inf...
Michel Dumontier
 
Bioinformatics in the Era of Open Science and Big Data
Bioinformatics in the Era of Open Science and Big DataBioinformatics in the Era of Open Science and Big Data
Bioinformatics in the Era of Open Science and Big Data
Philip Bourne
 
Understanding the Big Data Enterprise
Understanding the Big Data EnterpriseUnderstanding the Big Data Enterprise
Understanding the Big Data Enterprise
Philip Bourne
 
Trust threads: Provenance for Data Reuse in Long Tail Science
Trust threads: Provenance for Data Reuse in Long Tail ScienceTrust threads: Provenance for Data Reuse in Long Tail Science
Trust threads: Provenance for Data Reuse in Long Tail Science
Beth Plale
 
Data Virtualization Modernizes Biobanking
Data Virtualization Modernizes BiobankingData Virtualization Modernizes Biobanking
Data Virtualization Modernizes Biobanking
Denodo
 
A Gen3 Perspective of Disparate Data
A Gen3 Perspective of Disparate DataA Gen3 Perspective of Disparate Data
A Gen3 Perspective of Disparate Data
Robert Grossman
 
Hadoop and Data Virtualization - A Case Study by VHA
Hadoop and Data Virtualization - A Case Study by VHAHadoop and Data Virtualization - A Case Study by VHA
Hadoop and Data Virtualization - A Case Study by VHA
Denodo
 
Trust threads : Active Curation and Publishing in SEAD
Trust threads : Active Curation and Publishing in SEADTrust threads : Active Curation and Publishing in SEAD
Trust threads : Active Curation and Publishing in SEAD
Beth Plale
 
Meeting the Computational Challenges Associated with Human Health
Meeting the Computational Challenges Associated with Human HealthMeeting the Computational Challenges Associated with Human Health
Meeting the Computational Challenges Associated with Human Health
Philip Bourne
 

What's hot (20)

Some Frameworks for Improving Analytic Operations at Your Company
Some Frameworks for Improving Analytic Operations at Your CompanySome Frameworks for Improving Analytic Operations at Your Company
Some Frameworks for Improving Analytic Operations at Your Company
 
Big Data as a Catalyst for Collaboration & Innovation
Big Data as a Catalyst for Collaboration & InnovationBig Data as a Catalyst for Collaboration & Innovation
Big Data as a Catalyst for Collaboration & Innovation
 
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
 
The Vision for Data @ the NIH
The Vision for Data @ the NIHThe Vision for Data @ the NIH
The Vision for Data @ the NIH
 
Darwin ai covid-net mitre
Darwin ai   covid-net mitreDarwin ai   covid-net mitre
Darwin ai covid-net mitre
 
What is Data Commons and How Can Your Organization Build One?
What is Data Commons and How Can Your Organization Build One?What is Data Commons and How Can Your Organization Build One?
What is Data Commons and How Can Your Organization Build One?
 
SWOT Analysis - What Does it Tell Us?
SWOT Analysis - What Does it Tell Us?SWOT Analysis - What Does it Tell Us?
SWOT Analysis - What Does it Tell Us?
 
dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09...
dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09...dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09...
dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09...
 
Some Proposed Principles for Interoperating Cloud Based Data Platforms
Some Proposed Principles for Interoperating Cloud Based Data PlatformsSome Proposed Principles for Interoperating Cloud Based Data Platforms
Some Proposed Principles for Interoperating Cloud Based Data Platforms
 
A Data Biosphere for Biomedical Research
A Data Biosphere for Biomedical ResearchA Data Biosphere for Biomedical Research
A Data Biosphere for Biomedical Research
 
Introduction to Big Data and its Potential for Dementia Research
Introduction to Big Data and its Potential for Dementia ResearchIntroduction to Big Data and its Potential for Dementia Research
Introduction to Big Data and its Potential for Dementia Research
 
The Future of FAIR Data: An international social, legal and technological inf...
The Future of FAIR Data: An international social, legal and technological inf...The Future of FAIR Data: An international social, legal and technological inf...
The Future of FAIR Data: An international social, legal and technological inf...
 
Bioinformatics in the Era of Open Science and Big Data
Bioinformatics in the Era of Open Science and Big DataBioinformatics in the Era of Open Science and Big Data
Bioinformatics in the Era of Open Science and Big Data
 
Understanding the Big Data Enterprise
Understanding the Big Data EnterpriseUnderstanding the Big Data Enterprise
Understanding the Big Data Enterprise
 
Trust threads: Provenance for Data Reuse in Long Tail Science
Trust threads: Provenance for Data Reuse in Long Tail ScienceTrust threads: Provenance for Data Reuse in Long Tail Science
Trust threads: Provenance for Data Reuse in Long Tail Science
 
Data Virtualization Modernizes Biobanking
Data Virtualization Modernizes BiobankingData Virtualization Modernizes Biobanking
Data Virtualization Modernizes Biobanking
 
A Gen3 Perspective of Disparate Data
A Gen3 Perspective of Disparate DataA Gen3 Perspective of Disparate Data
A Gen3 Perspective of Disparate Data
 
Hadoop and Data Virtualization - A Case Study by VHA
Hadoop and Data Virtualization - A Case Study by VHAHadoop and Data Virtualization - A Case Study by VHA
Hadoop and Data Virtualization - A Case Study by VHA
 
Trust threads : Active Curation and Publishing in SEAD
Trust threads : Active Curation and Publishing in SEADTrust threads : Active Curation and Publishing in SEAD
Trust threads : Active Curation and Publishing in SEAD
 
Meeting the Computational Challenges Associated with Human Health
Meeting the Computational Challenges Associated with Human HealthMeeting the Computational Challenges Associated with Human Health
Meeting the Computational Challenges Associated with Human Health
 

Similar to Data Harmonization for a Molecularly Driven Health System

Hadoop Enabled Healthcare
Hadoop Enabled HealthcareHadoop Enabled Healthcare
Hadoop Enabled Healthcare
DataWorks Summit
 
Starting the Hadoop Journey at a Global Leader in Cancer Research
Starting the Hadoop Journey at a Global Leader in Cancer ResearchStarting the Hadoop Journey at a Global Leader in Cancer Research
Starting the Hadoop Journey at a Global Leader in Cancer Research
DataWorks Summit/Hadoop Summit
 
Starting the Hadoop Journey at a Global Leader in Cancer Research
Starting the Hadoop Journey at a Global Leader in Cancer ResearchStarting the Hadoop Journey at a Global Leader in Cancer Research
Starting the Hadoop Journey at a Global Leader in Cancer Research
DataWorks Summit/Hadoop Summit
 
Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204
Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204
Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204
Kees van Bochove
 
Bonazzi commons bd2 k ahm 2016 v2
Bonazzi commons bd2 k ahm 2016 v2Bonazzi commons bd2 k ahm 2016 v2
Bonazzi commons bd2 k ahm 2016 v2
Vivien Bonazzi
 
Building an Intelligent Biobank to Power Research Decision-Making
Building an Intelligent Biobank to Power Research Decision-MakingBuilding an Intelligent Biobank to Power Research Decision-Making
Building an Intelligent Biobank to Power Research Decision-Making
Denodo
 
NIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data CommonsNIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data Commons
Vivien Bonazzi
 
Role of data in precision oncology
Role of data in precision oncologyRole of data in precision oncology
Role of data in precision oncology
Warren Kibbe
 
BD2K and the Commons : ELIXR All Hands
BD2K and the Commons : ELIXR All Hands BD2K and the Commons : ELIXR All Hands
BD2K and the Commons : ELIXR All Hands
Vivien Bonazzi
 
Big Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short TimeBig Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short Time
DataWorks Summit
 
Data commons bonazzi bd2 k fundamentals of science feb 2017
Data commons bonazzi   bd2 k fundamentals of science feb 2017Data commons bonazzi   bd2 k fundamentals of science feb 2017
Data commons bonazzi bd2 k fundamentals of science feb 2017
Vivien Bonazzi
 
Cri big data
Cri big dataCri big data
Cri big data
Putchong Uthayopas
 
2015 04-18-wilson cg
2015 04-18-wilson cg2015 04-18-wilson cg
2015 04-18-wilson cg
Christopher Wilson
 
Opportunities for HPC in pharma R&D - main deck
Opportunities for HPC in pharma R&D - main deckOpportunities for HPC in pharma R&D - main deck
Opportunities for HPC in pharma R&D - main deck
Pistoia Alliance
 
Will Biomedical Research Fundamentally Change in the Era of Big Data?
Will Biomedical Research Fundamentally Change in the Era of Big Data?Will Biomedical Research Fundamentally Change in the Era of Big Data?
Will Biomedical Research Fundamentally Change in the Era of Big Data?
Philip Bourne
 
Toward a FAIR Biomedical Data Ecosystem
Toward a FAIR Biomedical Data EcosystemToward a FAIR Biomedical Data Ecosystem
Toward a FAIR Biomedical Data Ecosystem
Globus
 
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECAProject
 
The Science of Data Science
The Science of Data Science The Science of Data Science
The Science of Data Science
James Hendler
 
Being FAIR: Enabling Reproducible Data Science
Being FAIR: Enabling Reproducible Data ScienceBeing FAIR: Enabling Reproducible Data Science
Being FAIR: Enabling Reproducible Data Science
Carole Goble
 
Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...
Michel Dumontier
 

Similar to Data Harmonization for a Molecularly Driven Health System (20)

Hadoop Enabled Healthcare
Hadoop Enabled HealthcareHadoop Enabled Healthcare
Hadoop Enabled Healthcare
 
Starting the Hadoop Journey at a Global Leader in Cancer Research
Starting the Hadoop Journey at a Global Leader in Cancer ResearchStarting the Hadoop Journey at a Global Leader in Cancer Research
Starting the Hadoop Journey at a Global Leader in Cancer Research
 
Starting the Hadoop Journey at a Global Leader in Cancer Research
Starting the Hadoop Journey at a Global Leader in Cancer ResearchStarting the Hadoop Journey at a Global Leader in Cancer Research
Starting the Hadoop Journey at a Global Leader in Cancer Research
 
Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204
Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204
Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204
 
Bonazzi commons bd2 k ahm 2016 v2
Bonazzi commons bd2 k ahm 2016 v2Bonazzi commons bd2 k ahm 2016 v2
Bonazzi commons bd2 k ahm 2016 v2
 
Building an Intelligent Biobank to Power Research Decision-Making
Building an Intelligent Biobank to Power Research Decision-MakingBuilding an Intelligent Biobank to Power Research Decision-Making
Building an Intelligent Biobank to Power Research Decision-Making
 
NIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data CommonsNIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data Commons
 
Role of data in precision oncology
Role of data in precision oncologyRole of data in precision oncology
Role of data in precision oncology
 
BD2K and the Commons : ELIXR All Hands
BD2K and the Commons : ELIXR All Hands BD2K and the Commons : ELIXR All Hands
BD2K and the Commons : ELIXR All Hands
 
Big Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short TimeBig Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short Time
 
Data commons bonazzi bd2 k fundamentals of science feb 2017
Data commons bonazzi   bd2 k fundamentals of science feb 2017Data commons bonazzi   bd2 k fundamentals of science feb 2017
Data commons bonazzi bd2 k fundamentals of science feb 2017
 
Cri big data
Cri big dataCri big data
Cri big data
 
2015 04-18-wilson cg
2015 04-18-wilson cg2015 04-18-wilson cg
2015 04-18-wilson cg
 
Opportunities for HPC in pharma R&D - main deck
Opportunities for HPC in pharma R&D - main deckOpportunities for HPC in pharma R&D - main deck
Opportunities for HPC in pharma R&D - main deck
 
Will Biomedical Research Fundamentally Change in the Era of Big Data?
Will Biomedical Research Fundamentally Change in the Era of Big Data?Will Biomedical Research Fundamentally Change in the Era of Big Data?
Will Biomedical Research Fundamentally Change in the Era of Big Data?
 
Toward a FAIR Biomedical Data Ecosystem
Toward a FAIR Biomedical Data EcosystemToward a FAIR Biomedical Data Ecosystem
Toward a FAIR Biomedical Data Ecosystem
 
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
 
The Science of Data Science
The Science of Data Science The Science of Data Science
The Science of Data Science
 
Being FAIR: Enabling Reproducible Data Science
Being FAIR: Enabling Reproducible Data ScienceBeing FAIR: Enabling Reproducible Data Science
Being FAIR: Enabling Reproducible Data Science
 
Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...
 

More from Warren Kibbe

CCDI Kibbe Wake Forest University Dec 2023.pptx
CCDI Kibbe Wake Forest University Dec 2023.pptxCCDI Kibbe Wake Forest University Dec 2023.pptx
CCDI Kibbe Wake Forest University Dec 2023.pptx
Warren Kibbe
 
Big Data Training for Cancer Research, Purdue, May 2023
Big Data Training for Cancer Research, Purdue, May 2023Big Data Training for Cancer Research, Purdue, May 2023
Big Data Training for Cancer Research, Purdue, May 2023
Warren Kibbe
 
CCDI Overview November 2022
CCDI Overview November 2022CCDI Overview November 2022
CCDI Overview November 2022
Warren Kibbe
 
RADx-UP CDCC Overview November 2022
RADx-UP CDCC Overview November 2022RADx-UP CDCC Overview November 2022
RADx-UP CDCC Overview November 2022
Warren Kibbe
 
CCDI Kibbe Big Data Training May 2022
CCDI Kibbe Big Data Training May 2022CCDI Kibbe Big Data Training May 2022
CCDI Kibbe Big Data Training May 2022
Warren Kibbe
 
Real world data, the National COVID-19 Cohort Consortium, and Oncology 2021
Real world data, the National COVID-19 Cohort Consortium, and Oncology 2021Real world data, the National COVID-19 Cohort Consortium, and Oncology 2021
Real world data, the National COVID-19 Cohort Consortium, and Oncology 2021
Warren Kibbe
 
Childhood Cancer Data Initiative presentation to the Children’s Brain Tumor N...
Childhood Cancer Data Initiative presentation to the Children’s Brain Tumor N...Childhood Cancer Data Initiative presentation to the Children’s Brain Tumor N...
Childhood Cancer Data Initiative presentation to the Children’s Brain Tumor N...
Warren Kibbe
 
RADx-UP CDCC presentation for the NIH Disaster Interest Group
RADx-UP CDCC presentation for the NIH Disaster Interest GroupRADx-UP CDCC presentation for the NIH Disaster Interest Group
RADx-UP CDCC presentation for the NIH Disaster Interest Group
Warren Kibbe
 
DCHI webinar on N3C January 2021
DCHI webinar on N3C January 2021DCHI webinar on N3C January 2021
DCHI webinar on N3C January 2021
Warren Kibbe
 
NCATS CTSA N3C
NCATS CTSA N3C NCATS CTSA N3C
NCATS CTSA N3C
Warren Kibbe
 
NAACCR June 2020
NAACCR June 2020NAACCR June 2020
NAACCR June 2020
Warren Kibbe
 
NCI HTAN, cancer trajectories, precision oncology
NCI HTAN, cancer trajectories, precision oncologyNCI HTAN, cancer trajectories, precision oncology
NCI HTAN, cancer trajectories, precision oncology
Warren Kibbe
 
ENAR 2020
ENAR 2020ENAR 2020
ENAR 2020
Warren Kibbe
 
ENAR 2020
ENAR 2020ENAR 2020
ENAR 2020
Warren Kibbe
 
Technology and connected health for population science kibbe duke jan 2020
Technology and connected health for population science kibbe duke jan 2020Technology and connected health for population science kibbe duke jan 2020
Technology and connected health for population science kibbe duke jan 2020
Warren Kibbe
 
Super computing 19 Cancer Computing Workshop Keynote
Super computing 19 Cancer Computing Workshop KeynoteSuper computing 19 Cancer Computing Workshop Keynote
Super computing 19 Cancer Computing Workshop Keynote
Warren Kibbe
 
Data sharing Webinar March 2019
Data sharing Webinar March  2019Data sharing Webinar March  2019
Data sharing Webinar March 2019
Warren Kibbe
 
Data in precision oncology SAMSI Precision Medicine Meeting mar 2019
Data in precision oncology SAMSI Precision Medicine Meeting mar 2019Data in precision oncology SAMSI Precision Medicine Meeting mar 2019
Data in precision oncology SAMSI Precision Medicine Meeting mar 2019
Warren Kibbe
 
Opportunities for computing in cancer research
Opportunities for computing in cancer researchOpportunities for computing in cancer research
Opportunities for computing in cancer research
Warren Kibbe
 
Opportunities in technology and connected health for population science
Opportunities in technology and connected health for population science Opportunities in technology and connected health for population science
Opportunities in technology and connected health for population science
Warren Kibbe
 

More from Warren Kibbe (20)

CCDI Kibbe Wake Forest University Dec 2023.pptx
CCDI Kibbe Wake Forest University Dec 2023.pptxCCDI Kibbe Wake Forest University Dec 2023.pptx
CCDI Kibbe Wake Forest University Dec 2023.pptx
 
Big Data Training for Cancer Research, Purdue, May 2023
Big Data Training for Cancer Research, Purdue, May 2023Big Data Training for Cancer Research, Purdue, May 2023
Big Data Training for Cancer Research, Purdue, May 2023
 
CCDI Overview November 2022
CCDI Overview November 2022CCDI Overview November 2022
CCDI Overview November 2022
 
RADx-UP CDCC Overview November 2022
RADx-UP CDCC Overview November 2022RADx-UP CDCC Overview November 2022
RADx-UP CDCC Overview November 2022
 
CCDI Kibbe Big Data Training May 2022
CCDI Kibbe Big Data Training May 2022CCDI Kibbe Big Data Training May 2022
CCDI Kibbe Big Data Training May 2022
 
Real world data, the National COVID-19 Cohort Consortium, and Oncology 2021
Real world data, the National COVID-19 Cohort Consortium, and Oncology 2021Real world data, the National COVID-19 Cohort Consortium, and Oncology 2021
Real world data, the National COVID-19 Cohort Consortium, and Oncology 2021
 
Childhood Cancer Data Initiative presentation to the Children’s Brain Tumor N...
Childhood Cancer Data Initiative presentation to the Children’s Brain Tumor N...Childhood Cancer Data Initiative presentation to the Children’s Brain Tumor N...
Childhood Cancer Data Initiative presentation to the Children’s Brain Tumor N...
 
RADx-UP CDCC presentation for the NIH Disaster Interest Group
RADx-UP CDCC presentation for the NIH Disaster Interest GroupRADx-UP CDCC presentation for the NIH Disaster Interest Group
RADx-UP CDCC presentation for the NIH Disaster Interest Group
 
DCHI webinar on N3C January 2021
DCHI webinar on N3C January 2021DCHI webinar on N3C January 2021
DCHI webinar on N3C January 2021
 
NCATS CTSA N3C
NCATS CTSA N3C NCATS CTSA N3C
NCATS CTSA N3C
 
NAACCR June 2020
NAACCR June 2020NAACCR June 2020
NAACCR June 2020
 
NCI HTAN, cancer trajectories, precision oncology
NCI HTAN, cancer trajectories, precision oncologyNCI HTAN, cancer trajectories, precision oncology
NCI HTAN, cancer trajectories, precision oncology
 
ENAR 2020
ENAR 2020ENAR 2020
ENAR 2020
 
ENAR 2020
ENAR 2020ENAR 2020
ENAR 2020
 
Technology and connected health for population science kibbe duke jan 2020
Technology and connected health for population science kibbe duke jan 2020Technology and connected health for population science kibbe duke jan 2020
Technology and connected health for population science kibbe duke jan 2020
 
Super computing 19 Cancer Computing Workshop Keynote
Super computing 19 Cancer Computing Workshop KeynoteSuper computing 19 Cancer Computing Workshop Keynote
Super computing 19 Cancer Computing Workshop Keynote
 
Data sharing Webinar March 2019
Data sharing Webinar March  2019Data sharing Webinar March  2019
Data sharing Webinar March 2019
 
Data in precision oncology SAMSI Precision Medicine Meeting mar 2019
Data in precision oncology SAMSI Precision Medicine Meeting mar 2019Data in precision oncology SAMSI Precision Medicine Meeting mar 2019
Data in precision oncology SAMSI Precision Medicine Meeting mar 2019
 
Opportunities for computing in cancer research
Opportunities for computing in cancer researchOpportunities for computing in cancer research
Opportunities for computing in cancer research
 
Opportunities in technology and connected health for population science
Opportunities in technology and connected health for population science Opportunities in technology and connected health for population science
Opportunities in technology and connected health for population science
 

Recently uploaded

Non-respiratory Functions of the Lungs.pdf
Non-respiratory Functions of the Lungs.pdfNon-respiratory Functions of the Lungs.pdf
Non-respiratory Functions of the Lungs.pdf
MedicoseAcademics
 
Triangles of Neck and Clinical Correlation by Dr. RIG.pptx
Triangles of Neck and Clinical Correlation by Dr. RIG.pptxTriangles of Neck and Clinical Correlation by Dr. RIG.pptx
Triangles of Neck and Clinical Correlation by Dr. RIG.pptx
Dr. Rabia Inam Gandapore
 
POST OPERATIVE OLIGURIA and its management
POST OPERATIVE OLIGURIA and its managementPOST OPERATIVE OLIGURIA and its management
POST OPERATIVE OLIGURIA and its management
touseefaziz1
 
Ozempic: Preoperative Management of Patients on GLP-1 Receptor Agonists
Ozempic: Preoperative Management of Patients on GLP-1 Receptor Agonists  Ozempic: Preoperative Management of Patients on GLP-1 Receptor Agonists
Ozempic: Preoperative Management of Patients on GLP-1 Receptor Agonists
Saeid Safari
 
Physiology of Special Chemical Sensation of Taste
Physiology of Special Chemical Sensation of TastePhysiology of Special Chemical Sensation of Taste
Physiology of Special Chemical Sensation of Taste
MedicoseAcademics
 
Report Back from SGO 2024: What’s the Latest in Cervical Cancer?
Report Back from SGO 2024: What’s the Latest in Cervical Cancer?Report Back from SGO 2024: What’s the Latest in Cervical Cancer?
Report Back from SGO 2024: What’s the Latest in Cervical Cancer?
bkling
 
Couples presenting to the infertility clinic- Do they really have infertility...
Couples presenting to the infertility clinic- Do they really have infertility...Couples presenting to the infertility clinic- Do they really have infertility...
Couples presenting to the infertility clinic- Do they really have infertility...
Sujoy Dasgupta
 
Antiulcer drugs Advance Pharmacology .pptx
Antiulcer drugs Advance Pharmacology .pptxAntiulcer drugs Advance Pharmacology .pptx
Antiulcer drugs Advance Pharmacology .pptx
Rohit chaurpagar
 
Are There Any Natural Remedies To Treat Syphilis.pdf
Are There Any Natural Remedies To Treat Syphilis.pdfAre There Any Natural Remedies To Treat Syphilis.pdf
Are There Any Natural Remedies To Treat Syphilis.pdf
Little Cross Family Clinic
 
Flu Vaccine Alert in Bangalore Karnataka
Flu Vaccine Alert in Bangalore KarnatakaFlu Vaccine Alert in Bangalore Karnataka
Flu Vaccine Alert in Bangalore Karnataka
addon Scans
 
Pulmonary Thromboembolism - etilogy, types, medical- Surgical and nursing man...
Pulmonary Thromboembolism - etilogy, types, medical- Surgical and nursing man...Pulmonary Thromboembolism - etilogy, types, medical- Surgical and nursing man...
Pulmonary Thromboembolism - etilogy, types, medical- Surgical and nursing man...
VarunMahajani
 
The POPPY STUDY (Preconception to post-partum cardiovascular function in prim...
The POPPY STUDY (Preconception to post-partum cardiovascular function in prim...The POPPY STUDY (Preconception to post-partum cardiovascular function in prim...
The POPPY STUDY (Preconception to post-partum cardiovascular function in prim...
Catherine Liao
 
THOA 2.ppt Human Organ Transplantation Act
THOA 2.ppt Human Organ Transplantation ActTHOA 2.ppt Human Organ Transplantation Act
THOA 2.ppt Human Organ Transplantation Act
DrSathishMS1
 
BENIGN PROSTATIC HYPERPLASIA.BPH. BPHpdf
BENIGN PROSTATIC HYPERPLASIA.BPH. BPHpdfBENIGN PROSTATIC HYPERPLASIA.BPH. BPHpdf
BENIGN PROSTATIC HYPERPLASIA.BPH. BPHpdf
DR SETH JOTHAM
 
ANATOMY AND PHYSIOLOGY OF URINARY SYSTEM.pptx
ANATOMY AND PHYSIOLOGY OF URINARY SYSTEM.pptxANATOMY AND PHYSIOLOGY OF URINARY SYSTEM.pptx
ANATOMY AND PHYSIOLOGY OF URINARY SYSTEM.pptx
Swetaba Besh
 
Maxilla, Mandible & Hyoid Bone & Clinical Correlations by Dr. RIG.pptx
Maxilla, Mandible & Hyoid Bone & Clinical Correlations by Dr. RIG.pptxMaxilla, Mandible & Hyoid Bone & Clinical Correlations by Dr. RIG.pptx
Maxilla, Mandible & Hyoid Bone & Clinical Correlations by Dr. RIG.pptx
Dr. Rabia Inam Gandapore
 
Charaka Samhita Sutra sthana Chapter 15 Upakalpaniyaadhyaya
Charaka Samhita Sutra sthana Chapter 15 UpakalpaniyaadhyayaCharaka Samhita Sutra sthana Chapter 15 Upakalpaniyaadhyaya
Charaka Samhita Sutra sthana Chapter 15 Upakalpaniyaadhyaya
Dr KHALID B.M
 
Hemodialysis: Chapter 3, Dialysis Water Unit - Dr.Gawad
Hemodialysis: Chapter 3, Dialysis Water Unit - Dr.GawadHemodialysis: Chapter 3, Dialysis Water Unit - Dr.Gawad
Hemodialysis: Chapter 3, Dialysis Water Unit - Dr.Gawad
NephroTube - Dr.Gawad
 
Prix Galien International 2024 Forum Program
Prix Galien International 2024 Forum ProgramPrix Galien International 2024 Forum Program
Prix Galien International 2024 Forum Program
Levi Shapiro
 
New Directions in Targeted Therapeutic Approaches for Older Adults With Mantl...
New Directions in Targeted Therapeutic Approaches for Older Adults With Mantl...New Directions in Targeted Therapeutic Approaches for Older Adults With Mantl...
New Directions in Targeted Therapeutic Approaches for Older Adults With Mantl...
i3 Health
 

Recently uploaded (20)

Non-respiratory Functions of the Lungs.pdf
Non-respiratory Functions of the Lungs.pdfNon-respiratory Functions of the Lungs.pdf
Non-respiratory Functions of the Lungs.pdf
 
Triangles of Neck and Clinical Correlation by Dr. RIG.pptx
Triangles of Neck and Clinical Correlation by Dr. RIG.pptxTriangles of Neck and Clinical Correlation by Dr. RIG.pptx
Triangles of Neck and Clinical Correlation by Dr. RIG.pptx
 
POST OPERATIVE OLIGURIA and its management
POST OPERATIVE OLIGURIA and its managementPOST OPERATIVE OLIGURIA and its management
POST OPERATIVE OLIGURIA and its management
 
Ozempic: Preoperative Management of Patients on GLP-1 Receptor Agonists
Ozempic: Preoperative Management of Patients on GLP-1 Receptor Agonists  Ozempic: Preoperative Management of Patients on GLP-1 Receptor Agonists
Ozempic: Preoperative Management of Patients on GLP-1 Receptor Agonists
 
Physiology of Special Chemical Sensation of Taste
Physiology of Special Chemical Sensation of TastePhysiology of Special Chemical Sensation of Taste
Physiology of Special Chemical Sensation of Taste
 
Report Back from SGO 2024: What’s the Latest in Cervical Cancer?
Report Back from SGO 2024: What’s the Latest in Cervical Cancer?Report Back from SGO 2024: What’s the Latest in Cervical Cancer?
Report Back from SGO 2024: What’s the Latest in Cervical Cancer?
 
Couples presenting to the infertility clinic- Do they really have infertility...
Couples presenting to the infertility clinic- Do they really have infertility...Couples presenting to the infertility clinic- Do they really have infertility...
Couples presenting to the infertility clinic- Do they really have infertility...
 
Antiulcer drugs Advance Pharmacology .pptx
Antiulcer drugs Advance Pharmacology .pptxAntiulcer drugs Advance Pharmacology .pptx
Antiulcer drugs Advance Pharmacology .pptx
 
Are There Any Natural Remedies To Treat Syphilis.pdf
Are There Any Natural Remedies To Treat Syphilis.pdfAre There Any Natural Remedies To Treat Syphilis.pdf
Are There Any Natural Remedies To Treat Syphilis.pdf
 
Flu Vaccine Alert in Bangalore Karnataka
Flu Vaccine Alert in Bangalore KarnatakaFlu Vaccine Alert in Bangalore Karnataka
Flu Vaccine Alert in Bangalore Karnataka
 
Pulmonary Thromboembolism - etilogy, types, medical- Surgical and nursing man...
Pulmonary Thromboembolism - etilogy, types, medical- Surgical and nursing man...Pulmonary Thromboembolism - etilogy, types, medical- Surgical and nursing man...
Pulmonary Thromboembolism - etilogy, types, medical- Surgical and nursing man...
 
The POPPY STUDY (Preconception to post-partum cardiovascular function in prim...
The POPPY STUDY (Preconception to post-partum cardiovascular function in prim...The POPPY STUDY (Preconception to post-partum cardiovascular function in prim...
The POPPY STUDY (Preconception to post-partum cardiovascular function in prim...
 
THOA 2.ppt Human Organ Transplantation Act
THOA 2.ppt Human Organ Transplantation ActTHOA 2.ppt Human Organ Transplantation Act
THOA 2.ppt Human Organ Transplantation Act
 
BENIGN PROSTATIC HYPERPLASIA.BPH. BPHpdf
BENIGN PROSTATIC HYPERPLASIA.BPH. BPHpdfBENIGN PROSTATIC HYPERPLASIA.BPH. BPHpdf
BENIGN PROSTATIC HYPERPLASIA.BPH. BPHpdf
 
ANATOMY AND PHYSIOLOGY OF URINARY SYSTEM.pptx
ANATOMY AND PHYSIOLOGY OF URINARY SYSTEM.pptxANATOMY AND PHYSIOLOGY OF URINARY SYSTEM.pptx
ANATOMY AND PHYSIOLOGY OF URINARY SYSTEM.pptx
 
Maxilla, Mandible & Hyoid Bone & Clinical Correlations by Dr. RIG.pptx
Maxilla, Mandible & Hyoid Bone & Clinical Correlations by Dr. RIG.pptxMaxilla, Mandible & Hyoid Bone & Clinical Correlations by Dr. RIG.pptx
Maxilla, Mandible & Hyoid Bone & Clinical Correlations by Dr. RIG.pptx
 
Charaka Samhita Sutra sthana Chapter 15 Upakalpaniyaadhyaya
Charaka Samhita Sutra sthana Chapter 15 UpakalpaniyaadhyayaCharaka Samhita Sutra sthana Chapter 15 Upakalpaniyaadhyaya
Charaka Samhita Sutra sthana Chapter 15 Upakalpaniyaadhyaya
 
Hemodialysis: Chapter 3, Dialysis Water Unit - Dr.Gawad
Hemodialysis: Chapter 3, Dialysis Water Unit - Dr.GawadHemodialysis: Chapter 3, Dialysis Water Unit - Dr.Gawad
Hemodialysis: Chapter 3, Dialysis Water Unit - Dr.Gawad
 
Prix Galien International 2024 Forum Program
Prix Galien International 2024 Forum ProgramPrix Galien International 2024 Forum Program
Prix Galien International 2024 Forum Program
 
New Directions in Targeted Therapeutic Approaches for Older Adults With Mantl...
New Directions in Targeted Therapeutic Approaches for Older Adults With Mantl...New Directions in Targeted Therapeutic Approaches for Older Adults With Mantl...
New Directions in Targeted Therapeutic Approaches for Older Adults With Mantl...
 

Data Harmonization for a Molecularly Driven Health System

  • 1. Data Harmonization for a Molecularly Driven Health System Warren A. Kibbe, Ph.D. Professor, Biostats & Bioinformatics Chief Data Officer, Duke Cancer Institute warren.kibbe@duke.edu @wakibbe #DataSharing #LearningHealthSystem #DataHarmonization
  • 2. Sections • Learning Health Systems • Data Commons • Data Harmonization
  • 3. The World is Changing • Pace of Commercialization • Reach of Markets • Role of Data • Change in Healthcare • Change in Computing • Societal Changes
  • 4. Is the US able to keep up?
  • 6. US R&D Funding as share of GDP
  • 8. How do we continue to innovate?
  • 12. Changes in Computing • Converged devices • Converged IT • Ubiquity of devices, data, mHealth
  • 14. Pace of Technology Adoption
  • 16. Changes in Oncology • Cancer is a grand challenge • Anatomic vs molecular classification • Health vs Disease
  • 17. Understanding Cancer • Precision medicine will lead to fundamental understanding of the complex interplay between genetics, epigenetics, nutrition, environment and clinical presentation and direct effective, evidence-based prevention and treatment. Ramifications across many aspects of health care
  • 20. “Science, informatics, incentives, and culture are aligned for continuous improvement and innovation, with best practices seamlessly embedded in the delivery process and new knowledge captured as an integral by-product of the delivery experience.” —Institute of Medicine LEARNING HEALTH SYSTEMS
  • 21. Another imperative is that such systems do their work: • Transparently (how does one learn without well documented processes?) • Reproducibly (good practices must always be repeatable at scale and scientifically reproducible) • Only with the above can the science in “data science” be done with sufficient rigor LEARNING HEALTH SYSTEMS
  • 24. Goals • Contain rising cost of healthcare • Maximize the value of care • Increase public discourse and marketplace for healthcare
  • 25. Drivers • Decision Making is too complex • Clinical decisions are based on practice, not evidence • Inefficiency and waste in healthcare
  • 28. EHRs and the Learning Health System
  • 30. Problems for LHS to solve
  • 32. Poor Health in spite of high expenditures
  • 34. 2015
  • 36. EHRs are now ubiquitous But evidence-driven decision support remains a future vision
  • 37. Hope Cloud computing, data commons, service-based computing provide some powerful tools for solving data access, data analysis, data analytics, and data visualization problems at scale, securely.
  • 39. So what is a Data Commons
  • 40. Commons Topology Compute Platform: Cloud or HPC Services: APIs, Containers, Indexing, Software: Services & Tools scientific analysis tools/workflows Data “Reference” Data Sets User defined data DigitalObjectCompliance App store/User Interface PaaS SaaS IaaS https://datascience.nih.gov/commons
  • 41. Commons Compliance • Treat products of research – data, methods, papers etc. as digital objects • These digital objects exist in a shared virtual space • Digital object compliance through FAIR principles: – Findable – Accessible (and usable) – Interoperable – Reusable
  • 42. Data Sharing and the FAIR Principles FAIR – Making data Findable, Accessible, Attributable, Interoperable, Reusable, and provide Recognition Force11 white paper https://www.force11.org/group/fairgroup/fairprinciples
  • 43. “The Commons is an effort at creating a sharing economy and for building community. We hope for a more cost effective and productive research environment while bringing people together in a unique way.“ Phil Bourne
  • 44. 44 Blue Ribbon Panel Report Cancer Moonshot℠ Blue Ribbon Panel “The Cancer Moonshot Task Force was directed to consult with external experts from relevant scientific sectors, including the presidentially appointed National Cancer Advisory Board(NCAB). A Blue Ribbon Panel of scientific experts was created to advise the NCAB.”
  • 45. Vision: Enable the creation of a Learning Healthcare System for Cancer, where as a nation we learn from the contributed knowledge and experience of every cancer patient. As part of the Cancer Moonshot, we want to unleash the power of data to enhance, improve, and inform the journey of every cancer patient from the point of diagnosis through survivorship.
  • 46. A National Cancer Data Ecosystem Cancer Research Data Commons SBG CGC Broad FireCloud ISB CGC Courtesy NCI-CBIIT
  • 47. Data Commons Framework – What Is It? 47 Modular Components Secure user authentication and authorization Metadata validation and tools Domain-specific, extensible data models and dictionaries API and container environment for tools and pipelines Access to computational workspaces for storing data, tools, and results Reusable, expandable framework for a Data Commons Core principles and structures for a Data Commons Set of modular components that can be leveraged across Data Commons
  • 48. Narrow Middle Architecture (End-to-End Design) 1. AuthN / AuthZ 2. Metadata validation 3. Extensible data model 4. APIs for containers, workflows & tools 5. Workspaces science outdata in Courtesy Bob Grossman, U. Chicago
  • 49. 49 NCI Cancer Research Data Commons (CRDC) - Concept NCI Scope: “Create a data science infrastructure necessary to connect repositories, analytical tools, and knowledge bases” Data commons co-locate data, storage and computing infrastructure with commonly used services, tools & apps for analyzing and sharing data to create an interoperable resource for the research community.* *Robert L. Grossman, Allison Heath, Mark Murphy, Maria Patterson and Walt Wells, A Case for Data Commons Towards Data Science as a Service, IEEE Computing in Science and Engineer, 2016. Source of image: The CDIS, GDC, & OCC data commons infrastructure at the University of Chicago Kenwood Data Center.
  • 50. 50 Data Commons Framework Clinical Proteomics ImagingGenomics Immuno- oncology Animal Models Cancer Biomarkers NCI Cancer Research Data Commons SBG CGC Broad FireCloud ISB CGC Elastic Compute Query Visualization Clinical Proteomics Tumor Analysis Consortium* Tool Deployment The Cancer Imaging Archive* TCIA Web Interface APIs Data Submission Authentication & Authorization Authentication & Authorization Data Models & Dictionaries Computational Workspaces Data Contributors and Consumers Tool Repositories Metadata Validation & Tools Analysis Courtesy NCI-CBIIT
  • 54. NCI Genomic Data Commons
  • 55. NCI Genomic Data Commons
  • 56. NCI Genomic Data Commons
  • 57. Data Harmonization • The process of semantic and syntactic mapping of data to a set of definitions, predefined data elements, data model. • Validation and Harmonization of primary and secondary data is crucial to enable analysis and reuse
  • 58. Spanning the Semantic Chasm of Despair Building a Translational Bridge CD2H Thanks to Melissa Haendel
  • 59. Project Highlight: Harmonizing clinical data models Sentinel I2b2/ACT OMOP PCORNET ▪ Different countries use different “outlets”. ▪ There is a need for travel adapters. The Solution: ▪ Use a converter between various adapters. ▪ Allow researchers to ask a question once and receive results from many different sources
  • 60. Project Highlight: LOINC2HPO ◆ Develop a software tool to map LOINC codes to HPO terms ◆ Develop software to convert EHR observations into HPO terms for use in clinical research Steps Develop a tool for converting LOINC laboratory codes and values into more phenotypically meaningful language (Human Phenotype Ontology) to allow for translational interoperability and new analytics 2657-5 “Nitrite [Mass/volume] in Urine” Numeric 20407-3 “Nitrite [Mass/volume] in Urine by Test strip” Numeric 32710-6 “Nitrite [Presence] in Urine” Positive/Negati ve 5802-4 “Nitrite [Presence] in Urine by Test strip” Positive/Negati ve 50558-6 “Nitrite [Presence] in Urine by Automated test strip Positive/Negati ve LOINC Outcome HPO: Nitrituria
  • 61. INSERT CDE Browser Screenshot? CIBMTR Center for Cancer Research Over 35 NCI Programs, Plus Cancer Centers and Consortia GDC
  • 62. Data Sharing Index • We need metrics for data, software, algorithm use, usability, conformance • Data sharing stimulates science, innovation, commercialization • Providing recognition and attribution to data providers and software & algorithm builders is critical for a robust data sharing ecosystem • Support and measure FAIRness!