SlideShare a Scribd company logo
1 of 20
Download to read offline
Big Medical Data: Challenge or Potential
BioNRW, Münster, May 21, 2014
Dr. Matthieu-P. Schapranow, Hasso Plattner Institute
Hasso Plattner Institute
Key Facts
■  Founded as a public-private partnership
in 1998 in Potsdam near Berlin, Germany
■  Institute belongs to the
University of Potsdam
■  Ranked 1st in CHE since 2009
■  500 B.Sc. and M.Sc. students
■  10 professors, 150 PhD students
■  Course of study: IT Systems Engineering
Big Medical Data, BioNRW, Münster, Dr. M.-P. Schapranow, May 21, 20142
Hasso Plattner Institute
Enterprise Platform and Integration Concepts Group
  Prof. Dr. h.c. Hasso Plattner
■ Research focuses on the technical aspects of enterprise
software and design of complex applications
□  In-Memory Data Management for Enterprise Applications
□  Enterprise Application Programming Model
□  Scientific Data Management
□  Human-Centered Software Design and Engineering
■ Industry cooperations, e.g. SAP, Siemens, Audi, and EADS
■ Research cooperations, e.g. Stanford, MIT, and Berkeley
Big Medical Data, BioNRW, Münster, Dr. M.-P. Schapranow, May 21, 20143
Partner of
Stanford Center
for Design
Research
Partner of MIT in
Supply Chain
Innovation and
CSAIL
Partner at
UC Berkeley

RAD / AMP Lab
Partner of
SAP AG
The Challenge
Distributed Big Data Sources
Big Medical Data, BioNRW, Münster, Dr. M.-P. Schapranow, May 21, 20144
Human genome/biological data
600GB per full genome
15PB+ in databases of leading institutes
Prescription data
1.5B records from 10,000 doctors
and 10M Patients (100 GB)
Clinical trials
Currently more than 30k
recruiting on ClinicalTrials.gov
Human proteome
160M data points (2.4GB) per sample
>3TB raw proteome data in ProteomicsDB
PubMed database
>23M articles
Hospital information systems
Often more than 50GB
Medical sensor data
Scan of a single organ in 1s
creates 10GB of raw data
Cancer patient records
>160k records at NCT
Combined
column
and row store
Map/Reduce Single and
multi-tenancy
Lightweight
compression
Insert only
for time travel
Real-time
replication
Working on
integers
SQL interface
on columns
and rows
Active/passive
data store
Minimal
projections
Group key Reduction of
software
layers
Dynamic multi-
threading
Bulk load
of data
Object-
relational
mapping
Text retrieval
and extraction
engine
No aggregate
tables
Data
partitioning
Any attribute
as index
No disk
On-the-fly
extensibility
Analytics on
historical data
Multi-core/
parallelization
Our Approach
In-Memory Technology
Big Medical Data, BioNRW, Münster, Dr. M.-P. Schapranow, May 21, 20145
+
++
+
+
P
v
+++
t
SQL
x
x
T
disk
Our Vision
Personalized Medicine
Big Medical Data, BioNRW, Münster, Dr. M.-P. Schapranow, May 21, 20146
High-Performance In-Memory Genome Project
Big Medical Data, BioNRW, Münster, Dr. M.-P. Schapranow, May 21, 20147
In-Memory Database
Extensions
App Store
Access
Control
Billing
Statistical
Tools
Data
Genome
Data
Pathways
Genome
Metadata
Publications
Pipeline
Models
Analytical
Tools
...
Drugs and
Interactions
Drug Response
Analysis
Pathway Topology
Analysis
Medical Know-
ledge Cockpit
Oncolyzer Cohort
Analysis
Clinical Trial
Assessment
High-Performance In-Memory Genome Project
Medical Knowledge Cockpit
■  Search for affected genes in distributed and
heterogeneous data sources
■  Immediate exploration of relevant information,
such as
□  Gene descriptions,
□  Molecular impact and related pathways,
□  Scientific publications, and
□  Suitable clinical trials.
■  No manual searching for hours or days:
In-memory technology translates searching into
interactive finding!
Big Medical Data, BioNRW, Münster, Dr. M.-P. Schapranow, May 21, 2014
Automatic clinical trial
matching build on text
analysis features
Unified access to
structured and un-
structured data sources
8
Medical Knowledge Cockpit
Seamless Integration of Patient Specifics
■  Google-like user interface for searching data
■  Seamless integration of individual EMR data
■  Search various sources for biomarkers, literature, and diseases
Big Medical Data, BioNRW, Münster, Dr. M.-P. Schapranow, May 21, 20149
Medical Knowledge Cockpit
Publications
■  In-place preview of relevant data, such as publications and publication
meta data
■  Incorporating individual filter settings, e.g. additional search terms
Big Medical Data, BioNRW, Münster, Dr. M.-P. Schapranow, May 21, 201410
Medical Knowledge Cockpit
Publications
■  Interactively explore relevant publications, e.g. PDFs
■  Improved ease of exploration, e.g. by highlighted medical terms and
relevant concepts
Big Medical Data, BioNRW, Münster, Dr. M.-P. Schapranow, May 21, 201411
Medical Knowledge Cockpit
Latest Clinicial Trials
■  Personalized clinical trials, e.g. by incorporating patient specifics
■  Classification of internal/external trials based on treating institute
Big Medical Data, BioNRW, Münster, Dr. M.-P. Schapranow, May 21, 201412
Medical Knowledge Cockpit
Pathway Topology Analysis
■  Search in pathways is limited to “is a certain
element contained” today
■  Integrated >1,5k pathways from international
sources, e.g. KEGG, HumanCyc, and
WikiPathways, into HANA
■  Implemented graph-based topology exploration
and ranking based on patient specifics
■  Enables interactive identification of possible
dysfunctions affecting the course of a therapy
before its start
Big Medical Data, BioNRW, Münster, Dr. M.-P. Schapranow, May 21, 2014
Unified access to multiple
formerly disjoint data sources
Pathway analysis of genetic
variants with graph engine
13
Medical Knowledge Cockpit
Search in Structured and Unstructured Medical Data
■  Extended text analysis feature by medical
terminology
□  Genes (122,975 + 186,771 synonyms)
□  Medical terms and categories (98,886 diseases
+ 48,561 synonyms, 47 categories)
□  Pharmaceutical ingredients (7,099 + 5,561
synonyms)
■  Indexed clinicaltrials.gov database (145k trials/
30,138 recruiting)
■  Extracted, e.g., 320k genes, 161k ingredients,
30k periods
■  Select all studies based on multiple filters in less
than 500ms
Big Medical Data, BioNRW, Münster, Dr. M.-P. Schapranow, May 21, 2014
Clinical trial matching using
text analysis features
Unified access to structured
and unstructured data sources
14
T
Drug Response Analysis and Prediction
Data Sources
Big Medical Data, BioNRW, Münster, Dr. M.-P. Schapranow, May 21, 201415
Collect Patient
Data
Sequence Tumor
Conduct Xenograft
Experiments
Metadata
e.g. smoking status,
tumor classification and age
Experiment Results
e.g. medication effectivity
obtained from wet laboratory
Genome Data
e.g. raw DNA data
and genetic variants
Computational
Biology
Tumor Similarity
and Clustering
Visual Data
Exploration
Perform Manual
Data Exploration and
Analysis
Drug Response Analysis and Prediction
Interactive Data Exploration
■  Drug response depends on individual genetic
variants of tumors
■  Challenge: Identification of relevant genetic
variants and their impact on drug response is a
ongoing research activity, e.g. Xenograft models
■  Exploration of experiment results is time-
consuming and Excel-driven
■  In-memory technology enables interactive
exploration of experiment data to leverage new
scientific insights
Big Medical Data, BioNRW, Münster, Dr. M.-P. Schapranow, May 21, 2014
Interactive analysis of
correlations between drugs
and genetic variants
16
High-Performance In-Memory Genome Project
Big Medical Data, BioNRW, Münster, Dr. M.-P. Schapranow, May 21, 201417
?
In-Memory Database
Extensions
App Store
Access
Control
Billing
Statistical
Tools
Data
Genome
Data
Pathways
Genome
Metadata
Publications
Pipeline
Models
Analytical
Tools
...
Drugs and
Interactions
Drug Response
Analysis
Pathway Topology
Analysis
Medical Know-
ledge Cockpit
Oncolyzer Cohort
Analysis
Clinical Trial
Assessment
High-Performance In-Memory Genome Project
Innovations start here
■  Strong partners combine their expertises to
come-up with innovative solutions for
personalized medicine
■  Experts from interdisciplinary fields join
globally distributed acting teams
■  Internationally accepted research institutes
guarantee academic validity
■  What about you? è Get in contact with us!
Big Medical Data, BioNRW, Münster, Dr. M.-P. Schapranow, May 21, 201418
Interdisciplinary
Design Thinking
Teams
You?
What to take home?
Test-drive it yourself: http://we.AnalyzeGenomes.com
  For researchers
■  Enable real-time analysis of medical data
■  Automatic assessment of data, e.g. scan of pathways to identify
cellular impact of mutations
■  Combined free-text search in publications, diagnosis, and EMR
data, i.e. structured and unstructured data
  For clinicians
■  Preventive diagnostics to identify risk patients early
■  Indicate pharmacokinetic correlations
■  Scan for similar patient cases, e.g. to evaluate therapy success
  For patients
■  Identify relevant clinical trials and medical experts
■  Start most appropriate therapy as early as possible
Big Medical Data, BioNRW, Münster, Dr. M.-P. Schapranow, May 21, 201419
Keep in contact with us!
Big Medical Data, BioNRW, Münster, Dr. M.-P. Schapranow, May 21, 2014
Hasso Plattner Institute
Enterprise Platform & Integration Concepts
Dr. Matthieu-P. Schapranow
August-Bebel-Str. 88
14482 Potsdam, Germany
Dr. Matthieu-P. Schapranow
schapranow@hpi.uni-potsdam.de
http://we.analyzegenomes.com/
20

More Related Content

What's hot

Festival of Genomics 2016 London: Analyze Genomes: A Federated In-Memory Comp...
Festival of Genomics 2016 London: Analyze Genomes: A Federated In-Memory Comp...Festival of Genomics 2016 London: Analyze Genomes: A Federated In-Memory Comp...
Festival of Genomics 2016 London: Analyze Genomes: A Federated In-Memory Comp...Matthieu Schapranow
 
In-Memory Data Management for Systems Medicine
In-Memory Data Management for Systems MedicineIn-Memory Data Management for Systems Medicine
In-Memory Data Management for Systems MedicineMatthieu Schapranow
 
Analyze Genomes Services for Precision Medicine
Analyze Genomes Services for Precision MedicineAnalyze Genomes Services for Precision Medicine
Analyze Genomes Services for Precision MedicineMatthieu Schapranow
 
A Federated In-Memory Database System for Life Sciences
A Federated In-Memory Database System for Life SciencesA Federated In-Memory Database System for Life Sciences
A Federated In-Memory Database System for Life SciencesMatthieu Schapranow
 
Gesundheit geht uns alle an: Smart Data ermöglicht passendere Entscheidungen...
Gesundheit geht uns alle an: Smart Data ermöglicht passendere Entscheidungen...Gesundheit geht uns alle an: Smart Data ermöglicht passendere Entscheidungen...
Gesundheit geht uns alle an: Smart Data ermöglicht passendere Entscheidungen...Matthieu Schapranow
 
AnalyzeGenomes.com: A Federated In-Memory Database Platform for Digital Health
AnalyzeGenomes.com: A Federated In-Memory Database Platform for Digital HealthAnalyzeGenomes.com: A Federated In-Memory Database Platform for Digital Health
AnalyzeGenomes.com: A Federated In-Memory Database Platform for Digital HealthMatthieu Schapranow
 
Analyze Genomes: A Federated In-memory Database Computing Platform enabling r...
Analyze Genomes: A Federated In-memory Database Computing Platform enabling r...Analyze Genomes: A Federated In-memory Database Computing Platform enabling r...
Analyze Genomes: A Federated In-memory Database Computing Platform enabling r...Matthieu Schapranow
 
Analyze Genomes: In-memory Apps for Next-generation Life Sciences Research
Analyze Genomes: In-memory Apps for Next-generation Life Sciences ResearchAnalyze Genomes: In-memory Apps for Next-generation Life Sciences Research
Analyze Genomes: In-memory Apps for Next-generation Life Sciences ResearchMatthieu Schapranow
 
The Driver of the Healthcare System in the 21st Century: Real-world Applicati...
The Driver of the Healthcare System in the 21st Century: Real-world Applicati...The Driver of the Healthcare System in the 21st Century: Real-world Applicati...
The Driver of the Healthcare System in the 21st Century: Real-world Applicati...Matthieu Schapranow
 
ICT Platform to Enable Consortium Work for Systems Medicine of Heart Failure
ICT Platform to Enable Consortium Work for Systems Medicine of Heart FailureICT Platform to Enable Consortium Work for Systems Medicine of Heart Failure
ICT Platform to Enable Consortium Work for Systems Medicine of Heart FailureMatthieu Schapranow
 
Festival of Genomics 2016 London: Analyze Genomes: Modeling and Executing Gen...
Festival of Genomics 2016 London: Analyze Genomes: Modeling and Executing Gen...Festival of Genomics 2016 London: Analyze Genomes: Modeling and Executing Gen...
Festival of Genomics 2016 London: Analyze Genomes: Modeling and Executing Gen...Matthieu Schapranow
 
A Federated In-Memory Database Computing Platform Enabling Real-Time Analysis...
A Federated In-Memory Database Computing Platform Enabling Real-Time Analysis...A Federated In-Memory Database Computing Platform Enabling Real-Time Analysis...
A Federated In-Memory Database Computing Platform Enabling Real-Time Analysis...Matthieu Schapranow
 
Patient Journey in Oncology 2025: Molecular Tumour Boards in Practice
Patient Journey in Oncology 2025: Molecular Tumour Boards in PracticePatient Journey in Oncology 2025: Molecular Tumour Boards in Practice
Patient Journey in Oncology 2025: Molecular Tumour Boards in PracticeMatthieu Schapranow
 
In-Memory Apps for Precision Medicine
In-Memory Apps for Precision MedicineIn-Memory Apps for Precision Medicine
In-Memory Apps for Precision MedicineMatthieu Schapranow
 
Human Genome and Big Data Challenges
Human Genome and Big Data ChallengesHuman Genome and Big Data Challenges
Human Genome and Big Data ChallengesPhilip Bourne
 
Analyze Genomes: Drug Response Analysis
Analyze Genomes: Drug Response AnalysisAnalyze Genomes: Drug Response Analysis
Analyze Genomes: Drug Response AnalysisMatthieu Schapranow
 
How will AI affect the patient journey of the future?
How will AI affect the patient journey of the future?How will AI affect the patient journey of the future?
How will AI affect the patient journey of the future?Matthieu Schapranow
 
Festival of Genomics 2016 London: What to take home?
Festival of Genomics 2016 London: What to take home?Festival of Genomics 2016 London: What to take home?
Festival of Genomics 2016 London: What to take home?Matthieu Schapranow
 
Festival of Genomics 2016 London: Analyze Genomes: Real-world Examples
Festival of Genomics 2016 London: Analyze Genomes: Real-world ExamplesFestival of Genomics 2016 London: Analyze Genomes: Real-world Examples
Festival of Genomics 2016 London: Analyze Genomes: Real-world ExamplesMatthieu Schapranow
 

What's hot (20)

Festival of Genomics 2016 London: Analyze Genomes: A Federated In-Memory Comp...
Festival of Genomics 2016 London: Analyze Genomes: A Federated In-Memory Comp...Festival of Genomics 2016 London: Analyze Genomes: A Federated In-Memory Comp...
Festival of Genomics 2016 London: Analyze Genomes: A Federated In-Memory Comp...
 
In-Memory Data Management for Systems Medicine
In-Memory Data Management for Systems MedicineIn-Memory Data Management for Systems Medicine
In-Memory Data Management for Systems Medicine
 
Analyze Genomes Services for Precision Medicine
Analyze Genomes Services for Precision MedicineAnalyze Genomes Services for Precision Medicine
Analyze Genomes Services for Precision Medicine
 
A Federated In-Memory Database System for Life Sciences
A Federated In-Memory Database System for Life SciencesA Federated In-Memory Database System for Life Sciences
A Federated In-Memory Database System for Life Sciences
 
Gesundheit geht uns alle an: Smart Data ermöglicht passendere Entscheidungen...
Gesundheit geht uns alle an: Smart Data ermöglicht passendere Entscheidungen...Gesundheit geht uns alle an: Smart Data ermöglicht passendere Entscheidungen...
Gesundheit geht uns alle an: Smart Data ermöglicht passendere Entscheidungen...
 
AnalyzeGenomes.com: A Federated In-Memory Database Platform for Digital Health
AnalyzeGenomes.com: A Federated In-Memory Database Platform for Digital HealthAnalyzeGenomes.com: A Federated In-Memory Database Platform for Digital Health
AnalyzeGenomes.com: A Federated In-Memory Database Platform for Digital Health
 
Big Data in Life Sciences
Big Data in Life SciencesBig Data in Life Sciences
Big Data in Life Sciences
 
Analyze Genomes: A Federated In-memory Database Computing Platform enabling r...
Analyze Genomes: A Federated In-memory Database Computing Platform enabling r...Analyze Genomes: A Federated In-memory Database Computing Platform enabling r...
Analyze Genomes: A Federated In-memory Database Computing Platform enabling r...
 
Analyze Genomes: In-memory Apps for Next-generation Life Sciences Research
Analyze Genomes: In-memory Apps for Next-generation Life Sciences ResearchAnalyze Genomes: In-memory Apps for Next-generation Life Sciences Research
Analyze Genomes: In-memory Apps for Next-generation Life Sciences Research
 
The Driver of the Healthcare System in the 21st Century: Real-world Applicati...
The Driver of the Healthcare System in the 21st Century: Real-world Applicati...The Driver of the Healthcare System in the 21st Century: Real-world Applicati...
The Driver of the Healthcare System in the 21st Century: Real-world Applicati...
 
ICT Platform to Enable Consortium Work for Systems Medicine of Heart Failure
ICT Platform to Enable Consortium Work for Systems Medicine of Heart FailureICT Platform to Enable Consortium Work for Systems Medicine of Heart Failure
ICT Platform to Enable Consortium Work for Systems Medicine of Heart Failure
 
Festival of Genomics 2016 London: Analyze Genomes: Modeling and Executing Gen...
Festival of Genomics 2016 London: Analyze Genomes: Modeling and Executing Gen...Festival of Genomics 2016 London: Analyze Genomes: Modeling and Executing Gen...
Festival of Genomics 2016 London: Analyze Genomes: Modeling and Executing Gen...
 
A Federated In-Memory Database Computing Platform Enabling Real-Time Analysis...
A Federated In-Memory Database Computing Platform Enabling Real-Time Analysis...A Federated In-Memory Database Computing Platform Enabling Real-Time Analysis...
A Federated In-Memory Database Computing Platform Enabling Real-Time Analysis...
 
Patient Journey in Oncology 2025: Molecular Tumour Boards in Practice
Patient Journey in Oncology 2025: Molecular Tumour Boards in PracticePatient Journey in Oncology 2025: Molecular Tumour Boards in Practice
Patient Journey in Oncology 2025: Molecular Tumour Boards in Practice
 
In-Memory Apps for Precision Medicine
In-Memory Apps for Precision MedicineIn-Memory Apps for Precision Medicine
In-Memory Apps for Precision Medicine
 
Human Genome and Big Data Challenges
Human Genome and Big Data ChallengesHuman Genome and Big Data Challenges
Human Genome and Big Data Challenges
 
Analyze Genomes: Drug Response Analysis
Analyze Genomes: Drug Response AnalysisAnalyze Genomes: Drug Response Analysis
Analyze Genomes: Drug Response Analysis
 
How will AI affect the patient journey of the future?
How will AI affect the patient journey of the future?How will AI affect the patient journey of the future?
How will AI affect the patient journey of the future?
 
Festival of Genomics 2016 London: What to take home?
Festival of Genomics 2016 London: What to take home?Festival of Genomics 2016 London: What to take home?
Festival of Genomics 2016 London: What to take home?
 
Festival of Genomics 2016 London: Analyze Genomes: Real-world Examples
Festival of Genomics 2016 London: Analyze Genomes: Real-world ExamplesFestival of Genomics 2016 London: Analyze Genomes: Real-world Examples
Festival of Genomics 2016 London: Analyze Genomes: Real-world Examples
 

Similar to BioNRW: Big Medical Data: Challenge or Potential

In-memory Applications for Informed Patients
In-memory Applications for Informed PatientsIn-memory Applications for Informed Patients
In-memory Applications for Informed PatientsMatthieu Schapranow
 
How Real-time Analysis turns Big Medical Data into Precision Medicine
How Real-time Analysis turns Big Medical Data into Precision MedicineHow Real-time Analysis turns Big Medical Data into Precision Medicine
How Real-time Analysis turns Big Medical Data into Precision MedicineMatthieu Schapranow
 
Turning Big Data into Precision Medicine
Turning Big Data into Precision MedicineTurning Big Data into Precision Medicine
Turning Big Data into Precision MedicineMatthieu Schapranow
 
Introduction to High-performance In-memory Genome Project at HPI
Introduction to High-performance In-memory Genome Project at HPI Introduction to High-performance In-memory Genome Project at HPI
Introduction to High-performance In-memory Genome Project at HPI Matthieu Schapranow
 
Enabling Real-time Genome Data Research with In-memory Database Technology (S...
Enabling Real-time Genome Data Research with In-memory Database Technology (S...Enabling Real-time Genome Data Research with In-memory Database Technology (S...
Enabling Real-time Genome Data Research with In-memory Database Technology (S...Matthieu Schapranow
 
In-memory Applications for Oncology
In-memory Applications for OncologyIn-memory Applications for Oncology
In-memory Applications for OncologyMatthieu Schapranow
 
Festival of Genomics 2016 London: Challenges of Big Medical Data?
Festival of Genomics 2016 London: Challenges of Big Medical Data?Festival of Genomics 2016 London: Challenges of Big Medical Data?
Festival of Genomics 2016 London: Challenges of Big Medical Data?Matthieu Schapranow
 
Umc floortje scheepers
Umc floortje scheepersUmc floortje scheepers
Umc floortje scheepersBigDataExpo
 
Gaining Time -- Real-time Analysis of Big Medical Data
Gaining Time -- Real-time Analysis of Big Medical DataGaining Time -- Real-time Analysis of Big Medical Data
Gaining Time -- Real-time Analysis of Big Medical DataMatthieu Schapranow
 
A Tool for Optimizing De-Identified Health Data for Use in Statistical Classi...
A Tool for Optimizing De-Identified Health Data for Use in Statistical Classi...A Tool for Optimizing De-Identified Health Data for Use in Statistical Classi...
A Tool for Optimizing De-Identified Health Data for Use in Statistical Classi...arx-deidentifier
 
AI in translational medicine webinar
AI in translational medicine webinarAI in translational medicine webinar
AI in translational medicine webinarPistoia Alliance
 
Advanced Analytics for Clinical Data Full Event Guide
Advanced Analytics for Clinical Data Full Event GuideAdvanced Analytics for Clinical Data Full Event Guide
Advanced Analytics for Clinical Data Full Event GuidePfizer
 
Gaining Time – Real-time Analysis of Big Medical Data
Gaining Time – Real-time Analysis of Big Medical Data Gaining Time – Real-time Analysis of Big Medical Data
Gaining Time – Real-time Analysis of Big Medical Data SAP Technology
 
Digital transformation of translational medicine
Digital transformation of translational medicineDigital transformation of translational medicine
Digital transformation of translational medicineEagle Genomics
 
Realising the potential of Health Data Science: opportunities and challenges ...
Realising the potential of Health Data Science:opportunities and challenges ...Realising the potential of Health Data Science:opportunities and challenges ...
Realising the potential of Health Data Science: opportunities and challenges ...Paolo Missier
 
AI-based Business Models in Healthcare: An Empirical Study of Clinical Decisi...
AI-based Business Models in Healthcare: An Empirical Study of Clinical Decisi...AI-based Business Models in Healthcare: An Empirical Study of Clinical Decisi...
AI-based Business Models in Healthcare: An Empirical Study of Clinical Decisi...ICDEcCnferenece
 
Clinical genomics spx
Clinical genomics   spxClinical genomics   spx
Clinical genomics spxDiane McKenna
 
Personalized medicine tools for clinical trials - Kuchinke
Personalized medicine tools for clinical trials - KuchinkePersonalized medicine tools for clinical trials - Kuchinke
Personalized medicine tools for clinical trials - KuchinkeWolfgang Kuchinke
 

Similar to BioNRW: Big Medical Data: Challenge or Potential (20)

In-memory Applications for Informed Patients
In-memory Applications for Informed PatientsIn-memory Applications for Informed Patients
In-memory Applications for Informed Patients
 
How Real-time Analysis turns Big Medical Data into Precision Medicine
How Real-time Analysis turns Big Medical Data into Precision MedicineHow Real-time Analysis turns Big Medical Data into Precision Medicine
How Real-time Analysis turns Big Medical Data into Precision Medicine
 
Turning Big Data into Precision Medicine
Turning Big Data into Precision MedicineTurning Big Data into Precision Medicine
Turning Big Data into Precision Medicine
 
Introduction to High-performance In-memory Genome Project at HPI
Introduction to High-performance In-memory Genome Project at HPI Introduction to High-performance In-memory Genome Project at HPI
Introduction to High-performance In-memory Genome Project at HPI
 
Enabling Real-time Genome Data Research with In-memory Database Technology (S...
Enabling Real-time Genome Data Research with In-memory Database Technology (S...Enabling Real-time Genome Data Research with In-memory Database Technology (S...
Enabling Real-time Genome Data Research with In-memory Database Technology (S...
 
In-memory Applications for Oncology
In-memory Applications for OncologyIn-memory Applications for Oncology
In-memory Applications for Oncology
 
Festival of Genomics 2016 London: Challenges of Big Medical Data?
Festival of Genomics 2016 London: Challenges of Big Medical Data?Festival of Genomics 2016 London: Challenges of Big Medical Data?
Festival of Genomics 2016 London: Challenges of Big Medical Data?
 
Umc floortje scheepers
Umc floortje scheepersUmc floortje scheepers
Umc floortje scheepers
 
Gaining Time -- Real-time Analysis of Big Medical Data
Gaining Time -- Real-time Analysis of Big Medical DataGaining Time -- Real-time Analysis of Big Medical Data
Gaining Time -- Real-time Analysis of Big Medical Data
 
plani_prezi_34
plani_prezi_34plani_prezi_34
plani_prezi_34
 
A Tool for Optimizing De-Identified Health Data for Use in Statistical Classi...
A Tool for Optimizing De-Identified Health Data for Use in Statistical Classi...A Tool for Optimizing De-Identified Health Data for Use in Statistical Classi...
A Tool for Optimizing De-Identified Health Data for Use in Statistical Classi...
 
AI in translational medicine webinar
AI in translational medicine webinarAI in translational medicine webinar
AI in translational medicine webinar
 
Advanced Analytics for Clinical Data Full Event Guide
Advanced Analytics for Clinical Data Full Event GuideAdvanced Analytics for Clinical Data Full Event Guide
Advanced Analytics for Clinical Data Full Event Guide
 
Gaining Time – Real-time Analysis of Big Medical Data
Gaining Time – Real-time Analysis of Big Medical Data Gaining Time – Real-time Analysis of Big Medical Data
Gaining Time – Real-time Analysis of Big Medical Data
 
Digital transformation of translational medicine
Digital transformation of translational medicineDigital transformation of translational medicine
Digital transformation of translational medicine
 
Realising the potential of Health Data Science: opportunities and challenges ...
Realising the potential of Health Data Science:opportunities and challenges ...Realising the potential of Health Data Science:opportunities and challenges ...
Realising the potential of Health Data Science: opportunities and challenges ...
 
HEALTHCARE_IT
HEALTHCARE_ITHEALTHCARE_IT
HEALTHCARE_IT
 
AI-based Business Models in Healthcare: An Empirical Study of Clinical Decisi...
AI-based Business Models in Healthcare: An Empirical Study of Clinical Decisi...AI-based Business Models in Healthcare: An Empirical Study of Clinical Decisi...
AI-based Business Models in Healthcare: An Empirical Study of Clinical Decisi...
 
Clinical genomics spx
Clinical genomics   spxClinical genomics   spx
Clinical genomics spx
 
Personalized medicine tools for clinical trials - Kuchinke
Personalized medicine tools for clinical trials - KuchinkePersonalized medicine tools for clinical trials - Kuchinke
Personalized medicine tools for clinical trials - Kuchinke
 

Recently uploaded

Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 

Recently uploaded (20)

Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 

BioNRW: Big Medical Data: Challenge or Potential

  • 1. Big Medical Data: Challenge or Potential BioNRW, Münster, May 21, 2014 Dr. Matthieu-P. Schapranow, Hasso Plattner Institute
  • 2. Hasso Plattner Institute Key Facts ■  Founded as a public-private partnership in 1998 in Potsdam near Berlin, Germany ■  Institute belongs to the University of Potsdam ■  Ranked 1st in CHE since 2009 ■  500 B.Sc. and M.Sc. students ■  10 professors, 150 PhD students ■  Course of study: IT Systems Engineering Big Medical Data, BioNRW, Münster, Dr. M.-P. Schapranow, May 21, 20142
  • 3. Hasso Plattner Institute Enterprise Platform and Integration Concepts Group   Prof. Dr. h.c. Hasso Plattner ■ Research focuses on the technical aspects of enterprise software and design of complex applications □  In-Memory Data Management for Enterprise Applications □  Enterprise Application Programming Model □  Scientific Data Management □  Human-Centered Software Design and Engineering ■ Industry cooperations, e.g. SAP, Siemens, Audi, and EADS ■ Research cooperations, e.g. Stanford, MIT, and Berkeley Big Medical Data, BioNRW, Münster, Dr. M.-P. Schapranow, May 21, 20143 Partner of Stanford Center for Design Research Partner of MIT in Supply Chain Innovation and CSAIL Partner at UC Berkeley
 RAD / AMP Lab Partner of SAP AG
  • 4. The Challenge Distributed Big Data Sources Big Medical Data, BioNRW, Münster, Dr. M.-P. Schapranow, May 21, 20144 Human genome/biological data 600GB per full genome 15PB+ in databases of leading institutes Prescription data 1.5B records from 10,000 doctors and 10M Patients (100 GB) Clinical trials Currently more than 30k recruiting on ClinicalTrials.gov Human proteome 160M data points (2.4GB) per sample >3TB raw proteome data in ProteomicsDB PubMed database >23M articles Hospital information systems Often more than 50GB Medical sensor data Scan of a single organ in 1s creates 10GB of raw data Cancer patient records >160k records at NCT
  • 5. Combined column and row store Map/Reduce Single and multi-tenancy Lightweight compression Insert only for time travel Real-time replication Working on integers SQL interface on columns and rows Active/passive data store Minimal projections Group key Reduction of software layers Dynamic multi- threading Bulk load of data Object- relational mapping Text retrieval and extraction engine No aggregate tables Data partitioning Any attribute as index No disk On-the-fly extensibility Analytics on historical data Multi-core/ parallelization Our Approach In-Memory Technology Big Medical Data, BioNRW, Münster, Dr. M.-P. Schapranow, May 21, 20145 + ++ + + P v +++ t SQL x x T disk
  • 6. Our Vision Personalized Medicine Big Medical Data, BioNRW, Münster, Dr. M.-P. Schapranow, May 21, 20146
  • 7. High-Performance In-Memory Genome Project Big Medical Data, BioNRW, Münster, Dr. M.-P. Schapranow, May 21, 20147 In-Memory Database Extensions App Store Access Control Billing Statistical Tools Data Genome Data Pathways Genome Metadata Publications Pipeline Models Analytical Tools ... Drugs and Interactions Drug Response Analysis Pathway Topology Analysis Medical Know- ledge Cockpit Oncolyzer Cohort Analysis Clinical Trial Assessment
  • 8. High-Performance In-Memory Genome Project Medical Knowledge Cockpit ■  Search for affected genes in distributed and heterogeneous data sources ■  Immediate exploration of relevant information, such as □  Gene descriptions, □  Molecular impact and related pathways, □  Scientific publications, and □  Suitable clinical trials. ■  No manual searching for hours or days: In-memory technology translates searching into interactive finding! Big Medical Data, BioNRW, Münster, Dr. M.-P. Schapranow, May 21, 2014 Automatic clinical trial matching build on text analysis features Unified access to structured and un- structured data sources 8
  • 9. Medical Knowledge Cockpit Seamless Integration of Patient Specifics ■  Google-like user interface for searching data ■  Seamless integration of individual EMR data ■  Search various sources for biomarkers, literature, and diseases Big Medical Data, BioNRW, Münster, Dr. M.-P. Schapranow, May 21, 20149
  • 10. Medical Knowledge Cockpit Publications ■  In-place preview of relevant data, such as publications and publication meta data ■  Incorporating individual filter settings, e.g. additional search terms Big Medical Data, BioNRW, Münster, Dr. M.-P. Schapranow, May 21, 201410
  • 11. Medical Knowledge Cockpit Publications ■  Interactively explore relevant publications, e.g. PDFs ■  Improved ease of exploration, e.g. by highlighted medical terms and relevant concepts Big Medical Data, BioNRW, Münster, Dr. M.-P. Schapranow, May 21, 201411
  • 12. Medical Knowledge Cockpit Latest Clinicial Trials ■  Personalized clinical trials, e.g. by incorporating patient specifics ■  Classification of internal/external trials based on treating institute Big Medical Data, BioNRW, Münster, Dr. M.-P. Schapranow, May 21, 201412
  • 13. Medical Knowledge Cockpit Pathway Topology Analysis ■  Search in pathways is limited to “is a certain element contained” today ■  Integrated >1,5k pathways from international sources, e.g. KEGG, HumanCyc, and WikiPathways, into HANA ■  Implemented graph-based topology exploration and ranking based on patient specifics ■  Enables interactive identification of possible dysfunctions affecting the course of a therapy before its start Big Medical Data, BioNRW, Münster, Dr. M.-P. Schapranow, May 21, 2014 Unified access to multiple formerly disjoint data sources Pathway analysis of genetic variants with graph engine 13
  • 14. Medical Knowledge Cockpit Search in Structured and Unstructured Medical Data ■  Extended text analysis feature by medical terminology □  Genes (122,975 + 186,771 synonyms) □  Medical terms and categories (98,886 diseases + 48,561 synonyms, 47 categories) □  Pharmaceutical ingredients (7,099 + 5,561 synonyms) ■  Indexed clinicaltrials.gov database (145k trials/ 30,138 recruiting) ■  Extracted, e.g., 320k genes, 161k ingredients, 30k periods ■  Select all studies based on multiple filters in less than 500ms Big Medical Data, BioNRW, Münster, Dr. M.-P. Schapranow, May 21, 2014 Clinical trial matching using text analysis features Unified access to structured and unstructured data sources 14 T
  • 15. Drug Response Analysis and Prediction Data Sources Big Medical Data, BioNRW, Münster, Dr. M.-P. Schapranow, May 21, 201415 Collect Patient Data Sequence Tumor Conduct Xenograft Experiments Metadata e.g. smoking status, tumor classification and age Experiment Results e.g. medication effectivity obtained from wet laboratory Genome Data e.g. raw DNA data and genetic variants Computational Biology Tumor Similarity and Clustering Visual Data Exploration Perform Manual Data Exploration and Analysis
  • 16. Drug Response Analysis and Prediction Interactive Data Exploration ■  Drug response depends on individual genetic variants of tumors ■  Challenge: Identification of relevant genetic variants and their impact on drug response is a ongoing research activity, e.g. Xenograft models ■  Exploration of experiment results is time- consuming and Excel-driven ■  In-memory technology enables interactive exploration of experiment data to leverage new scientific insights Big Medical Data, BioNRW, Münster, Dr. M.-P. Schapranow, May 21, 2014 Interactive analysis of correlations between drugs and genetic variants 16
  • 17. High-Performance In-Memory Genome Project Big Medical Data, BioNRW, Münster, Dr. M.-P. Schapranow, May 21, 201417 ? In-Memory Database Extensions App Store Access Control Billing Statistical Tools Data Genome Data Pathways Genome Metadata Publications Pipeline Models Analytical Tools ... Drugs and Interactions Drug Response Analysis Pathway Topology Analysis Medical Know- ledge Cockpit Oncolyzer Cohort Analysis Clinical Trial Assessment
  • 18. High-Performance In-Memory Genome Project Innovations start here ■  Strong partners combine their expertises to come-up with innovative solutions for personalized medicine ■  Experts from interdisciplinary fields join globally distributed acting teams ■  Internationally accepted research institutes guarantee academic validity ■  What about you? è Get in contact with us! Big Medical Data, BioNRW, Münster, Dr. M.-P. Schapranow, May 21, 201418 Interdisciplinary Design Thinking Teams You?
  • 19. What to take home? Test-drive it yourself: http://we.AnalyzeGenomes.com   For researchers ■  Enable real-time analysis of medical data ■  Automatic assessment of data, e.g. scan of pathways to identify cellular impact of mutations ■  Combined free-text search in publications, diagnosis, and EMR data, i.e. structured and unstructured data   For clinicians ■  Preventive diagnostics to identify risk patients early ■  Indicate pharmacokinetic correlations ■  Scan for similar patient cases, e.g. to evaluate therapy success   For patients ■  Identify relevant clinical trials and medical experts ■  Start most appropriate therapy as early as possible Big Medical Data, BioNRW, Münster, Dr. M.-P. Schapranow, May 21, 201419
  • 20. Keep in contact with us! Big Medical Data, BioNRW, Münster, Dr. M.-P. Schapranow, May 21, 2014 Hasso Plattner Institute Enterprise Platform & Integration Concepts Dr. Matthieu-P. Schapranow August-Bebel-Str. 88 14482 Potsdam, Germany Dr. Matthieu-P. Schapranow schapranow@hpi.uni-potsdam.de http://we.analyzegenomes.com/ 20