SlideShare a Scribd company logo
Analyze Genomes: A Federated In-Memory Database Computing
Platform Enabling Real-time Analysis of Big Medical Data
Dr. Matthieu-P. Schapranow
SAPPHIRE, Orlando, USA
May 17, 2016
■  Online: Visit we.analyzegenomes.com for latest research
results, slides, videos, tools, and publications
■  Offline: High-Performance In-Memory Genome Data Analysis:
In-Memory Data Management Research, Springer,
ISBN: 978-3-319-03034-0, 2014
■  In Person: Join us for Intel Tech Talks at SAPPHIRE booth 625 daily!
□  May 17 12.30pm: A Federated In-Memory Database Computing Platform Enabling
Real-time Analysis of Big Medical Data
□  May 18 12.30pm: In-Memory Apps for Next Generation Life Sciences Research
□  May 19 11.30am: In-Memory Apps Supporting Precision Medicine
Where to find additional information?
Schapranow, SAPPHIRE,
May 17, 2016
A Federated In-
Memory Database
Computing Platform
for Big Medical Data
2
Indirect Interaction
Direct Interaction
C linician PatientResearcher
Pharm aceutical
Com pany
H ealthcare
Providers
H ospital
Research
Center
Laboratory
Patient
Advocacy
G roup
Intelligent Healthcare Networks in the 21st Century?
Schapranow, SAPPHIRE,
May 17, 2016
A Federated In-
Memory Database
Computing Platform
for Big Medical Data
3
Indirect Interaction
Direct Interaction
C linician PatientResearcher
Pharm aceutical
Com pany
H ealthcare
Providers
H ospital
Research
Center
Laboratory
Patient
Advocacy
G roup
Intelligent Healthcare Networks in the 21st Century?
Schapranow, SAPPHIRE,
May 17, 2016
A Federated In-
Memory Database
Computing Platform
for Big Medical Data
4
Indirect Interaction
Direct Interaction
C linician PatientResearcher
Pharm aceutical
Com pany
H ealthcare
Providers
H ospital
Research
Center
Laboratory
Patient
Advocacy
G roup
Intelligent Healthcare Networks in the 21st Century!
Schapranow, SAPPHIRE,
May 17, 2016
A Federated In-
Memory Database
Computing Platform
for Big Medical Data
5
■  Patients
□  Individual anamnesis, family history, and background
□  Require fast access to individualized therapy
■  Clinicians
□  Identify root and extent of disease using laboratory tests
□  Evaluate therapy alternatives, adapt existing therapy
■  Researchers
□  Conduct laboratory work, e.g. analyze patient samples
□  Create new research findings and come-up with treatment alternatives
The Setting
Actors in Oncology
Schapranow, SAPPHIRE,
May 17, 2016
6
A Federated In-
Memory Database
Computing Platform
for Big Medical Data
IT Challenges
Distributed Heterogeneous Data Sources
7
Human genome/biological data
600GB per full genome
15PB+ in databases of leading institutes
Prescription data
1.5B records from 10,000 doctors and
10M Patients (100 GB)
Clinical trials
Currently more than 30k
recruiting on ClinicalTrials.gov
Human proteome
160M data points (2.4GB) per sample
>3TB raw proteome data in ProteomicsDB
PubMed database
>23M articles
Hospital information systems
Often more than 50GB
Medical sensor data
Scan of a single organ in 1s
creates 10GB of raw dataCancer patient records
>160k records at NCT A Federated In-
Memory Database
Computing Platform
for Big Medical Data
Schapranow, SAPPHIRE,
May 17, 2016
Schapranow, SAPPHIRE,
May 17, 2016
Our Approach
Analyze Genomes: Real-time Analysis of Big Medical Data
8
In-Memory Database
Extensions for Life Sciences
Data Exchange,
App Store
Access Control,
Data Protection
Fair Use
Statistical
Tools
Real-time
Analysis
App-spanning
User Profiles
Combined and Linked Data
Genome
Data
Cellular
Pathways
Genome
Metadata
Research
Publications
Pipeline and
Analysis Models
Drugs and
Interactions
A Federated In-
Memory Database
Computing Platform
for Big Medical Data
Drug Response
Analysis
Pathway Topology
Analysis
Medical
Knowledge CockpitOncolyzer
Clinical Trial
Recruitment
Cohort
Analysis
...
Indexed
Sources
Combined column
and row store
Map/Reduce Single and
multi-tenancy
Lightweight
compression
Insert only
for time travel
Real-time
replication
Working on
integers
SQL interface on
columns and rows
Active/passive
data store
Minimal
projections
Group key Reduction of
software layers
Dynamic multi-
threading
Bulk load
of data
Object-
relational
mapping
Text retrieval
and extraction engine
No aggregate
tables
Data partitioning Any attribute
as index
No disk
On-the-fly
extensibility
Analytics on
historical data
Multi-core/
parallelization
Our Technology
In-Memory Database Technology
+
++
+
+
P
v
+++
t
SQL
x
x
T
disk
9
Schapranow, SAPPHIRE,
May 17, 2016
A Federated In-
Memory Database
Computing Platform
for Big Medical Data
Where are all those Clouds go to?
Schapranow, SAPPHIRE,
May 17, 2016
A Federated In-
Memory Database
Computing Platform
for Big Medical Data
10
Gartner's 2014 Hype Cycle for Emerging Technologies
■  Requirements
□  Real-time data analysis
□  Maintained software
■  Restrictions
□  Data privacy
□  Data locality
□  Volume of “big medical data”
■  Solution?
□  Federated In-Memory Database System vs. Cloud Computing
Software Requirements in Life Sciences
Schapranow, SAPPHIRE,
May 17, 2016
A Federated In-
Memory Database
Computing Platform
for Big Medical Data
11
Approach I:
Multiple Cloud Service Providers
Schapranow, SAPPHIRE,
May 17, 2016
A Federated In-
Memory Database
Computing Platform
for Big Medical Data
12
Local System
C loud
Synchronization
Service
R
Local Storage
Local
Synchronization
Service
R
Shared
C loud
Storage
Site A
Local System
R
Local Storage
Local
Synchronization
Service
Site B
C loud
Synchronization
Service
Shared
C loud
Storage
R
Cloud Provider
Site A
C loud Provider
Site B
Approach II:
A Single Service Provider
Schapranow, SAPPHIRE,
May 17, 2016
A Federated In-
Memory Database
Computing Platform
for Big Medical Data
13
Cloud
Synchronization
Service
Shared
Cloud
Storage
Site A Site BCloud Provider
Cloud System
R R
Multiple Sites Forming the
Federated In-Memory Database System (FIMDB)
Schapranow, SAPPHIRE,
May 17, 2016
A Federated In-
Memory Database
Computing Platform
for Big Medical Data
14
Federated In-M em ory D atabase System
M aster Data and
Shared Algorithm s
Site A Site BCloud Provider
Cloud IM D B
Instance
Local IM DB
Instance
Sensitive D ata,
e.g. Patient Data
R
Local IM DB
Instance
Sensitive Data,
e.g. Patient D ata
R
FIMDB: Cloud Service Provider
Schapranow, SAPPHIRE,
May 17, 2016
A Federated In-
Memory Database
Computing Platform
for Big Medical Data
15
Site B
Federated In-M em ory
D atabase Instance,
Algorithm s, and
Applications M anaged
by Service Provider
CloudService
Provider
Site A
FIMDB
A.1
FIMDB
A.2
FIMDB
A.3
FIMDB
A.4
FIMDB
A.5
FIMDB
B.1
FIMDB
B.2
FIMDB
B.3
FIMDB
C.1
Federated In-M em ory
Database Instances
M aster Data
M anaged by
Service Provider
Sensitive D ata
reside at Site
■  Change of cloud computing paradigm:
Transfer (small) algorithms to (big) data
■  In-Memory Database (IMDB)
□  Landscape of IMDB nodes
□  Stored IMDB procedures and algorithms
□  Master data for applications
■  In-Memory File System (IMDBfs)
□  Integration of file-based tools
□  Managed services directory
□  OS binaries compiled and statically linked for
individual platforms
1.  Establish site-to-site VPN connection b/w site and cloud service
provider
2.  Mount remote services directory
3.  Install and configure local IMDB instance from services directory
4.  Subscribe to and configure selected managed services
FIMDB: Setup of a New Client
Schapranow, SAPPHIRE,
May 17, 2016
A Federated In-
Memory Database
Computing Platform
for Big Medical Data
16
■  Data partitioning protects sensitive data by
storing it on local hardware resources only
■  Supports parallel query execution, i.e. reduced
processing time
■  Efficient use of existing hardware resources
FIMDB: Incorporating Local Compute Resources
Schapranow, SAPPHIRE,
May 17, 2016
A Federated In-
Memory Database
Computing Platform
for Big Medical Data
17
■  Brings algorithms to data
■  Forms a single database across individual sites and locations
■  Master data managed by service provider whilst sensitive data resides locally
What to Take Home?
Test it Yourself: AnalyzeGenomes.com
Schapranow, SAPPHIRE,
May 17, 2016
A Federated In-
Memory Database
Computing Platform
for Big Medical Data
18
Pros Cons
Single database license Complex operation
Easy to consume services Time-consuming infrastructure setup
Query propagation by IMDB
Only a single source of truth
■  Online: Visit we.analyzegenomes.com for latest research
results, slides, videos, tools, and publications
■  Offline: High-Performance In-Memory Genome Data Analysis:
In-Memory Data Management Research, Springer,
ISBN: 978-3-319-03034-0, 2014
■  In Person: Join us for Intel Tech Talks at SAPPHIRE booth 625 daily!
□  May 17 12.30pm: A Federated In-Memory Database Computing Platform Enabling
Real-time Analysis of Big Medical Data
□  May 18 12.30pm: In-Memory Apps for Next Generation Life Sciences Research
□  May 19 11.30am: In-Memory Apps Supporting Precision Medicine
Where to find additional information?
Schapranow, SAPPHIRE,
May 17, 2016
A Federated In-
Memory Database
Computing Platform
for Big Medical Data
19
Keep in contact with us!
Dr. Matthieu-P. Schapranow
Program Manager E-Health & Life Sciences
Hasso Plattner Institute
August-Bebel-Str. 88
14482 Potsdam, Germany
schapranow@hpi.de
http://we.analyzegenomes.com/
Schapranow, SAPPHIRE,
May 17, 2016
A Federated In-
Memory Database
Computing Platform
for Big Medical Data
20

More Related Content

What's hot

"When time matters..."
"When time matters...""When time matters..."
"When time matters..."
Matthieu Schapranow
 
Festival of Genomics 2016 London: Analyze Genomes: Modeling and Executing Gen...
Festival of Genomics 2016 London: Analyze Genomes: Modeling and Executing Gen...Festival of Genomics 2016 London: Analyze Genomes: Modeling and Executing Gen...
Festival of Genomics 2016 London: Analyze Genomes: Modeling and Executing Gen...
Matthieu Schapranow
 
In-Memory Apps for Precision Medicine
In-Memory Apps for Precision MedicineIn-Memory Apps for Precision Medicine
In-Memory Apps for Precision Medicine
Matthieu Schapranow
 
AnalyzeGenomes.com: A Federated In-Memory Database Platform for Digital Health
AnalyzeGenomes.com: A Federated In-Memory Database Platform for Digital HealthAnalyzeGenomes.com: A Federated In-Memory Database Platform for Digital Health
AnalyzeGenomes.com: A Federated In-Memory Database Platform for Digital Health
Matthieu Schapranow
 
In-Memory Data Management for Systems Medicine
In-Memory Data Management for Systems MedicineIn-Memory Data Management for Systems Medicine
In-Memory Data Management for Systems Medicine
Matthieu Schapranow
 
Patient Journey in Oncology 2025: Molecular Tumour Boards in Practice
Patient Journey in Oncology 2025: Molecular Tumour Boards in PracticePatient Journey in Oncology 2025: Molecular Tumour Boards in Practice
Patient Journey in Oncology 2025: Molecular Tumour Boards in Practice
Matthieu Schapranow
 
Analyze Genomes: A Federated In-Memory Database System For Life Sciences
Analyze Genomes: A Federated In-Memory Database System For Life SciencesAnalyze Genomes: A Federated In-Memory Database System For Life Sciences
Analyze Genomes: A Federated In-Memory Database System For Life Sciences
Matthieu Schapranow
 
The Driver of the Healthcare System in the 21st Century: Real-world Applicati...
The Driver of the Healthcare System in the 21st Century: Real-world Applicati...The Driver of the Healthcare System in the 21st Century: Real-world Applicati...
The Driver of the Healthcare System in the 21st Century: Real-world Applicati...
Matthieu Schapranow
 
A Platform for Integrated Genome Data Analysis
A Platform for Integrated Genome Data AnalysisA Platform for Integrated Genome Data Analysis
A Platform for Integrated Genome Data Analysis
Matthieu Schapranow
 
A Federated In-Memory Database System for Life Sciences
A Federated In-Memory Database System for Life SciencesA Federated In-Memory Database System for Life Sciences
A Federated In-Memory Database System for Life Sciences
Matthieu Schapranow
 
Big Data in Genomics: Opportunities and Challenges
Big Data in Genomics: Opportunities and ChallengesBig Data in Genomics: Opportunities and Challenges
Big Data in Genomics: Opportunities and Challenges
Matthieu Schapranow
 
How will AI affect the patient journey of the future?
How will AI affect the patient journey of the future?How will AI affect the patient journey of the future?
How will AI affect the patient journey of the future?
Matthieu Schapranow
 
Algorithmen statt Ärzte: Algorithmen statt Ärzte: Ersetzt Big Data künftig ...
Algorithmen statt Ärzte: Algorithmen statt Ärzte: Ersetzt Big Data künftig ...Algorithmen statt Ärzte: Algorithmen statt Ärzte: Ersetzt Big Data künftig ...
Algorithmen statt Ärzte: Algorithmen statt Ärzte: Ersetzt Big Data künftig ...
Matthieu Schapranow
 
BioNRW: Big Medical Data: Challenge or Potential
BioNRW: Big Medical Data: Challenge or PotentialBioNRW: Big Medical Data: Challenge or Potential
BioNRW: Big Medical Data: Challenge or Potential
Matthieu Schapranow
 
Big Data in Life Sciences
Big Data in Life SciencesBig Data in Life Sciences
Big Data in Life Sciences
Matthieu Schapranow
 
Festival of Genomics 2016 London: What to take home?
Festival of Genomics 2016 London: What to take home?Festival of Genomics 2016 London: What to take home?
Festival of Genomics 2016 London: What to take home?
Matthieu Schapranow
 
AI in Oncology
AI in OncologyAI in Oncology
AI in Oncology
Matthieu Schapranow
 
Analyze Genomes: Drug Response Analysis
Analyze Genomes: Drug Response AnalysisAnalyze Genomes: Drug Response Analysis
Analyze Genomes: Drug Response Analysis
Matthieu Schapranow
 
Big Medical Data – Challenge or Potential?
Big Medical Data – Challenge or Potential?Big Medical Data – Challenge or Potential?
Big Medical Data – Challenge or Potential?
Matthieu Schapranow
 
Festival of Genomics 2016 London: Mining and Processing of Unstructured Medic...
Festival of Genomics 2016 London: Mining and Processing of Unstructured Medic...Festival of Genomics 2016 London: Mining and Processing of Unstructured Medic...
Festival of Genomics 2016 London: Mining and Processing of Unstructured Medic...
Matthieu Schapranow
 

What's hot (20)

"When time matters..."
"When time matters...""When time matters..."
"When time matters..."
 
Festival of Genomics 2016 London: Analyze Genomes: Modeling and Executing Gen...
Festival of Genomics 2016 London: Analyze Genomes: Modeling and Executing Gen...Festival of Genomics 2016 London: Analyze Genomes: Modeling and Executing Gen...
Festival of Genomics 2016 London: Analyze Genomes: Modeling and Executing Gen...
 
In-Memory Apps for Precision Medicine
In-Memory Apps for Precision MedicineIn-Memory Apps for Precision Medicine
In-Memory Apps for Precision Medicine
 
AnalyzeGenomes.com: A Federated In-Memory Database Platform for Digital Health
AnalyzeGenomes.com: A Federated In-Memory Database Platform for Digital HealthAnalyzeGenomes.com: A Federated In-Memory Database Platform for Digital Health
AnalyzeGenomes.com: A Federated In-Memory Database Platform for Digital Health
 
In-Memory Data Management for Systems Medicine
In-Memory Data Management for Systems MedicineIn-Memory Data Management for Systems Medicine
In-Memory Data Management for Systems Medicine
 
Patient Journey in Oncology 2025: Molecular Tumour Boards in Practice
Patient Journey in Oncology 2025: Molecular Tumour Boards in PracticePatient Journey in Oncology 2025: Molecular Tumour Boards in Practice
Patient Journey in Oncology 2025: Molecular Tumour Boards in Practice
 
Analyze Genomes: A Federated In-Memory Database System For Life Sciences
Analyze Genomes: A Federated In-Memory Database System For Life SciencesAnalyze Genomes: A Federated In-Memory Database System For Life Sciences
Analyze Genomes: A Federated In-Memory Database System For Life Sciences
 
The Driver of the Healthcare System in the 21st Century: Real-world Applicati...
The Driver of the Healthcare System in the 21st Century: Real-world Applicati...The Driver of the Healthcare System in the 21st Century: Real-world Applicati...
The Driver of the Healthcare System in the 21st Century: Real-world Applicati...
 
A Platform for Integrated Genome Data Analysis
A Platform for Integrated Genome Data AnalysisA Platform for Integrated Genome Data Analysis
A Platform for Integrated Genome Data Analysis
 
A Federated In-Memory Database System for Life Sciences
A Federated In-Memory Database System for Life SciencesA Federated In-Memory Database System for Life Sciences
A Federated In-Memory Database System for Life Sciences
 
Big Data in Genomics: Opportunities and Challenges
Big Data in Genomics: Opportunities and ChallengesBig Data in Genomics: Opportunities and Challenges
Big Data in Genomics: Opportunities and Challenges
 
How will AI affect the patient journey of the future?
How will AI affect the patient journey of the future?How will AI affect the patient journey of the future?
How will AI affect the patient journey of the future?
 
Algorithmen statt Ärzte: Algorithmen statt Ärzte: Ersetzt Big Data künftig ...
Algorithmen statt Ärzte: Algorithmen statt Ärzte: Ersetzt Big Data künftig ...Algorithmen statt Ärzte: Algorithmen statt Ärzte: Ersetzt Big Data künftig ...
Algorithmen statt Ärzte: Algorithmen statt Ärzte: Ersetzt Big Data künftig ...
 
BioNRW: Big Medical Data: Challenge or Potential
BioNRW: Big Medical Data: Challenge or PotentialBioNRW: Big Medical Data: Challenge or Potential
BioNRW: Big Medical Data: Challenge or Potential
 
Big Data in Life Sciences
Big Data in Life SciencesBig Data in Life Sciences
Big Data in Life Sciences
 
Festival of Genomics 2016 London: What to take home?
Festival of Genomics 2016 London: What to take home?Festival of Genomics 2016 London: What to take home?
Festival of Genomics 2016 London: What to take home?
 
AI in Oncology
AI in OncologyAI in Oncology
AI in Oncology
 
Analyze Genomes: Drug Response Analysis
Analyze Genomes: Drug Response AnalysisAnalyze Genomes: Drug Response Analysis
Analyze Genomes: Drug Response Analysis
 
Big Medical Data – Challenge or Potential?
Big Medical Data – Challenge or Potential?Big Medical Data – Challenge or Potential?
Big Medical Data – Challenge or Potential?
 
Festival of Genomics 2016 London: Mining and Processing of Unstructured Medic...
Festival of Genomics 2016 London: Mining and Processing of Unstructured Medic...Festival of Genomics 2016 London: Mining and Processing of Unstructured Medic...
Festival of Genomics 2016 London: Mining and Processing of Unstructured Medic...
 

Viewers also liked

Festival of Genomics 2016 London: Challenges of Big Medical Data?
Festival of Genomics 2016 London: Challenges of Big Medical Data?Festival of Genomics 2016 London: Challenges of Big Medical Data?
Festival of Genomics 2016 London: Challenges of Big Medical Data?
Matthieu Schapranow
 
UCSF Informatics Day 2014 - Wylie Burke, "Bioethical Issues in Genomics and E...
UCSF Informatics Day 2014 - Wylie Burke, "Bioethical Issues in Genomics and E...UCSF Informatics Day 2014 - Wylie Burke, "Bioethical Issues in Genomics and E...
UCSF Informatics Day 2014 - Wylie Burke, "Bioethical Issues in Genomics and E...
CTSI at UCSF
 
Modern technology in medical science
Modern technology in medical scienceModern technology in medical science
Modern technology in medical science
yashmurthy
 
The Latest Advancements in Medical Technologies
The Latest Advancements in Medical TechnologiesThe Latest Advancements in Medical Technologies
The Latest Advancements in Medical Technologies
IETSwindon
 
019 traditional medicine
019 traditional medicine019 traditional medicine
019 traditional medicine
Mohammed Muneer
 
Advancement in medical technology
Advancement in medical technologyAdvancement in medical technology
Advancement in medical technology
Cokhy Fasha
 
Traditional medicine of india
Traditional medicine of indiaTraditional medicine of india
Traditional medicine of india
Anis Farah Syafiqah Ab Razak
 
10 Tech Trends in Healthcare
10 Tech Trends in Healthcare10 Tech Trends in Healthcare
10 Tech Trends in Healthcare
Vala Afshar
 

Viewers also liked (8)

Festival of Genomics 2016 London: Challenges of Big Medical Data?
Festival of Genomics 2016 London: Challenges of Big Medical Data?Festival of Genomics 2016 London: Challenges of Big Medical Data?
Festival of Genomics 2016 London: Challenges of Big Medical Data?
 
UCSF Informatics Day 2014 - Wylie Burke, "Bioethical Issues in Genomics and E...
UCSF Informatics Day 2014 - Wylie Burke, "Bioethical Issues in Genomics and E...UCSF Informatics Day 2014 - Wylie Burke, "Bioethical Issues in Genomics and E...
UCSF Informatics Day 2014 - Wylie Burke, "Bioethical Issues in Genomics and E...
 
Modern technology in medical science
Modern technology in medical scienceModern technology in medical science
Modern technology in medical science
 
The Latest Advancements in Medical Technologies
The Latest Advancements in Medical TechnologiesThe Latest Advancements in Medical Technologies
The Latest Advancements in Medical Technologies
 
019 traditional medicine
019 traditional medicine019 traditional medicine
019 traditional medicine
 
Advancement in medical technology
Advancement in medical technologyAdvancement in medical technology
Advancement in medical technology
 
Traditional medicine of india
Traditional medicine of indiaTraditional medicine of india
Traditional medicine of india
 
10 Tech Trends in Healthcare
10 Tech Trends in Healthcare10 Tech Trends in Healthcare
10 Tech Trends in Healthcare
 

Similar to Analyze Genomes: A Federated In-memory Database Computing Platform enabling real-time Analysis of Big Medical Data

NewMR 2016 presents: 9 Big Applications of Big Data
NewMR 2016 presents: 9 Big Applications of Big DataNewMR 2016 presents: 9 Big Applications of Big Data
NewMR 2016 presents: 9 Big Applications of Big Data
Annie Pettit, Research Methodologist
 
Processing of Big Medical Data in Personalized Medicine: Challenge or Potential
Processing of Big Medical Data in Personalized Medicine: Challenge or PotentialProcessing of Big Medical Data in Personalized Medicine: Challenge or Potential
Processing of Big Medical Data in Personalized Medicine: Challenge or Potential
Matthieu Schapranow
 
Introduction to High-performance In-memory Genome Project at HPI
Introduction to High-performance In-memory Genome Project at HPI Introduction to High-performance In-memory Genome Project at HPI
Introduction to High-performance In-memory Genome Project at HPI
Matthieu Schapranow
 
Enabling Real-time Genome Data Research with In-memory Database Technology (S...
Enabling Real-time Genome Data Research with In-memory Database Technology (S...Enabling Real-time Genome Data Research with In-memory Database Technology (S...
Enabling Real-time Genome Data Research with In-memory Database Technology (S...
Matthieu Schapranow
 
Enabling Real-Time Genome Data Research with In-Memory Database Technology (I...
Enabling Real-Time Genome Data Research with In-Memory Database Technology (I...Enabling Real-Time Genome Data Research with In-Memory Database Technology (I...
Enabling Real-Time Genome Data Research with In-Memory Database Technology (I...
Matthieu Schapranow
 
Gaining Time -- Real-time Analysis of Big Medical Data
Gaining Time -- Real-time Analysis of Big Medical DataGaining Time -- Real-time Analysis of Big Medical Data
Gaining Time -- Real-time Analysis of Big Medical Data
Matthieu Schapranow
 
Gaining Time – Real-time Analysis of Big Medical Data
Gaining Time – Real-time Analysis of Big Medical Data Gaining Time – Real-time Analysis of Big Medical Data
Gaining Time – Real-time Analysis of Big Medical Data
SAP Technology
 
Biothings APIs: high-performance bioentity-centric web services
Biothings APIs: high-performance bioentity-centric web servicesBiothings APIs: high-performance bioentity-centric web services
Biothings APIs: high-performance bioentity-centric web services
Chunlei Wu
 
SAP HANA in Healthcare: Real-Time Big Data Analysis
SAP HANA in Healthcare: Real-Time Big Data AnalysisSAP HANA in Healthcare: Real-Time Big Data Analysis
SAP HANA in Healthcare: Real-Time Big Data Analysis
SAP Technology
 
In-memory Applications for Oncology
In-memory Applications for OncologyIn-memory Applications for Oncology
In-memory Applications for Oncology
Matthieu Schapranow
 
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
Bonnie Hurwitz
 
IC-SDV 2018: Stefan Geißler (Expert System) Navigating to new shores: the Bio...
IC-SDV 2018: Stefan Geißler (Expert System) Navigating to new shores: the Bio...IC-SDV 2018: Stefan Geißler (Expert System) Navigating to new shores: the Bio...
IC-SDV 2018: Stefan Geißler (Expert System) Navigating to new shores: the Bio...
Dr. Haxel Consult
 
Open PHACTS MIOSS may 2016
Open PHACTS MIOSS may 2016Open PHACTS MIOSS may 2016
Open PHACTS MIOSS may 2016
open_phacts
 
Top 10 data science technologies
Top 10 data science technologiesTop 10 data science technologies
Top 10 data science technologies
Brainware University
 
Tag.bio aws public jun 08 2021
Tag.bio aws public jun 08 2021 Tag.bio aws public jun 08 2021
Tag.bio aws public jun 08 2021
Sanjay Padhi, Ph.D
 
Enterprise Analytics: Serving Big Data Projects for Healthcare
Enterprise Analytics: Serving Big Data Projects for HealthcareEnterprise Analytics: Serving Big Data Projects for Healthcare
Enterprise Analytics: Serving Big Data Projects for Healthcare
DATA360US
 
Turning Big Data into Precision Medicine
Turning Big Data into Precision MedicineTurning Big Data into Precision Medicine
Turning Big Data into Precision Medicine
Matthieu Schapranow
 
How Real-time Analysis turns Big Medical Data into Precision Medicine
How Real-time Analysis turns Big Medical Data into Precision MedicineHow Real-time Analysis turns Big Medical Data into Precision Medicine
How Real-time Analysis turns Big Medical Data into Precision Medicine
Matthieu Schapranow
 
MongoDB in the Healthcare Enterprise
MongoDB in the Healthcare EnterpriseMongoDB in the Healthcare Enterprise
MongoDB in the Healthcare Enterprise
MongoDB
 
The State of the Data Warehouse in 2017 and Beyond
The State of the Data Warehouse in 2017 and BeyondThe State of the Data Warehouse in 2017 and Beyond
The State of the Data Warehouse in 2017 and Beyond
SingleStore
 

Similar to Analyze Genomes: A Federated In-memory Database Computing Platform enabling real-time Analysis of Big Medical Data (20)

NewMR 2016 presents: 9 Big Applications of Big Data
NewMR 2016 presents: 9 Big Applications of Big DataNewMR 2016 presents: 9 Big Applications of Big Data
NewMR 2016 presents: 9 Big Applications of Big Data
 
Processing of Big Medical Data in Personalized Medicine: Challenge or Potential
Processing of Big Medical Data in Personalized Medicine: Challenge or PotentialProcessing of Big Medical Data in Personalized Medicine: Challenge or Potential
Processing of Big Medical Data in Personalized Medicine: Challenge or Potential
 
Introduction to High-performance In-memory Genome Project at HPI
Introduction to High-performance In-memory Genome Project at HPI Introduction to High-performance In-memory Genome Project at HPI
Introduction to High-performance In-memory Genome Project at HPI
 
Enabling Real-time Genome Data Research with In-memory Database Technology (S...
Enabling Real-time Genome Data Research with In-memory Database Technology (S...Enabling Real-time Genome Data Research with In-memory Database Technology (S...
Enabling Real-time Genome Data Research with In-memory Database Technology (S...
 
Enabling Real-Time Genome Data Research with In-Memory Database Technology (I...
Enabling Real-Time Genome Data Research with In-Memory Database Technology (I...Enabling Real-Time Genome Data Research with In-Memory Database Technology (I...
Enabling Real-Time Genome Data Research with In-Memory Database Technology (I...
 
Gaining Time -- Real-time Analysis of Big Medical Data
Gaining Time -- Real-time Analysis of Big Medical DataGaining Time -- Real-time Analysis of Big Medical Data
Gaining Time -- Real-time Analysis of Big Medical Data
 
Gaining Time – Real-time Analysis of Big Medical Data
Gaining Time – Real-time Analysis of Big Medical Data Gaining Time – Real-time Analysis of Big Medical Data
Gaining Time – Real-time Analysis of Big Medical Data
 
Biothings APIs: high-performance bioentity-centric web services
Biothings APIs: high-performance bioentity-centric web servicesBiothings APIs: high-performance bioentity-centric web services
Biothings APIs: high-performance bioentity-centric web services
 
SAP HANA in Healthcare: Real-Time Big Data Analysis
SAP HANA in Healthcare: Real-Time Big Data AnalysisSAP HANA in Healthcare: Real-Time Big Data Analysis
SAP HANA in Healthcare: Real-Time Big Data Analysis
 
In-memory Applications for Oncology
In-memory Applications for OncologyIn-memory Applications for Oncology
In-memory Applications for Oncology
 
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
 
IC-SDV 2018: Stefan Geißler (Expert System) Navigating to new shores: the Bio...
IC-SDV 2018: Stefan Geißler (Expert System) Navigating to new shores: the Bio...IC-SDV 2018: Stefan Geißler (Expert System) Navigating to new shores: the Bio...
IC-SDV 2018: Stefan Geißler (Expert System) Navigating to new shores: the Bio...
 
Open PHACTS MIOSS may 2016
Open PHACTS MIOSS may 2016Open PHACTS MIOSS may 2016
Open PHACTS MIOSS may 2016
 
Top 10 data science technologies
Top 10 data science technologiesTop 10 data science technologies
Top 10 data science technologies
 
Tag.bio aws public jun 08 2021
Tag.bio aws public jun 08 2021 Tag.bio aws public jun 08 2021
Tag.bio aws public jun 08 2021
 
Enterprise Analytics: Serving Big Data Projects for Healthcare
Enterprise Analytics: Serving Big Data Projects for HealthcareEnterprise Analytics: Serving Big Data Projects for Healthcare
Enterprise Analytics: Serving Big Data Projects for Healthcare
 
Turning Big Data into Precision Medicine
Turning Big Data into Precision MedicineTurning Big Data into Precision Medicine
Turning Big Data into Precision Medicine
 
How Real-time Analysis turns Big Medical Data into Precision Medicine
How Real-time Analysis turns Big Medical Data into Precision MedicineHow Real-time Analysis turns Big Medical Data into Precision Medicine
How Real-time Analysis turns Big Medical Data into Precision Medicine
 
MongoDB in the Healthcare Enterprise
MongoDB in the Healthcare EnterpriseMongoDB in the Healthcare Enterprise
MongoDB in the Healthcare Enterprise
 
The State of the Data Warehouse in 2017 and Beyond
The State of the Data Warehouse in 2017 and BeyondThe State of the Data Warehouse in 2017 and Beyond
The State of the Data Warehouse in 2017 and Beyond
 

Recently uploaded

Mental Health and Physical Wellbeing.pdf
Mental Health and Physical Wellbeing.pdfMental Health and Physical Wellbeing.pdf
Mental Health and Physical Wellbeing.pdf
shindesupriya013
 
Fit to Fly PCR Covid Testing at our Clinic Near You
Fit to Fly PCR Covid Testing at our Clinic Near YouFit to Fly PCR Covid Testing at our Clinic Near You
Fit to Fly PCR Covid Testing at our Clinic Near You
NX Healthcare
 
一比一原版(USF毕业证)旧金山大学毕业证如何办理
一比一原版(USF毕业证)旧金山大学毕业证如何办理一比一原版(USF毕业证)旧金山大学毕业证如何办理
一比一原版(USF毕业证)旧金山大学毕业证如何办理
40fortunate
 
FACIAL NERVE
FACIAL NERVEFACIAL NERVE
FACIAL NERVE
aditigupta1117
 
HUMAN BRAIN.pptx.PRIYA BHOJWANI@GAMIL.COM
HUMAN BRAIN.pptx.PRIYA BHOJWANI@GAMIL.COMHUMAN BRAIN.pptx.PRIYA BHOJWANI@GAMIL.COM
HUMAN BRAIN.pptx.PRIYA BHOJWANI@GAMIL.COM
priyabhojwani1200
 
Innovative Minds France's Most Impactful Healthcare Leaders.pdf
Innovative Minds France's Most Impactful Healthcare Leaders.pdfInnovative Minds France's Most Impactful Healthcare Leaders.pdf
Innovative Minds France's Most Impactful Healthcare Leaders.pdf
eurohealthleaders
 
NURSING MANAGEMENT OF PATIENT WITH EMPHYSEMA .PPT
NURSING MANAGEMENT OF PATIENT WITH EMPHYSEMA .PPTNURSING MANAGEMENT OF PATIENT WITH EMPHYSEMA .PPT
NURSING MANAGEMENT OF PATIENT WITH EMPHYSEMA .PPT
blessyjannu21
 
National Rural Health Mission(NRHM).pptx
National Rural Health Mission(NRHM).pptxNational Rural Health Mission(NRHM).pptx
National Rural Health Mission(NRHM).pptx
Jyoti Chand
 
R3 Stem Cell Therapy: A New Hope for Women with Ovarian Failure
R3 Stem Cell Therapy: A New Hope for Women with Ovarian FailureR3 Stem Cell Therapy: A New Hope for Women with Ovarian Failure
R3 Stem Cell Therapy: A New Hope for Women with Ovarian Failure
R3 Stem Cell
 
Data-Driven Dispensing- Rise of AI in Pharmacies.pdf
Data-Driven Dispensing- Rise of AI in Pharmacies.pdfData-Driven Dispensing- Rise of AI in Pharmacies.pdf
Data-Driven Dispensing- Rise of AI in Pharmacies.pdf
Jasper Colin
 
Friendly Massage in Ajman - Malayali Kerala Spa Ajman
Friendly Massage in Ajman - Malayali Kerala Spa AjmanFriendly Massage in Ajman - Malayali Kerala Spa Ajman
Friendly Massage in Ajman - Malayali Kerala Spa Ajman
Malayali Kerala Spa Ajman
 
PrudentRx: A Resource for Patient Education and Engagement
PrudentRx: A Resource for Patient Education and EngagementPrudentRx: A Resource for Patient Education and Engagement
PrudentRx: A Resource for Patient Education and Engagement
PrudentRx Program
 
CHAPTER 1 SEMESTER V COMMUNICATION TECHNIQUES FOR CHILDREN.pdf
CHAPTER 1 SEMESTER V  COMMUNICATION TECHNIQUES FOR CHILDREN.pdfCHAPTER 1 SEMESTER V  COMMUNICATION TECHNIQUES FOR CHILDREN.pdf
CHAPTER 1 SEMESTER V COMMUNICATION TECHNIQUES FOR CHILDREN.pdf
Sachin Sharma
 
nurs fpx 4050 assessment 4 final care coordination plan.pdf
nurs fpx 4050 assessment 4 final care coordination plan.pdfnurs fpx 4050 assessment 4 final care coordination plan.pdf
nurs fpx 4050 assessment 4 final care coordination plan.pdf
Carolyn Harker
 
Digital Health in India_Health Informatics Trained Manpower _DrDevTaneja_15.0...
Digital Health in India_Health Informatics Trained Manpower _DrDevTaneja_15.0...Digital Health in India_Health Informatics Trained Manpower _DrDevTaneja_15.0...
Digital Health in India_Health Informatics Trained Manpower _DrDevTaneja_15.0...
DrDevTaneja1
 
NEEDLE STICK INJURY - JOURNAL CLUB PRESENTATION - DR SHAMIN EABENSON
NEEDLE STICK INJURY - JOURNAL CLUB PRESENTATION - DR SHAMIN EABENSONNEEDLE STICK INJURY - JOURNAL CLUB PRESENTATION - DR SHAMIN EABENSON
NEEDLE STICK INJURY - JOURNAL CLUB PRESENTATION - DR SHAMIN EABENSON
SHAMIN EABENSON
 
DELIRIUM BY DR JAGMOHAN PRAJAPATI.......
DELIRIUM BY DR JAGMOHAN PRAJAPATI.......DELIRIUM BY DR JAGMOHAN PRAJAPATI.......
DELIRIUM BY DR JAGMOHAN PRAJAPATI.......
DR Jag Mohan Prajapati
 
English Drug and Alcohol Commissioners June 2024.pptx
English Drug and Alcohol Commissioners June 2024.pptxEnglish Drug and Alcohol Commissioners June 2024.pptx
English Drug and Alcohol Commissioners June 2024.pptx
MatSouthwell1
 
VEDANTA AIR AMBULANCE SERVICES IN REWA AT A COST-EFFECTIVE PRICE.pdf
VEDANTA AIR AMBULANCE SERVICES IN REWA AT A COST-EFFECTIVE PRICE.pdfVEDANTA AIR AMBULANCE SERVICES IN REWA AT A COST-EFFECTIVE PRICE.pdf
VEDANTA AIR AMBULANCE SERVICES IN REWA AT A COST-EFFECTIVE PRICE.pdf
Vedanta A
 
leprosy Case detection and diagnosis.pptx
leprosy Case detection and diagnosis.pptxleprosy Case detection and diagnosis.pptx
leprosy Case detection and diagnosis.pptx
habtegirma
 

Recently uploaded (20)

Mental Health and Physical Wellbeing.pdf
Mental Health and Physical Wellbeing.pdfMental Health and Physical Wellbeing.pdf
Mental Health and Physical Wellbeing.pdf
 
Fit to Fly PCR Covid Testing at our Clinic Near You
Fit to Fly PCR Covid Testing at our Clinic Near YouFit to Fly PCR Covid Testing at our Clinic Near You
Fit to Fly PCR Covid Testing at our Clinic Near You
 
一比一原版(USF毕业证)旧金山大学毕业证如何办理
一比一原版(USF毕业证)旧金山大学毕业证如何办理一比一原版(USF毕业证)旧金山大学毕业证如何办理
一比一原版(USF毕业证)旧金山大学毕业证如何办理
 
FACIAL NERVE
FACIAL NERVEFACIAL NERVE
FACIAL NERVE
 
HUMAN BRAIN.pptx.PRIYA BHOJWANI@GAMIL.COM
HUMAN BRAIN.pptx.PRIYA BHOJWANI@GAMIL.COMHUMAN BRAIN.pptx.PRIYA BHOJWANI@GAMIL.COM
HUMAN BRAIN.pptx.PRIYA BHOJWANI@GAMIL.COM
 
Innovative Minds France's Most Impactful Healthcare Leaders.pdf
Innovative Minds France's Most Impactful Healthcare Leaders.pdfInnovative Minds France's Most Impactful Healthcare Leaders.pdf
Innovative Minds France's Most Impactful Healthcare Leaders.pdf
 
NURSING MANAGEMENT OF PATIENT WITH EMPHYSEMA .PPT
NURSING MANAGEMENT OF PATIENT WITH EMPHYSEMA .PPTNURSING MANAGEMENT OF PATIENT WITH EMPHYSEMA .PPT
NURSING MANAGEMENT OF PATIENT WITH EMPHYSEMA .PPT
 
National Rural Health Mission(NRHM).pptx
National Rural Health Mission(NRHM).pptxNational Rural Health Mission(NRHM).pptx
National Rural Health Mission(NRHM).pptx
 
R3 Stem Cell Therapy: A New Hope for Women with Ovarian Failure
R3 Stem Cell Therapy: A New Hope for Women with Ovarian FailureR3 Stem Cell Therapy: A New Hope for Women with Ovarian Failure
R3 Stem Cell Therapy: A New Hope for Women with Ovarian Failure
 
Data-Driven Dispensing- Rise of AI in Pharmacies.pdf
Data-Driven Dispensing- Rise of AI in Pharmacies.pdfData-Driven Dispensing- Rise of AI in Pharmacies.pdf
Data-Driven Dispensing- Rise of AI in Pharmacies.pdf
 
Friendly Massage in Ajman - Malayali Kerala Spa Ajman
Friendly Massage in Ajman - Malayali Kerala Spa AjmanFriendly Massage in Ajman - Malayali Kerala Spa Ajman
Friendly Massage in Ajman - Malayali Kerala Spa Ajman
 
PrudentRx: A Resource for Patient Education and Engagement
PrudentRx: A Resource for Patient Education and EngagementPrudentRx: A Resource for Patient Education and Engagement
PrudentRx: A Resource for Patient Education and Engagement
 
CHAPTER 1 SEMESTER V COMMUNICATION TECHNIQUES FOR CHILDREN.pdf
CHAPTER 1 SEMESTER V  COMMUNICATION TECHNIQUES FOR CHILDREN.pdfCHAPTER 1 SEMESTER V  COMMUNICATION TECHNIQUES FOR CHILDREN.pdf
CHAPTER 1 SEMESTER V COMMUNICATION TECHNIQUES FOR CHILDREN.pdf
 
nurs fpx 4050 assessment 4 final care coordination plan.pdf
nurs fpx 4050 assessment 4 final care coordination plan.pdfnurs fpx 4050 assessment 4 final care coordination plan.pdf
nurs fpx 4050 assessment 4 final care coordination plan.pdf
 
Digital Health in India_Health Informatics Trained Manpower _DrDevTaneja_15.0...
Digital Health in India_Health Informatics Trained Manpower _DrDevTaneja_15.0...Digital Health in India_Health Informatics Trained Manpower _DrDevTaneja_15.0...
Digital Health in India_Health Informatics Trained Manpower _DrDevTaneja_15.0...
 
NEEDLE STICK INJURY - JOURNAL CLUB PRESENTATION - DR SHAMIN EABENSON
NEEDLE STICK INJURY - JOURNAL CLUB PRESENTATION - DR SHAMIN EABENSONNEEDLE STICK INJURY - JOURNAL CLUB PRESENTATION - DR SHAMIN EABENSON
NEEDLE STICK INJURY - JOURNAL CLUB PRESENTATION - DR SHAMIN EABENSON
 
DELIRIUM BY DR JAGMOHAN PRAJAPATI.......
DELIRIUM BY DR JAGMOHAN PRAJAPATI.......DELIRIUM BY DR JAGMOHAN PRAJAPATI.......
DELIRIUM BY DR JAGMOHAN PRAJAPATI.......
 
English Drug and Alcohol Commissioners June 2024.pptx
English Drug and Alcohol Commissioners June 2024.pptxEnglish Drug and Alcohol Commissioners June 2024.pptx
English Drug and Alcohol Commissioners June 2024.pptx
 
VEDANTA AIR AMBULANCE SERVICES IN REWA AT A COST-EFFECTIVE PRICE.pdf
VEDANTA AIR AMBULANCE SERVICES IN REWA AT A COST-EFFECTIVE PRICE.pdfVEDANTA AIR AMBULANCE SERVICES IN REWA AT A COST-EFFECTIVE PRICE.pdf
VEDANTA AIR AMBULANCE SERVICES IN REWA AT A COST-EFFECTIVE PRICE.pdf
 
leprosy Case detection and diagnosis.pptx
leprosy Case detection and diagnosis.pptxleprosy Case detection and diagnosis.pptx
leprosy Case detection and diagnosis.pptx
 

Analyze Genomes: A Federated In-memory Database Computing Platform enabling real-time Analysis of Big Medical Data

  • 1. Analyze Genomes: A Federated In-Memory Database Computing Platform Enabling Real-time Analysis of Big Medical Data Dr. Matthieu-P. Schapranow SAPPHIRE, Orlando, USA May 17, 2016
  • 2. ■  Online: Visit we.analyzegenomes.com for latest research results, slides, videos, tools, and publications ■  Offline: High-Performance In-Memory Genome Data Analysis: In-Memory Data Management Research, Springer, ISBN: 978-3-319-03034-0, 2014 ■  In Person: Join us for Intel Tech Talks at SAPPHIRE booth 625 daily! □  May 17 12.30pm: A Federated In-Memory Database Computing Platform Enabling Real-time Analysis of Big Medical Data □  May 18 12.30pm: In-Memory Apps for Next Generation Life Sciences Research □  May 19 11.30am: In-Memory Apps Supporting Precision Medicine Where to find additional information? Schapranow, SAPPHIRE, May 17, 2016 A Federated In- Memory Database Computing Platform for Big Medical Data 2
  • 3. Indirect Interaction Direct Interaction C linician PatientResearcher Pharm aceutical Com pany H ealthcare Providers H ospital Research Center Laboratory Patient Advocacy G roup Intelligent Healthcare Networks in the 21st Century? Schapranow, SAPPHIRE, May 17, 2016 A Federated In- Memory Database Computing Platform for Big Medical Data 3
  • 4. Indirect Interaction Direct Interaction C linician PatientResearcher Pharm aceutical Com pany H ealthcare Providers H ospital Research Center Laboratory Patient Advocacy G roup Intelligent Healthcare Networks in the 21st Century? Schapranow, SAPPHIRE, May 17, 2016 A Federated In- Memory Database Computing Platform for Big Medical Data 4
  • 5. Indirect Interaction Direct Interaction C linician PatientResearcher Pharm aceutical Com pany H ealthcare Providers H ospital Research Center Laboratory Patient Advocacy G roup Intelligent Healthcare Networks in the 21st Century! Schapranow, SAPPHIRE, May 17, 2016 A Federated In- Memory Database Computing Platform for Big Medical Data 5
  • 6. ■  Patients □  Individual anamnesis, family history, and background □  Require fast access to individualized therapy ■  Clinicians □  Identify root and extent of disease using laboratory tests □  Evaluate therapy alternatives, adapt existing therapy ■  Researchers □  Conduct laboratory work, e.g. analyze patient samples □  Create new research findings and come-up with treatment alternatives The Setting Actors in Oncology Schapranow, SAPPHIRE, May 17, 2016 6 A Federated In- Memory Database Computing Platform for Big Medical Data
  • 7. IT Challenges Distributed Heterogeneous Data Sources 7 Human genome/biological data 600GB per full genome 15PB+ in databases of leading institutes Prescription data 1.5B records from 10,000 doctors and 10M Patients (100 GB) Clinical trials Currently more than 30k recruiting on ClinicalTrials.gov Human proteome 160M data points (2.4GB) per sample >3TB raw proteome data in ProteomicsDB PubMed database >23M articles Hospital information systems Often more than 50GB Medical sensor data Scan of a single organ in 1s creates 10GB of raw dataCancer patient records >160k records at NCT A Federated In- Memory Database Computing Platform for Big Medical Data Schapranow, SAPPHIRE, May 17, 2016
  • 8. Schapranow, SAPPHIRE, May 17, 2016 Our Approach Analyze Genomes: Real-time Analysis of Big Medical Data 8 In-Memory Database Extensions for Life Sciences Data Exchange, App Store Access Control, Data Protection Fair Use Statistical Tools Real-time Analysis App-spanning User Profiles Combined and Linked Data Genome Data Cellular Pathways Genome Metadata Research Publications Pipeline and Analysis Models Drugs and Interactions A Federated In- Memory Database Computing Platform for Big Medical Data Drug Response Analysis Pathway Topology Analysis Medical Knowledge CockpitOncolyzer Clinical Trial Recruitment Cohort Analysis ... Indexed Sources
  • 9. Combined column and row store Map/Reduce Single and multi-tenancy Lightweight compression Insert only for time travel Real-time replication Working on integers SQL interface on columns and rows Active/passive data store Minimal projections Group key Reduction of software layers Dynamic multi- threading Bulk load of data Object- relational mapping Text retrieval and extraction engine No aggregate tables Data partitioning Any attribute as index No disk On-the-fly extensibility Analytics on historical data Multi-core/ parallelization Our Technology In-Memory Database Technology + ++ + + P v +++ t SQL x x T disk 9 Schapranow, SAPPHIRE, May 17, 2016 A Federated In- Memory Database Computing Platform for Big Medical Data
  • 10. Where are all those Clouds go to? Schapranow, SAPPHIRE, May 17, 2016 A Federated In- Memory Database Computing Platform for Big Medical Data 10 Gartner's 2014 Hype Cycle for Emerging Technologies
  • 11. ■  Requirements □  Real-time data analysis □  Maintained software ■  Restrictions □  Data privacy □  Data locality □  Volume of “big medical data” ■  Solution? □  Federated In-Memory Database System vs. Cloud Computing Software Requirements in Life Sciences Schapranow, SAPPHIRE, May 17, 2016 A Federated In- Memory Database Computing Platform for Big Medical Data 11
  • 12. Approach I: Multiple Cloud Service Providers Schapranow, SAPPHIRE, May 17, 2016 A Federated In- Memory Database Computing Platform for Big Medical Data 12 Local System C loud Synchronization Service R Local Storage Local Synchronization Service R Shared C loud Storage Site A Local System R Local Storage Local Synchronization Service Site B C loud Synchronization Service Shared C loud Storage R Cloud Provider Site A C loud Provider Site B
  • 13. Approach II: A Single Service Provider Schapranow, SAPPHIRE, May 17, 2016 A Federated In- Memory Database Computing Platform for Big Medical Data 13 Cloud Synchronization Service Shared Cloud Storage Site A Site BCloud Provider Cloud System R R
  • 14. Multiple Sites Forming the Federated In-Memory Database System (FIMDB) Schapranow, SAPPHIRE, May 17, 2016 A Federated In- Memory Database Computing Platform for Big Medical Data 14 Federated In-M em ory D atabase System M aster Data and Shared Algorithm s Site A Site BCloud Provider Cloud IM D B Instance Local IM DB Instance Sensitive D ata, e.g. Patient Data R Local IM DB Instance Sensitive Data, e.g. Patient D ata R
  • 15. FIMDB: Cloud Service Provider Schapranow, SAPPHIRE, May 17, 2016 A Federated In- Memory Database Computing Platform for Big Medical Data 15 Site B Federated In-M em ory D atabase Instance, Algorithm s, and Applications M anaged by Service Provider CloudService Provider Site A FIMDB A.1 FIMDB A.2 FIMDB A.3 FIMDB A.4 FIMDB A.5 FIMDB B.1 FIMDB B.2 FIMDB B.3 FIMDB C.1 Federated In-M em ory Database Instances M aster Data M anaged by Service Provider Sensitive D ata reside at Site ■  Change of cloud computing paradigm: Transfer (small) algorithms to (big) data ■  In-Memory Database (IMDB) □  Landscape of IMDB nodes □  Stored IMDB procedures and algorithms □  Master data for applications ■  In-Memory File System (IMDBfs) □  Integration of file-based tools □  Managed services directory □  OS binaries compiled and statically linked for individual platforms
  • 16. 1.  Establish site-to-site VPN connection b/w site and cloud service provider 2.  Mount remote services directory 3.  Install and configure local IMDB instance from services directory 4.  Subscribe to and configure selected managed services FIMDB: Setup of a New Client Schapranow, SAPPHIRE, May 17, 2016 A Federated In- Memory Database Computing Platform for Big Medical Data 16
  • 17. ■  Data partitioning protects sensitive data by storing it on local hardware resources only ■  Supports parallel query execution, i.e. reduced processing time ■  Efficient use of existing hardware resources FIMDB: Incorporating Local Compute Resources Schapranow, SAPPHIRE, May 17, 2016 A Federated In- Memory Database Computing Platform for Big Medical Data 17
  • 18. ■  Brings algorithms to data ■  Forms a single database across individual sites and locations ■  Master data managed by service provider whilst sensitive data resides locally What to Take Home? Test it Yourself: AnalyzeGenomes.com Schapranow, SAPPHIRE, May 17, 2016 A Federated In- Memory Database Computing Platform for Big Medical Data 18 Pros Cons Single database license Complex operation Easy to consume services Time-consuming infrastructure setup Query propagation by IMDB Only a single source of truth
  • 19. ■  Online: Visit we.analyzegenomes.com for latest research results, slides, videos, tools, and publications ■  Offline: High-Performance In-Memory Genome Data Analysis: In-Memory Data Management Research, Springer, ISBN: 978-3-319-03034-0, 2014 ■  In Person: Join us for Intel Tech Talks at SAPPHIRE booth 625 daily! □  May 17 12.30pm: A Federated In-Memory Database Computing Platform Enabling Real-time Analysis of Big Medical Data □  May 18 12.30pm: In-Memory Apps for Next Generation Life Sciences Research □  May 19 11.30am: In-Memory Apps Supporting Precision Medicine Where to find additional information? Schapranow, SAPPHIRE, May 17, 2016 A Federated In- Memory Database Computing Platform for Big Medical Data 19
  • 20. Keep in contact with us! Dr. Matthieu-P. Schapranow Program Manager E-Health & Life Sciences Hasso Plattner Institute August-Bebel-Str. 88 14482 Potsdam, Germany schapranow@hpi.de http://we.analyzegenomes.com/ Schapranow, SAPPHIRE, May 17, 2016 A Federated In- Memory Database Computing Platform for Big Medical Data 20