SlideShare a Scribd company logo
11© 2017 MapR Technologies
Big Data in Healthcare
Carol McDonald
@caroljmcdonald
22© 2017 MapR Technologies
The Motivation for Big Data: Poor ROI
•  USA spends a lot more per
capita
•  US Health System ranks last
among eleven countries
(OECD)
–  healthy lives, access, quality,
efficiency
33© 2017 MapR Technologies
Who Knew Healthcare could be so complicated?
44© 2017 MapR Technologies
Value Based Care & Value Based Reimbursement
Incentives for Technology:
•  Improve coordination and
outcome
•  shifting from fee-for-service
•  to value based data driven incentives
55© 2017 MapR Technologies© 2016 MapR Technologies© 2016 MapR Technologies
The Data
66© 2017 MapR Technologies
Where is the Big Data Opportunity?
McKinsey Global Institute
77© 2017 MapR Technologies
Where is the Big Data Opportunity?
According to McKinsey Global Institute the big
data opportunity:
•  Claims
–  utilization of care
•  Pharmaceutical
–  clinical trials
•  Clinical Data
–  Electronic Medical Records
•  Patient Behavior and
Population Health
lab
EMR / EHR
Doctor’s notes
Claims
images
HL7
Social Media
88© 2017 MapR Technologies
Building a Healthcare Data Lake on MapR
Data
Lake
Claims
Clinical
Pharmacy
EMR
Logs and
Notes
3rd Party
Additional
Data
CB Header data, Social, ...
Historical procedures, co-morbidities (prof & inst.)
Lab results, vital signs, ...
Dr. Notes, Customer call logs, emails
Licensing, death master, …
Electronic Medical Records, images & text
Prescriptions, adherence
99© 2017 MapR Technologies© 2016 MapR Technologies© 2016 MapR Technologies
Big Data Use Cases
1010© 2017 MapR Technologies
Patient Data Management
Analyzed
Unstructured Data
Patient 360 View
Lab
EMR / EHR
Analysts
Doctor’s notes
Claims
Images
HL7
Social Media
Providers
MapR Converged Data
Platform
1111© 2017 MapR Technologies
Reducing Fraud Waste and Abuse with Big Data Analytics
•  Healthcare Fraud >$60 billion yr
•  UnitedHealthcare:
–  2200% ROI using MapR for
Fraud
•  Medicare/Medicaid prevented
>$210.7 million fraud 1 year
Machine Learning
Model
EDI Claim
Fraud
Score
1212© 2017 MapR Technologies
Predictive Analytics to Improve Outcomes
• Early Diagnosis of sepsis, CHF
• Predicting risk of readmission
• Matching treatments
Early Detection of Congestive Heart Failure
Sun, Jimeng, Large-scale Patient Similarity Learning for health analytics, Georgia Tech
1313© 2017 MapR Technologies
Predictive Analytics/ Machine Learning
•  Aetna Labs predict future risk of metabolic syndrome
–  https://www.healthcare-informatics.com/article/how-aetna-using-big-data-give-patients-
personalized-care
•  Optum Labs data from 150 million patient records gives insight about
what works best
–  http://www.modernhealthcare.com/article/20150926/MAGAZINE/309269979
1414© 2017 MapR Technologies
Real Time Monitoring and Alerts
Medical Devices
Stream
Stream
Stream Dashboards
Global Analytics &
Alerting
1515© 2017 MapR Technologies
Why combine IOT with Machine Learning?
•  Cheaper sensors and machine learning are making it possible for
doctors to rapidly apply smart medicine to their patients’ cases
–  https://www.wsj.com/articles/the-smart-medicine-solution-to-the-health-care-
crisis-1499443449
1616© 2017 MapR Technologies
Why combine IOT with Machine Learning?
•  A Stanford team has shown that a machine-learning model can
identify arrhythmias from an EKG better than an expert
–  https://www.technologyreview.com/s/608234/the-machines-are-getting-ready-to-play-
doctor/
1717© 2017 MapR Technologies
Applying Machine Learning to Live Patient Data
–  https://www.healthitoutcomes.com/doc/applying-machine-learning-to-live-
data-0001
1818© 2017 MapR Technologies
Real Time Monitoring Potential
•  CDC: chronic diseases—such as heart disease—are the major
causes of sickness and health care costs in the nation
•  McKinsey: Better management of congestive heart failure could
reduce treatment costs by a billion dollars annually
1919© 2017 MapR Technologies
Why combine IOT with Machine Learning?
•  Connected care ensuring quicker Sepsis treatment:
–  Blood pressure, pulse rates and oxygen levels from monitoring devices
combined with machine learning to provide alerts
–  http://www.computerweekly.com/news/450422258/Putting-sepsis-algorithms-into-
electronic-patient-records
2020© 2017 MapR Technologies© 2016 MapR Technologies© 2016 MapR Technologies
Solution Architecture
2121© 2017 MapR Technologies
Serve DataStore DataCollect Data
What Do We Need to Do ?
Process DataData Sources
images
? ? ? ?
2222© 2017 MapR Technologies
Collect the Data with NFS mounted on MapR-XD
•  Data Ingest:
–  File Based:
NFS with
MapR-FS
•  Move hot data
to $$ storage
•  Move cold
data to
cheaper MapR-
XD
Collect Data
MapR-FS
Data Sources
images
NFS
$$$ Storage
NFS
RDBMS
Data
Warehouse
NFS
Unlimited
Inexpensive
Storage
2323© 2017 MapR Technologies
Collect the Events with MapR Streams
Consumers
Consumers
Consumers
Producers
Producers
Producers
MapR-FS
Kafka API Kafka API
2424© 2017 MapR Technologies
Collect Data
Batch processing
MapR-FS
Process Data
•  Spark Parallel processing high
throughput fast
•  Hive, Pig, MapReduce slower but can
be simpler for batch file processing
2525© 2017 MapR Technologies
Apache Spark Distributed Datasets
Distributed Dataset
Node
Executor
P4
Node
Executor
P1 P3
Node
Executor
P2
partitioned
Partition 1
8213034705, 95,
2.927373,
jake7870, 0……
Partition 2
8213034705,
115, 2.943484,
Davidbresler2,
1….
Partition 3
8213034705,
100, 2.951285,
gladimacowgirl,
58…
Partition 4
8213034705,
117, 2.998947,
daysrus, 95….
•  Data read into Memory Cache
•  Partitioned across a cluster
•  Operated on in parallel
•  Cached in memory for iterations
2626© 2017 MapR Technologies
Streaming Data
Stream processing
Process Data
•  scalable, high-throughput, stream
processing of live data
raw
enriched
alerts
2727© 2017 MapR Technologies
Streaming Analytics
2828© 2017 MapR Technologies
Store the Data with MapR-DB
Key
Range
xxxx
xxxx
Key
Range
xxxx
xxxx
Key
Range
xxxx
xxxx
Key colB col
C
val val val
xxx val val
Key colB col
C
val val val
xxx val val
Key colB col
C
val val val
xxx val val
Fast Reads and Writes by Key! Data is automatically partitioned
by Key Range!
2929© 2017 MapR Technologies
Store Lots of Data with NoSQL MapR-DB
bottleneck
Storage ModelRDBMS MapR-DB
Normalized schema à Joins for
queries can cause bottleneck De-Normalized schema à Data that
is read together is stored together
Key colB colC
xxx val val
xxx val val
Key colB colC
xxx val val
xxx val val
Key colB colC
xxx val val
xxx val val
3030© 2017 MapR Technologies
What is Drill?
•  SQL engine on “everything”
•  Files: JSON, CSV, Parquet
•  Structured formats – Ex: parquet
•  Ecosystem components – Hbase, MapRDB, Hive
•  Schema optional
•  interactive response times
3131© 2017 MapR Technologies
Apache Drill Architecture
•  massively parallel processing execution engine
•  distributed query processing
3232© 2017 MapR Technologies
Serve DataStore DataCollect Data
What Do We Need to Do ?
MapR-FS
Process DataData Sources
MapR-FS
Stream
Topic
3333© 2017 MapR Technologies© 2016 MapR Technologies© 2016 MapR Technologies
Customer Data Lakes
3434© 2017 MapR Technologies
MapR Healthcare Customers
Delivers clinical intelligence
to healthcare providers
Sepsis control based on
real time patient data
Genomic data platform
Research grant analysis
80+ use cases; FWA, …
Genomics analysisRadiology analytics Customized solutions for
value-based care
MRI
manufacturer
Novartis
3535© 2017 MapR Technologies
MapR Healthcare Architecture
3636© 2017 MapR Technologies
Data Lake Architectures
Agile, self-
service data
exploration
ETL into operational
reporting formats (e.g.,
Parquet)
Multi-tenancy: job/
data placement
control, volumes
Access controls:
file, table, column,
column family, doc,
sub-doc levels
Sources
Labs
Claims pharmacy
EHR
Auditing:
compliance, analyze
user accesses
Snapshots:
track data lineage
and history
Table Replication:
global multi-master,
business continuity
MapR Converged Data Platform
Enterprise Storage Database Event Streaming
MapR-FS MapR-DB MapR Streams
MapR-DB: time
series, structured
data, JSON
MapR-XD:
unstructured data
NFS/ raw files
MapR Event Streams:
real-time event data
3737© 2017 MapR Technologies
Valence Health
Population Health SaaS for 85,000 doctors 135 hospitals
•  3,000 inbound data feeds
–  Labs, EHR, claims…
Business Problem:
•  ETL for 20 million lab records took 22 hours to process.
Solution with MapR:
•  With NFS 20 million lab records now take 20 minutes with less
hardware
•  https://www.cioreview.com/news/valence-health-cuts-down-processing-time-and-
drives-customer-satisfaction-with-mapr-nid-11084-cid-15.html
3838© 2017 MapR Technologies
UnitedHealthcare Optum
MapR Data Lake single platform to analyze claims, prescriptions..
•  NFS to ingest 1 million claims, 10 terabytes per day
•  2200% ROI machine learning for Payment Integrity
•  Machine learning for improving outcomes: Diabetes, reduce readmissions…
3939© 2017 MapR Technologies
Baptist Health South Florida
Problem:
•  Oracle too expensive for big data
•  Need a common data platform for patient history
Solution:
1.  MapR data lake
2.  Offload cold data from Oracle $$ NFS to MapR
3.  Integration with EMR
4.  Admission/Readmission prediction
5.  Early sepsis detection/notification
6.  real time monitoring
4040© 2017 MapR Technologies
Use Case: Streaming System of Record for Healthcare
•  Objective:
–  Build a flexible, secure
healthcare information
exchange
Challenges:
•  Many different data models
•  Security and privacy issues
•  HIPAA compliance
4141© 2017 MapR Technologies
Solution: Streaming System of Record for Healthcare
•  Solution:
–  Streaming system of record
•  secure
•  immutable
•  rewindable
Auditable
•  Materialized views continuously computed
•  Selective cross data center replication
Stream
Topic
Records
Applications
6 5 4 3 2 1
Search
Graph DB
JSON
HBase
Micro
Service
Micro
Service
Micro
Service
Micro
Service
Micro
Service
Micro
Service
A
P
I
Streaming System of Record
Materialized
Views
4242© 2017 MapR Technologies
Streaming System of Record for Healthcare
Case Study: Liaison Technologies
Raw
Data
workflow
Key/Value
MapR-DB
materialized
view
workflow
Search
Engine
materialized
view
CEP
k v v v v v
k v v v
k v v
k v v v v
k v v v
k v v v v v
Document Log
(MapR-FS)
log
API
App
pre-
processor
workflow
Graph DB
materialized
view
workflow
Time
Series DB
materialized
view
micro
service
micro
service
micro
service
micro
service
micro
service
micro
service
micro
service
micro
service
App AppApp
...
MapR-ES as Immutable Log
MapR Event Streams (MapR-ES)
•  Immutable log for all data
ingested or consumed.
•  Events become system of
record, processed by
consumers based on their
permissions.
MapR-ES powers compliance-
ready lineage:
•  Immutability. MapR-ES throws
no data away.
•  Auditing. Who wrote/read
events?
•  Rewind. What was status of
data two days ago?
•  Replay. Rebuild derivative data
stores.
Auditors want to see:
•  Data lineage. Where data came
from, how it got there.
•  Audit logging. Who wrote to,
updated, or read the data.
4343© 2017 MapR Technologies
Q&A
@mapr
https://www.mapr.com/blog/author/carol-mcdonald
Engage with us!
mapr-technologies

More Related Content

What's hot

Applying Machine Learning to IOT: End to End Distributed Pipeline for Real-Ti...
Applying Machine Learning to IOT: End to End Distributed Pipeline for Real-Ti...Applying Machine Learning to IOT: End to End Distributed Pipeline for Real-Ti...
Applying Machine Learning to IOT: End to End Distributed Pipeline for Real-Ti...
Carol McDonald
 
Live Machine Learning Tutorial: Churn Prediction
Live Machine Learning Tutorial: Churn PredictionLive Machine Learning Tutorial: Churn Prediction
Live Machine Learning Tutorial: Churn Prediction
MapR Technologies
 
Applying Machine learning to IOT: End to End Distributed Distributed Pipeline...
Applying Machine learning to IOT: End to End Distributed Distributed Pipeline...Applying Machine learning to IOT: End to End Distributed Distributed Pipeline...
Applying Machine learning to IOT: End to End Distributed Distributed Pipeline...
Carol McDonald
 
Fast Cars, Big Data How Streaming can help Formula 1
Fast Cars, Big Data How Streaming can help Formula 1Fast Cars, Big Data How Streaming can help Formula 1
Fast Cars, Big Data How Streaming can help Formula 1
Carol McDonald
 
Streaming healthcare Data pipeline using Apache APIs: Kafka and Spark with Ma...
Streaming healthcare Data pipeline using Apache APIs: Kafka and Spark with Ma...Streaming healthcare Data pipeline using Apache APIs: Kafka and Spark with Ma...
Streaming healthcare Data pipeline using Apache APIs: Kafka and Spark with Ma...
Carol McDonald
 
Apache Spark Machine Learning Decision Trees
Apache Spark Machine Learning Decision TreesApache Spark Machine Learning Decision Trees
Apache Spark Machine Learning Decision Trees
Carol McDonald
 
Analysis of Popular Uber Locations using Apache APIs: Spark Machine Learning...
Analysis of Popular Uber Locations using Apache APIs:  Spark Machine Learning...Analysis of Popular Uber Locations using Apache APIs:  Spark Machine Learning...
Analysis of Popular Uber Locations using Apache APIs: Spark Machine Learning...
Carol McDonald
 
Streaming Patterns Revolutionary Architectures with the Kafka API
Streaming Patterns Revolutionary Architectures with the Kafka APIStreaming Patterns Revolutionary Architectures with the Kafka API
Streaming Patterns Revolutionary Architectures with the Kafka API
Carol McDonald
 
Analyzing Flight Delays with Apache Spark, DataFrames, GraphFrames, and MapR-DB
Analyzing Flight Delays with Apache Spark, DataFrames, GraphFrames, and MapR-DBAnalyzing Flight Delays with Apache Spark, DataFrames, GraphFrames, and MapR-DB
Analyzing Flight Delays with Apache Spark, DataFrames, GraphFrames, and MapR-DB
Carol McDonald
 
Streaming Goes Mainstream: New Architecture & Emerging Technologies for Strea...
Streaming Goes Mainstream: New Architecture & Emerging Technologies for Strea...Streaming Goes Mainstream: New Architecture & Emerging Technologies for Strea...
Streaming Goes Mainstream: New Architecture & Emerging Technologies for Strea...
MapR Technologies
 
When Streaming Becomes Strategic
When Streaming Becomes StrategicWhen Streaming Becomes Strategic
When Streaming Becomes Strategic
MapR Technologies
 
NoSQL Application Development with JSON and MapR-DB
NoSQL Application Development with JSON and MapR-DBNoSQL Application Development with JSON and MapR-DB
NoSQL Application Development with JSON and MapR-DB
MapR Technologies
 
Deep Learning vs. Cheap Learning
Deep Learning vs. Cheap LearningDeep Learning vs. Cheap Learning
Deep Learning vs. Cheap Learning
MapR Technologies
 
MapR Streams and MapR Converged Data Platform
MapR Streams and MapR Converged Data PlatformMapR Streams and MapR Converged Data Platform
MapR Streams and MapR Converged Data Platform
MapR Technologies
 
Spark and MapR Streams: A Motivating Example
Spark and MapR Streams: A Motivating ExampleSpark and MapR Streams: A Motivating Example
Spark and MapR Streams: A Motivating Example
Ian Downard
 
Xactly: How to Build a Successful Converged Data Platform with Hadoop, Spark,...
Xactly: How to Build a Successful Converged Data Platform with Hadoop, Spark,...Xactly: How to Build a Successful Converged Data Platform with Hadoop, Spark,...
Xactly: How to Build a Successful Converged Data Platform with Hadoop, Spark,...
MapR Technologies
 
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
MapR Technologies
 
Bringing Structure, Scalability, and Services to Cloud-Scale Storage
Bringing Structure, Scalability, and Services to Cloud-Scale StorageBringing Structure, Scalability, and Services to Cloud-Scale Storage
Bringing Structure, Scalability, and Services to Cloud-Scale Storage
MapR Technologies
 
ML Workshop 2: Machine Learning Model Comparison & Evaluation
ML Workshop 2: Machine Learning Model Comparison & EvaluationML Workshop 2: Machine Learning Model Comparison & Evaluation
ML Workshop 2: Machine Learning Model Comparison & Evaluation
MapR Technologies
 
Introduction to machine learning with GPUs
Introduction to machine learning with GPUsIntroduction to machine learning with GPUs
Introduction to machine learning with GPUs
Carol McDonald
 

What's hot (20)

Applying Machine Learning to IOT: End to End Distributed Pipeline for Real-Ti...
Applying Machine Learning to IOT: End to End Distributed Pipeline for Real-Ti...Applying Machine Learning to IOT: End to End Distributed Pipeline for Real-Ti...
Applying Machine Learning to IOT: End to End Distributed Pipeline for Real-Ti...
 
Live Machine Learning Tutorial: Churn Prediction
Live Machine Learning Tutorial: Churn PredictionLive Machine Learning Tutorial: Churn Prediction
Live Machine Learning Tutorial: Churn Prediction
 
Applying Machine learning to IOT: End to End Distributed Distributed Pipeline...
Applying Machine learning to IOT: End to End Distributed Distributed Pipeline...Applying Machine learning to IOT: End to End Distributed Distributed Pipeline...
Applying Machine learning to IOT: End to End Distributed Distributed Pipeline...
 
Fast Cars, Big Data How Streaming can help Formula 1
Fast Cars, Big Data How Streaming can help Formula 1Fast Cars, Big Data How Streaming can help Formula 1
Fast Cars, Big Data How Streaming can help Formula 1
 
Streaming healthcare Data pipeline using Apache APIs: Kafka and Spark with Ma...
Streaming healthcare Data pipeline using Apache APIs: Kafka and Spark with Ma...Streaming healthcare Data pipeline using Apache APIs: Kafka and Spark with Ma...
Streaming healthcare Data pipeline using Apache APIs: Kafka and Spark with Ma...
 
Apache Spark Machine Learning Decision Trees
Apache Spark Machine Learning Decision TreesApache Spark Machine Learning Decision Trees
Apache Spark Machine Learning Decision Trees
 
Analysis of Popular Uber Locations using Apache APIs: Spark Machine Learning...
Analysis of Popular Uber Locations using Apache APIs:  Spark Machine Learning...Analysis of Popular Uber Locations using Apache APIs:  Spark Machine Learning...
Analysis of Popular Uber Locations using Apache APIs: Spark Machine Learning...
 
Streaming Patterns Revolutionary Architectures with the Kafka API
Streaming Patterns Revolutionary Architectures with the Kafka APIStreaming Patterns Revolutionary Architectures with the Kafka API
Streaming Patterns Revolutionary Architectures with the Kafka API
 
Analyzing Flight Delays with Apache Spark, DataFrames, GraphFrames, and MapR-DB
Analyzing Flight Delays with Apache Spark, DataFrames, GraphFrames, and MapR-DBAnalyzing Flight Delays with Apache Spark, DataFrames, GraphFrames, and MapR-DB
Analyzing Flight Delays with Apache Spark, DataFrames, GraphFrames, and MapR-DB
 
Streaming Goes Mainstream: New Architecture & Emerging Technologies for Strea...
Streaming Goes Mainstream: New Architecture & Emerging Technologies for Strea...Streaming Goes Mainstream: New Architecture & Emerging Technologies for Strea...
Streaming Goes Mainstream: New Architecture & Emerging Technologies for Strea...
 
When Streaming Becomes Strategic
When Streaming Becomes StrategicWhen Streaming Becomes Strategic
When Streaming Becomes Strategic
 
NoSQL Application Development with JSON and MapR-DB
NoSQL Application Development with JSON and MapR-DBNoSQL Application Development with JSON and MapR-DB
NoSQL Application Development with JSON and MapR-DB
 
Deep Learning vs. Cheap Learning
Deep Learning vs. Cheap LearningDeep Learning vs. Cheap Learning
Deep Learning vs. Cheap Learning
 
MapR Streams and MapR Converged Data Platform
MapR Streams and MapR Converged Data PlatformMapR Streams and MapR Converged Data Platform
MapR Streams and MapR Converged Data Platform
 
Spark and MapR Streams: A Motivating Example
Spark and MapR Streams: A Motivating ExampleSpark and MapR Streams: A Motivating Example
Spark and MapR Streams: A Motivating Example
 
Xactly: How to Build a Successful Converged Data Platform with Hadoop, Spark,...
Xactly: How to Build a Successful Converged Data Platform with Hadoop, Spark,...Xactly: How to Build a Successful Converged Data Platform with Hadoop, Spark,...
Xactly: How to Build a Successful Converged Data Platform with Hadoop, Spark,...
 
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
 
Bringing Structure, Scalability, and Services to Cloud-Scale Storage
Bringing Structure, Scalability, and Services to Cloud-Scale StorageBringing Structure, Scalability, and Services to Cloud-Scale Storage
Bringing Structure, Scalability, and Services to Cloud-Scale Storage
 
ML Workshop 2: Machine Learning Model Comparison & Evaluation
ML Workshop 2: Machine Learning Model Comparison & EvaluationML Workshop 2: Machine Learning Model Comparison & Evaluation
ML Workshop 2: Machine Learning Model Comparison & Evaluation
 
Introduction to machine learning with GPUs
Introduction to machine learning with GPUsIntroduction to machine learning with GPUs
Introduction to machine learning with GPUs
 

Similar to How Big Data is Reducing Costs and Improving Outcomes in Health Care

Genome Analysis Pipelines, Big Data Style
Genome Analysis Pipelines, Big Data StyleGenome Analysis Pipelines, Big Data Style
Genome Analysis Pipelines, Big Data Style
Julius Remigio, CBIP
 
Big Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short TimeBig Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short Time
DataWorks Summit
 
Hitachi high-performance-accelerates-life-sciences-research
Hitachi high-performance-accelerates-life-sciences-researchHitachi high-performance-accelerates-life-sciences-research
Hitachi high-performance-accelerates-life-sciences-researchHitachi Vantara
 
Baptist Health: Solving Healthcare Problems with Big Data
Baptist Health: Solving Healthcare Problems with Big DataBaptist Health: Solving Healthcare Problems with Big Data
Baptist Health: Solving Healthcare Problems with Big Data
MapR Technologies
 
Data Warehouse Modernization: Accelerating Time-To-Action
Data Warehouse Modernization: Accelerating Time-To-Action Data Warehouse Modernization: Accelerating Time-To-Action
Data Warehouse Modernization: Accelerating Time-To-Action
MapR Technologies
 
Data analytics - May 2016
Data analytics - May 2016Data analytics - May 2016
Data analytics - May 2016
Mark Yunger
 
Dennis Kehoe - ECO 15: Digital connectivity in healthcare
Dennis Kehoe - ECO 15: Digital connectivity in healthcareDennis Kehoe - ECO 15: Digital connectivity in healthcare
Dennis Kehoe - ECO 15: Digital connectivity in healthcare
Innovation Agency
 
Data Virtualization at UMC Utrecht: Don't Collect, Connect! by Erik Fransen (...
Data Virtualization at UMC Utrecht: Don't Collect, Connect! by Erik Fransen (...Data Virtualization at UMC Utrecht: Don't Collect, Connect! by Erik Fransen (...
Data Virtualization at UMC Utrecht: Don't Collect, Connect! by Erik Fransen (...
Patrick Van Renterghem
 
Appistry WGDAS Presentation
Appistry WGDAS PresentationAppistry WGDAS Presentation
Appistry WGDAS Presentation
elasticdave
 
20131011 - Los Gatos - Netflix - Big Data Design Patterns
20131011 - Los Gatos - Netflix - Big Data Design Patterns20131011 - Los Gatos - Netflix - Big Data Design Patterns
20131011 - Los Gatos - Netflix - Big Data Design PatternsAllen Day, PhD
 
How to Use Big Data to Transform IT Operations
How to Use Big Data to Transform IT OperationsHow to Use Big Data to Transform IT Operations
How to Use Big Data to Transform IT Operations
ExtraHop Networks
 
Scientific Application Development and Early results on Summit
Scientific Application Development and Early results on SummitScientific Application Development and Early results on Summit
Scientific Application Development and Early results on Summit
Ganesan Narayanasamy
 
AI Super computer update
AI Super computer update AI Super computer update
AI Super computer update
Ganesan Narayanasamy
 
Innovating With Data and Analytics
Innovating With Data and AnalyticsInnovating With Data and Analytics
Innovating With Data and Analytics
VMware Tanzu
 
Scaling up with Cisco Big Data: Data + Science = Data Science
Scaling up with Cisco Big Data: Data + Science = Data ScienceScaling up with Cisco Big Data: Data + Science = Data Science
Scaling up with Cisco Big Data: Data + Science = Data Science
eRic Choo
 
Enterprise Analytics: Serving Big Data Projects for Healthcare
Enterprise Analytics: Serving Big Data Projects for HealthcareEnterprise Analytics: Serving Big Data Projects for Healthcare
Enterprise Analytics: Serving Big Data Projects for Healthcare
DATA360US
 
High Performance Data Analytics and a Java Grande Run Time
High Performance Data Analytics and a Java Grande Run TimeHigh Performance Data Analytics and a Java Grande Run Time
High Performance Data Analytics and a Java Grande Run Time
Geoffrey Fox
 
Chip ICT | Hgst storage brochure
Chip ICT | Hgst storage brochureChip ICT | Hgst storage brochure
Chip ICT | Hgst storage brochure
Marco van der Hart
 
The challenges of Analytical Data Management in R&D
The challenges of Analytical Data Management in R&DThe challenges of Analytical Data Management in R&D
The challenges of Analytical Data Management in R&D
Laura Berry
 
Real Time Business Platform by Ivan Novick from Pivotal
Real Time Business Platform by Ivan Novick from PivotalReal Time Business Platform by Ivan Novick from Pivotal
Real Time Business Platform by Ivan Novick from Pivotal
VMware Tanzu Korea
 

Similar to How Big Data is Reducing Costs and Improving Outcomes in Health Care (20)

Genome Analysis Pipelines, Big Data Style
Genome Analysis Pipelines, Big Data StyleGenome Analysis Pipelines, Big Data Style
Genome Analysis Pipelines, Big Data Style
 
Big Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short TimeBig Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short Time
 
Hitachi high-performance-accelerates-life-sciences-research
Hitachi high-performance-accelerates-life-sciences-researchHitachi high-performance-accelerates-life-sciences-research
Hitachi high-performance-accelerates-life-sciences-research
 
Baptist Health: Solving Healthcare Problems with Big Data
Baptist Health: Solving Healthcare Problems with Big DataBaptist Health: Solving Healthcare Problems with Big Data
Baptist Health: Solving Healthcare Problems with Big Data
 
Data Warehouse Modernization: Accelerating Time-To-Action
Data Warehouse Modernization: Accelerating Time-To-Action Data Warehouse Modernization: Accelerating Time-To-Action
Data Warehouse Modernization: Accelerating Time-To-Action
 
Data analytics - May 2016
Data analytics - May 2016Data analytics - May 2016
Data analytics - May 2016
 
Dennis Kehoe - ECO 15: Digital connectivity in healthcare
Dennis Kehoe - ECO 15: Digital connectivity in healthcareDennis Kehoe - ECO 15: Digital connectivity in healthcare
Dennis Kehoe - ECO 15: Digital connectivity in healthcare
 
Data Virtualization at UMC Utrecht: Don't Collect, Connect! by Erik Fransen (...
Data Virtualization at UMC Utrecht: Don't Collect, Connect! by Erik Fransen (...Data Virtualization at UMC Utrecht: Don't Collect, Connect! by Erik Fransen (...
Data Virtualization at UMC Utrecht: Don't Collect, Connect! by Erik Fransen (...
 
Appistry WGDAS Presentation
Appistry WGDAS PresentationAppistry WGDAS Presentation
Appistry WGDAS Presentation
 
20131011 - Los Gatos - Netflix - Big Data Design Patterns
20131011 - Los Gatos - Netflix - Big Data Design Patterns20131011 - Los Gatos - Netflix - Big Data Design Patterns
20131011 - Los Gatos - Netflix - Big Data Design Patterns
 
How to Use Big Data to Transform IT Operations
How to Use Big Data to Transform IT OperationsHow to Use Big Data to Transform IT Operations
How to Use Big Data to Transform IT Operations
 
Scientific Application Development and Early results on Summit
Scientific Application Development and Early results on SummitScientific Application Development and Early results on Summit
Scientific Application Development and Early results on Summit
 
AI Super computer update
AI Super computer update AI Super computer update
AI Super computer update
 
Innovating With Data and Analytics
Innovating With Data and AnalyticsInnovating With Data and Analytics
Innovating With Data and Analytics
 
Scaling up with Cisco Big Data: Data + Science = Data Science
Scaling up with Cisco Big Data: Data + Science = Data ScienceScaling up with Cisco Big Data: Data + Science = Data Science
Scaling up with Cisco Big Data: Data + Science = Data Science
 
Enterprise Analytics: Serving Big Data Projects for Healthcare
Enterprise Analytics: Serving Big Data Projects for HealthcareEnterprise Analytics: Serving Big Data Projects for Healthcare
Enterprise Analytics: Serving Big Data Projects for Healthcare
 
High Performance Data Analytics and a Java Grande Run Time
High Performance Data Analytics and a Java Grande Run TimeHigh Performance Data Analytics and a Java Grande Run Time
High Performance Data Analytics and a Java Grande Run Time
 
Chip ICT | Hgst storage brochure
Chip ICT | Hgst storage brochureChip ICT | Hgst storage brochure
Chip ICT | Hgst storage brochure
 
The challenges of Analytical Data Management in R&D
The challenges of Analytical Data Management in R&DThe challenges of Analytical Data Management in R&D
The challenges of Analytical Data Management in R&D
 
Real Time Business Platform by Ivan Novick from Pivotal
Real Time Business Platform by Ivan Novick from PivotalReal Time Business Platform by Ivan Novick from Pivotal
Real Time Business Platform by Ivan Novick from Pivotal
 

More from Carol McDonald

Spark graphx
Spark graphxSpark graphx
Spark graphx
Carol McDonald
 
Spark machine learning predicting customer churn
Spark machine learning predicting customer churnSpark machine learning predicting customer churn
Spark machine learning predicting customer churn
Carol McDonald
 
Fast, Scalable, Streaming Applications with Spark Streaming, the Kafka API an...
Fast, Scalable, Streaming Applications with Spark Streaming, the Kafka API an...Fast, Scalable, Streaming Applications with Spark Streaming, the Kafka API an...
Fast, Scalable, Streaming Applications with Spark Streaming, the Kafka API an...
Carol McDonald
 
Apache Spark Machine Learning
Apache Spark Machine LearningApache Spark Machine Learning
Apache Spark Machine Learning
Carol McDonald
 
Build a Time Series Application with Apache Spark and Apache HBase
Build a Time Series Application with Apache Spark and Apache  HBaseBuild a Time Series Application with Apache Spark and Apache  HBase
Build a Time Series Application with Apache Spark and Apache HBase
Carol McDonald
 
Apache Spark streaming and HBase
Apache Spark streaming and HBaseApache Spark streaming and HBase
Apache Spark streaming and HBase
Carol McDonald
 
Machine Learning Recommendations with Spark
Machine Learning Recommendations with SparkMachine Learning Recommendations with Spark
Machine Learning Recommendations with Spark
Carol McDonald
 
Apache Spark Overview
Apache Spark OverviewApache Spark Overview
Apache Spark Overview
Carol McDonald
 
Introduction to Spark
Introduction to SparkIntroduction to Spark
Introduction to Spark
Carol McDonald
 
Getting started with HBase
Getting started with HBaseGetting started with HBase
Getting started with HBase
Carol McDonald
 
Introduction to Spark on Hadoop
Introduction to Spark on HadoopIntroduction to Spark on Hadoop
Introduction to Spark on Hadoop
Carol McDonald
 
NoSQL HBase schema design and SQL with Apache Drill
NoSQL HBase schema design and SQL with Apache Drill NoSQL HBase schema design and SQL with Apache Drill
NoSQL HBase schema design and SQL with Apache Drill Carol McDonald
 

More from Carol McDonald (13)

Spark graphx
Spark graphxSpark graphx
Spark graphx
 
Spark machine learning predicting customer churn
Spark machine learning predicting customer churnSpark machine learning predicting customer churn
Spark machine learning predicting customer churn
 
Fast, Scalable, Streaming Applications with Spark Streaming, the Kafka API an...
Fast, Scalable, Streaming Applications with Spark Streaming, the Kafka API an...Fast, Scalable, Streaming Applications with Spark Streaming, the Kafka API an...
Fast, Scalable, Streaming Applications with Spark Streaming, the Kafka API an...
 
Apache Spark Machine Learning
Apache Spark Machine LearningApache Spark Machine Learning
Apache Spark Machine Learning
 
Build a Time Series Application with Apache Spark and Apache HBase
Build a Time Series Application with Apache Spark and Apache  HBaseBuild a Time Series Application with Apache Spark and Apache  HBase
Build a Time Series Application with Apache Spark and Apache HBase
 
Apache Spark streaming and HBase
Apache Spark streaming and HBaseApache Spark streaming and HBase
Apache Spark streaming and HBase
 
Machine Learning Recommendations with Spark
Machine Learning Recommendations with SparkMachine Learning Recommendations with Spark
Machine Learning Recommendations with Spark
 
Apache Spark Overview
Apache Spark OverviewApache Spark Overview
Apache Spark Overview
 
Introduction to Spark
Introduction to SparkIntroduction to Spark
Introduction to Spark
 
CU9411MW.DOC
CU9411MW.DOCCU9411MW.DOC
CU9411MW.DOC
 
Getting started with HBase
Getting started with HBaseGetting started with HBase
Getting started with HBase
 
Introduction to Spark on Hadoop
Introduction to Spark on HadoopIntroduction to Spark on Hadoop
Introduction to Spark on Hadoop
 
NoSQL HBase schema design and SQL with Apache Drill
NoSQL HBase schema design and SQL with Apache Drill NoSQL HBase schema design and SQL with Apache Drill
NoSQL HBase schema design and SQL with Apache Drill
 

Recently uploaded

Understanding Globus Data Transfers with NetSage
Understanding Globus Data Transfers with NetSageUnderstanding Globus Data Transfers with NetSage
Understanding Globus Data Transfers with NetSage
Globus
 
Advanced Flow Concepts Every Developer Should Know
Advanced Flow Concepts Every Developer Should KnowAdvanced Flow Concepts Every Developer Should Know
Advanced Flow Concepts Every Developer Should Know
Peter Caitens
 
Enhancing Research Orchestration Capabilities at ORNL.pdf
Enhancing Research Orchestration Capabilities at ORNL.pdfEnhancing Research Orchestration Capabilities at ORNL.pdf
Enhancing Research Orchestration Capabilities at ORNL.pdf
Globus
 
Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...
Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...
Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...
informapgpstrackings
 
Designing for Privacy in Amazon Web Services
Designing for Privacy in Amazon Web ServicesDesigning for Privacy in Amazon Web Services
Designing for Privacy in Amazon Web Services
KrzysztofKkol1
 
Vitthal Shirke Microservices Resume Montevideo
Vitthal Shirke Microservices Resume MontevideoVitthal Shirke Microservices Resume Montevideo
Vitthal Shirke Microservices Resume Montevideo
Vitthal Shirke
 
Lecture 1 Introduction to games development
Lecture 1 Introduction to games developmentLecture 1 Introduction to games development
Lecture 1 Introduction to games development
abdulrafaychaudhry
 
Explore Modern SharePoint Templates for 2024
Explore Modern SharePoint Templates for 2024Explore Modern SharePoint Templates for 2024
Explore Modern SharePoint Templates for 2024
Sharepoint Designs
 
How Does XfilesPro Ensure Security While Sharing Documents in Salesforce?
How Does XfilesPro Ensure Security While Sharing Documents in Salesforce?How Does XfilesPro Ensure Security While Sharing Documents in Salesforce?
How Does XfilesPro Ensure Security While Sharing Documents in Salesforce?
XfilesPro
 
Multiple Your Crypto Portfolio with the Innovative Features of Advanced Crypt...
Multiple Your Crypto Portfolio with the Innovative Features of Advanced Crypt...Multiple Your Crypto Portfolio with the Innovative Features of Advanced Crypt...
Multiple Your Crypto Portfolio with the Innovative Features of Advanced Crypt...
Hivelance Technology
 
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...
Globus
 
Paketo Buildpacks : la meilleure façon de construire des images OCI? DevopsDa...
Paketo Buildpacks : la meilleure façon de construire des images OCI? DevopsDa...Paketo Buildpacks : la meilleure façon de construire des images OCI? DevopsDa...
Paketo Buildpacks : la meilleure façon de construire des images OCI? DevopsDa...
Anthony Dahanne
 
Developing Distributed High-performance Computing Capabilities of an Open Sci...
Developing Distributed High-performance Computing Capabilities of an Open Sci...Developing Distributed High-performance Computing Capabilities of an Open Sci...
Developing Distributed High-performance Computing Capabilities of an Open Sci...
Globus
 
OpenFOAM solver for Helmholtz equation, helmholtzFoam / helmholtzBubbleFoam
OpenFOAM solver for Helmholtz equation, helmholtzFoam / helmholtzBubbleFoamOpenFOAM solver for Helmholtz equation, helmholtzFoam / helmholtzBubbleFoam
OpenFOAM solver for Helmholtz equation, helmholtzFoam / helmholtzBubbleFoam
takuyayamamoto1800
 
A Comprehensive Look at Generative AI in Retail App Testing.pdf
A Comprehensive Look at Generative AI in Retail App Testing.pdfA Comprehensive Look at Generative AI in Retail App Testing.pdf
A Comprehensive Look at Generative AI in Retail App Testing.pdf
kalichargn70th171
 
BoxLang: Review our Visionary Licenses of 2024
BoxLang: Review our Visionary Licenses of 2024BoxLang: Review our Visionary Licenses of 2024
BoxLang: Review our Visionary Licenses of 2024
Ortus Solutions, Corp
 
Webinar: Salesforce Document Management 2.0 - Smarter, Faster, Better
Webinar: Salesforce Document Management 2.0 - Smarter, Faster, BetterWebinar: Salesforce Document Management 2.0 - Smarter, Faster, Better
Webinar: Salesforce Document Management 2.0 - Smarter, Faster, Better
XfilesPro
 
Cyaniclab : Software Development Agency Portfolio.pdf
Cyaniclab : Software Development Agency Portfolio.pdfCyaniclab : Software Development Agency Portfolio.pdf
Cyaniclab : Software Development Agency Portfolio.pdf
Cyanic lab
 
Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...
Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...
Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...
Shahin Sheidaei
 
Software Testing Exam imp Ques Notes.pdf
Software Testing Exam imp Ques Notes.pdfSoftware Testing Exam imp Ques Notes.pdf
Software Testing Exam imp Ques Notes.pdf
MayankTawar1
 

Recently uploaded (20)

Understanding Globus Data Transfers with NetSage
Understanding Globus Data Transfers with NetSageUnderstanding Globus Data Transfers with NetSage
Understanding Globus Data Transfers with NetSage
 
Advanced Flow Concepts Every Developer Should Know
Advanced Flow Concepts Every Developer Should KnowAdvanced Flow Concepts Every Developer Should Know
Advanced Flow Concepts Every Developer Should Know
 
Enhancing Research Orchestration Capabilities at ORNL.pdf
Enhancing Research Orchestration Capabilities at ORNL.pdfEnhancing Research Orchestration Capabilities at ORNL.pdf
Enhancing Research Orchestration Capabilities at ORNL.pdf
 
Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...
Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...
Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...
 
Designing for Privacy in Amazon Web Services
Designing for Privacy in Amazon Web ServicesDesigning for Privacy in Amazon Web Services
Designing for Privacy in Amazon Web Services
 
Vitthal Shirke Microservices Resume Montevideo
Vitthal Shirke Microservices Resume MontevideoVitthal Shirke Microservices Resume Montevideo
Vitthal Shirke Microservices Resume Montevideo
 
Lecture 1 Introduction to games development
Lecture 1 Introduction to games developmentLecture 1 Introduction to games development
Lecture 1 Introduction to games development
 
Explore Modern SharePoint Templates for 2024
Explore Modern SharePoint Templates for 2024Explore Modern SharePoint Templates for 2024
Explore Modern SharePoint Templates for 2024
 
How Does XfilesPro Ensure Security While Sharing Documents in Salesforce?
How Does XfilesPro Ensure Security While Sharing Documents in Salesforce?How Does XfilesPro Ensure Security While Sharing Documents in Salesforce?
How Does XfilesPro Ensure Security While Sharing Documents in Salesforce?
 
Multiple Your Crypto Portfolio with the Innovative Features of Advanced Crypt...
Multiple Your Crypto Portfolio with the Innovative Features of Advanced Crypt...Multiple Your Crypto Portfolio with the Innovative Features of Advanced Crypt...
Multiple Your Crypto Portfolio with the Innovative Features of Advanced Crypt...
 
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...
 
Paketo Buildpacks : la meilleure façon de construire des images OCI? DevopsDa...
Paketo Buildpacks : la meilleure façon de construire des images OCI? DevopsDa...Paketo Buildpacks : la meilleure façon de construire des images OCI? DevopsDa...
Paketo Buildpacks : la meilleure façon de construire des images OCI? DevopsDa...
 
Developing Distributed High-performance Computing Capabilities of an Open Sci...
Developing Distributed High-performance Computing Capabilities of an Open Sci...Developing Distributed High-performance Computing Capabilities of an Open Sci...
Developing Distributed High-performance Computing Capabilities of an Open Sci...
 
OpenFOAM solver for Helmholtz equation, helmholtzFoam / helmholtzBubbleFoam
OpenFOAM solver for Helmholtz equation, helmholtzFoam / helmholtzBubbleFoamOpenFOAM solver for Helmholtz equation, helmholtzFoam / helmholtzBubbleFoam
OpenFOAM solver for Helmholtz equation, helmholtzFoam / helmholtzBubbleFoam
 
A Comprehensive Look at Generative AI in Retail App Testing.pdf
A Comprehensive Look at Generative AI in Retail App Testing.pdfA Comprehensive Look at Generative AI in Retail App Testing.pdf
A Comprehensive Look at Generative AI in Retail App Testing.pdf
 
BoxLang: Review our Visionary Licenses of 2024
BoxLang: Review our Visionary Licenses of 2024BoxLang: Review our Visionary Licenses of 2024
BoxLang: Review our Visionary Licenses of 2024
 
Webinar: Salesforce Document Management 2.0 - Smarter, Faster, Better
Webinar: Salesforce Document Management 2.0 - Smarter, Faster, BetterWebinar: Salesforce Document Management 2.0 - Smarter, Faster, Better
Webinar: Salesforce Document Management 2.0 - Smarter, Faster, Better
 
Cyaniclab : Software Development Agency Portfolio.pdf
Cyaniclab : Software Development Agency Portfolio.pdfCyaniclab : Software Development Agency Portfolio.pdf
Cyaniclab : Software Development Agency Portfolio.pdf
 
Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...
Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...
Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...
 
Software Testing Exam imp Ques Notes.pdf
Software Testing Exam imp Ques Notes.pdfSoftware Testing Exam imp Ques Notes.pdf
Software Testing Exam imp Ques Notes.pdf
 

How Big Data is Reducing Costs and Improving Outcomes in Health Care

  • 1. 11© 2017 MapR Technologies Big Data in Healthcare Carol McDonald @caroljmcdonald
  • 2. 22© 2017 MapR Technologies The Motivation for Big Data: Poor ROI •  USA spends a lot more per capita •  US Health System ranks last among eleven countries (OECD) –  healthy lives, access, quality, efficiency
  • 3. 33© 2017 MapR Technologies Who Knew Healthcare could be so complicated?
  • 4. 44© 2017 MapR Technologies Value Based Care & Value Based Reimbursement Incentives for Technology: •  Improve coordination and outcome •  shifting from fee-for-service •  to value based data driven incentives
  • 5. 55© 2017 MapR Technologies© 2016 MapR Technologies© 2016 MapR Technologies The Data
  • 6. 66© 2017 MapR Technologies Where is the Big Data Opportunity? McKinsey Global Institute
  • 7. 77© 2017 MapR Technologies Where is the Big Data Opportunity? According to McKinsey Global Institute the big data opportunity: •  Claims –  utilization of care •  Pharmaceutical –  clinical trials •  Clinical Data –  Electronic Medical Records •  Patient Behavior and Population Health lab EMR / EHR Doctor’s notes Claims images HL7 Social Media
  • 8. 88© 2017 MapR Technologies Building a Healthcare Data Lake on MapR Data Lake Claims Clinical Pharmacy EMR Logs and Notes 3rd Party Additional Data CB Header data, Social, ... Historical procedures, co-morbidities (prof & inst.) Lab results, vital signs, ... Dr. Notes, Customer call logs, emails Licensing, death master, … Electronic Medical Records, images & text Prescriptions, adherence
  • 9. 99© 2017 MapR Technologies© 2016 MapR Technologies© 2016 MapR Technologies Big Data Use Cases
  • 10. 1010© 2017 MapR Technologies Patient Data Management Analyzed Unstructured Data Patient 360 View Lab EMR / EHR Analysts Doctor’s notes Claims Images HL7 Social Media Providers MapR Converged Data Platform
  • 11. 1111© 2017 MapR Technologies Reducing Fraud Waste and Abuse with Big Data Analytics •  Healthcare Fraud >$60 billion yr •  UnitedHealthcare: –  2200% ROI using MapR for Fraud •  Medicare/Medicaid prevented >$210.7 million fraud 1 year Machine Learning Model EDI Claim Fraud Score
  • 12. 1212© 2017 MapR Technologies Predictive Analytics to Improve Outcomes • Early Diagnosis of sepsis, CHF • Predicting risk of readmission • Matching treatments Early Detection of Congestive Heart Failure Sun, Jimeng, Large-scale Patient Similarity Learning for health analytics, Georgia Tech
  • 13. 1313© 2017 MapR Technologies Predictive Analytics/ Machine Learning •  Aetna Labs predict future risk of metabolic syndrome –  https://www.healthcare-informatics.com/article/how-aetna-using-big-data-give-patients- personalized-care •  Optum Labs data from 150 million patient records gives insight about what works best –  http://www.modernhealthcare.com/article/20150926/MAGAZINE/309269979
  • 14. 1414© 2017 MapR Technologies Real Time Monitoring and Alerts Medical Devices Stream Stream Stream Dashboards Global Analytics & Alerting
  • 15. 1515© 2017 MapR Technologies Why combine IOT with Machine Learning? •  Cheaper sensors and machine learning are making it possible for doctors to rapidly apply smart medicine to their patients’ cases –  https://www.wsj.com/articles/the-smart-medicine-solution-to-the-health-care- crisis-1499443449
  • 16. 1616© 2017 MapR Technologies Why combine IOT with Machine Learning? •  A Stanford team has shown that a machine-learning model can identify arrhythmias from an EKG better than an expert –  https://www.technologyreview.com/s/608234/the-machines-are-getting-ready-to-play- doctor/
  • 17. 1717© 2017 MapR Technologies Applying Machine Learning to Live Patient Data –  https://www.healthitoutcomes.com/doc/applying-machine-learning-to-live- data-0001
  • 18. 1818© 2017 MapR Technologies Real Time Monitoring Potential •  CDC: chronic diseases—such as heart disease—are the major causes of sickness and health care costs in the nation •  McKinsey: Better management of congestive heart failure could reduce treatment costs by a billion dollars annually
  • 19. 1919© 2017 MapR Technologies Why combine IOT with Machine Learning? •  Connected care ensuring quicker Sepsis treatment: –  Blood pressure, pulse rates and oxygen levels from monitoring devices combined with machine learning to provide alerts –  http://www.computerweekly.com/news/450422258/Putting-sepsis-algorithms-into- electronic-patient-records
  • 20. 2020© 2017 MapR Technologies© 2016 MapR Technologies© 2016 MapR Technologies Solution Architecture
  • 21. 2121© 2017 MapR Technologies Serve DataStore DataCollect Data What Do We Need to Do ? Process DataData Sources images ? ? ? ?
  • 22. 2222© 2017 MapR Technologies Collect the Data with NFS mounted on MapR-XD •  Data Ingest: –  File Based: NFS with MapR-FS •  Move hot data to $$ storage •  Move cold data to cheaper MapR- XD Collect Data MapR-FS Data Sources images NFS $$$ Storage NFS RDBMS Data Warehouse NFS Unlimited Inexpensive Storage
  • 23. 2323© 2017 MapR Technologies Collect the Events with MapR Streams Consumers Consumers Consumers Producers Producers Producers MapR-FS Kafka API Kafka API
  • 24. 2424© 2017 MapR Technologies Collect Data Batch processing MapR-FS Process Data •  Spark Parallel processing high throughput fast •  Hive, Pig, MapReduce slower but can be simpler for batch file processing
  • 25. 2525© 2017 MapR Technologies Apache Spark Distributed Datasets Distributed Dataset Node Executor P4 Node Executor P1 P3 Node Executor P2 partitioned Partition 1 8213034705, 95, 2.927373, jake7870, 0…… Partition 2 8213034705, 115, 2.943484, Davidbresler2, 1…. Partition 3 8213034705, 100, 2.951285, gladimacowgirl, 58… Partition 4 8213034705, 117, 2.998947, daysrus, 95…. •  Data read into Memory Cache •  Partitioned across a cluster •  Operated on in parallel •  Cached in memory for iterations
  • 26. 2626© 2017 MapR Technologies Streaming Data Stream processing Process Data •  scalable, high-throughput, stream processing of live data raw enriched alerts
  • 27. 2727© 2017 MapR Technologies Streaming Analytics
  • 28. 2828© 2017 MapR Technologies Store the Data with MapR-DB Key Range xxxx xxxx Key Range xxxx xxxx Key Range xxxx xxxx Key colB col C val val val xxx val val Key colB col C val val val xxx val val Key colB col C val val val xxx val val Fast Reads and Writes by Key! Data is automatically partitioned by Key Range!
  • 29. 2929© 2017 MapR Technologies Store Lots of Data with NoSQL MapR-DB bottleneck Storage ModelRDBMS MapR-DB Normalized schema à Joins for queries can cause bottleneck De-Normalized schema à Data that is read together is stored together Key colB colC xxx val val xxx val val Key colB colC xxx val val xxx val val Key colB colC xxx val val xxx val val
  • 30. 3030© 2017 MapR Technologies What is Drill? •  SQL engine on “everything” •  Files: JSON, CSV, Parquet •  Structured formats – Ex: parquet •  Ecosystem components – Hbase, MapRDB, Hive •  Schema optional •  interactive response times
  • 31. 3131© 2017 MapR Technologies Apache Drill Architecture •  massively parallel processing execution engine •  distributed query processing
  • 32. 3232© 2017 MapR Technologies Serve DataStore DataCollect Data What Do We Need to Do ? MapR-FS Process DataData Sources MapR-FS Stream Topic
  • 33. 3333© 2017 MapR Technologies© 2016 MapR Technologies© 2016 MapR Technologies Customer Data Lakes
  • 34. 3434© 2017 MapR Technologies MapR Healthcare Customers Delivers clinical intelligence to healthcare providers Sepsis control based on real time patient data Genomic data platform Research grant analysis 80+ use cases; FWA, … Genomics analysisRadiology analytics Customized solutions for value-based care MRI manufacturer Novartis
  • 35. 3535© 2017 MapR Technologies MapR Healthcare Architecture
  • 36. 3636© 2017 MapR Technologies Data Lake Architectures Agile, self- service data exploration ETL into operational reporting formats (e.g., Parquet) Multi-tenancy: job/ data placement control, volumes Access controls: file, table, column, column family, doc, sub-doc levels Sources Labs Claims pharmacy EHR Auditing: compliance, analyze user accesses Snapshots: track data lineage and history Table Replication: global multi-master, business continuity MapR Converged Data Platform Enterprise Storage Database Event Streaming MapR-FS MapR-DB MapR Streams MapR-DB: time series, structured data, JSON MapR-XD: unstructured data NFS/ raw files MapR Event Streams: real-time event data
  • 37. 3737© 2017 MapR Technologies Valence Health Population Health SaaS for 85,000 doctors 135 hospitals •  3,000 inbound data feeds –  Labs, EHR, claims… Business Problem: •  ETL for 20 million lab records took 22 hours to process. Solution with MapR: •  With NFS 20 million lab records now take 20 minutes with less hardware •  https://www.cioreview.com/news/valence-health-cuts-down-processing-time-and- drives-customer-satisfaction-with-mapr-nid-11084-cid-15.html
  • 38. 3838© 2017 MapR Technologies UnitedHealthcare Optum MapR Data Lake single platform to analyze claims, prescriptions.. •  NFS to ingest 1 million claims, 10 terabytes per day •  2200% ROI machine learning for Payment Integrity •  Machine learning for improving outcomes: Diabetes, reduce readmissions…
  • 39. 3939© 2017 MapR Technologies Baptist Health South Florida Problem: •  Oracle too expensive for big data •  Need a common data platform for patient history Solution: 1.  MapR data lake 2.  Offload cold data from Oracle $$ NFS to MapR 3.  Integration with EMR 4.  Admission/Readmission prediction 5.  Early sepsis detection/notification 6.  real time monitoring
  • 40. 4040© 2017 MapR Technologies Use Case: Streaming System of Record for Healthcare •  Objective: –  Build a flexible, secure healthcare information exchange Challenges: •  Many different data models •  Security and privacy issues •  HIPAA compliance
  • 41. 4141© 2017 MapR Technologies Solution: Streaming System of Record for Healthcare •  Solution: –  Streaming system of record •  secure •  immutable •  rewindable Auditable •  Materialized views continuously computed •  Selective cross data center replication Stream Topic Records Applications 6 5 4 3 2 1 Search Graph DB JSON HBase Micro Service Micro Service Micro Service Micro Service Micro Service Micro Service A P I Streaming System of Record Materialized Views
  • 42. 4242© 2017 MapR Technologies Streaming System of Record for Healthcare Case Study: Liaison Technologies Raw Data workflow Key/Value MapR-DB materialized view workflow Search Engine materialized view CEP k v v v v v k v v v k v v k v v v v k v v v k v v v v v Document Log (MapR-FS) log API App pre- processor workflow Graph DB materialized view workflow Time Series DB materialized view micro service micro service micro service micro service micro service micro service micro service micro service App AppApp ... MapR-ES as Immutable Log MapR Event Streams (MapR-ES) •  Immutable log for all data ingested or consumed. •  Events become system of record, processed by consumers based on their permissions. MapR-ES powers compliance- ready lineage: •  Immutability. MapR-ES throws no data away. •  Auditing. Who wrote/read events? •  Rewind. What was status of data two days ago? •  Replay. Rebuild derivative data stores. Auditors want to see: •  Data lineage. Where data came from, how it got there. •  Audit logging. Who wrote to, updated, or read the data.
  • 43. 4343© 2017 MapR Technologies Q&A @mapr https://www.mapr.com/blog/author/carol-mcdonald Engage with us! mapr-technologies