SlideShare a Scribd company logo
1 of 27
Download to read offline
#datapopupseattle
Using Spark in Healthcare
Predictive Analytics in the OR
Denny Lee
Data Scientist and Evangelist, databricks.com
databricks
#datapopupseattle
UNSTRUCTURED
Data Science POP-UP in Seattle
www.dominodatalab.com
D
Produced by Domino Data Lab
Domino’s enterprise data science platform is used
by leading analytical organizations to increase
productivity, enable collaboration, and publish
models into production faster.
Using Spark in Healthcare
Predictive Analytics in the OR
Denny Lee
June 15th, 2015
Collaboration
• Working with Ayad Shammout
• Director of Business Intelligence, Beth Israel Deaconess
Medical Center
• Helped build highly available / disaster recovery
infrastructure for BIDMC
2
About Me
• Technology Evangelist, Databricks
• Former Sr. Director of Data Sciences Eng, Concur
• Helped bring Hadoop onto Windows and Azure
3
About Databricks
• Founded by Apache Spark Creators
• Largest contributor to Spark project, committed to
keeping Spark 100% open source
• Databricks is an end-to-end hosted platform
4
5
$15-$20 / minute for a
basic surgical procedure
Time is an OR's most valuable resource
Lack of OR availability
means loss of patient
OR efficiency differs depending on the

OR staffing and allocation (8, 10, 13, or 16h),
not the workload (i.e. cases)
6
“You are not going to get the elephant to shrink or change its size. You
need to face the fact that the elephant is 8 OR tall and 11hr wide”
Steven Shafer, MD
7
Operating Room
Better utilization =
Better profit margins
Reduce support and
maintenance costs
Medical Staff
Better utilization =
Better profit margins
Better medical staff
efficiencies = Better
outcomes
Patients
Shorter wait times
and less cancellations
Better medical staff
efficiencies = Better
outcomes
Develop Predictive Model
• Develop a predictive model that would identify
available OR time two weeks in advance.
• Allow us to confirm wait list cases two weeks in advance,
instead of when the blocks normally release four days
out.
8
Forecast OR Schedule
• Case load three weeks in advance
• Book more cases weeks in advance to prevent under-
utilization
• Reduce staff overtime and idle time
9
Background
• Three surgical pools
• GYN, urology, general surgery, colorectal, surgical
oncology
• Eyes, plastics, ENT
• Orthopedics, podiatry
• Currently built using SQL Server Data Mining
10
11
12
13
demoOR Block Scheduling
Extract History data and run linear regression with SGD
with multiple variables
X
X
X
X
X
X
X
14
OR Schedule Report (example)
15
Why the model is working
• Can coordinate waitlist scheduling logistics with physicians and
patients within two weeks of the surgery
• Plan staff scheduling and resources so there are less last-minute
staffing issues for nursing and anesthesia
• Utilization metrics are showing us where we can maximize our
elective surgical schedule and level demand
#datapopupseattle
@datapopup
#datapopupseattle
#datapopupseattle
Thank You To Our Sponsors
Thank you.
Formoreinformation,pleasecontactdenny@databricks.com

More Related Content

What's hot

Hitting the Sweet Spot with Predictive Analytics (Michael Draugelis)
Hitting the Sweet Spot with Predictive Analytics (Michael Draugelis)Hitting the Sweet Spot with Predictive Analytics (Michael Draugelis)
Hitting the Sweet Spot with Predictive Analytics (Michael Draugelis)Ashleigh Kades
 
Opencity Inc Hacking Health Waterloo cafe April 2016
Opencity Inc Hacking Health Waterloo cafe April 2016Opencity Inc Hacking Health Waterloo cafe April 2016
Opencity Inc Hacking Health Waterloo cafe April 2016Opencity Inc
 
Critical Success Factors for A Data Analytics Initiative
Critical Success Factors for A Data Analytics InitiativeCritical Success Factors for A Data Analytics Initiative
Critical Success Factors for A Data Analytics InitiativeSasken Technologies Ltd.
 
Transforming Business with Intelligent Data
Transforming Business with Intelligent DataTransforming Business with Intelligent Data
Transforming Business with Intelligent Dataashbhatia
 
Prof. Neil O'Hare, Director of Informatics, St. James's Hospital
Prof. Neil O'Hare, Director of Informatics, St. James's HospitalProf. Neil O'Hare, Director of Informatics, St. James's Hospital
Prof. Neil O'Hare, Director of Informatics, St. James's HospitalIMSTA
 
Technology Careers at Elsevier
Technology Careers at ElsevierTechnology Careers at Elsevier
Technology Careers at Elseviervyshnavin
 
Day-3, Mr. Satyajit Dwivedi analytics for decision making
Day-3, Mr. Satyajit Dwivedi analytics for decision makingDay-3, Mr. Satyajit Dwivedi analytics for decision making
Day-3, Mr. Satyajit Dwivedi analytics for decision makingIPPAI
 
MedStack: 500 Demo Day Batch 21
MedStack: 500 Demo Day Batch 21MedStack: 500 Demo Day Batch 21
MedStack: 500 Demo Day Batch 21500 Startups
 
Panel on the future of Electronic Health Records
Panel on the future of Electronic Health RecordsPanel on the future of Electronic Health Records
Panel on the future of Electronic Health RecordsKarim Keshavjee
 
Philips john huffman
Philips john huffmanPhilips john huffman
Philips john huffmanBigDataExpo
 
500’s Demo Day Batch 14 >> ZendyHealth
500’s Demo Day Batch 14 >> ZendyHealth500’s Demo Day Batch 14 >> ZendyHealth
500’s Demo Day Batch 14 >> ZendyHealth500 Startups
 
Let’s make a difference
Let’s make a differenceLet’s make a difference
Let’s make a differenceIswardy Morni
 
Project Management in the Healthcare Research Domain
Project Management in the Healthcare Research DomainProject Management in the Healthcare Research Domain
Project Management in the Healthcare Research Domainsriharshagunnam
 
David Cocker big data MDCPartners ta-scan
David Cocker big data MDCPartners ta-scanDavid Cocker big data MDCPartners ta-scan
David Cocker big data MDCPartners ta-scanDavid Cocker
 
HARMONY Alliance speaker during EAPM2017: Egge van der Poel about big data in...
HARMONY Alliance speaker during EAPM2017: Egge van der Poel about big data in...HARMONY Alliance speaker during EAPM2017: Egge van der Poel about big data in...
HARMONY Alliance speaker during EAPM2017: Egge van der Poel about big data in...HARMONY Alliance
 
Learning to Rank with Apache Spark: A Case Study in Production Machine Learni...
Learning to Rank with Apache Spark: A Case Study in Production Machine Learni...Learning to Rank with Apache Spark: A Case Study in Production Machine Learni...
Learning to Rank with Apache Spark: A Case Study in Production Machine Learni...Databricks
 
Learning to Rank with Apache Spark
Learning to Rank with Apache SparkLearning to Rank with Apache Spark
Learning to Rank with Apache SparkAnna Bladzich
 
LucidAct - Batch 25 Demo Day
LucidAct - Batch 25 Demo DayLucidAct - Batch 25 Demo Day
LucidAct - Batch 25 Demo Day500 Startups
 

What's hot (20)

Hitting the Sweet Spot with Predictive Analytics (Michael Draugelis)
Hitting the Sweet Spot with Predictive Analytics (Michael Draugelis)Hitting the Sweet Spot with Predictive Analytics (Michael Draugelis)
Hitting the Sweet Spot with Predictive Analytics (Michael Draugelis)
 
Opencity Inc Hacking Health Waterloo cafe April 2016
Opencity Inc Hacking Health Waterloo cafe April 2016Opencity Inc Hacking Health Waterloo cafe April 2016
Opencity Inc Hacking Health Waterloo cafe April 2016
 
Critical Success Factors for A Data Analytics Initiative
Critical Success Factors for A Data Analytics InitiativeCritical Success Factors for A Data Analytics Initiative
Critical Success Factors for A Data Analytics Initiative
 
Spark & IoT
Spark & IoTSpark & IoT
Spark & IoT
 
Transforming Business with Intelligent Data
Transforming Business with Intelligent DataTransforming Business with Intelligent Data
Transforming Business with Intelligent Data
 
Prof. Neil O'Hare, Director of Informatics, St. James's Hospital
Prof. Neil O'Hare, Director of Informatics, St. James's HospitalProf. Neil O'Hare, Director of Informatics, St. James's Hospital
Prof. Neil O'Hare, Director of Informatics, St. James's Hospital
 
Technology Careers at Elsevier
Technology Careers at ElsevierTechnology Careers at Elsevier
Technology Careers at Elsevier
 
Day-3, Mr. Satyajit Dwivedi analytics for decision making
Day-3, Mr. Satyajit Dwivedi analytics for decision makingDay-3, Mr. Satyajit Dwivedi analytics for decision making
Day-3, Mr. Satyajit Dwivedi analytics for decision making
 
MedStack: 500 Demo Day Batch 21
MedStack: 500 Demo Day Batch 21MedStack: 500 Demo Day Batch 21
MedStack: 500 Demo Day Batch 21
 
Panel on the future of Electronic Health Records
Panel on the future of Electronic Health RecordsPanel on the future of Electronic Health Records
Panel on the future of Electronic Health Records
 
Philips john huffman
Philips john huffmanPhilips john huffman
Philips john huffman
 
500’s Demo Day Batch 14 >> ZendyHealth
500’s Demo Day Batch 14 >> ZendyHealth500’s Demo Day Batch 14 >> ZendyHealth
500’s Demo Day Batch 14 >> ZendyHealth
 
Let’s make a difference
Let’s make a differenceLet’s make a difference
Let’s make a difference
 
Project Management in the Healthcare Research Domain
Project Management in the Healthcare Research DomainProject Management in the Healthcare Research Domain
Project Management in the Healthcare Research Domain
 
David Cocker big data MDCPartners ta-scan
David Cocker big data MDCPartners ta-scanDavid Cocker big data MDCPartners ta-scan
David Cocker big data MDCPartners ta-scan
 
HARMONY Alliance speaker during EAPM2017: Egge van der Poel about big data in...
HARMONY Alliance speaker during EAPM2017: Egge van der Poel about big data in...HARMONY Alliance speaker during EAPM2017: Egge van der Poel about big data in...
HARMONY Alliance speaker during EAPM2017: Egge van der Poel about big data in...
 
Learning to Rank with Apache Spark: A Case Study in Production Machine Learni...
Learning to Rank with Apache Spark: A Case Study in Production Machine Learni...Learning to Rank with Apache Spark: A Case Study in Production Machine Learni...
Learning to Rank with Apache Spark: A Case Study in Production Machine Learni...
 
Learning to Rank with Apache Spark
Learning to Rank with Apache SparkLearning to Rank with Apache Spark
Learning to Rank with Apache Spark
 
LucidAct - Batch 25 Demo Day
LucidAct - Batch 25 Demo DayLucidAct - Batch 25 Demo Day
LucidAct - Batch 25 Demo Day
 
Ipsen Big Data
Ipsen  Big DataIpsen  Big Data
Ipsen Big Data
 

Similar to Using Spark in Healthcare Predictive Analytics in the OR - Data Science Pop-up Seattle

Hitting the Sweet Spot with Predictive Analytics (Michael Draugelis)
Hitting the Sweet Spot with Predictive Analytics (Michael Draugelis)Hitting the Sweet Spot with Predictive Analytics (Michael Draugelis)
Hitting the Sweet Spot with Predictive Analytics (Michael Draugelis)U.S. News Healthcare of Tomorrow
 
10 Things to Consider When Building a CTMS Business Case
10 Things to Consider When Building a CTMS Business Case10 Things to Consider When Building a CTMS Business Case
10 Things to Consider When Building a CTMS Business CasePerficient, Inc.
 
Transitioning from Traditional DW to Apache® Spark™ in Operating Room Predict...
Transitioning from Traditional DW to Apache® Spark™ in Operating Room Predict...Transitioning from Traditional DW to Apache® Spark™ in Operating Room Predict...
Transitioning from Traditional DW to Apache® Spark™ in Operating Room Predict...Databricks
 
Curlew Research Brussels 2014 Electronic Data & Knowledge Management
Curlew Research Brussels 2014 Electronic Data & Knowledge ManagementCurlew Research Brussels 2014 Electronic Data & Knowledge Management
Curlew Research Brussels 2014 Electronic Data & Knowledge ManagementNick Lynch
 
The Evolution of Drug Development and Market Access via Connected Data-Driven...
The Evolution of Drug Development and Market Access via Connected Data-Driven...The Evolution of Drug Development and Market Access via Connected Data-Driven...
The Evolution of Drug Development and Market Access via Connected Data-Driven...PAREXEL International
 
2018 Bio-IT World Agile in Wet Labs Speeds Big Data
2018 Bio-IT World Agile in Wet Labs Speeds Big Data2018 Bio-IT World Agile in Wet Labs Speeds Big Data
2018 Bio-IT World Agile in Wet Labs Speeds Big DataBruce Kozuma
 
Health Care: Cost Reductions through Data Insights - The Data Analysis Group
Health Care: Cost Reductions through Data Insights - The Data Analysis GroupHealth Care: Cost Reductions through Data Insights - The Data Analysis Group
Health Care: Cost Reductions through Data Insights - The Data Analysis GroupJames Karis
 
Children's Mercy Patient Progression Hub - HIT December 2023
Children's Mercy Patient Progression Hub - HIT December 2023Children's Mercy Patient Progression Hub - HIT December 2023
Children's Mercy Patient Progression Hub - HIT December 2023KC Digital Drive
 
Kim bonett cv_2016
Kim bonett cv_2016Kim bonett cv_2016
Kim bonett cv_2016koko bonet
 
Delivering improved patient outcomes through advanced analytics 6.26.18
Delivering improved patient outcomes through advanced analytics 6.26.18Delivering improved patient outcomes through advanced analytics 6.26.18
Delivering improved patient outcomes through advanced analytics 6.26.18Cloudera, Inc.
 
Top 10 Reasons Why You Need a CTMS
Top 10 Reasons Why You Need a CTMSTop 10 Reasons Why You Need a CTMS
Top 10 Reasons Why You Need a CTMSPerficient
 
Starting the Hadoop Journey at a Global Leader in Cancer Research
Starting the Hadoop Journey at a Global Leader in Cancer ResearchStarting the Hadoop Journey at a Global Leader in Cancer Research
Starting the Hadoop Journey at a Global Leader in Cancer ResearchDataWorks Summit/Hadoop Summit
 
Starting the Hadoop Journey at a Global Leader in Cancer Research
Starting the Hadoop Journey at a Global Leader in Cancer ResearchStarting the Hadoop Journey at a Global Leader in Cancer Research
Starting the Hadoop Journey at a Global Leader in Cancer ResearchDataWorks Summit/Hadoop Summit
 
Galen healthcare solutions Healthcare Information Technology 2017 Year in Rev...
Galen healthcare solutions Healthcare Information Technology 2017 Year in Rev...Galen healthcare solutions Healthcare Information Technology 2017 Year in Rev...
Galen healthcare solutions Healthcare Information Technology 2017 Year in Rev...Justin Campbell
 
AMIA Joint Summits 2017: Building Research Data Mart from UCSF OMOP Database ...
AMIA Joint Summits 2017: Building Research Data Mart from UCSF OMOP Database ...AMIA Joint Summits 2017: Building Research Data Mart from UCSF OMOP Database ...
AMIA Joint Summits 2017: Building Research Data Mart from UCSF OMOP Database ...CTSI at UCSF
 
Epic EMR to OMOP CDM to Clinical Research Data Mart: an Unmaintained Road or ...
Epic EMR to OMOP CDM to Clinical Research Data Mart: an Unmaintained Road or ...Epic EMR to OMOP CDM to Clinical Research Data Mart: an Unmaintained Road or ...
Epic EMR to OMOP CDM to Clinical Research Data Mart: an Unmaintained Road or ...Oksana Gologorskaya
 
Advanced Analytics and Machine Learning with Data Virtualization (India)
Advanced Analytics and Machine Learning with Data Virtualization (India)Advanced Analytics and Machine Learning with Data Virtualization (India)
Advanced Analytics and Machine Learning with Data Virtualization (India)Denodo
 
Toma de decisiones impulsada por datos en radiología: Rochester Regional Heal...
Toma de decisiones impulsada por datos en radiología: Rochester Regional Heal...Toma de decisiones impulsada por datos en radiología: Rochester Regional Heal...
Toma de decisiones impulsada por datos en radiología: Rochester Regional Heal...Data IQ Argentina
 

Similar to Using Spark in Healthcare Predictive Analytics in the OR - Data Science Pop-up Seattle (20)

Hitting the Sweet Spot with Predictive Analytics (Michael Draugelis)
Hitting the Sweet Spot with Predictive Analytics (Michael Draugelis)Hitting the Sweet Spot with Predictive Analytics (Michael Draugelis)
Hitting the Sweet Spot with Predictive Analytics (Michael Draugelis)
 
10 Things to Consider When Building a CTMS Business Case
10 Things to Consider When Building a CTMS Business Case10 Things to Consider When Building a CTMS Business Case
10 Things to Consider When Building a CTMS Business Case
 
Transitioning from Traditional DW to Apache® Spark™ in Operating Room Predict...
Transitioning from Traditional DW to Apache® Spark™ in Operating Room Predict...Transitioning from Traditional DW to Apache® Spark™ in Operating Room Predict...
Transitioning from Traditional DW to Apache® Spark™ in Operating Room Predict...
 
Curlew Research Brussels 2014 Electronic Data & Knowledge Management
Curlew Research Brussels 2014 Electronic Data & Knowledge ManagementCurlew Research Brussels 2014 Electronic Data & Knowledge Management
Curlew Research Brussels 2014 Electronic Data & Knowledge Management
 
The Evolution of Drug Development and Market Access via Connected Data-Driven...
The Evolution of Drug Development and Market Access via Connected Data-Driven...The Evolution of Drug Development and Market Access via Connected Data-Driven...
The Evolution of Drug Development and Market Access via Connected Data-Driven...
 
2018 Bio-IT World Agile in Wet Labs Speeds Big Data
2018 Bio-IT World Agile in Wet Labs Speeds Big Data2018 Bio-IT World Agile in Wet Labs Speeds Big Data
2018 Bio-IT World Agile in Wet Labs Speeds Big Data
 
Health Care: Cost Reductions through Data Insights - The Data Analysis Group
Health Care: Cost Reductions through Data Insights - The Data Analysis GroupHealth Care: Cost Reductions through Data Insights - The Data Analysis Group
Health Care: Cost Reductions through Data Insights - The Data Analysis Group
 
Children's Mercy Patient Progression Hub - HIT December 2023
Children's Mercy Patient Progression Hub - HIT December 2023Children's Mercy Patient Progression Hub - HIT December 2023
Children's Mercy Patient Progression Hub - HIT December 2023
 
Kim bonett cv_2016
Kim bonett cv_2016Kim bonett cv_2016
Kim bonett cv_2016
 
Delivering improved patient outcomes through advanced analytics 6.26.18
Delivering improved patient outcomes through advanced analytics 6.26.18Delivering improved patient outcomes through advanced analytics 6.26.18
Delivering improved patient outcomes through advanced analytics 6.26.18
 
Trial io pcori doc v1
Trial io pcori doc v1Trial io pcori doc v1
Trial io pcori doc v1
 
Attune Lab Information System
Attune Lab Information SystemAttune Lab Information System
Attune Lab Information System
 
Top 10 Reasons Why You Need a CTMS
Top 10 Reasons Why You Need a CTMSTop 10 Reasons Why You Need a CTMS
Top 10 Reasons Why You Need a CTMS
 
Starting the Hadoop Journey at a Global Leader in Cancer Research
Starting the Hadoop Journey at a Global Leader in Cancer ResearchStarting the Hadoop Journey at a Global Leader in Cancer Research
Starting the Hadoop Journey at a Global Leader in Cancer Research
 
Starting the Hadoop Journey at a Global Leader in Cancer Research
Starting the Hadoop Journey at a Global Leader in Cancer ResearchStarting the Hadoop Journey at a Global Leader in Cancer Research
Starting the Hadoop Journey at a Global Leader in Cancer Research
 
Galen healthcare solutions Healthcare Information Technology 2017 Year in Rev...
Galen healthcare solutions Healthcare Information Technology 2017 Year in Rev...Galen healthcare solutions Healthcare Information Technology 2017 Year in Rev...
Galen healthcare solutions Healthcare Information Technology 2017 Year in Rev...
 
AMIA Joint Summits 2017: Building Research Data Mart from UCSF OMOP Database ...
AMIA Joint Summits 2017: Building Research Data Mart from UCSF OMOP Database ...AMIA Joint Summits 2017: Building Research Data Mart from UCSF OMOP Database ...
AMIA Joint Summits 2017: Building Research Data Mart from UCSF OMOP Database ...
 
Epic EMR to OMOP CDM to Clinical Research Data Mart: an Unmaintained Road or ...
Epic EMR to OMOP CDM to Clinical Research Data Mart: an Unmaintained Road or ...Epic EMR to OMOP CDM to Clinical Research Data Mart: an Unmaintained Road or ...
Epic EMR to OMOP CDM to Clinical Research Data Mart: an Unmaintained Road or ...
 
Advanced Analytics and Machine Learning with Data Virtualization (India)
Advanced Analytics and Machine Learning with Data Virtualization (India)Advanced Analytics and Machine Learning with Data Virtualization (India)
Advanced Analytics and Machine Learning with Data Virtualization (India)
 
Toma de decisiones impulsada por datos en radiología: Rochester Regional Heal...
Toma de decisiones impulsada por datos en radiología: Rochester Regional Heal...Toma de decisiones impulsada por datos en radiología: Rochester Regional Heal...
Toma de decisiones impulsada por datos en radiología: Rochester Regional Heal...
 

More from Domino Data Lab

What's in your workflow? Bringing data science workflows to business analysis...
What's in your workflow? Bringing data science workflows to business analysis...What's in your workflow? Bringing data science workflows to business analysis...
What's in your workflow? Bringing data science workflows to business analysis...Domino Data Lab
 
The Proliferation of New Database Technologies and Implications for Data Scie...
The Proliferation of New Database Technologies and Implications for Data Scie...The Proliferation of New Database Technologies and Implications for Data Scie...
The Proliferation of New Database Technologies and Implications for Data Scie...Domino Data Lab
 
Racial Bias in Policing: an analysis of Illinois traffic stops data
Racial Bias in Policing: an analysis of Illinois traffic stops dataRacial Bias in Policing: an analysis of Illinois traffic stops data
Racial Bias in Policing: an analysis of Illinois traffic stops dataDomino Data Lab
 
Data Quality Analytics: Understanding what is in your data, before using it
Data Quality Analytics: Understanding what is in your data, before using itData Quality Analytics: Understanding what is in your data, before using it
Data Quality Analytics: Understanding what is in your data, before using itDomino Data Lab
 
Supporting innovation in insurance with randomized experimentation
Supporting innovation in insurance with randomized experimentationSupporting innovation in insurance with randomized experimentation
Supporting innovation in insurance with randomized experimentationDomino Data Lab
 
Leveraging Data Science in the Automotive Industry
Leveraging Data Science in the Automotive IndustryLeveraging Data Science in the Automotive Industry
Leveraging Data Science in the Automotive IndustryDomino Data Lab
 
Summertime Analytics: Predicting E. coli and West Nile Virus
Summertime Analytics: Predicting E. coli and West Nile VirusSummertime Analytics: Predicting E. coli and West Nile Virus
Summertime Analytics: Predicting E. coli and West Nile VirusDomino Data Lab
 
GeoViz: A Canvas for Data Science
GeoViz: A Canvas for Data ScienceGeoViz: A Canvas for Data Science
GeoViz: A Canvas for Data ScienceDomino Data Lab
 
Doing your first Kaggle (Python for Big Data sets)
Doing your first Kaggle (Python for Big Data sets)Doing your first Kaggle (Python for Big Data sets)
Doing your first Kaggle (Python for Big Data sets)Domino Data Lab
 
Leveraged Analytics at Scale
Leveraged Analytics at ScaleLeveraged Analytics at Scale
Leveraged Analytics at ScaleDomino Data Lab
 
How I Learned to Stop Worrying and Love Linked Data
How I Learned to Stop Worrying and Love Linked DataHow I Learned to Stop Worrying and Love Linked Data
How I Learned to Stop Worrying and Love Linked DataDomino Data Lab
 
Software Engineering for Data Scientists
Software Engineering for Data ScientistsSoftware Engineering for Data Scientists
Software Engineering for Data ScientistsDomino Data Lab
 
Moving Data Science from an Event to A Program: Considerations in Creating Su...
Moving Data Science from an Event to A Program: Considerations in Creating Su...Moving Data Science from an Event to A Program: Considerations in Creating Su...
Moving Data Science from an Event to A Program: Considerations in Creating Su...Domino Data Lab
 
Building Data Analytics pipelines in the cloud using serverless technology
Building Data Analytics pipelines in the cloud using serverless technologyBuilding Data Analytics pipelines in the cloud using serverless technology
Building Data Analytics pipelines in the cloud using serverless technologyDomino Data Lab
 
Leveraging Open Source Automated Data Science Tools
Leveraging Open Source Automated Data Science ToolsLeveraging Open Source Automated Data Science Tools
Leveraging Open Source Automated Data Science ToolsDomino Data Lab
 
Domino and AWS: collaborative analytics and model governance at financial ser...
Domino and AWS: collaborative analytics and model governance at financial ser...Domino and AWS: collaborative analytics and model governance at financial ser...
Domino and AWS: collaborative analytics and model governance at financial ser...Domino Data Lab
 
The Role and Importance of Curiosity in Data Science
The Role and Importance of Curiosity in Data ScienceThe Role and Importance of Curiosity in Data Science
The Role and Importance of Curiosity in Data ScienceDomino Data Lab
 
Fuzzy Matching to the Rescue
Fuzzy Matching to the RescueFuzzy Matching to the Rescue
Fuzzy Matching to the RescueDomino Data Lab
 
How to Effectively Combine Numerical Features and Categorical Features
How to Effectively Combine Numerical Features and Categorical FeaturesHow to Effectively Combine Numerical Features and Categorical Features
How to Effectively Combine Numerical Features and Categorical FeaturesDomino Data Lab
 

More from Domino Data Lab (20)

What's in your workflow? Bringing data science workflows to business analysis...
What's in your workflow? Bringing data science workflows to business analysis...What's in your workflow? Bringing data science workflows to business analysis...
What's in your workflow? Bringing data science workflows to business analysis...
 
The Proliferation of New Database Technologies and Implications for Data Scie...
The Proliferation of New Database Technologies and Implications for Data Scie...The Proliferation of New Database Technologies and Implications for Data Scie...
The Proliferation of New Database Technologies and Implications for Data Scie...
 
Racial Bias in Policing: an analysis of Illinois traffic stops data
Racial Bias in Policing: an analysis of Illinois traffic stops dataRacial Bias in Policing: an analysis of Illinois traffic stops data
Racial Bias in Policing: an analysis of Illinois traffic stops data
 
Data Quality Analytics: Understanding what is in your data, before using it
Data Quality Analytics: Understanding what is in your data, before using itData Quality Analytics: Understanding what is in your data, before using it
Data Quality Analytics: Understanding what is in your data, before using it
 
Supporting innovation in insurance with randomized experimentation
Supporting innovation in insurance with randomized experimentationSupporting innovation in insurance with randomized experimentation
Supporting innovation in insurance with randomized experimentation
 
Leveraging Data Science in the Automotive Industry
Leveraging Data Science in the Automotive IndustryLeveraging Data Science in the Automotive Industry
Leveraging Data Science in the Automotive Industry
 
Summertime Analytics: Predicting E. coli and West Nile Virus
Summertime Analytics: Predicting E. coli and West Nile VirusSummertime Analytics: Predicting E. coli and West Nile Virus
Summertime Analytics: Predicting E. coli and West Nile Virus
 
GeoViz: A Canvas for Data Science
GeoViz: A Canvas for Data ScienceGeoViz: A Canvas for Data Science
GeoViz: A Canvas for Data Science
 
Doing your first Kaggle (Python for Big Data sets)
Doing your first Kaggle (Python for Big Data sets)Doing your first Kaggle (Python for Big Data sets)
Doing your first Kaggle (Python for Big Data sets)
 
Leveraged Analytics at Scale
Leveraged Analytics at ScaleLeveraged Analytics at Scale
Leveraged Analytics at Scale
 
How I Learned to Stop Worrying and Love Linked Data
How I Learned to Stop Worrying and Love Linked DataHow I Learned to Stop Worrying and Love Linked Data
How I Learned to Stop Worrying and Love Linked Data
 
Software Engineering for Data Scientists
Software Engineering for Data ScientistsSoftware Engineering for Data Scientists
Software Engineering for Data Scientists
 
Making Big Data Smart
Making Big Data SmartMaking Big Data Smart
Making Big Data Smart
 
Moving Data Science from an Event to A Program: Considerations in Creating Su...
Moving Data Science from an Event to A Program: Considerations in Creating Su...Moving Data Science from an Event to A Program: Considerations in Creating Su...
Moving Data Science from an Event to A Program: Considerations in Creating Su...
 
Building Data Analytics pipelines in the cloud using serverless technology
Building Data Analytics pipelines in the cloud using serverless technologyBuilding Data Analytics pipelines in the cloud using serverless technology
Building Data Analytics pipelines in the cloud using serverless technology
 
Leveraging Open Source Automated Data Science Tools
Leveraging Open Source Automated Data Science ToolsLeveraging Open Source Automated Data Science Tools
Leveraging Open Source Automated Data Science Tools
 
Domino and AWS: collaborative analytics and model governance at financial ser...
Domino and AWS: collaborative analytics and model governance at financial ser...Domino and AWS: collaborative analytics and model governance at financial ser...
Domino and AWS: collaborative analytics and model governance at financial ser...
 
The Role and Importance of Curiosity in Data Science
The Role and Importance of Curiosity in Data ScienceThe Role and Importance of Curiosity in Data Science
The Role and Importance of Curiosity in Data Science
 
Fuzzy Matching to the Rescue
Fuzzy Matching to the RescueFuzzy Matching to the Rescue
Fuzzy Matching to the Rescue
 
How to Effectively Combine Numerical Features and Categorical Features
How to Effectively Combine Numerical Features and Categorical FeaturesHow to Effectively Combine Numerical Features and Categorical Features
How to Effectively Combine Numerical Features and Categorical Features
 

Recently uploaded

Elements of language learning - an analysis of how different elements of lang...
Elements of language learning - an analysis of how different elements of lang...Elements of language learning - an analysis of how different elements of lang...
Elements of language learning - an analysis of how different elements of lang...PrithaVashisht1
 
Master's Thesis - Data Science - Presentation
Master's Thesis - Data Science - PresentationMaster's Thesis - Data Science - Presentation
Master's Thesis - Data Science - PresentationGiorgio Carbone
 
TINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptx
TINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptxTINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptx
TINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptxDwiAyuSitiHartinah
 
AI for Sustainable Development Goals (SDGs)
AI for Sustainable Development Goals (SDGs)AI for Sustainable Development Goals (SDGs)
AI for Sustainable Development Goals (SDGs)Data & Analytics Magazin
 
YourView Panel Book.pptx YourView Panel Book.
YourView Panel Book.pptx YourView Panel Book.YourView Panel Book.pptx YourView Panel Book.
YourView Panel Book.pptx YourView Panel Book.JasonViviers2
 
Strategic CX: A Deep Dive into Voice of the Customer Insights for Clarity
Strategic CX: A Deep Dive into Voice of the Customer Insights for ClarityStrategic CX: A Deep Dive into Voice of the Customer Insights for Clarity
Strategic CX: A Deep Dive into Voice of the Customer Insights for ClarityAggregage
 
SFBA Splunk Usergroup meeting March 13, 2024
SFBA Splunk Usergroup meeting March 13, 2024SFBA Splunk Usergroup meeting March 13, 2024
SFBA Splunk Usergroup meeting March 13, 2024Becky Burwell
 
How is Real-Time Analytics Different from Traditional OLAP?
How is Real-Time Analytics Different from Traditional OLAP?How is Real-Time Analytics Different from Traditional OLAP?
How is Real-Time Analytics Different from Traditional OLAP?sonikadigital1
 
Mapping the pubmed data under different suptopics using NLP.pptx
Mapping the pubmed data under different suptopics using NLP.pptxMapping the pubmed data under different suptopics using NLP.pptx
Mapping the pubmed data under different suptopics using NLP.pptxVenkatasubramani13
 
MEASURES OF DISPERSION I BSc Botany .ppt
MEASURES OF DISPERSION I BSc Botany .pptMEASURES OF DISPERSION I BSc Botany .ppt
MEASURES OF DISPERSION I BSc Botany .pptaigil2
 
The Universal GTM - how we design GTM and dataLayer
The Universal GTM - how we design GTM and dataLayerThe Universal GTM - how we design GTM and dataLayer
The Universal GTM - how we design GTM and dataLayerPavel Šabatka
 
ChistaDATA Real-Time DATA Analytics Infrastructure
ChistaDATA Real-Time DATA Analytics InfrastructureChistaDATA Real-Time DATA Analytics Infrastructure
ChistaDATA Real-Time DATA Analytics Infrastructuresonikadigital1
 
Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024
Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024
Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024Guido X Jansen
 
Virtuosoft SmartSync Product Introduction
Virtuosoft SmartSync Product IntroductionVirtuosoft SmartSync Product Introduction
Virtuosoft SmartSync Product Introductionsanjaymuralee1
 
5 Ds to Define Data Archiving Best Practices
5 Ds to Define Data Archiving Best Practices5 Ds to Define Data Archiving Best Practices
5 Ds to Define Data Archiving Best PracticesDataArchiva
 
Cash Is Still King: ATM market research '2023
Cash Is Still King: ATM market research '2023Cash Is Still King: ATM market research '2023
Cash Is Still King: ATM market research '2023Vladislav Solodkiy
 
CI, CD -Tools to integrate without manual intervention
CI, CD -Tools to integrate without manual interventionCI, CD -Tools to integrate without manual intervention
CI, CD -Tools to integrate without manual interventionajayrajaganeshkayala
 

Recently uploaded (17)

Elements of language learning - an analysis of how different elements of lang...
Elements of language learning - an analysis of how different elements of lang...Elements of language learning - an analysis of how different elements of lang...
Elements of language learning - an analysis of how different elements of lang...
 
Master's Thesis - Data Science - Presentation
Master's Thesis - Data Science - PresentationMaster's Thesis - Data Science - Presentation
Master's Thesis - Data Science - Presentation
 
TINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptx
TINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptxTINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptx
TINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptx
 
AI for Sustainable Development Goals (SDGs)
AI for Sustainable Development Goals (SDGs)AI for Sustainable Development Goals (SDGs)
AI for Sustainable Development Goals (SDGs)
 
YourView Panel Book.pptx YourView Panel Book.
YourView Panel Book.pptx YourView Panel Book.YourView Panel Book.pptx YourView Panel Book.
YourView Panel Book.pptx YourView Panel Book.
 
Strategic CX: A Deep Dive into Voice of the Customer Insights for Clarity
Strategic CX: A Deep Dive into Voice of the Customer Insights for ClarityStrategic CX: A Deep Dive into Voice of the Customer Insights for Clarity
Strategic CX: A Deep Dive into Voice of the Customer Insights for Clarity
 
SFBA Splunk Usergroup meeting March 13, 2024
SFBA Splunk Usergroup meeting March 13, 2024SFBA Splunk Usergroup meeting March 13, 2024
SFBA Splunk Usergroup meeting March 13, 2024
 
How is Real-Time Analytics Different from Traditional OLAP?
How is Real-Time Analytics Different from Traditional OLAP?How is Real-Time Analytics Different from Traditional OLAP?
How is Real-Time Analytics Different from Traditional OLAP?
 
Mapping the pubmed data under different suptopics using NLP.pptx
Mapping the pubmed data under different suptopics using NLP.pptxMapping the pubmed data under different suptopics using NLP.pptx
Mapping the pubmed data under different suptopics using NLP.pptx
 
MEASURES OF DISPERSION I BSc Botany .ppt
MEASURES OF DISPERSION I BSc Botany .pptMEASURES OF DISPERSION I BSc Botany .ppt
MEASURES OF DISPERSION I BSc Botany .ppt
 
The Universal GTM - how we design GTM and dataLayer
The Universal GTM - how we design GTM and dataLayerThe Universal GTM - how we design GTM and dataLayer
The Universal GTM - how we design GTM and dataLayer
 
ChistaDATA Real-Time DATA Analytics Infrastructure
ChistaDATA Real-Time DATA Analytics InfrastructureChistaDATA Real-Time DATA Analytics Infrastructure
ChistaDATA Real-Time DATA Analytics Infrastructure
 
Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024
Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024
Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024
 
Virtuosoft SmartSync Product Introduction
Virtuosoft SmartSync Product IntroductionVirtuosoft SmartSync Product Introduction
Virtuosoft SmartSync Product Introduction
 
5 Ds to Define Data Archiving Best Practices
5 Ds to Define Data Archiving Best Practices5 Ds to Define Data Archiving Best Practices
5 Ds to Define Data Archiving Best Practices
 
Cash Is Still King: ATM market research '2023
Cash Is Still King: ATM market research '2023Cash Is Still King: ATM market research '2023
Cash Is Still King: ATM market research '2023
 
CI, CD -Tools to integrate without manual intervention
CI, CD -Tools to integrate without manual interventionCI, CD -Tools to integrate without manual intervention
CI, CD -Tools to integrate without manual intervention
 

Using Spark in Healthcare Predictive Analytics in the OR - Data Science Pop-up Seattle

  • 1. #datapopupseattle Using Spark in Healthcare Predictive Analytics in the OR Denny Lee Data Scientist and Evangelist, databricks.com databricks
  • 2. #datapopupseattle UNSTRUCTURED Data Science POP-UP in Seattle www.dominodatalab.com D Produced by Domino Data Lab Domino’s enterprise data science platform is used by leading analytical organizations to increase productivity, enable collaboration, and publish models into production faster.
  • 3. Using Spark in Healthcare Predictive Analytics in the OR Denny Lee June 15th, 2015
  • 4. Collaboration • Working with Ayad Shammout • Director of Business Intelligence, Beth Israel Deaconess Medical Center • Helped build highly available / disaster recovery infrastructure for BIDMC 2
  • 5. About Me • Technology Evangelist, Databricks • Former Sr. Director of Data Sciences Eng, Concur • Helped bring Hadoop onto Windows and Azure 3
  • 6. About Databricks • Founded by Apache Spark Creators • Largest contributor to Spark project, committed to keeping Spark 100% open source • Databricks is an end-to-end hosted platform 4
  • 7. 5 $15-$20 / minute for a basic surgical procedure Time is an OR's most valuable resource Lack of OR availability means loss of patient OR efficiency differs depending on the
 OR staffing and allocation (8, 10, 13, or 16h), not the workload (i.e. cases)
  • 8. 6 “You are not going to get the elephant to shrink or change its size. You need to face the fact that the elephant is 8 OR tall and 11hr wide” Steven Shafer, MD
  • 9. 7 Operating Room Better utilization = Better profit margins Reduce support and maintenance costs Medical Staff Better utilization = Better profit margins Better medical staff efficiencies = Better outcomes Patients Shorter wait times and less cancellations Better medical staff efficiencies = Better outcomes
  • 10. Develop Predictive Model • Develop a predictive model that would identify available OR time two weeks in advance. • Allow us to confirm wait list cases two weeks in advance, instead of when the blocks normally release four days out. 8
  • 11. Forecast OR Schedule • Case load three weeks in advance • Book more cases weeks in advance to prevent under- utilization • Reduce staff overtime and idle time 9
  • 12. Background • Three surgical pools • GYN, urology, general surgery, colorectal, surgical oncology • Eyes, plastics, ENT • Orthopedics, podiatry • Currently built using SQL Server Data Mining 10
  • 13. 11
  • 14. 12
  • 15. 13 demoOR Block Scheduling Extract History data and run linear regression with SGD with multiple variables
  • 16. X
  • 17. X
  • 18. X
  • 19. X
  • 20. X
  • 21. X
  • 22. X
  • 24. 15 Why the model is working • Can coordinate waitlist scheduling logistics with physicians and patients within two weeks of the surgery • Plan staff scheduling and resources so there are less last-minute staffing issues for nursing and anesthesia • Utilization metrics are showing us where we can maximize our elective surgical schedule and level demand