Healthcare Predictive Analytics with the OR-(Denny Lee and Ayad Shammout, Databricks and Beth Israel Deaconess Medical Center)

•

7 likes•2,550 views

This document discusses using predictive analytics within operating rooms (OR) at Beth Israel Deaconess Medical Center. It describes developing a predictive model to identify available OR time two weeks in advance to better schedule waitlisted cases and staff. Building the model using historical OR data and linear regression with stochastic gradient descent could help forecast case loads three weeks out. This would allow for improved OR utilization, reduced staff overtime and idle time, shorter patient wait times and fewer cancellations.

Data & Analytics

Healthcare Predictive
Analytics within the OR
Ayad Shammout and Denny Lee
June 15th, 2015

About Ayad Shammout
• Director of Business Intelligence, Beth Israel Deaconess
Medical Center
• Helped build highly available / disaster recovery
infrastructure for BIDMC
2

About Denny Lee
• Technology Evangelist, Databricks
• Former Sr. Director of Data Sciences Eng, Concur
• Helped bring Hadoop onto Windows and Azure
3

About Databricks
• Founded by Apache Spark Creators
• Largest contributor to Spark project, committed to
keeping Spark 100% open source
• Databricks is an end-to-end hosted platform
4

5
$15-$20 / minute for a
basic surgical procedure
Time is an OR's most valuable resource
Lack of OR availability
means loss of patient
OR efficiency differs depending on the 
OR staffing and allocation (8, 10, 13, or 16h),
not the workload (i.e. cases)

6
“You are not going to get the elephant to shrink or change its size. You
need to face the fact that the elephant is 8 OR tall and 11hr wide”
Steven Shafer, MD

7
Operating Room
Better utilization =
Better profit margins
Reduce support and
maintenance costs
Medical Staff
Better utilization =
Better profit margins
Better medical staff
efficiencies = Better
outcomes
Patients
Shorter wait times
and less cancellations
Better medical staff
efficiencies = Better
outcomes

Develop Predictive Model
• Develop a predictive model that would identify
available OR time two weeks in advance.
• Allow us to confirm wait list cases two weeks in advance,
instead of when the blocks normally release four days
out.
8

Forecast OR Schedule
• Case load three weeks in advance
• Book more cases weeks in advance to prevent under-
utilization
• Reduce staff overtime and idle time
9

Background
• Three surgical pools
• GYN, urology, general surgery, colorectal, surgical
oncology
• Eyes, plastics, ENT
• Orthopedics, podiatry
• Currently built using SQL Server Data Mining
10

13
demoOR Block Scheduling
Extract History data and run linear regression with SGD
with multiple variables

15
Why the model is working
• Can coordinate waitlist scheduling logistics with physicians and
patients within two weeks of the surgery
• Plan staff scheduling and resources so there are less last-minute
staffing issues for nursing and anesthesia
• Utilization metrics are showing us where we can maximize our
elective surgical schedule and level demand

Thank you.
Formoreinformation,pleasecontactdenny@databricks.com

The prevailing issue when working with Operating Room (OR) scheduling within a hospital setting is that it is difficult to schedule and predict available OR block times. This leads to empty and unused operating rooms leading to longer waiting times for patients for their procedures. Using multi-variate linear regression, we will show how they can predict available OR block times using Spark MLlib resulting in better OR utilization and shorter wait times for patients. Presented by Denny Lee, Data Scientist and Evangelist at Databricks.

advanced-analytics-brochureLarry Gallina

Driving data

Kaustubh Srivastava

Regulating the Workload of Your Clinical Research Coordinator (CRC)

TrialJoin

A CRC (clinical research coordinator) is one of the most important people regarding clinical trials. He/she is the person who’s in charge of conducting the clinical trial, under the guidance of the PI (principal investigator). As the people responsible for coordinating all the activities on site, CRCs can sometimes carry a huge workload. This can quickly become a big problem not only for the CRC but also for everyone else involved in the study. Determining the right workload for your CRC is one of the most important actions that you (as a site owner or PI) should take. Ideally, a CRC should work 4 to 6 effective hours per day. However, you will notice that there will be periods of time when the CRC has to work more than 8 hours but also periods when he/she will be free most of the days. In this article, we’ll help you find the right balance so that your CRC isn’t too overworked or underworked.

Why scan slides

Joe Harper

MediaVilla026

Lucien Engelen

Technology Careers at Elsevier

vyshnavin

What is the similarity between a hopsital patient and an oil rig

Eugene Borukhovich

In our "Sitter Usage" webinar, you can expect to learn about the challenges of providing a patient with a sitter and the labor costs associated with it. Nash Analytics™ has the capability to monitor how often PCTs, RNs and other support staff are pulled out of staffing to sit with a patient. We’ll use this webinar to show how to: -Track sitter usage -Review reports that track hours and cost -Identify units that can benefit from alternative sitter options -Determine when it when its best to “hard wire” sitters into the budget -Review alternatives to sitters as outlined in a Medscape article.

ehCOS: Global Pionner in the development of "Next-Generation Electronic Healt...

everis/ ehCOS

MyMedRecs NYU Final Presentation

Stanford University

Hadoop Webinar

Edureka!

DDL - Overview - June 2014

Sanjukt Saha

Strength Finders - Blessing or CurseSteve Rogers

Improve inclusion exclusion criteria to safeguard successful patient recruitm...

Kunal Sampat

New Drug Application - How to Speed Up FDA Approval

TrialJoin

We all know that sponsors invest a lot of money in animal and human clinical studies where they test the safety and efficiency of a new drug. Sites are chosen which conduct the trials and in the end, they gather data for further analysis. But, what’s the purpose of it all? What happens after the trials end? After a trial ends, the sponsor determines if it went well enough for him to be able to submit a New Drug Application to the FDA. An NDA is submitted in order for the FDA to approve the new investigational product on the market. So, here we’ll discuss the ways in which the FDA reviews the NDA and ways in which you can increase your chances to get approved.

Panel: Improving Health Through Active Design

Ted Eytan, MD, MS, MPH

Idea Template for the Healthcare Entrepreneurs' BootCamp at Healthdatapalooza

RowdMap has joined Cotiviti

Medifies Investor Presentation-72

Medifies

Perspectives on Big Data & Analytics-(Doug Wolfe, CIA)

Spark Summit

IBM Insight 2014 session (4152 )- Accelerating Insights in Healthcare with “B...

Alex Zeltov

What's hot

Prof. Neil O'Hare, Director of Informatics, St. James's Hospital

IMSTA

Building a Data Warehouse at Clover (PDF)

Otis Anderson

Project Management in the Healthcare Research Domainsriharshagunnam

Building a Data Warehouse at Clover

Otis Anderson

Data Matters and So Does Compliance

CRF Health

Opencity Inc Hacking Health Waterloo cafe April 2016

Opencity Inc

Lessons learned from the evaluation of a Pessl for sprayer calibration

Washington State University

Developing a great CCG General Practice Forward View plan

Robert Varnam Coaching

1130 des o loan dublin master may 2015

investnethealthcare

The Nash Group: Sitter Usage Webinar Presentation

David Ledersnaider, Ph.D.

ehCOS: Global Pionner in the development of "Next-Generation Electronic Healt...

everis/ ehCOS

MyMedRecs NYU Final Presentation

Stanford University

Hadoop Webinar

Edureka!

DDL - Overview - June 2014

Sanjukt Saha

Strength Finders - Blessing or CurseSteve Rogers

Improve inclusion exclusion criteria to safeguard successful patient recruitm...

Kunal Sampat

New Drug Application - How to Speed Up FDA Approval

TrialJoin

Panel: Improving Health Through Active Design

Ted Eytan, MD, MS, MPH

Idea Template for the Healthcare Entrepreneurs' BootCamp at Healthdatapalooza

RowdMap has joined Cotiviti

Medifies Investor Presentation-72

Medifies

What's hot (20)

Prof. Neil O'Hare, Director of Informatics, St. James's Hospital

Building a Data Warehouse at Clover (PDF)

Project Management in the Healthcare Research Domain

Building a Data Warehouse at Clover

Data Matters and So Does Compliance

Opencity Inc Hacking Health Waterloo cafe April 2016

Lessons learned from the evaluation of a Pessl for sprayer calibration

Developing a great CCG General Practice Forward View plan

1130 des o loan dublin master may 2015

The Nash Group: Sitter Usage Webinar Presentation

ehCOS: Global Pionner in the development of "Next-Generation Electronic Healt...

MyMedRecs NYU Final Presentation

Hadoop Webinar

DDL - Overview - June 2014

Strength Finders - Blessing or Curse

Improve inclusion exclusion criteria to safeguard successful patient recruitm...

New Drug Application - How to Speed Up FDA Approval

Panel: Improving Health Through Active Design

Idea Template for the Healthcare Entrepreneurs' BootCamp at Healthdatapalooza

Medifies Investor Presentation-72

Viewers also liked

Perspectives on Big Data & Analytics-(Doug Wolfe, CIA)

Spark Summit

IBM Insight 2014 session (4152 )- Accelerating Insights in Healthcare with “B...

Alex Zeltov

Geber Consulting - Big Data in Healthcare

Martin Hiesboeck

Константин Швачко, Yahoo!, - Scaling Storage and Computation with HadoopMedia Gorod

Hospital Readmission Reduction: How Important are Follow Up Calls? (Hint: Very)

SironaHealth

Healthcare Analytics Maturity Model

Frank Wang

Big Data Analytics in Healthcare and Life SciencesAli Sanousi, MD, MBA, PhD

Predicting Hospital Readmission Using Cascading

Cascading

Big Data, CEP and IoT : Redefining Healthcare Information Systems and Analytics

Tauseef Naquishbandi

Big Data is a term encompassing the use of techniques to capture, process, analyze and visualize potentially large datasets in a reasonable time frame not accessible to standard technologies. It refers to the ability to crunch vast collections of information, analyze it instantly, and draw from it sometimes profoundly surprising conclusions Big data solutions can help stakeholders personalize care, engage patients, reduce variability and costs, and improve quality of health delivery. Big data analytics can also contribute to providing a rich context to shape many areas of health care like analysis of effects, side-effects of drugs, genome analysis etc.

Medicine of the Future—The Transformation from Reactive to Proactive (P4) Med...

Ryan Squire

Medicine of the Future—The Transformation from Reactive to Proactive (P4) Medicine as presented at the Ohio State University Medical Center Personalized Health Care National Conference. Leroy Hood, MD, PhD, is the president and founder of the Institute of Systems Biology. Dr. Hood is a member of the National Academy of Sciences, the American Philosophical Society, the American Academy of Arts and Sciences, the Institute of Medicine and the National Academy of Engineering. His professional career began at Caltech where he and his colleagues pioneered four instruments — the DNA gene sequencer and synthesizer and the protein synthesizer and sequencer — which comprise the technological foundation for contemporary molecular biology. In particular, the DNA sequencer played a crucial role in contributing to the successful mapping of the human genome during the 1990s. http://www.systemsbiology.org/Scientists_and_Research

Agile deployment predictive analytics on hadoopDataWorks Summit

CIA power point Zach Barber

Zachary Barber

Realizing the Promise of Big Data with Hadoop - Cloudera Summer Webinar Serie...

Cloudera, Inc.

Apache Hadoop, an open-source platform, is increasingly gaining adoption within organizations trying to draw insight from all the big data being generated. Hadoop, and a handful of open-source tools that complement it, are promising to make gigantic and diverse datasets easily and economically available for quick analysis. A burgeoning partner ecosystem is also essential to helping organizations turn big data into business value.

QCon São Paulo: Real-Time Analytics with Spark Streaming

Paco Nathan

"Real-Time Analytics with Spark Streaming" presented at QCon São Paulo, 2015-03-26 http://qconsp.com/presentation/real-time-analytics-spark-streaming This talk presents an overview of Spark and its history and applications, then focuses on the Spark Streaming component used for real-time analytics. We compare it with earlier frameworks such as MillWheel and Storm, and explore industry motivations for open-source micro-batch streaming at scale. The talk will include demos for streaming apps that include machine-learning examples. We also consider public case studies of production deployments at scale. We’ll review the use of open-source sketch algorithms and probabilistic data structures that get leveraged in streaming – for example, the trade-off of 4% error bounds on real-time metrics for two orders of magnitude reduction in required memory footprint of a Spark app.

Introduction to Big Data Analytics: Batch, Real-Time, and the Best of Both Wo...

WSO2

Strata EU 2014: Spark Streaming Case Studies

Paco Nathan

Predictive Analytics Usage and Implications in Healthcare

J. Bryan Bennett, MBA, CPA, LSSGB

Interoperability between heterogeneous healthcare information systems by John...Luigi Ceccaroni

Predictive analytics for personalized healthcareJohn Cai

Evaluating Big Data Predictive Analytics Platforms

Teradata Aster

Mike Gualtieri, Principal Analyst, Forrester Research, presents at the Big Analytics Roadshow, 2012 in New York City on December 12, 2012 Presentation title: Evaluating Big Data Predictive Analytics Platforms Abstract: Great. You have Big Data. Now what? You have to analyze it to find game-changing predictive models that you can use to make smart decisions, reduce risk, or deliver breakthrough customer experiences. Big Data Predictive Analytics solutions are software and/or hardware solutions that allow firms to discover, evaluate, optimize, and deploy predictive models by analyzing big data sources. In this session, Forrester Principal Analyst Mike Gualtieri will discuss the key criteria you should use to evaluate Big Data Predictive Analytics platforms to meet your specific needs.

Viewers also liked (20)

Perspectives on Big Data & Analytics-(Doug Wolfe, CIA)

IBM Insight 2014 session (4152 )- Accelerating Insights in Healthcare with “B...

Geber Consulting - Big Data in Healthcare

Константин Швачко, Yahoo!, - Scaling Storage and Computation with Hadoop

Hospital Readmission Reduction: How Important are Follow Up Calls? (Hint: Very)

Healthcare Analytics Maturity Model

Big Data Analytics in Healthcare and Life Sciences

Predicting Hospital Readmission Using Cascading

Big Data, CEP and IoT : Redefining Healthcare Information Systems and Analytics

Medicine of the Future—The Transformation from Reactive to Proactive (P4) Med...

Agile deployment predictive analytics on hadoop

CIA power point Zach Barber

Realizing the Promise of Big Data with Hadoop - Cloudera Summer Webinar Serie...

QCon São Paulo: Real-Time Analytics with Spark Streaming

Introduction to Big Data Analytics: Batch, Real-Time, and the Best of Both Wo...

Strata EU 2014: Spark Streaming Case Studies

Predictive Analytics Usage and Implications in Healthcare

Interoperability between heterogeneous healthcare information systems by John...

Predictive analytics for personalized healthcare

Evaluating Big Data Predictive Analytics Platforms

Similar to Healthcare Predictive Analytics with the OR-(Denny Lee and Ayad Shammout, Databricks and Beth Israel Deaconess Medical Center)

Children's Mercy Patient Progression Hub - HIT December 2023

KC Digital Drive

These slides were presented at the December 2023 meeting of the KC Digital Drive Health Innovation Team. This presentation focuses on Children's Mercy's innovative use of data. Bill Saltmarsh, MBA, Vice President and Chief Data Officer says, "We are using data to create value for our patients, their families, and our community. We believe that the key to delivering that value is contingent upon our ability to capture, safeguard, and derive novel insights from our data. It is also contingent upon our ability to take advantage of advanced analytical methods and technologies, including the use of Artificial Intelligence. One example of this type of innovation is our Patient Progression Hub which is enabling us to improve the connected care experience for our patients by consistently providing the right information to the right people and the right time." Bill leads the Data Intelligence Team for Children's Mercy, which includes groups such as Data Science, Clinical Reporting and Analytics, Data Platform Engineering, and Data Governance. Before joining Children's Mercy in March of 2023, he led data teams at ResMed and Pluralsight.

Advanced Process Simulation Methodology To Plan Facility Renovation

Alexander Kolker

Attune Lab Information System

Attune Technologies

Boston Medical Center PresentationChristopher Powers

Fall2016 SIBC Deloitte Consulting Final Deliverable_v7Chang Woo Jung

Lean Scheduling in Operating RoomsWilliam Reau

What is the best Healthcare Data Warehouse Model for Your Organization?

Health Catalyst

Jennifer Kosiol & Heidi Weber - Gold Coast Hospitals

Informa Australia

Case Study: Increasing Operating Room Utilization

U.S. News Healthcare of Tomorrow

Staffing Decision-Making Using Simulation Modeling

Alexander Kolker

PresentationAshish Mehta

Idea-Presentation-Format-SIH2023-College[1].pptx

ISHANAMRITSRIVASTAVA

10 Things to Consider When Building a CTMS Business Case

Perficient, Inc.

Sponsors and research organizations are often tasked with building a business case for a clinical trial management system (CTMS) before they even evaluate the various solutions in the marketplace. After multiple successful Oracle Siebel CTMS implementations, Perficient has identified 10 ways you can benefit from a CTMS solution. In this slideshare we share information that you can leverage as you develop a business case for a CTMS. We also demonstrate the two most popular CTMS benefits and corresponding features.

Transitioning from Traditional DW to Apache® Spark™ in Operating Room Predict...

Databricks

The prevailing issue when working with Operating Room (OR) scheduling within a hospital setting is that it is difficult to schedule and predict available OR block times. This leads to empty and unused operating rooms leading to longer waiting times for patients for their procedures. In this three-part session, Ayad Shammout and Denny will show: 1) How we tried to solve this problem using traditional DW techniques 2) How we took advantage of the DW capabilities in Apache Spark AND easily transition to Spark MLlib so we could more easily predict available OR block times resulting in better OR utilization and shorter wait times for patients. 3) Some of the key learnings we had when migrating from DW to Spark.

eHealth Summit: "How a mathematical patient flow modelling study can eliminat...

3GDR

Conducting a Summative Study of EHR Usability: Case Study

UXPA Boston

At least year’s conference, a group of us explored the complexity involved with evaluating the usability of Electronic Health Records: The wide range of user profiles and characteristics, a seemingly infinite number of tasks, and challenges in obtaining realistic data while respecting HIPAA regulations. In December, the Usability team at athenahealth conducted a summative usability study of [product]. In this Case Study, the Kris will discuss how the team navigated the challenges of summative EHR evaluation to conduct this study. Topics include task selection, recruiting, metric selection, logistics, and lessons learned.

Health Care: Cost Reductions through Data Insights - The Data Analysis Group

James Karis

Nearly the Holy Grail – Clinical Portals for Faster, Better and Borderless Care

NHSScotlandEvent

North Tees Hartlepool Case Study

O2 Business UK

North Tees and Hartlepool NHS Foundation Trust look to O2’s Casebook 3 to support Electronic Patient Records. NHS Trusts around the country face increasing challenges related to an ageing and increasing population. The balance of budgets and resources, against employee morale and patient care, has meant the need to explore how technology can be used to drive efficiencies and maximise clinician-patient facing time rather than admin time. Many NHS Trusts currently run at a deficit and many are facing challenges and obstructions to achieving their performance targets, which means they are unable to access certain exemplar funding.

OR Efficiency Solution from Core Mobile

Bruce Bain

Similar to Healthcare Predictive Analytics with the OR-(Denny Lee and Ayad Shammout, Databricks and Beth Israel Deaconess Medical Center) (20)

Children's Mercy Patient Progression Hub - HIT December 2023

Advanced Process Simulation Methodology To Plan Facility Renovation

Attune Lab Information System

Boston Medical Center Presentation

Fall2016 SIBC Deloitte Consulting Final Deliverable_v7

Lean Scheduling in Operating Rooms

What is the best Healthcare Data Warehouse Model for Your Organization?

Jennifer Kosiol & Heidi Weber - Gold Coast Hospitals

Case Study: Increasing Operating Room Utilization

Staffing Decision-Making Using Simulation Modeling

Presentation

Idea-Presentation-Format-SIH2023-College[1].pptx

10 Things to Consider When Building a CTMS Business Case

Transitioning from Traditional DW to Apache® Spark™ in Operating Room Predict...

eHealth Summit: "How a mathematical patient flow modelling study can eliminat...

Conducting a Summative Study of EHR Usability: Case Study

Health Care: Cost Reductions through Data Insights - The Data Analysis Group

Nearly the Holy Grail – Clinical Portals for Faster, Better and Borderless Care

North Tees Hartlepool Case Study

OR Efficiency Solution from Core Mobile

More from Spark Summit

FPGA-Based Acceleration Architecture for Spark SQL Qi Xie and Quanfu Wang

Spark Summit

In this session we will present a Configurable FPGA-Based Spark SQL Acceleration Architecture. It is target to leverage FPGA highly parallel computing capability to accelerate Spark SQL Query and for FPGA’s higher power efficiency than CPU we can lower the power consumption at the same time. The Architecture consists of SQL query decomposition algorithms, fine-grained FPGA based Engine Units which perform basic computation of sub string, arithmetic and logic operations. Using SQL query decomposition algorithm, we are able to decompose a complex SQL query into basic operations and according to their patterns each is fed into an Engine Unit. SQL Engine Units are highly configurable and can be chained together to perform complex Spark SQL queries, finally one SQL query is transformed into a Hardware Pipeline. We will present the performance benchmark results comparing the queries with FGPA-Based Spark SQL Acceleration Architecture on XEON E5 and FPGA to the ones with Spark SQL Query on XEON E5 with 10X ~ 100X improvement and we will demonstrate one SQL query workload from a real customer.

VEGAS: The Missing Matplotlib for Scala/Apache Spark with DB Tsai and Roger M...

Spark Summit

In this talk, we’ll present techniques for visualizing large scale machine learning systems in Spark. These are techniques that are employed by Netflix to understand and refine the machine learning models behind Netflix’s famous recommender systems that are used to personalize the Netflix experience for their 99 millions members around the world. Essential to these techniques is Vegas, a new OSS Scala library that aims to be the “missing MatPlotLib” for Spark/Scala. We’ll talk about the design of Vegas and its usage in Scala notebooks to visualize Machine Learning Models.

Apache Spark Structured Streaming Helps Smart Manufacturing with Xiaochang Wu

Spark Summit

This presentation introduces how we design and implement a real-time processing platform using latest Spark Structured Streaming framework to intelligently transform the production lines in the manufacturing industry. In the traditional production line there are a variety of isolated structured, semi-structured and unstructured data, such as sensor data, machine screen output, log output, database records etc. There are two main data scenarios: 1) Picture and video data with low frequency but a large amount; 2) Continuous data with high frequency. They are not a large amount of data per unit. However the total amount of them is very large, such as vibration data used to detect the quality of the equipment. These data have the characteristics of streaming data: real-time, volatile, burst, disorder and infinity. Making effective real-time decisions to retrieve values from these data is critical to smart manufacturing. The latest Spark Structured Streaming framework greatly lowers the bar for building highly scalable and fault-tolerant streaming applications. Thanks to the Spark we are able to build a low-latency, high-throughput and reliable operation system involving data acquisition, transmission, analysis and storage. The actual user case proved that the system meets the needs of real-time decision-making. The system greatly enhance the production process of predictive fault repair and production line material tracking efficiency, and can reduce about half of the labor force for the production lines.

Improving Traffic Prediction Using Weather Data with Ramya Raghavendra

Spark Summit

As common sense would suggest, weather has a definite impact on traffic. But how much? And under what circumstances? Can we improve traffic (congestion) prediction given weather data? Predictive traffic is envisioned to significantly impact how driver’s plan their day by alerting users before they travel, find the best times to travel, and over time, learn from new IoT data such as road conditions, incidents, etc. This talk will cover the traffic prediction work conducted jointly by IBM and the traffic data provider. As a part of this work, we conducted a case study over five large metropolitans in the US, 2.58 billion traffic records and 262 million weather records, to quantify the boost in accuracy of traffic prediction using weather data. We will provide an overview of our lambda architecture with Apache Spark being used to build prediction models with weather and traffic data, and Spark Streaming used to score the model and provide real-time traffic predictions. This talk will also cover a suite of extensions to Spark to analyze geospatial and temporal patterns in traffic and weather data, as well as the suite of machine learning algorithms that were used with Spark framework. Initial results of this work were presented at the National Association of Broadcasters meeting in Las Vegas in April 2017, and there is work to scale the system to provide predictions in over a 100 cities. Audience will learn about our experience scaling using Spark in offline and streaming mode, building statistical and deep-learning pipelines with Spark, and techniques to work with geospatial and time-series data.

A Tale of Two Graph Frameworks on Spark: GraphFrames and Tinkerpop OLAP Artem...

Spark Summit

Graph is on the rise and it’s time to start learning about scalable graph analytics! In this session we will go over two Spark-based Graph Analytics frameworks: Tinkerpop and GraphFrames. While both frameworks can express very similar traversals, they have different performance characteristics and APIs. In this Deep-Dive by example presentation, we will demonstrate some common traversals and explain how, at a Spark level, each traversal is actually computed under the hood! Learn both the fluent Gremlin API as well as the powerful GraphFrame Motif api as we show examples of both simultaneously. No need to be familiar with Graphs or Spark for this presentation as we’ll be explaining everything from the ground up!

No More Cumbersomeness: Automatic Predictive Modeling on Apache Spark Marcin ...

Spark Summit

Building accurate machine learning models has been an art of data scientists, i.e., algorithm selection, hyper parameter tuning, feature selection and so on. Recently, challenges to breakthrough this “black-arts” have got started. In cooperation with our partner, NEC Laboratories America, we have developed a Spark-based automatic predictive modeling system. The system automatically searches the best algorithm, parameters and features without any manual work. In this talk, we will share how the automation system is designed to exploit attractive advantages of Spark. The evaluation with real open data demonstrates that our system can explore hundreds of predictive models and discovers the most accurate ones in minutes on a Ultra High Density Server, which employs 272 CPU cores, 2TB memory and 17TB SSD in 3U chassis. We will also share open challenges to learn such a massive amount of models on Spark, particularly from reliability and stability standpoints. This talk will cover the presentation already shown on Spark Summit SF’17 (#SFds5) but from more technical perspective.

Apache Spark and Tensorflow as a Service with Jim Dowling

Spark Summit

In Sweden, from the Rise ICE Data Center at www.hops.site, we are providing to reseachers both Spark-as-a-Service and, more recently, Tensorflow-as-a-Service as part of the Hops platform. In this talk, we examine the different ways in which Tensorflow can be included in Spark workflows, from batch to streaming to structured streaming applications. We will analyse the different frameworks for integrating Spark with Tensorflow, from Tensorframes to TensorflowOnSpark to Databrick’s Deep Learning Pipelines. We introduce the different programming models supported and highlight the importance of cluster support for managing different versions of python libraries on behalf of users. We will also present cluster management support for sharing GPUs, including Mesos and YARN (in Hops Hadoop). Finally, we will perform a live demonstration of training and inference for a TensorflowOnSpark application written on Jupyter that can read data from either HDFS or Kafka, transform the data in Spark, and train a deep neural network on Tensorflow. We will show how to debug the application using both Spark UI and Tensorboard, and how to examine logs and monitor training.

Apache Spark and Tensorflow as a Service with Jim Dowling

Spark Summit

MMLSpark: Lessons from Building a SparkML-Compatible Machine Learning Library...

Spark Summit

With the rapid growth of available datasets, it is imperative to have good tools for extracting insight from big data. The Spark ML library has excellent support for performing at-scale data processing and machine learning experiments, but more often than not, Data Scientists find themselves struggling with issues such as: low level data manipulation, lack of support for image processing, text analytics and deep learning, as well as the inability to use Spark alongside other popular machine learning libraries. To address these pain points, Microsoft recently released The Microsoft Machine Learning Library for Apache Spark (MMLSpark), an open-source machine learning library built on top of SparkML that seeks to simplify the data science process and integrate SparkML Pipelines with deep learning and computer vision libraries such as the Microsoft Cognitive Toolkit (CNTK) and OpenCV. With MMLSpark, Data Scientists can build models with 1/10th of the code through Pipeline objects that compose seamlessly with other parts of the SparkML ecosystem. In this session, we explore some of the main lessons learned from building MMLSpark. Join us if you would like to know how to extend Pipelines to ensure seamless integration with SparkML, how to auto-generate Python and R wrappers from Scala Transformers and Estimators, how to integrate and use previously non-distributed libraries in a distributed manner and how to efficiently deploy a Spark library across multiple platforms.

Next CERN Accelerator Logging Service with Jakub Wozniak

Spark Summit

The Next Accelerator Logging Service (NXCALS) is a new Big Data project at CERN aiming to replace the existing Oracle-based service. The main purpose of the system is to store and present Controls/Infrastructure related data gathered from thousands of devices in the whole accelerator complex. The data is used to operate the machines, improve their performance and conduct studies for new beam types or future experiments. During this talk, Jakub will speak about NXCALS requirements and design choices that lead to the selected architecture based on Hadoop and Spark. He will present the Ingestion API, the abstractions behind the Meta-data Service and the Spark-based Extraction API where simple changes to the schema handling greatly improved the overall usability of the system. The system itself is not CERN specific and can be of interest to other companies or institutes confronted with similar Big Data problems.

Powering a Startup with Apache Spark with Kevin Kim

Spark Summit

In Between (A mobile App for couples, downloaded 20M in Global), from daily batch for extracting metrics, analysis and dashboard. Spark is widely used by engineers and data analysts in Between, thanks to the performance and expendability of Spark, data operating has become extremely efficient. Entire team including Biz Dev, Global Operation, Designers are enjoying data results so Spark is empowering entire company for data driven operation and thinking. Kevin, Co-founder and Data Team leader of Between will be presenting how things are going in Between. Listeners will know how small and agile team is living with data (how we build organization, culture and technical base) after this presentation.

Improving Traffic Prediction Using Weather Datawith Ramya Raghavendra

Spark Summit

Hiding Apache Spark Complexity for Fast Prototyping of Big Data Applications—...

Spark Summit

In many cases, Big Data becomes just another buzzword because of the lack of tools that can support both the technological requirements for developing and deploying of the projects and/or the fluency of communication between the different profiles of people involved in the projects. In this talk, we will present Moriarty, a set of tools for fast prototyping of Big Data applications that can be deployed in an Apache Spark environment. These tools support the creation of Big Data workflows using the already existing functional blocks or supporting the creation of new functional blocks. The created workflow can then be deployed in a Spark infrastructure and used through a REST API. For better understanding of Moriarty, the prototyping process and the way it hides the Spark environment to the Big Data users and developers, we will present it together with a couple of examples based on a Industry 4.0 success cases and other on a logistic success case.

How Nielsen Utilized Databricks for Large-Scale Research and Development with...

Spark Summit

Large-scale testing of new data products or enhancements to existing products in a research and development environment can be a technical challenge for data scientists. In some cases, tools available to data scientists lack production-level capacity, whereas other tools do not provide the algorithms needed to run the methodology. At Nielsen, the Databricks platform provided a solution to both of these challenges. This breakout session will cover a specific Nielsen business case where two methodology enhancements were developed and tested at large-scale using the Databricks platform. Development and large-scale testing of these enhancements would not have been possible using standard database tools.

Spline: Apache Spark Lineage not Only for the Banking Industry with Marek Nov...

Spark Summit

Goal Based Data Production with Sim Simeonov

Spark Summit

Since the invention of SQL and relational databases, data production has been about specifying how data is transformed through queries. While Apache Spark can certainly be used as a general distributed query engine, the power and granularity of Spark’s APIs enables a revolutionary increase in data engineering productivity: goal-based data production. Goal-based data production concerns itself with specifying WHAT the desired result is, leaving the details of HOW the result is achieved to a smart data warehouse running on top of Spark. That not only substantially increases productivity, but also significantly expands the audience that can work directly with Spark: from developers and data scientists to technical business users. With specific data and architecture patterns spanning the range from ETL to machine learning data prep and with live demos, this session will demonstrate how Spark users can gain the benefits of goal-based data production.

Preventing Revenue Leakage and Monitoring Distributed Systems with Machine Le...

Spark Summit

Have you imagined a simple machine learning solution able to prevent revenue leakage and monitor your distributed application? To answer this question, we offer a practical and a simple machine learning solution to create an intelligent monitoring application based on simple data analysis using Apache Spark MLlib. Our application uses linear regression models to make predictions and check if the platform is experiencing any operational problems that can impact in revenue losses. The application monitor distributed systems and provides notifications stating the problem detected, that way users can operate quickly to avoid serious problems which directly impact the company’s revenue and reduce the time for action. We will present an architecture for not only a monitoring system, but also an active actor for our outages recoveries. At the end of the presentation you will have access to our training program source code and you will be able to adapt and implement in your company. This solution already helped to prevent about US$3mi in losses last year.

Getting Ready to Use Redis with Apache Spark with Dvir Volk

Spark Summit

Getting Ready to use Redis with Apache Spark is a technical tutorial designed to address integrating Redis with an Apache Spark deployment to increase the performance of serving complex decision models. To set the context for the session, we start with a quick introduction to Redis and the capabilities Redis provides. We cover the basic data types provided by Redis and cover the module system. Using an ad serving use-case, we look at how Redis can improve the performance and reduce the cost of using complex ML-models in production. Attendees will be guided through the key steps of setting up and integrating Redis with Spark, including how to train a model using Spark then load and serve it using Redis, as well as how to work with the Spark Redis module. The capabilities of the Redis Machine Learning Module (redis-ml) will be discussed focusing primarily on decision trees and regression (linear and logistic) with code examples to demonstrate how to use these feature. At the end of the session, developers should feel confident building a prototype/proof-of-concept application using Redis and Spark. Attendees will understand how Redis complements Spark and how to use Redis to serve complex, ML-models with high performance.

Deduplication and Author-Disambiguation of Streaming Records via Supervised M...

Spark Summit

Here we present a general supervised framework for record deduplication and author-disambiguation via Spark. This work differentiates itself by – Application of Databricks and AWS makes this a scalable implementation. Compute resources are comparably lower than traditional legacy technology using big boxes 24/7. Scalability is crucial as Elsevier’s Scopus data, the biggest scientific abstract repository, covers roughly 250 million authorships from 70 million abstracts covering a few hundred years. – We create a fingerprint for each content by deep learning and/or word2vec algorithms to expedite pairwise similarity calculation. These encoders substantially reduce compute time while maintaining semantic similarity (unlike traditional TFIDF or predefined taxonomies). We will briefly discuss how to optimize word2vec training with high parallelization. Moreover, we show how these encoders can be used to derive a standard representation for all our entities namely such as documents, authors, users, journals, etc. This standard representation can simplify the recommendation problem into a pairwise similarity search and hence it can offer a basic recommender for cross-product applications where we may not have a dedicate recommender engine designed. – Traditional author-disambiguation or record deduplication algorithms are batch-processing with small to no training data. However, we have roughly 25 million authorships that are manually curated or corrected upon user feedback. Hence, it is crucial to maintain historical profiles and hence we have developed a machine learning implementation to deal with data streams and process them in mini batches or one document at a time. We will discuss how to measure the accuracy of such a system, how to tune it and how to process the raw data of pairwise similarity function into final clusters. Lessons learned from this talk can help all sort of companies where they want to integrate their data or deduplicate their user/customer/product databases.

MatFast: In-Memory Distributed Matrix Computation Processing and Optimization...

Spark Summit

The use of large-scale machine learning and data mining methods is becoming ubiquitous in many application domains ranging from business intelligence and bioinformatics to self-driving cars. These methods heavily rely on matrix computations, and it is hence critical to make these computations scalable and efficient. These matrix computations are often complex and involve multiple steps that need to be optimized and sequenced properly for efficient execution. This work presents new efficient and scalable matrix processing and optimization techniques based on Spark. The proposed techniques estimate the sparsity of intermediate matrix-computation results and optimize communication costs. An evaluation plan generator for complex matrix computations is introduced as well as a distributed plan optimizer that exploits dynamic cost-based analysis and rule-based heuristics The result of a matrix operation will often serve as an input to another matrix operation, thus defining the matrix data dependencies within a matrix program. The matrix query plan generator produces query execution plans that minimize memory usage and communication overhead by partitioning the matrix based on the data dependencies in the execution plan. We implemented the proposed matrix techniques inside the Spark SQL, and optimize the matrix execution plan based on Spark SQL Catalyst. We conduct case studies on a series of ML models and matrix computations with special features on different datasets. These are PageRank, GNMF, BFGS, sparse matrix chain multiplications, and a biological data analysis. The open-source library ScaLAPACK and the array-based database SciDB are used for performance evaluation. Our experiments are performed on six real-world datasets are: social network data ( e.g., soc-pokec, cit-Patents, LiveJournal), Twitter2010, Netflix recommendation data, and 1000 Genomes Project sample. Experiments demonstrate that our proposed techniques achieve up to an order-of-magnitude performance.

More from Spark Summit (20)

FPGA-Based Acceleration Architecture for Spark SQL Qi Xie and Quanfu Wang

VEGAS: The Missing Matplotlib for Scala/Apache Spark with DB Tsai and Roger M...

Apache Spark Structured Streaming Helps Smart Manufacturing with Xiaochang Wu

Improving Traffic Prediction Using Weather Data with Ramya Raghavendra

A Tale of Two Graph Frameworks on Spark: GraphFrames and Tinkerpop OLAP Artem...

No More Cumbersomeness: Automatic Predictive Modeling on Apache Spark Marcin ...

Apache Spark and Tensorflow as a Service with Jim Dowling

MMLSpark: Lessons from Building a SparkML-Compatible Machine Learning Library...

Next CERN Accelerator Logging Service with Jakub Wozniak

Powering a Startup with Apache Spark with Kevin Kim

Improving Traffic Prediction Using Weather Datawith Ramya Raghavendra

Hiding Apache Spark Complexity for Fast Prototyping of Big Data Applications—...

How Nielsen Utilized Databricks for Large-Scale Research and Development with...

Spline: Apache Spark Lineage not Only for the Banking Industry with Marek Nov...

Goal Based Data Production with Sim Simeonov

Preventing Revenue Leakage and Monitoring Distributed Systems with Machine Le...

Getting Ready to Use Redis with Apache Spark with Dvir Volk

Deduplication and Author-Disambiguation of Streaming Records via Supervised M...

MatFast: In-Memory Distributed Matrix Computation Processing and Optimization...

Recently uploaded

Q1’2024 Update: MYCI’s Leap Year Rebound

Oppotus

Business update Q1 2024 Lar España Real Estate SOCIMI

AlejandraGmez176757

Criminal IP - Threat Hunting Webinar.pdf

Criminal IP

一比一原版(NYU毕业证)纽约大学毕业证成绩单

ewymefz

NYU毕业证【微信95270640】《如何办理NYU毕业证纽约大学文凭学历》【Q微信95270640】《纽约大学文凭学历证书》《纽约大学毕业证书与成绩单样本图片》毕业证书补办 Fake Degree做学费单《毕业证明信-推荐信》成绩单，录取通知书，Offer，在读证明，雅思托福成绩单，真实大使馆教育部认证，回国人员证明，留信网认证。网上存档永久可查！【本科硕士】纽约大学纽约大学毕业证学位证（GPA修改）；学历认证（教育部认证）；大学Offer录取通知书留信认证使馆认证；雅思语言证书等高仿类证书。办理流程： 1客户提供办理纽约大学纽约大学毕业证学位证信息：姓名生日专业学位毕业时间等（如信息不确定可以咨询顾问：我们有专业老师帮你查询）； 2开始安排制作毕业证成绩单电子图； 3毕业证成绩单电子版做好以后发送给您确认； 4毕业证成绩单电子版您确认信息无误之后安排制作成品； 5成品做好拍照或者视频给您确认； 6快递给客户（国内顺丰国外DHLUPS等快读邮寄）真实网上可查的证明材料 1教育部学历学位认证留服官网真实存档可查永久存档。 2留学回国人员证明（使馆认证）使馆网站真实存档可查。我们对海外大学及学院的毕业证成绩单所使用的材料尺寸大小防伪结构（包括：纽约大学纽约大学毕业证学位证隐形水印阴影底纹钢印LOGO烫金烫银LOGO烫金烫银复合重叠。文字图案浮雕激光镭射紫外荧光温感复印防伪）都有原版本文凭对照。质量得到了广大海外客户群体的认可同时和海外学校留学中介做到与时俱进及时掌握各大院校的（毕业证成绩单资格证结业证录取通知书在读证明等相关材料）的版本更新信息能够在第一时间掌握最新的海外学历文凭的样版尺寸大小纸张材质防伪技术等等并在第一时间收集到原版实物以求达到客户的需求。本公司还可以按照客户原版印刷制作且能够达到客户理想的要求。有需要办理证件的客户请联系我们在线客服中心微信：95270640 或咨询在线已转到了尽头他的城市生活也将划上一个不很圆满的句号了值得庆幸的是山娃早记下了他们的学校和联系方式说也奇怪在山娃离城的头一天父亲居然请假陪山娃耍了一天那一天父亲陪着山娃辗转长隆水上乐园疯了一整天水上漂流高空冲浪看大马戏大凡里面有的父亲都带着他去疯一把山娃算了算这一次足足花了老爸元够他挣上半个月的山娃很不解一向节俭的父亲啥时变得如此阔绰大方大把大把掏钱时居然连眉头也不皱一下车票早买好了直达卧铺车得经子

一比一原版(TWU毕业证)西三一大学毕业证成绩单

ocavb

TWU毕业证【微信95270640】西三一大学没毕业>办理西三一大学毕业证成绩单【微信TWU】TWU毕业证成绩单TWU学历证书TWU文凭《TWU毕业套号文凭网认证西三一大学毕业证成绩单》《哪里买西三一大学毕业证文凭TWU成绩学校快递邮寄信封》《开版西三一大学文凭》TWU留信认证本科硕士学历认证 [留学文凭学历认证(留信认证使馆认证)西三一大学毕业证成绩单毕业证证书大学Offer请假条成绩单语言证书国际回国人员证明高仿教育部认证申请学校等一切高仿或者真实可查认证服务。多年留学服务公司,拥有海外样板无数能完美1:1还原海外各国大学degreeDiplomaTranscripts等毕业材料。海外大学毕业材料都有哪些工艺呢？工艺难度主要由：烫金.钢印.底纹.水印.防伪光标.热敏防伪等等组成。而且我们每天都在更新海外文凭的样板以求所有同学都能享受到完美的品质服务。国外毕业证学位证成绩单办理方法： 1客户提供办理西三一大学西三一大学本科学位证成绩单信息：姓名生日专业学位毕业时间等（如信息不确定可以咨询顾问：我们有专业老师帮你查询）； 2开始安排制作毕业证成绩单电子图； 3毕业证成绩单电子版做好以后发送给您确认； 4毕业证成绩单电子版您确认信息无误之后安排制作成品； 5成品做好拍照或者视频给您确认； 6快递给客户（国内顺丰国外DHLUPS等快读邮寄） — — — — 我们是挂科和未毕业同学们的福音我们是实体公司精益求精的工艺！ — — — - 一真实留信认证的作用(私企外企荣誉的见证): 1：该专业认证可证明留学生真实留学身份同时对留学生所学专业等级给予评定。 2：国家专业人才认证中心颁发入库证书这个入网证书并且可以归档到地方。 3：凡是获得留信网入网的信息将会逐步更新到个人身份内将在公安部网内查询个人身份证信息后同步读取人才网入库信息。 4：个人职称评审加20分个人信誉贷款加10分。 5：在国家人才网主办的全国网络招聘大会中纳入资料供国家500强等高端企业选择人才。广州火车东站的那一刻山娃感受到了一种从未有过的激动和震撼太美了可爱的广州父亲的城山娃惊喜得几乎叫出声来山娃觉得父亲太伟大了居然能单匹马地跑到这么远这么大这么美的地方赚大钱高楼大厦鳞次栉比大街小巷人潮涌动山娃一路张望一路惊叹他发现城里的桥居然层层叠叠扭来扭去桥下没水却有着水一般的车水马龙山娃惊诧于城里的公交车那么大那么美不用买票乖乖地掷下二枚硬币空调享受还能坐着看电视呢屡经辗转山娃终于跟着父亲到拉

Empowering Data Analytics Ecosystem.pptx

benishzehra469

Show drafts volume_up Empowering the Data Analytics Ecosystem: A Laser Focus on Value The data analytics ecosystem thrives when every component functions at its peak, unlocking the true potential of data. Here's a laser focus on key areas for an empowered ecosystem: 1. Democratize Access, Not Data: Granular Access Controls: Provide users with self-service tools tailored to their specific needs, preventing data overload and misuse. Data Catalogs: Implement robust data catalogs for easy discovery and understanding of available data sources. 2. Foster Collaboration with Clear Roles: Data Mesh Architecture: Break down data silos by creating a distributed data ownership model with clear ownership and responsibilities. Collaborative Workspaces: Utilize interactive platforms where data scientists, analysts, and domain experts can work seamlessly together. 3. Leverage Advanced Analytics Strategically: AI-powered Automation: Automate repetitive tasks like data cleaning and feature engineering, freeing up data talent for higher-level analysis. Right-Tool Selection: Strategically choose the most effective advanced analytics techniques (e.g., AI, ML) based on specific business problems. 4. Prioritize Data Quality with Automation: Automated Data Validation: Implement automated data quality checks to identify and rectify errors at the source, minimizing downstream issues. Data Lineage Tracking: Track the flow of data throughout the ecosystem, ensuring transparency and facilitating root cause analysis for errors. 5. Cultivate a Data-Driven Mindset: Metrics-Driven Performance Management: Align KPIs and performance metrics with data-driven insights to ensure actionable decision making. Data Storytelling Workshops: Equip stakeholders with the skills to translate complex data findings into compelling narratives that drive action. Benefits of a Precise Ecosystem: Sharpened Focus: Precise access and clear roles ensure everyone works with the most relevant data, maximizing efficiency. Actionable Insights: Strategic analytics and automated quality checks lead to more reliable and actionable data insights. Continuous Improvement: Data-driven performance management fosters a culture of learning and continuous improvement. Sustainable Growth: Empowered by data, organizations can make informed decisions to drive sustainable growth and innovation. By focusing on these precise actions, organizations can create an empowered data analytics ecosystem that delivers real value by driving data-driven decisions and maximizing the return on their data investment.

1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx

Tiktokethiodaily

Opendatabay - Open Data Marketplace.pptx

Opendatabay

Opendatabay.com unlocks the power of data for everyone. Open Data Marketplace fosters a collaborative hub for data enthusiasts to explore, share, and contribute to a vast collection of datasets. First ever open hub for data enthusiasts to collaborate and innovate. A platform to explore, share, and contribute to a vast collection of datasets. Through robust quality control and innovative technologies like blockchain verification, opendatabay ensures the authenticity and reliability of datasets, empowering users to make data-driven decisions with confidence. Leverage cutting-edge AI technologies to enhance the data exploration, analysis, and discovery experience. From intelligent search and recommendations to automated data productisation and quotation, Opendatabay AI-driven features streamline the data workflow. Finding the data you need shouldn't be a complex. Opendatabay simplifies the data acquisition process with an intuitive interface and robust search tools. Effortlessly explore, discover, and access the data you need, allowing you to focus on extracting valuable insights. Opendatabay breaks new ground with a dedicated, AI-generated, synthetic datasets. Leverage these privacy-preserving datasets for training and testing AI models without compromising sensitive information. Opendatabay prioritizes transparency by providing detailed metadata, provenance information, and usage guidelines for each dataset, ensuring users have a comprehensive understanding of the data they're working with. By leveraging a powerful combination of distributed ledger technology and rigorous third-party audits Opendatabay ensures the authenticity and reliability of every dataset. Security is at the core of Opendatabay. Marketplace implements stringent security measures, including encryption, access controls, and regular vulnerability assessments, to safeguard your data and protect your privacy.

一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单

ewymefz

IIT毕业证【微信95270640】购买（伊利诺伊理工大学毕业证成绩单硕士学历）Q微信95270640代办IIT学历认证留信网伪造伊利诺伊理工大学学位证书精仿伊利诺伊理工大学本科/硕士文凭证书补办伊利诺伊理工大学 diplomaoffer,Transcript购买伊利诺伊理工大学毕业证成绩单购买IIT假毕业证学位证书购买伪造伊利诺伊理工大学文凭证书学位证书,专业办理雅思、托福成绩单，学生ID卡，在读证明，海外各大学offer录取通知书，毕业证书，成绩单，文凭等材料:1:1完美还原毕业证、offer录取通知书、学生卡等各种在读或毕业材料的防伪工艺（包括烫金、烫银、钢印、底纹、凹凸版、水印、防伪光标、热敏防伪、文字图案浮雕，激光镭射，紫外荧光，温感光标）学校原版上有的工艺我们一样不会少，不论是老版本还是最新版本，都能保证最高程度还原，力争完美以求让所有同学都能享受到完美的品质服务。 #一整套伊利诺伊理工大学文凭证件办理#—包含伊利诺伊理工大学伊利诺伊理工大学毕业证假文凭学历认证|使馆认证|归国人员证明|教育部认证|留信网认证永远存档教育部学历学位认证查询办理国外文凭国外学历学位认证#我们提供全套办理服务。一整套留学文凭证件服务：一：伊利诺伊理工大学伊利诺伊理工大学毕业证假文凭毕业证 #成绩单等全套材料从防伪到印刷水印底纹到钢印烫金二：真实使馆认证（留学人员回国证明）使馆存档三：真实教育部认证教育部存档教育部留服网站永久可查四：留信认证留学生信息网站永久可查国外毕业证学位证成绩单办理方法： 1客户提供办理伊利诺伊理工大学伊利诺伊理工大学毕业证假文凭信息：姓名生日专业学位毕业时间等（如信息不确定可以咨询顾问：我们有专业老师帮你查询）； 2开始安排制作毕业证成绩单电子图； 3毕业证成绩单电子版做好以后发送给您确认； 4毕业证成绩单电子版您确认信息无误之后安排制作成品； 5成品做好拍照或者视频给您确认； 6快递给客户（国内顺丰国外DHLUPS等快读邮寄）。教育部文凭学历认证认证的用途：如果您计划在国内发展那么办理国内教育部认证是必不可少的。事业性用人单位如银行国企公务员在您应聘时都会需要您提供这个认证。其他私营 #外企企业无需提供！办理教育部认证所需资料众多且烦琐所有材料您都必须提供原件我们凭借丰富的经验帮您快速整合材料让您少走弯路。实体公司专业为您服务如有需要请联系我: 微信95270640声和哐咣的关门声待山娃醒来时父亲早已上班去了床头总搁着山娃最爱吃的馒头和肉包还有白花花的豆浆父亲中午留在工地吃饭和午休山娃的中饭是对面快餐店送来的不用山娃付钱父亲早跟老板谈妥了钱到时一起结父亲给山娃配了台手机二手货诺基亚的父亲说有什么事只管给他挂电话能拥有自己的手机山娃很高兴除了玩游戏发短信除了挂电话给爷爷奶奶和母亲山娃还给班主任邱老师连挂了二个电话并给同学阿强和阿昌家挂山娃兴奋地向他们诉说城市一

一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单

nscud

CBU毕业证【微信95270640】《如何办理不列颠海角大学毕业证认证》【办证Q微信95270640】《不列颠海角大学文凭毕业证制作》《CBU学历学位证书哪里买》办理不列颠海角大学学位证书扫描件、办理不列颠海角大学雅思证书！国际留学归国服务中心《如何办不列颠海角大学毕业证认证》《CBU学位证书扫描件哪里买》实体公司，注册经营，行业标杆，精益求精！ 1:1完美还原海外各大学毕业材料上的工艺：水印阴影底纹钢印LOGO烫金烫银LOGO烫金烫银复合重叠。文字图案浮雕激光镭射紫外荧光温感复印防伪。可办理以下真实不列颠海角大学存档留学生信息存档认证： 1不列颠海角大学真实留信网认证（网上可查永久存档无风险百分百成功入库）； 2真实教育部认证（留服）等一切高仿或者真实可查认证服务（暂时不可办理）； 3购买英美真实学籍（不用正常就读直接出学历）； 4英美一年硕士保毕业证项目（保录取学校挂名不用正常就读保毕业）留学本科/硕士毕业证书成绩单制作流程： 1客户提供办理信息：姓名生日专业学位毕业时间等（如信息不确定可以咨询顾问：我们有专业老师帮你查询不列颠海角大学不列颠海角大学本科学位证成绩单）； 2开始安排制作不列颠海角大学毕业证成绩单电子图； 3不列颠海角大学毕业证成绩单电子版做好以后发送给您确认； 4不列颠海角大学毕业证成绩单电子版您确认信息无误之后安排制作成品； 5不列颠海角大学成品做好拍照或者视频给您确认； 6快递给客户（国内顺丰国外DHLUPS等快读邮寄） — — — — — — — — — — — 《文凭顾问Q/微：95270640》这么大这么美的地方赚大钱高楼大厦鳞次栉比大街小巷人潮涌动山娃一路张望一路惊叹他发现城里的桥居然层层叠叠扭来扭去桥下没水却有着水一般的车水马龙山娃惊诧于城里的公交车那么大那么美不用买票乖乖地掷下二枚硬币空调享受还能坐着看电视呢屡经辗转山娃终于跟着父亲到家了山娃没想到父亲城里的家会如此寒碜更没料到父亲的城里竟有如此简陋的鬼地方父亲的家在高楼最底屋最下面很矮很黑是很不显眼的地下室父亲的家安在别人脚底下孰

【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】

NABLAS株式会社

Adjusting primitives for graph : SHORT REPORT / NOTES

Subhajit Sahu

Graph algorithms, like PageRank Compressed Sparse Row (CSR) is an adjacency-list based graph representation that is Multiply with different modes (map) 1. Performance of sequential execution based vs OpenMP based vector multiply. 2. Comparing various launch configs for CUDA based vector multiply. Sum with different storage types (reduce) 1. Performance of vector element sum using float vs bfloat16 as the storage type. Sum with different modes (reduce) 1. Performance of sequential execution based vs OpenMP based vector element sum. 2. Performance of memcpy vs in-place based CUDA based vector element sum. 3. Comparing various launch configs for CUDA based vector element sum (memcpy). 4. Comparing various launch configs for CUDA based vector element sum (in-place). Sum with in-place strategies of CUDA mode (reduce) 1. Comparing various launch configs for CUDA based vector element sum (in-place).

FP Growth Algorithm and its Applications

MaleehaSheikh2

一比一原版(CU毕业证)卡尔顿大学毕业证成绩单

yhkoc

CU毕业证【微信95270640】（卡尔顿大学毕业证成绩单本科学历）Q微信95270640(补办CU学位文凭证书)卡尔顿大学留信网学历认证怎么办理卡尔顿大学毕业证成绩单精仿本科学位证书硕士文凭证书认证Seneca College diplomaoffer,Transcript办理硕士学位证书造假卡尔顿大学假文凭学位证书制作CU本科毕业证书硕士学位证书精仿卡尔顿大学学历认证成绩单修改制作，办理真实认证、留信认证、使馆公证、购买成绩单，购买假文凭，购买假学位证，制造假国外大学文凭、毕业公证、毕业证明书、录取通知书、Offer、在读证明、雅思托福成绩单、假文凭、假毕业证、请假条、国际驾照、网上存档可查！如果您是以下情况，我们都能竭诚为您解决实际问题：【公司采用定金+余款的付款流程，以最大化保障您的利益，让您放心无忧】 1、在校期间，因各种原因未能顺利毕业，拿不到官方毕业证+微信95270640 2、面对父母的压力，希望尽快拿到卡尔顿大学卡尔顿大学毕业证文凭证书； 3、不清楚流程以及材料该如何准备卡尔顿大学卡尔顿大学毕业证文凭证书； 4、回国时间很长，忘记办理； 5、回国马上就要找工作，办给用人单位看； 6、企事业单位必须要求办理的；面向美国乔治城大学毕业留学生提供以下服务: 【★卡尔顿大学卡尔顿大学毕业证文凭证书毕业证、成绩单等全套材料，从防伪到印刷，从水印到钢印烫金，与学校100%相同】【★真实使馆认证（留学人员回国证明），使馆存档可通过大使馆查询确认】【★真实教育部认证，教育部存档，教育部留服网站可查】【★真实留信认证，留信网入库存档，可查卡尔顿大学卡尔顿大学毕业证文凭证书】我们从事工作十余年的有着丰富经验的业务顾问，熟悉海外各国大学的学制及教育体系，并且以挂科生解决毕业材料不全问题为基础，为客户量身定制1对1方案，未能毕业的回国留学生成功搭建回国顺利发展所需的桥梁。我们一直努力以高品质的教育为起点，以诚信、专业、高效、创新作为一切的行动宗旨，始终把“诚信为主、质量为本、客户第一”作为我们全部工作的出发点和归宿点。同时为海内外留学生提供大学毕业证购买、补办成绩单及各类分数修改等服务；归国认证方面，提供《留信网入库》申请、《国外学历学位认证》申请以及真实学籍办理等服务，帮助众多莘莘学子实现了一个又一个梦想。专业服务，请勿犹豫联系我如果您真实毕业回国，对于学历认证无从下手，请联系我，我们免费帮您递交诚招代理：本公司诚聘当地代理人员，如果你有业余时间，或者你有同学朋友需要，有兴趣就请联系我你赢我赢，共创双赢你做代理，可以帮助卡尔顿大学同学朋友你做代理，可以拯救卡尔顿大学失足青年你做代理，可以挽救卡尔顿大学一个个人才你做代理，你将是别人人生卡尔顿大学的转折点你做代理，可以改变自己，改变他人，给他人和自己一个机会他交友与城里人交友但他俩就好像是两个世界里的人根本拢不到一块儿不知不觉山娃倒跟周围出租屋里的几个小伙伴成了好朋友因为他们也是从乡下进城过暑假的小学生快乐的日子总是过得飞快山娃尚未完全认清那几位小朋友时他们却一个接一个地回家了山娃这时才恍然发现二个月的暑假已转到了尽头他的城市生活也将划上一个不很圆满的句号了值得庆幸的是山娃早记下了他们的学校和联系方式说也奇怪在山娃离城的头一天父亲居然请假陪山娃耍了活

Investigate & Recover / StarCompliance.io / Crypto_Crimes

StarCompliance.io

StarCompliance is a leading firm specializing in the recovery of stolen cryptocurrency. Our comprehensive services are designed to assist individuals and organizations in navigating the complex process of fraud reporting, investigation, and fund recovery. We combine cutting-edge technology with expert legal support to provide a robust solution for victims of crypto theft. Our Services Include: Reporting to Tracking Authorities: We immediately notify all relevant centralized exchanges (CEX), decentralized exchanges (DEX), and wallet providers about the stolen cryptocurrency. This ensures that the stolen assets are flagged as scam transactions, making it impossible for the thief to use them. Assistance with Filing Police Reports: We guide you through the process of filing a valid police report. Our support team provides detailed instructions on which police department to contact and helps you complete the necessary paperwork within the critical 72-hour window. Launching the Refund Process: Our team of experienced lawyers can initiate lawsuits on your behalf and represent you in various jurisdictions around the world. They work diligently to recover your stolen funds and ensure that justice is served. At StarCompliance, we understand the urgency and stress involved in dealing with cryptocurrency theft. Our dedicated team works quickly and efficiently to provide you with the support and expertise needed to recover your assets. Trust us to be your partner in navigating the complexities of the crypto world and safeguarding your investments.

Tabula.io Cheatsheet: automate your data workflows

alex933524

standardisation of garbhpala offhgfffghh

ArpitMalhotra16

一比一原版(UVic毕业证)维多利亚大学毕业证成绩单

ukgaet

UVic毕业证【微信95270640】（维多利亚大学毕业证成绩单本科学历）Q微信95270640(补办UVic学位文凭证书)维多利亚大学留信网学历认证怎么办理维多利亚大学毕业证成绩单精仿本科学位证书硕士文凭证书认证Seneca College diplomaoffer,Transcript办理硕士学位证书造假维多利亚大学假文凭学位证书制作UVic本科毕业证书硕士学位证书精仿维多利亚大学学历认证成绩单修改制作，办理真实认证、留信认证、使馆公证、购买成绩单，购买假文凭，购买假学位证，制造假国外大学文凭、毕业公证、毕业证明书、录取通知书、Offer、在读证明、雅思托福成绩单、假文凭、假毕业证、请假条、国际驾照、网上存档可查！【实体公司】办维多利亚大学维多利亚大学毕业证文凭证书学历认证学位证文凭认证办留信网认证办留服认证办教育部认证（网上可查实体公司专业可靠） — — — 留学归国服务中心 — — - 【主营项目】一.维多利亚大学毕业证成绩单使馆认证教育部认证成绩单等！二.真实使馆公证(即留学回国人员证明,不成功不收费) 三.真实教育部学历学位认证（教育部存档！教育部留服网站永久可查）四.办理各国各大学文凭(一对一专业服务,可全程监控跟踪进度) 国外毕业证学位证成绩单办理流程： 1客户提供维多利亚大学维多利亚大学毕业证文凭证书办理信息：姓名生日专业学位毕业时间等（如信息不确定可以咨询顾问：我们有专业老师帮你查询）； 2开始安排制作毕业证成绩单电子图； 3毕业证成绩单电子版做好以后发送给您确认； 4毕业证成绩单电子版您确认信息无误之后安排制作成品； 5成品做好拍照或者视频给您确认； 6快递给客户（国内顺丰国外DHLUPS等快读邮寄）。专业服务请勿犹豫联系我！本公司是留学创业和海归创业者们的桥梁。一次办理终生受用一步到位高效服务。详情请在线咨询办理,欢迎有诚意办理的客户咨询!洽谈。招聘代理：本公司诚聘英国加拿大澳洲新西兰美国法国德国新加坡各地代理人员如果你有业余时间有兴趣就请联系我们咨询顾问：+微信:95270640刀劈开抑或用拳头砸开每人抱起一大块就啃啃得满嘴满脸猴屁股般的红艳大家一个劲地指着对方吃吃地笑瓜裂得古怪奇形怪状却丝毫不影响瓜味甜丝丝的满嘴生津遍地都是瓜横七竖八的活像掷满了一地的大石块摘走二三只爷爷是断然发现不了的即便发现爷爷也不恼反而教山娃辨认孰熟孰嫩孰甜孰淡名义上是护瓜往往在瓜棚里坐上一刻饱吃一顿后山娃就领着阿黑漫山遍野地跑阿黑是一条黑色的大猎狗挺机灵的是山娃多年的忠实伙伴平时山娃上学阿黑也静

Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...

John Andrews

SlideShare Description for "Chatty Kathy - UNC Bootcamp Final Project Presentation" Title: Chatty Kathy: Enhancing Physical Activity Among Older Adults Description: Discover how Chatty Kathy, an innovative project developed at the UNC Bootcamp, aims to tackle the challenge of low physical activity among older adults. Our AI-driven solution uses peer interaction to boost and sustain exercise levels, significantly improving health outcomes. This presentation covers our problem statement, the rationale behind Chatty Kathy, synthetic data and persona creation, model performance metrics, a visual demonstration of the project, and potential future developments. Join us for an insightful Q&A session to explore the potential of this groundbreaking project. Project Team: Jay Requarth, Jana Avery, John Andrews, Dr. Dick Davis II, Nee Buntoum, Nam Yeongjin & Mat Nicholas

Criminal IP - Threat Hunting Webinar.pdf

Criminal IP

Recently uploaded (20)

Q1’2024 Update: MYCI’s Leap Year Rebound

Business update Q1 2024 Lar España Real Estate SOCIMI

Criminal IP - Threat Hunting Webinar.pdf

一比一原版(NYU毕业证)纽约大学毕业证成绩单

一比一原版(TWU毕业证)西三一大学毕业证成绩单

Empowering Data Analytics Ecosystem.pptx

1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx

Opendatabay - Open Data Marketplace.pptx

一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单

一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单

【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】

Adjusting primitives for graph : SHORT REPORT / NOTES

FP Growth Algorithm and its Applications

一比一原版(CU毕业证)卡尔顿大学毕业证成绩单

Investigate & Recover / StarCompliance.io / Crypto_Crimes

Tabula.io Cheatsheet: automate your data workflows

standardisation of garbhpala offhgfffghh

一比一原版(UVic毕业证)维多利亚大学毕业证成绩单

Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...

Criminal IP - Threat Hunting Webinar.pdf

Healthcare Predictive Analytics with the OR-(Denny Lee and Ayad Shammout, Databricks and Beth Israel Deaconess Medical Center)

1. Healthcare Predictive Analytics within the OR Ayad Shammout and Denny Lee June 15th, 2015

2. About Ayad Shammout • Director of Business Intelligence, Beth Israel Deaconess Medical Center • Helped build highly available / disaster recovery infrastructure for BIDMC 2

3. About Denny Lee • Technology Evangelist, Databricks • Former Sr. Director of Data Sciences Eng, Concur • Helped bring Hadoop onto Windows and Azure 3

4. About Databricks • Founded by Apache Spark Creators • Largest contributor to Spark project, committed to keeping Spark 100% open source • Databricks is an end-to-end hosted platform 4

5. 5 $15-$20 / minute for a basic surgical procedure Time is an OR's most valuable resource Lack of OR availability means loss of patient OR efficiency differs depending on the  OR staffing and allocation (8, 10, 13, or 16h), not the workload (i.e. cases)

6. 6 “You are not going to get the elephant to shrink or change its size. You need to face the fact that the elephant is 8 OR tall and 11hr wide” Steven Shafer, MD

7. 7 Operating Room Better utilization = Better profit margins Reduce support and maintenance costs Medical Staff Better utilization = Better profit margins Better medical staff efficiencies = Better outcomes Patients Shorter wait times and less cancellations Better medical staff efficiencies = Better outcomes

8. Develop Predictive Model • Develop a predictive model that would identify available OR time two weeks in advance. • Allow us to confirm wait list cases two weeks in advance, instead of when the blocks normally release four days out. 8

9. Forecast OR Schedule • Case load three weeks in advance • Book more cases weeks in advance to prevent under- utilization • Reduce staff overtime and idle time 9

10. Background • Three surgical pools • GYN, urology, general surgery, colorectal, surgical oncology • Eyes, plastics, ENT • Orthopedics, podiatry • Currently built using SQL Server Data Mining 10

11. 11

12. 12

13. 13 demoOR Block Scheduling Extract History data and run linear regression with SGD with multiple variables

14. X

15. X

16. X

17. X

18. X

19. X

20. X

21. 14 OR Schedule Report (example)

22. 15 Why the model is working • Can coordinate waitlist scheduling logistics with physicians and patients within two weeks of the surgery • Plan staff scheduling and resources so there are less last-minute staffing issues for nursing and anesthesia • Utilization metrics are showing us where we can maximize our elective surgical schedule and level demand

23. Thank you. Formoreinformation,pleasecontactdenny@databricks.com

Healthcare Predictive Analytics with the OR-(Denny Lee and Ayad Shammout, Databricks and Beth Israel Deaconess Medical Center)

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (20)

Similar to Healthcare Predictive Analytics with the OR-(Denny Lee and Ayad Shammout, Databricks and Beth Israel Deaconess Medical Center)

Similar to Healthcare Predictive Analytics with the OR-(Denny Lee and Ayad Shammout, Databricks and Beth Israel Deaconess Medical Center) (20)

More from Spark Summit

More from Spark Summit (20)

Recently uploaded

Recently uploaded (20)

Healthcare Predictive Analytics with the OR-(Denny Lee and Ayad Shammout, Databricks and Beth Israel Deaconess Medical Center)