SlideShare a Scribd company logo
Data Mining for
Diabetes Readmission Prediction
Team Evolution
Yi Chun Chien, Xiayu Zeng, Hong Zhang, Yixi Zhang
Agenda
 Abstract
 Introduction
 Data Description
 Problem Statement
 Methodology
 Discussions and Results
 Conclusions
2
Abstract
Background: Alarmingly high risk of readmissions in the US
Goal: discover factors contributing to hospitals readmissions
Methods: C5.0 Decision Tree, Quest, Neural Network and Bayesian Network
Results: emergency readmissions occur most frequently
Conclusions: effective prediction on readmissions enables hospitals to
identify and target patients at the highest risk
3
Introduction
 Topic: Diabetes Readmission Prediction
 What Is Readmission Rate
 A hospitalization that occurs within 30 days after a discharge
 Why Is Readmission Important
 Reduce cost of care and medical disputes
 Improve patients’ safety and health
4
Data Description
 Data source: The data is from the Center
for Clinical and Translational Research,
Virginia Commonwealth University
The dataset represents 10 years (1999-
2008) of clinical care at 130 US
hospitals and integrated delivery
networks. It includes101,766 instances
and 55 features representing patient and
hospital outcomes.
Link:
http://www.cioslab.vcu.edu/index.html
Attribute Description
Admission type
patient's admission type: emergency, urgent, elective,
newborn, not available, etc.
Time in hospital Integer number of days between admission and discharge
Number of lab
procedures Number of lab tests performed during the encounter
Number of outpatient
visits
Number of outpatient visits of the patient in the year
preceding the encounter
Number of emergency
visits
Number of emergency visits of the patient in the year
preceding the encounter
Number of inpatient
visits
Number of inpatient visits of the patient in the year
preceding the encounter
Number of diagnoses Number of diagnoses entered to the system
A1c test result The range of the result or if the test was not taken
Diabetes medications
Whether there was any diabetic medication prescribed or
not
Readmitted Days to inpatient readmission
Main attributes :
5
Methodology
 Business Outlook
 Data
Processing
 Variable
Selection
 Accuracy
Comparison
 Conclusion
 Model Building
Problem
Statement
Data
Pre-mining
Perform
AnalysisEvaluation
6
Problem Statement
 Identify the major factors that contribute to hospital readmissions
 Measure the influence of every attribute
 Compare accuracy of each model
7
0%
20%
40%
60%
80%
100%
As-IS Now
11%
49%
89%
51%
Patient Distribution
Readmission < 30 days Non-within 30 days
Data Pre-mining
 Re-categorize Readmission group from 3 groups (<30, >30 days and No)
to 2 groups (Readmission <30 days and Non-within 30 days)
 Review all variables’ distribution to ensure data quality
 Make Readmission group’s sample ratio be 1:1
 Review each variable’s relationship with Readmission
 Review numeric variables’ correlation
8
Variable’s relationship with Readmission
 Remove irrelevant variables
(EX: patient ID, payer code)
 Delete unknown value
 Eliminate variables only
has majority value
9
Variable’s relationship with Readmission
 Found some variables have
different behavior between
readmission group
 Diagnoses Number
 Admission Type ID
 Inpatient Number
10
Review numeric variables’ correlation
 No correlation between
numeric variables’
correlation
 Variables are independent
11
Perform Analysis
 Input variables: 14; Output variable: Readmission group
 Total sample size: 23,154
 Partition: 70% on Training; 30% on Testing
 Build models from (1) Decision Tree Analysis: C5.0 & QUEST
(2) Apply Neural Network Methodology
(3) Perform Bayes’ Analysis
12
13Model Stream
Hidden Layer=5 Hidden Layer=8
Neural Network - Comparison 14
Hidden layer = 5 Hidden layer = 8
13
Neural Network 15
Training data Testing data
Model Performance:
 66% accuracy in testing
dataset
Predictor Importance:
Top 5:
 Number of inpatient visits
 Admission type
 Admission source
 Discharge disposition
 Number of emergency visits
Why Use Bayesian Network
• Have probability for reference
• Allow variables’ dependency
• Well suited for categorical variables
16
Interpreting the Results
• Overall Model:
Training accuracy: 70.67%
Testing accuracy: 69.51%
• Model on “Yes” results:
Training accuracy: 71.35%
Testing accuracy: 69.54%
19Decision Tree: C5.0
Model Performance:
 65% accuracy in testing
dataset
Predictor Importance:
Top 5:
 Admission type
 Admission source
 Discharge disposition
 Number of inpatient visits
 Number of lab procedures
20Decision Tree: C5.0
Decision tree (part)
Decision Tree: QUEST
 What’s QUEST: Quick, Unbiased and Efficient Statistical Tree
 Advantage:
 Easily handle categorical predictor variables with many categories
 Use imputation to deal with missing values
 It provides linear splits using Fisher's LDA method
 How to improve accuracy
 Boosting: Randomly re-sample N dataset -> create
N trees -> optimal by weighted average
 Bagging: optimal by aggregated average
21
Decision Tree: QUEST
 Adopt Boosting method in QUEST
 Model Performance:
 72% accuracy in testing dataset
 Variable Importance:
 Top 5: Admission Type,
Admission Source,
Number of inpatient visits
Discharge Deposition,
Number of diagnoses
22
Training data Testing data
Model Evaluation 23
QUEST C5.0 Neural Net Bayesian
Admission Type Admission Type Inpatient # Discharge Disposition
Admission Source Admission Source Admission Type Admission Source
Inpatient # Discharge Disposition Admission Source Inpatient #
Discharge Disposition Inpatient # Discharge Disposition Lab Procedures #
Diagonose # Lab Procedures # Emergency # Admission Type
Top Important Variables:
 QUEST has the best accuracy
• Top 4 variables: Admission Type, Admission
Source, number of inpatient visits and
Discharge disposition.
NUMBER OF INPATIENT VISITS
(Number of inpatient visits of the patient in the year
preceding the encounter)
From the graph, it’s shown that number of inpatient visits
potentially increases the risk of readmission. For patients
who stayed in a hospital over 15 times in a year, they
would be 100% readmitted to the hospital within 30 days.
Inpatient treatment facilities should better provide
additional and more specialized medical care to reduce
their readmission rate.
Evaluation of top 4 important attributes
25ADMISSION TYPE
(1.Emergency, 2.Urgent, 3.Elective,
4.Newborn, 5.Not Available, 6.NULL,
7.Trauma Center, 8.Not Mapped )
ADMISSION SOURCE
(physician referral, emergency room,
and transfer from a hospital, etc.)
DISCHARGE DISPOSITION
(discharged to home, expired, and Hospice
/ home, etc. )
Decision Tree by JMP
Decision tree(part)
27
27Conclusion
• Data pre-mining is of upmost importance in improving the model accuracy
(1%72%);
• The readmission groups are related to admission source,admission type,
discharge disposition and number of inpatient visits;
• The readmission groups do not solely depend on any single variable, but the
interactions of related variables;
• Instead of tracking all 55 attributes, hospitals are suggested to focus on number
of patient’s inpatient visits, admission source, admission type, discharge
disposition;
• Hospitals are advised to concern not only inpatient treatment but also continuing
care after discharge.

More Related Content

What's hot

Power BI with Essbase in the Oracle Cloud
Power BI with Essbase in the Oracle CloudPower BI with Essbase in the Oracle Cloud
Power BI with Essbase in the Oracle Cloud
Kellyn Pot'Vin-Gorman
 
Data Discovery and Visualization
Data Discovery and VisualizationData Discovery and Visualization
Data Discovery and Visualization
Dr. Neil Brittliff
 
Search: Probabilistic Information Retrieval
Search: Probabilistic Information RetrievalSearch: Probabilistic Information Retrieval
Search: Probabilistic Information Retrieval
Vipul Munot
 
BIG DATA and USE CASES
BIG DATA and USE CASESBIG DATA and USE CASES
BIG DATA and USE CASES
Bhaskara Reddy Sannapureddy
 
Power BI: Tips and Tricks
Power BI: Tips and TricksPower BI: Tips and Tricks
Power BI: Tips and Tricks
GlobalLogic Ukraine
 
Big data and the Healthcare Sector
Big data and the Healthcare Sector Big data and the Healthcare Sector
Big data and the Healthcare Sector
Chris Groves
 
DATA PREPROCESSING AND DATA CLEANSING
DATA PREPROCESSING AND DATA CLEANSINGDATA PREPROCESSING AND DATA CLEANSING
DATA PREPROCESSING AND DATA CLEANSING
Ahtesham Ullah khan
 
Columnar Databases (1).pptx
Columnar Databases (1).pptxColumnar Databases (1).pptx
Columnar Databases (1).pptx
ssuser55cbdb
 
User Case of Migration from MicroStrategy to Power BI
 User Case of Migration from MicroStrategy to Power BI User Case of Migration from MicroStrategy to Power BI
User Case of Migration from MicroStrategy to Power BI
GreenM
 
Practo Ray.com for linkedin
Practo Ray.com for linkedinPracto Ray.com for linkedin
Practo Ray.com for linkedin
Dr Rahul Deshpande
 
Big Data applications in Health Care
Big Data applications in Health CareBig Data applications in Health Care
Big Data applications in Health Care
Leo Barella
 
Data Mining And Data Warehousing Laboratory File Manual
Data Mining And Data Warehousing Laboratory File ManualData Mining And Data Warehousing Laboratory File Manual
Data Mining And Data Warehousing Laboratory File Manual
Nitin Bhasin
 
Chapter - 5 Data Mining Concepts and Techniques 2nd Ed slides Han &amp; Kamber
Chapter - 5 Data Mining Concepts and Techniques 2nd Ed slides Han &amp; KamberChapter - 5 Data Mining Concepts and Techniques 2nd Ed slides Han &amp; Kamber
Chapter - 5 Data Mining Concepts and Techniques 2nd Ed slides Han &amp; Kamber
error007
 
Data Modeling with Power BI
Data Modeling with Power BIData Modeling with Power BI
Data Modeling with Power BI
Raul Martin Sarachaga Diaz
 
Big Data & Hadoop Introduction
Big Data & Hadoop IntroductionBig Data & Hadoop Introduction
Big Data & Hadoop Introduction
Jayant Mukherjee
 
Chapter 1: Introduction to Data Mining
Chapter 1: Introduction to Data MiningChapter 1: Introduction to Data Mining
Chapter 1: Introduction to Data Mining
Izwan Nizal Mohd Shaharanee
 
Fraud Detection in Insurance with Machine Learning for WARTA - Artur Suchwalko
Fraud Detection in Insurance with Machine Learning for WARTA - Artur SuchwalkoFraud Detection in Insurance with Machine Learning for WARTA - Artur Suchwalko
Fraud Detection in Insurance with Machine Learning for WARTA - Artur Suchwalko
Institute of Contemporary Sciences
 
Machine Learning in Big Data
Machine Learning in Big DataMachine Learning in Big Data
Machine Learning in Big Data
DataWorks Summit/Hadoop Summit
 
Data Mining: Concepts and Techniques — Chapter 2 —
Data Mining:  Concepts and Techniques — Chapter 2 —Data Mining:  Concepts and Techniques — Chapter 2 —
Data Mining: Concepts and Techniques — Chapter 2 —
Salah Amean
 
Association Analysis
Association AnalysisAssociation Analysis
Association Analysis
guest0edcaf
 

What's hot (20)

Power BI with Essbase in the Oracle Cloud
Power BI with Essbase in the Oracle CloudPower BI with Essbase in the Oracle Cloud
Power BI with Essbase in the Oracle Cloud
 
Data Discovery and Visualization
Data Discovery and VisualizationData Discovery and Visualization
Data Discovery and Visualization
 
Search: Probabilistic Information Retrieval
Search: Probabilistic Information RetrievalSearch: Probabilistic Information Retrieval
Search: Probabilistic Information Retrieval
 
BIG DATA and USE CASES
BIG DATA and USE CASESBIG DATA and USE CASES
BIG DATA and USE CASES
 
Power BI: Tips and Tricks
Power BI: Tips and TricksPower BI: Tips and Tricks
Power BI: Tips and Tricks
 
Big data and the Healthcare Sector
Big data and the Healthcare Sector Big data and the Healthcare Sector
Big data and the Healthcare Sector
 
DATA PREPROCESSING AND DATA CLEANSING
DATA PREPROCESSING AND DATA CLEANSINGDATA PREPROCESSING AND DATA CLEANSING
DATA PREPROCESSING AND DATA CLEANSING
 
Columnar Databases (1).pptx
Columnar Databases (1).pptxColumnar Databases (1).pptx
Columnar Databases (1).pptx
 
User Case of Migration from MicroStrategy to Power BI
 User Case of Migration from MicroStrategy to Power BI User Case of Migration from MicroStrategy to Power BI
User Case of Migration from MicroStrategy to Power BI
 
Practo Ray.com for linkedin
Practo Ray.com for linkedinPracto Ray.com for linkedin
Practo Ray.com for linkedin
 
Big Data applications in Health Care
Big Data applications in Health CareBig Data applications in Health Care
Big Data applications in Health Care
 
Data Mining And Data Warehousing Laboratory File Manual
Data Mining And Data Warehousing Laboratory File ManualData Mining And Data Warehousing Laboratory File Manual
Data Mining And Data Warehousing Laboratory File Manual
 
Chapter - 5 Data Mining Concepts and Techniques 2nd Ed slides Han &amp; Kamber
Chapter - 5 Data Mining Concepts and Techniques 2nd Ed slides Han &amp; KamberChapter - 5 Data Mining Concepts and Techniques 2nd Ed slides Han &amp; Kamber
Chapter - 5 Data Mining Concepts and Techniques 2nd Ed slides Han &amp; Kamber
 
Data Modeling with Power BI
Data Modeling with Power BIData Modeling with Power BI
Data Modeling with Power BI
 
Big Data & Hadoop Introduction
Big Data & Hadoop IntroductionBig Data & Hadoop Introduction
Big Data & Hadoop Introduction
 
Chapter 1: Introduction to Data Mining
Chapter 1: Introduction to Data MiningChapter 1: Introduction to Data Mining
Chapter 1: Introduction to Data Mining
 
Fraud Detection in Insurance with Machine Learning for WARTA - Artur Suchwalko
Fraud Detection in Insurance with Machine Learning for WARTA - Artur SuchwalkoFraud Detection in Insurance with Machine Learning for WARTA - Artur Suchwalko
Fraud Detection in Insurance with Machine Learning for WARTA - Artur Suchwalko
 
Machine Learning in Big Data
Machine Learning in Big DataMachine Learning in Big Data
Machine Learning in Big Data
 
Data Mining: Concepts and Techniques — Chapter 2 —
Data Mining:  Concepts and Techniques — Chapter 2 —Data Mining:  Concepts and Techniques — Chapter 2 —
Data Mining: Concepts and Techniques — Chapter 2 —
 
Association Analysis
Association AnalysisAssociation Analysis
Association Analysis
 

Viewers also liked

Predicting Hospital Readmission Using TreeNet
Predicting Hospital Readmission Using TreeNetPredicting Hospital Readmission Using TreeNet
Predicting Hospital Readmission Using TreeNet
Salford Systems
 
Classifying Readmissions of Diabetic Patient Encounters
Classifying Readmissions of Diabetic Patient EncountersClassifying Readmissions of Diabetic Patient Encounters
Classifying Readmissions of Diabetic Patient Encounters
Mayur Srinivasan
 
Applied Multivariable Modeling in Public Health: Use of CART and Logistic Reg...
Applied Multivariable Modeling in Public Health: Use of CART and Logistic Reg...Applied Multivariable Modeling in Public Health: Use of CART and Logistic Reg...
Applied Multivariable Modeling in Public Health: Use of CART and Logistic Reg...
Salford Systems
 
Readmission of Diabetes Patients Report
Readmission of Diabetes Patients ReportReadmission of Diabetes Patients Report
Readmission of Diabetes Patients Report
Hong Lu
 
Text Mining in Social Network
Text Mining in Social NetworkText Mining in Social Network
Text Mining in Social Network
Yi Chun (Nancy) Chien
 
Text Mining in Jeb Bush’s Email and Social Network
Text Mining in Jeb Bush’s Email and Social NetworkText Mining in Jeb Bush’s Email and Social Network
Text Mining in Jeb Bush’s Email and Social Network
Yi Chun (Nancy) Chien
 
2015 Sport Analysis for March Madness
2015 Sport Analysis for March Madness2015 Sport Analysis for March Madness
2015 Sport Analysis for March Madness
Yi Chun (Nancy) Chien
 
Rio reel 2015 illustration
Rio reel 2015 illustrationRio reel 2015 illustration
Rio reel 2015 illustration
Rio Li
 
Health care data - survivial analysis, draft
Health care data - survivial analysis, draftHealth care data - survivial analysis, draft
Health care data - survivial analysis, draft
Habet Madoyan
 
REGRESSION ANALYSIS ON HEALTH INSURANCE COVERAGE RATE
REGRESSION ANALYSIS ON HEALTH INSURANCE COVERAGE RATEREGRESSION ANALYSIS ON HEALTH INSURANCE COVERAGE RATE
REGRESSION ANALYSIS ON HEALTH INSURANCE COVERAGE RATE
Chaoyi WU
 
Churn Modeling-For-Mobile-Telecommunications
Churn Modeling-For-Mobile-Telecommunications Churn Modeling-For-Mobile-Telecommunications
Churn Modeling-For-Mobile-Telecommunications
Salford Systems
 
Analysis Of A Binary Outcome Variable
Analysis Of A Binary Outcome VariableAnalysis Of A Binary Outcome Variable
Analysis Of A Binary Outcome Variable
Arthur8898
 
Case Study: American Family Insurance Best Practices for Automating Guidewire...
Case Study: American Family Insurance Best Practices for Automating Guidewire...Case Study: American Family Insurance Best Practices for Automating Guidewire...
Case Study: American Family Insurance Best Practices for Automating Guidewire...
CA Technologies
 
5 Vital Tips to Help Reduce Readmissions in Hospitals
5 Vital Tips to Help Reduce Readmissions in Hospitals5 Vital Tips to Help Reduce Readmissions in Hospitals
5 Vital Tips to Help Reduce Readmissions in Hospitals
Juran Global
 
Presentation at UHC Annual Meeting
Presentation at UHC  Annual MeetingPresentation at UHC  Annual Meeting
Presentation at UHC Annual Meeting
Joel Saltz
 
Car accident repairshops
Car accident repairshopsCar accident repairshops
Car accident repairshops
Yi Chun (Nancy) Chien
 
Improve Your Regression with CART and RandomForests
Improve Your Regression with CART and RandomForestsImprove Your Regression with CART and RandomForests
Improve Your Regression with CART and RandomForests
Salford Systems
 
Learn how to do a conjoint analysis project in 1 hr
Learn how to do a conjoint analysis project in 1 hrLearn how to do a conjoint analysis project in 1 hr
Learn how to do a conjoint analysis project in 1 hr
QuestionPro
 
Predictive Modeling in Insurance in the context of (possibly) big data
Predictive Modeling in Insurance in the context of (possibly) big dataPredictive Modeling in Insurance in the context of (possibly) big data
Predictive Modeling in Insurance in the context of (possibly) big data
Arthur Charpentier
 
Decision tree and random forest
Decision tree and random forestDecision tree and random forest
Decision tree and random forest
Lippo Group Digital
 

Viewers also liked (20)

Predicting Hospital Readmission Using TreeNet
Predicting Hospital Readmission Using TreeNetPredicting Hospital Readmission Using TreeNet
Predicting Hospital Readmission Using TreeNet
 
Classifying Readmissions of Diabetic Patient Encounters
Classifying Readmissions of Diabetic Patient EncountersClassifying Readmissions of Diabetic Patient Encounters
Classifying Readmissions of Diabetic Patient Encounters
 
Applied Multivariable Modeling in Public Health: Use of CART and Logistic Reg...
Applied Multivariable Modeling in Public Health: Use of CART and Logistic Reg...Applied Multivariable Modeling in Public Health: Use of CART and Logistic Reg...
Applied Multivariable Modeling in Public Health: Use of CART and Logistic Reg...
 
Readmission of Diabetes Patients Report
Readmission of Diabetes Patients ReportReadmission of Diabetes Patients Report
Readmission of Diabetes Patients Report
 
Text Mining in Social Network
Text Mining in Social NetworkText Mining in Social Network
Text Mining in Social Network
 
Text Mining in Jeb Bush’s Email and Social Network
Text Mining in Jeb Bush’s Email and Social NetworkText Mining in Jeb Bush’s Email and Social Network
Text Mining in Jeb Bush’s Email and Social Network
 
2015 Sport Analysis for March Madness
2015 Sport Analysis for March Madness2015 Sport Analysis for March Madness
2015 Sport Analysis for March Madness
 
Rio reel 2015 illustration
Rio reel 2015 illustrationRio reel 2015 illustration
Rio reel 2015 illustration
 
Health care data - survivial analysis, draft
Health care data - survivial analysis, draftHealth care data - survivial analysis, draft
Health care data - survivial analysis, draft
 
REGRESSION ANALYSIS ON HEALTH INSURANCE COVERAGE RATE
REGRESSION ANALYSIS ON HEALTH INSURANCE COVERAGE RATEREGRESSION ANALYSIS ON HEALTH INSURANCE COVERAGE RATE
REGRESSION ANALYSIS ON HEALTH INSURANCE COVERAGE RATE
 
Churn Modeling-For-Mobile-Telecommunications
Churn Modeling-For-Mobile-Telecommunications Churn Modeling-For-Mobile-Telecommunications
Churn Modeling-For-Mobile-Telecommunications
 
Analysis Of A Binary Outcome Variable
Analysis Of A Binary Outcome VariableAnalysis Of A Binary Outcome Variable
Analysis Of A Binary Outcome Variable
 
Case Study: American Family Insurance Best Practices for Automating Guidewire...
Case Study: American Family Insurance Best Practices for Automating Guidewire...Case Study: American Family Insurance Best Practices for Automating Guidewire...
Case Study: American Family Insurance Best Practices for Automating Guidewire...
 
5 Vital Tips to Help Reduce Readmissions in Hospitals
5 Vital Tips to Help Reduce Readmissions in Hospitals5 Vital Tips to Help Reduce Readmissions in Hospitals
5 Vital Tips to Help Reduce Readmissions in Hospitals
 
Presentation at UHC Annual Meeting
Presentation at UHC  Annual MeetingPresentation at UHC  Annual Meeting
Presentation at UHC Annual Meeting
 
Car accident repairshops
Car accident repairshopsCar accident repairshops
Car accident repairshops
 
Improve Your Regression with CART and RandomForests
Improve Your Regression with CART and RandomForestsImprove Your Regression with CART and RandomForests
Improve Your Regression with CART and RandomForests
 
Learn how to do a conjoint analysis project in 1 hr
Learn how to do a conjoint analysis project in 1 hrLearn how to do a conjoint analysis project in 1 hr
Learn how to do a conjoint analysis project in 1 hr
 
Predictive Modeling in Insurance in the context of (possibly) big data
Predictive Modeling in Insurance in the context of (possibly) big dataPredictive Modeling in Insurance in the context of (possibly) big data
Predictive Modeling in Insurance in the context of (possibly) big data
 
Decision tree and random forest
Decision tree and random forestDecision tree and random forest
Decision tree and random forest
 

Similar to Data mining for diabetes readmission

Data mining for diabetes readmission final
Data mining for diabetes readmission finalData mining for diabetes readmission final
Data mining for diabetes readmission final
Xiayu (Carol) Zeng
 
Predicting Diabetic Readmission Rates: Moving Beyond HbA1c
Predicting Diabetic Readmission Rates: Moving Beyond HbA1cPredicting Diabetic Readmission Rates: Moving Beyond HbA1c
Predicting Diabetic Readmission Rates: Moving Beyond HbA1c
Damian R. Mingle, MBA
 
How Allina Health Uses Analytics to Transform Care - HAS Session 16
How Allina Health Uses Analytics to Transform Care - HAS Session 16How Allina Health Uses Analytics to Transform Care - HAS Session 16
How Allina Health Uses Analytics to Transform Care - HAS Session 16
Health Catalyst
 
Bigtown simulation model
Bigtown simulation modelBigtown simulation model
Bigtown simulation model
NHS Improving Quality
 
The Imperative of Linking Clinical and Financial Data to Improve Outcomes - H...
The Imperative of Linking Clinical and Financial Data to Improve Outcomes - H...The Imperative of Linking Clinical and Financial Data to Improve Outcomes - H...
The Imperative of Linking Clinical and Financial Data to Improve Outcomes - H...
Health Catalyst
 
Critical Appraisal - Quantitative SS.pptx
Critical Appraisal - Quantitative SS.pptxCritical Appraisal - Quantitative SS.pptx
Critical Appraisal - Quantitative SS.pptx
Mrs S Sen
 
Emerging Technologies and the Quality of Care
Emerging Technologies and the Quality of CareEmerging Technologies and the Quality of Care
Emerging Technologies and the Quality of Care
CBA Consórcio Brasileiro de Acreditação
 
Advanced Laboratory Analytics — A Disruptive Solution for Health Systems
Advanced Laboratory Analytics — A Disruptive Solution for Health SystemsAdvanced Laboratory Analytics — A Disruptive Solution for Health Systems
Advanced Laboratory Analytics — A Disruptive Solution for Health Systems
Viewics
 
Certified Nurse Leader (CNL) Capstone Project
Certified Nurse Leader (CNL) Capstone ProjectCertified Nurse Leader (CNL) Capstone Project
Certified Nurse Leader (CNL) Capstone Project
bdcw
 
Managing Readmissions: The Key to Impacting Your Bottom Line
Managing Readmissions: The Key to Impacting Your Bottom LineManaging Readmissions: The Key to Impacting Your Bottom Line
Managing Readmissions: The Key to Impacting Your Bottom Line
Fathom: A Digital Marketing Agency
 
MEDICAL AUDIT 3rd Ed.pptx
MEDICAL AUDIT 3rd Ed.pptxMEDICAL AUDIT 3rd Ed.pptx
MEDICAL AUDIT 3rd Ed.pptx
YezekaelSebastianQui
 
CPHQ Exam Sample Questions By Dr.Mahboob Khan Phd
CPHQ Exam Sample Questions By Dr.Mahboob Khan PhdCPHQ Exam Sample Questions By Dr.Mahboob Khan Phd
CPHQ Exam Sample Questions By Dr.Mahboob Khan Phd
Healthcare consultant
 
Measure For Improvement
Measure For ImprovementMeasure For Improvement
Measure For Improvement
Musa Abu Sbeih
 
Measure the Improvement
Measure the  ImprovementMeasure the  Improvement
Measure the Improvement
Musa Abu Sbeih
 
Methods for Observational Comparative Effectiveness Research on Healthcare De...
Methods for Observational Comparative Effectiveness Research on Healthcare De...Methods for Observational Comparative Effectiveness Research on Healthcare De...
Methods for Observational Comparative Effectiveness Research on Healthcare De...
Marion Sills
 
Making Healthcare Waste Reduction and Patient Safety Actionable - HAS Session 6
Making Healthcare Waste Reduction and Patient Safety Actionable - HAS Session 6Making Healthcare Waste Reduction and Patient Safety Actionable - HAS Session 6
Making Healthcare Waste Reduction and Patient Safety Actionable - HAS Session 6
Health Catalyst
 
Addressing Crowding by Smoothing Occupancy at Children's Hospitals 4.14.10
Addressing Crowding by Smoothing Occupancy at Children's Hospitals 4.14.10Addressing Crowding by Smoothing Occupancy at Children's Hospitals 4.14.10
Addressing Crowding by Smoothing Occupancy at Children's Hospitals 4.14.10
Leonard Davis Institute of Health Economics
 
Decision Support System to Evaluate Patient Readmission Risk
Decision Support System to Evaluate Patient Readmission RiskDecision Support System to Evaluate Patient Readmission Risk
Decision Support System to Evaluate Patient Readmission Risk
Avishek Choudhury
 
PCMH: Part 4 – Learn How to Start or Improve Your Quality Improvement Program
PCMH: Part 4 – Learn How to Start or Improve Your Quality Improvement ProgramPCMH: Part 4 – Learn How to Start or Improve Your Quality Improvement Program
PCMH: Part 4 – Learn How to Start or Improve Your Quality Improvement Program
Julie Champagne
 
Introduction to Healthcare Analytics
Introduction to Healthcare Analytics Introduction to Healthcare Analytics
Introduction to Healthcare Analytics
Experfy
 

Similar to Data mining for diabetes readmission (20)

Data mining for diabetes readmission final
Data mining for diabetes readmission finalData mining for diabetes readmission final
Data mining for diabetes readmission final
 
Predicting Diabetic Readmission Rates: Moving Beyond HbA1c
Predicting Diabetic Readmission Rates: Moving Beyond HbA1cPredicting Diabetic Readmission Rates: Moving Beyond HbA1c
Predicting Diabetic Readmission Rates: Moving Beyond HbA1c
 
How Allina Health Uses Analytics to Transform Care - HAS Session 16
How Allina Health Uses Analytics to Transform Care - HAS Session 16How Allina Health Uses Analytics to Transform Care - HAS Session 16
How Allina Health Uses Analytics to Transform Care - HAS Session 16
 
Bigtown simulation model
Bigtown simulation modelBigtown simulation model
Bigtown simulation model
 
The Imperative of Linking Clinical and Financial Data to Improve Outcomes - H...
The Imperative of Linking Clinical and Financial Data to Improve Outcomes - H...The Imperative of Linking Clinical and Financial Data to Improve Outcomes - H...
The Imperative of Linking Clinical and Financial Data to Improve Outcomes - H...
 
Critical Appraisal - Quantitative SS.pptx
Critical Appraisal - Quantitative SS.pptxCritical Appraisal - Quantitative SS.pptx
Critical Appraisal - Quantitative SS.pptx
 
Emerging Technologies and the Quality of Care
Emerging Technologies and the Quality of CareEmerging Technologies and the Quality of Care
Emerging Technologies and the Quality of Care
 
Advanced Laboratory Analytics — A Disruptive Solution for Health Systems
Advanced Laboratory Analytics — A Disruptive Solution for Health SystemsAdvanced Laboratory Analytics — A Disruptive Solution for Health Systems
Advanced Laboratory Analytics — A Disruptive Solution for Health Systems
 
Certified Nurse Leader (CNL) Capstone Project
Certified Nurse Leader (CNL) Capstone ProjectCertified Nurse Leader (CNL) Capstone Project
Certified Nurse Leader (CNL) Capstone Project
 
Managing Readmissions: The Key to Impacting Your Bottom Line
Managing Readmissions: The Key to Impacting Your Bottom LineManaging Readmissions: The Key to Impacting Your Bottom Line
Managing Readmissions: The Key to Impacting Your Bottom Line
 
MEDICAL AUDIT 3rd Ed.pptx
MEDICAL AUDIT 3rd Ed.pptxMEDICAL AUDIT 3rd Ed.pptx
MEDICAL AUDIT 3rd Ed.pptx
 
CPHQ Exam Sample Questions By Dr.Mahboob Khan Phd
CPHQ Exam Sample Questions By Dr.Mahboob Khan PhdCPHQ Exam Sample Questions By Dr.Mahboob Khan Phd
CPHQ Exam Sample Questions By Dr.Mahboob Khan Phd
 
Measure For Improvement
Measure For ImprovementMeasure For Improvement
Measure For Improvement
 
Measure the Improvement
Measure the  ImprovementMeasure the  Improvement
Measure the Improvement
 
Methods for Observational Comparative Effectiveness Research on Healthcare De...
Methods for Observational Comparative Effectiveness Research on Healthcare De...Methods for Observational Comparative Effectiveness Research on Healthcare De...
Methods for Observational Comparative Effectiveness Research on Healthcare De...
 
Making Healthcare Waste Reduction and Patient Safety Actionable - HAS Session 6
Making Healthcare Waste Reduction and Patient Safety Actionable - HAS Session 6Making Healthcare Waste Reduction and Patient Safety Actionable - HAS Session 6
Making Healthcare Waste Reduction and Patient Safety Actionable - HAS Session 6
 
Addressing Crowding by Smoothing Occupancy at Children's Hospitals 4.14.10
Addressing Crowding by Smoothing Occupancy at Children's Hospitals 4.14.10Addressing Crowding by Smoothing Occupancy at Children's Hospitals 4.14.10
Addressing Crowding by Smoothing Occupancy at Children's Hospitals 4.14.10
 
Decision Support System to Evaluate Patient Readmission Risk
Decision Support System to Evaluate Patient Readmission RiskDecision Support System to Evaluate Patient Readmission Risk
Decision Support System to Evaluate Patient Readmission Risk
 
PCMH: Part 4 – Learn How to Start or Improve Your Quality Improvement Program
PCMH: Part 4 – Learn How to Start or Improve Your Quality Improvement ProgramPCMH: Part 4 – Learn How to Start or Improve Your Quality Improvement Program
PCMH: Part 4 – Learn How to Start or Improve Your Quality Improvement Program
 
Introduction to Healthcare Analytics
Introduction to Healthcare Analytics Introduction to Healthcare Analytics
Introduction to Healthcare Analytics
 

Recently uploaded

一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
bopyb
 
The Ipsos - AI - Monitor 2024 Report.pdf
The  Ipsos - AI - Monitor 2024 Report.pdfThe  Ipsos - AI - Monitor 2024 Report.pdf
The Ipsos - AI - Monitor 2024 Report.pdf
Social Samosa
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
roli9797
 
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataPredictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Kiwi Creative
 
一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
aqzctr7x
 
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
dwreak4tg
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
slg6lamcq
 
State of Artificial intelligence Report 2023
State of Artificial intelligence Report 2023State of Artificial intelligence Report 2023
State of Artificial intelligence Report 2023
kuntobimo2016
 
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
v7oacc3l
 
Global Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headedGlobal Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headed
vikram sood
 
Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......
Sachin Paul
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
slg6lamcq
 
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
g4dpvqap0
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
ahzuo
 
Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...
Bill641377
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
sameer shah
 
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
zsjl4mimo
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
u86oixdj
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
javier ramirez
 

Recently uploaded (20)

一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
 
The Ipsos - AI - Monitor 2024 Report.pdf
The  Ipsos - AI - Monitor 2024 Report.pdfThe  Ipsos - AI - Monitor 2024 Report.pdf
The Ipsos - AI - Monitor 2024 Report.pdf
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
 
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataPredictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
 
一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
 
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
 
State of Artificial intelligence Report 2023
State of Artificial intelligence Report 2023State of Artificial intelligence Report 2023
State of Artificial intelligence Report 2023
 
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
 
Global Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headedGlobal Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headed
 
Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
 
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
 
Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
 
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
 

Data mining for diabetes readmission

  • 1. Data Mining for Diabetes Readmission Prediction Team Evolution Yi Chun Chien, Xiayu Zeng, Hong Zhang, Yixi Zhang
  • 2. Agenda  Abstract  Introduction  Data Description  Problem Statement  Methodology  Discussions and Results  Conclusions 2
  • 3. Abstract Background: Alarmingly high risk of readmissions in the US Goal: discover factors contributing to hospitals readmissions Methods: C5.0 Decision Tree, Quest, Neural Network and Bayesian Network Results: emergency readmissions occur most frequently Conclusions: effective prediction on readmissions enables hospitals to identify and target patients at the highest risk 3
  • 4. Introduction  Topic: Diabetes Readmission Prediction  What Is Readmission Rate  A hospitalization that occurs within 30 days after a discharge  Why Is Readmission Important  Reduce cost of care and medical disputes  Improve patients’ safety and health 4
  • 5. Data Description  Data source: The data is from the Center for Clinical and Translational Research, Virginia Commonwealth University The dataset represents 10 years (1999- 2008) of clinical care at 130 US hospitals and integrated delivery networks. It includes101,766 instances and 55 features representing patient and hospital outcomes. Link: http://www.cioslab.vcu.edu/index.html Attribute Description Admission type patient's admission type: emergency, urgent, elective, newborn, not available, etc. Time in hospital Integer number of days between admission and discharge Number of lab procedures Number of lab tests performed during the encounter Number of outpatient visits Number of outpatient visits of the patient in the year preceding the encounter Number of emergency visits Number of emergency visits of the patient in the year preceding the encounter Number of inpatient visits Number of inpatient visits of the patient in the year preceding the encounter Number of diagnoses Number of diagnoses entered to the system A1c test result The range of the result or if the test was not taken Diabetes medications Whether there was any diabetic medication prescribed or not Readmitted Days to inpatient readmission Main attributes : 5
  • 6. Methodology  Business Outlook  Data Processing  Variable Selection  Accuracy Comparison  Conclusion  Model Building Problem Statement Data Pre-mining Perform AnalysisEvaluation 6
  • 7. Problem Statement  Identify the major factors that contribute to hospital readmissions  Measure the influence of every attribute  Compare accuracy of each model 7
  • 8. 0% 20% 40% 60% 80% 100% As-IS Now 11% 49% 89% 51% Patient Distribution Readmission < 30 days Non-within 30 days Data Pre-mining  Re-categorize Readmission group from 3 groups (<30, >30 days and No) to 2 groups (Readmission <30 days and Non-within 30 days)  Review all variables’ distribution to ensure data quality  Make Readmission group’s sample ratio be 1:1  Review each variable’s relationship with Readmission  Review numeric variables’ correlation 8
  • 9. Variable’s relationship with Readmission  Remove irrelevant variables (EX: patient ID, payer code)  Delete unknown value  Eliminate variables only has majority value 9
  • 10. Variable’s relationship with Readmission  Found some variables have different behavior between readmission group  Diagnoses Number  Admission Type ID  Inpatient Number 10
  • 11. Review numeric variables’ correlation  No correlation between numeric variables’ correlation  Variables are independent 11
  • 12. Perform Analysis  Input variables: 14; Output variable: Readmission group  Total sample size: 23,154  Partition: 70% on Training; 30% on Testing  Build models from (1) Decision Tree Analysis: C5.0 & QUEST (2) Apply Neural Network Methodology (3) Perform Bayes’ Analysis 12
  • 14. Hidden Layer=5 Hidden Layer=8 Neural Network - Comparison 14 Hidden layer = 5 Hidden layer = 8
  • 15. 13 Neural Network 15 Training data Testing data Model Performance:  66% accuracy in testing dataset Predictor Importance: Top 5:  Number of inpatient visits  Admission type  Admission source  Discharge disposition  Number of emergency visits
  • 16. Why Use Bayesian Network • Have probability for reference • Allow variables’ dependency • Well suited for categorical variables 16
  • 18. • Overall Model: Training accuracy: 70.67% Testing accuracy: 69.51% • Model on “Yes” results: Training accuracy: 71.35% Testing accuracy: 69.54%
  • 19. 19Decision Tree: C5.0 Model Performance:  65% accuracy in testing dataset Predictor Importance: Top 5:  Admission type  Admission source  Discharge disposition  Number of inpatient visits  Number of lab procedures
  • 21. Decision Tree: QUEST  What’s QUEST: Quick, Unbiased and Efficient Statistical Tree  Advantage:  Easily handle categorical predictor variables with many categories  Use imputation to deal with missing values  It provides linear splits using Fisher's LDA method  How to improve accuracy  Boosting: Randomly re-sample N dataset -> create N trees -> optimal by weighted average  Bagging: optimal by aggregated average 21
  • 22. Decision Tree: QUEST  Adopt Boosting method in QUEST  Model Performance:  72% accuracy in testing dataset  Variable Importance:  Top 5: Admission Type, Admission Source, Number of inpatient visits Discharge Deposition, Number of diagnoses 22 Training data Testing data
  • 23. Model Evaluation 23 QUEST C5.0 Neural Net Bayesian Admission Type Admission Type Inpatient # Discharge Disposition Admission Source Admission Source Admission Type Admission Source Inpatient # Discharge Disposition Admission Source Inpatient # Discharge Disposition Inpatient # Discharge Disposition Lab Procedures # Diagonose # Lab Procedures # Emergency # Admission Type Top Important Variables:  QUEST has the best accuracy • Top 4 variables: Admission Type, Admission Source, number of inpatient visits and Discharge disposition.
  • 24. NUMBER OF INPATIENT VISITS (Number of inpatient visits of the patient in the year preceding the encounter) From the graph, it’s shown that number of inpatient visits potentially increases the risk of readmission. For patients who stayed in a hospital over 15 times in a year, they would be 100% readmitted to the hospital within 30 days. Inpatient treatment facilities should better provide additional and more specialized medical care to reduce their readmission rate. Evaluation of top 4 important attributes
  • 25. 25ADMISSION TYPE (1.Emergency, 2.Urgent, 3.Elective, 4.Newborn, 5.Not Available, 6.NULL, 7.Trauma Center, 8.Not Mapped ) ADMISSION SOURCE (physician referral, emergency room, and transfer from a hospital, etc.) DISCHARGE DISPOSITION (discharged to home, expired, and Hospice / home, etc. )
  • 26. Decision Tree by JMP Decision tree(part)
  • 27. 27 27Conclusion • Data pre-mining is of upmost importance in improving the model accuracy (1%72%); • The readmission groups are related to admission source,admission type, discharge disposition and number of inpatient visits; • The readmission groups do not solely depend on any single variable, but the interactions of related variables; • Instead of tracking all 55 attributes, hospitals are suggested to focus on number of patient’s inpatient visits, admission source, admission type, discharge disposition; • Hospitals are advised to concern not only inpatient treatment but also continuing care after discharge.

Editor's Notes

  1. From the perspective of number_inpatient, we can see that when number_inpatient is less than 3, the readimission rate is relatively low. By contrast, when it is higher or equal to three, the readmission rate becomes very high. And it even goes to 100 percent when it is larger than or equals to 14.So, the hospitals should aimed at increasing the number of inpatient visits of the patient in the year preceding the encounter to increase the readmission rate of the patients, thus realizing their business goal.