SlideShare a Scribd company logo
1 of 9
PROJECT NAME:
HEALTHCARE COST ANALYSIS
Trainer Name: Deepti Miyan Gupta
Submitted By: Prasann Prem
A nationwide survey of hospital costs conducted by the US
Agency for Healthcare consists of hospital records of inpatient
samples. The analysis is being done on data records of the
hospital in Wisconsin and our main aim is to predict the
healthcare cost.
BUSINESS
PROBLEM
2
DATAOVERVIEW:
* According to a Survey
3
 Dataset dimensions:
Total no of rows: 151
Total no of columns: 6
 Dependent variable :
TOTCHG – Actual total healthcare cost
 Independent variables:
AGE: Value ranges from 0 to 17
FEMALE: If yes, then ‘1’ else ‘0’
LOS: Length of stay ranged from 0 to 41
RACE: 6 unique race valued 1 to 6
APRDRG: All Patient Refined Diagnosis Related Groups
BUSINESS SOLUTION:
4
 Since the data has multiple independent variables and has
continuous values, we will use Multi Linear Regression (MLR)
Algorithm.
 We will split the dataset in 70:30 ratio and train the model by
70% of data and predict the healthcare cost on 30% data.
MODEL INTERPRETATION:
5
 Setting the significant value at 95%, we looked for
variables with p-value < 0.05 to find out the
significant variables.
 We found that AGE, LOS AND APRDRG have only
p-value less than 0.05, and thus these are our
significant variables.
 Therefore, we will rebuild our model using only
these significant variables.
 Also, while looking at slope(Estimate) value, we
found the following relation between independent
and dependent variable:
1. AGE and LOS are directly proportional to TOTCHG
2. FEMALE, RACE & APRDRG are inversely
proportional to TOTCHG
MODEL RE-BUILDING:
6
 Using only significant variables from our last model, we
re-built the lm model and found that all the variables
have p-value less than 0.05.
 Thus, the model built with these variables are our final
model to predict our problem
 While looking at slope(Estimate) value, we found the
following relation between independent and dependent
variable:
1. AGE and LOS are directly proportional to TOTCHG
2. APRDRG is inversely proportional to TOTCHG
 Also, we got our R-squared value = 0.4434. This
means our current data and independent variables are
able to explain 44.34% of dependent variable only.
PREDICTION RESULTS:
7
 Based on our final model, we predicted the total
healthcare cost (predtest) and calculated residual
value and mean squared error (MSE).
 Our MSE value turned out to be 3825548.
 Also, we plotted our actual healthcare cost (TOTCHG)
against predicted healthcare cost (predtest) to see the
trend between them.
CONCLUSION:
8
 Our MSE value equals to 3825548, which is very high
and signifies the low accuracy of our predicted result.
 The plotted graph also shows the inaccuracy between
actual (TOTCHG) and predicted (predtest) healthcare
cost.
 We also calculated the R-squared value which turned
out to be only 44.34%
Thus, we can conclude that our current model is
insufficient to predict healthcare cost accurately.
This is because our dataset is very small and
current variables are not able to completely
explain our dependent variable.
With a rich dataset having more features and
information, we are likely to get a good result
with the same model.
THANK
YOU
Prasann Prem
prasannprem@live.com
+91-8006869552
DATA ANALYST

More Related Content

What's hot

StatVignette01-Correlation_06_15_2020.pptx
StatVignette01-Correlation_06_15_2020.pptxStatVignette01-Correlation_06_15_2020.pptx
StatVignette01-Correlation_06_15_2020.pptxSERC at Carleton College
 
DATA SCIENCE - Outlier detection and treatment_ sachin pathania
DATA SCIENCE - Outlier detection and treatment_ sachin pathaniaDATA SCIENCE - Outlier detection and treatment_ sachin pathania
DATA SCIENCE - Outlier detection and treatment_ sachin pathaniaSachin Pathania
 
Dependance Technique, Regression & Correlation
Dependance Technique, Regression & Correlation Dependance Technique, Regression & Correlation
Dependance Technique, Regression & Correlation Qasim Raza
 
Dependence Techniques
Dependence Techniques Dependence Techniques
Dependence Techniques Hasnain Khan
 
Interpretation and analysis
Interpretation and analysisInterpretation and analysis
Interpretation and analysisjavaidji
 
Assignment 3
Assignment 3  Assignment 3
Assignment 3 NURINCHO
 
ECON104RoughDraft1
ECON104RoughDraft1ECON104RoughDraft1
ECON104RoughDraft1John Nguyen
 
Correlation analysis ppt
Correlation analysis pptCorrelation analysis ppt
Correlation analysis pptDavid Jaison
 
2.3 the simple regression model
2.3 the simple regression model2.3 the simple regression model
2.3 the simple regression modelRegmi Milan
 

What's hot (19)

StatVignette01-Correlation_06_15_2020.pptx
StatVignette01-Correlation_06_15_2020.pptxStatVignette01-Correlation_06_15_2020.pptx
StatVignette01-Correlation_06_15_2020.pptx
 
DATA SCIENCE - Outlier detection and treatment_ sachin pathania
DATA SCIENCE - Outlier detection and treatment_ sachin pathaniaDATA SCIENCE - Outlier detection and treatment_ sachin pathania
DATA SCIENCE - Outlier detection and treatment_ sachin pathania
 
Dependance Technique, Regression & Correlation
Dependance Technique, Regression & Correlation Dependance Technique, Regression & Correlation
Dependance Technique, Regression & Correlation
 
Correlation
CorrelationCorrelation
Correlation
 
Regression project
Regression projectRegression project
Regression project
 
Dependence Techniques
Dependence Techniques Dependence Techniques
Dependence Techniques
 
Correlation
CorrelationCorrelation
Correlation
 
Regression
RegressionRegression
Regression
 
Lisrel
LisrelLisrel
Lisrel
 
Bias in Research Methods
Bias in Research Methods Bias in Research Methods
Bias in Research Methods
 
Interpretation and analysis
Interpretation and analysisInterpretation and analysis
Interpretation and analysis
 
Research meathodology
Research meathodologyResearch meathodology
Research meathodology
 
Data Applied: Correlation
Data Applied: CorrelationData Applied: Correlation
Data Applied: Correlation
 
Correlation analysis
Correlation analysis  Correlation analysis
Correlation analysis
 
R - Multiple Regression
R - Multiple RegressionR - Multiple Regression
R - Multiple Regression
 
Assignment 3
Assignment 3  Assignment 3
Assignment 3
 
ECON104RoughDraft1
ECON104RoughDraft1ECON104RoughDraft1
ECON104RoughDraft1
 
Correlation analysis ppt
Correlation analysis pptCorrelation analysis ppt
Correlation analysis ppt
 
2.3 the simple regression model
2.3 the simple regression model2.3 the simple regression model
2.3 the simple regression model
 

Similar to Wisconsin hospital - Healthcare Cost Prediction

Add slides
Add slidesAdd slides
Add slidesRupa D
 
Statistical analysis of Multiple and Logistic Regression
Statistical analysis of Multiple and Logistic RegressionStatistical analysis of Multiple and Logistic Regression
Statistical analysis of Multiple and Logistic RegressionSindhujanDhayalan
 
Econometrics project
Econometrics projectEconometrics project
Econometrics projectShubham Joon
 
Hypothesis Tests in R Programming
Hypothesis Tests in R ProgrammingHypothesis Tests in R Programming
Hypothesis Tests in R ProgrammingAtacan Garip
 
Predicting breast cancer: Adrian Valles
Predicting breast cancer: Adrian VallesPredicting breast cancer: Adrian Valles
Predicting breast cancer: Adrian VallesAdrián Vallés
 
Introduction to Econometrics for under gruadute class.pptx
Introduction to Econometrics for under gruadute class.pptxIntroduction to Econometrics for under gruadute class.pptx
Introduction to Econometrics for under gruadute class.pptxtadegebreyesus
 
Atharva_Joshis_Presentation_on_Regression.pptx
Atharva_Joshis_Presentation_on_Regression.pptxAtharva_Joshis_Presentation_on_Regression.pptx
Atharva_Joshis_Presentation_on_Regression.pptxAtharva Joshi
 
Meta-Analysis in Ayurveda
Meta-Analysis in AyurvedaMeta-Analysis in Ayurveda
Meta-Analysis in AyurvedaAyurdata
 
Statistical distributions
Statistical distributionsStatistical distributions
Statistical distributionsTanveerRehman4
 
statistical learning theory
statistical learning theorystatistical learning theory
statistical learning theoryHarshKumar943076
 
Predicting Hospital Productivity from Capacity Metrics - A Linear Regression ...
Predicting Hospital Productivity from Capacity Metrics - A Linear Regression ...Predicting Hospital Productivity from Capacity Metrics - A Linear Regression ...
Predicting Hospital Productivity from Capacity Metrics - A Linear Regression ...Madeleine Organ
 
2014 IIAG Imputation Assessments
2014 IIAG Imputation Assessments2014 IIAG Imputation Assessments
2014 IIAG Imputation AssessmentsDr Lendy Spires
 
Trochim, W. M. K. (2006). Internal validity.httpwww.socialres
Trochim, W. M. K. (2006). Internal validity.httpwww.socialresTrochim, W. M. K. (2006). Internal validity.httpwww.socialres
Trochim, W. M. K. (2006). Internal validity.httpwww.socialrescurranalmeta
 
Boost model accuracy of imbalanced covid 19 mortality prediction
Boost model accuracy of imbalanced covid 19 mortality predictionBoost model accuracy of imbalanced covid 19 mortality prediction
Boost model accuracy of imbalanced covid 19 mortality predictionBindhuBhargaviTalasi
 

Similar to Wisconsin hospital - Healthcare Cost Prediction (20)

Add slides
Add slidesAdd slides
Add slides
 
200994363
200994363200994363
200994363
 
Statistical analysis of Multiple and Logistic Regression
Statistical analysis of Multiple and Logistic RegressionStatistical analysis of Multiple and Logistic Regression
Statistical analysis of Multiple and Logistic Regression
 
B025209013
B025209013B025209013
B025209013
 
Econometrics project
Econometrics projectEconometrics project
Econometrics project
 
Hypothesis Tests in R Programming
Hypothesis Tests in R ProgrammingHypothesis Tests in R Programming
Hypothesis Tests in R Programming
 
Wine.Final.Project.MJv3
Wine.Final.Project.MJv3Wine.Final.Project.MJv3
Wine.Final.Project.MJv3
 
Predicting breast cancer: Adrian Valles
Predicting breast cancer: Adrian VallesPredicting breast cancer: Adrian Valles
Predicting breast cancer: Adrian Valles
 
Introduction to Econometrics for under gruadute class.pptx
Introduction to Econometrics for under gruadute class.pptxIntroduction to Econometrics for under gruadute class.pptx
Introduction to Econometrics for under gruadute class.pptx
 
Atharva_Joshis_Presentation_on_Regression.pptx
Atharva_Joshis_Presentation_on_Regression.pptxAtharva_Joshis_Presentation_on_Regression.pptx
Atharva_Joshis_Presentation_on_Regression.pptx
 
Meta-Analysis in Ayurveda
Meta-Analysis in AyurvedaMeta-Analysis in Ayurveda
Meta-Analysis in Ayurveda
 
Statistical distributions
Statistical distributionsStatistical distributions
Statistical distributions
 
statistical learning theory
statistical learning theorystatistical learning theory
statistical learning theory
 
Sem with amos ii
Sem with amos iiSem with amos ii
Sem with amos ii
 
Two sample t-test
Two sample t-testTwo sample t-test
Two sample t-test
 
Predicting Hospital Productivity from Capacity Metrics - A Linear Regression ...
Predicting Hospital Productivity from Capacity Metrics - A Linear Regression ...Predicting Hospital Productivity from Capacity Metrics - A Linear Regression ...
Predicting Hospital Productivity from Capacity Metrics - A Linear Regression ...
 
2014 IIAG Imputation Assessments
2014 IIAG Imputation Assessments2014 IIAG Imputation Assessments
2014 IIAG Imputation Assessments
 
mining
miningmining
mining
 
Trochim, W. M. K. (2006). Internal validity.httpwww.socialres
Trochim, W. M. K. (2006). Internal validity.httpwww.socialresTrochim, W. M. K. (2006). Internal validity.httpwww.socialres
Trochim, W. M. K. (2006). Internal validity.httpwww.socialres
 
Boost model accuracy of imbalanced covid 19 mortality prediction
Boost model accuracy of imbalanced covid 19 mortality predictionBoost model accuracy of imbalanced covid 19 mortality prediction
Boost model accuracy of imbalanced covid 19 mortality prediction
 

Recently uploaded

Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)jennyeacort
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceSapana Sha
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
Data Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxData Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxFurkanTasci3
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha
 
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...ThinkInnovation
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 

Recently uploaded (20)

Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts Service
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
Data Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxData Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptx
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts Service
 
Call Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort ServiceCall Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort Service
 
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 

Wisconsin hospital - Healthcare Cost Prediction

  • 1. PROJECT NAME: HEALTHCARE COST ANALYSIS Trainer Name: Deepti Miyan Gupta Submitted By: Prasann Prem
  • 2. A nationwide survey of hospital costs conducted by the US Agency for Healthcare consists of hospital records of inpatient samples. The analysis is being done on data records of the hospital in Wisconsin and our main aim is to predict the healthcare cost. BUSINESS PROBLEM 2
  • 3. DATAOVERVIEW: * According to a Survey 3  Dataset dimensions: Total no of rows: 151 Total no of columns: 6  Dependent variable : TOTCHG – Actual total healthcare cost  Independent variables: AGE: Value ranges from 0 to 17 FEMALE: If yes, then ‘1’ else ‘0’ LOS: Length of stay ranged from 0 to 41 RACE: 6 unique race valued 1 to 6 APRDRG: All Patient Refined Diagnosis Related Groups
  • 4. BUSINESS SOLUTION: 4  Since the data has multiple independent variables and has continuous values, we will use Multi Linear Regression (MLR) Algorithm.  We will split the dataset in 70:30 ratio and train the model by 70% of data and predict the healthcare cost on 30% data.
  • 5. MODEL INTERPRETATION: 5  Setting the significant value at 95%, we looked for variables with p-value < 0.05 to find out the significant variables.  We found that AGE, LOS AND APRDRG have only p-value less than 0.05, and thus these are our significant variables.  Therefore, we will rebuild our model using only these significant variables.  Also, while looking at slope(Estimate) value, we found the following relation between independent and dependent variable: 1. AGE and LOS are directly proportional to TOTCHG 2. FEMALE, RACE & APRDRG are inversely proportional to TOTCHG
  • 6. MODEL RE-BUILDING: 6  Using only significant variables from our last model, we re-built the lm model and found that all the variables have p-value less than 0.05.  Thus, the model built with these variables are our final model to predict our problem  While looking at slope(Estimate) value, we found the following relation between independent and dependent variable: 1. AGE and LOS are directly proportional to TOTCHG 2. APRDRG is inversely proportional to TOTCHG  Also, we got our R-squared value = 0.4434. This means our current data and independent variables are able to explain 44.34% of dependent variable only.
  • 7. PREDICTION RESULTS: 7  Based on our final model, we predicted the total healthcare cost (predtest) and calculated residual value and mean squared error (MSE).  Our MSE value turned out to be 3825548.  Also, we plotted our actual healthcare cost (TOTCHG) against predicted healthcare cost (predtest) to see the trend between them.
  • 8. CONCLUSION: 8  Our MSE value equals to 3825548, which is very high and signifies the low accuracy of our predicted result.  The plotted graph also shows the inaccuracy between actual (TOTCHG) and predicted (predtest) healthcare cost.  We also calculated the R-squared value which turned out to be only 44.34% Thus, we can conclude that our current model is insufficient to predict healthcare cost accurately. This is because our dataset is very small and current variables are not able to completely explain our dependent variable. With a rich dataset having more features and information, we are likely to get a good result with the same model.