SlideShare a Scribd company logo
Project presentation of
“ HEALTH PREDICTION ANALYSIS USING
DATA MINING “
Presented by
Name Roll
no
Kritika Ashok Rane. 28
Ashish Ravindra Salve. 30
Ashwini Dhananjay Sawant. 31
Under Guidance Of
Prof. J. P. Patil
 What is Need of data mining in healthcare.
1. The application of Data mining healthcare has a lot of positive and also life-saving outcomes.
2. Data mining refers to the vast quantities of information created by the digitization of everything, that gets consolidated and
analyzed by specific technologies.
3. The costs of treatments are much higher than they should be, and they have been rising for the past 20 years. Clearly, we need
some smart, data-driven thinking in this area.
 How it will affect on patient’s health and wealth.
1. Applied to healthcare, it will use specific health data of a population (or of a particular individual) and potentially help to
prevent epidemics, cure disease, cut down costs, etc.
2. Data mining has become an increasingly pervasive activity in all areas medical science research. Data mining has resulted in
tha discovery of useful hidden patterns from massive databases. By using data mining techniques finally physicians need to
know how quickly identify and diagnose potential cases.
Rural Areas get relatively less healthcare facilities and doctor
availability is very poor but there may be some people who might have
relatively sufficient knowledge about pharmaceuticals and they can
treat the patient in urgent basis using their knowledge of what may
happen to the patient . So we need something that predict what will
happen to the patient in less time and save patient’s life. So our main
moto of doing this project is to help the needy people.
 Hospital management
The ability to detect anomalous behaviour based on purchase, usage and other transactional behaviour information has made
data mining a key tool in variety of organizations to detect fraudulent claims, inappropriate prescriptions and other
abnormal behavioural patterns.
 Healthcare management
Healthcare industry today generates large amounts of complex data about patients, hospitals resources, disease diagnosis,
electronic patient records, medical devices etc. The large amounts of data is a key resource to be processed and analyzed for
knowledge extraction that enables support for cost-savings and decision making
 Pharmaceutical industry
When there is no dispositive evidence favouring a particular treatment option Based on patients’ profile, history, physical
examination, diagnosis and utilizing previous treatment patterns, new treatment plans can be effectively suggested
 Personalized treatment planning
Healthcare organizations make customer relationship management decisions, Physicians identify effective treatments and
best practices, and Patients receive better and more affordable healthcare services
 Prediction of diseases
 Monitor patient’s vital signs and many more….
 Personalized Medicine
One of the top goal is to create a personalised treatment plan based on individual biology.
 Predictive Analytics And Preventive Measures
Prevention is always better than cure. For the healthcare industry, it also happens to save a lot of
money.
 The Ultimate EHR ( electronic health record ) also referred as EMR (electronic medical record)
This precious file would contain every piece of information about patient’s health, would always be
up to date could be shared across any network.
 Disease Modelling and Mapping
One of the flashiest uses of data science in the past few years has been in tracking, finding ways ton
halt or prevent diseases.
 Reduce Fraud And Enhance Security
This particular industry is 200% more likely to experience data breaches than any other industry
because personal data is extremely valuable and profitable in black markets.
The application of Data Mining in healthcare has a lot of positive and
also life-saving outcomes. So we are going to develop a model which
predicts the health of the patient through the medical history of patient
stored in the database as EHR in the hospital or particular healthcare
organization so the specialists can predict the disease in less time and
should give the proper treatment to the patient.
“Application of big data in medical science Brings revolution in managing health Care of humans”
by Dr. Gagandeep Jagdev (2015)
Domain Big data
Task Performed
 Personalized treatment planning
 Assisted diagnosis
 Fraud detection
 Monitor patient vital signs
 Digitization of data
Technology Used
 First stage: mapping
 Intermediate stages: shuffling
 Final stage: reducing
Provide Information About
 Various disease
 Patient treatment
 Hospital management info
“Data Mining Applications in Healthcare Sector” by M. Sumanth
Domain Data mining
Task Performed
 Treatment management
 Healthcare management
 Customer relationship management
 Fraud and abuse
 Medical device industry
 Pharmaceutical industry
 System biology
 Hospital management
Technology Used --na--
Provide Information About
 Patient treatment
“Hybrid Approach for Heart Disease Detection Using Clustering and ANN”
by Neha Chikshe, Tejasweeta Dixit, Rashmi Gore, Prerana Akade (2016)
Domain Clustering and ANN
Task Performed Prediction Of heart Disease
Technology Used
 Clustering
 Neural networks navie bayes
 Decision tree
 K- nearest neighbour
Provide Information About Heart disease
“Analysis of Data mining techniques for healthcare decision support system using liver disorder dataset”
by Tapas Rajan Baitharu, Subhendu Kumar Pani (2016)
Domain Data mining, ANN
Task Performed Liver disease prediction Analysis
Technology Used
• Naïve Bayes
• Multiplier Perceptron
• ZeroR
Provide Information About
Comparison of various algorithms
 Hardware Requirement:
1. 1.5 gigahertz (GHz) dual-core C.P.U.
2. 4 GB RAM
3. 1024x768 minimum screen resolution
4. 10GB Of hard disk space
 Software Requirements:
1. Microsoft Windows 7+
2. Xampp web server
3. text editor (notepad)
4. Anaconda
We Are using following algorithms for implementation
of this project:
• Decision tree
• Logistic regression
• K-nearest neighbours
• Naive Bayes classifier
Decision trees are commonly used in operations research, specifically in
decision analysis, to help identify a strategy most likely to reach a goal, but
are also a popular tool in data mining.
Example:-
 It’s a classification algorithm, that is used where the response
variable is categorical. The idea of Logistic Regression is to find a
relationship between features and probability of particular outcome.
 Why we are using Logistic regression:
With binary classification, let ‘x’ be some feature and ‘y’ be the
output which can be either 0 or 1. So we can predict is the
patients records state that the person has the disease or not based
on training data.
A naive Bayes classifier is an algorithm that uses Bayes' theorem to classify objects.
Naive Bayes classifiers assume strong, or naive, independence between attributes of
data points. Naive Bayes is also known as simple Bayes or independence Bayes
Formula:-
K nearest neighbors is a simple algorithm that stores all available cases
and classifies new cases based on a similarity measure (e.g., distance
functions) KNN has been used in statistical estimation and pattern
recognition.
Example:-
 Programming Languages :-
1. HTML & CSS:- for designing the interface
2. PHP:- connecting user interface with database
3. Python:- for mining data and generating results
4. SQL:- for Database Management
 Python Libraries Required
1. Pandas:- for data manipulation and analysis
2. Seborn:- for data visualization
3. Sklearn:-for data mining
 We used Statlog (Heart) Data Set from UCI Machine Learning
This data source contains 13 attributes
1. age
2. sex
3. chest pain type (4 values)
4. resting blood pressure
5. serum cholesterol in mg/dl
6. fasting blood sugar > 120 mg/dl
7. resting electrocardiographic results (values 0,1,2)
8. maximum heart rate achieved
9. exercise induced angina
10. oldpeak = ST depression induced by exercise relative to rest
11. the slope of the peak exercise ST segment
12. number of major vessels (0-3) colored by flourosopy
13. thal: 3 = normal; 6 = fixed defect; 7 = reversable defect
Real: 1,4,5,8,10,12
Ordered:11
Binary: 2,6,9
Nominal:7,3,13
Variable to be predicted
Absence (1) or presence (2) of heart disease
EHR creation interface
Doctor dashboard showing all
records created by doctor.
Doctor dashboard showing predictive result
of patient to the doctor including their
reports.
Patient dashboard showing predictive result
and report to patient
 we had observed that each algorithm has its own unique property on
which the accuracy is determined.
 So we had used election approach to state the results in which the
system is providing prediction based on mean of all results from
various algorithms which helps to get more accurate and appropriate
result which also help to make the application more robust.
 In this system, we used different data mining algorithms &
calculated their accuracy which is given above. In which we had
observed that each algorithm has its own unique property on
which the accuracy is determined. So the system is providing
prediction based on mean of all results from various algorithms
which helps to get more accurate and appropriate result .
 M. Sumanth. “Data Mining Applications in Healthcare Sector”.
 Neha Chikshe, Tejasweeta Dixit, Rashmi Gore Prerana Akade (2016). “Hybrid
Approach for Heart Disease Detection Using Clustering and ANN”. IJRITCC,JAN
2016
Volume 4 Issue 1.
 Dr. Gagandeep Jagdev, (2015).” Application Of Big Data In Medical Science Brings
Revolution In Managing Health Care Of Humans”. IJEEE,JAN 2015 Volume 2 SPl.
Issue 1.
 Tapas Ranjan Baitharu, Subhendu Kumar Pani (2016)”Analysis of Data Mining
Techniques For Healthcare Decision Support System Using Liver Disorder Dataset”.
Books:
Areth James, Daniela Witten ,Trevor Hastie , Robert Tibshirani ,”An
Introduction to Statistical Learning” By Springer Publications
Websites:
1. www.ijritcc.com/index.php/ijritcc/article/view/1718, accessed on 17/08/18
2. www.slideshare.net/madallapallisumanth/data mininginhealthcaresector, accessed
on 17/08/18
3. www.issuu.com/ijeeeapm/docs/id77, accessed on 17/08/18
4.www.immagic.com/eLibrary/ARCHIVES/GENERAL/WIKIPEDI/W1120615B.pdf,
accessed on 31/08/18
5. www.Wikipedia.com, accessed on 05/10/18
6. www.saedsayad.com, accessed on 10/10/18
7. www.scikit-learn.org/stable/modules/generated/sklearn.metrics.confusionmatrix.html,
accessed on 9/01/19
8.www.python.org, accessed on 15/01/19
9.www.archive.ics.uci.edu/ml/datasets/Heart + Disease,accessed on 28/01/19

More Related Content

What's hot

Prediction of cardiovascular disease with machine learning
Prediction of cardiovascular disease with machine learningPrediction of cardiovascular disease with machine learning
Prediction of cardiovascular disease with machine learning
Pravinkumar Landge
 
Prediction of heart disease using machine learning.pptx
Prediction of heart disease using machine learning.pptxPrediction of heart disease using machine learning.pptx
Prediction of heart disease using machine learning.pptx
kumari36
 
Hybrid Technique for Associative Classification of Heart Diseases
Hybrid Technique for Associative Classification of Heart DiseasesHybrid Technique for Associative Classification of Heart Diseases
Hybrid Technique for Associative Classification of Heart Diseases
Jagdeep Singh Malhi
 

What's hot (20)

Prediction of cardiovascular disease with machine learning
Prediction of cardiovascular disease with machine learningPrediction of cardiovascular disease with machine learning
Prediction of cardiovascular disease with machine learning
 
Heart Disease Prediction using Machine Learning Algorithm
Heart Disease Prediction using Machine Learning AlgorithmHeart Disease Prediction using Machine Learning Algorithm
Heart Disease Prediction using Machine Learning Algorithm
 
Prediction of Heart Disease using Machine Learning Algorithms: A Survey
Prediction of Heart Disease using Machine Learning Algorithms: A SurveyPrediction of Heart Disease using Machine Learning Algorithms: A Survey
Prediction of Heart Disease using Machine Learning Algorithms: A Survey
 
Prediction of heart disease using machine learning.pptx
Prediction of heart disease using machine learning.pptxPrediction of heart disease using machine learning.pptx
Prediction of heart disease using machine learning.pptx
 
Disease Prediction by Machine Learning Over Big Data From Healthcare Communities
Disease Prediction by Machine Learning Over Big Data From Healthcare CommunitiesDisease Prediction by Machine Learning Over Big Data From Healthcare Communities
Disease Prediction by Machine Learning Over Big Data From Healthcare Communities
 
Heart disease prediction system
Heart disease prediction systemHeart disease prediction system
Heart disease prediction system
 
Final ppt
Final pptFinal ppt
Final ppt
 
Heart Attack Prediction using Machine Learning
Heart Attack Prediction using Machine LearningHeart Attack Prediction using Machine Learning
Heart Attack Prediction using Machine Learning
 
Disease Prediction And Doctor Appointment system
Disease Prediction And Doctor Appointment  systemDisease Prediction And Doctor Appointment  system
Disease Prediction And Doctor Appointment system
 
Project on disease prediction
Project on disease predictionProject on disease prediction
Project on disease prediction
 
Diabetes prediction using different machine learning approaches
Diabetes prediction using different machine learning approachesDiabetes prediction using different machine learning approaches
Diabetes prediction using different machine learning approaches
 
Hybrid Technique for Associative Classification of Heart Diseases
Hybrid Technique for Associative Classification of Heart DiseasesHybrid Technique for Associative Classification of Heart Diseases
Hybrid Technique for Associative Classification of Heart Diseases
 
IRJET - Effective Heart Disease Prediction using Distinct Machine Learning Te...
IRJET - Effective Heart Disease Prediction using Distinct Machine Learning Te...IRJET - Effective Heart Disease Prediction using Distinct Machine Learning Te...
IRJET - Effective Heart Disease Prediction using Distinct Machine Learning Te...
 
Health care analytics
Health care analyticsHealth care analytics
Health care analytics
 
Cardiovascular Disease Prediction Using Machine Learning Approaches.pptx
Cardiovascular Disease Prediction Using Machine Learning Approaches.pptxCardiovascular Disease Prediction Using Machine Learning Approaches.pptx
Cardiovascular Disease Prediction Using Machine Learning Approaches.pptx
 
machine learning a a tool for disease detection and diagnosis
machine learning a a tool for disease detection and diagnosismachine learning a a tool for disease detection and diagnosis
machine learning a a tool for disease detection and diagnosis
 
HPPS: Heart Problem Prediction System using Machine Learning
HPPS: Heart Problem Prediction System using Machine LearningHPPS: Heart Problem Prediction System using Machine Learning
HPPS: Heart Problem Prediction System using Machine Learning
 
Heart Attack Prediction System Using Fuzzy C Means Classifier
Heart Attack Prediction System Using Fuzzy C Means ClassifierHeart Attack Prediction System Using Fuzzy C Means Classifier
Heart Attack Prediction System Using Fuzzy C Means Classifier
 
Detection of heart diseases by data mining
Detection of heart diseases by data miningDetection of heart diseases by data mining
Detection of heart diseases by data mining
 
Big Data Analytics for Healthcare
Big Data Analytics for HealthcareBig Data Analytics for Healthcare
Big Data Analytics for Healthcare
 

Similar to HEALTH PREDICTION ANALYSIS USING DATA MINING

Paper id 212014112
Paper id 212014112Paper id 212014112
Paper id 212014112
IJRAT
 

Similar to HEALTH PREDICTION ANALYSIS USING DATA MINING (20)

Paper id 212014112
Paper id 212014112Paper id 212014112
Paper id 212014112
 
ppt for data science slideshare.pptx
ppt for data science slideshare.pptxppt for data science slideshare.pptx
ppt for data science slideshare.pptx
 
Day 1: Real-World Data Panel
Day 1: Real-World Data Panel Day 1: Real-World Data Panel
Day 1: Real-World Data Panel
 
prediction of heart disease using machine learning algorithms
prediction of heart disease using machine learning algorithmsprediction of heart disease using machine learning algorithms
prediction of heart disease using machine learning algorithms
 
PERFORMANCE OF DATA MINING TECHNIQUES TO PREDICT IN HEALTHCARE CASE STUDY: CH...
PERFORMANCE OF DATA MINING TECHNIQUES TO PREDICT IN HEALTHCARE CASE STUDY: CH...PERFORMANCE OF DATA MINING TECHNIQUES TO PREDICT IN HEALTHCARE CASE STUDY: CH...
PERFORMANCE OF DATA MINING TECHNIQUES TO PREDICT IN HEALTHCARE CASE STUDY: CH...
 
PREDICTIVE ANALYTICS IN HEALTHCARE SYSTEM USING DATA MINING TECHNIQUES
PREDICTIVE ANALYTICS IN HEALTHCARE SYSTEM USING DATA MINING TECHNIQUESPREDICTIVE ANALYTICS IN HEALTHCARE SYSTEM USING DATA MINING TECHNIQUES
PREDICTIVE ANALYTICS IN HEALTHCARE SYSTEM USING DATA MINING TECHNIQUES
 
Comparing Data Mining Techniques used for Heart Disease Prediction
Comparing Data Mining Techniques used for Heart Disease PredictionComparing Data Mining Techniques used for Heart Disease Prediction
Comparing Data Mining Techniques used for Heart Disease Prediction
 
Heart Disease Prediction Using Data Mining
Heart Disease Prediction Using Data MiningHeart Disease Prediction Using Data Mining
Heart Disease Prediction Using Data Mining
 
Data science in healthcare-Assignment 2.pptx
Data science in healthcare-Assignment 2.pptxData science in healthcare-Assignment 2.pptx
Data science in healthcare-Assignment 2.pptx
 
DATA MINING CLASSIFICATION ALGORITHMS FOR KIDNEY DISEASE PREDICTION
DATA MINING CLASSIFICATION ALGORITHMS FOR KIDNEY DISEASE PREDICTION DATA MINING CLASSIFICATION ALGORITHMS FOR KIDNEY DISEASE PREDICTION
DATA MINING CLASSIFICATION ALGORITHMS FOR KIDNEY DISEASE PREDICTION
 
IRJET - Digital Assistance: A New Impulse on Stroke Patient Health Care using...
IRJET - Digital Assistance: A New Impulse on Stroke Patient Health Care using...IRJET - Digital Assistance: A New Impulse on Stroke Patient Health Care using...
IRJET - Digital Assistance: A New Impulse on Stroke Patient Health Care using...
 
SMART HEALTH PREDICTION USING DATA MINING by Dr.Mahboob Khan Phd
SMART HEALTH PREDICTION USING DATA MINING by Dr.Mahboob Khan PhdSMART HEALTH PREDICTION USING DATA MINING by Dr.Mahboob Khan Phd
SMART HEALTH PREDICTION USING DATA MINING by Dr.Mahboob Khan Phd
 
Psdot 14 using data mining techniques in heart
Psdot 14 using data mining techniques in heartPsdot 14 using data mining techniques in heart
Psdot 14 using data mining techniques in heart
 
Sun==big data analytics for health care
Sun==big data analytics for health careSun==big data analytics for health care
Sun==big data analytics for health care
 
H0342044046
H0342044046H0342044046
H0342044046
 
IRJET- Survey on Risk Estimation of Chronic Disease using Machine Learning
IRJET- Survey on Risk Estimation of Chronic Disease using Machine LearningIRJET- Survey on Risk Estimation of Chronic Disease using Machine Learning
IRJET- Survey on Risk Estimation of Chronic Disease using Machine Learning
 
Use of data analytics in health care
Use of data analytics in health careUse of data analytics in health care
Use of data analytics in health care
 
Health Analyzer System
Health Analyzer SystemHealth Analyzer System
Health Analyzer System
 
IRJET- Disease Prediction using Machine Learning
IRJET-  	  Disease Prediction using Machine LearningIRJET-  	  Disease Prediction using Machine Learning
IRJET- Disease Prediction using Machine Learning
 
Heart Diseases Diagnosis Using Data Mining Techniques
Heart Diseases Diagnosis Using Data Mining TechniquesHeart Diseases Diagnosis Using Data Mining Techniques
Heart Diseases Diagnosis Using Data Mining Techniques
 

Recently uploaded

Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Peter Udo Diehl
 
Search and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical FuturesSearch and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical Futures
Bhaskar Mitra
 

Recently uploaded (20)

Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
 
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
 
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
 
Speed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in MinutesSpeed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in Minutes
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
 
UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User Group
 
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
 
IoT Analytics Company Presentation May 2024
IoT Analytics Company Presentation May 2024IoT Analytics Company Presentation May 2024
IoT Analytics Company Presentation May 2024
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
 
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
 
WSO2CONMay2024OpenSourceConferenceDebrief.pptx
WSO2CONMay2024OpenSourceConferenceDebrief.pptxWSO2CONMay2024OpenSourceConferenceDebrief.pptx
WSO2CONMay2024OpenSourceConferenceDebrief.pptx
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
10 Differences between Sales Cloud and CPQ, Blanka Doktorová
10 Differences between Sales Cloud and CPQ, Blanka Doktorová10 Differences between Sales Cloud and CPQ, Blanka Doktorová
10 Differences between Sales Cloud and CPQ, Blanka Doktorová
 
Introduction to Open Source RAG and RAG Evaluation
Introduction to Open Source RAG and RAG EvaluationIntroduction to Open Source RAG and RAG Evaluation
Introduction to Open Source RAG and RAG Evaluation
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
Powerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara LaskowskaPowerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara Laskowska
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
Search and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical FuturesSearch and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical Futures
 

HEALTH PREDICTION ANALYSIS USING DATA MINING

  • 1. Project presentation of “ HEALTH PREDICTION ANALYSIS USING DATA MINING “ Presented by Name Roll no Kritika Ashok Rane. 28 Ashish Ravindra Salve. 30 Ashwini Dhananjay Sawant. 31 Under Guidance Of Prof. J. P. Patil
  • 2.  What is Need of data mining in healthcare. 1. The application of Data mining healthcare has a lot of positive and also life-saving outcomes. 2. Data mining refers to the vast quantities of information created by the digitization of everything, that gets consolidated and analyzed by specific technologies. 3. The costs of treatments are much higher than they should be, and they have been rising for the past 20 years. Clearly, we need some smart, data-driven thinking in this area.  How it will affect on patient’s health and wealth. 1. Applied to healthcare, it will use specific health data of a population (or of a particular individual) and potentially help to prevent epidemics, cure disease, cut down costs, etc. 2. Data mining has become an increasingly pervasive activity in all areas medical science research. Data mining has resulted in tha discovery of useful hidden patterns from massive databases. By using data mining techniques finally physicians need to know how quickly identify and diagnose potential cases.
  • 3. Rural Areas get relatively less healthcare facilities and doctor availability is very poor but there may be some people who might have relatively sufficient knowledge about pharmaceuticals and they can treat the patient in urgent basis using their knowledge of what may happen to the patient . So we need something that predict what will happen to the patient in less time and save patient’s life. So our main moto of doing this project is to help the needy people.
  • 4.  Hospital management The ability to detect anomalous behaviour based on purchase, usage and other transactional behaviour information has made data mining a key tool in variety of organizations to detect fraudulent claims, inappropriate prescriptions and other abnormal behavioural patterns.  Healthcare management Healthcare industry today generates large amounts of complex data about patients, hospitals resources, disease diagnosis, electronic patient records, medical devices etc. The large amounts of data is a key resource to be processed and analyzed for knowledge extraction that enables support for cost-savings and decision making  Pharmaceutical industry When there is no dispositive evidence favouring a particular treatment option Based on patients’ profile, history, physical examination, diagnosis and utilizing previous treatment patterns, new treatment plans can be effectively suggested  Personalized treatment planning Healthcare organizations make customer relationship management decisions, Physicians identify effective treatments and best practices, and Patients receive better and more affordable healthcare services  Prediction of diseases  Monitor patient’s vital signs and many more….
  • 5.  Personalized Medicine One of the top goal is to create a personalised treatment plan based on individual biology.  Predictive Analytics And Preventive Measures Prevention is always better than cure. For the healthcare industry, it also happens to save a lot of money.  The Ultimate EHR ( electronic health record ) also referred as EMR (electronic medical record) This precious file would contain every piece of information about patient’s health, would always be up to date could be shared across any network.  Disease Modelling and Mapping One of the flashiest uses of data science in the past few years has been in tracking, finding ways ton halt or prevent diseases.  Reduce Fraud And Enhance Security This particular industry is 200% more likely to experience data breaches than any other industry because personal data is extremely valuable and profitable in black markets.
  • 6. The application of Data Mining in healthcare has a lot of positive and also life-saving outcomes. So we are going to develop a model which predicts the health of the patient through the medical history of patient stored in the database as EHR in the hospital or particular healthcare organization so the specialists can predict the disease in less time and should give the proper treatment to the patient.
  • 7.
  • 8. “Application of big data in medical science Brings revolution in managing health Care of humans” by Dr. Gagandeep Jagdev (2015) Domain Big data Task Performed  Personalized treatment planning  Assisted diagnosis  Fraud detection  Monitor patient vital signs  Digitization of data Technology Used  First stage: mapping  Intermediate stages: shuffling  Final stage: reducing Provide Information About  Various disease  Patient treatment  Hospital management info
  • 9. “Data Mining Applications in Healthcare Sector” by M. Sumanth Domain Data mining Task Performed  Treatment management  Healthcare management  Customer relationship management  Fraud and abuse  Medical device industry  Pharmaceutical industry  System biology  Hospital management Technology Used --na-- Provide Information About  Patient treatment
  • 10. “Hybrid Approach for Heart Disease Detection Using Clustering and ANN” by Neha Chikshe, Tejasweeta Dixit, Rashmi Gore, Prerana Akade (2016) Domain Clustering and ANN Task Performed Prediction Of heart Disease Technology Used  Clustering  Neural networks navie bayes  Decision tree  K- nearest neighbour Provide Information About Heart disease
  • 11. “Analysis of Data mining techniques for healthcare decision support system using liver disorder dataset” by Tapas Rajan Baitharu, Subhendu Kumar Pani (2016) Domain Data mining, ANN Task Performed Liver disease prediction Analysis Technology Used • Naïve Bayes • Multiplier Perceptron • ZeroR Provide Information About Comparison of various algorithms
  • 12.  Hardware Requirement: 1. 1.5 gigahertz (GHz) dual-core C.P.U. 2. 4 GB RAM 3. 1024x768 minimum screen resolution 4. 10GB Of hard disk space  Software Requirements: 1. Microsoft Windows 7+ 2. Xampp web server 3. text editor (notepad) 4. Anaconda
  • 13. We Are using following algorithms for implementation of this project: • Decision tree • Logistic regression • K-nearest neighbours • Naive Bayes classifier
  • 14. Decision trees are commonly used in operations research, specifically in decision analysis, to help identify a strategy most likely to reach a goal, but are also a popular tool in data mining. Example:-
  • 15.  It’s a classification algorithm, that is used where the response variable is categorical. The idea of Logistic Regression is to find a relationship between features and probability of particular outcome.  Why we are using Logistic regression: With binary classification, let ‘x’ be some feature and ‘y’ be the output which can be either 0 or 1. So we can predict is the patients records state that the person has the disease or not based on training data.
  • 16. A naive Bayes classifier is an algorithm that uses Bayes' theorem to classify objects. Naive Bayes classifiers assume strong, or naive, independence between attributes of data points. Naive Bayes is also known as simple Bayes or independence Bayes Formula:-
  • 17. K nearest neighbors is a simple algorithm that stores all available cases and classifies new cases based on a similarity measure (e.g., distance functions) KNN has been used in statistical estimation and pattern recognition. Example:-
  • 18.  Programming Languages :- 1. HTML & CSS:- for designing the interface 2. PHP:- connecting user interface with database 3. Python:- for mining data and generating results 4. SQL:- for Database Management  Python Libraries Required 1. Pandas:- for data manipulation and analysis 2. Seborn:- for data visualization 3. Sklearn:-for data mining
  • 19.
  • 20.
  • 21.
  • 22.
  • 23.
  • 24.
  • 25.  We used Statlog (Heart) Data Set from UCI Machine Learning This data source contains 13 attributes 1. age 2. sex 3. chest pain type (4 values) 4. resting blood pressure 5. serum cholesterol in mg/dl 6. fasting blood sugar > 120 mg/dl
  • 26. 7. resting electrocardiographic results (values 0,1,2) 8. maximum heart rate achieved 9. exercise induced angina 10. oldpeak = ST depression induced by exercise relative to rest 11. the slope of the peak exercise ST segment 12. number of major vessels (0-3) colored by flourosopy 13. thal: 3 = normal; 6 = fixed defect; 7 = reversable defect
  • 27. Real: 1,4,5,8,10,12 Ordered:11 Binary: 2,6,9 Nominal:7,3,13 Variable to be predicted Absence (1) or presence (2) of heart disease
  • 28. EHR creation interface Doctor dashboard showing all records created by doctor.
  • 29. Doctor dashboard showing predictive result of patient to the doctor including their reports. Patient dashboard showing predictive result and report to patient
  • 30.
  • 31.  we had observed that each algorithm has its own unique property on which the accuracy is determined.  So we had used election approach to state the results in which the system is providing prediction based on mean of all results from various algorithms which helps to get more accurate and appropriate result which also help to make the application more robust.
  • 32.  In this system, we used different data mining algorithms & calculated their accuracy which is given above. In which we had observed that each algorithm has its own unique property on which the accuracy is determined. So the system is providing prediction based on mean of all results from various algorithms which helps to get more accurate and appropriate result .
  • 33.  M. Sumanth. “Data Mining Applications in Healthcare Sector”.  Neha Chikshe, Tejasweeta Dixit, Rashmi Gore Prerana Akade (2016). “Hybrid Approach for Heart Disease Detection Using Clustering and ANN”. IJRITCC,JAN 2016 Volume 4 Issue 1.  Dr. Gagandeep Jagdev, (2015).” Application Of Big Data In Medical Science Brings Revolution In Managing Health Care Of Humans”. IJEEE,JAN 2015 Volume 2 SPl. Issue 1.  Tapas Ranjan Baitharu, Subhendu Kumar Pani (2016)”Analysis of Data Mining Techniques For Healthcare Decision Support System Using Liver Disorder Dataset”. Books: Areth James, Daniela Witten ,Trevor Hastie , Robert Tibshirani ,”An Introduction to Statistical Learning” By Springer Publications
  • 34. Websites: 1. www.ijritcc.com/index.php/ijritcc/article/view/1718, accessed on 17/08/18 2. www.slideshare.net/madallapallisumanth/data mininginhealthcaresector, accessed on 17/08/18 3. www.issuu.com/ijeeeapm/docs/id77, accessed on 17/08/18 4.www.immagic.com/eLibrary/ARCHIVES/GENERAL/WIKIPEDI/W1120615B.pdf, accessed on 31/08/18 5. www.Wikipedia.com, accessed on 05/10/18 6. www.saedsayad.com, accessed on 10/10/18 7. www.scikit-learn.org/stable/modules/generated/sklearn.metrics.confusionmatrix.html, accessed on 9/01/19 8.www.python.org, accessed on 15/01/19 9.www.archive.ics.uci.edu/ml/datasets/Heart + Disease,accessed on 28/01/19