SlideShare a Scribd company logo
1 of 19
Kallam Harandha Reddy
Institute Of Technology
(Autonomous)
Heart Disease
Prediction System
using Exploratory
Data Analysis
Mentored By-
Mr.Nagarjuna
Submitted By-
P.Pravallika(CSE/576)
N.Divya(CSE/570)
Y.Joshua(CSE/5A6)
N.MukeshKrishna(CSE/569)
TABLE OF CONTENTS:
1. Abstract
2. Introduction
3. Existing System
4. Proposed System
5. System Specifications
6. System Architecture
7. Problem Statement
8. Data Collection
9. Technology Used
10. Data Cleaning
11. Library Used
12. Tableau Dashboard
13. Conclusion
Abstract
● With an opulence of data , healthcare is being developed by the
application of machine learning.
● Cardiovascular disease is one of the most fatal conditions in the
present world. In case of heart diseases, the correct diagnosis in early
stage is important as time is the very important factor.
● We are building a Heart Disease Prediction system to predict the
chance of heart disease , for this we use different algorithms
like Logistic Regression and Random Forests by giving age, gender,
blood pressure etc as input. As a output it gives the chance of getting
heart disease.
Introduction
Heart disease predictor is an offline platform designed and developed to
explore the path of machine learning . The goal is to predict the health of a
patient from collective data, so as to be able to detect configurations at risk
for the patient, and therefore, in cases requiring emergency medical
assistance, alert the appropriate medical staff of the situation of the latter.
We initially have a dataset collecting information of many patients with which
we are able to conclude the results into a complete form and can predict data
precisely. The results of the predictions, derived from the predictive models
generated by machine learning, will be presented through several distinct
graphical interfaces according to the datasets considered. We will then bring
criticism as to the scope of our results.
Existing System
● Diagnosis of the disease solely depends upon the docter’s intuition and patient’s
records.
● Researchers made use of several data mining techniques that are accessible to help
the specialists or physicians identify the heart disease. One of them is Naïve Bayes
algorithm.
● The disadvantages of this prediction are, cardiovascular disease results are
not accurate , cannot handle enormous datasets for patient records.
Disadvantages:
● This practice leads to unwanted biases ,errors and excessive medical costs which
effects the quality of service provided to patients.
● There are many ways that a medical misdiagnosis can represent itself .
Proposed System
● Machine learning techniques are used to increase the accuracy rate.
● In machine learning technique we can use the following algorithms on
huge datasets to predict the heart disease.
● 1.Logistic Regression
● 2.Random Forest
● Logistic Regression algorithm is used to improve the accuracy of the
system.
● By using this ,the proposed system acts as a decision support system
and will predict the chances of heart diseases.
System Specifications
Software requirements:
● OS : Windows
● Python IDE : Python 2.7.x and above Anaconda IDE
● Setup tools and pip to be installed for 3.6.x and above
Hardware requirements:
● RAM : 4GB and Higher
● Processor : Intel i3 and above
● Hard Disk : 500GB: Minimum
System Architecture
System Architecture
Problem Statement
Machine learning allows building models to quickly analyze data and
deliver results, leveraging the historical and real-time data, with machine
learning that will help healthcare service providers to make better
decisions on patient’s disease diagnosis.
By analyzing the data we can predict the occurrence of the disease in our
project. This intelligent system for disease prediction plays a major role in
controlling the disease and maintaining the good health status of people
by predicting accurate disease risk.
Machine learning algorithms can also be helpful in providing vital
statistics, real-time data and advanced analytics in terms of the patient’s
disease, lab test results, blood pressure, family history, clinical trial data,
etc., to doctors.
Data Collection
Data has been collected from Kaggle.
Data collection is the process of gathering and Measuring information from countless different
sources. In order to use the datawe collect to develop practical Artificial Intelligence (AI) amd
Machine learning solutions it must be collected and stored in a way that makes sense for the
business problem at hand
What is Kaggle?
KAGGLE is an online community of data scientists and machine learners, owned by Google
LLC. Kaggle allows users to find and publish data sets, explore and build models in a web
based data science environment, work with other data scientists and other machine learning
engineers and enter data competitions to solve data science challenges
Attributes in Dataset
● Age
● Sex
● Chest Pain (CP)
● Blood Pressure (Trestbps)
● Cholestrol (Chol)Major
● Fasting Blood Sugar (fbs)
● Heart Rate (Thalach)
● Resting electrocardiographic results(Rest
● Exercise induced angina (Exang)
● Expression (oldpeak)
● Slope
● vessels (Ca)
Testing Technologies
Anaconda(Python) - Anaconda is a free and open-source distribution of the
Python and R programming languages for scientific computing, that aims to
simplify package management and deployment.
Jupyter Notebook - The Jupyter Notebook is an open-source web application that
allows you to create and share documents that contain live code, equations,
visualizations and narrative text. Uses include: data cleaning and transformation,
numerical simulation, statistical modeling, data visualization, machine learning,
and much more.
Data Cleaning
Data Cleaning is essentially the task of removing errors and anomalies or
replacing observed values with the true values from data to get more values in
analytics .
METHODS
● Get Rid of Extra Spaces.
● Select and Treat All Blank Cells.
● Convert Numbers Stored as Text into Numbers.
● Remove Duplicates.
● Highlight Errors.
● Change Text to Lower/Upper/Proper Case.
● Spell Check.
● Delete all Formatting.
Libraries Used
1. Pandas- is a software library written for the Python programming
language for data manipulation and analysis. In particular, it offers data
structures and operations for manipulating numerical tables and time series
pandas is a Python package providing fast, flexible, and expressive data
structures designed to make working with “relational” or “labeled” data
both easy and intuitive. It aims to be the fundamental high-level building
block for doing practical, real world data analysis in Python.
2. .NumPy- NumPy is a library for the Python programming language,
adding support for large, multi-dimensional arrays and matrices, along with
a large collection of high-level mathematical functions to operate on these
arrays.
Tableau Dashboard
● Tableau is one of the business intelligence software used to analyse
data and visualize the insights in the form of graph and charts.
● User can develop and share an interactive dashboard which shows the
hidden pattern, trends, density and variation of data.
● Tableau uses centroid-based k-means clustering algorithm that divides
the data into K-number of clusters.
● Dashboards are created with the data set after applying K-means
algorithm.
● It provides visual appealing clusters in order to predict the occurrence
of heart disease from the given dataset.
Dataset in Tableau:
Conclusion
● The models we used to predict the probability of having heart disease are
Logistic regression,Random forest as they are more accurate in numerical
variables. The model accuracy is 85 % in test and train data sets. This model will
would be used in medical field as it can predict the heart diseases .
● Heart stroke and vascular disease are the major cause of disability and
premature death. Chest pain is the key to recognize the heart disease. In this
work, the heart diseases are predicted by considering major factors with four
types of chest pain. K-means clustering is one of the simplest and popular
unsupervised machine learning algorithms. Here the datasets is clustered and
based upon the clusters the happening of chest pain is predicted. The role of
exploratory data using tableau provided a visual appealing and accurate
clustering experience.
18
THANK YOU

More Related Content

Similar to heart disease predction using machiine learning

Multiple Disease Prediction System
Multiple Disease Prediction SystemMultiple Disease Prediction System
Multiple Disease Prediction SystemIRJET Journal
 
Heart Disease Prediction using Machine Learning Algorithms
Heart Disease Prediction using Machine Learning AlgorithmsHeart Disease Prediction using Machine Learning Algorithms
Heart Disease Prediction using Machine Learning AlgorithmsIRJET Journal
 
IRJET- Web-based Application to Detect Heart Attack using Machine Learning
IRJET- Web-based Application to Detect Heart Attack using Machine LearningIRJET- Web-based Application to Detect Heart Attack using Machine Learning
IRJET- Web-based Application to Detect Heart Attack using Machine LearningIRJET Journal
 
Comparing Data Mining Techniques used for Heart Disease Prediction
Comparing Data Mining Techniques used for Heart Disease PredictionComparing Data Mining Techniques used for Heart Disease Prediction
Comparing Data Mining Techniques used for Heart Disease PredictionIRJET Journal
 
HEALTH PREDICTION ANALYSIS USING DATA MINING
HEALTH PREDICTION ANALYSIS USING DATA  MININGHEALTH PREDICTION ANALYSIS USING DATA  MINING
HEALTH PREDICTION ANALYSIS USING DATA MININGAshish Salve
 
A STUDY OF THE LITERATURE ON CARDIOVASCULAR DISEASE PREDICTION METHODS
A STUDY OF THE LITERATURE ON CARDIOVASCULAR DISEASE PREDICTION METHODSA STUDY OF THE LITERATURE ON CARDIOVASCULAR DISEASE PREDICTION METHODS
A STUDY OF THE LITERATURE ON CARDIOVASCULAR DISEASE PREDICTION METHODSIRJET Journal
 
PREDICTION OF HEART DISEASE USING LOGISTIC REGRESSION
PREDICTION OF HEART DISEASE USING LOGISTIC REGRESSIONPREDICTION OF HEART DISEASE USING LOGISTIC REGRESSION
PREDICTION OF HEART DISEASE USING LOGISTIC REGRESSIONIRJET Journal
 
IRJET-Survey on Data Mining Techniques for Disease Prediction
IRJET-Survey on Data Mining Techniques for Disease PredictionIRJET-Survey on Data Mining Techniques for Disease Prediction
IRJET-Survey on Data Mining Techniques for Disease PredictionIRJET Journal
 
Predicting Heart Disease Using Machine Learning Algorithms.
Predicting Heart Disease Using Machine Learning Algorithms.Predicting Heart Disease Using Machine Learning Algorithms.
Predicting Heart Disease Using Machine Learning Algorithms.IRJET Journal
 
Prediction of Heart Disease Using Data Mining Techniques- A Review
Prediction of Heart Disease Using Data Mining Techniques- A ReviewPrediction of Heart Disease Using Data Mining Techniques- A Review
Prediction of Heart Disease Using Data Mining Techniques- A ReviewIRJET Journal
 
IRJET- Hybrid Architecture of Heart Disease Prediction System using Genetic N...
IRJET- Hybrid Architecture of Heart Disease Prediction System using Genetic N...IRJET- Hybrid Architecture of Heart Disease Prediction System using Genetic N...
IRJET- Hybrid Architecture of Heart Disease Prediction System using Genetic N...IRJET Journal
 
DESIGN AND IMPLEMENTATION OF CARDIAC DISEASE USING NAIVE BAYES TECHNIQUE
DESIGN AND IMPLEMENTATION OF CARDIAC DISEASE USING NAIVE BAYES TECHNIQUEDESIGN AND IMPLEMENTATION OF CARDIAC DISEASE USING NAIVE BAYES TECHNIQUE
DESIGN AND IMPLEMENTATION OF CARDIAC DISEASE USING NAIVE BAYES TECHNIQUEIRJET Journal
 
IRJET- Role of Different Data Mining Techniques for Predicting Heart Disease
IRJET-  	  Role of Different Data Mining Techniques for Predicting Heart DiseaseIRJET-  	  Role of Different Data Mining Techniques for Predicting Heart Disease
IRJET- Role of Different Data Mining Techniques for Predicting Heart DiseaseIRJET Journal
 
Risk Of Heart Disease Prediction Using Machine Learning
Risk Of Heart Disease Prediction Using Machine LearningRisk Of Heart Disease Prediction Using Machine Learning
Risk Of Heart Disease Prediction Using Machine LearningIRJET Journal
 
A data mining approach for prediction of heart disease using neural networks
A data mining approach for prediction of heart disease using neural networksA data mining approach for prediction of heart disease using neural networks
A data mining approach for prediction of heart disease using neural networksIAEME Publication
 
A data mining approach for prediction of heart disease using neural networks
A data mining approach for prediction of heart disease using neural networksA data mining approach for prediction of heart disease using neural networks
A data mining approach for prediction of heart disease using neural networksIAEME Publication
 
A data mining approach for prediction of heart disease using neural networks
A data mining approach for prediction of heart disease using neural networksA data mining approach for prediction of heart disease using neural networks
A data mining approach for prediction of heart disease using neural networksIAEME Publication
 
Disease prediction in big data healthcare using extended convolutional neural...
Disease prediction in big data healthcare using extended convolutional neural...Disease prediction in big data healthcare using extended convolutional neural...
Disease prediction in big data healthcare using extended convolutional neural...IJAAS Team
 
Comparative Analysis of Heart Disease Prediction Models: Unveiling the Most A...
Comparative Analysis of Heart Disease Prediction Models: Unveiling the Most A...Comparative Analysis of Heart Disease Prediction Models: Unveiling the Most A...
Comparative Analysis of Heart Disease Prediction Models: Unveiling the Most A...IRJET Journal
 

Similar to heart disease predction using machiine learning (20)

Multiple Disease Prediction System
Multiple Disease Prediction SystemMultiple Disease Prediction System
Multiple Disease Prediction System
 
Heart Disease Prediction using Machine Learning Algorithms
Heart Disease Prediction using Machine Learning AlgorithmsHeart Disease Prediction using Machine Learning Algorithms
Heart Disease Prediction using Machine Learning Algorithms
 
Short story.pptx
Short story.pptxShort story.pptx
Short story.pptx
 
IRJET- Web-based Application to Detect Heart Attack using Machine Learning
IRJET- Web-based Application to Detect Heart Attack using Machine LearningIRJET- Web-based Application to Detect Heart Attack using Machine Learning
IRJET- Web-based Application to Detect Heart Attack using Machine Learning
 
Comparing Data Mining Techniques used for Heart Disease Prediction
Comparing Data Mining Techniques used for Heart Disease PredictionComparing Data Mining Techniques used for Heart Disease Prediction
Comparing Data Mining Techniques used for Heart Disease Prediction
 
HEALTH PREDICTION ANALYSIS USING DATA MINING
HEALTH PREDICTION ANALYSIS USING DATA  MININGHEALTH PREDICTION ANALYSIS USING DATA  MINING
HEALTH PREDICTION ANALYSIS USING DATA MINING
 
A STUDY OF THE LITERATURE ON CARDIOVASCULAR DISEASE PREDICTION METHODS
A STUDY OF THE LITERATURE ON CARDIOVASCULAR DISEASE PREDICTION METHODSA STUDY OF THE LITERATURE ON CARDIOVASCULAR DISEASE PREDICTION METHODS
A STUDY OF THE LITERATURE ON CARDIOVASCULAR DISEASE PREDICTION METHODS
 
PREDICTION OF HEART DISEASE USING LOGISTIC REGRESSION
PREDICTION OF HEART DISEASE USING LOGISTIC REGRESSIONPREDICTION OF HEART DISEASE USING LOGISTIC REGRESSION
PREDICTION OF HEART DISEASE USING LOGISTIC REGRESSION
 
IRJET-Survey on Data Mining Techniques for Disease Prediction
IRJET-Survey on Data Mining Techniques for Disease PredictionIRJET-Survey on Data Mining Techniques for Disease Prediction
IRJET-Survey on Data Mining Techniques for Disease Prediction
 
Predicting Heart Disease Using Machine Learning Algorithms.
Predicting Heart Disease Using Machine Learning Algorithms.Predicting Heart Disease Using Machine Learning Algorithms.
Predicting Heart Disease Using Machine Learning Algorithms.
 
Prediction of Heart Disease Using Data Mining Techniques- A Review
Prediction of Heart Disease Using Data Mining Techniques- A ReviewPrediction of Heart Disease Using Data Mining Techniques- A Review
Prediction of Heart Disease Using Data Mining Techniques- A Review
 
IRJET- Hybrid Architecture of Heart Disease Prediction System using Genetic N...
IRJET- Hybrid Architecture of Heart Disease Prediction System using Genetic N...IRJET- Hybrid Architecture of Heart Disease Prediction System using Genetic N...
IRJET- Hybrid Architecture of Heart Disease Prediction System using Genetic N...
 
DESIGN AND IMPLEMENTATION OF CARDIAC DISEASE USING NAIVE BAYES TECHNIQUE
DESIGN AND IMPLEMENTATION OF CARDIAC DISEASE USING NAIVE BAYES TECHNIQUEDESIGN AND IMPLEMENTATION OF CARDIAC DISEASE USING NAIVE BAYES TECHNIQUE
DESIGN AND IMPLEMENTATION OF CARDIAC DISEASE USING NAIVE BAYES TECHNIQUE
 
IRJET- Role of Different Data Mining Techniques for Predicting Heart Disease
IRJET-  	  Role of Different Data Mining Techniques for Predicting Heart DiseaseIRJET-  	  Role of Different Data Mining Techniques for Predicting Heart Disease
IRJET- Role of Different Data Mining Techniques for Predicting Heart Disease
 
Risk Of Heart Disease Prediction Using Machine Learning
Risk Of Heart Disease Prediction Using Machine LearningRisk Of Heart Disease Prediction Using Machine Learning
Risk Of Heart Disease Prediction Using Machine Learning
 
A data mining approach for prediction of heart disease using neural networks
A data mining approach for prediction of heart disease using neural networksA data mining approach for prediction of heart disease using neural networks
A data mining approach for prediction of heart disease using neural networks
 
A data mining approach for prediction of heart disease using neural networks
A data mining approach for prediction of heart disease using neural networksA data mining approach for prediction of heart disease using neural networks
A data mining approach for prediction of heart disease using neural networks
 
A data mining approach for prediction of heart disease using neural networks
A data mining approach for prediction of heart disease using neural networksA data mining approach for prediction of heart disease using neural networks
A data mining approach for prediction of heart disease using neural networks
 
Disease prediction in big data healthcare using extended convolutional neural...
Disease prediction in big data healthcare using extended convolutional neural...Disease prediction in big data healthcare using extended convolutional neural...
Disease prediction in big data healthcare using extended convolutional neural...
 
Comparative Analysis of Heart Disease Prediction Models: Unveiling the Most A...
Comparative Analysis of Heart Disease Prediction Models: Unveiling the Most A...Comparative Analysis of Heart Disease Prediction Models: Unveiling the Most A...
Comparative Analysis of Heart Disease Prediction Models: Unveiling the Most A...
 

Recently uploaded

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 

Recently uploaded (20)

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 

heart disease predction using machiine learning

  • 1. Kallam Harandha Reddy Institute Of Technology (Autonomous)
  • 2. Heart Disease Prediction System using Exploratory Data Analysis Mentored By- Mr.Nagarjuna Submitted By- P.Pravallika(CSE/576) N.Divya(CSE/570) Y.Joshua(CSE/5A6) N.MukeshKrishna(CSE/569)
  • 3. TABLE OF CONTENTS: 1. Abstract 2. Introduction 3. Existing System 4. Proposed System 5. System Specifications 6. System Architecture 7. Problem Statement 8. Data Collection 9. Technology Used 10. Data Cleaning 11. Library Used 12. Tableau Dashboard 13. Conclusion
  • 4. Abstract ● With an opulence of data , healthcare is being developed by the application of machine learning. ● Cardiovascular disease is one of the most fatal conditions in the present world. In case of heart diseases, the correct diagnosis in early stage is important as time is the very important factor. ● We are building a Heart Disease Prediction system to predict the chance of heart disease , for this we use different algorithms like Logistic Regression and Random Forests by giving age, gender, blood pressure etc as input. As a output it gives the chance of getting heart disease.
  • 5. Introduction Heart disease predictor is an offline platform designed and developed to explore the path of machine learning . The goal is to predict the health of a patient from collective data, so as to be able to detect configurations at risk for the patient, and therefore, in cases requiring emergency medical assistance, alert the appropriate medical staff of the situation of the latter. We initially have a dataset collecting information of many patients with which we are able to conclude the results into a complete form and can predict data precisely. The results of the predictions, derived from the predictive models generated by machine learning, will be presented through several distinct graphical interfaces according to the datasets considered. We will then bring criticism as to the scope of our results.
  • 6. Existing System ● Diagnosis of the disease solely depends upon the docter’s intuition and patient’s records. ● Researchers made use of several data mining techniques that are accessible to help the specialists or physicians identify the heart disease. One of them is Naïve Bayes algorithm. ● The disadvantages of this prediction are, cardiovascular disease results are not accurate , cannot handle enormous datasets for patient records. Disadvantages: ● This practice leads to unwanted biases ,errors and excessive medical costs which effects the quality of service provided to patients. ● There are many ways that a medical misdiagnosis can represent itself .
  • 7. Proposed System ● Machine learning techniques are used to increase the accuracy rate. ● In machine learning technique we can use the following algorithms on huge datasets to predict the heart disease. ● 1.Logistic Regression ● 2.Random Forest ● Logistic Regression algorithm is used to improve the accuracy of the system. ● By using this ,the proposed system acts as a decision support system and will predict the chances of heart diseases.
  • 8. System Specifications Software requirements: ● OS : Windows ● Python IDE : Python 2.7.x and above Anaconda IDE ● Setup tools and pip to be installed for 3.6.x and above Hardware requirements: ● RAM : 4GB and Higher ● Processor : Intel i3 and above ● Hard Disk : 500GB: Minimum
  • 10. Problem Statement Machine learning allows building models to quickly analyze data and deliver results, leveraging the historical and real-time data, with machine learning that will help healthcare service providers to make better decisions on patient’s disease diagnosis. By analyzing the data we can predict the occurrence of the disease in our project. This intelligent system for disease prediction plays a major role in controlling the disease and maintaining the good health status of people by predicting accurate disease risk. Machine learning algorithms can also be helpful in providing vital statistics, real-time data and advanced analytics in terms of the patient’s disease, lab test results, blood pressure, family history, clinical trial data, etc., to doctors.
  • 11. Data Collection Data has been collected from Kaggle. Data collection is the process of gathering and Measuring information from countless different sources. In order to use the datawe collect to develop practical Artificial Intelligence (AI) amd Machine learning solutions it must be collected and stored in a way that makes sense for the business problem at hand What is Kaggle? KAGGLE is an online community of data scientists and machine learners, owned by Google LLC. Kaggle allows users to find and publish data sets, explore and build models in a web based data science environment, work with other data scientists and other machine learning engineers and enter data competitions to solve data science challenges
  • 12. Attributes in Dataset ● Age ● Sex ● Chest Pain (CP) ● Blood Pressure (Trestbps) ● Cholestrol (Chol)Major ● Fasting Blood Sugar (fbs) ● Heart Rate (Thalach) ● Resting electrocardiographic results(Rest ● Exercise induced angina (Exang) ● Expression (oldpeak) ● Slope ● vessels (Ca)
  • 13. Testing Technologies Anaconda(Python) - Anaconda is a free and open-source distribution of the Python and R programming languages for scientific computing, that aims to simplify package management and deployment. Jupyter Notebook - The Jupyter Notebook is an open-source web application that allows you to create and share documents that contain live code, equations, visualizations and narrative text. Uses include: data cleaning and transformation, numerical simulation, statistical modeling, data visualization, machine learning, and much more.
  • 14. Data Cleaning Data Cleaning is essentially the task of removing errors and anomalies or replacing observed values with the true values from data to get more values in analytics . METHODS ● Get Rid of Extra Spaces. ● Select and Treat All Blank Cells. ● Convert Numbers Stored as Text into Numbers. ● Remove Duplicates. ● Highlight Errors. ● Change Text to Lower/Upper/Proper Case. ● Spell Check. ● Delete all Formatting.
  • 15. Libraries Used 1. Pandas- is a software library written for the Python programming language for data manipulation and analysis. In particular, it offers data structures and operations for manipulating numerical tables and time series pandas is a Python package providing fast, flexible, and expressive data structures designed to make working with “relational” or “labeled” data both easy and intuitive. It aims to be the fundamental high-level building block for doing practical, real world data analysis in Python. 2. .NumPy- NumPy is a library for the Python programming language, adding support for large, multi-dimensional arrays and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.
  • 16. Tableau Dashboard ● Tableau is one of the business intelligence software used to analyse data and visualize the insights in the form of graph and charts. ● User can develop and share an interactive dashboard which shows the hidden pattern, trends, density and variation of data. ● Tableau uses centroid-based k-means clustering algorithm that divides the data into K-number of clusters. ● Dashboards are created with the data set after applying K-means algorithm. ● It provides visual appealing clusters in order to predict the occurrence of heart disease from the given dataset.
  • 18. Conclusion ● The models we used to predict the probability of having heart disease are Logistic regression,Random forest as they are more accurate in numerical variables. The model accuracy is 85 % in test and train data sets. This model will would be used in medical field as it can predict the heart diseases . ● Heart stroke and vascular disease are the major cause of disability and premature death. Chest pain is the key to recognize the heart disease. In this work, the heart diseases are predicted by considering major factors with four types of chest pain. K-means clustering is one of the simplest and popular unsupervised machine learning algorithms. Here the datasets is clustered and based upon the clusters the happening of chest pain is predicted. The role of exploratory data using tableau provided a visual appealing and accurate clustering experience. 18