SlideShare a Scribd company logo
1 of 4
Download to read offline
IOSR Journal of Engineering (IOSRJEN) www.iosrjen.org
ISSN (e): 2250-3021, ISSN (p): 2278-8719
Vol. 04, Issue 07 (July. 2014), ||V3|| PP 36-39
International organization of Scientific Research 36 | P a g e
Comparative study of Data Mining Approaches for prediction
Heart Diseases
Dr.Hari Ganesh S1
, Gajenthiran M2
1
Asst. Professor, 2
M.Phil. Scholar,
Department of Computer Applications,
Bishop Heber College (Autonomous),
Trichirappalli-620 017
Abstract: - Data mining is the process of finding useful and relevant information from the databases. There are
several types of data mining techniques are available. Association Rule, Classification, Neural Networks,
Clusteringare some of the most important data mining techniques. Data mining process may take important role
in Health care Industries. Most commonly the data mining process is used in health care industries for the
process of prediction of diseases. This paper analysis the Heart Disease prediction approaches using
classification technique. Here we are using three different kinds of classifiers named Naïve Bayes, Decision
Table, and J48. The data set which is used here is taken from the UCI repository.Weka tool is used.
Keywords: - Heart disease prediction, Classification, Decision Table, Naïve Bayes.
I. INTRODUCTION
Data mining is the process of finding useful and relevant information from the various types of databases. The
process of extraction of the hidden predictive information from the databases is also called as Data mining.[1]
Nowadays data mining may take important role in several types of fields like Medical, Science, Railway etc.
Association rule, Classification is one of the type of widely using data mining technique for prediction of heart
diseases.
1.1 Classification
Classification is the process of categorization the data which are in the databases for its most effective
and efficient use. For example, data can be broken down according to its topical content, file type, operating
platform, average file size megabytes or gigabytes, when it was created, when it was last accessed or modified,
which person or department last accessed or modified it, and which personnel or departments use it the most. A
well-planned data classification system makes essential data easy to find. There are several types of
classification models are available. But here we are using four different kinds of classification models. They are
Naïve Bayes, Decision Tree, and J48.
1.2 Heart Diseases
Heart disease is one of the type of disease which will affects the operations of the heart. Nowadays
Heart disease is the major reason for deaths. There are several kinds of factors which increases the risk of Heart
Diseases. Here we are using fourteen kinds of factors for the purpose predicting the heart disease. But the
following are considered as important reasons for heart diseases.
 Age
 Blood Pressure
 Smoking habit
 Cholesterol
 Exercise and weight
 Blood Sugar
 Depression
The Heart Diseases are mostly caused by the above factors. The Survey result of World Health
Organization says 12 million deaths are occurred worldwide, every year due to Heart diseases. If we want to
find the solution for this problem, there are several kinds of prediction processes are needed. This paper will
analyse various kinds of Heart Disease Prediction Techniques.
Comparative study of Data Mining Approaches for prediction Heart Diseases
International organization of Scientific Research 37 | P a g e
1.3 Weka Tool:
WEKA stands for Waikato Environment for Knowledge Learning. It was developed by the University of
Waikato, New Zealand. WEKA supports many data mining tasks such as re-processing, classification,
clustering, regression and future selection to name a few[2].The workflow of WEKA would be follows:
Naïve Bayes:
Naïve Bayes is a simple probabilistic classifier based on applying Bayes theorem with strong
independence assumptions. A more descriptive term for underlying probability model would be “independent
feature model”. In simple terms, a Naïve Bayes classifier assumes that the presence of a particular feature of a
class is unrelated to the presence of any other feature.[3] Depending on the precise nature of the probability
model, Naïve Bayes classifiers can be trained very sufficiently in a supervised learning setting. In many
practical applications, parameter estimation for Naïve Bayes models uses the method of maximum likelihood.
1.5. Decision Table:
The Decision tree is a structure that includes root node, branch and leaf node. Each internal node
denotes a test on attribute, each branch denotes the outcome of test and each leaf node holds the class label [4].
The topmost node in the tree is the root node. There are three main advantages of decision tree.
 It does not require any domain knowledge.
 It is easy to assimilate by human.
 Learning and classification steps of decision tree are simple and fast.
1.6 J48:
The J48 Decision tree classifier follows the following simple algorithm. In order to classify a new item,
it first needs to create a decision tree based on the attribute values of the available training data [5]. So,
whenever it encounters a set of items (training set) it identifies the attribute that discriminates the various
instances most clearly. This feature that ableto tell us most about the data instances so that we can classify them
the best is said to have the highest information gain. Now, among the possible values of this feature, if there is
any value for which there is no ambiguity, that is, for which the data instances falling within its category have
the same value for that target variable, then we terminate that branch and assign to it the target value that we
have obtained [6].
II. DATA DESCRIPTION
The data are collected from UCI repository. The objective of this data set is to analysis the Heart
Disease based on the given attributes. The data set consists of 13 attributes that are used to predict the Heart
Disease. The detail descriptions of the attributes are given as below.
Table 2.1: Attributes for Heart Disease
No Name of the Attribute Description
1 Age Patient’s age
2 Sex Male/Female
3 Cp Constrictive Pericarditis
4 Fbs Fasting Blood Sugar
5 Trestbps Resting Blood Sugar
6 Restecg Resting Electrocardiograph Results
7 Thalach Maximum Heart Attack achieved.
8 Exang Exercise Induced Angina.
9 Oldpeak ST depression by exercise relative to
Comparative study of Data Mining Approaches for prediction Heart Diseases
International organization of Scientific Research 38 | P a g e
rest.
10 Slope The slope of the peak exercise st
segment.
11 Chol Cholesterol
12 Ca Number of major vessels (0-3) colored
by fluoroscopy.
13 Num Diagnosis of heart disease.
The attributes are given based on data types. The data set is based on the numeric and nominal data type.
III. EXPERIMENTAL RESULTS
The given three types of algorithms like Naïve Bayes, Decision Table, and J48 are applied on the
Heart Disease data set in WEKA and the performance of the algorithm are given based various factors. The
performance can be obtained based on the time taken to build the tree and correctly classified instances.
Table 3.1 Time taken by the algorithms
Name of the
Algorithm
Time Taken to build the
decision tree
Naïve Bayes 0.01 seconds
Decision Table 0.05 seconds
J48 0.03 seconds
Fig 3.1 Performance of the Algorithms based on the time taken
X-Axis: Classification of Algorithms
Y-Axis: Time Range
The dataset consists of 303 instances and they are applied as a test case in the classification algorithms.
The performance of the algorithms can be known from the instances that are correctly classified. The instances
which are correctly classified using the WEKA tool can be given as below,
Table 3.2 Number of instances correctly classified
Name of the
Algorithm
Number of correct
instance
Accuracy
Naïve Bayes 253 83.4%
Decision Table 231 76.2%
J48 235 77.5%
0
0.01
0.02
0.03
0.04
0.05
0.06
Performance Analysis
Naïve Bayes
Decision
Table
J48
Comparative study of Data Mining Approaches for prediction Heart Diseases
International organization of Scientific Research 39 | P a g e
IV. DISCUSSION
The above three algorithms predicts the class label. The final output will be patterns which are used to
find out whether the person is affected by the Heart Disease or not. A Confusion Matrix is a useful visualization
tool for analyzing the classifier accuracy. Structure of the confusion matrix can be given as below
Table 4.1 Structure of the Confusion Matrix
TP TN
FP FN
Where
 TP is True Positive: Heart Disease patients correctly identified as Heart Disease.
 FP is False Positive: Healthy people incorrectly identified as Heart Disease.
 TN is True Negative: Healthy people correctly identified as healthy.
 FN is False Negative: Heart Disease patients incorrectly identified as healthy.
The Confusion Matrix for the classification algorithms such as Naïve Bayes, Decision Table and J48 can be
given as follows based on the execution of the algorithm using WEKA tool.
Table 4.2 Confusion Matrix for Naïve Bayes
143 22
28 110
Table 4.3 Confusion Matrix for Decision Table
133 32
40 98
Table 4.4 Confusion Matrix for J48
137 28
40 98
V. CONCLUSION
Data mining plays a major role in extracting the hidden information in the medical data base. The data
pre-processing is used in order to improve the quality of the data. This model is built based as a test case on the
UCI repository dataset. The experiment has been successfully performed with several data mining classification
techniques and it is found that the Naïve Bayes algorithm gives a better performance over the supplied data set
with the accuracy of 83.4%. It is believed that the data mining can significantly help in the Heart Disease
research and ultimately improve the quality of health care of Heart Disease patients. It can also be implemented
using several classification techniques.
REFERENCES
[1] Dharminder Kumar, Deepak Bhardwaj“Rise of Data Mining: Current and Future Application”,
International Journal of Computer Science, Vol- 8, Issue 5, September-2011.
[2] ZdravkoMarkovIngridRussell“An Introduction to the WEKA Data Mining System”, Zdravko Markov
Central Connecticut State University.
[3] VikasChaurasia, SaurabhPal“Early Prediction of Heart disease using data mining
techniques”,Carib.j.SciTech,2013,Vol.1,208-217
[4] NidhiBhatla, KiranJyoti“An Analysis of Heart Disease Prediction using Different Data Mining
Techniques”,International Journal of Engineering Research & Technology (IJERT).
[5] Jay Gholap“Performance Tuning Of J48”,Dept. of Computer Engineering College of Engineering, Pune,
Maharashtra, India.
[6] Shadab Adam Pattekari and AsmaParveen “Prediction System for Heart Disease Using Naive Bayes”,
International Journal of Advanced Computer and Mathematical Sciences. Vol 3, Issue 3, 2012, pp 290-
294.

More Related Content

What's hot

A Heart Disease Prediction Model using Decision Tree
A Heart Disease Prediction Model using Decision TreeA Heart Disease Prediction Model using Decision Tree
A Heart Disease Prediction Model using Decision TreeIOSR Journals
 
Survey on data mining techniques in heart disease prediction
Survey on data mining techniques in heart disease predictionSurvey on data mining techniques in heart disease prediction
Survey on data mining techniques in heart disease predictionSivagowry Shathesh
 
A Heart Disease Prediction Model using Logistic Regression By Cleveland DataBase
A Heart Disease Prediction Model using Logistic Regression By Cleveland DataBaseA Heart Disease Prediction Model using Logistic Regression By Cleveland DataBase
A Heart Disease Prediction Model using Logistic Regression By Cleveland DataBaseijtsrd
 
IRJET- Heart Failure Risk Prediction using Trained Electronic Health Record
IRJET- Heart Failure Risk Prediction using Trained Electronic Health RecordIRJET- Heart Failure Risk Prediction using Trained Electronic Health Record
IRJET- Heart Failure Risk Prediction using Trained Electronic Health RecordIRJET Journal
 
Decision Tree Models for Medical Diagnosis
Decision Tree Models for Medical DiagnosisDecision Tree Models for Medical Diagnosis
Decision Tree Models for Medical Diagnosisijtsrd
 
Propose a Enhanced Framework for Prediction of Heart Disease
Propose a Enhanced Framework for Prediction of Heart DiseasePropose a Enhanced Framework for Prediction of Heart Disease
Propose a Enhanced Framework for Prediction of Heart DiseaseIJERA Editor
 
Classification of Heart Diseases Patients using Data Mining Techniques
Classification of Heart Diseases Patients using Data Mining TechniquesClassification of Heart Diseases Patients using Data Mining Techniques
Classification of Heart Diseases Patients using Data Mining TechniquesLovely Professional University
 
Heart disease prediction
Heart disease predictionHeart disease prediction
Heart disease predictionAriful Haque
 
IRJET- Predicting Heart Disease using Machine Learning Algorithm
IRJET- Predicting Heart Disease using Machine Learning AlgorithmIRJET- Predicting Heart Disease using Machine Learning Algorithm
IRJET- Predicting Heart Disease using Machine Learning AlgorithmIRJET Journal
 
Heart disease prediction system
Heart disease prediction systemHeart disease prediction system
Heart disease prediction systemSWAMI06
 
A data mining approach for prediction of heart disease using neural networks
A data mining approach for prediction of heart disease using neural networksA data mining approach for prediction of heart disease using neural networks
A data mining approach for prediction of heart disease using neural networksIAEME Publication
 
Chronic Kidney Disease Prediction
Chronic Kidney Disease PredictionChronic Kidney Disease Prediction
Chronic Kidney Disease PredictionRajandeep Gill
 
Psdot 14 using data mining techniques in heart
Psdot 14 using data mining techniques in heartPsdot 14 using data mining techniques in heart
Psdot 14 using data mining techniques in heartZTech Proje
 
Prognosis of Cardiac Disease using Data Mining Techniques A Comprehensive Survey
Prognosis of Cardiac Disease using Data Mining Techniques A Comprehensive SurveyPrognosis of Cardiac Disease using Data Mining Techniques A Comprehensive Survey
Prognosis of Cardiac Disease using Data Mining Techniques A Comprehensive Surveyijtsrd
 
HEART DISEASE PREDICTION USING NAIVE BAYES ALGORITHM
HEART DISEASE PREDICTION USING NAIVE BAYES ALGORITHMHEART DISEASE PREDICTION USING NAIVE BAYES ALGORITHM
HEART DISEASE PREDICTION USING NAIVE BAYES ALGORITHMamiteshg
 
IRJET- Role of Different Data Mining Techniques for Predicting Heart Disease
IRJET-  	  Role of Different Data Mining Techniques for Predicting Heart DiseaseIRJET-  	  Role of Different Data Mining Techniques for Predicting Heart Disease
IRJET- Role of Different Data Mining Techniques for Predicting Heart DiseaseIRJET Journal
 
Prediction of Heart Disease using Machine Learning Algorithms: A Survey
Prediction of Heart Disease using Machine Learning Algorithms: A SurveyPrediction of Heart Disease using Machine Learning Algorithms: A Survey
Prediction of Heart Disease using Machine Learning Algorithms: A Surveyrahulmonikasharma
 
IRJET- Heart Disease Prediction System
IRJET- Heart Disease Prediction SystemIRJET- Heart Disease Prediction System
IRJET- Heart Disease Prediction SystemIRJET Journal
 
Survey on data mining techniques in heart disease prediction
Survey on data mining techniques in heart disease predictionSurvey on data mining techniques in heart disease prediction
Survey on data mining techniques in heart disease predictionSivagowry Shathesh
 

What's hot (20)

A Heart Disease Prediction Model using Decision Tree
A Heart Disease Prediction Model using Decision TreeA Heart Disease Prediction Model using Decision Tree
A Heart Disease Prediction Model using Decision Tree
 
Survey on data mining techniques in heart disease prediction
Survey on data mining techniques in heart disease predictionSurvey on data mining techniques in heart disease prediction
Survey on data mining techniques in heart disease prediction
 
A Heart Disease Prediction Model using Logistic Regression By Cleveland DataBase
A Heart Disease Prediction Model using Logistic Regression By Cleveland DataBaseA Heart Disease Prediction Model using Logistic Regression By Cleveland DataBase
A Heart Disease Prediction Model using Logistic Regression By Cleveland DataBase
 
IRJET- Heart Failure Risk Prediction using Trained Electronic Health Record
IRJET- Heart Failure Risk Prediction using Trained Electronic Health RecordIRJET- Heart Failure Risk Prediction using Trained Electronic Health Record
IRJET- Heart Failure Risk Prediction using Trained Electronic Health Record
 
Decision Tree Models for Medical Diagnosis
Decision Tree Models for Medical DiagnosisDecision Tree Models for Medical Diagnosis
Decision Tree Models for Medical Diagnosis
 
Propose a Enhanced Framework for Prediction of Heart Disease
Propose a Enhanced Framework for Prediction of Heart DiseasePropose a Enhanced Framework for Prediction of Heart Disease
Propose a Enhanced Framework for Prediction of Heart Disease
 
Classification of Heart Diseases Patients using Data Mining Techniques
Classification of Heart Diseases Patients using Data Mining TechniquesClassification of Heart Diseases Patients using Data Mining Techniques
Classification of Heart Diseases Patients using Data Mining Techniques
 
Heart disease prediction
Heart disease predictionHeart disease prediction
Heart disease prediction
 
IRJET- Predicting Heart Disease using Machine Learning Algorithm
IRJET- Predicting Heart Disease using Machine Learning AlgorithmIRJET- Predicting Heart Disease using Machine Learning Algorithm
IRJET- Predicting Heart Disease using Machine Learning Algorithm
 
Heart disease prediction system
Heart disease prediction systemHeart disease prediction system
Heart disease prediction system
 
A data mining approach for prediction of heart disease using neural networks
A data mining approach for prediction of heart disease using neural networksA data mining approach for prediction of heart disease using neural networks
A data mining approach for prediction of heart disease using neural networks
 
Chronic Kidney Disease Prediction
Chronic Kidney Disease PredictionChronic Kidney Disease Prediction
Chronic Kidney Disease Prediction
 
Psdot 14 using data mining techniques in heart
Psdot 14 using data mining techniques in heartPsdot 14 using data mining techniques in heart
Psdot 14 using data mining techniques in heart
 
Prognosis of Cardiac Disease using Data Mining Techniques A Comprehensive Survey
Prognosis of Cardiac Disease using Data Mining Techniques A Comprehensive SurveyPrognosis of Cardiac Disease using Data Mining Techniques A Comprehensive Survey
Prognosis of Cardiac Disease using Data Mining Techniques A Comprehensive Survey
 
Final ppt
Final pptFinal ppt
Final ppt
 
HEART DISEASE PREDICTION USING NAIVE BAYES ALGORITHM
HEART DISEASE PREDICTION USING NAIVE BAYES ALGORITHMHEART DISEASE PREDICTION USING NAIVE BAYES ALGORITHM
HEART DISEASE PREDICTION USING NAIVE BAYES ALGORITHM
 
IRJET- Role of Different Data Mining Techniques for Predicting Heart Disease
IRJET-  	  Role of Different Data Mining Techniques for Predicting Heart DiseaseIRJET-  	  Role of Different Data Mining Techniques for Predicting Heart Disease
IRJET- Role of Different Data Mining Techniques for Predicting Heart Disease
 
Prediction of Heart Disease using Machine Learning Algorithms: A Survey
Prediction of Heart Disease using Machine Learning Algorithms: A SurveyPrediction of Heart Disease using Machine Learning Algorithms: A Survey
Prediction of Heart Disease using Machine Learning Algorithms: A Survey
 
IRJET- Heart Disease Prediction System
IRJET- Heart Disease Prediction SystemIRJET- Heart Disease Prediction System
IRJET- Heart Disease Prediction System
 
Survey on data mining techniques in heart disease prediction
Survey on data mining techniques in heart disease predictionSurvey on data mining techniques in heart disease prediction
Survey on data mining techniques in heart disease prediction
 

Similar to E04733639

A Survey on Heart Disease Prediction Techniques
A Survey on Heart Disease Prediction TechniquesA Survey on Heart Disease Prediction Techniques
A Survey on Heart Disease Prediction Techniquesijtsrd
 
Enhanced Detection System for Trust Aware P2P Communication Networks
Enhanced Detection System for Trust Aware P2P Communication NetworksEnhanced Detection System for Trust Aware P2P Communication Networks
Enhanced Detection System for Trust Aware P2P Communication NetworksEditor IJCATR
 
Comparative Study of Diabetic Patient Data’s Using Classification Algorithm i...
Comparative Study of Diabetic Patient Data’s Using Classification Algorithm i...Comparative Study of Diabetic Patient Data’s Using Classification Algorithm i...
Comparative Study of Diabetic Patient Data’s Using Classification Algorithm i...Editor IJCATR
 
C omparative S tudy of D iabetic P atient D ata’s U sing C lassification A lg...
C omparative S tudy of D iabetic P atient D ata’s U sing C lassification A lg...C omparative S tudy of D iabetic P atient D ata’s U sing C lassification A lg...
C omparative S tudy of D iabetic P atient D ata’s U sing C lassification A lg...Editor IJCATR
 
IRJET- Disease Prediction and Doctor Recommendation System
IRJET-  	  Disease Prediction and Doctor Recommendation SystemIRJET-  	  Disease Prediction and Doctor Recommendation System
IRJET- Disease Prediction and Doctor Recommendation SystemIRJET Journal
 
IRJET- The Prediction of Heart Disease using Naive Bayes Classifier
IRJET- The Prediction of Heart Disease using Naive Bayes ClassifierIRJET- The Prediction of Heart Disease using Naive Bayes Classifier
IRJET- The Prediction of Heart Disease using Naive Bayes ClassifierIRJET Journal
 
IRJET -Improving the Accuracy of the Heart Disease Prediction using Hybrid Ma...
IRJET -Improving the Accuracy of the Heart Disease Prediction using Hybrid Ma...IRJET -Improving the Accuracy of the Heart Disease Prediction using Hybrid Ma...
IRJET -Improving the Accuracy of the Heart Disease Prediction using Hybrid Ma...IRJET Journal
 
EVALUATING THE ACCURACY OF CLASSIFICATION ALGORITHMS FOR DETECTING HEART DIS...
EVALUATING THE ACCURACY OF CLASSIFICATION  ALGORITHMS FOR DETECTING HEART DIS...EVALUATING THE ACCURACY OF CLASSIFICATION  ALGORITHMS FOR DETECTING HEART DIS...
EVALUATING THE ACCURACY OF CLASSIFICATION ALGORITHMS FOR DETECTING HEART DIS...mlaij
 
EVALUATING THE ACCURACY OF CLASSIFICATION ALGORITHMS FOR DETECTING HEART DISE...
EVALUATING THE ACCURACY OF CLASSIFICATION ALGORITHMS FOR DETECTING HEART DISE...EVALUATING THE ACCURACY OF CLASSIFICATION ALGORITHMS FOR DETECTING HEART DISE...
EVALUATING THE ACCURACY OF CLASSIFICATION ALGORITHMS FOR DETECTING HEART DISE...mlaij
 
Comparing Data Mining Techniques used for Heart Disease Prediction
Comparing Data Mining Techniques used for Heart Disease PredictionComparing Data Mining Techniques used for Heart Disease Prediction
Comparing Data Mining Techniques used for Heart Disease PredictionIRJET Journal
 
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSIS
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSISAN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSIS
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSISAIRCC Publishing Corporation
 
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSIS
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSISAN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSIS
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSISijcsit
 
Mining of medical data to identify risk factors of heart disease using freque...
Mining of medical data to identify risk factors of heart disease using freque...Mining of medical data to identify risk factors of heart disease using freque...
Mining of medical data to identify risk factors of heart disease using freque...IRJET Journal
 
Android Based Questionnaires Application for Heart Disease Prediction System
Android Based Questionnaires Application for Heart Disease Prediction SystemAndroid Based Questionnaires Application for Heart Disease Prediction System
Android Based Questionnaires Application for Heart Disease Prediction Systemijtsrd
 
Ascendable Clarification for Coronary Illness Prediction using Classification...
Ascendable Clarification for Coronary Illness Prediction using Classification...Ascendable Clarification for Coronary Illness Prediction using Classification...
Ascendable Clarification for Coronary Illness Prediction using Classification...ijtsrd
 
APPLYING MACHINE LEARNING TECHNIQUES TO FIND IMPORTANT ATTRIBUTES FOR HEART F...
APPLYING MACHINE LEARNING TECHNIQUES TO FIND IMPORTANT ATTRIBUTES FOR HEART F...APPLYING MACHINE LEARNING TECHNIQUES TO FIND IMPORTANT ATTRIBUTES FOR HEART F...
APPLYING MACHINE LEARNING TECHNIQUES TO FIND IMPORTANT ATTRIBUTES FOR HEART F...IJCSEA Journal
 
IRJET- Feature Selection and Classifier Accuracy of Data Mining Algorithms
IRJET- Feature Selection and Classifier Accuracy of Data Mining AlgorithmsIRJET- Feature Selection and Classifier Accuracy of Data Mining Algorithms
IRJET- Feature Selection and Classifier Accuracy of Data Mining AlgorithmsIRJET Journal
 
IRJET - Comparative Study of Cardiovascular Disease Detection Algorithms
IRJET - Comparative Study of Cardiovascular Disease Detection AlgorithmsIRJET - Comparative Study of Cardiovascular Disease Detection Algorithms
IRJET - Comparative Study of Cardiovascular Disease Detection AlgorithmsIRJET Journal
 
prediction of heart disease using machine learning algorithms
prediction of heart disease using machine learning algorithmsprediction of heart disease using machine learning algorithms
prediction of heart disease using machine learning algorithmsINFOGAIN PUBLICATION
 
Prediction of Heart Disease Using Data Mining Techniques- A Review
Prediction of Heart Disease Using Data Mining Techniques- A ReviewPrediction of Heart Disease Using Data Mining Techniques- A Review
Prediction of Heart Disease Using Data Mining Techniques- A ReviewIRJET Journal
 

Similar to E04733639 (20)

A Survey on Heart Disease Prediction Techniques
A Survey on Heart Disease Prediction TechniquesA Survey on Heart Disease Prediction Techniques
A Survey on Heart Disease Prediction Techniques
 
Enhanced Detection System for Trust Aware P2P Communication Networks
Enhanced Detection System for Trust Aware P2P Communication NetworksEnhanced Detection System for Trust Aware P2P Communication Networks
Enhanced Detection System for Trust Aware P2P Communication Networks
 
Comparative Study of Diabetic Patient Data’s Using Classification Algorithm i...
Comparative Study of Diabetic Patient Data’s Using Classification Algorithm i...Comparative Study of Diabetic Patient Data’s Using Classification Algorithm i...
Comparative Study of Diabetic Patient Data’s Using Classification Algorithm i...
 
C omparative S tudy of D iabetic P atient D ata’s U sing C lassification A lg...
C omparative S tudy of D iabetic P atient D ata’s U sing C lassification A lg...C omparative S tudy of D iabetic P atient D ata’s U sing C lassification A lg...
C omparative S tudy of D iabetic P atient D ata’s U sing C lassification A lg...
 
IRJET- Disease Prediction and Doctor Recommendation System
IRJET-  	  Disease Prediction and Doctor Recommendation SystemIRJET-  	  Disease Prediction and Doctor Recommendation System
IRJET- Disease Prediction and Doctor Recommendation System
 
IRJET- The Prediction of Heart Disease using Naive Bayes Classifier
IRJET- The Prediction of Heart Disease using Naive Bayes ClassifierIRJET- The Prediction of Heart Disease using Naive Bayes Classifier
IRJET- The Prediction of Heart Disease using Naive Bayes Classifier
 
IRJET -Improving the Accuracy of the Heart Disease Prediction using Hybrid Ma...
IRJET -Improving the Accuracy of the Heart Disease Prediction using Hybrid Ma...IRJET -Improving the Accuracy of the Heart Disease Prediction using Hybrid Ma...
IRJET -Improving the Accuracy of the Heart Disease Prediction using Hybrid Ma...
 
EVALUATING THE ACCURACY OF CLASSIFICATION ALGORITHMS FOR DETECTING HEART DIS...
EVALUATING THE ACCURACY OF CLASSIFICATION  ALGORITHMS FOR DETECTING HEART DIS...EVALUATING THE ACCURACY OF CLASSIFICATION  ALGORITHMS FOR DETECTING HEART DIS...
EVALUATING THE ACCURACY OF CLASSIFICATION ALGORITHMS FOR DETECTING HEART DIS...
 
EVALUATING THE ACCURACY OF CLASSIFICATION ALGORITHMS FOR DETECTING HEART DISE...
EVALUATING THE ACCURACY OF CLASSIFICATION ALGORITHMS FOR DETECTING HEART DISE...EVALUATING THE ACCURACY OF CLASSIFICATION ALGORITHMS FOR DETECTING HEART DISE...
EVALUATING THE ACCURACY OF CLASSIFICATION ALGORITHMS FOR DETECTING HEART DISE...
 
Comparing Data Mining Techniques used for Heart Disease Prediction
Comparing Data Mining Techniques used for Heart Disease PredictionComparing Data Mining Techniques used for Heart Disease Prediction
Comparing Data Mining Techniques used for Heart Disease Prediction
 
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSIS
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSISAN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSIS
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSIS
 
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSIS
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSISAN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSIS
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSIS
 
Mining of medical data to identify risk factors of heart disease using freque...
Mining of medical data to identify risk factors of heart disease using freque...Mining of medical data to identify risk factors of heart disease using freque...
Mining of medical data to identify risk factors of heart disease using freque...
 
Android Based Questionnaires Application for Heart Disease Prediction System
Android Based Questionnaires Application for Heart Disease Prediction SystemAndroid Based Questionnaires Application for Heart Disease Prediction System
Android Based Questionnaires Application for Heart Disease Prediction System
 
Ascendable Clarification for Coronary Illness Prediction using Classification...
Ascendable Clarification for Coronary Illness Prediction using Classification...Ascendable Clarification for Coronary Illness Prediction using Classification...
Ascendable Clarification for Coronary Illness Prediction using Classification...
 
APPLYING MACHINE LEARNING TECHNIQUES TO FIND IMPORTANT ATTRIBUTES FOR HEART F...
APPLYING MACHINE LEARNING TECHNIQUES TO FIND IMPORTANT ATTRIBUTES FOR HEART F...APPLYING MACHINE LEARNING TECHNIQUES TO FIND IMPORTANT ATTRIBUTES FOR HEART F...
APPLYING MACHINE LEARNING TECHNIQUES TO FIND IMPORTANT ATTRIBUTES FOR HEART F...
 
IRJET- Feature Selection and Classifier Accuracy of Data Mining Algorithms
IRJET- Feature Selection and Classifier Accuracy of Data Mining AlgorithmsIRJET- Feature Selection and Classifier Accuracy of Data Mining Algorithms
IRJET- Feature Selection and Classifier Accuracy of Data Mining Algorithms
 
IRJET - Comparative Study of Cardiovascular Disease Detection Algorithms
IRJET - Comparative Study of Cardiovascular Disease Detection AlgorithmsIRJET - Comparative Study of Cardiovascular Disease Detection Algorithms
IRJET - Comparative Study of Cardiovascular Disease Detection Algorithms
 
prediction of heart disease using machine learning algorithms
prediction of heart disease using machine learning algorithmsprediction of heart disease using machine learning algorithms
prediction of heart disease using machine learning algorithms
 
Prediction of Heart Disease Using Data Mining Techniques- A Review
Prediction of Heart Disease Using Data Mining Techniques- A ReviewPrediction of Heart Disease Using Data Mining Techniques- A Review
Prediction of Heart Disease Using Data Mining Techniques- A Review
 

More from IOSR-JEN

More from IOSR-JEN (20)

C05921721
C05921721C05921721
C05921721
 
B05921016
B05921016B05921016
B05921016
 
A05920109
A05920109A05920109
A05920109
 
J05915457
J05915457J05915457
J05915457
 
I05914153
I05914153I05914153
I05914153
 
H05913540
H05913540H05913540
H05913540
 
G05913234
G05913234G05913234
G05913234
 
F05912731
F05912731F05912731
F05912731
 
E05912226
E05912226E05912226
E05912226
 
D05911621
D05911621D05911621
D05911621
 
C05911315
C05911315C05911315
C05911315
 
B05910712
B05910712B05910712
B05910712
 
A05910106
A05910106A05910106
A05910106
 
B05840510
B05840510B05840510
B05840510
 
I05844759
I05844759I05844759
I05844759
 
H05844346
H05844346H05844346
H05844346
 
G05843942
G05843942G05843942
G05843942
 
F05843238
F05843238F05843238
F05843238
 
E05842831
E05842831E05842831
E05842831
 
D05842227
D05842227D05842227
D05842227
 

Recently uploaded

Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 

Recently uploaded (20)

Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 

E04733639

  • 1. IOSR Journal of Engineering (IOSRJEN) www.iosrjen.org ISSN (e): 2250-3021, ISSN (p): 2278-8719 Vol. 04, Issue 07 (July. 2014), ||V3|| PP 36-39 International organization of Scientific Research 36 | P a g e Comparative study of Data Mining Approaches for prediction Heart Diseases Dr.Hari Ganesh S1 , Gajenthiran M2 1 Asst. Professor, 2 M.Phil. Scholar, Department of Computer Applications, Bishop Heber College (Autonomous), Trichirappalli-620 017 Abstract: - Data mining is the process of finding useful and relevant information from the databases. There are several types of data mining techniques are available. Association Rule, Classification, Neural Networks, Clusteringare some of the most important data mining techniques. Data mining process may take important role in Health care Industries. Most commonly the data mining process is used in health care industries for the process of prediction of diseases. This paper analysis the Heart Disease prediction approaches using classification technique. Here we are using three different kinds of classifiers named Naïve Bayes, Decision Table, and J48. The data set which is used here is taken from the UCI repository.Weka tool is used. Keywords: - Heart disease prediction, Classification, Decision Table, Naïve Bayes. I. INTRODUCTION Data mining is the process of finding useful and relevant information from the various types of databases. The process of extraction of the hidden predictive information from the databases is also called as Data mining.[1] Nowadays data mining may take important role in several types of fields like Medical, Science, Railway etc. Association rule, Classification is one of the type of widely using data mining technique for prediction of heart diseases. 1.1 Classification Classification is the process of categorization the data which are in the databases for its most effective and efficient use. For example, data can be broken down according to its topical content, file type, operating platform, average file size megabytes or gigabytes, when it was created, when it was last accessed or modified, which person or department last accessed or modified it, and which personnel or departments use it the most. A well-planned data classification system makes essential data easy to find. There are several types of classification models are available. But here we are using four different kinds of classification models. They are Naïve Bayes, Decision Tree, and J48. 1.2 Heart Diseases Heart disease is one of the type of disease which will affects the operations of the heart. Nowadays Heart disease is the major reason for deaths. There are several kinds of factors which increases the risk of Heart Diseases. Here we are using fourteen kinds of factors for the purpose predicting the heart disease. But the following are considered as important reasons for heart diseases.  Age  Blood Pressure  Smoking habit  Cholesterol  Exercise and weight  Blood Sugar  Depression The Heart Diseases are mostly caused by the above factors. The Survey result of World Health Organization says 12 million deaths are occurred worldwide, every year due to Heart diseases. If we want to find the solution for this problem, there are several kinds of prediction processes are needed. This paper will analyse various kinds of Heart Disease Prediction Techniques.
  • 2. Comparative study of Data Mining Approaches for prediction Heart Diseases International organization of Scientific Research 37 | P a g e 1.3 Weka Tool: WEKA stands for Waikato Environment for Knowledge Learning. It was developed by the University of Waikato, New Zealand. WEKA supports many data mining tasks such as re-processing, classification, clustering, regression and future selection to name a few[2].The workflow of WEKA would be follows: Naïve Bayes: Naïve Bayes is a simple probabilistic classifier based on applying Bayes theorem with strong independence assumptions. A more descriptive term for underlying probability model would be “independent feature model”. In simple terms, a Naïve Bayes classifier assumes that the presence of a particular feature of a class is unrelated to the presence of any other feature.[3] Depending on the precise nature of the probability model, Naïve Bayes classifiers can be trained very sufficiently in a supervised learning setting. In many practical applications, parameter estimation for Naïve Bayes models uses the method of maximum likelihood. 1.5. Decision Table: The Decision tree is a structure that includes root node, branch and leaf node. Each internal node denotes a test on attribute, each branch denotes the outcome of test and each leaf node holds the class label [4]. The topmost node in the tree is the root node. There are three main advantages of decision tree.  It does not require any domain knowledge.  It is easy to assimilate by human.  Learning and classification steps of decision tree are simple and fast. 1.6 J48: The J48 Decision tree classifier follows the following simple algorithm. In order to classify a new item, it first needs to create a decision tree based on the attribute values of the available training data [5]. So, whenever it encounters a set of items (training set) it identifies the attribute that discriminates the various instances most clearly. This feature that ableto tell us most about the data instances so that we can classify them the best is said to have the highest information gain. Now, among the possible values of this feature, if there is any value for which there is no ambiguity, that is, for which the data instances falling within its category have the same value for that target variable, then we terminate that branch and assign to it the target value that we have obtained [6]. II. DATA DESCRIPTION The data are collected from UCI repository. The objective of this data set is to analysis the Heart Disease based on the given attributes. The data set consists of 13 attributes that are used to predict the Heart Disease. The detail descriptions of the attributes are given as below. Table 2.1: Attributes for Heart Disease No Name of the Attribute Description 1 Age Patient’s age 2 Sex Male/Female 3 Cp Constrictive Pericarditis 4 Fbs Fasting Blood Sugar 5 Trestbps Resting Blood Sugar 6 Restecg Resting Electrocardiograph Results 7 Thalach Maximum Heart Attack achieved. 8 Exang Exercise Induced Angina. 9 Oldpeak ST depression by exercise relative to
  • 3. Comparative study of Data Mining Approaches for prediction Heart Diseases International organization of Scientific Research 38 | P a g e rest. 10 Slope The slope of the peak exercise st segment. 11 Chol Cholesterol 12 Ca Number of major vessels (0-3) colored by fluoroscopy. 13 Num Diagnosis of heart disease. The attributes are given based on data types. The data set is based on the numeric and nominal data type. III. EXPERIMENTAL RESULTS The given three types of algorithms like Naïve Bayes, Decision Table, and J48 are applied on the Heart Disease data set in WEKA and the performance of the algorithm are given based various factors. The performance can be obtained based on the time taken to build the tree and correctly classified instances. Table 3.1 Time taken by the algorithms Name of the Algorithm Time Taken to build the decision tree Naïve Bayes 0.01 seconds Decision Table 0.05 seconds J48 0.03 seconds Fig 3.1 Performance of the Algorithms based on the time taken X-Axis: Classification of Algorithms Y-Axis: Time Range The dataset consists of 303 instances and they are applied as a test case in the classification algorithms. The performance of the algorithms can be known from the instances that are correctly classified. The instances which are correctly classified using the WEKA tool can be given as below, Table 3.2 Number of instances correctly classified Name of the Algorithm Number of correct instance Accuracy Naïve Bayes 253 83.4% Decision Table 231 76.2% J48 235 77.5% 0 0.01 0.02 0.03 0.04 0.05 0.06 Performance Analysis Naïve Bayes Decision Table J48
  • 4. Comparative study of Data Mining Approaches for prediction Heart Diseases International organization of Scientific Research 39 | P a g e IV. DISCUSSION The above three algorithms predicts the class label. The final output will be patterns which are used to find out whether the person is affected by the Heart Disease or not. A Confusion Matrix is a useful visualization tool for analyzing the classifier accuracy. Structure of the confusion matrix can be given as below Table 4.1 Structure of the Confusion Matrix TP TN FP FN Where  TP is True Positive: Heart Disease patients correctly identified as Heart Disease.  FP is False Positive: Healthy people incorrectly identified as Heart Disease.  TN is True Negative: Healthy people correctly identified as healthy.  FN is False Negative: Heart Disease patients incorrectly identified as healthy. The Confusion Matrix for the classification algorithms such as Naïve Bayes, Decision Table and J48 can be given as follows based on the execution of the algorithm using WEKA tool. Table 4.2 Confusion Matrix for Naïve Bayes 143 22 28 110 Table 4.3 Confusion Matrix for Decision Table 133 32 40 98 Table 4.4 Confusion Matrix for J48 137 28 40 98 V. CONCLUSION Data mining plays a major role in extracting the hidden information in the medical data base. The data pre-processing is used in order to improve the quality of the data. This model is built based as a test case on the UCI repository dataset. The experiment has been successfully performed with several data mining classification techniques and it is found that the Naïve Bayes algorithm gives a better performance over the supplied data set with the accuracy of 83.4%. It is believed that the data mining can significantly help in the Heart Disease research and ultimately improve the quality of health care of Heart Disease patients. It can also be implemented using several classification techniques. REFERENCES [1] Dharminder Kumar, Deepak Bhardwaj“Rise of Data Mining: Current and Future Application”, International Journal of Computer Science, Vol- 8, Issue 5, September-2011. [2] ZdravkoMarkovIngridRussell“An Introduction to the WEKA Data Mining System”, Zdravko Markov Central Connecticut State University. [3] VikasChaurasia, SaurabhPal“Early Prediction of Heart disease using data mining techniques”,Carib.j.SciTech,2013,Vol.1,208-217 [4] NidhiBhatla, KiranJyoti“An Analysis of Heart Disease Prediction using Different Data Mining Techniques”,International Journal of Engineering Research & Technology (IJERT). [5] Jay Gholap“Performance Tuning Of J48”,Dept. of Computer Engineering College of Engineering, Pune, Maharashtra, India. [6] Shadab Adam Pattekari and AsmaParveen “Prediction System for Heart Disease Using Naive Bayes”, International Journal of Advanced Computer and Mathematical Sciences. Vol 3, Issue 3, 2012, pp 290- 294.