SlideShare a Scribd company logo
1 of 5
Download to read offline
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 05 Issue: 12 | Dec 2018 www.irjet.net p-ISSN: 2395-0072
© 2018, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 206
Comparative Analysis of Data Mining Classification Techniques for
Heart Disease Prediction
Dhara Mehta1, Nirali Varnagar2
1Student, Dept. of Computer Engineering, Atmiya Institute of Technology and Science, Rajkot, Gujarat, India
2Assistent Professor, Dept. of Computer Engineering, Atmiya Institute of Technology and Science, Rajkot,
Gujarat, India
---------------------------------------------------------------------***----------------------------------------------------------------------
Abstract - Data Mining is the most advanced technology in
the field of computer science. It is used to discover the fruitful
information from the large amount of data, by extracting the
frequent patterns generating on database. Medical Science is
also using the techniques of data mining. In medicalfield, data
mining techniques can be used to predict the risk level of a
particular disease. In this research work, we have compared
the accuracy levels of data mining classifiers. We compared
most trending classifiers like Naïve Bayes, SVM (Support
Vector Machine), K-Star, J48 and Random Forest. In order to
examine these classifiers, we have used Data Mining tool
WEKA. We used the heart disease dataset of UCI repository.
Among all the classifiers, we got the highest accuracy in SVM
classifier by 84.07%. This paper contains the introduction of
data mining technology, its techniques and classifiers used to
predict the accuracy.
Key Words: Data Mining, Heart Disease,
Classification Techniques, WEKA, Predication.
1. INTRODUCTION
The cardiovascular disease is the biggest cause of heart
failure. Heart is an important constituent to live in human
body. If heart does not work properly, it maycausethedeath
of a patient. . High blood pressure, abnormal cholesterol,
diabetes, smoking are the major causes for heart disease[3].
The doctors and experts generally use Electrocardiogram
(ECG) for diagnosis of a patient. The hospitals and
organizations keep the records of all the patients in the
databases which remain unusable. But by applying data
mining techniquesto extractknowledgeableinformation can
help in checkup and diagnosis of a patient. This ultimately
helps in prevention and detection of cause of health disease.
Data Mining is one of the sensational fields of machine
learning which extracts hidden information from the large
databases [7]. So researchers canusedata miningtechniques
in solving these kinds of problems. The database of any
hospitals keeps records of patient’s age, sex, Bloodpressure,
cholesterol, ECG result etc. By applying data mining
classification techniques, we can predict the risk level of
Heart Disease. In this paper, there are five classification
techniques like Naïve Bayes, SVM, K-star, J48 and Random
Forest are used to analyze the accuracy.
2. Knowledge Discovery from Database
KDD refers to the process of generating useful information
from the large amount of database [1, 10]. KDD is a step by
step process of deriving knowledge.
Fig -1: Procedure of KDD
2.1 Data Cleaning
It refers to the ability to understand and correct the data
to get accurate analysis.
2.2 Data Integration
Data integration is the data preprocessing technique to
merge the data from multiple data sources into a single
database.
2.3 Data Selection and Transformation
Data selection is the process where thedata necessary are
selected from database. The data is to be transformed in
Data Cleaning
Data Integration
Data Selection and
Transformation
Data Mining
Pattern
Evaluation
Knowledge
Representation
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 05 Issue: 12 | Dec 2018 www.irjet.net p-ISSN: 2395-0072
© 2018, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 207
appropriate form in whichdata miningoperationsisgoingto
be done. Sometimes, transformation process is performed
before data selection.
2.4 Data Mining
It is an important phase wherethedata miningalgorithms
is applied on dataset to generate the pattern andknowledge.
2.5 Pattern Evaluation
The final step of data mining process includes deriving
relevant information hidden in the dataset.
2.6 Knowledge Representation
Knowledge representation is about representing the
knowledge retrieved from the data mining steps. It can be
represented in terms of trees, tables, charts, matrix etc.
3. Data Mining Techniques
Data Mining techniques are use to get meaning
information from the raw data. One of the most important
steps in data mining is to choose the appropriate technique.
There are many data mining techniques that can be used for
different requirements.Someimportanttechniquesarelisted
below.
3.1 Association
Association is one of the data mining techniques which is
use to unwrap the hidden relationship between unrelated
data of relational database [1]. Association rule works on the
basis of occurrence of one particular object based on
occurrence of another object. It helps in analyzing the
customer’s behavior on shopping basket. Multilevel
association rule, multidimensional association rule and
quantitative association rule are the types of association.
3.2 Classification
Classification is a technique used to assign an item/items
to the collection of target classes. The main goal of
classification techniquesistopredictaccurately.Theaimisto
predict each class accuratelyfor each and every class of data.
In our case classificationtechniquesisusedtopredicttherisk
level of heart disease by high, medium and low.
3.3 Clustering
Clustering is the process of grouping a set of objects into
classes with similar objects. A cluster is a set of objects
having similar characteristics to one another. It means the
objects of one cluster can be treated as a whole. However it
can be said as a data compression. Clustering analysis is a
usual activity like as we learn to differentiate between cat
and dog, by repeatedly improving schemes of clustering.
Clustering can be done by different methods like partition,
hierarchical agglomerative, density based, grid based and
model based.
3.4 Regression
Regression and classification are used for prediction
analysis but the difference is that regression is used to
predictnumericandcontinuousvalueswhereasclassification
can also be used for discrete data categories. The types of
regression techniques are simple regression, multiple
regression, hierarchical regression and set wise regression.
3.5 Prediction
Prediction is a data mining techniques about predicting
the futurebehavior of an item based on currentandpastdata
to generate a prediction model. Types of prediction
techniques are linear regression, multivariate linear
regression.
4. Related Work
In this analysis task,wehaveusedWEKA3.8tool for
comparative analysis. WEKA contains machine learning
algorithms for data classification. WEKA contains tools for
classification, clustering and association. In this experiment
there is an analysis for the accuracyofdifferentclassification
algorithms and find the best suitable algorithm for heart
disease prediction. We have used the Naïve Bayes, Support
Vector Machine, K-Star, J48,RandomForestclassifierson the
heart disease dataset.
The experiment has been conducted on the Heart
disease dataset available on the UCI Repository. The dataset
contains 270 instances and 14 attributes [5]. Table 1 shows
number of attributes and its description available on UCI’s
Heart Disease dataset.
Table -1: Attributes Affected for Heart Disease
No
.
Attribute Name Type of
Attribute
Attribute
Description
1 age Numerical Age
2 Sex Numerical Sex
3 chest Numerical Chest pain type
4 resting_blood_p
ressure
Numerical Resting blood
pressure
5 Serum_cholesto
ral
Numerical Serum
Cholestoral in
mg/dl
6 Fasting_blood_s Numerical Fasting blood
sugar> 120
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 05 Issue: 12 | Dec 2018 www.irjet.net p-ISSN: 2395-0072
© 2018, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 208
ugar mg/dl
7 Resting_electro
cardiographic_r
esults
Numerical Resting
electrocardiogra
phic
results(0,1,2)
8 Maximum_heart
_rate_achieved
Numerical Maximum heart
rate achieved
9 Exercise_induce
d_angina
Numerical Exercise induced
angina
10 Oldpeak Numerical Oldpeak = ST
depression
induced by
exercise relative
to rest
11 slope Numerical The slope of the
peak exercise ST
segment
12 Number_of_maj
or_vessels
Numerical Number of major
vessels(0-3)
colored by
flourosopt
13 Thal Numerical 3 = normal; 6 =
fixed defect; 7 =
reversible defect
14 class Nominal class
4.1 Classification Techniques Used in Experiment
4.1.1 Naïve Bayes Classifiers
It is based on Bayes’ theorem classification technique
which havingassumptionofindependenceamong predictors
[3]. This technique assumes that the presence of one feature
in a particular class is not related to the presence of any
other feature. Naïve Bayes model is very easy and quick to
build and can be used for a large dataset [11].
p(c|x): The posterior probability for class (c, target)
given predictor (x, attributes).
p(c) : The prior probability of class.
p (x|c) : The probability for predictor given class.
p(x) : The prior probability of predictor.
4.1.2 Support Vector Machine
Support Vector Machine (SVM) is a supervised machine
learning algorithm used for classification as well as
regression. In this algorithm, we plot data item as a point in
n-dimensional graph where each point is being a value of a
particular coordinate. Then classification algorithm is being
used to fine hyper-plane that differentiates 2 classes with
maximum margin.
4.1.3 K-star
K-star algorithm work on the problem,” Each new
instance is being compared with the existing one by using a
distance metric, and the closest instance will be used to
assign a class to the newer one”[6].
4.1.4 J48
It is a one type of decision tree algorithm used as a
classification algorithm. This algorithm is based on divide
and conquers approach [3]. J48 divides the data into a sub
range, based on existing attributes values available in
training dataset.
4.1.5 Random Forest
Random forestisa supervisedmachinelearningalgorithm
which creates forest and makes it random. The forest is an
ensemble of decision trees, mostly trained with “training
method” [12]. In simple word, Random forest algorithm
generates multiple decision trees and merges themtogether
to get the maximum accuracy and immutable prediction.
4.2 Performance Measures
4.2.1 Accuracy
Accuracy refers to the quality ofpredictionindata mining.
Accuracy and be predicted by the formula:
Accuracy=
Accuracy=
4.2.2 ROC
ROC is receiver Operating Characteristics. ROC curve is a
graphical representation of classifier’s performance.
4.2.3 Kappa Statistics
The Kappa Statistics is used to compare the accuracy of
the current system with the random system.
k= P (a)-P (e)/ (1-P (e))
P (a) = agreement percentage
P (e) = agreement chances
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 05 Issue: 12 | Dec 2018 www.irjet.net p-ISSN: 2395-0072
© 2018, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 209
If the value of k is 0then it means that there is a chance of
agreement. If the value of k is 1 then we can say that
agreement is in a tolerable range of classifier.
4.2.4 RMSE
Root Mean Squared Error is defined as the difference
between predicted value and observed value.
Ei=
P (i,j) = predicted value
i = fitness
Tj = fitness applicability target value for j
4.3 Result Analysis
In this experiment, there are five classifiers are used in
WEKA tool namely Naïve Bayes, Support Vector Machine, K-
Star, J48 and Random Forest. The results generated by these
classifiers are shown in table 2. The maximum accuracy is
derived by the classifier Support Vector Machine in 0.24
seconds.
Table -2: Result Analysis from WEKA
Classifier
Accuracy
ROC
Kappa
Statistic
RMSE
Time(sec
)
Naïve Bayes 83.70 0.898 0.6683 0.3598 0.02
SVM 84.07 0.837 0.6762 0.3991 0.24
K Star 75.18 0.8939 0.4937 0.4438 0
J48 76.67 0.744 0.5271 0.4601 0.14
Random
Forest
81.48 0.899 0.6244 0.3587 0.35
4.3.1 Classifiers Accuracy
Chart 1 shows the graph of accuracy for different
classifiers. It shows that Naïve Bayes and SVM gives the best
accuracy by 83.70% and 84.07% respectively.
Chart -1: Classifiers Accuracy
4.3.2 ROC and Time
ROC curve shows good results for SVM and Random
Forest whereas poor results for K-Star and J48.Howeverthe
time taken by K-Star and J48 is lesser than SVM andRandom
Forest.
Chart -2: ROC and Time to build Model
4.3.3 K Statistic and RMSE
Chart 3 shows the better values of Kappa statistics for
Naïve Bayes and SVM with lesser error as RMSE.
Chart -3: Kappa Statistic and RMSE
4.3.4 Accuracy Measures
Some other measures are also considered for prediction
or classification. TP Rate, FP Rate, Precision, Recall and F-
measure are majorly used measures.
TP Rate is the true positive rate which denotes the
number of instances correctly classified and FP Rate is the
false positive rate which denotes the number of instances
that are wrongly classified.
Precision only considers positive cases as P=tp/ (tp+fp).
Where tp denotes true positive rate and fp denotes false
positive rate.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 05 Issue: 12 | Dec 2018 www.irjet.net p-ISSN: 2395-0072
© 2018, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 210
The recall represents the proportion of real positive
values which is denoted by R=tp/ (tp+fn).
F-measure is the combination of precision and recall. F-
measure =2*((precision*recall) / precision+recall).
Table -3: Major Values for Accuracy Measure
Classifier
Name
TP
Rate
FP
Rate
Precision Recall F-
Measure
Naïve
Bayes
0.847 0.172 0.837 0.837 0.837
SVM 0.841 0.167 0.841 0.841 0.840
K-star 0.752 0.262 0.751 0.752 0.751
J48 0.767 0.240 0.766 0.767 0.767
Random
Forest
0.815 0.191 0.815 0.815 0.815
5. CONCLUSION
In this research work, we have examined data mining
classifiers on heart disease dataset. In comparison of 5
classifiers namely Naïve Bayes,SVM,K-Star,J48andRandom
Forest; we got the highest accuracy by SVM classifier with
84.07%. The classifiers have been also compared by ROC, K-
statistics and RMSE. In future better classifiers can be
developed in order to get higher accuracy for more datasets
of any particular disease.
REFERENCES
[1] Narender Kumar, Sabita Khatri, “Implementing WEKA
for medical data classification and early disease
prediction”, In 3rd IEEE Conference on Computational
IntelligenceandCommunicationTechnology(IEEE-CICT
2017)
[2] Meenal Saini, Niyati Baliyan, Vineeta Bassi, “Prediction
of Heart Disease Severity with Hybrid Data Mining”
2017 2nd International Conference on
Telecommunication and Networks(TEL-NET 2017)
[3] Theresa Princy. R, J. Thomas, “Human Heart Disease
Prediction System using Data Mining Techniques” 2016
International Conference on Circuit, Power and
Computing Technologies [ICCPCT]
[4] Salma Banu N.K and Suma Swamy, “Prediction of Heart
Disease at early stage using Data Mining and Big Data
Analytics : A Survey” 2016 International Conference on
Electrical, Electronics, Communication, Computer and
Optimization Techniques.
[5] Amita Malav, Kalyani Kadam and Pooja Kamat,
“PREDICTION OF HEART DISEASE USINGK-MEANSand
Artificial NEURAL NETWORKasHYBRIDAPPROACHTO
IMPROVE ACCURACY ” International Journal of
Engineering and Technology(IJET) Vol 9 No 4-Sep 2017
[6] Rashmi Madhukar Jadhav and Ms. Roshani Ade,
“Analytical Approach to Predictive Disease Diagnosis
using K-nn and Kstar” International Journal ofComputer
Science and Information Technology, Vol. 6(2), 2015,
1876-1879
[7] S. M. M. Hasan, A. Mamun, M. P. Uddin and M. A. Hossain,
“Comparetive Analysis Of Classification Approach for
Heart Disease Prediction” IEEE-2018
[8] K. Prasanna Lakshmi and Dr. C. R. K. Reddy, “Fast Rule-
Based Heart Disease Prediction using Associative
Classification Mining” IEEE International Conferenceon
Computer, Communication and Control (IC4-2015)
[9] Mrs.S.Radhimeenakshi, “Classification andPredictionof
Heart Disease Risk Using Data Mining Techniques of
Support Vector Machine and Artificial Neural Network”
IEEE-2016
[10] Hamidrza Ashrafi Esfahani, Morteza Ghazanfari,
“Cardiovascular diseasedetectionusinga new ensemble
classifier” 2017 IEEE International conference on
knowledge-Based Engineering and Innovation (KBEI)
[11] M. A. Jabbar, Shirina Samreen, “HeartDiseasePrediction
System based on hidden naïve bayes classifier” IEEE-
2016 International Conference on Circuits, Controls,
Communications and Computing (I4C)
[12] Theresa Princy R and J. Thomas, “Human Heart Disease
Prediction System using Data Mining Techniques” 2016
International Conference on Circuit, Power and
Computing Technologies (IEEE-March 2016)

More Related Content

What's hot

IRJET- Machine Learning Classification Algorithms for Predictive Analysis in ...
IRJET- Machine Learning Classification Algorithms for Predictive Analysis in ...IRJET- Machine Learning Classification Algorithms for Predictive Analysis in ...
IRJET- Machine Learning Classification Algorithms for Predictive Analysis in ...IRJET Journal
 
IRJET- A Review of Data Cleaning and its Current Approaches
IRJET- A Review of Data Cleaning and its Current ApproachesIRJET- A Review of Data Cleaning and its Current Approaches
IRJET- A Review of Data Cleaning and its Current ApproachesIRJET Journal
 
IRJET- GDPS - General Disease Prediction System
IRJET- GDPS - General Disease Prediction SystemIRJET- GDPS - General Disease Prediction System
IRJET- GDPS - General Disease Prediction SystemIRJET Journal
 
Mining of medical data to identify risk factors of heart disease using freque...
Mining of medical data to identify risk factors of heart disease using freque...Mining of medical data to identify risk factors of heart disease using freque...
Mining of medical data to identify risk factors of heart disease using freque...IRJET Journal
 
Study of Data Mining Methods and its Applications
Study of  Data Mining Methods and its ApplicationsStudy of  Data Mining Methods and its Applications
Study of Data Mining Methods and its ApplicationsIRJET Journal
 
IRJET- Analyse Big Data Electronic Health Records Database using Hadoop Cluster
IRJET- Analyse Big Data Electronic Health Records Database using Hadoop ClusterIRJET- Analyse Big Data Electronic Health Records Database using Hadoop Cluster
IRJET- Analyse Big Data Electronic Health Records Database using Hadoop ClusterIRJET Journal
 
An efficient feature selection algorithm for health care data analysis
An efficient feature selection algorithm for health care data analysisAn efficient feature selection algorithm for health care data analysis
An efficient feature selection algorithm for health care data analysisjournalBEEI
 
IRJET- Prediction of Heart Disease using RNN Algorithm
IRJET- Prediction of Heart Disease using RNN AlgorithmIRJET- Prediction of Heart Disease using RNN Algorithm
IRJET- Prediction of Heart Disease using RNN AlgorithmIRJET Journal
 
Medic - Artificially Intelligent System for Healthcare Services ...
Medic - Artificially Intelligent System for Healthcare Services              ...Medic - Artificially Intelligent System for Healthcare Services              ...
Medic - Artificially Intelligent System for Healthcare Services ...IRJET Journal
 
IRJET- Advances in Data Mining: Healthcare Applications
IRJET- Advances in Data Mining: Healthcare ApplicationsIRJET- Advances in Data Mining: Healthcare Applications
IRJET- Advances in Data Mining: Healthcare ApplicationsIRJET Journal
 
MULTI MODEL DATA MINING APPROACH FOR HEART FAILURE PREDICTION
MULTI MODEL DATA MINING APPROACH FOR HEART FAILURE PREDICTIONMULTI MODEL DATA MINING APPROACH FOR HEART FAILURE PREDICTION
MULTI MODEL DATA MINING APPROACH FOR HEART FAILURE PREDICTIONIJDKP
 
Comparative Study of Classification Method on Customer Candidate Data to Pred...
Comparative Study of Classification Method on Customer Candidate Data to Pred...Comparative Study of Classification Method on Customer Candidate Data to Pred...
Comparative Study of Classification Method on Customer Candidate Data to Pred...IJECEIAES
 
The Analysis of Performace Model Tiered Artificial Neural Network for Assessm...
The Analysis of Performace Model Tiered Artificial Neural Network for Assessm...The Analysis of Performace Model Tiered Artificial Neural Network for Assessm...
The Analysis of Performace Model Tiered Artificial Neural Network for Assessm...IJECEIAES
 
IRJET - An User Friendly Interface for Data Preprocessing and Visualizati...
IRJET -  	  An User Friendly Interface for Data Preprocessing and Visualizati...IRJET -  	  An User Friendly Interface for Data Preprocessing and Visualizati...
IRJET - An User Friendly Interface for Data Preprocessing and Visualizati...IRJET Journal
 
Comparative Study on Machine Learning Algorithms for Network Intrusion Detect...
Comparative Study on Machine Learning Algorithms for Network Intrusion Detect...Comparative Study on Machine Learning Algorithms for Network Intrusion Detect...
Comparative Study on Machine Learning Algorithms for Network Intrusion Detect...ijtsrd
 
A ROBUST APPROACH FOR DATA CLEANING USED BY DECISION TREE
A ROBUST APPROACH FOR DATA CLEANING USED BY DECISION TREEA ROBUST APPROACH FOR DATA CLEANING USED BY DECISION TREE
A ROBUST APPROACH FOR DATA CLEANING USED BY DECISION TREEijcsa
 

What's hot (20)

IRJET- Machine Learning Classification Algorithms for Predictive Analysis in ...
IRJET- Machine Learning Classification Algorithms for Predictive Analysis in ...IRJET- Machine Learning Classification Algorithms for Predictive Analysis in ...
IRJET- Machine Learning Classification Algorithms for Predictive Analysis in ...
 
IRJET- A Review of Data Cleaning and its Current Approaches
IRJET- A Review of Data Cleaning and its Current ApproachesIRJET- A Review of Data Cleaning and its Current Approaches
IRJET- A Review of Data Cleaning and its Current Approaches
 
IRJET- GDPS - General Disease Prediction System
IRJET- GDPS - General Disease Prediction SystemIRJET- GDPS - General Disease Prediction System
IRJET- GDPS - General Disease Prediction System
 
Ez36937941
Ez36937941Ez36937941
Ez36937941
 
Mining of medical data to identify risk factors of heart disease using freque...
Mining of medical data to identify risk factors of heart disease using freque...Mining of medical data to identify risk factors of heart disease using freque...
Mining of medical data to identify risk factors of heart disease using freque...
 
50120130406032
5012013040603250120130406032
50120130406032
 
V34132136
V34132136V34132136
V34132136
 
Study of Data Mining Methods and its Applications
Study of  Data Mining Methods and its ApplicationsStudy of  Data Mining Methods and its Applications
Study of Data Mining Methods and its Applications
 
IRJET- Analyse Big Data Electronic Health Records Database using Hadoop Cluster
IRJET- Analyse Big Data Electronic Health Records Database using Hadoop ClusterIRJET- Analyse Big Data Electronic Health Records Database using Hadoop Cluster
IRJET- Analyse Big Data Electronic Health Records Database using Hadoop Cluster
 
An efficient feature selection algorithm for health care data analysis
An efficient feature selection algorithm for health care data analysisAn efficient feature selection algorithm for health care data analysis
An efficient feature selection algorithm for health care data analysis
 
IRJET- Prediction of Heart Disease using RNN Algorithm
IRJET- Prediction of Heart Disease using RNN AlgorithmIRJET- Prediction of Heart Disease using RNN Algorithm
IRJET- Prediction of Heart Disease using RNN Algorithm
 
Medic - Artificially Intelligent System for Healthcare Services ...
Medic - Artificially Intelligent System for Healthcare Services              ...Medic - Artificially Intelligent System for Healthcare Services              ...
Medic - Artificially Intelligent System for Healthcare Services ...
 
IRJET- Advances in Data Mining: Healthcare Applications
IRJET- Advances in Data Mining: Healthcare ApplicationsIRJET- Advances in Data Mining: Healthcare Applications
IRJET- Advances in Data Mining: Healthcare Applications
 
COMPARATIVE ANALYSIS OF DIFFERENT MACHINE LEARNING ALGORITHMS FOR PLANT DISEA...
COMPARATIVE ANALYSIS OF DIFFERENT MACHINE LEARNING ALGORITHMS FOR PLANT DISEA...COMPARATIVE ANALYSIS OF DIFFERENT MACHINE LEARNING ALGORITHMS FOR PLANT DISEA...
COMPARATIVE ANALYSIS OF DIFFERENT MACHINE LEARNING ALGORITHMS FOR PLANT DISEA...
 
MULTI MODEL DATA MINING APPROACH FOR HEART FAILURE PREDICTION
MULTI MODEL DATA MINING APPROACH FOR HEART FAILURE PREDICTIONMULTI MODEL DATA MINING APPROACH FOR HEART FAILURE PREDICTION
MULTI MODEL DATA MINING APPROACH FOR HEART FAILURE PREDICTION
 
Comparative Study of Classification Method on Customer Candidate Data to Pred...
Comparative Study of Classification Method on Customer Candidate Data to Pred...Comparative Study of Classification Method on Customer Candidate Data to Pred...
Comparative Study of Classification Method on Customer Candidate Data to Pred...
 
The Analysis of Performace Model Tiered Artificial Neural Network for Assessm...
The Analysis of Performace Model Tiered Artificial Neural Network for Assessm...The Analysis of Performace Model Tiered Artificial Neural Network for Assessm...
The Analysis of Performace Model Tiered Artificial Neural Network for Assessm...
 
IRJET - An User Friendly Interface for Data Preprocessing and Visualizati...
IRJET -  	  An User Friendly Interface for Data Preprocessing and Visualizati...IRJET -  	  An User Friendly Interface for Data Preprocessing and Visualizati...
IRJET - An User Friendly Interface for Data Preprocessing and Visualizati...
 
Comparative Study on Machine Learning Algorithms for Network Intrusion Detect...
Comparative Study on Machine Learning Algorithms for Network Intrusion Detect...Comparative Study on Machine Learning Algorithms for Network Intrusion Detect...
Comparative Study on Machine Learning Algorithms for Network Intrusion Detect...
 
A ROBUST APPROACH FOR DATA CLEANING USED BY DECISION TREE
A ROBUST APPROACH FOR DATA CLEANING USED BY DECISION TREEA ROBUST APPROACH FOR DATA CLEANING USED BY DECISION TREE
A ROBUST APPROACH FOR DATA CLEANING USED BY DECISION TREE
 

Similar to IRJET- Comparative Analysis of Data Mining Classification Techniques for Heart Disease Prediction

PREDICTION OF DISEASE WITH MINING ALGORITHMS IN MACHINE LEARNING
PREDICTION OF DISEASE WITH MINING ALGORITHMS IN MACHINE LEARNINGPREDICTION OF DISEASE WITH MINING ALGORITHMS IN MACHINE LEARNING
PREDICTION OF DISEASE WITH MINING ALGORITHMS IN MACHINE LEARNINGIRJET Journal
 
IRJET- Predicting Heart Disease using Machine Learning Algorithm
IRJET- Predicting Heart Disease using Machine Learning AlgorithmIRJET- Predicting Heart Disease using Machine Learning Algorithm
IRJET- Predicting Heart Disease using Machine Learning AlgorithmIRJET Journal
 
Heart Disease Prediction Using Random Forest Algorithm
Heart Disease Prediction Using Random Forest AlgorithmHeart Disease Prediction Using Random Forest Algorithm
Heart Disease Prediction Using Random Forest AlgorithmIRJET Journal
 
Smart Health Guide App
Smart Health Guide AppSmart Health Guide App
Smart Health Guide AppIRJET Journal
 
CLASSIFICATION ALGORITHM USING RANDOM CONCEPT ON A VERY LARGE DATA SET: A SURVEY
CLASSIFICATION ALGORITHM USING RANDOM CONCEPT ON A VERY LARGE DATA SET: A SURVEYCLASSIFICATION ALGORITHM USING RANDOM CONCEPT ON A VERY LARGE DATA SET: A SURVEY
CLASSIFICATION ALGORITHM USING RANDOM CONCEPT ON A VERY LARGE DATA SET: A SURVEYEditor IJMTER
 
PARKINSON’S DISEASE DETECTION USING MACHINE LEARNING
PARKINSON’S DISEASE DETECTION USING MACHINE LEARNINGPARKINSON’S DISEASE DETECTION USING MACHINE LEARNING
PARKINSON’S DISEASE DETECTION USING MACHINE LEARNINGIRJET Journal
 
Network Intrusion Detection System Based on Modified Random Forest Classifier...
Network Intrusion Detection System Based on Modified Random Forest Classifier...Network Intrusion Detection System Based on Modified Random Forest Classifier...
Network Intrusion Detection System Based on Modified Random Forest Classifier...IRJET Journal
 
IRJET- Hybrid Architecture of Heart Disease Prediction System using Genetic N...
IRJET- Hybrid Architecture of Heart Disease Prediction System using Genetic N...IRJET- Hybrid Architecture of Heart Disease Prediction System using Genetic N...
IRJET- Hybrid Architecture of Heart Disease Prediction System using Genetic N...IRJET Journal
 
IRJET- Survey of Estimation of Crop Yield using Agriculture Data
IRJET- Survey of Estimation of Crop Yield using Agriculture DataIRJET- Survey of Estimation of Crop Yield using Agriculture Data
IRJET- Survey of Estimation of Crop Yield using Agriculture DataIRJET Journal
 
TERM DEPOSIT SUBSCRIPTION PREDICTION
TERM DEPOSIT SUBSCRIPTION PREDICTIONTERM DEPOSIT SUBSCRIPTION PREDICTION
TERM DEPOSIT SUBSCRIPTION PREDICTIONIRJET Journal
 
IRJET- Heart Disease Prediction System
IRJET- Heart Disease Prediction SystemIRJET- Heart Disease Prediction System
IRJET- Heart Disease Prediction SystemIRJET Journal
 
CARDIOVASCULAR DISEASE PREDICTION USING GENETIC ALGORITHM
CARDIOVASCULAR DISEASE PREDICTION USING GENETIC ALGORITHMCARDIOVASCULAR DISEASE PREDICTION USING GENETIC ALGORITHM
CARDIOVASCULAR DISEASE PREDICTION USING GENETIC ALGORITHMIRJET Journal
 
Heart Disease Prediction Using Multi Feature and Hybrid Approach
Heart Disease Prediction Using Multi Feature and Hybrid ApproachHeart Disease Prediction Using Multi Feature and Hybrid Approach
Heart Disease Prediction Using Multi Feature and Hybrid ApproachIRJET Journal
 
E-Healthcare monitoring System for diagnosis of Heart Disease using Machine L...
E-Healthcare monitoring System for diagnosis of Heart Disease using Machine L...E-Healthcare monitoring System for diagnosis of Heart Disease using Machine L...
E-Healthcare monitoring System for diagnosis of Heart Disease using Machine L...IRJET Journal
 
IRJET-A Hybrid Intrusion Detection Technique based on IRF & AODE for KDD-CUP ...
IRJET-A Hybrid Intrusion Detection Technique based on IRF & AODE for KDD-CUP ...IRJET-A Hybrid Intrusion Detection Technique based on IRF & AODE for KDD-CUP ...
IRJET-A Hybrid Intrusion Detection Technique based on IRF & AODE for KDD-CUP ...IRJET Journal
 
IRJET- A Detailed Study on Classification Techniques for Data Mining
IRJET- A Detailed Study on Classification Techniques for Data MiningIRJET- A Detailed Study on Classification Techniques for Data Mining
IRJET- A Detailed Study on Classification Techniques for Data MiningIRJET Journal
 
Diabetes Prediction using Machine Learning Algorithms
Diabetes Prediction using Machine Learning AlgorithmsDiabetes Prediction using Machine Learning Algorithms
Diabetes Prediction using Machine Learning AlgorithmsIRJET Journal
 
SURVEY ON CLASSIFICATION ALGORITHMS USING BIG DATASET
SURVEY ON CLASSIFICATION ALGORITHMS USING BIG DATASETSURVEY ON CLASSIFICATION ALGORITHMS USING BIG DATASET
SURVEY ON CLASSIFICATION ALGORITHMS USING BIG DATASETEditor IJMTER
 
Data mining techniques
Data mining techniquesData mining techniques
Data mining techniqueseSAT Journals
 

Similar to IRJET- Comparative Analysis of Data Mining Classification Techniques for Heart Disease Prediction (20)

PREDICTION OF DISEASE WITH MINING ALGORITHMS IN MACHINE LEARNING
PREDICTION OF DISEASE WITH MINING ALGORITHMS IN MACHINE LEARNINGPREDICTION OF DISEASE WITH MINING ALGORITHMS IN MACHINE LEARNING
PREDICTION OF DISEASE WITH MINING ALGORITHMS IN MACHINE LEARNING
 
Ijetcas14 338
Ijetcas14 338Ijetcas14 338
Ijetcas14 338
 
IRJET- Predicting Heart Disease using Machine Learning Algorithm
IRJET- Predicting Heart Disease using Machine Learning AlgorithmIRJET- Predicting Heart Disease using Machine Learning Algorithm
IRJET- Predicting Heart Disease using Machine Learning Algorithm
 
Heart Disease Prediction Using Random Forest Algorithm
Heart Disease Prediction Using Random Forest AlgorithmHeart Disease Prediction Using Random Forest Algorithm
Heart Disease Prediction Using Random Forest Algorithm
 
Smart Health Guide App
Smart Health Guide AppSmart Health Guide App
Smart Health Guide App
 
CLASSIFICATION ALGORITHM USING RANDOM CONCEPT ON A VERY LARGE DATA SET: A SURVEY
CLASSIFICATION ALGORITHM USING RANDOM CONCEPT ON A VERY LARGE DATA SET: A SURVEYCLASSIFICATION ALGORITHM USING RANDOM CONCEPT ON A VERY LARGE DATA SET: A SURVEY
CLASSIFICATION ALGORITHM USING RANDOM CONCEPT ON A VERY LARGE DATA SET: A SURVEY
 
PARKINSON’S DISEASE DETECTION USING MACHINE LEARNING
PARKINSON’S DISEASE DETECTION USING MACHINE LEARNINGPARKINSON’S DISEASE DETECTION USING MACHINE LEARNING
PARKINSON’S DISEASE DETECTION USING MACHINE LEARNING
 
Network Intrusion Detection System Based on Modified Random Forest Classifier...
Network Intrusion Detection System Based on Modified Random Forest Classifier...Network Intrusion Detection System Based on Modified Random Forest Classifier...
Network Intrusion Detection System Based on Modified Random Forest Classifier...
 
IRJET- Hybrid Architecture of Heart Disease Prediction System using Genetic N...
IRJET- Hybrid Architecture of Heart Disease Prediction System using Genetic N...IRJET- Hybrid Architecture of Heart Disease Prediction System using Genetic N...
IRJET- Hybrid Architecture of Heart Disease Prediction System using Genetic N...
 
IRJET- Survey of Estimation of Crop Yield using Agriculture Data
IRJET- Survey of Estimation of Crop Yield using Agriculture DataIRJET- Survey of Estimation of Crop Yield using Agriculture Data
IRJET- Survey of Estimation of Crop Yield using Agriculture Data
 
TERM DEPOSIT SUBSCRIPTION PREDICTION
TERM DEPOSIT SUBSCRIPTION PREDICTIONTERM DEPOSIT SUBSCRIPTION PREDICTION
TERM DEPOSIT SUBSCRIPTION PREDICTION
 
IRJET- Heart Disease Prediction System
IRJET- Heart Disease Prediction SystemIRJET- Heart Disease Prediction System
IRJET- Heart Disease Prediction System
 
CARDIOVASCULAR DISEASE PREDICTION USING GENETIC ALGORITHM
CARDIOVASCULAR DISEASE PREDICTION USING GENETIC ALGORITHMCARDIOVASCULAR DISEASE PREDICTION USING GENETIC ALGORITHM
CARDIOVASCULAR DISEASE PREDICTION USING GENETIC ALGORITHM
 
Heart Disease Prediction Using Multi Feature and Hybrid Approach
Heart Disease Prediction Using Multi Feature and Hybrid ApproachHeart Disease Prediction Using Multi Feature and Hybrid Approach
Heart Disease Prediction Using Multi Feature and Hybrid Approach
 
E-Healthcare monitoring System for diagnosis of Heart Disease using Machine L...
E-Healthcare monitoring System for diagnosis of Heart Disease using Machine L...E-Healthcare monitoring System for diagnosis of Heart Disease using Machine L...
E-Healthcare monitoring System for diagnosis of Heart Disease using Machine L...
 
IRJET-A Hybrid Intrusion Detection Technique based on IRF & AODE for KDD-CUP ...
IRJET-A Hybrid Intrusion Detection Technique based on IRF & AODE for KDD-CUP ...IRJET-A Hybrid Intrusion Detection Technique based on IRF & AODE for KDD-CUP ...
IRJET-A Hybrid Intrusion Detection Technique based on IRF & AODE for KDD-CUP ...
 
IRJET- A Detailed Study on Classification Techniques for Data Mining
IRJET- A Detailed Study on Classification Techniques for Data MiningIRJET- A Detailed Study on Classification Techniques for Data Mining
IRJET- A Detailed Study on Classification Techniques for Data Mining
 
Diabetes Prediction using Machine Learning Algorithms
Diabetes Prediction using Machine Learning AlgorithmsDiabetes Prediction using Machine Learning Algorithms
Diabetes Prediction using Machine Learning Algorithms
 
SURVEY ON CLASSIFICATION ALGORITHMS USING BIG DATASET
SURVEY ON CLASSIFICATION ALGORITHMS USING BIG DATASETSURVEY ON CLASSIFICATION ALGORITHMS USING BIG DATASET
SURVEY ON CLASSIFICATION ALGORITHMS USING BIG DATASET
 
Data mining techniques
Data mining techniquesData mining techniques
Data mining techniques
 

More from IRJET Journal

TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...IRJET Journal
 
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURESTUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTUREIRJET Journal
 
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...IRJET Journal
 
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsEffect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsIRJET Journal
 
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...IRJET Journal
 
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...IRJET Journal
 
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...IRJET Journal
 
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...IRJET Journal
 
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASA REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASIRJET Journal
 
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...IRJET Journal
 
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProP.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProIRJET Journal
 
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...IRJET Journal
 
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemSurvey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemIRJET Journal
 
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesReview on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesIRJET Journal
 
React based fullstack edtech web application
React based fullstack edtech web applicationReact based fullstack edtech web application
React based fullstack edtech web applicationIRJET Journal
 
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...IRJET Journal
 
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.IRJET Journal
 
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...IRJET Journal
 
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignMultistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignIRJET Journal
 
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...IRJET Journal
 

More from IRJET Journal (20)

TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
 
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURESTUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
 
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
 
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsEffect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil Characteristics
 
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
 
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
 
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
 
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
 
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASA REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADAS
 
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
 
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProP.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
 
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
 
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemSurvey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare System
 
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesReview on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridges
 
React based fullstack edtech web application
React based fullstack edtech web applicationReact based fullstack edtech web application
React based fullstack edtech web application
 
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
 
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
 
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
 
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignMultistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
 
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
 

Recently uploaded

Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxAsutosh Ranjan
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Call Girls in Nagpur High Profile
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxupamatechverse
 
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...RajaP95
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSISrknatarajan
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escortsranjana rawat
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxpranjaldaimarysona
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxupamatechverse
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...ranjana rawat
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Dr.Costas Sachpazis
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Christo Ananth
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Christo Ananth
 

Recently uploaded (20)

Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptx
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptx
 
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSIS
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptx
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptx
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
 
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
 

IRJET- Comparative Analysis of Data Mining Classification Techniques for Heart Disease Prediction

  • 1. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 05 Issue: 12 | Dec 2018 www.irjet.net p-ISSN: 2395-0072 © 2018, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 206 Comparative Analysis of Data Mining Classification Techniques for Heart Disease Prediction Dhara Mehta1, Nirali Varnagar2 1Student, Dept. of Computer Engineering, Atmiya Institute of Technology and Science, Rajkot, Gujarat, India 2Assistent Professor, Dept. of Computer Engineering, Atmiya Institute of Technology and Science, Rajkot, Gujarat, India ---------------------------------------------------------------------***---------------------------------------------------------------------- Abstract - Data Mining is the most advanced technology in the field of computer science. It is used to discover the fruitful information from the large amount of data, by extracting the frequent patterns generating on database. Medical Science is also using the techniques of data mining. In medicalfield, data mining techniques can be used to predict the risk level of a particular disease. In this research work, we have compared the accuracy levels of data mining classifiers. We compared most trending classifiers like Naïve Bayes, SVM (Support Vector Machine), K-Star, J48 and Random Forest. In order to examine these classifiers, we have used Data Mining tool WEKA. We used the heart disease dataset of UCI repository. Among all the classifiers, we got the highest accuracy in SVM classifier by 84.07%. This paper contains the introduction of data mining technology, its techniques and classifiers used to predict the accuracy. Key Words: Data Mining, Heart Disease, Classification Techniques, WEKA, Predication. 1. INTRODUCTION The cardiovascular disease is the biggest cause of heart failure. Heart is an important constituent to live in human body. If heart does not work properly, it maycausethedeath of a patient. . High blood pressure, abnormal cholesterol, diabetes, smoking are the major causes for heart disease[3]. The doctors and experts generally use Electrocardiogram (ECG) for diagnosis of a patient. The hospitals and organizations keep the records of all the patients in the databases which remain unusable. But by applying data mining techniquesto extractknowledgeableinformation can help in checkup and diagnosis of a patient. This ultimately helps in prevention and detection of cause of health disease. Data Mining is one of the sensational fields of machine learning which extracts hidden information from the large databases [7]. So researchers canusedata miningtechniques in solving these kinds of problems. The database of any hospitals keeps records of patient’s age, sex, Bloodpressure, cholesterol, ECG result etc. By applying data mining classification techniques, we can predict the risk level of Heart Disease. In this paper, there are five classification techniques like Naïve Bayes, SVM, K-star, J48 and Random Forest are used to analyze the accuracy. 2. Knowledge Discovery from Database KDD refers to the process of generating useful information from the large amount of database [1, 10]. KDD is a step by step process of deriving knowledge. Fig -1: Procedure of KDD 2.1 Data Cleaning It refers to the ability to understand and correct the data to get accurate analysis. 2.2 Data Integration Data integration is the data preprocessing technique to merge the data from multiple data sources into a single database. 2.3 Data Selection and Transformation Data selection is the process where thedata necessary are selected from database. The data is to be transformed in Data Cleaning Data Integration Data Selection and Transformation Data Mining Pattern Evaluation Knowledge Representation
  • 2. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 05 Issue: 12 | Dec 2018 www.irjet.net p-ISSN: 2395-0072 © 2018, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 207 appropriate form in whichdata miningoperationsisgoingto be done. Sometimes, transformation process is performed before data selection. 2.4 Data Mining It is an important phase wherethedata miningalgorithms is applied on dataset to generate the pattern andknowledge. 2.5 Pattern Evaluation The final step of data mining process includes deriving relevant information hidden in the dataset. 2.6 Knowledge Representation Knowledge representation is about representing the knowledge retrieved from the data mining steps. It can be represented in terms of trees, tables, charts, matrix etc. 3. Data Mining Techniques Data Mining techniques are use to get meaning information from the raw data. One of the most important steps in data mining is to choose the appropriate technique. There are many data mining techniques that can be used for different requirements.Someimportanttechniquesarelisted below. 3.1 Association Association is one of the data mining techniques which is use to unwrap the hidden relationship between unrelated data of relational database [1]. Association rule works on the basis of occurrence of one particular object based on occurrence of another object. It helps in analyzing the customer’s behavior on shopping basket. Multilevel association rule, multidimensional association rule and quantitative association rule are the types of association. 3.2 Classification Classification is a technique used to assign an item/items to the collection of target classes. The main goal of classification techniquesistopredictaccurately.Theaimisto predict each class accuratelyfor each and every class of data. In our case classificationtechniquesisusedtopredicttherisk level of heart disease by high, medium and low. 3.3 Clustering Clustering is the process of grouping a set of objects into classes with similar objects. A cluster is a set of objects having similar characteristics to one another. It means the objects of one cluster can be treated as a whole. However it can be said as a data compression. Clustering analysis is a usual activity like as we learn to differentiate between cat and dog, by repeatedly improving schemes of clustering. Clustering can be done by different methods like partition, hierarchical agglomerative, density based, grid based and model based. 3.4 Regression Regression and classification are used for prediction analysis but the difference is that regression is used to predictnumericandcontinuousvalueswhereasclassification can also be used for discrete data categories. The types of regression techniques are simple regression, multiple regression, hierarchical regression and set wise regression. 3.5 Prediction Prediction is a data mining techniques about predicting the futurebehavior of an item based on currentandpastdata to generate a prediction model. Types of prediction techniques are linear regression, multivariate linear regression. 4. Related Work In this analysis task,wehaveusedWEKA3.8tool for comparative analysis. WEKA contains machine learning algorithms for data classification. WEKA contains tools for classification, clustering and association. In this experiment there is an analysis for the accuracyofdifferentclassification algorithms and find the best suitable algorithm for heart disease prediction. We have used the Naïve Bayes, Support Vector Machine, K-Star, J48,RandomForestclassifierson the heart disease dataset. The experiment has been conducted on the Heart disease dataset available on the UCI Repository. The dataset contains 270 instances and 14 attributes [5]. Table 1 shows number of attributes and its description available on UCI’s Heart Disease dataset. Table -1: Attributes Affected for Heart Disease No . Attribute Name Type of Attribute Attribute Description 1 age Numerical Age 2 Sex Numerical Sex 3 chest Numerical Chest pain type 4 resting_blood_p ressure Numerical Resting blood pressure 5 Serum_cholesto ral Numerical Serum Cholestoral in mg/dl 6 Fasting_blood_s Numerical Fasting blood sugar> 120
  • 3. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 05 Issue: 12 | Dec 2018 www.irjet.net p-ISSN: 2395-0072 © 2018, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 208 ugar mg/dl 7 Resting_electro cardiographic_r esults Numerical Resting electrocardiogra phic results(0,1,2) 8 Maximum_heart _rate_achieved Numerical Maximum heart rate achieved 9 Exercise_induce d_angina Numerical Exercise induced angina 10 Oldpeak Numerical Oldpeak = ST depression induced by exercise relative to rest 11 slope Numerical The slope of the peak exercise ST segment 12 Number_of_maj or_vessels Numerical Number of major vessels(0-3) colored by flourosopt 13 Thal Numerical 3 = normal; 6 = fixed defect; 7 = reversible defect 14 class Nominal class 4.1 Classification Techniques Used in Experiment 4.1.1 Naïve Bayes Classifiers It is based on Bayes’ theorem classification technique which havingassumptionofindependenceamong predictors [3]. This technique assumes that the presence of one feature in a particular class is not related to the presence of any other feature. Naïve Bayes model is very easy and quick to build and can be used for a large dataset [11]. p(c|x): The posterior probability for class (c, target) given predictor (x, attributes). p(c) : The prior probability of class. p (x|c) : The probability for predictor given class. p(x) : The prior probability of predictor. 4.1.2 Support Vector Machine Support Vector Machine (SVM) is a supervised machine learning algorithm used for classification as well as regression. In this algorithm, we plot data item as a point in n-dimensional graph where each point is being a value of a particular coordinate. Then classification algorithm is being used to fine hyper-plane that differentiates 2 classes with maximum margin. 4.1.3 K-star K-star algorithm work on the problem,” Each new instance is being compared with the existing one by using a distance metric, and the closest instance will be used to assign a class to the newer one”[6]. 4.1.4 J48 It is a one type of decision tree algorithm used as a classification algorithm. This algorithm is based on divide and conquers approach [3]. J48 divides the data into a sub range, based on existing attributes values available in training dataset. 4.1.5 Random Forest Random forestisa supervisedmachinelearningalgorithm which creates forest and makes it random. The forest is an ensemble of decision trees, mostly trained with “training method” [12]. In simple word, Random forest algorithm generates multiple decision trees and merges themtogether to get the maximum accuracy and immutable prediction. 4.2 Performance Measures 4.2.1 Accuracy Accuracy refers to the quality ofpredictionindata mining. Accuracy and be predicted by the formula: Accuracy= Accuracy= 4.2.2 ROC ROC is receiver Operating Characteristics. ROC curve is a graphical representation of classifier’s performance. 4.2.3 Kappa Statistics The Kappa Statistics is used to compare the accuracy of the current system with the random system. k= P (a)-P (e)/ (1-P (e)) P (a) = agreement percentage P (e) = agreement chances
  • 4. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 05 Issue: 12 | Dec 2018 www.irjet.net p-ISSN: 2395-0072 © 2018, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 209 If the value of k is 0then it means that there is a chance of agreement. If the value of k is 1 then we can say that agreement is in a tolerable range of classifier. 4.2.4 RMSE Root Mean Squared Error is defined as the difference between predicted value and observed value. Ei= P (i,j) = predicted value i = fitness Tj = fitness applicability target value for j 4.3 Result Analysis In this experiment, there are five classifiers are used in WEKA tool namely Naïve Bayes, Support Vector Machine, K- Star, J48 and Random Forest. The results generated by these classifiers are shown in table 2. The maximum accuracy is derived by the classifier Support Vector Machine in 0.24 seconds. Table -2: Result Analysis from WEKA Classifier Accuracy ROC Kappa Statistic RMSE Time(sec ) Naïve Bayes 83.70 0.898 0.6683 0.3598 0.02 SVM 84.07 0.837 0.6762 0.3991 0.24 K Star 75.18 0.8939 0.4937 0.4438 0 J48 76.67 0.744 0.5271 0.4601 0.14 Random Forest 81.48 0.899 0.6244 0.3587 0.35 4.3.1 Classifiers Accuracy Chart 1 shows the graph of accuracy for different classifiers. It shows that Naïve Bayes and SVM gives the best accuracy by 83.70% and 84.07% respectively. Chart -1: Classifiers Accuracy 4.3.2 ROC and Time ROC curve shows good results for SVM and Random Forest whereas poor results for K-Star and J48.Howeverthe time taken by K-Star and J48 is lesser than SVM andRandom Forest. Chart -2: ROC and Time to build Model 4.3.3 K Statistic and RMSE Chart 3 shows the better values of Kappa statistics for Naïve Bayes and SVM with lesser error as RMSE. Chart -3: Kappa Statistic and RMSE 4.3.4 Accuracy Measures Some other measures are also considered for prediction or classification. TP Rate, FP Rate, Precision, Recall and F- measure are majorly used measures. TP Rate is the true positive rate which denotes the number of instances correctly classified and FP Rate is the false positive rate which denotes the number of instances that are wrongly classified. Precision only considers positive cases as P=tp/ (tp+fp). Where tp denotes true positive rate and fp denotes false positive rate.
  • 5. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 05 Issue: 12 | Dec 2018 www.irjet.net p-ISSN: 2395-0072 © 2018, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 210 The recall represents the proportion of real positive values which is denoted by R=tp/ (tp+fn). F-measure is the combination of precision and recall. F- measure =2*((precision*recall) / precision+recall). Table -3: Major Values for Accuracy Measure Classifier Name TP Rate FP Rate Precision Recall F- Measure Naïve Bayes 0.847 0.172 0.837 0.837 0.837 SVM 0.841 0.167 0.841 0.841 0.840 K-star 0.752 0.262 0.751 0.752 0.751 J48 0.767 0.240 0.766 0.767 0.767 Random Forest 0.815 0.191 0.815 0.815 0.815 5. CONCLUSION In this research work, we have examined data mining classifiers on heart disease dataset. In comparison of 5 classifiers namely Naïve Bayes,SVM,K-Star,J48andRandom Forest; we got the highest accuracy by SVM classifier with 84.07%. The classifiers have been also compared by ROC, K- statistics and RMSE. In future better classifiers can be developed in order to get higher accuracy for more datasets of any particular disease. REFERENCES [1] Narender Kumar, Sabita Khatri, “Implementing WEKA for medical data classification and early disease prediction”, In 3rd IEEE Conference on Computational IntelligenceandCommunicationTechnology(IEEE-CICT 2017) [2] Meenal Saini, Niyati Baliyan, Vineeta Bassi, “Prediction of Heart Disease Severity with Hybrid Data Mining” 2017 2nd International Conference on Telecommunication and Networks(TEL-NET 2017) [3] Theresa Princy. R, J. Thomas, “Human Heart Disease Prediction System using Data Mining Techniques” 2016 International Conference on Circuit, Power and Computing Technologies [ICCPCT] [4] Salma Banu N.K and Suma Swamy, “Prediction of Heart Disease at early stage using Data Mining and Big Data Analytics : A Survey” 2016 International Conference on Electrical, Electronics, Communication, Computer and Optimization Techniques. [5] Amita Malav, Kalyani Kadam and Pooja Kamat, “PREDICTION OF HEART DISEASE USINGK-MEANSand Artificial NEURAL NETWORKasHYBRIDAPPROACHTO IMPROVE ACCURACY ” International Journal of Engineering and Technology(IJET) Vol 9 No 4-Sep 2017 [6] Rashmi Madhukar Jadhav and Ms. Roshani Ade, “Analytical Approach to Predictive Disease Diagnosis using K-nn and Kstar” International Journal ofComputer Science and Information Technology, Vol. 6(2), 2015, 1876-1879 [7] S. M. M. Hasan, A. Mamun, M. P. Uddin and M. A. Hossain, “Comparetive Analysis Of Classification Approach for Heart Disease Prediction” IEEE-2018 [8] K. Prasanna Lakshmi and Dr. C. R. K. Reddy, “Fast Rule- Based Heart Disease Prediction using Associative Classification Mining” IEEE International Conferenceon Computer, Communication and Control (IC4-2015) [9] Mrs.S.Radhimeenakshi, “Classification andPredictionof Heart Disease Risk Using Data Mining Techniques of Support Vector Machine and Artificial Neural Network” IEEE-2016 [10] Hamidrza Ashrafi Esfahani, Morteza Ghazanfari, “Cardiovascular diseasedetectionusinga new ensemble classifier” 2017 IEEE International conference on knowledge-Based Engineering and Innovation (KBEI) [11] M. A. Jabbar, Shirina Samreen, “HeartDiseasePrediction System based on hidden naïve bayes classifier” IEEE- 2016 International Conference on Circuits, Controls, Communications and Computing (I4C) [12] Theresa Princy R and J. Thomas, “Human Heart Disease Prediction System using Data Mining Techniques” 2016 International Conference on Circuit, Power and Computing Technologies (IEEE-March 2016)