SlideShare a Scribd company logo
IOSR Journal of Computer Engineering (IOSR-JCE)
e-ISSN: 2278-0661,p-ISSN: 2278-8727, Volume 18, Issue 3, Ver. IV (May-Jun. 2016), PP 87-91
www.iosrjournals.org
DOI: 10.9790/0661-1803048791 www.iosrjournals.org 87 | Page
Heart Disease Detection using EKSTRAP Clustering with
Statistical and Distance based Classifiers
Terence Johnson1
, Dr. S. K. Singh2
, Vaishnavi Kamat3
, Aishwarya Joshi4
,
Lester D‟Souza4
, Poohar Amonkar4
, Devyani Joshi4
, Anirudha Kulkarni4
1
(Ph.D Scholar, Information Technology Department, AMET University, India)
1
(Asst. Prof., Computer Engineering Department, AITD, Goa University, India)
2
(Head, Information Technology Department, TCSC, Mumbai University, India)
3
(Asst. Prof., Computer Engineering Department, AITD, Goa University, India)
4
(Students, Computer Engineering Department, AITD, Goa University, India)
Abstract : The heart is the most important organ in the human body which pumps blood to various parts of the
body. If there is inefficient circulation of blood in body organs like brain will suffer. If heart stops pumping
blood it results in death. An individual’s life is very much dependent on how efficiently the heart works. Using
data mining technique proposed in this paper we are trying to detect if a patient has heart disease or not. The
system uses 13 attributes like age, gender, blood pressure, cholesterol etc to detect the same. The system uses a
hybrid technique which uses Enhanced K STRAnge Points(EKSTRAP) clustering algorithm , output of which is
given to different classifiers like statistical –Naïve Bayes classifier and Distance Based – MSDC (Modified
Simple Distance Classifier ).
Keywords: Enhanced K Strange, clustering, Heart Disease, Naïve Bayes Classifier
I. Introduction
When we have large pre-existing dataset, we can examine it and generate new information previously
not observed by processing the data. For processing we can use different data mining techniques, in this paper
clustering and classification techniques are used [1]. Clustering is an unsupervised process of grouping similar
objects from a given dataset. The similarity is determined using many techniques like Euclidian Distance [2].
Clustering is achieved by the Enhanced k strange points clustering algorithm [3]. Classification on the contrary
is a supervised technique to check, to which group the point belongs to given the groups. Statistical
classification [4] and Distance based classification [5] techniques are used. Naïve Bayes [6], a simple
probabilistic classifier is the statistical classifier used and the Modified Simple Distance classifier is the
Distance based classifier used in this paper. Heart Disease Detection [7] tells whether the new data object has
heart disease or not based on the training data given to the classifier. The training data is obtained by clustering.
The dataset used is cleveland dataset which is taken from the UCI Repository. The block diagram below
explains the flow of the implemented system. Entire dataset is grouped into classes using clustering algorithm.
The output of the clustering algorithm along with the new tuple is given to the classifier which then detects to
which class the new tuple belongs.
Fig.1.Block Diagram
The clustering and classification is done based on the following attributes:-
1. Age-Age in years.
2. Gender-
a. 1 is male
b. 0 is female.
3. Cp –Chest Pain type
a. value 1-Typical angina
b. value 2-Atyical angina
c. value 3-Non Anginal pain
d. value 4 -Asymptomatic
Heart Disease Detection Using EKSTRAP Clustering With Statistical And Distance Based Classifiers
DOI: 10.9790/0661-1803048791 www.iosrjournals.org 88 | Page
4. trestbps-resting blood pressure(in mm Hg on admission to the hospital)
5. chol-(Cholestrol) serum cholesterol in mg/dl.
6. fbs- (fasting blood sugar)
a. if >120 mg/dl then 1
b. else 0
7. restecg-resting electrocardiographic results.
a. Value 0 –normal
b. Value 1-having ST-T wave abnormality
c. Value2 –showing probable or definite left ventricular hypertrophy
8. Thalach-maximum heart rate achieved
9. Exang- exercise induced angina
a. 1-yes
b. 0-no
10. Oldpeak-ST depression induced by exercise relative to rest
11. Slope-the slope of the peak exercise ST segment.
a. Value1-Upsloping
b. Value2-Flat
c. Value3-downsloping
12. Ca-number of major vessels(0-3) colored by flouroscopy
13. Thal-
a. 3-Normal
b. 6-Fixed Detect
c. 7-reverable detect
14. Num-diagnosis of heart disease
II. Motivation
If the clustering accuracy is improved then the classification result will improve. The existing system
uses k means for clustering. K Means has many limitations. When the number of attributes increase the number
of iterations increases to converge and sometimes it may not converge, so we may have to forcefully stop the
computation using stopping criteria„t‟ (maximum number of iterations ). This may lead to wrong clusters. So to
improve the efficiency of the classifier we replaced k means with enhanced k strange points clustering
algorithm. The benefit of using EKSTRAP (Enhanced K STRAnge Points clustering algorithm) is that there is
no need of iterations so it converges faster. And even when the number of attributes is more, the performance of
EKSTRAP (Enhanced K STRAnge Points clustering algorithm) is better.
III. Proposed Methodology
Here we have replaced K-Means with EKSTRAP (Enhanced K STRAnge Points clustering algorithm).
Input:-Heart Disease Dataset.
Output: - Tells if the new tuple falls in cluster 0 or cluster 1.
Step 1:-Dataset is given as input to Enhanced k strange points clustering algorithm, which will form two
clusters, class 0 and class 1.
Cluster 0- No Heart disease.
Cluster 1-Has Heart disease.
This will act as training data.
Step 2 :- Using this training data, whenever a new tuple is given using Naïve Bayes classifier we try to detect to
which class the newly entered tuple belongs.
Enhanced K strange points clustering algorithm (EKSTRAP)
Input:-Number of clusters required (N) and dataset.
Output:-Set of N clusters.
Step 1:- Find the point which will be at the minimum distance from origin “min” using Euclidian Distance.
Step 2:-Find the point which will be at the maximum distance from” min” call it “max”.
Step 3:-Now find the strange point “strange” which will at maximum distance from “min” and “max”.
Heart Disease Detection Using EKSTRAP Clustering With Statistical And Distance Based Classifiers
DOI: 10.9790/0661-1803048791 www.iosrjournals.org 89 | Page
Step 4:-This Strange point may not be equidistant from “min” and “max” so we make it equidistant by applying
one of the following formulae.
If point is near to “min”:- Str=Str(prev)+Xm[max-str(prev)]/[N-1]
If point is near to “max”:- Str=min+Xm[max-min]/[N-1]
Xm ranges from 1,2,3,…..to N-2.
Step 5:- Repeat step 4 until all N strange points are located.
Step 6:- Assign remaining points from dataset to these N different clusters.
Step 7:-Output the N clusters.
IV. Experimental Results
Clustering: The entire dataset contains 297 tuples. For this dataset after clustering using K-Means,
Class 0 contains 186 tuples and Class 1 contains 111. This result was compared to the actual dataset, in Class 0,
110 tuples were clustered correctly and in Class 1, 61. The same dataset when given to EKSTRAP Clustering
Algorithm, Class 0 contains 193 tuples and Class 1 contains 104 .This result when compared to the actual
dataset 144 tuples in Class 0 and 88 in Class 1 were correctly clustered. The actual dataset contains 160 in Class
0 and 137 tuples in Class 1.
Table.1. Clustering Output
Algorithm Used Class 0 Class 1
Actual 160 137
K-Means 186 111
EKSTRAP 193 104
Tabel.2. Clustering Accuracy
Algorithm Used Class 0 Class 1
K-Means 68.75% 44.53%
EKSTRAP 90.00% 64.23%
Statistical Classifier:
Naïve Bayes Classifier: The Naive Bayes is a conditional probability system. Given a unlabeled tuple
to be classified, represented by a vector x=(x1,…..,xn) representing some n independent dimension, it assigns to
this the tuple probabilities
for each of K possible categories
The issue with the above design is that if the number of dimensions n is large or if a dimension can take
on a large number of values, then building such a model on probability tables is not viable. We therefore
reframe the model to make it more compliant. Using Bayes' theorem, the conditional probability can be
decomposed as
In simple terms, using Bayesian probability jargon, the above equation can be written as
For classification 10 tuples were removed, these 10 and the remaining 287 tuples were clustered
separately using K-Means and EKSTRAP Clustering Algorithm. The clusters formed by the 287 tuples were
given to the Naive Bayes Classification as training data. Then these tuples were given to the classification
algorithm. When the 287 tuples were clustered, Class 0 had 180 tuples and Class1 had 107 tuples using k-
means, whereas using EKSTRAP, class 0 had 185 and class 1 had 102.
Table.3. Classification Output (K-Means)
k-Means: Clustering o/p Classification o/p using k-means Comment
Class 0 Class 1 Wrong
Class 0 Class 0 Correct
Class 0 Class 0 Correct
Class 0 Class 0 Correct
Class 1 Class 1 Correct
Class 0 Class 0 Correct
Heart Disease Detection Using EKSTRAP Clustering With Statistical And Distance Based Classifiers
DOI: 10.9790/0661-1803048791 www.iosrjournals.org 90 | Page
Class 1 Class 1 Correct
Class 1 Class 1 Correct
Class 0 Class 0 Correct
Class 1 Class 0 Wrong
Table.4. Classification Output(EKSTRAP)
EKSTRAP: Clustering o/p Classification o/p using EKSTRAP Comment
Class 1 Class 1 Correct
Class 0 Class 0 Correct
Class 0 Class 0 Correct
Class 0 Class 0 Correct
Class 0 Class 0 Correct
Class 0 Class 1 Wrong
Class 0 Class 0 Correct
Class 0 Class 0 Correct
Class 1 Class 1 Correct
Class 0 Class 0 Correct
Table.5. Classification Accuracy
Training Data Accuracy
o/p of K-Means 80%
o/p of EKSTRAP 90%
Distance based Classifier:
Modified Simple Distance Classifier: In the simple distance classifier the means of each of the classes
are computed and the distance of the unlabelled tuple from each of these means is calculated. The unlabelled
tuple is assigned to the class whose mean is closest to the unlabelled tuple.
In the modified simple distance classifier the minimum of one class and the maximum distance from
the minimum of another class is computed. If there are three classes a point in the third class is found which is
farthest from the minimum and the maximum of the earlier two classes. The distance of the unlabelled tuple
from each of these farthest points is calculated. The unlabelled tuple is assigned to the class having the
minimum if distance from the unlabelled tuple to the minimum is lesser that its distance to the maximum and
third farthest point.
If however the distance of the unlabelled tuple is lesser to the maximum than to the minimum and 3rd
farthest point, it is assigned to the class having the maximum. Similarly, if the distance of the unlabelled tuple is
lesser to the 3rd
farthest point than to the minimum and maximum, it is assigned to the class having the 3rd
farthest point.
For classification 25 tuples were extracted from the dataset and these 25 and the remaining 272 tuples
were clustered separately using K-Means and EKSTRAP Clustering Algorithm. The clusters formed by the 272
tuples were given to the Simple Distance Classifier as training data. Then these tuples were given one by one to
the classification algorithm.
When the 272 tuples were clustered, Class 0 had 110 tuples and Class1 had 177 tuples using k-means, whereas
using EKSTRAP, class 0 had 178 and class 1 had 94.
Table.6. Classification Output (EKSTRAP)
EKSTRAP: Clustering o/p Classification o/p using EKSTRAP Comment
Class 0 Class 0 Correct
Class 0 Class 0 Correct
Class 0 Class 0 Correct
Class 1 Class 1 Correct
Class 0 Class 0 Correct
Class 1 Class 1 Correct
Class 0 Class 0 Correct
Class 1 Class 1 Correct
Class 0 Class 0 Correct
Class 0 Class 0 Correct
Class 0 Class 0 Correct
Class 1 Class 1 Correct
Class 0 Class 0 Correct
Class 1 Class 1 Correct
Class 1 Class 1 Correct
Class 0 Class 0 Correct
Class 0 Class 0 Correct
Class 1 Class 1 Correct
Class 0 Class 0 Correct
Heart Disease Detection Using EKSTRAP Clustering With Statistical And Distance Based Classifiers
DOI: 10.9790/0661-1803048791 www.iosrjournals.org 91 | Page
Class 1 Class 1 Correct
Class 1 Class 1 Correct
Class 0 Class 0 Correct
Class 0 Class 0 Correct
Class 1 Class 1 Correct
Class 0 Class 0 Correct
Table.7. Modified Simple Distance Classification Accuracy
Training Data Accuracy
o/p of EKSTRAP 100%
V. Conclusion
In this paper it was proved that improving the clustering algorithm, the output of classification can be
improved. The existing system used k-means clustering algorithm which was replaced by EKSTRAP (Enhanced
K STRAnge Points) clustering algorithm. The accuracy of the algorithms are 68.75% for class 0, 44.53% for
class 1 for k means and 90.00% for class 0 and 64.23% for class 1 using EKSTRAP. EKSTRAP clustering
algorithm eliminates the randomness used while selecting the centroid for k means. The accuracy of the Naïve
Bayes classifier also improves, as it classifies 8 out of the 10 tuples correctly using k-means output as training
data whereas while using EKSTRAP clustering algorithm it classifies 9 out of the 10 tuples correctly. The
accuracy with the modified simple distance classifier is also 100% as it correctly classifies 15 out of the 15 class
0 tuples and 10 out of the 10 class 1 tuples accurately. Hence the proposed system accuracy is good. In our
future work, this can further enhanced and expanded. For detecting heart disease the number of attributes can be
increased.
References
[1]. David Hand, Heikki Mannila and Padhraic Smyth, principles of data mining, Cambridge, MA: MIT Press, (2001).
[2]. Jiawei Han and Micheline Kamber, data mining - concepts and techniques, Elsevier, Second Edition, Original ISBN: 978-1-
55860-901-3, Indian Reprint ISBN: 978-81-3120535-8
[3]. Johnson Terence,Dr.Santosh Kumar Singh,” Enhanced K-Strange Points Clustering Algorithm” International Conference on
Emerging Information Technology and Engineering Solutions(EITES 2015),78-1-4799-1838-6/15,IEEE, DOI:10.1109/
EITES.2015.14,pp 32-37.
[4]. Kuncheva L, “On the equivalence between fuzzy and statistical classifiers” Int. J. of Uncertainity, Fuzziness and Knowledge
based Systems. Vol. 4, No. 3, 1996, pp 245-253
[5]. Margaret Dunham , data mining – introductory and advanced topics, Pearson Education 2006.ISBN:
8177587854, 9788177587852
[6]. N. Friedman, D. Geiger, and Goldszmidt M. Bayesian network classifiers. Machine Learning, 29:131–163, 1997
[7]. Rucha Shinde , Sandhya Arjun ,Priyanka Patil and Prof Jayashree Waghmare “An Intelligent Heart Disease Prediction System
Using K-Means Clustering and Naive Bayes Algorithm” ,International Journal of computer science and Information Technology
Vol 6 (1) 2015 637-639

More Related Content

What's hot

A Heart Disease Prediction Model using Logistic Regression
A Heart Disease Prediction Model using Logistic RegressionA Heart Disease Prediction Model using Logistic Regression
A Heart Disease Prediction Model using Logistic Regression
ijtsrd
 
Chronic Kidney Disease Prediction Using Machine Learning
Chronic Kidney Disease Prediction Using Machine LearningChronic Kidney Disease Prediction Using Machine Learning
Chronic Kidney Disease Prediction Using Machine Learning
IJCSIS Research Publications
 
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSIS
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSISAN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSIS
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSIS
ijcsit
 
Disease prediction in big data healthcare using extended convolutional neural...
Disease prediction in big data healthcare using extended convolutional neural...Disease prediction in big data healthcare using extended convolutional neural...
Disease prediction in big data healthcare using extended convolutional neural...
IJAAS Team
 
A Heart Disease Prediction Model using Logistic Regression By Cleveland DataBase
A Heart Disease Prediction Model using Logistic Regression By Cleveland DataBaseA Heart Disease Prediction Model using Logistic Regression By Cleveland DataBase
A Heart Disease Prediction Model using Logistic Regression By Cleveland DataBase
ijtsrd
 
SVM-PSO based Feature Selection for Improving Medical Diagnosis Reliability u...
SVM-PSO based Feature Selection for Improving Medical Diagnosis Reliability u...SVM-PSO based Feature Selection for Improving Medical Diagnosis Reliability u...
SVM-PSO based Feature Selection for Improving Medical Diagnosis Reliability u...
cscpconf
 
Heart Disease Identification Method Using Machine Learnin in E-healthcare.
Heart Disease Identification Method Using Machine Learnin in E-healthcare.Heart Disease Identification Method Using Machine Learnin in E-healthcare.
Heart Disease Identification Method Using Machine Learnin in E-healthcare.
SUJIT SHIBAPRASAD MAITY
 

What's hot (7)

A Heart Disease Prediction Model using Logistic Regression
A Heart Disease Prediction Model using Logistic RegressionA Heart Disease Prediction Model using Logistic Regression
A Heart Disease Prediction Model using Logistic Regression
 
Chronic Kidney Disease Prediction Using Machine Learning
Chronic Kidney Disease Prediction Using Machine LearningChronic Kidney Disease Prediction Using Machine Learning
Chronic Kidney Disease Prediction Using Machine Learning
 
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSIS
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSISAN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSIS
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSIS
 
Disease prediction in big data healthcare using extended convolutional neural...
Disease prediction in big data healthcare using extended convolutional neural...Disease prediction in big data healthcare using extended convolutional neural...
Disease prediction in big data healthcare using extended convolutional neural...
 
A Heart Disease Prediction Model using Logistic Regression By Cleveland DataBase
A Heart Disease Prediction Model using Logistic Regression By Cleveland DataBaseA Heart Disease Prediction Model using Logistic Regression By Cleveland DataBase
A Heart Disease Prediction Model using Logistic Regression By Cleveland DataBase
 
SVM-PSO based Feature Selection for Improving Medical Diagnosis Reliability u...
SVM-PSO based Feature Selection for Improving Medical Diagnosis Reliability u...SVM-PSO based Feature Selection for Improving Medical Diagnosis Reliability u...
SVM-PSO based Feature Selection for Improving Medical Diagnosis Reliability u...
 
Heart Disease Identification Method Using Machine Learnin in E-healthcare.
Heart Disease Identification Method Using Machine Learnin in E-healthcare.Heart Disease Identification Method Using Machine Learnin in E-healthcare.
Heart Disease Identification Method Using Machine Learnin in E-healthcare.
 

Viewers also liked

S0102297102
S0102297102S0102297102
S0102297102
IOSR Journals
 
G017633943
G017633943G017633943
G017633943
IOSR Journals
 
Histological effects of pre-exposure prophylactic consumption of sulfa drugs ...
Histological effects of pre-exposure prophylactic consumption of sulfa drugs ...Histological effects of pre-exposure prophylactic consumption of sulfa drugs ...
Histological effects of pre-exposure prophylactic consumption of sulfa drugs ...
IOSR Journals
 
The Role of Customer Listening Ability and Perceived Justice in Service Recov...
The Role of Customer Listening Ability and Perceived Justice in Service Recov...The Role of Customer Listening Ability and Perceived Justice in Service Recov...
The Role of Customer Listening Ability and Perceived Justice in Service Recov...
IOSR Journals
 
K0457278
K0457278K0457278
K0457278
IOSR Journals
 
Structural elucidation, Identification, quantization of process related impur...
Structural elucidation, Identification, quantization of process related impur...Structural elucidation, Identification, quantization of process related impur...
Structural elucidation, Identification, quantization of process related impur...
IOSR Journals
 
Assessment of Amcon’s Role in Resuscitating the Banking Sector during the Glo...
Assessment of Amcon’s Role in Resuscitating the Banking Sector during the Glo...Assessment of Amcon’s Role in Resuscitating the Banking Sector during the Glo...
Assessment of Amcon’s Role in Resuscitating the Banking Sector during the Glo...
IOSR Journals
 
Effect of Regular Exercise on Prolactin Secretion: A Pilot Study
Effect of Regular Exercise on Prolactin Secretion: A Pilot StudyEffect of Regular Exercise on Prolactin Secretion: A Pilot Study
Effect of Regular Exercise on Prolactin Secretion: A Pilot Study
IOSR Journals
 
Effect of Poling Field and Non-linearity in Quantum Breathers in Ferroelectrics
Effect of Poling Field and Non-linearity in Quantum Breathers in FerroelectricsEffect of Poling Field and Non-linearity in Quantum Breathers in Ferroelectrics
Effect of Poling Field and Non-linearity in Quantum Breathers in Ferroelectrics
IOSR Journals
 
Investigation of General Equation That Best Describe the Relationship between...
Investigation of General Equation That Best Describe the Relationship between...Investigation of General Equation That Best Describe the Relationship between...
Investigation of General Equation That Best Describe the Relationship between...
IOSR Journals
 
Synthesis, growth of semiorganic TGDCC (Tetra Glycine Dihydrated Calcium Chlo...
Synthesis, growth of semiorganic TGDCC (Tetra Glycine Dihydrated Calcium Chlo...Synthesis, growth of semiorganic TGDCC (Tetra Glycine Dihydrated Calcium Chlo...
Synthesis, growth of semiorganic TGDCC (Tetra Glycine Dihydrated Calcium Chlo...
IOSR Journals
 
P01254123131
P01254123131P01254123131
P01254123131
IOSR Journals
 
J012637178
J012637178J012637178
J012637178
IOSR Journals
 
E010513037
E010513037E010513037
E010513037
IOSR Journals
 
I018135565
I018135565I018135565
I018135565
IOSR Journals
 
J010126267
J010126267J010126267
J010126267
IOSR Journals
 
Field Programmable Gate Array for Data Processing in Medical Systems
Field Programmable Gate Array for Data Processing in Medical SystemsField Programmable Gate Array for Data Processing in Medical Systems
Field Programmable Gate Array for Data Processing in Medical Systems
IOSR Journals
 
Congestion Control in Manets Using Hybrid Routing Protocol
Congestion Control in Manets Using Hybrid Routing ProtocolCongestion Control in Manets Using Hybrid Routing Protocol
Congestion Control in Manets Using Hybrid Routing Protocol
IOSR Journals
 
Wireless Network Security Architecture with Blowfish Encryption Model
Wireless Network Security Architecture with Blowfish Encryption ModelWireless Network Security Architecture with Blowfish Encryption Model
Wireless Network Security Architecture with Blowfish Encryption Model
IOSR Journals
 
D010121832
D010121832D010121832
D010121832
IOSR Journals
 

Viewers also liked (20)

S0102297102
S0102297102S0102297102
S0102297102
 
G017633943
G017633943G017633943
G017633943
 
Histological effects of pre-exposure prophylactic consumption of sulfa drugs ...
Histological effects of pre-exposure prophylactic consumption of sulfa drugs ...Histological effects of pre-exposure prophylactic consumption of sulfa drugs ...
Histological effects of pre-exposure prophylactic consumption of sulfa drugs ...
 
The Role of Customer Listening Ability and Perceived Justice in Service Recov...
The Role of Customer Listening Ability and Perceived Justice in Service Recov...The Role of Customer Listening Ability and Perceived Justice in Service Recov...
The Role of Customer Listening Ability and Perceived Justice in Service Recov...
 
K0457278
K0457278K0457278
K0457278
 
Structural elucidation, Identification, quantization of process related impur...
Structural elucidation, Identification, quantization of process related impur...Structural elucidation, Identification, quantization of process related impur...
Structural elucidation, Identification, quantization of process related impur...
 
Assessment of Amcon’s Role in Resuscitating the Banking Sector during the Glo...
Assessment of Amcon’s Role in Resuscitating the Banking Sector during the Glo...Assessment of Amcon’s Role in Resuscitating the Banking Sector during the Glo...
Assessment of Amcon’s Role in Resuscitating the Banking Sector during the Glo...
 
Effect of Regular Exercise on Prolactin Secretion: A Pilot Study
Effect of Regular Exercise on Prolactin Secretion: A Pilot StudyEffect of Regular Exercise on Prolactin Secretion: A Pilot Study
Effect of Regular Exercise on Prolactin Secretion: A Pilot Study
 
Effect of Poling Field and Non-linearity in Quantum Breathers in Ferroelectrics
Effect of Poling Field and Non-linearity in Quantum Breathers in FerroelectricsEffect of Poling Field and Non-linearity in Quantum Breathers in Ferroelectrics
Effect of Poling Field and Non-linearity in Quantum Breathers in Ferroelectrics
 
Investigation of General Equation That Best Describe the Relationship between...
Investigation of General Equation That Best Describe the Relationship between...Investigation of General Equation That Best Describe the Relationship between...
Investigation of General Equation That Best Describe the Relationship between...
 
Synthesis, growth of semiorganic TGDCC (Tetra Glycine Dihydrated Calcium Chlo...
Synthesis, growth of semiorganic TGDCC (Tetra Glycine Dihydrated Calcium Chlo...Synthesis, growth of semiorganic TGDCC (Tetra Glycine Dihydrated Calcium Chlo...
Synthesis, growth of semiorganic TGDCC (Tetra Glycine Dihydrated Calcium Chlo...
 
P01254123131
P01254123131P01254123131
P01254123131
 
J012637178
J012637178J012637178
J012637178
 
E010513037
E010513037E010513037
E010513037
 
I018135565
I018135565I018135565
I018135565
 
J010126267
J010126267J010126267
J010126267
 
Field Programmable Gate Array for Data Processing in Medical Systems
Field Programmable Gate Array for Data Processing in Medical SystemsField Programmable Gate Array for Data Processing in Medical Systems
Field Programmable Gate Array for Data Processing in Medical Systems
 
Congestion Control in Manets Using Hybrid Routing Protocol
Congestion Control in Manets Using Hybrid Routing ProtocolCongestion Control in Manets Using Hybrid Routing Protocol
Congestion Control in Manets Using Hybrid Routing Protocol
 
Wireless Network Security Architecture with Blowfish Encryption Model
Wireless Network Security Architecture with Blowfish Encryption ModelWireless Network Security Architecture with Blowfish Encryption Model
Wireless Network Security Architecture with Blowfish Encryption Model
 
D010121832
D010121832D010121832
D010121832
 

Similar to O1803048791

Statistical Data Analysis on a Data Set (Diabetes 130-US hospitals for years ...
Statistical Data Analysis on a Data Set (Diabetes 130-US hospitals for years ...Statistical Data Analysis on a Data Set (Diabetes 130-US hospitals for years ...
Statistical Data Analysis on a Data Set (Diabetes 130-US hospitals for years ...
Seval Çapraz
 
Data mining for the masses, Chapter 6, Using R as an alternative to Rapidminer
Data mining for the masses, Chapter 6, Using R as an alternative to RapidminerData mining for the masses, Chapter 6, Using R as an alternative to Rapidminer
Data mining for the masses, Chapter 6, Using R as an alternative to Rapidminer
Ulrik Hørlyk Hjort
 
Hybrid PSO-SA algorithm for training a Neural Network for Classification
Hybrid PSO-SA algorithm for training a Neural Network for ClassificationHybrid PSO-SA algorithm for training a Neural Network for Classification
Hybrid PSO-SA algorithm for training a Neural Network for Classification
IJCSEA Journal
 
AI-Lec20 Clustering I - Kmean.pptx
AI-Lec20 Clustering I - Kmean.pptxAI-Lec20 Clustering I - Kmean.pptx
AI-Lec20 Clustering I - Kmean.pptx
Syed Ejaz
 
Lecture_3_k-mean-clustering.ppt
Lecture_3_k-mean-clustering.pptLecture_3_k-mean-clustering.ppt
Lecture_3_k-mean-clustering.ppt
SyedNahin1
 
Detection of heart diseases by data mining
Detection of heart diseases by data miningDetection of heart diseases by data mining
Detection of heart diseases by data mining
Abheepsa Pattnaik
 
K mean-clustering
K mean-clusteringK mean-clustering
K mean-clustering
Afzaal Subhani
 
Statistics Notes
Statistics NotesStatistics Notes
Statistics Notes
Leah Morgan
 
Comparative Study of Data Mining Classification Algorithms in Heart Disease P...
Comparative Study of Data Mining Classification Algorithms in Heart Disease P...Comparative Study of Data Mining Classification Algorithms in Heart Disease P...
Comparative Study of Data Mining Classification Algorithms in Heart Disease P...
paperpublications3
 
CS 402 DATAMINING AND WAREHOUSING -PROBLEMS
CS 402 DATAMINING AND WAREHOUSING -PROBLEMSCS 402 DATAMINING AND WAREHOUSING -PROBLEMS
CS 402 DATAMINING AND WAREHOUSING -PROBLEMS
NIMMYRAJU
 
Classification of medical datasets using back propagation neural network powe...
Classification of medical datasets using back propagation neural network powe...Classification of medical datasets using back propagation neural network powe...
Classification of medical datasets using back propagation neural network powe...
IJECEIAES
 
Disease Classification using ECG Signal Based on PCA Feature along with GA & ...
Disease Classification using ECG Signal Based on PCA Feature along with GA & ...Disease Classification using ECG Signal Based on PCA Feature along with GA & ...
Disease Classification using ECG Signal Based on PCA Feature along with GA & ...
IRJET Journal
 
Pneumothorax Detection Using Deep Convolutional Neural Networks
Pneumothorax Detection Using Deep Convolutional Neural NetworksPneumothorax Detection Using Deep Convolutional Neural Networks
Pneumothorax Detection Using Deep Convolutional Neural Networks
Michael Sebetich
 
An Experimental Study of Diabetes Disease Prediction System Using Classificat...
An Experimental Study of Diabetes Disease Prediction System Using Classificat...An Experimental Study of Diabetes Disease Prediction System Using Classificat...
An Experimental Study of Diabetes Disease Prediction System Using Classificat...
IOSRjournaljce
 
Outlier detection by Ueda's method
Outlier detection by Ueda's methodOutlier detection by Ueda's method
Outlier detection by Ueda's method
POOJA PATIL
 
Clustering & classification
Clustering & classificationClustering & classification
Clustering & classification
Jamshed Khan
 
Spss tutorial-cluster-analysis
Spss tutorial-cluster-analysisSpss tutorial-cluster-analysis
Spss tutorial-cluster-analysis
Animesh Kumar
 
Islamic University Pattern Recognition & Neural Network 2019
Islamic University Pattern Recognition & Neural Network 2019 Islamic University Pattern Recognition & Neural Network 2019
Islamic University Pattern Recognition & Neural Network 2019
Rakibul Hasan Pranto
 
Comparative study of artificial neural network based classification for liver...
Comparative study of artificial neural network based classification for liver...Comparative study of artificial neural network based classification for liver...
Comparative study of artificial neural network based classification for liver...
Alexander Decker
 
An Automatic Clustering Technique for Optimal Clusters
An Automatic Clustering Technique for Optimal ClustersAn Automatic Clustering Technique for Optimal Clusters
An Automatic Clustering Technique for Optimal Clusters
IJCSEA Journal
 

Similar to O1803048791 (20)

Statistical Data Analysis on a Data Set (Diabetes 130-US hospitals for years ...
Statistical Data Analysis on a Data Set (Diabetes 130-US hospitals for years ...Statistical Data Analysis on a Data Set (Diabetes 130-US hospitals for years ...
Statistical Data Analysis on a Data Set (Diabetes 130-US hospitals for years ...
 
Data mining for the masses, Chapter 6, Using R as an alternative to Rapidminer
Data mining for the masses, Chapter 6, Using R as an alternative to RapidminerData mining for the masses, Chapter 6, Using R as an alternative to Rapidminer
Data mining for the masses, Chapter 6, Using R as an alternative to Rapidminer
 
Hybrid PSO-SA algorithm for training a Neural Network for Classification
Hybrid PSO-SA algorithm for training a Neural Network for ClassificationHybrid PSO-SA algorithm for training a Neural Network for Classification
Hybrid PSO-SA algorithm for training a Neural Network for Classification
 
AI-Lec20 Clustering I - Kmean.pptx
AI-Lec20 Clustering I - Kmean.pptxAI-Lec20 Clustering I - Kmean.pptx
AI-Lec20 Clustering I - Kmean.pptx
 
Lecture_3_k-mean-clustering.ppt
Lecture_3_k-mean-clustering.pptLecture_3_k-mean-clustering.ppt
Lecture_3_k-mean-clustering.ppt
 
Detection of heart diseases by data mining
Detection of heart diseases by data miningDetection of heart diseases by data mining
Detection of heart diseases by data mining
 
K mean-clustering
K mean-clusteringK mean-clustering
K mean-clustering
 
Statistics Notes
Statistics NotesStatistics Notes
Statistics Notes
 
Comparative Study of Data Mining Classification Algorithms in Heart Disease P...
Comparative Study of Data Mining Classification Algorithms in Heart Disease P...Comparative Study of Data Mining Classification Algorithms in Heart Disease P...
Comparative Study of Data Mining Classification Algorithms in Heart Disease P...
 
CS 402 DATAMINING AND WAREHOUSING -PROBLEMS
CS 402 DATAMINING AND WAREHOUSING -PROBLEMSCS 402 DATAMINING AND WAREHOUSING -PROBLEMS
CS 402 DATAMINING AND WAREHOUSING -PROBLEMS
 
Classification of medical datasets using back propagation neural network powe...
Classification of medical datasets using back propagation neural network powe...Classification of medical datasets using back propagation neural network powe...
Classification of medical datasets using back propagation neural network powe...
 
Disease Classification using ECG Signal Based on PCA Feature along with GA & ...
Disease Classification using ECG Signal Based on PCA Feature along with GA & ...Disease Classification using ECG Signal Based on PCA Feature along with GA & ...
Disease Classification using ECG Signal Based on PCA Feature along with GA & ...
 
Pneumothorax Detection Using Deep Convolutional Neural Networks
Pneumothorax Detection Using Deep Convolutional Neural NetworksPneumothorax Detection Using Deep Convolutional Neural Networks
Pneumothorax Detection Using Deep Convolutional Neural Networks
 
An Experimental Study of Diabetes Disease Prediction System Using Classificat...
An Experimental Study of Diabetes Disease Prediction System Using Classificat...An Experimental Study of Diabetes Disease Prediction System Using Classificat...
An Experimental Study of Diabetes Disease Prediction System Using Classificat...
 
Outlier detection by Ueda's method
Outlier detection by Ueda's methodOutlier detection by Ueda's method
Outlier detection by Ueda's method
 
Clustering & classification
Clustering & classificationClustering & classification
Clustering & classification
 
Spss tutorial-cluster-analysis
Spss tutorial-cluster-analysisSpss tutorial-cluster-analysis
Spss tutorial-cluster-analysis
 
Islamic University Pattern Recognition & Neural Network 2019
Islamic University Pattern Recognition & Neural Network 2019 Islamic University Pattern Recognition & Neural Network 2019
Islamic University Pattern Recognition & Neural Network 2019
 
Comparative study of artificial neural network based classification for liver...
Comparative study of artificial neural network based classification for liver...Comparative study of artificial neural network based classification for liver...
Comparative study of artificial neural network based classification for liver...
 
An Automatic Clustering Technique for Optimal Clusters
An Automatic Clustering Technique for Optimal ClustersAn Automatic Clustering Technique for Optimal Clusters
An Automatic Clustering Technique for Optimal Clusters
 

More from IOSR Journals

A011140104
A011140104A011140104
A011140104
IOSR Journals
 
M0111397100
M0111397100M0111397100
M0111397100
IOSR Journals
 
L011138596
L011138596L011138596
L011138596
IOSR Journals
 
K011138084
K011138084K011138084
K011138084
IOSR Journals
 
J011137479
J011137479J011137479
J011137479
IOSR Journals
 
I011136673
I011136673I011136673
I011136673
IOSR Journals
 
G011134454
G011134454G011134454
G011134454
IOSR Journals
 
H011135565
H011135565H011135565
H011135565
IOSR Journals
 
F011134043
F011134043F011134043
F011134043
IOSR Journals
 
E011133639
E011133639E011133639
E011133639
IOSR Journals
 
D011132635
D011132635D011132635
D011132635
IOSR Journals
 
C011131925
C011131925C011131925
C011131925
IOSR Journals
 
B011130918
B011130918B011130918
B011130918
IOSR Journals
 
A011130108
A011130108A011130108
A011130108
IOSR Journals
 
I011125160
I011125160I011125160
I011125160
IOSR Journals
 
H011124050
H011124050H011124050
H011124050
IOSR Journals
 
G011123539
G011123539G011123539
G011123539
IOSR Journals
 
F011123134
F011123134F011123134
F011123134
IOSR Journals
 
E011122530
E011122530E011122530
E011122530
IOSR Journals
 
D011121524
D011121524D011121524
D011121524
IOSR Journals
 

More from IOSR Journals (20)

A011140104
A011140104A011140104
A011140104
 
M0111397100
M0111397100M0111397100
M0111397100
 
L011138596
L011138596L011138596
L011138596
 
K011138084
K011138084K011138084
K011138084
 
J011137479
J011137479J011137479
J011137479
 
I011136673
I011136673I011136673
I011136673
 
G011134454
G011134454G011134454
G011134454
 
H011135565
H011135565H011135565
H011135565
 
F011134043
F011134043F011134043
F011134043
 
E011133639
E011133639E011133639
E011133639
 
D011132635
D011132635D011132635
D011132635
 
C011131925
C011131925C011131925
C011131925
 
B011130918
B011130918B011130918
B011130918
 
A011130108
A011130108A011130108
A011130108
 
I011125160
I011125160I011125160
I011125160
 
H011124050
H011124050H011124050
H011124050
 
G011123539
G011123539G011123539
G011123539
 
F011123134
F011123134F011123134
F011123134
 
E011122530
E011122530E011122530
E011122530
 
D011121524
D011121524D011121524
D011121524
 

Recently uploaded

Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving
 
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing InstancesEnergy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Alpen-Adria-Universität
 
Nordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptxNordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptx
MichaelKnudsen27
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
Tatiana Kojar
 
Harnessing the Power of NLP and Knowledge Graphs for Opioid Research
Harnessing the Power of NLP and Knowledge Graphs for Opioid ResearchHarnessing the Power of NLP and Knowledge Graphs for Opioid Research
Harnessing the Power of NLP and Knowledge Graphs for Opioid Research
Neo4j
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
Zilliz
 
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansBiomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Neo4j
 
5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides
DanBrown980551
 
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
Jason Yip
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
panagenda
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
panagenda
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
Zilliz
 
Leveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and StandardsLeveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and Standards
Neo4j
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
Chart Kalyan
 
The Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptxThe Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptx
operationspcvita
 
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
Alex Pruden
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Safe Software
 
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
saastr
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
Zilliz
 

Recently uploaded (20)

Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024
 
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing InstancesEnergy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
 
Nordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptxNordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptx
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
 
Harnessing the Power of NLP and Knowledge Graphs for Opioid Research
Harnessing the Power of NLP and Knowledge Graphs for Opioid ResearchHarnessing the Power of NLP and Knowledge Graphs for Opioid Research
Harnessing the Power of NLP and Knowledge Graphs for Opioid Research
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
 
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansBiomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
 
5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides
 
Artificial Intelligence and Electronic Warfare
Artificial Intelligence and Electronic WarfareArtificial Intelligence and Electronic Warfare
Artificial Intelligence and Electronic Warfare
 
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
 
Leveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and StandardsLeveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and Standards
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
 
The Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptxThe Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptx
 
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
 
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
 

O1803048791

  • 1. IOSR Journal of Computer Engineering (IOSR-JCE) e-ISSN: 2278-0661,p-ISSN: 2278-8727, Volume 18, Issue 3, Ver. IV (May-Jun. 2016), PP 87-91 www.iosrjournals.org DOI: 10.9790/0661-1803048791 www.iosrjournals.org 87 | Page Heart Disease Detection using EKSTRAP Clustering with Statistical and Distance based Classifiers Terence Johnson1 , Dr. S. K. Singh2 , Vaishnavi Kamat3 , Aishwarya Joshi4 , Lester D‟Souza4 , Poohar Amonkar4 , Devyani Joshi4 , Anirudha Kulkarni4 1 (Ph.D Scholar, Information Technology Department, AMET University, India) 1 (Asst. Prof., Computer Engineering Department, AITD, Goa University, India) 2 (Head, Information Technology Department, TCSC, Mumbai University, India) 3 (Asst. Prof., Computer Engineering Department, AITD, Goa University, India) 4 (Students, Computer Engineering Department, AITD, Goa University, India) Abstract : The heart is the most important organ in the human body which pumps blood to various parts of the body. If there is inefficient circulation of blood in body organs like brain will suffer. If heart stops pumping blood it results in death. An individual’s life is very much dependent on how efficiently the heart works. Using data mining technique proposed in this paper we are trying to detect if a patient has heart disease or not. The system uses 13 attributes like age, gender, blood pressure, cholesterol etc to detect the same. The system uses a hybrid technique which uses Enhanced K STRAnge Points(EKSTRAP) clustering algorithm , output of which is given to different classifiers like statistical –Naïve Bayes classifier and Distance Based – MSDC (Modified Simple Distance Classifier ). Keywords: Enhanced K Strange, clustering, Heart Disease, Naïve Bayes Classifier I. Introduction When we have large pre-existing dataset, we can examine it and generate new information previously not observed by processing the data. For processing we can use different data mining techniques, in this paper clustering and classification techniques are used [1]. Clustering is an unsupervised process of grouping similar objects from a given dataset. The similarity is determined using many techniques like Euclidian Distance [2]. Clustering is achieved by the Enhanced k strange points clustering algorithm [3]. Classification on the contrary is a supervised technique to check, to which group the point belongs to given the groups. Statistical classification [4] and Distance based classification [5] techniques are used. Naïve Bayes [6], a simple probabilistic classifier is the statistical classifier used and the Modified Simple Distance classifier is the Distance based classifier used in this paper. Heart Disease Detection [7] tells whether the new data object has heart disease or not based on the training data given to the classifier. The training data is obtained by clustering. The dataset used is cleveland dataset which is taken from the UCI Repository. The block diagram below explains the flow of the implemented system. Entire dataset is grouped into classes using clustering algorithm. The output of the clustering algorithm along with the new tuple is given to the classifier which then detects to which class the new tuple belongs. Fig.1.Block Diagram The clustering and classification is done based on the following attributes:- 1. Age-Age in years. 2. Gender- a. 1 is male b. 0 is female. 3. Cp –Chest Pain type a. value 1-Typical angina b. value 2-Atyical angina c. value 3-Non Anginal pain d. value 4 -Asymptomatic
  • 2. Heart Disease Detection Using EKSTRAP Clustering With Statistical And Distance Based Classifiers DOI: 10.9790/0661-1803048791 www.iosrjournals.org 88 | Page 4. trestbps-resting blood pressure(in mm Hg on admission to the hospital) 5. chol-(Cholestrol) serum cholesterol in mg/dl. 6. fbs- (fasting blood sugar) a. if >120 mg/dl then 1 b. else 0 7. restecg-resting electrocardiographic results. a. Value 0 –normal b. Value 1-having ST-T wave abnormality c. Value2 –showing probable or definite left ventricular hypertrophy 8. Thalach-maximum heart rate achieved 9. Exang- exercise induced angina a. 1-yes b. 0-no 10. Oldpeak-ST depression induced by exercise relative to rest 11. Slope-the slope of the peak exercise ST segment. a. Value1-Upsloping b. Value2-Flat c. Value3-downsloping 12. Ca-number of major vessels(0-3) colored by flouroscopy 13. Thal- a. 3-Normal b. 6-Fixed Detect c. 7-reverable detect 14. Num-diagnosis of heart disease II. Motivation If the clustering accuracy is improved then the classification result will improve. The existing system uses k means for clustering. K Means has many limitations. When the number of attributes increase the number of iterations increases to converge and sometimes it may not converge, so we may have to forcefully stop the computation using stopping criteria„t‟ (maximum number of iterations ). This may lead to wrong clusters. So to improve the efficiency of the classifier we replaced k means with enhanced k strange points clustering algorithm. The benefit of using EKSTRAP (Enhanced K STRAnge Points clustering algorithm) is that there is no need of iterations so it converges faster. And even when the number of attributes is more, the performance of EKSTRAP (Enhanced K STRAnge Points clustering algorithm) is better. III. Proposed Methodology Here we have replaced K-Means with EKSTRAP (Enhanced K STRAnge Points clustering algorithm). Input:-Heart Disease Dataset. Output: - Tells if the new tuple falls in cluster 0 or cluster 1. Step 1:-Dataset is given as input to Enhanced k strange points clustering algorithm, which will form two clusters, class 0 and class 1. Cluster 0- No Heart disease. Cluster 1-Has Heart disease. This will act as training data. Step 2 :- Using this training data, whenever a new tuple is given using Naïve Bayes classifier we try to detect to which class the newly entered tuple belongs. Enhanced K strange points clustering algorithm (EKSTRAP) Input:-Number of clusters required (N) and dataset. Output:-Set of N clusters. Step 1:- Find the point which will be at the minimum distance from origin “min” using Euclidian Distance. Step 2:-Find the point which will be at the maximum distance from” min” call it “max”. Step 3:-Now find the strange point “strange” which will at maximum distance from “min” and “max”.
  • 3. Heart Disease Detection Using EKSTRAP Clustering With Statistical And Distance Based Classifiers DOI: 10.9790/0661-1803048791 www.iosrjournals.org 89 | Page Step 4:-This Strange point may not be equidistant from “min” and “max” so we make it equidistant by applying one of the following formulae. If point is near to “min”:- Str=Str(prev)+Xm[max-str(prev)]/[N-1] If point is near to “max”:- Str=min+Xm[max-min]/[N-1] Xm ranges from 1,2,3,…..to N-2. Step 5:- Repeat step 4 until all N strange points are located. Step 6:- Assign remaining points from dataset to these N different clusters. Step 7:-Output the N clusters. IV. Experimental Results Clustering: The entire dataset contains 297 tuples. For this dataset after clustering using K-Means, Class 0 contains 186 tuples and Class 1 contains 111. This result was compared to the actual dataset, in Class 0, 110 tuples were clustered correctly and in Class 1, 61. The same dataset when given to EKSTRAP Clustering Algorithm, Class 0 contains 193 tuples and Class 1 contains 104 .This result when compared to the actual dataset 144 tuples in Class 0 and 88 in Class 1 were correctly clustered. The actual dataset contains 160 in Class 0 and 137 tuples in Class 1. Table.1. Clustering Output Algorithm Used Class 0 Class 1 Actual 160 137 K-Means 186 111 EKSTRAP 193 104 Tabel.2. Clustering Accuracy Algorithm Used Class 0 Class 1 K-Means 68.75% 44.53% EKSTRAP 90.00% 64.23% Statistical Classifier: Naïve Bayes Classifier: The Naive Bayes is a conditional probability system. Given a unlabeled tuple to be classified, represented by a vector x=(x1,…..,xn) representing some n independent dimension, it assigns to this the tuple probabilities for each of K possible categories The issue with the above design is that if the number of dimensions n is large or if a dimension can take on a large number of values, then building such a model on probability tables is not viable. We therefore reframe the model to make it more compliant. Using Bayes' theorem, the conditional probability can be decomposed as In simple terms, using Bayesian probability jargon, the above equation can be written as For classification 10 tuples were removed, these 10 and the remaining 287 tuples were clustered separately using K-Means and EKSTRAP Clustering Algorithm. The clusters formed by the 287 tuples were given to the Naive Bayes Classification as training data. Then these tuples were given to the classification algorithm. When the 287 tuples were clustered, Class 0 had 180 tuples and Class1 had 107 tuples using k- means, whereas using EKSTRAP, class 0 had 185 and class 1 had 102. Table.3. Classification Output (K-Means) k-Means: Clustering o/p Classification o/p using k-means Comment Class 0 Class 1 Wrong Class 0 Class 0 Correct Class 0 Class 0 Correct Class 0 Class 0 Correct Class 1 Class 1 Correct Class 0 Class 0 Correct
  • 4. Heart Disease Detection Using EKSTRAP Clustering With Statistical And Distance Based Classifiers DOI: 10.9790/0661-1803048791 www.iosrjournals.org 90 | Page Class 1 Class 1 Correct Class 1 Class 1 Correct Class 0 Class 0 Correct Class 1 Class 0 Wrong Table.4. Classification Output(EKSTRAP) EKSTRAP: Clustering o/p Classification o/p using EKSTRAP Comment Class 1 Class 1 Correct Class 0 Class 0 Correct Class 0 Class 0 Correct Class 0 Class 0 Correct Class 0 Class 0 Correct Class 0 Class 1 Wrong Class 0 Class 0 Correct Class 0 Class 0 Correct Class 1 Class 1 Correct Class 0 Class 0 Correct Table.5. Classification Accuracy Training Data Accuracy o/p of K-Means 80% o/p of EKSTRAP 90% Distance based Classifier: Modified Simple Distance Classifier: In the simple distance classifier the means of each of the classes are computed and the distance of the unlabelled tuple from each of these means is calculated. The unlabelled tuple is assigned to the class whose mean is closest to the unlabelled tuple. In the modified simple distance classifier the minimum of one class and the maximum distance from the minimum of another class is computed. If there are three classes a point in the third class is found which is farthest from the minimum and the maximum of the earlier two classes. The distance of the unlabelled tuple from each of these farthest points is calculated. The unlabelled tuple is assigned to the class having the minimum if distance from the unlabelled tuple to the minimum is lesser that its distance to the maximum and third farthest point. If however the distance of the unlabelled tuple is lesser to the maximum than to the minimum and 3rd farthest point, it is assigned to the class having the maximum. Similarly, if the distance of the unlabelled tuple is lesser to the 3rd farthest point than to the minimum and maximum, it is assigned to the class having the 3rd farthest point. For classification 25 tuples were extracted from the dataset and these 25 and the remaining 272 tuples were clustered separately using K-Means and EKSTRAP Clustering Algorithm. The clusters formed by the 272 tuples were given to the Simple Distance Classifier as training data. Then these tuples were given one by one to the classification algorithm. When the 272 tuples were clustered, Class 0 had 110 tuples and Class1 had 177 tuples using k-means, whereas using EKSTRAP, class 0 had 178 and class 1 had 94. Table.6. Classification Output (EKSTRAP) EKSTRAP: Clustering o/p Classification o/p using EKSTRAP Comment Class 0 Class 0 Correct Class 0 Class 0 Correct Class 0 Class 0 Correct Class 1 Class 1 Correct Class 0 Class 0 Correct Class 1 Class 1 Correct Class 0 Class 0 Correct Class 1 Class 1 Correct Class 0 Class 0 Correct Class 0 Class 0 Correct Class 0 Class 0 Correct Class 1 Class 1 Correct Class 0 Class 0 Correct Class 1 Class 1 Correct Class 1 Class 1 Correct Class 0 Class 0 Correct Class 0 Class 0 Correct Class 1 Class 1 Correct Class 0 Class 0 Correct
  • 5. Heart Disease Detection Using EKSTRAP Clustering With Statistical And Distance Based Classifiers DOI: 10.9790/0661-1803048791 www.iosrjournals.org 91 | Page Class 1 Class 1 Correct Class 1 Class 1 Correct Class 0 Class 0 Correct Class 0 Class 0 Correct Class 1 Class 1 Correct Class 0 Class 0 Correct Table.7. Modified Simple Distance Classification Accuracy Training Data Accuracy o/p of EKSTRAP 100% V. Conclusion In this paper it was proved that improving the clustering algorithm, the output of classification can be improved. The existing system used k-means clustering algorithm which was replaced by EKSTRAP (Enhanced K STRAnge Points) clustering algorithm. The accuracy of the algorithms are 68.75% for class 0, 44.53% for class 1 for k means and 90.00% for class 0 and 64.23% for class 1 using EKSTRAP. EKSTRAP clustering algorithm eliminates the randomness used while selecting the centroid for k means. The accuracy of the Naïve Bayes classifier also improves, as it classifies 8 out of the 10 tuples correctly using k-means output as training data whereas while using EKSTRAP clustering algorithm it classifies 9 out of the 10 tuples correctly. The accuracy with the modified simple distance classifier is also 100% as it correctly classifies 15 out of the 15 class 0 tuples and 10 out of the 10 class 1 tuples accurately. Hence the proposed system accuracy is good. In our future work, this can further enhanced and expanded. For detecting heart disease the number of attributes can be increased. References [1]. David Hand, Heikki Mannila and Padhraic Smyth, principles of data mining, Cambridge, MA: MIT Press, (2001). [2]. Jiawei Han and Micheline Kamber, data mining - concepts and techniques, Elsevier, Second Edition, Original ISBN: 978-1- 55860-901-3, Indian Reprint ISBN: 978-81-3120535-8 [3]. Johnson Terence,Dr.Santosh Kumar Singh,” Enhanced K-Strange Points Clustering Algorithm” International Conference on Emerging Information Technology and Engineering Solutions(EITES 2015),78-1-4799-1838-6/15,IEEE, DOI:10.1109/ EITES.2015.14,pp 32-37. [4]. Kuncheva L, “On the equivalence between fuzzy and statistical classifiers” Int. J. of Uncertainity, Fuzziness and Knowledge based Systems. Vol. 4, No. 3, 1996, pp 245-253 [5]. Margaret Dunham , data mining – introductory and advanced topics, Pearson Education 2006.ISBN: 8177587854, 9788177587852 [6]. N. Friedman, D. Geiger, and Goldszmidt M. Bayesian network classifiers. Machine Learning, 29:131–163, 1997 [7]. Rucha Shinde , Sandhya Arjun ,Priyanka Patil and Prof Jayashree Waghmare “An Intelligent Heart Disease Prediction System Using K-Means Clustering and Naive Bayes Algorithm” ,International Journal of computer science and Information Technology Vol 6 (1) 2015 637-639