SlideShare a Scribd company logo
1 of 8
Download to read offline
http://www.iaeme.com/IJCET/index.asp 25 editor@iaeme.com
International Journal of Computer Engineering and Technology (IJCET)
Volume 10, Issue 04, July-August 2019, pp. 25-32, Article ID: IJCET_10_04_004
Available online at http://www.iaeme.com/ijcet/issues.asp?JType=IJCET&VType=10&IType=4
Journal Impact Factor (2019): 10.5167 (Calculated by GISI) www.jifactor.com
ISSN Print: 0976-6367 and ISSN Online: 0976–6375
© IAEME Publication
PREDICTION OF DIABETES MELLITUS USING
MACHINE LEARNING TECHNIQUES
Temesgen Abera Asfaw
Department of Computer Science & Engineering,
College of Engineering, Jawaharlal Technological University, Telangana, India
ABSTRACT
Diabetes mellitus is a common disease caused by a set of metabolic ailments
where the sugar stages over drawn-out period is very high. It touches diverse organs
of the human body which therefore harm a huge number of the body's system, in
precise the blood strains and nerves. Early prediction in such disease can be exact
and save human life. To achieve the goal, this research work mainly discovers
numerous factors associated to this disease using machine learning techniques.
Machine learning methods provide effectual outcome to extract knowledge by building
predicting models from diagnostic medical datasets together from the diabetic
patients. Quarrying knowledge from such data can be valuable to predict diabetic
patients. In this research, six popular used machine learning techniques, namely
Random Forest (RF), Logistic Regression (LR), Naive Bayes (NB), C4.5 Decision
Tree (DT), K-Nearest Neighbor (KNN), and Support Vector Machine (SVM) are
compared in order to get outstanding machine learning techniques to forecast diabetic
mellitus. Our new outcome shows that Support Vector Machine (SVM) achieved
higher accuracy compared to other machine learning techniques.
Key words: machine learning, C4.5 Decision Tree, Support Vector Machine, Logistic
Regression, Naive Bayes, K-Nearest Neighbor, and Random Forest, diabetes.
Cite this Article: Temesgen Abera Asfaw, Prediction of Diabetes Mellitus Using
Machine Learning Techniques, International Journal of Computer Engineering and
Technology 10(4), 2019, pp. 25-32.
http://www.iaeme.com/IJCET/issues.asp?JType=IJCET&VType=10&IType=4
1. INTRODUCTION
Diabetic is a disease that affects the hormone insulin, follow-on in abnormal metabolism of
carbohydrates and advance steps of sugar in the blood. This great blood sugar affects several
organs of the human body which in turn complicates many source of the body, in precise the
blood strains and nerves. The details of diabetic is not nevertheless totally exposed, many
researchers supposed that both hereditary elements and environmental effects are complex
therein. As exposed by the International Diabetes Federation, extent of people having diabetes
mellitus stretched 382 million out of 2013 [1] that take 6.6% of the world's total adult
population. According to the world healthcare medical data it has been probable that diabetic
patients will be increased up to 490 billion within the year 2030 [2]. Furthermore, diabetic is
Temesgen Abera Asfaw
http://www.iaeme.com/IJCET/index.asp 26 editor@iaeme.com
imaginably independent causal factor to micro-vascular entanglements. Diabetic patients are
maybe more incapable against a hoisted risk of micro-vascular damage, in this way long term
difficulty of cardio-vascular disease is the leading reason of death. This micro-vascular harm
and hasty cardio vascular disease ultimately quick to retinopathy, nephropathy and
neuropathy [3].Early prediction of such disease can be controlled over the diseases and save
human life. To accomplish this goal, this research work mainly discovers the early prediction
of diabetes by taking into account various risk factors related to this disease. For the willpower
of the study we gathered diagnostic dataset having 16 attributes diabetic of 2000 patients.
These attributes are age, diet, hyper-tension, problem in vision, genetic etc. In later part, we
debate about these attributes with their conforming values. Based on these attributes, we
figure prediction model by means of various machine learning techniques to predict diabetes
mellitus. Machine learning techniques provide well-organized result to extract knowledge by
making predicting models from diagnostic medical datasets composed from the diabetic
patients. Haul out knowledge from such data can be beneficial to predict diabetic patients.
Innumerable machine learning techniques have the knack to predict diabetes mellitus. Though
it is very difficult to select the best technique to predict based on such attributes. Thus for the
determination of the study, we deal six popular machine learning algorithms, namely Support
Vector Machine (SVM), Naive Bayes (NB), K-Nearest Neighbor (KNN) ,Logistic Regression
(LR) ,Random Forest (RF) and C4.5 decision tree (DT), on adult population data to predict
diabetic mellitus.
2. RELATED WORK
Various researchers have been shown revisions in the area of diabetic by using machine
learning techniques to extract knowledge from existing medical data. For illustration,
ALjumah et. al. [4] established a predictive analysis model using support vector machine
algorithm. In [5], Kavakiotis et al. used 10 fold cross validation as evaluation method in three
different algorithms, including Logistic regression, Naive Bayes, and SVM, where SVM on
condition that better performance and accuracy of 84 % than other algorithms. In [6], Zheng
et al. applied Random Forest, KNN, Naive Bayes, SVM, decision tree and logistic regression
to predict diabetes mellitus at early stage, where cleaning criteria can be improved. Swarupa
et al. [7] applied J48, ANN, KNN, ZeroR and NB on various diabetes dataset. Pradeep et al.
[8] applied Random Forest, KNN, SVM and J48 where J48 shows better performance than
others. The classification algorithms did not assess using cross validation method. To predict
and control diabetes mellitus Huang et al [9] conversed three data mining methods, including
IB1, Naïve Bayes and C4.5 in the year of 2000 to 2004. By smearing feature selection
technique, the performance of IB1 and Naive Bayes provided better result. In [10], Xue-Hui
Meng et al. used three different data mining techniques ANN, Logistic regression, and J48 to
predict the diabetic diseases using real world data sets. Finally it was concluded as J48
performs better accuracy than others. In this work, we examine real diagnostic medical data
based on numerous risk factors using popular machine learning classification techniques to
assess their performance for predicting diabetes mellitus.
3. METHODOLOGY
In order to accomplish our goal study methodology includes of few stages, which are accrual
of diabetes dataset with the applicable attributes of the patients, preprocessing the numeric
value attributes, to smear several machine learning techniques and conforming predictive
analysis employing such data. In the subsequent, we fleetingly confer these stages.
Prediction of Diabetes Mellitus Using Machine Learning Techniques
http://www.iaeme.com/IJCET/index.asp 27 editor@iaeme.com
3.1. Dataset and Attributes
In this work, datasets have been collected among the Pima Indian female population near
Phoenix, Arizona. This particular dataset has been widely used in machine learning
experiments and is currently available through the UCI repository of standard datasets. This
population has been studied continuously by the National Institute of Diabetes, Digestive and
Kidney. UCI repository contains 768 instances of observations and total 9 attributes with no
missing values reported. Data sets contains 8 particular variables which were considered high
risk factors for the occurrence of diabetes, like number of times pregnant, plasma glucose
concentration at 2 hour in an oral glucose tolerance test (OGTT), diastolic blood pressure, 2
hour serum insulin, body mass index, diabetes pedigree. All the patients in this datasets are
female at least 21 years old living near Phoenix, Arizona. All attributes are numeric values
except class is nominal type. Attributes name and types are shown in table 1.
Table 1 Dataset Description
No Name of attributes Type
1 Number of times pregnant Numeric
2 Plasma glucose concentration a 2 hours in an oral
glucose tolerance test Numeric
Numeric
3 Diastolic blood pressure Numeric
4 Triceps skin fold thickness Numeric
5 2 hour serum insulin Numeric
6 Body mass index Numeric
7 Diabetes pedigree function Numeric
Figure 1 Sample dataset
It is good to check the correlations between the attributes. From the output graph below,
the red around the diagonal suggests that attributes are correlated with each other. The yellow
and green patches suggest some moderate correlation and the blue boxes show negative
correlations as shown below fig. 2.
Temesgen Abera Asfaw
http://www.iaeme.com/IJCET/index.asp 28 editor@iaeme.com
Figure 2 Attributes Correlation
Next, we visualize the data using density plots to get a sense of the data distribution. From
the outputs below, you can see the data shows a general Gaussian distribution below fig 3.
Figure 3 Density plots
Figure 4 Histogram
Prediction of Diabetes Mellitus Using Machine Learning Techniques
http://www.iaeme.com/IJCET/index.asp 29 editor@iaeme.com
According to Fig. 5 below, after defining the problem we collect the relevant data from
the Diagnostic Data Storage. We then preprocess the data for the purpose of building the
prediction model. After that we apply various machine learning techniques discussed above
on the training dataset. Finally, test dataset is used to measure the performance of the
techniques in order to choose the best classifier for predicting diabetes mellitus.
Figure 5 Shows an overview of the overall process of our work.
3.2. Apply Machine Learning Techniques
After the data has been ready for demonstrating, we use six popular machine learning
techniques to predict diabetes mellitus. Later we offer an overview of these techniques.
Support Vector Machines: is widely classification technique proposed by J. Platt et. al. [11].
A Support Vector Machine is known by characterizing the data by separating a hyper plane.
SVM detaches entities in specified classes. It can also recognize and classify instances which
are not supported by data. SVM is not caring in the distribution of acquiring data of each
class. The one delay of this algorithm is to execute regression analysis to produce a linear
function and another extension is learning to rank elements to yield classification for
individual elements.
Naive Bayes: is a popular probabilistic classification technique proposed by John et. al. [12].
Naive Bayes also called Bayesian theorem is a simple, effective and commonly used machine
learning classifier. The algorithm calculates probabilistic results by counting the regularity
and combines the value given in data set. By using Bayesian theorem, it adopts that all
attributes are independent and based on variable values of classes. In real world solicitation,
the conditional independence hypothesis rarely holds true and gives well and more
sophisticate classifier results.
K-Nearest Neighbor Algorithm: K-nearest neighbor is simple regression and classification
algorithm that used non parametric method proposed by Aha et. al. [13]. The algorithm trains
all usable attributes and classifies new query based on their likeness measure. To determine
the distance from point of interest to points in training data set it uses tree like data structure.
The attribute is classified by its neighbors.
Temesgen Abera Asfaw
http://www.iaeme.com/IJCET/index.asp 30 editor@iaeme.com
Decision Tree: is a tree that provides powerful classification techniques to predict diabetes
mellitus. Every discrete area and feature of the domain is called a class. An input feature of
the class attribute is labeled with the internal node in a tree. The leaf node of the tree is
labeled by attribute and each attribute associated with a target value. There are some popular
decision tree algorithms are accessible to classify diabetic data in machine learning
techniques, including ID3, C4.5 , J48, C5, CART and CHAID. C4.5 provides extended
features of ID3 decision tree algorithm proposed by Ross Quinlan et. al. [14]. C4.5 decision
tree uses same training data as ID3, in which learned function is introduced. The learning
method can be used to diagnose medical data to predict the value of the decision attribute.
Logistic regression: is a probabilistic statistical model for investigative a dataset in which
there are one or more independent variables that govern a result. In logistic regression, the
dependent variable is binary that means 1 as (TRUE, patient, etc.) or 0 as (FALSE, healthy,
etc.). Logistic regression produces the coefficients of a procedure to predict a log it
adjustment of the probability of occurrence of the characteristic of nosiness.
Random forest: is collaborative classification scheme based on Decision Tree. At the
training stage, it produces a massive number of trees and creates a forest of Decision Trees.
At the testing stage, each tree of the forest predicts a class label for each data. When each tree
predicts a class label, then the final decision for each test data depends on widely held voting.
Which class label gets the majority of votes this label accepts to be the correct label allotted to
the test data. This process is continual for each of data in the dataset.
4. EXPERIMENTAL RESULTS AND DISCUSSION
To conduct the experiment six popular used machine learning techniques, namely Random
Forest (RF), Logistic Regression (LR), Naive Bayes (NB), C4.5 Decision Tree (DT), K-
Nearest Neighbor (KNN), and Support Vector Machine (SVM) are were used. Machine
learning techniques were implemented in pycharm 3.6. An experimental result shows that the
performance of SVM is significantly superior to other machine learning techniques for the
classification of diabetic data. The experimental results could assist health care to take early
prevention and make better clinical decisions to control diabetes and thus save human life. To
take into account additional attributes and evaluation for further analysis is our future work.
Figure 6 Comparison result of Various Machine Learning Techniques
Prediction of Diabetes Mellitus Using Machine Learning Techniques
http://www.iaeme.com/IJCET/index.asp 31 editor@iaeme.com
In order to assess the performance of various machine learning techniques, we have
showed the prediction results in Fig. 6 on the basis of accuracy. The figure shows the results
of studies such as LR, SVM, NB, KNN, RF and C4.5 and SVM accomplishes better results
than other classifiers to predict diabetes mellitus. According to Fig. 6, SVM achieves 96.4%
on this dataset, which is greater than other learning techniques. This experimental result
provides indication that SVM performs acceptable on medical datasets for the determination
of predicting diabetes mellitus based on numerous risk factors, deliberated in the earlier
section.
Table 2 Accuracy Results of Various Machine Learning Techniques
Classification Algorithms Accuracy
Support Vector Machine (SVM) 96.4%
Decision Tree (DT) 95.8%
Random Forest (RF) 95.3%
Naïve Bayes (NB) 76.0%
Logistic Regression (LR) 77.6%
K-Nearest Neighbors ( KNN) 78.8%
Overall, we have selected the best machine learning technique to predict diabetes mellitus
to achieve high performance, based on the evaluation criteria discuss above. All the
techniques mentioned over are estimated on an unseen testing diabetic dataset. The technique
which accomplishes the highest performance in terms of accuracy is considered to be the best
choice. Based on Fig. 6, it can be observed that SVM achieved the better accuracy of 96.4 %
to predict diabetes mellitus utilizing a given medical dataset.
5. CONCLUSION
In this work, we have investigated the early prediction of diabetes by taking into account
several risk factors related to this disease using machine learning techniques To predict
diabetes mellitus efficiently, we have done our investigation using six popular machine
learning algorithms, namely Support Vector Machine (SVM), Naive Bayes (NB), K-Nearest
Neighbor (KNN), Logistic Regression (LR.), Random Forest (RF.) and C4.5 decision tree, on
adult population data to predict diabetes mellitus. The technique which accomplishes the
highest performance in terms of accuracy is considered to be the best choice. Based on Fig.6,
it can be observed that SVM achieved the better accuracy of 96.4 % to predict diabetes
mellitus utilizing a given medical dataset.
REFERENCES
[1] V., A. K. and R., C. 2013. Classification of Diabetes Disease Using Support Vector
Machine. International Journal of Engineering Research and Applications. 3, (April.
2013), 1797-1801.
[2] Carlo, B G., Valeria, M. and Jesús, D. C. 2011. The impact of diabetes mellitus on
healthcare costs in Italy. Expert review of pharmacoeconomics & outcomes research. 11,
(Dec. 2011),709-19.
[3] Nahla B., Andrew et al. 2010. Intelligible support vector machines for diagnosis of
diabetes mellitus. Information Technology in Biomedicine, IEEE Transactions. 14, (July.
2010), 1114-20.
Temesgen Abera Asfaw
http://www.iaeme.com/IJCET/index.asp 32 editor@iaeme.com
[4] Abdullah A. Aljumah et al., Application of data mining: Diabetes health care in young and
old patients, Journal of King Saud University - Computer and Information Sciences,
Volume 25, Issue 2,
[5] July 2013, Pages 127-136
[6] Kavakiotis, Ioannis, Olga Tsave, AthanasiosSalifoglou, NicosMaglaveras,
IoannisVlahavas, and IoannaChouvarda. "Machine learning and data mining methods in
diabetes research." Computational and structural biotechnology journal (2017).
[7] Zheng, Tao et al. "A machine learning-based framework to identify type 2 diabetes
through electronic health records." International journal of medical informatics 97 (2017):
120- 127.
[8] Rani, A. Swarupa, and S. Jyothi. "Performance analysis of classification algorithms under
different datasets." In Computing for Sustainable Global Development (INDIACom),
2016 3rd International Conference on, pp. 1584-1589. IEEE, 2016.
[9] Kandhasamy, J. Pradeep, and S. Balamurali. "Performance analysis of classifier models to
predict diabetes mellitus." Procedia Computer Science 47 (2015): 45-51.
[10] Y. Huang, P. McCullagh, N. Black, R. Harper, Feature selection and classification model
construction on type 2 diabetic patients’data, Artificial Intelligence in Medicine 41 (3)
(2015) 251–262.
[11] Meng, X. H., Huang, Y. X., Rao, D. P., Zhang, Q., & Liu, Q. (2013). Comparison of three
data mining models for predicting diabetes or prediabetes by risk factors. The Kaohsiung
journal of medical sciences, 29(2), 93-99.
[12] Platt, John C. "12 fast training of support vector machines using sequential minimal
optimization." Advances in kernel methods (1999): 185-208.
[13] John, George H., and Pat Langley. "Estimating continuous distributions in Bayesian
classifiers." Proceedings of the Eleventh conference on Uncertainty in artificial
intelligence. Morgan Kaufmann Publishers Inc., 1995.
[14] Aha, David W., Dennis Kibler, and Marc K. Albert. "Instance-based learning algorithms."
Machine learning 6.1 (1991): 37-66.
[15] Ross Quinlan (1993). C4.5: Programs for Machine Learning. Morgan Kaufmann
Publishers, San Mateo, CA.

More Related Content

What's hot

HEART DISEASE PREDICTION USING NAIVE BAYES ALGORITHM
HEART DISEASE PREDICTION USING NAIVE BAYES ALGORITHMHEART DISEASE PREDICTION USING NAIVE BAYES ALGORITHM
HEART DISEASE PREDICTION USING NAIVE BAYES ALGORITHMamiteshg
 
Survey on data mining techniques in heart disease prediction
Survey on data mining techniques in heart disease predictionSurvey on data mining techniques in heart disease prediction
Survey on data mining techniques in heart disease predictionSivagowry Shathesh
 
Heart Attack Prediction using Machine Learning
Heart Attack Prediction using Machine LearningHeart Attack Prediction using Machine Learning
Heart Attack Prediction using Machine Learningmohdshoaibuddin1
 
Diabetes Prediction Using Machine Learning
Diabetes Prediction Using Machine LearningDiabetes Prediction Using Machine Learning
Diabetes Prediction Using Machine Learningjagan477830
 
Heart Disease Identification Method Using Machine Learnin in E-healthcare.
Heart Disease Identification Method Using Machine Learnin in E-healthcare.Heart Disease Identification Method Using Machine Learnin in E-healthcare.
Heart Disease Identification Method Using Machine Learnin in E-healthcare.SUJIT SHIBAPRASAD MAITY
 
Project on disease prediction
Project on disease predictionProject on disease prediction
Project on disease predictionKOYELMAJUMDAR1
 
Cardiovascular Disease Prediction Using Machine Learning Approaches.pptx
Cardiovascular Disease Prediction Using Machine Learning Approaches.pptxCardiovascular Disease Prediction Using Machine Learning Approaches.pptx
Cardiovascular Disease Prediction Using Machine Learning Approaches.pptxTaminul Islam
 
A Heart Disease Prediction Model using Logistic Regression By Cleveland DataBase
A Heart Disease Prediction Model using Logistic Regression By Cleveland DataBaseA Heart Disease Prediction Model using Logistic Regression By Cleveland DataBase
A Heart Disease Prediction Model using Logistic Regression By Cleveland DataBaseijtsrd
 
IRJET- Diabetes Diagnosis using Machine Learning Algorithms
IRJET- Diabetes Diagnosis using Machine Learning AlgorithmsIRJET- Diabetes Diagnosis using Machine Learning Algorithms
IRJET- Diabetes Diagnosis using Machine Learning AlgorithmsIRJET Journal
 
Prediction of Diabetes using Probability Approach
Prediction of Diabetes using Probability ApproachPrediction of Diabetes using Probability Approach
Prediction of Diabetes using Probability ApproachIRJET Journal
 
Heart disease prediction system
Heart disease prediction systemHeart disease prediction system
Heart disease prediction systemSWAMI06
 
Machine Learning for Disease Prediction
Machine Learning for Disease PredictionMachine Learning for Disease Prediction
Machine Learning for Disease PredictionMustafa Oğuz
 
Prediction of cardiovascular disease with machine learning
Prediction of cardiovascular disease with machine learningPrediction of cardiovascular disease with machine learning
Prediction of cardiovascular disease with machine learningPravinkumar Landge
 
DISEASE PREDICTION SYSTEM USING DATA MINING
DISEASE PREDICTION SYSTEM USING  DATA MININGDISEASE PREDICTION SYSTEM USING  DATA MINING
DISEASE PREDICTION SYSTEM USING DATA MININGshivaniyadav112
 
Android Based Questionnaires Application for Heart Disease Prediction System
Android Based Questionnaires Application for Heart Disease Prediction SystemAndroid Based Questionnaires Application for Heart Disease Prediction System
Android Based Questionnaires Application for Heart Disease Prediction Systemijtsrd
 
Diabetes prediciton model ppt.ppt
Diabetes prediciton model ppt.pptDiabetes prediciton model ppt.ppt
Diabetes prediciton model ppt.pptsatvikpatil5
 
Diabetes prediction with r(using knn)
Diabetes prediction with r(using knn)Diabetes prediction with r(using knn)
Diabetes prediction with r(using knn)tanujoshi98
 
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSIS
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSISAN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSIS
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSISijcsit
 

What's hot (20)

Final ppt
Final pptFinal ppt
Final ppt
 
HEART DISEASE PREDICTION USING NAIVE BAYES ALGORITHM
HEART DISEASE PREDICTION USING NAIVE BAYES ALGORITHMHEART DISEASE PREDICTION USING NAIVE BAYES ALGORITHM
HEART DISEASE PREDICTION USING NAIVE BAYES ALGORITHM
 
Survey on data mining techniques in heart disease prediction
Survey on data mining techniques in heart disease predictionSurvey on data mining techniques in heart disease prediction
Survey on data mining techniques in heart disease prediction
 
Heart Attack Prediction using Machine Learning
Heart Attack Prediction using Machine LearningHeart Attack Prediction using Machine Learning
Heart Attack Prediction using Machine Learning
 
Diabetes Prediction Using Machine Learning
Diabetes Prediction Using Machine LearningDiabetes Prediction Using Machine Learning
Diabetes Prediction Using Machine Learning
 
Heart Disease Identification Method Using Machine Learnin in E-healthcare.
Heart Disease Identification Method Using Machine Learnin in E-healthcare.Heart Disease Identification Method Using Machine Learnin in E-healthcare.
Heart Disease Identification Method Using Machine Learnin in E-healthcare.
 
Project on disease prediction
Project on disease predictionProject on disease prediction
Project on disease prediction
 
Cardiovascular Disease Prediction Using Machine Learning Approaches.pptx
Cardiovascular Disease Prediction Using Machine Learning Approaches.pptxCardiovascular Disease Prediction Using Machine Learning Approaches.pptx
Cardiovascular Disease Prediction Using Machine Learning Approaches.pptx
 
A Heart Disease Prediction Model using Logistic Regression By Cleveland DataBase
A Heart Disease Prediction Model using Logistic Regression By Cleveland DataBaseA Heart Disease Prediction Model using Logistic Regression By Cleveland DataBase
A Heart Disease Prediction Model using Logistic Regression By Cleveland DataBase
 
IRJET- Diabetes Diagnosis using Machine Learning Algorithms
IRJET- Diabetes Diagnosis using Machine Learning AlgorithmsIRJET- Diabetes Diagnosis using Machine Learning Algorithms
IRJET- Diabetes Diagnosis using Machine Learning Algorithms
 
Prediction of Diabetes using Probability Approach
Prediction of Diabetes using Probability ApproachPrediction of Diabetes using Probability Approach
Prediction of Diabetes using Probability Approach
 
Heart disease prediction system
Heart disease prediction systemHeart disease prediction system
Heart disease prediction system
 
Machine Learning for Disease Prediction
Machine Learning for Disease PredictionMachine Learning for Disease Prediction
Machine Learning for Disease Prediction
 
Prediction of cardiovascular disease with machine learning
Prediction of cardiovascular disease with machine learningPrediction of cardiovascular disease with machine learning
Prediction of cardiovascular disease with machine learning
 
Stroke Prediction
Stroke PredictionStroke Prediction
Stroke Prediction
 
DISEASE PREDICTION SYSTEM USING DATA MINING
DISEASE PREDICTION SYSTEM USING  DATA MININGDISEASE PREDICTION SYSTEM USING  DATA MINING
DISEASE PREDICTION SYSTEM USING DATA MINING
 
Android Based Questionnaires Application for Heart Disease Prediction System
Android Based Questionnaires Application for Heart Disease Prediction SystemAndroid Based Questionnaires Application for Heart Disease Prediction System
Android Based Questionnaires Application for Heart Disease Prediction System
 
Diabetes prediciton model ppt.ppt
Diabetes prediciton model ppt.pptDiabetes prediciton model ppt.ppt
Diabetes prediciton model ppt.ppt
 
Diabetes prediction with r(using knn)
Diabetes prediction with r(using knn)Diabetes prediction with r(using knn)
Diabetes prediction with r(using knn)
 
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSIS
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSISAN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSIS
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSIS
 

Similar to PREDICTION OF DIABETES MELLITUS USING MACHINE LEARNING TECHNIQUES

Prognosis of Diabetes by Performing Data Mining of HbA1c
 Prognosis of Diabetes by Performing Data Mining of HbA1c Prognosis of Diabetes by Performing Data Mining of HbA1c
Prognosis of Diabetes by Performing Data Mining of HbA1cIJCSIS Research Publications
 
Diabetes Prediction Using ML
Diabetes Prediction Using MLDiabetes Prediction Using ML
Diabetes Prediction Using MLIRJET Journal
 
Early Stage Diabetic Disease Prediction and Risk Minimization using Machine L...
Early Stage Diabetic Disease Prediction and Risk Minimization using Machine L...Early Stage Diabetic Disease Prediction and Risk Minimization using Machine L...
Early Stage Diabetic Disease Prediction and Risk Minimization using Machine L...IRJET Journal
 
Analysis and Prediction of Diabetes Diseases using Machine Learning Algorithm...
Analysis and Prediction of Diabetes Diseases using Machine Learning Algorithm...Analysis and Prediction of Diabetes Diseases using Machine Learning Algorithm...
Analysis and Prediction of Diabetes Diseases using Machine Learning Algorithm...IRJET Journal
 
Implementation of a Web Application to Foresee and Pretreat Diabetes Mellitus...
Implementation of a Web Application to Foresee and Pretreat Diabetes Mellitus...Implementation of a Web Application to Foresee and Pretreat Diabetes Mellitus...
Implementation of a Web Application to Foresee and Pretreat Diabetes Mellitus...IRJET Journal
 
Development of a Hybrid Dynamic Expert System for the Diagnosis of Peripheral...
Development of a Hybrid Dynamic Expert System for the Diagnosis of Peripheral...Development of a Hybrid Dynamic Expert System for the Diagnosis of Peripheral...
Development of a Hybrid Dynamic Expert System for the Diagnosis of Peripheral...ijtsrd
 
Machine learning approach for predicting heart and diabetes diseases using da...
Machine learning approach for predicting heart and diabetes diseases using da...Machine learning approach for predicting heart and diabetes diseases using da...
Machine learning approach for predicting heart and diabetes diseases using da...IAESIJAI
 
PREDICTION OF ANEMIA USING MACHINE LEARNING ALGORITHMS
PREDICTION OF ANEMIA USING MACHINE LEARNING ALGORITHMSPREDICTION OF ANEMIA USING MACHINE LEARNING ALGORITHMS
PREDICTION OF ANEMIA USING MACHINE LEARNING ALGORITHMSAIRCC Publishing Corporation
 
Prediction of Anemia using Machine Learning Algorithms
Prediction of Anemia using Machine Learning AlgorithmsPrediction of Anemia using Machine Learning Algorithms
Prediction of Anemia using Machine Learning AlgorithmsAIRCC Publishing Corporation
 
ML In Predicting Diabetes In The Early Stage
ML In Predicting Diabetes In The Early StageML In Predicting Diabetes In The Early Stage
ML In Predicting Diabetes In The Early StageIRJET Journal
 
A CONCEPTUAL APPROACH TO ENHANCE PREDICTION OF DIABETES USING ALTERNATE FEATU...
A CONCEPTUAL APPROACH TO ENHANCE PREDICTION OF DIABETES USING ALTERNATE FEATU...A CONCEPTUAL APPROACH TO ENHANCE PREDICTION OF DIABETES USING ALTERNATE FEATU...
A CONCEPTUAL APPROACH TO ENHANCE PREDICTION OF DIABETES USING ALTERNATE FEATU...IAEMEPublication
 
A CONCEPTUAL APPROACH TO ENHANCE PREDICTION OF DIABETES USING ALTERNATE FEATU...
A CONCEPTUAL APPROACH TO ENHANCE PREDICTION OF DIABETES USING ALTERNATE FEATU...A CONCEPTUAL APPROACH TO ENHANCE PREDICTION OF DIABETES USING ALTERNATE FEATU...
A CONCEPTUAL APPROACH TO ENHANCE PREDICTION OF DIABETES USING ALTERNATE FEATU...IAEME Publication
 
Performance Evaluation of Data Mining Algorithm on Electronic Health Record o...
Performance Evaluation of Data Mining Algorithm on Electronic Health Record o...Performance Evaluation of Data Mining Algorithm on Electronic Health Record o...
Performance Evaluation of Data Mining Algorithm on Electronic Health Record o...BRNSSPublicationHubI
 
Diabetes Prediction by Supervised and Unsupervised Approaches with Feature Se...
Diabetes Prediction by Supervised and Unsupervised Approaches with Feature Se...Diabetes Prediction by Supervised and Unsupervised Approaches with Feature Se...
Diabetes Prediction by Supervised and Unsupervised Approaches with Feature Se...IJARIIT
 
AN IMPROVED MODEL FOR CLINICAL DECISION SUPPORT SYSTEM
AN IMPROVED MODEL FOR CLINICAL DECISION SUPPORT SYSTEMAN IMPROVED MODEL FOR CLINICAL DECISION SUPPORT SYSTEM
AN IMPROVED MODEL FOR CLINICAL DECISION SUPPORT SYSTEMijaia
 
Improving the performance of k nearest neighbor algorithm for the classificat...
Improving the performance of k nearest neighbor algorithm for the classificat...Improving the performance of k nearest neighbor algorithm for the classificat...
Improving the performance of k nearest neighbor algorithm for the classificat...IAEME Publication
 
DIAGNOSIS OF OBESITY LEVEL BASED ON BAGGING ENSEMBLE CLASSIFIER AND FEATURE S...
DIAGNOSIS OF OBESITY LEVEL BASED ON BAGGING ENSEMBLE CLASSIFIER AND FEATURE S...DIAGNOSIS OF OBESITY LEVEL BASED ON BAGGING ENSEMBLE CLASSIFIER AND FEATURE S...
DIAGNOSIS OF OBESITY LEVEL BASED ON BAGGING ENSEMBLE CLASSIFIER AND FEATURE S...ijaia
 
DIABETES PREDICTOR USING ENSEMBLE TECHNIQUE
DIABETES PREDICTOR USING ENSEMBLE TECHNIQUEDIABETES PREDICTOR USING ENSEMBLE TECHNIQUE
DIABETES PREDICTOR USING ENSEMBLE TECHNIQUEIRJET Journal
 
IRJET - Prediction and Detection of Diabetes using Machine Learning
IRJET - Prediction and Detection of Diabetes using Machine LearningIRJET - Prediction and Detection of Diabetes using Machine Learning
IRJET - Prediction and Detection of Diabetes using Machine LearningIRJET Journal
 
Ascendable Clarification for Coronary Illness Prediction using Classification...
Ascendable Clarification for Coronary Illness Prediction using Classification...Ascendable Clarification for Coronary Illness Prediction using Classification...
Ascendable Clarification for Coronary Illness Prediction using Classification...ijtsrd
 

Similar to PREDICTION OF DIABETES MELLITUS USING MACHINE LEARNING TECHNIQUES (20)

Prognosis of Diabetes by Performing Data Mining of HbA1c
 Prognosis of Diabetes by Performing Data Mining of HbA1c Prognosis of Diabetes by Performing Data Mining of HbA1c
Prognosis of Diabetes by Performing Data Mining of HbA1c
 
Diabetes Prediction Using ML
Diabetes Prediction Using MLDiabetes Prediction Using ML
Diabetes Prediction Using ML
 
Early Stage Diabetic Disease Prediction and Risk Minimization using Machine L...
Early Stage Diabetic Disease Prediction and Risk Minimization using Machine L...Early Stage Diabetic Disease Prediction and Risk Minimization using Machine L...
Early Stage Diabetic Disease Prediction and Risk Minimization using Machine L...
 
Analysis and Prediction of Diabetes Diseases using Machine Learning Algorithm...
Analysis and Prediction of Diabetes Diseases using Machine Learning Algorithm...Analysis and Prediction of Diabetes Diseases using Machine Learning Algorithm...
Analysis and Prediction of Diabetes Diseases using Machine Learning Algorithm...
 
Implementation of a Web Application to Foresee and Pretreat Diabetes Mellitus...
Implementation of a Web Application to Foresee and Pretreat Diabetes Mellitus...Implementation of a Web Application to Foresee and Pretreat Diabetes Mellitus...
Implementation of a Web Application to Foresee and Pretreat Diabetes Mellitus...
 
Development of a Hybrid Dynamic Expert System for the Diagnosis of Peripheral...
Development of a Hybrid Dynamic Expert System for the Diagnosis of Peripheral...Development of a Hybrid Dynamic Expert System for the Diagnosis of Peripheral...
Development of a Hybrid Dynamic Expert System for the Diagnosis of Peripheral...
 
Machine learning approach for predicting heart and diabetes diseases using da...
Machine learning approach for predicting heart and diabetes diseases using da...Machine learning approach for predicting heart and diabetes diseases using da...
Machine learning approach for predicting heart and diabetes diseases using da...
 
PREDICTION OF ANEMIA USING MACHINE LEARNING ALGORITHMS
PREDICTION OF ANEMIA USING MACHINE LEARNING ALGORITHMSPREDICTION OF ANEMIA USING MACHINE LEARNING ALGORITHMS
PREDICTION OF ANEMIA USING MACHINE LEARNING ALGORITHMS
 
Prediction of Anemia using Machine Learning Algorithms
Prediction of Anemia using Machine Learning AlgorithmsPrediction of Anemia using Machine Learning Algorithms
Prediction of Anemia using Machine Learning Algorithms
 
ML In Predicting Diabetes In The Early Stage
ML In Predicting Diabetes In The Early StageML In Predicting Diabetes In The Early Stage
ML In Predicting Diabetes In The Early Stage
 
A CONCEPTUAL APPROACH TO ENHANCE PREDICTION OF DIABETES USING ALTERNATE FEATU...
A CONCEPTUAL APPROACH TO ENHANCE PREDICTION OF DIABETES USING ALTERNATE FEATU...A CONCEPTUAL APPROACH TO ENHANCE PREDICTION OF DIABETES USING ALTERNATE FEATU...
A CONCEPTUAL APPROACH TO ENHANCE PREDICTION OF DIABETES USING ALTERNATE FEATU...
 
A CONCEPTUAL APPROACH TO ENHANCE PREDICTION OF DIABETES USING ALTERNATE FEATU...
A CONCEPTUAL APPROACH TO ENHANCE PREDICTION OF DIABETES USING ALTERNATE FEATU...A CONCEPTUAL APPROACH TO ENHANCE PREDICTION OF DIABETES USING ALTERNATE FEATU...
A CONCEPTUAL APPROACH TO ENHANCE PREDICTION OF DIABETES USING ALTERNATE FEATU...
 
Performance Evaluation of Data Mining Algorithm on Electronic Health Record o...
Performance Evaluation of Data Mining Algorithm on Electronic Health Record o...Performance Evaluation of Data Mining Algorithm on Electronic Health Record o...
Performance Evaluation of Data Mining Algorithm on Electronic Health Record o...
 
Diabetes Prediction by Supervised and Unsupervised Approaches with Feature Se...
Diabetes Prediction by Supervised and Unsupervised Approaches with Feature Se...Diabetes Prediction by Supervised and Unsupervised Approaches with Feature Se...
Diabetes Prediction by Supervised and Unsupervised Approaches with Feature Se...
 
AN IMPROVED MODEL FOR CLINICAL DECISION SUPPORT SYSTEM
AN IMPROVED MODEL FOR CLINICAL DECISION SUPPORT SYSTEMAN IMPROVED MODEL FOR CLINICAL DECISION SUPPORT SYSTEM
AN IMPROVED MODEL FOR CLINICAL DECISION SUPPORT SYSTEM
 
Improving the performance of k nearest neighbor algorithm for the classificat...
Improving the performance of k nearest neighbor algorithm for the classificat...Improving the performance of k nearest neighbor algorithm for the classificat...
Improving the performance of k nearest neighbor algorithm for the classificat...
 
DIAGNOSIS OF OBESITY LEVEL BASED ON BAGGING ENSEMBLE CLASSIFIER AND FEATURE S...
DIAGNOSIS OF OBESITY LEVEL BASED ON BAGGING ENSEMBLE CLASSIFIER AND FEATURE S...DIAGNOSIS OF OBESITY LEVEL BASED ON BAGGING ENSEMBLE CLASSIFIER AND FEATURE S...
DIAGNOSIS OF OBESITY LEVEL BASED ON BAGGING ENSEMBLE CLASSIFIER AND FEATURE S...
 
DIABETES PREDICTOR USING ENSEMBLE TECHNIQUE
DIABETES PREDICTOR USING ENSEMBLE TECHNIQUEDIABETES PREDICTOR USING ENSEMBLE TECHNIQUE
DIABETES PREDICTOR USING ENSEMBLE TECHNIQUE
 
IRJET - Prediction and Detection of Diabetes using Machine Learning
IRJET - Prediction and Detection of Diabetes using Machine LearningIRJET - Prediction and Detection of Diabetes using Machine Learning
IRJET - Prediction and Detection of Diabetes using Machine Learning
 
Ascendable Clarification for Coronary Illness Prediction using Classification...
Ascendable Clarification for Coronary Illness Prediction using Classification...Ascendable Clarification for Coronary Illness Prediction using Classification...
Ascendable Clarification for Coronary Illness Prediction using Classification...
 

More from IAEME Publication

IAEME_Publication_Call_for_Paper_September_2022.pdf
IAEME_Publication_Call_for_Paper_September_2022.pdfIAEME_Publication_Call_for_Paper_September_2022.pdf
IAEME_Publication_Call_for_Paper_September_2022.pdfIAEME Publication
 
MODELING AND ANALYSIS OF SURFACE ROUGHNESS AND WHITE LATER THICKNESS IN WIRE-...
MODELING AND ANALYSIS OF SURFACE ROUGHNESS AND WHITE LATER THICKNESS IN WIRE-...MODELING AND ANALYSIS OF SURFACE ROUGHNESS AND WHITE LATER THICKNESS IN WIRE-...
MODELING AND ANALYSIS OF SURFACE ROUGHNESS AND WHITE LATER THICKNESS IN WIRE-...IAEME Publication
 
A STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURS
A STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURSA STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURS
A STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURSIAEME Publication
 
BROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURS
BROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURSBROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURS
BROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURSIAEME Publication
 
DETERMINANTS AFFECTING THE USER'S INTENTION TO USE MOBILE BANKING APPLICATIONS
DETERMINANTS AFFECTING THE USER'S INTENTION TO USE MOBILE BANKING APPLICATIONSDETERMINANTS AFFECTING THE USER'S INTENTION TO USE MOBILE BANKING APPLICATIONS
DETERMINANTS AFFECTING THE USER'S INTENTION TO USE MOBILE BANKING APPLICATIONSIAEME Publication
 
ANALYSE THE USER PREDILECTION ON GPAY AND PHONEPE FOR DIGITAL TRANSACTIONS
ANALYSE THE USER PREDILECTION ON GPAY AND PHONEPE FOR DIGITAL TRANSACTIONSANALYSE THE USER PREDILECTION ON GPAY AND PHONEPE FOR DIGITAL TRANSACTIONS
ANALYSE THE USER PREDILECTION ON GPAY AND PHONEPE FOR DIGITAL TRANSACTIONSIAEME Publication
 
VOICE BASED ATM FOR VISUALLY IMPAIRED USING ARDUINO
VOICE BASED ATM FOR VISUALLY IMPAIRED USING ARDUINOVOICE BASED ATM FOR VISUALLY IMPAIRED USING ARDUINO
VOICE BASED ATM FOR VISUALLY IMPAIRED USING ARDUINOIAEME Publication
 
IMPACT OF EMOTIONAL INTELLIGENCE ON HUMAN RESOURCE MANAGEMENT PRACTICES AMONG...
IMPACT OF EMOTIONAL INTELLIGENCE ON HUMAN RESOURCE MANAGEMENT PRACTICES AMONG...IMPACT OF EMOTIONAL INTELLIGENCE ON HUMAN RESOURCE MANAGEMENT PRACTICES AMONG...
IMPACT OF EMOTIONAL INTELLIGENCE ON HUMAN RESOURCE MANAGEMENT PRACTICES AMONG...IAEME Publication
 
VISUALISING AGING PARENTS & THEIR CLOSE CARERS LIFE JOURNEY IN AGING ECONOMY
VISUALISING AGING PARENTS & THEIR CLOSE CARERS LIFE JOURNEY IN AGING ECONOMYVISUALISING AGING PARENTS & THEIR CLOSE CARERS LIFE JOURNEY IN AGING ECONOMY
VISUALISING AGING PARENTS & THEIR CLOSE CARERS LIFE JOURNEY IN AGING ECONOMYIAEME Publication
 
A STUDY ON THE IMPACT OF ORGANIZATIONAL CULTURE ON THE EFFECTIVENESS OF PERFO...
A STUDY ON THE IMPACT OF ORGANIZATIONAL CULTURE ON THE EFFECTIVENESS OF PERFO...A STUDY ON THE IMPACT OF ORGANIZATIONAL CULTURE ON THE EFFECTIVENESS OF PERFO...
A STUDY ON THE IMPACT OF ORGANIZATIONAL CULTURE ON THE EFFECTIVENESS OF PERFO...IAEME Publication
 
GANDHI ON NON-VIOLENT POLICE
GANDHI ON NON-VIOLENT POLICEGANDHI ON NON-VIOLENT POLICE
GANDHI ON NON-VIOLENT POLICEIAEME Publication
 
A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...
A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...
A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...IAEME Publication
 
ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...
ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...
ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...IAEME Publication
 
INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...
INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...
INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...IAEME Publication
 
A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...
A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...
A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...IAEME Publication
 
EXPERIMENTAL STUDY OF MECHANICAL AND TRIBOLOGICAL RELATION OF NYLON/BaSO4 POL...
EXPERIMENTAL STUDY OF MECHANICAL AND TRIBOLOGICAL RELATION OF NYLON/BaSO4 POL...EXPERIMENTAL STUDY OF MECHANICAL AND TRIBOLOGICAL RELATION OF NYLON/BaSO4 POL...
EXPERIMENTAL STUDY OF MECHANICAL AND TRIBOLOGICAL RELATION OF NYLON/BaSO4 POL...IAEME Publication
 
ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...
ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...
ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...IAEME Publication
 
OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...
OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...
OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...IAEME Publication
 
APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...
APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...
APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...IAEME Publication
 
A MULTIPLE – CHANNEL QUEUING MODELS ON FUZZY ENVIRONMENT
A MULTIPLE – CHANNEL QUEUING MODELS ON FUZZY ENVIRONMENTA MULTIPLE – CHANNEL QUEUING MODELS ON FUZZY ENVIRONMENT
A MULTIPLE – CHANNEL QUEUING MODELS ON FUZZY ENVIRONMENTIAEME Publication
 

More from IAEME Publication (20)

IAEME_Publication_Call_for_Paper_September_2022.pdf
IAEME_Publication_Call_for_Paper_September_2022.pdfIAEME_Publication_Call_for_Paper_September_2022.pdf
IAEME_Publication_Call_for_Paper_September_2022.pdf
 
MODELING AND ANALYSIS OF SURFACE ROUGHNESS AND WHITE LATER THICKNESS IN WIRE-...
MODELING AND ANALYSIS OF SURFACE ROUGHNESS AND WHITE LATER THICKNESS IN WIRE-...MODELING AND ANALYSIS OF SURFACE ROUGHNESS AND WHITE LATER THICKNESS IN WIRE-...
MODELING AND ANALYSIS OF SURFACE ROUGHNESS AND WHITE LATER THICKNESS IN WIRE-...
 
A STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURS
A STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURSA STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURS
A STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURS
 
BROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURS
BROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURSBROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURS
BROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURS
 
DETERMINANTS AFFECTING THE USER'S INTENTION TO USE MOBILE BANKING APPLICATIONS
DETERMINANTS AFFECTING THE USER'S INTENTION TO USE MOBILE BANKING APPLICATIONSDETERMINANTS AFFECTING THE USER'S INTENTION TO USE MOBILE BANKING APPLICATIONS
DETERMINANTS AFFECTING THE USER'S INTENTION TO USE MOBILE BANKING APPLICATIONS
 
ANALYSE THE USER PREDILECTION ON GPAY AND PHONEPE FOR DIGITAL TRANSACTIONS
ANALYSE THE USER PREDILECTION ON GPAY AND PHONEPE FOR DIGITAL TRANSACTIONSANALYSE THE USER PREDILECTION ON GPAY AND PHONEPE FOR DIGITAL TRANSACTIONS
ANALYSE THE USER PREDILECTION ON GPAY AND PHONEPE FOR DIGITAL TRANSACTIONS
 
VOICE BASED ATM FOR VISUALLY IMPAIRED USING ARDUINO
VOICE BASED ATM FOR VISUALLY IMPAIRED USING ARDUINOVOICE BASED ATM FOR VISUALLY IMPAIRED USING ARDUINO
VOICE BASED ATM FOR VISUALLY IMPAIRED USING ARDUINO
 
IMPACT OF EMOTIONAL INTELLIGENCE ON HUMAN RESOURCE MANAGEMENT PRACTICES AMONG...
IMPACT OF EMOTIONAL INTELLIGENCE ON HUMAN RESOURCE MANAGEMENT PRACTICES AMONG...IMPACT OF EMOTIONAL INTELLIGENCE ON HUMAN RESOURCE MANAGEMENT PRACTICES AMONG...
IMPACT OF EMOTIONAL INTELLIGENCE ON HUMAN RESOURCE MANAGEMENT PRACTICES AMONG...
 
VISUALISING AGING PARENTS & THEIR CLOSE CARERS LIFE JOURNEY IN AGING ECONOMY
VISUALISING AGING PARENTS & THEIR CLOSE CARERS LIFE JOURNEY IN AGING ECONOMYVISUALISING AGING PARENTS & THEIR CLOSE CARERS LIFE JOURNEY IN AGING ECONOMY
VISUALISING AGING PARENTS & THEIR CLOSE CARERS LIFE JOURNEY IN AGING ECONOMY
 
A STUDY ON THE IMPACT OF ORGANIZATIONAL CULTURE ON THE EFFECTIVENESS OF PERFO...
A STUDY ON THE IMPACT OF ORGANIZATIONAL CULTURE ON THE EFFECTIVENESS OF PERFO...A STUDY ON THE IMPACT OF ORGANIZATIONAL CULTURE ON THE EFFECTIVENESS OF PERFO...
A STUDY ON THE IMPACT OF ORGANIZATIONAL CULTURE ON THE EFFECTIVENESS OF PERFO...
 
GANDHI ON NON-VIOLENT POLICE
GANDHI ON NON-VIOLENT POLICEGANDHI ON NON-VIOLENT POLICE
GANDHI ON NON-VIOLENT POLICE
 
A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...
A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...
A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...
 
ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...
ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...
ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...
 
INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...
INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...
INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...
 
A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...
A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...
A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...
 
EXPERIMENTAL STUDY OF MECHANICAL AND TRIBOLOGICAL RELATION OF NYLON/BaSO4 POL...
EXPERIMENTAL STUDY OF MECHANICAL AND TRIBOLOGICAL RELATION OF NYLON/BaSO4 POL...EXPERIMENTAL STUDY OF MECHANICAL AND TRIBOLOGICAL RELATION OF NYLON/BaSO4 POL...
EXPERIMENTAL STUDY OF MECHANICAL AND TRIBOLOGICAL RELATION OF NYLON/BaSO4 POL...
 
ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...
ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...
ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...
 
OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...
OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...
OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...
 
APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...
APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...
APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...
 
A MULTIPLE – CHANNEL QUEUING MODELS ON FUZZY ENVIRONMENT
A MULTIPLE – CHANNEL QUEUING MODELS ON FUZZY ENVIRONMENTA MULTIPLE – CHANNEL QUEUING MODELS ON FUZZY ENVIRONMENT
A MULTIPLE – CHANNEL QUEUING MODELS ON FUZZY ENVIRONMENT
 

Recently uploaded

Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills KuwaitKuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwaitjaanualu31
 
Online electricity billing project report..pdf
Online electricity billing project report..pdfOnline electricity billing project report..pdf
Online electricity billing project report..pdfKamal Acharya
 
DC MACHINE-Motoring and generation, Armature circuit equation
DC MACHINE-Motoring and generation, Armature circuit equationDC MACHINE-Motoring and generation, Armature circuit equation
DC MACHINE-Motoring and generation, Armature circuit equationBhangaleSonal
 
AIRCANVAS[1].pdf mini project for btech students
AIRCANVAS[1].pdf mini project for btech studentsAIRCANVAS[1].pdf mini project for btech students
AIRCANVAS[1].pdf mini project for btech studentsvanyagupta248
 
Verification of thevenin's theorem for BEEE Lab (1).pptx
Verification of thevenin's theorem for BEEE Lab (1).pptxVerification of thevenin's theorem for BEEE Lab (1).pptx
Verification of thevenin's theorem for BEEE Lab (1).pptxchumtiyababu
 
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptxS1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptxSCMS School of Architecture
 
School management system project Report.pdf
School management system project Report.pdfSchool management system project Report.pdf
School management system project Report.pdfKamal Acharya
 
Wadi Rum luxhotel lodge Analysis case study.pptx
Wadi Rum luxhotel lodge Analysis case study.pptxWadi Rum luxhotel lodge Analysis case study.pptx
Wadi Rum luxhotel lodge Analysis case study.pptxNadaHaitham1
 
Hospital management system project report.pdf
Hospital management system project report.pdfHospital management system project report.pdf
Hospital management system project report.pdfKamal Acharya
 
Computer Lecture 01.pptxIntroduction to Computers
Computer Lecture 01.pptxIntroduction to ComputersComputer Lecture 01.pptxIntroduction to Computers
Computer Lecture 01.pptxIntroduction to ComputersMairaAshraf6
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . pptDineshKumar4165
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptDineshKumar4165
 
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments""Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"mphochane1998
 
Hostel management system project report..pdf
Hostel management system project report..pdfHostel management system project report..pdf
Hostel management system project report..pdfKamal Acharya
 
A CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptx
A CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptxA CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptx
A CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptxmaisarahman1
 
Online food ordering system project report.pdf
Online food ordering system project report.pdfOnline food ordering system project report.pdf
Online food ordering system project report.pdfKamal Acharya
 
Computer Networks Basics of Network Devices
Computer Networks  Basics of Network DevicesComputer Networks  Basics of Network Devices
Computer Networks Basics of Network DevicesChandrakantDivate1
 
PE 459 LECTURE 2- natural gas basic concepts and properties
PE 459 LECTURE 2- natural gas basic concepts and propertiesPE 459 LECTURE 2- natural gas basic concepts and properties
PE 459 LECTURE 2- natural gas basic concepts and propertiessarkmank1
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfJiananWang21
 

Recently uploaded (20)

Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills KuwaitKuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
 
Online electricity billing project report..pdf
Online electricity billing project report..pdfOnline electricity billing project report..pdf
Online electricity billing project report..pdf
 
DC MACHINE-Motoring and generation, Armature circuit equation
DC MACHINE-Motoring and generation, Armature circuit equationDC MACHINE-Motoring and generation, Armature circuit equation
DC MACHINE-Motoring and generation, Armature circuit equation
 
AIRCANVAS[1].pdf mini project for btech students
AIRCANVAS[1].pdf mini project for btech studentsAIRCANVAS[1].pdf mini project for btech students
AIRCANVAS[1].pdf mini project for btech students
 
Verification of thevenin's theorem for BEEE Lab (1).pptx
Verification of thevenin's theorem for BEEE Lab (1).pptxVerification of thevenin's theorem for BEEE Lab (1).pptx
Verification of thevenin's theorem for BEEE Lab (1).pptx
 
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptxS1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
 
School management system project Report.pdf
School management system project Report.pdfSchool management system project Report.pdf
School management system project Report.pdf
 
Wadi Rum luxhotel lodge Analysis case study.pptx
Wadi Rum luxhotel lodge Analysis case study.pptxWadi Rum luxhotel lodge Analysis case study.pptx
Wadi Rum luxhotel lodge Analysis case study.pptx
 
Hospital management system project report.pdf
Hospital management system project report.pdfHospital management system project report.pdf
Hospital management system project report.pdf
 
Integrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - NeometrixIntegrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - Neometrix
 
Computer Lecture 01.pptxIntroduction to Computers
Computer Lecture 01.pptxIntroduction to ComputersComputer Lecture 01.pptxIntroduction to Computers
Computer Lecture 01.pptxIntroduction to Computers
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . ppt
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.ppt
 
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments""Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
 
Hostel management system project report..pdf
Hostel management system project report..pdfHostel management system project report..pdf
Hostel management system project report..pdf
 
A CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptx
A CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptxA CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptx
A CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptx
 
Online food ordering system project report.pdf
Online food ordering system project report.pdfOnline food ordering system project report.pdf
Online food ordering system project report.pdf
 
Computer Networks Basics of Network Devices
Computer Networks  Basics of Network DevicesComputer Networks  Basics of Network Devices
Computer Networks Basics of Network Devices
 
PE 459 LECTURE 2- natural gas basic concepts and properties
PE 459 LECTURE 2- natural gas basic concepts and propertiesPE 459 LECTURE 2- natural gas basic concepts and properties
PE 459 LECTURE 2- natural gas basic concepts and properties
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdf
 

PREDICTION OF DIABETES MELLITUS USING MACHINE LEARNING TECHNIQUES

  • 1. http://www.iaeme.com/IJCET/index.asp 25 editor@iaeme.com International Journal of Computer Engineering and Technology (IJCET) Volume 10, Issue 04, July-August 2019, pp. 25-32, Article ID: IJCET_10_04_004 Available online at http://www.iaeme.com/ijcet/issues.asp?JType=IJCET&VType=10&IType=4 Journal Impact Factor (2019): 10.5167 (Calculated by GISI) www.jifactor.com ISSN Print: 0976-6367 and ISSN Online: 0976–6375 © IAEME Publication PREDICTION OF DIABETES MELLITUS USING MACHINE LEARNING TECHNIQUES Temesgen Abera Asfaw Department of Computer Science & Engineering, College of Engineering, Jawaharlal Technological University, Telangana, India ABSTRACT Diabetes mellitus is a common disease caused by a set of metabolic ailments where the sugar stages over drawn-out period is very high. It touches diverse organs of the human body which therefore harm a huge number of the body's system, in precise the blood strains and nerves. Early prediction in such disease can be exact and save human life. To achieve the goal, this research work mainly discovers numerous factors associated to this disease using machine learning techniques. Machine learning methods provide effectual outcome to extract knowledge by building predicting models from diagnostic medical datasets together from the diabetic patients. Quarrying knowledge from such data can be valuable to predict diabetic patients. In this research, six popular used machine learning techniques, namely Random Forest (RF), Logistic Regression (LR), Naive Bayes (NB), C4.5 Decision Tree (DT), K-Nearest Neighbor (KNN), and Support Vector Machine (SVM) are compared in order to get outstanding machine learning techniques to forecast diabetic mellitus. Our new outcome shows that Support Vector Machine (SVM) achieved higher accuracy compared to other machine learning techniques. Key words: machine learning, C4.5 Decision Tree, Support Vector Machine, Logistic Regression, Naive Bayes, K-Nearest Neighbor, and Random Forest, diabetes. Cite this Article: Temesgen Abera Asfaw, Prediction of Diabetes Mellitus Using Machine Learning Techniques, International Journal of Computer Engineering and Technology 10(4), 2019, pp. 25-32. http://www.iaeme.com/IJCET/issues.asp?JType=IJCET&VType=10&IType=4 1. INTRODUCTION Diabetic is a disease that affects the hormone insulin, follow-on in abnormal metabolism of carbohydrates and advance steps of sugar in the blood. This great blood sugar affects several organs of the human body which in turn complicates many source of the body, in precise the blood strains and nerves. The details of diabetic is not nevertheless totally exposed, many researchers supposed that both hereditary elements and environmental effects are complex therein. As exposed by the International Diabetes Federation, extent of people having diabetes mellitus stretched 382 million out of 2013 [1] that take 6.6% of the world's total adult population. According to the world healthcare medical data it has been probable that diabetic patients will be increased up to 490 billion within the year 2030 [2]. Furthermore, diabetic is
  • 2. Temesgen Abera Asfaw http://www.iaeme.com/IJCET/index.asp 26 editor@iaeme.com imaginably independent causal factor to micro-vascular entanglements. Diabetic patients are maybe more incapable against a hoisted risk of micro-vascular damage, in this way long term difficulty of cardio-vascular disease is the leading reason of death. This micro-vascular harm and hasty cardio vascular disease ultimately quick to retinopathy, nephropathy and neuropathy [3].Early prediction of such disease can be controlled over the diseases and save human life. To accomplish this goal, this research work mainly discovers the early prediction of diabetes by taking into account various risk factors related to this disease. For the willpower of the study we gathered diagnostic dataset having 16 attributes diabetic of 2000 patients. These attributes are age, diet, hyper-tension, problem in vision, genetic etc. In later part, we debate about these attributes with their conforming values. Based on these attributes, we figure prediction model by means of various machine learning techniques to predict diabetes mellitus. Machine learning techniques provide well-organized result to extract knowledge by making predicting models from diagnostic medical datasets composed from the diabetic patients. Haul out knowledge from such data can be beneficial to predict diabetic patients. Innumerable machine learning techniques have the knack to predict diabetes mellitus. Though it is very difficult to select the best technique to predict based on such attributes. Thus for the determination of the study, we deal six popular machine learning algorithms, namely Support Vector Machine (SVM), Naive Bayes (NB), K-Nearest Neighbor (KNN) ,Logistic Regression (LR) ,Random Forest (RF) and C4.5 decision tree (DT), on adult population data to predict diabetic mellitus. 2. RELATED WORK Various researchers have been shown revisions in the area of diabetic by using machine learning techniques to extract knowledge from existing medical data. For illustration, ALjumah et. al. [4] established a predictive analysis model using support vector machine algorithm. In [5], Kavakiotis et al. used 10 fold cross validation as evaluation method in three different algorithms, including Logistic regression, Naive Bayes, and SVM, where SVM on condition that better performance and accuracy of 84 % than other algorithms. In [6], Zheng et al. applied Random Forest, KNN, Naive Bayes, SVM, decision tree and logistic regression to predict diabetes mellitus at early stage, where cleaning criteria can be improved. Swarupa et al. [7] applied J48, ANN, KNN, ZeroR and NB on various diabetes dataset. Pradeep et al. [8] applied Random Forest, KNN, SVM and J48 where J48 shows better performance than others. The classification algorithms did not assess using cross validation method. To predict and control diabetes mellitus Huang et al [9] conversed three data mining methods, including IB1, Naïve Bayes and C4.5 in the year of 2000 to 2004. By smearing feature selection technique, the performance of IB1 and Naive Bayes provided better result. In [10], Xue-Hui Meng et al. used three different data mining techniques ANN, Logistic regression, and J48 to predict the diabetic diseases using real world data sets. Finally it was concluded as J48 performs better accuracy than others. In this work, we examine real diagnostic medical data based on numerous risk factors using popular machine learning classification techniques to assess their performance for predicting diabetes mellitus. 3. METHODOLOGY In order to accomplish our goal study methodology includes of few stages, which are accrual of diabetes dataset with the applicable attributes of the patients, preprocessing the numeric value attributes, to smear several machine learning techniques and conforming predictive analysis employing such data. In the subsequent, we fleetingly confer these stages.
  • 3. Prediction of Diabetes Mellitus Using Machine Learning Techniques http://www.iaeme.com/IJCET/index.asp 27 editor@iaeme.com 3.1. Dataset and Attributes In this work, datasets have been collected among the Pima Indian female population near Phoenix, Arizona. This particular dataset has been widely used in machine learning experiments and is currently available through the UCI repository of standard datasets. This population has been studied continuously by the National Institute of Diabetes, Digestive and Kidney. UCI repository contains 768 instances of observations and total 9 attributes with no missing values reported. Data sets contains 8 particular variables which were considered high risk factors for the occurrence of diabetes, like number of times pregnant, plasma glucose concentration at 2 hour in an oral glucose tolerance test (OGTT), diastolic blood pressure, 2 hour serum insulin, body mass index, diabetes pedigree. All the patients in this datasets are female at least 21 years old living near Phoenix, Arizona. All attributes are numeric values except class is nominal type. Attributes name and types are shown in table 1. Table 1 Dataset Description No Name of attributes Type 1 Number of times pregnant Numeric 2 Plasma glucose concentration a 2 hours in an oral glucose tolerance test Numeric Numeric 3 Diastolic blood pressure Numeric 4 Triceps skin fold thickness Numeric 5 2 hour serum insulin Numeric 6 Body mass index Numeric 7 Diabetes pedigree function Numeric Figure 1 Sample dataset It is good to check the correlations between the attributes. From the output graph below, the red around the diagonal suggests that attributes are correlated with each other. The yellow and green patches suggest some moderate correlation and the blue boxes show negative correlations as shown below fig. 2.
  • 4. Temesgen Abera Asfaw http://www.iaeme.com/IJCET/index.asp 28 editor@iaeme.com Figure 2 Attributes Correlation Next, we visualize the data using density plots to get a sense of the data distribution. From the outputs below, you can see the data shows a general Gaussian distribution below fig 3. Figure 3 Density plots Figure 4 Histogram
  • 5. Prediction of Diabetes Mellitus Using Machine Learning Techniques http://www.iaeme.com/IJCET/index.asp 29 editor@iaeme.com According to Fig. 5 below, after defining the problem we collect the relevant data from the Diagnostic Data Storage. We then preprocess the data for the purpose of building the prediction model. After that we apply various machine learning techniques discussed above on the training dataset. Finally, test dataset is used to measure the performance of the techniques in order to choose the best classifier for predicting diabetes mellitus. Figure 5 Shows an overview of the overall process of our work. 3.2. Apply Machine Learning Techniques After the data has been ready for demonstrating, we use six popular machine learning techniques to predict diabetes mellitus. Later we offer an overview of these techniques. Support Vector Machines: is widely classification technique proposed by J. Platt et. al. [11]. A Support Vector Machine is known by characterizing the data by separating a hyper plane. SVM detaches entities in specified classes. It can also recognize and classify instances which are not supported by data. SVM is not caring in the distribution of acquiring data of each class. The one delay of this algorithm is to execute regression analysis to produce a linear function and another extension is learning to rank elements to yield classification for individual elements. Naive Bayes: is a popular probabilistic classification technique proposed by John et. al. [12]. Naive Bayes also called Bayesian theorem is a simple, effective and commonly used machine learning classifier. The algorithm calculates probabilistic results by counting the regularity and combines the value given in data set. By using Bayesian theorem, it adopts that all attributes are independent and based on variable values of classes. In real world solicitation, the conditional independence hypothesis rarely holds true and gives well and more sophisticate classifier results. K-Nearest Neighbor Algorithm: K-nearest neighbor is simple regression and classification algorithm that used non parametric method proposed by Aha et. al. [13]. The algorithm trains all usable attributes and classifies new query based on their likeness measure. To determine the distance from point of interest to points in training data set it uses tree like data structure. The attribute is classified by its neighbors.
  • 6. Temesgen Abera Asfaw http://www.iaeme.com/IJCET/index.asp 30 editor@iaeme.com Decision Tree: is a tree that provides powerful classification techniques to predict diabetes mellitus. Every discrete area and feature of the domain is called a class. An input feature of the class attribute is labeled with the internal node in a tree. The leaf node of the tree is labeled by attribute and each attribute associated with a target value. There are some popular decision tree algorithms are accessible to classify diabetic data in machine learning techniques, including ID3, C4.5 , J48, C5, CART and CHAID. C4.5 provides extended features of ID3 decision tree algorithm proposed by Ross Quinlan et. al. [14]. C4.5 decision tree uses same training data as ID3, in which learned function is introduced. The learning method can be used to diagnose medical data to predict the value of the decision attribute. Logistic regression: is a probabilistic statistical model for investigative a dataset in which there are one or more independent variables that govern a result. In logistic regression, the dependent variable is binary that means 1 as (TRUE, patient, etc.) or 0 as (FALSE, healthy, etc.). Logistic regression produces the coefficients of a procedure to predict a log it adjustment of the probability of occurrence of the characteristic of nosiness. Random forest: is collaborative classification scheme based on Decision Tree. At the training stage, it produces a massive number of trees and creates a forest of Decision Trees. At the testing stage, each tree of the forest predicts a class label for each data. When each tree predicts a class label, then the final decision for each test data depends on widely held voting. Which class label gets the majority of votes this label accepts to be the correct label allotted to the test data. This process is continual for each of data in the dataset. 4. EXPERIMENTAL RESULTS AND DISCUSSION To conduct the experiment six popular used machine learning techniques, namely Random Forest (RF), Logistic Regression (LR), Naive Bayes (NB), C4.5 Decision Tree (DT), K- Nearest Neighbor (KNN), and Support Vector Machine (SVM) are were used. Machine learning techniques were implemented in pycharm 3.6. An experimental result shows that the performance of SVM is significantly superior to other machine learning techniques for the classification of diabetic data. The experimental results could assist health care to take early prevention and make better clinical decisions to control diabetes and thus save human life. To take into account additional attributes and evaluation for further analysis is our future work. Figure 6 Comparison result of Various Machine Learning Techniques
  • 7. Prediction of Diabetes Mellitus Using Machine Learning Techniques http://www.iaeme.com/IJCET/index.asp 31 editor@iaeme.com In order to assess the performance of various machine learning techniques, we have showed the prediction results in Fig. 6 on the basis of accuracy. The figure shows the results of studies such as LR, SVM, NB, KNN, RF and C4.5 and SVM accomplishes better results than other classifiers to predict diabetes mellitus. According to Fig. 6, SVM achieves 96.4% on this dataset, which is greater than other learning techniques. This experimental result provides indication that SVM performs acceptable on medical datasets for the determination of predicting diabetes mellitus based on numerous risk factors, deliberated in the earlier section. Table 2 Accuracy Results of Various Machine Learning Techniques Classification Algorithms Accuracy Support Vector Machine (SVM) 96.4% Decision Tree (DT) 95.8% Random Forest (RF) 95.3% Naïve Bayes (NB) 76.0% Logistic Regression (LR) 77.6% K-Nearest Neighbors ( KNN) 78.8% Overall, we have selected the best machine learning technique to predict diabetes mellitus to achieve high performance, based on the evaluation criteria discuss above. All the techniques mentioned over are estimated on an unseen testing diabetic dataset. The technique which accomplishes the highest performance in terms of accuracy is considered to be the best choice. Based on Fig. 6, it can be observed that SVM achieved the better accuracy of 96.4 % to predict diabetes mellitus utilizing a given medical dataset. 5. CONCLUSION In this work, we have investigated the early prediction of diabetes by taking into account several risk factors related to this disease using machine learning techniques To predict diabetes mellitus efficiently, we have done our investigation using six popular machine learning algorithms, namely Support Vector Machine (SVM), Naive Bayes (NB), K-Nearest Neighbor (KNN), Logistic Regression (LR.), Random Forest (RF.) and C4.5 decision tree, on adult population data to predict diabetes mellitus. The technique which accomplishes the highest performance in terms of accuracy is considered to be the best choice. Based on Fig.6, it can be observed that SVM achieved the better accuracy of 96.4 % to predict diabetes mellitus utilizing a given medical dataset. REFERENCES [1] V., A. K. and R., C. 2013. Classification of Diabetes Disease Using Support Vector Machine. International Journal of Engineering Research and Applications. 3, (April. 2013), 1797-1801. [2] Carlo, B G., Valeria, M. and Jesús, D. C. 2011. The impact of diabetes mellitus on healthcare costs in Italy. Expert review of pharmacoeconomics & outcomes research. 11, (Dec. 2011),709-19. [3] Nahla B., Andrew et al. 2010. Intelligible support vector machines for diagnosis of diabetes mellitus. Information Technology in Biomedicine, IEEE Transactions. 14, (July. 2010), 1114-20.
  • 8. Temesgen Abera Asfaw http://www.iaeme.com/IJCET/index.asp 32 editor@iaeme.com [4] Abdullah A. Aljumah et al., Application of data mining: Diabetes health care in young and old patients, Journal of King Saud University - Computer and Information Sciences, Volume 25, Issue 2, [5] July 2013, Pages 127-136 [6] Kavakiotis, Ioannis, Olga Tsave, AthanasiosSalifoglou, NicosMaglaveras, IoannisVlahavas, and IoannaChouvarda. "Machine learning and data mining methods in diabetes research." Computational and structural biotechnology journal (2017). [7] Zheng, Tao et al. "A machine learning-based framework to identify type 2 diabetes through electronic health records." International journal of medical informatics 97 (2017): 120- 127. [8] Rani, A. Swarupa, and S. Jyothi. "Performance analysis of classification algorithms under different datasets." In Computing for Sustainable Global Development (INDIACom), 2016 3rd International Conference on, pp. 1584-1589. IEEE, 2016. [9] Kandhasamy, J. Pradeep, and S. Balamurali. "Performance analysis of classifier models to predict diabetes mellitus." Procedia Computer Science 47 (2015): 45-51. [10] Y. Huang, P. McCullagh, N. Black, R. Harper, Feature selection and classification model construction on type 2 diabetic patients’data, Artificial Intelligence in Medicine 41 (3) (2015) 251–262. [11] Meng, X. H., Huang, Y. X., Rao, D. P., Zhang, Q., & Liu, Q. (2013). Comparison of three data mining models for predicting diabetes or prediabetes by risk factors. The Kaohsiung journal of medical sciences, 29(2), 93-99. [12] Platt, John C. "12 fast training of support vector machines using sequential minimal optimization." Advances in kernel methods (1999): 185-208. [13] John, George H., and Pat Langley. "Estimating continuous distributions in Bayesian classifiers." Proceedings of the Eleventh conference on Uncertainty in artificial intelligence. Morgan Kaufmann Publishers Inc., 1995. [14] Aha, David W., Dennis Kibler, and Marc K. Albert. "Instance-based learning algorithms." Machine learning 6.1 (1991): 37-66. [15] Ross Quinlan (1993). C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers, San Mateo, CA.