SlideShare a Scribd company logo
1 of 8
Download to read offline
International Journal of Advances in Applied Sciences (IJAAS)
Vol. 9, No. 2, June 2020, pp. 85~92
ISSN: 2252-8814, DOI: 10.11591/ijaas.v9.i2.pp85-92  85
Journal homepage: http://ijaas.iaescore.com
Disease prediction in big data healthcare using extended
convolutional neural network techniques
Asadi Srinivasulu, Asadi Pushpa
Data Analytics Research Laboratory, Sree Vidyanikethan Engineering College, India
Article Info ABSTRACT
Article history:
Received May 3, 2019
Revised Feb 2, 2020
Accepted Mar 14, 2020
Diabetes Mellitus is one of the growing fatal diseases all over the world.
It leads to complications that include heart disease, stroke, and nerve disease,
kidney damage. So, Medical Professionals want a reliable prediction system
to diagnose Diabetes. To predict the diabetes at earlier stage, different
machine learning techniques are useful for examining the data from different
sources and valuable knowledge is synopsized. So, mining the diabetes data
in an efficient way is a crucial concern. In this project, a medical dataset has
been accomplished to predict the diabetes. The R-Studio and Pypark software
was employed as a statistical computing tool for diagnosing diabetes.
The PIMA Indian database was acquired from UCI repository will be used
for analysis. The dataset was studied and analyzed to build an effective
model that predicts and diagnoses the diabetes disease earlier.
Keywords:
CNN,
Diabetes,
Neural networks,
RNN
SVM, This is an open access article under the CC BY-SA license.
Corresponding Author:
Asadi Srinivasulu,
Data Analytics Research Laboratory,
Sree Vidyanikethan Engineering College,
Sree Sainath Nagar, Tirupati, Andhra Pradesh 517102, India.
Email: srinu.asadi@gmail.com
1. INTRODUCTION
As we know that the growth in technology helps the computers to produce huge amount of data.
Additionally, such advancements and innovations in the medical database management systems generate
large volumes of medical data. Healthcare industry contains very large and sensitive data. This data needs to
be treated very careful to get benefitted from it. Diabetic Mellitus is a set of associated diseases in which
the human body is unable to control the quantity of sugar in the blood. It results in high sugar levels in blood,
may be as the body does not produce sufficient insulin, or may because cells do not react to the produced
insulin. The focus is to develop the prediction models by using certain machine learning algorithms.
The Machine Learning is an application of artificial intelligence as it helps the computer to learn on its own.
The two classification of ML are supervised and unsupervised. The Supervised learning calculation utilizes
the past experience to influence expectations on new or inconspicuous information while unsupervised
calculations to can draw derivations from datasets. Machine learning algorithms are:
Supervised learning techniques:
Classification
The procedure of finding the obscure information of the class name which is utilizing recent known
information is called as class mark which is intern called as classification. The following are Popular
Classification Algorithms:
 Random forest
 ISSN: 2252-8814
Int J Adv Appl Sci, Vol. 9, No. 2, June 2020: 85 – 92
86
 SVM
 K-Nearest neighbors
 Decision tree
 Naïve Bayes
Regression
A supervised learning algorithm such as classification which finds the relationship between some
independent variables with some dependent variables isn called Regression. The popular Regression
algorithms are:
 Simple Linear Regression
 Multiple Linear Regression
 Logistic Regression
 Polynomial Regression
 Linear Discriminant Analysis (LDA)
Unsupervised Learning techniques:
Clustering
The process which classifies the similar objects into groups called as clustering mechanism. Some
of the clustering techniques are:
 K-means clustering
 Hierarchical clustering
R studio
An Integrated Development Environment (IDE) for R programming language which was founded
by Jjallaire is called as R Studio. The command line that R Studio uses is interpreter. R studio used for
statistical computing and graphics. R Studio is having many built-in packages so it can manipulate huge
dataset for analysis.
2. LITERATURE REVIEW
The usage of big data for predicting diabetes has been conducted in many researches. Table 1
display researches in the field and also critique given for each research papers. Considering the critique and
notes of each published research, this research will propose a new model for resolving problems from
previous research.
Tabel 1. Review of related research
No. Paper Author(s) Name of the Journal Methods Findings Notes/Critique
1. Predicting
Diabetes in
Medical
Datasets Using
Machine
Learning
Techniques
Uswa Ali Zia,
Dr. Naeem
Khan.
International Journal
of Scientific &
Engineering
Research
(IJSER).
Boot
strapping
resampling
technique to
enhance the
accuracy and
then applying
i. Naive
Bayes,
ii.Decision
Trees
iii.k-Nearest
Neighbors
(k-NN)
After Bootstrapping
Accuracy:
i.NaiveBayes-
74.89%
ii.Decision Trees-
94.44%
iii. k-NN(for k=1)
93.79%
4. k-NN(for k=3) -
76.79%
i.Plan to use further
more advanced
classifiers such as
Neural Networks.
ii. It should consider
some other important
factors that are related
to gestational
diabetes, like
metabolic syndrome,
family history, habit
of smoking, lazy
routines, some dietary
patterns etc.
2. Prediction of
Diabetes Using
Data Mining
Techniques
FikirteGirma,
Woldemichael,
Sumitra
Menaria
International
Conference on
Trends in Electronics
and Informatics
(ICOEI)
i. Back
Propagation
Algorithm
ii. J48
Algorithm
iii. Naïve
Bayes
Classifier
iv. Support
Vector
Machine.
Back Propagation
Algorithm has
Accuracy-83.11%
Sensitivity- 86.53%
Specificity-76%
i. Increment the
accuracy of the
algorithms.
Int J Adv Appl Sci ISSN: 2252-8814 
Disease prediction in big data healthcare using extended convolutional neural network… (Asadi Srinivasulu)
87
Table continued
3. Diabetes
Disease
Prediction
Using Data
Mining
Deeraj Shetty,
Kishor Rit,
Sohail Shaikh,
Nikita Patils
International
Conference on
Innovations in
Information,
Embedded and
Communication
Systems (ICIIECS).
i. Naïve
Bayes
ii. k-NN
algorithms
prediction of the
disease will be
done with the help
of Bayesian
algorithm and KNN
algorithm and
analyze them by
taking various
attributes of
diabetes.
i. Increment the
accuracy of the
algorithms.
ii. So Working on
some more attributes
which is used to
tackle the diabetes
even more.
4. Classification
of
Diabetic
Patients by
using Efficient
Prediction
from Big Data
using R Studio
K. Sharmila,
Dr. S.A. Vetha
Manickam
International Journal
of Advanced
Engineering
Research and Science
(IJAERS).
Decision tree i. Using R, the
dataset is analyzed
and the correlation
coefficient for two
attributes is
calculated.
ii. Decision Tree is
used to predict the
type of Diabetes.
Possibility of
developing efficient
predictive models
using the information
from the analysis
which is already
carried out.
5. Diagnosis of
diabetes using
Classification
mining
Techniques
AiswaryaIyer,
S. Jeyalatha
and Ronak
Sumbaly
International Journal
of Data Mining &
Knowledge
Management Process
(IJDKP).
Decision tree
Naïve Bayes.
J48 Cross
validation-74.8698
%
J48 Percentage
Split-76.9565 %
Naive Bayes-
79.5652 %
i.In future the work,
planned to be
gathering the
information from
different locales over
the world.
ii.This work can be
improved and
extended for the
automation of
diabetes analysis.
6. An Disease
Diagnosis
using Data
Mining
Techniques
And Empirical
study.
M. Deepika,
Dr. K.
Kalaiselvi
The 2nd
International
Conference on
Inventive
Communication and
Computational
Technologies
(ICICCT).
i.Artificial
Neural
Network
ii.Decision
Tree
iii. Logistic
Regression
iv. Naïve
Bayes
v. SVM
Artificial Neural
Network: 73.23%
Logistic Regression
:76.13%
Decision Tree
:77.87%
Efficient and
Accurate classifier
can be developed.
3. PROPOSED SYSTEM
We propose a classification model with boosted accuracy to predict the diabetic patient. In this
model, we have employed different machine learning techniques are using like classification, regression and
clustering. The major focus is to increase the accuracy by using resample technique on a benchmark well
renowned PIMA diabetes dataset that was acquired from UCI machine learning repository, having eight
attributes and one class label. The proposed framework is shown in Figure 1. The description of each phase is
mentioned.
3.1. Data selection
Data selection is a process in which the most relevant data is selected from a specific domain to
derive values that are informative and facilitate learning. PIMA diabetes dataset having 8 attributes that are
used to predict the diabetes at earlier stage. This dataset is obtained from UCI repository.
3.2. Data pre-processing
Data pre-processing is a Machine Learning technique that includes changing crude information into
reasonable configuration. It includes Data Cleaning, Data Integration, Data Transformation, and
Data Discretization.
3.3. Feature extraction through principle component analysis
Feature Extraction on the dataset to determine the most suitable set of attributes that can help
achieve better classification. The set of attributes suggested by the PCA are termed as feature vector.
Feature reduction or dimensionality reduction will be benefitted us by reducing the computation and
space complexity.
 ISSN: 2252-8814
Int J Adv Appl Sci, Vol. 9, No. 2, June 2020: 85 – 92
88
3.4. Resampling Filter
The supervised Resample filter is applied to the pre-processed dataset. Re-sampling is a series of
methods used to reconstruct your sample data sets, including training sets and validation sets. In this study,
Boot strapping resampling technique to enhance the accuracy.
Figure 1. Proposed system for diabetes prediction system
4. MACHINE LEARNING TECHNIQUES
4.1. Classification
4.1.1. Random forest
The outfit learning technique used for the classification and regression that operates by constructing
the multitude of decision trees at training time and outputting the class i.e mode of the classes or
the regression of the individual trees. Irregular choice woods right for choice trees propensity which is used
for over fitting on to their preparation set.
Int J Adv Appl Sci ISSN: 2252-8814 
Disease prediction in big data healthcare using extended convolutional neural network… (Asadi Srinivasulu)
89
4.1.2. Support vector machine (SVM)
SVM is a division of Supervised Learning Algorithm. The strategy used to perform regression,
classification and outlier detection of data.SVM will be grouping the information dependent that on the hyper
plane. The hyper plane is used to totally isolate the two classes in the best way and the most extreme edge
hyper plane ought to be picked as a best separator. The two types SVM Classifiers that are been used are
used are: Linear Classifier and Non-Linear Classifier.
4.1.3. Decision tree
The algorithm which is mainly used to produce a classification on training data and regression
model into a tree structure is called as Decision tree algorithm, it is based on previous data to classify/predict
class or target variables of future/new data with the help of decision rules or decision trees. Decision tree can
be useful for both numerical and categorical data. The tree in which the root node in each level is a starting
point or the best splitting attribute in that position which helps to test on an attribute is called as complete
decision tree. The yield of the test will create branches. Leaf hub will go about as a last class mark or target
variable to characterize/foresee the new information. Arrangement rules are attracted from root to leaf.
4.1.4. Naïve bayes
The algorithm performs classification tasks in the field of ML are called as Naïve Bayes. It can
perform classification very well on the dataset even it has huge records with multi class and binary class
classification problems. The application of Naive Bayes is mainly to text analysis and Natural Language
Processing. It works based on conditional probability. It can be represented (1).
𝑃(𝑀|𝑁) =
𝑃( 𝑀| 𝑁)𝑃(𝑀)
𝑃(𝑁)
(1)
Here M and N are two events and, P(M|N) is the conditional probability of M given N.P(M) is
the probability of M. P(N) is the probability of N. P (N|M) is the conditional probability of N given M.
4.1.5. K-nearest neighbors
The supervised classifier which is a best choice for K-NN is called as k-Nearest Neighbor. It is
a best choice for the classification of k-NN kind of problems. In order to predict the target label of a test data,
KNN which finds distance between nearest training data class labels and new test data point in the presence
of K value? KNN uses K variable value between 0 to 10 normally.
4.2. Regression
4.2.1. Simple linear regression
The linear Regression algorithm which explains the relationship between independent and
dependent variables to predict the values of the dependent variable is called as Simple Linear Regression
algorithm. Simple regression uses one independent variable. The simple linear regression model is
represented (2).
y= (b0 +b1x) (2)
Here, x(independent variable) and y (dependant variable) are two factors involved in simple linear
regression analysis. Also, b0 is the Y-intercept and b1 is the Slope.
4.2.2. Multiple linear regressions
It explains the relationship between two or more independent variables and a dependent variable to
predict the values of the dependent variable. It uses two or more independent variables. Dependent variable
has a continuous and independent variable has discrete or continuous values. The multiple linear regression
model is represented as (3)
y= (p0 +p1x1+p2x2+…+pnxn) (3)
Here x1, x2... xn (independent variable) and y (dependant variable) are two factors involved in
multiple linear regression analysis. Also b0 is the y-intercept and p1, p2… pn is the slope.
 ISSN: 2252-8814
Int J Adv Appl Sci, Vol. 9, No. 2, June 2020: 85 – 92
90
4.2.3. Logistic regression
The predictive analysis which is used for the dependent variable is categorical called as Logistical
Regression. Logistical Regression explains the relationship between one dependent variable and one or more
independent variables. The various types of Logistic Regression are:
 Multinomial Logistic Regression (many)
 Binary Logistic Regression (two)
 Ordinal Logistic Regression (1)
The categorical response has only two possible outcomes. Multinomial Logistic Regression has
three or more outcomes without ordering whereas Ordinal Logistic Regression has three or more outcomes
with ordering.
4.2.4. Polynomial regression
The form of regression analysis which explains the relationship between the independent variable
and dependent variable as an nth degree polynomial is called as polynomial regression. It fits a non-linear
relationship between the value of independent variable and conditional mean of dependent variable. It is
represented as (4).
x = a + b * y ^ n (4)
Here p is Dependent Variable, q is Independent Variable and n is Degree.
It is used to fit the data very well when the data is below and above the regression model. It
minimizes the cost function and provides optimum result on the regression.
4.2.5. Linear discriminant analysis
The process of using various data items and applying different functions to that set to analyze
classes of objects or items separately is called Linear Discriminant Analysis. Image Recognition and
Predictive analytics use this Linear Discriminant Analysis
4.3. Clustering
4.3.1. K-means clustering
The unsupervised machine learning algorithm which is used to solve clustering problems by
classifying the dataset into a number of clusters k (group of similar objects), which defines the number of
clusters which is assumed before classifying the dataset.
4.3.2. Hierarchical clustering
The type of clustering algorithm which is used to build a hierarchy of clusters is called hierarchical
clustering. The two types of Hierarchical Clustering are:
4.3.3. Agglomerative clustering
It is used to group objects into clusters based on their similarity. The result obtained at last is a tree
representation of objects called Dendrogram.
4.3.4. Divisive analysis
This is a best down methodology where all perceptions begin in one bunch, and parts are performed
recursively as one moves down the pecking order. A hierarchical clustering is often represented as
a dendrogram. Each cluster will be representing with centroids. Distance will be calculated by using linkage.
5. RESULTS AND ANALYSIS
Indian diabetes dataset named PIMA were used for analysis for this study. It consists of eight
independent attributes and one independent class attribute. The study was implemented by R programming
language using R Studio. Machine learning algorithms like classification (Decision Tree, Naïve Bayes, k-NN
and Random Forest), regression (linear, multiple, logistic, LDA) and clustering (k-means, hierarchical
agglomerative) are used to predict the diabetics disease in early stages as shown in Table 1. Measure
Performance model by using accuracy as shown in Figure 2.
Int J Adv Appl Sci ISSN: 2252-8814 
Disease prediction in big data healthcare using extended convolutional neural network… (Asadi Srinivasulu)
91
Table1. Predictive analysis of machine learning algorithms
S. No Algorithm Accuracy
1 Random forest 83%
2. Decision tree 77%
3. SVM 92%
4. Naïve Bayes 86%
5. K-NN 91%
6. Simple linear regression 98%
7. Logistic regression 88%
8. LDA 88%
9. k-Means 81%
10. Hierarchical agglomerative 74%
Figure 2. Comparison of accuracy of various algorithms
6. CONCLUSION AND FUTURE WORK
Deep Learning and Data mining plays an important role in various fields such as Artificial
Intelligence (AI) and Machine Learning (ML), Database Systems and more. The core objective is to enhance
the accuracy of predictive model. This PIMA dataset will increase the accuracy of almost all algorithms but
the SVM and linear regression leads over others. In future many advanced deep learning techniques will be
used to increasing the accuracy of the algorithms.
REFERENCES
[1] Usma Niyaz et al, “Advances in Deep Learning Techniques for Medical Image Analysis” 5th IEEE International
Conference on Parallel, Distributed and Grid Computing (PDGC-2018), 978-1-5386-6026-3/18/$31©2018 IEEE,
20-22 Dec, 2018, Solan, India.
[2] Y. LeCun, L. Bottou, Y. Bengio and P. Haffner, “Gradient-Based Learning Applied to Document Recognition,”
Proceedings of IEEE, vol. 86, pp. 2278-2324.
[3] Leslie N. Smith, “Cyclical Learning Rates for Training Neural Networks,” 2017 IEEE Winter Conference on
Applications of Computer Vision (WACV), pp. 464–472, 2017.
[4] J. Wang, L. Perez, “The Effectiveness of Data Augmentation in Image Classification using Deep Learning,”
Computing Research Repository (CoRR) - arXiv, 2017.
[5] N. Srivastava, G. Hinton, A. Krizhevsky and I. Sutskever and R. Salakhutdinov, “Dropout: A Simple Way to
Prevent Neural Networks from Overfitting,” Journal of Machine Learning Research (JMLR), vol.15,
pp. 1929-1958, 2014.
[6] M. S. Al-tarawneh, “Lung Cancer Detection Using Image Processing Techniques,” Leonardo Electronic Journal of
Practices and Technologies (LEJPT), no. 20, pp. 147–158, 2012.
[7] Abhishek S. Sambyal, Asha T., “Knowledge Abstraction from Textural Features of Brain MRI Images for
Diagnosing Brain Tumor using Statistical Techniques and Associative Classification,” 2016 International
Conference on Systems in Medicine and Biology, IIT Kharagpur, 2016.
[8] A. Krizhevsky, I. Sutskever and G. E. Hinton, “ImageNet Classification with Deep Convolutional Neural
Networks,” NIPS’12 Proceedings of the 25th International Conference on Neural Information Processing Systems,
vol. 1, pp. 1097–1105.
 ISSN: 2252-8814
Int J Adv Appl Sci, Vol. 9, No. 2, June 2020: 85 – 92
92
[9] K. Simonyan and A. Zisserman, “Very Deep Convolutional Network for Large-Scale Image Recognition,”
International Conference on Learning Representations (ICLR), 2015.
[10] C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, A. Rabinovich, “Going
deeper with convolutions,”2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.1–9,
2015.
[11] K. He, X. Zhang, S. Ren, J. Sun, “Deep Residual Learning for Image Recognition,” 2016 IEEE Conference on
Computer Vision and Pattern Recognition (CVPR), pp.770-778, 2016.
[12] O. Ronneberger, P. Fischer, and T. Brox, “U-Net: Convolutional Networks for Biomedical Image Segmentation,”
MICCAI 2015, pp. 234?241, 2015.
[13] F. Milletari, N. Navab, and S. A. Ahmadi, “V-Net: Fully Convolutional Networks for Volumetric Medical Image
Segmentation,” 2016 Fourth International Conference on 3D Vision, pp.565-571, 2016.
[14] E. Gibson, W. Li, C. Sudre, L. Fidon, D. I. Shakir, G. Wang Z. Eaton- Rosen, R. Gray, T. Doel, Y. Hu, T. Whyntie,
P. Nachev, M. Modat D. C. Barratt, S. Ourselin, M. Jorge Cardoso and T. Vercauteren, “NiftyNet: a deep-learning
platform for medical imaging,” Computer Methods and Programs in Biomedicine, vol. 158, pp. 113 - 122, 2018.
[15] S. Jgou, M. Drozdzal, D. Vzquez, A. Romero, and Y. Bengio, “The One Hundred Layers Tiramisu: Fully
Convolutional DenseNets for Semantic Segmentation,” arXiv:1611.09326v2 [cs.CV],2016.
[16] W. Chen, Y. Zhang, J. He, Y. Qiao, Y. Chen, H. Shi, X. Tang, “W-net: Bridged U-net for 2D Medical Image
Segmentation,” arXiv:1807.04459v1 [cs.CV] 12 Jul 2018.
[17] G. Yang, H. Jing, “Multiple Convolutional Neural Network for Feature Extraction,” International Conference on
Intelligent Computing (ICIC), 2015.
[18] J. Ding, A. Li, Z. Hu, and L. Wang, “Accurate Pulmonary Nodule Detection in Computed Tomography Images
Using Deep Convolutional Neural Networks,” Computing Research Repository (CoRR) - arXiv, 2017.
[19] Q. Song, L. Zhao, X. Luo, and X. Dou, “Using Deep Learning for Classification of Lung Nodules on Computed
Tomography Images,” Journal of Healthcare Engineering, vol. 2017, 2017.
[20] H. Chougrad, H. Zouaki, O. Alheyane “Convolutional Neural Networks for Breast Cancer Screening: Transfer
Learning with Exponential Decay,”- arXiv, 2017.
[21] A. Esteva, B. Kuprel, R. A. Novoa, J. Ko, S. M. Swetter, H. M. Blau& S. Thrun, “Dermatologist-level
classification of skin cancer with deep neural networks,” Nature, vol. 542, pp. 115–118, 2017.
[22] E. Sert, S. Ertekin, U. Halici, “Ensemble of Convolutional Neural Networks for Classification of Breast
Microcalcification from Mammograms,” 2017 39th Annual International Conference of the IEEE Engineering in
Medicine and Biology Society (EMBC), pp. 689–692, 2017.
[23] N. C. F. Codella, Q. B. Nguyen, S. Pankanti, D. Gutman, B. Helba, A. Halpern, J. R. Smith, “Deep learning
ensembles for melanoma recognition in dermoscopy images,” Computing Research Repository (CoRR), vol.
abs/1610.04662, 2016.
[24] K. J. Geras, S. Wolfson, Y. Shen, S. Gene Kim, L. Moy, K. Cho, “High-Resolution Breast Cancer Screening with
Multi-View Deep Convolutional Neural Networks,” Computing Research Repository (CoRR) - arXiv, 2017.

More Related Content

What's hot

76 s201915
76 s20191576 s201915
76 s201915IJRAT
 
IRJET- Heart Failure Risk Prediction using Trained Electronic Health Record
IRJET- Heart Failure Risk Prediction using Trained Electronic Health RecordIRJET- Heart Failure Risk Prediction using Trained Electronic Health Record
IRJET- Heart Failure Risk Prediction using Trained Electronic Health RecordIRJET Journal
 
Psdot 14 using data mining techniques in heart
Psdot 14 using data mining techniques in heartPsdot 14 using data mining techniques in heart
Psdot 14 using data mining techniques in heartZTech Proje
 
PSO-An Intellectual Technique for Feature Reduction on Heart Malady Anticipat...
PSO-An Intellectual Technique for Feature Reduction on Heart Malady Anticipat...PSO-An Intellectual Technique for Feature Reduction on Heart Malady Anticipat...
PSO-An Intellectual Technique for Feature Reduction on Heart Malady Anticipat...Sivagowry Shathesh
 
Survey on data mining techniques in heart disease prediction
Survey on data mining techniques in heart disease predictionSurvey on data mining techniques in heart disease prediction
Survey on data mining techniques in heart disease predictionSivagowry Shathesh
 
IRJET - Chronic Kidney Disease Prediction using Data Mining and Machine Learning
IRJET - Chronic Kidney Disease Prediction using Data Mining and Machine LearningIRJET - Chronic Kidney Disease Prediction using Data Mining and Machine Learning
IRJET - Chronic Kidney Disease Prediction using Data Mining and Machine LearningIRJET Journal
 
Heart Disease Prediction Using Data Mining Techniques
Heart Disease Prediction Using Data Mining TechniquesHeart Disease Prediction Using Data Mining Techniques
Heart Disease Prediction Using Data Mining TechniquesIJRES Journal
 
Heart Disease Prediction Using Associative Relational Classification Techniq...
Heart Disease Prediction Using Associative Relational  Classification Techniq...Heart Disease Prediction Using Associative Relational  Classification Techniq...
Heart Disease Prediction Using Associative Relational Classification Techniq...IJMER
 
IRJET- Medical Data Mining
IRJET- Medical Data MiningIRJET- Medical Data Mining
IRJET- Medical Data MiningIRJET Journal
 
prediction of heart disease using machine learning algorithms
prediction of heart disease using machine learning algorithmsprediction of heart disease using machine learning algorithms
prediction of heart disease using machine learning algorithmsINFOGAIN PUBLICATION
 
Analysis and Prediction of Diabetes Diseases using Machine Learning Algorithm...
Analysis and Prediction of Diabetes Diseases using Machine Learning Algorithm...Analysis and Prediction of Diabetes Diseases using Machine Learning Algorithm...
Analysis and Prediction of Diabetes Diseases using Machine Learning Algorithm...IRJET Journal
 
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSIS
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSISAN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSIS
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSISijcsit
 
An efficient feature selection algorithm for health care data analysis
An efficient feature selection algorithm for health care data analysisAn efficient feature selection algorithm for health care data analysis
An efficient feature selection algorithm for health care data analysisjournalBEEI
 
A data mining approach for prediction of heart disease using neural networks
A data mining approach for prediction of heart disease using neural networksA data mining approach for prediction of heart disease using neural networks
A data mining approach for prediction of heart disease using neural networksIAEME Publication
 
Prediction of Heart Disease using Machine Learning Algorithms: A Survey
Prediction of Heart Disease using Machine Learning Algorithms: A SurveyPrediction of Heart Disease using Machine Learning Algorithms: A Survey
Prediction of Heart Disease using Machine Learning Algorithms: A Surveyrahulmonikasharma
 
IRJET- Diabetes Diagnosis using Machine Learning Algorithms
IRJET- Diabetes Diagnosis using Machine Learning AlgorithmsIRJET- Diabetes Diagnosis using Machine Learning Algorithms
IRJET- Diabetes Diagnosis using Machine Learning AlgorithmsIRJET Journal
 
Framework for efficient transformation for complex medical data for improving...
Framework for efficient transformation for complex medical data for improving...Framework for efficient transformation for complex medical data for improving...
Framework for efficient transformation for complex medical data for improving...IJECEIAES
 
C omparative S tudy of D iabetic P atient D ata’s U sing C lassification A lg...
C omparative S tudy of D iabetic P atient D ata’s U sing C lassification A lg...C omparative S tudy of D iabetic P atient D ata’s U sing C lassification A lg...
C omparative S tudy of D iabetic P atient D ata’s U sing C lassification A lg...Editor IJCATR
 
IRJET- Chronic Kidney Disease Prediction based on Naive Bayes Technique
IRJET- Chronic Kidney Disease Prediction based on Naive Bayes TechniqueIRJET- Chronic Kidney Disease Prediction based on Naive Bayes Technique
IRJET- Chronic Kidney Disease Prediction based on Naive Bayes TechniqueIRJET Journal
 

What's hot (19)

76 s201915
76 s20191576 s201915
76 s201915
 
IRJET- Heart Failure Risk Prediction using Trained Electronic Health Record
IRJET- Heart Failure Risk Prediction using Trained Electronic Health RecordIRJET- Heart Failure Risk Prediction using Trained Electronic Health Record
IRJET- Heart Failure Risk Prediction using Trained Electronic Health Record
 
Psdot 14 using data mining techniques in heart
Psdot 14 using data mining techniques in heartPsdot 14 using data mining techniques in heart
Psdot 14 using data mining techniques in heart
 
PSO-An Intellectual Technique for Feature Reduction on Heart Malady Anticipat...
PSO-An Intellectual Technique for Feature Reduction on Heart Malady Anticipat...PSO-An Intellectual Technique for Feature Reduction on Heart Malady Anticipat...
PSO-An Intellectual Technique for Feature Reduction on Heart Malady Anticipat...
 
Survey on data mining techniques in heart disease prediction
Survey on data mining techniques in heart disease predictionSurvey on data mining techniques in heart disease prediction
Survey on data mining techniques in heart disease prediction
 
IRJET - Chronic Kidney Disease Prediction using Data Mining and Machine Learning
IRJET - Chronic Kidney Disease Prediction using Data Mining and Machine LearningIRJET - Chronic Kidney Disease Prediction using Data Mining and Machine Learning
IRJET - Chronic Kidney Disease Prediction using Data Mining and Machine Learning
 
Heart Disease Prediction Using Data Mining Techniques
Heart Disease Prediction Using Data Mining TechniquesHeart Disease Prediction Using Data Mining Techniques
Heart Disease Prediction Using Data Mining Techniques
 
Heart Disease Prediction Using Associative Relational Classification Techniq...
Heart Disease Prediction Using Associative Relational  Classification Techniq...Heart Disease Prediction Using Associative Relational  Classification Techniq...
Heart Disease Prediction Using Associative Relational Classification Techniq...
 
IRJET- Medical Data Mining
IRJET- Medical Data MiningIRJET- Medical Data Mining
IRJET- Medical Data Mining
 
prediction of heart disease using machine learning algorithms
prediction of heart disease using machine learning algorithmsprediction of heart disease using machine learning algorithms
prediction of heart disease using machine learning algorithms
 
Analysis and Prediction of Diabetes Diseases using Machine Learning Algorithm...
Analysis and Prediction of Diabetes Diseases using Machine Learning Algorithm...Analysis and Prediction of Diabetes Diseases using Machine Learning Algorithm...
Analysis and Prediction of Diabetes Diseases using Machine Learning Algorithm...
 
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSIS
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSISAN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSIS
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSIS
 
An efficient feature selection algorithm for health care data analysis
An efficient feature selection algorithm for health care data analysisAn efficient feature selection algorithm for health care data analysis
An efficient feature selection algorithm for health care data analysis
 
A data mining approach for prediction of heart disease using neural networks
A data mining approach for prediction of heart disease using neural networksA data mining approach for prediction of heart disease using neural networks
A data mining approach for prediction of heart disease using neural networks
 
Prediction of Heart Disease using Machine Learning Algorithms: A Survey
Prediction of Heart Disease using Machine Learning Algorithms: A SurveyPrediction of Heart Disease using Machine Learning Algorithms: A Survey
Prediction of Heart Disease using Machine Learning Algorithms: A Survey
 
IRJET- Diabetes Diagnosis using Machine Learning Algorithms
IRJET- Diabetes Diagnosis using Machine Learning AlgorithmsIRJET- Diabetes Diagnosis using Machine Learning Algorithms
IRJET- Diabetes Diagnosis using Machine Learning Algorithms
 
Framework for efficient transformation for complex medical data for improving...
Framework for efficient transformation for complex medical data for improving...Framework for efficient transformation for complex medical data for improving...
Framework for efficient transformation for complex medical data for improving...
 
C omparative S tudy of D iabetic P atient D ata’s U sing C lassification A lg...
C omparative S tudy of D iabetic P atient D ata’s U sing C lassification A lg...C omparative S tudy of D iabetic P atient D ata’s U sing C lassification A lg...
C omparative S tudy of D iabetic P atient D ata’s U sing C lassification A lg...
 
IRJET- Chronic Kidney Disease Prediction based on Naive Bayes Technique
IRJET- Chronic Kidney Disease Prediction based on Naive Bayes TechniqueIRJET- Chronic Kidney Disease Prediction based on Naive Bayes Technique
IRJET- Chronic Kidney Disease Prediction based on Naive Bayes Technique
 

Similar to Disease prediction in big data healthcare using extended convolutional neural network techniques

IRJET-Survey on Data Mining Techniques for Disease Prediction
IRJET-Survey on Data Mining Techniques for Disease PredictionIRJET-Survey on Data Mining Techniques for Disease Prediction
IRJET-Survey on Data Mining Techniques for Disease PredictionIRJET Journal
 
An efficient stacking based NSGA-II approach for predicting type 2 diabetes
An efficient stacking based NSGA-II approach for predicting  type 2 diabetesAn efficient stacking based NSGA-II approach for predicting  type 2 diabetes
An efficient stacking based NSGA-II approach for predicting type 2 diabetesIJECEIAES
 
A CONCEPTUAL APPROACH TO ENHANCE PREDICTION OF DIABETES USING ALTERNATE FEATU...
A CONCEPTUAL APPROACH TO ENHANCE PREDICTION OF DIABETES USING ALTERNATE FEATU...A CONCEPTUAL APPROACH TO ENHANCE PREDICTION OF DIABETES USING ALTERNATE FEATU...
A CONCEPTUAL APPROACH TO ENHANCE PREDICTION OF DIABETES USING ALTERNATE FEATU...IAEMEPublication
 
A CONCEPTUAL APPROACH TO ENHANCE PREDICTION OF DIABETES USING ALTERNATE FEATU...
A CONCEPTUAL APPROACH TO ENHANCE PREDICTION OF DIABETES USING ALTERNATE FEATU...A CONCEPTUAL APPROACH TO ENHANCE PREDICTION OF DIABETES USING ALTERNATE FEATU...
A CONCEPTUAL APPROACH TO ENHANCE PREDICTION OF DIABETES USING ALTERNATE FEATU...IAEME Publication
 
IRJET- Disease Analysis and Giving Remedies through an Android Application
IRJET- Disease Analysis and Giving Remedies through an Android ApplicationIRJET- Disease Analysis and Giving Remedies through an Android Application
IRJET- Disease Analysis and Giving Remedies through an Android ApplicationIRJET Journal
 
Multi Disease Detection using Deep Learning
Multi Disease Detection using Deep LearningMulti Disease Detection using Deep Learning
Multi Disease Detection using Deep LearningIRJET Journal
 
IRJET- Analyse Big Data Electronic Health Records Database using Hadoop Cluster
IRJET- Analyse Big Data Electronic Health Records Database using Hadoop ClusterIRJET- Analyse Big Data Electronic Health Records Database using Hadoop Cluster
IRJET- Analyse Big Data Electronic Health Records Database using Hadoop ClusterIRJET Journal
 
Diabetes Prediction by Supervised and Unsupervised Approaches with Feature Se...
Diabetes Prediction by Supervised and Unsupervised Approaches with Feature Se...Diabetes Prediction by Supervised and Unsupervised Approaches with Feature Se...
Diabetes Prediction by Supervised and Unsupervised Approaches with Feature Se...IJARIIT
 
Analysis on Data Mining Techniques for Heart Disease Dataset
Analysis on Data Mining Techniques for Heart Disease DatasetAnalysis on Data Mining Techniques for Heart Disease Dataset
Analysis on Data Mining Techniques for Heart Disease DatasetIRJET Journal
 
IRJET- Predicting Diabetes Disease using Effective Classification Techniques
IRJET-  	  Predicting Diabetes Disease using Effective Classification TechniquesIRJET-  	  Predicting Diabetes Disease using Effective Classification Techniques
IRJET- Predicting Diabetes Disease using Effective Classification TechniquesIRJET Journal
 
HEALTH PREDICTION ANALYSIS USING DATA MINING
HEALTH PREDICTION ANALYSIS USING DATA  MININGHEALTH PREDICTION ANALYSIS USING DATA  MINING
HEALTH PREDICTION ANALYSIS USING DATA MININGAshish Salve
 
IRJET- Machine Learning Classification Algorithms for Predictive Analysis in ...
IRJET- Machine Learning Classification Algorithms for Predictive Analysis in ...IRJET- Machine Learning Classification Algorithms for Predictive Analysis in ...
IRJET- Machine Learning Classification Algorithms for Predictive Analysis in ...IRJET Journal
 
IRJET- Disease Prediction using Machine Learning
IRJET-  	  Disease Prediction using Machine LearningIRJET-  	  Disease Prediction using Machine Learning
IRJET- Disease Prediction using Machine LearningIRJET Journal
 
IRJET- Heart Disease Prediction and Recommendation
IRJET-  	  Heart Disease Prediction and RecommendationIRJET-  	  Heart Disease Prediction and Recommendation
IRJET- Heart Disease Prediction and RecommendationIRJET Journal
 
Multiple Disease Prediction System: A Review
Multiple Disease Prediction System: A ReviewMultiple Disease Prediction System: A Review
Multiple Disease Prediction System: A ReviewIRJET Journal
 
MULTI MODEL DATA MINING APPROACH FOR HEART FAILURE PREDICTION
MULTI MODEL DATA MINING APPROACH FOR HEART FAILURE PREDICTIONMULTI MODEL DATA MINING APPROACH FOR HEART FAILURE PREDICTION
MULTI MODEL DATA MINING APPROACH FOR HEART FAILURE PREDICTIONIJDKP
 
ML In Predicting Diabetes In The Early Stage
ML In Predicting Diabetes In The Early StageML In Predicting Diabetes In The Early Stage
ML In Predicting Diabetes In The Early StageIRJET Journal
 
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSIS
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSISAN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSIS
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSISAIRCC Publishing Corporation
 
Health Care Application using Machine Learning and Deep Learning
Health Care Application using Machine Learning and Deep LearningHealth Care Application using Machine Learning and Deep Learning
Health Care Application using Machine Learning and Deep LearningIRJET Journal
 

Similar to Disease prediction in big data healthcare using extended convolutional neural network techniques (20)

IRJET-Survey on Data Mining Techniques for Disease Prediction
IRJET-Survey on Data Mining Techniques for Disease PredictionIRJET-Survey on Data Mining Techniques for Disease Prediction
IRJET-Survey on Data Mining Techniques for Disease Prediction
 
An efficient stacking based NSGA-II approach for predicting type 2 diabetes
An efficient stacking based NSGA-II approach for predicting  type 2 diabetesAn efficient stacking based NSGA-II approach for predicting  type 2 diabetes
An efficient stacking based NSGA-II approach for predicting type 2 diabetes
 
A CONCEPTUAL APPROACH TO ENHANCE PREDICTION OF DIABETES USING ALTERNATE FEATU...
A CONCEPTUAL APPROACH TO ENHANCE PREDICTION OF DIABETES USING ALTERNATE FEATU...A CONCEPTUAL APPROACH TO ENHANCE PREDICTION OF DIABETES USING ALTERNATE FEATU...
A CONCEPTUAL APPROACH TO ENHANCE PREDICTION OF DIABETES USING ALTERNATE FEATU...
 
A CONCEPTUAL APPROACH TO ENHANCE PREDICTION OF DIABETES USING ALTERNATE FEATU...
A CONCEPTUAL APPROACH TO ENHANCE PREDICTION OF DIABETES USING ALTERNATE FEATU...A CONCEPTUAL APPROACH TO ENHANCE PREDICTION OF DIABETES USING ALTERNATE FEATU...
A CONCEPTUAL APPROACH TO ENHANCE PREDICTION OF DIABETES USING ALTERNATE FEATU...
 
IRJET- Disease Analysis and Giving Remedies through an Android Application
IRJET- Disease Analysis and Giving Remedies through an Android ApplicationIRJET- Disease Analysis and Giving Remedies through an Android Application
IRJET- Disease Analysis and Giving Remedies through an Android Application
 
Multi Disease Detection using Deep Learning
Multi Disease Detection using Deep LearningMulti Disease Detection using Deep Learning
Multi Disease Detection using Deep Learning
 
IRJET- Analyse Big Data Electronic Health Records Database using Hadoop Cluster
IRJET- Analyse Big Data Electronic Health Records Database using Hadoop ClusterIRJET- Analyse Big Data Electronic Health Records Database using Hadoop Cluster
IRJET- Analyse Big Data Electronic Health Records Database using Hadoop Cluster
 
50120140506011
5012014050601150120140506011
50120140506011
 
Diabetes Prediction by Supervised and Unsupervised Approaches with Feature Se...
Diabetes Prediction by Supervised and Unsupervised Approaches with Feature Se...Diabetes Prediction by Supervised and Unsupervised Approaches with Feature Se...
Diabetes Prediction by Supervised and Unsupervised Approaches with Feature Se...
 
Analysis on Data Mining Techniques for Heart Disease Dataset
Analysis on Data Mining Techniques for Heart Disease DatasetAnalysis on Data Mining Techniques for Heart Disease Dataset
Analysis on Data Mining Techniques for Heart Disease Dataset
 
IRJET- Predicting Diabetes Disease using Effective Classification Techniques
IRJET-  	  Predicting Diabetes Disease using Effective Classification TechniquesIRJET-  	  Predicting Diabetes Disease using Effective Classification Techniques
IRJET- Predicting Diabetes Disease using Effective Classification Techniques
 
HEALTH PREDICTION ANALYSIS USING DATA MINING
HEALTH PREDICTION ANALYSIS USING DATA  MININGHEALTH PREDICTION ANALYSIS USING DATA  MINING
HEALTH PREDICTION ANALYSIS USING DATA MINING
 
IRJET- Machine Learning Classification Algorithms for Predictive Analysis in ...
IRJET- Machine Learning Classification Algorithms for Predictive Analysis in ...IRJET- Machine Learning Classification Algorithms for Predictive Analysis in ...
IRJET- Machine Learning Classification Algorithms for Predictive Analysis in ...
 
IRJET- Disease Prediction using Machine Learning
IRJET-  	  Disease Prediction using Machine LearningIRJET-  	  Disease Prediction using Machine Learning
IRJET- Disease Prediction using Machine Learning
 
IRJET- Heart Disease Prediction and Recommendation
IRJET-  	  Heart Disease Prediction and RecommendationIRJET-  	  Heart Disease Prediction and Recommendation
IRJET- Heart Disease Prediction and Recommendation
 
Multiple Disease Prediction System: A Review
Multiple Disease Prediction System: A ReviewMultiple Disease Prediction System: A Review
Multiple Disease Prediction System: A Review
 
MULTI MODEL DATA MINING APPROACH FOR HEART FAILURE PREDICTION
MULTI MODEL DATA MINING APPROACH FOR HEART FAILURE PREDICTIONMULTI MODEL DATA MINING APPROACH FOR HEART FAILURE PREDICTION
MULTI MODEL DATA MINING APPROACH FOR HEART FAILURE PREDICTION
 
ML In Predicting Diabetes In The Early Stage
ML In Predicting Diabetes In The Early StageML In Predicting Diabetes In The Early Stage
ML In Predicting Diabetes In The Early Stage
 
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSIS
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSISAN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSIS
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSIS
 
Health Care Application using Machine Learning and Deep Learning
Health Care Application using Machine Learning and Deep LearningHealth Care Application using Machine Learning and Deep Learning
Health Care Application using Machine Learning and Deep Learning
 

More from IJAAS Team

Super Capacitor Electronic Circuit Design for Wireless Charging
Super Capacitor Electronic Circuit Design for Wireless ChargingSuper Capacitor Electronic Circuit Design for Wireless Charging
Super Capacitor Electronic Circuit Design for Wireless ChargingIJAAS Team
 
On the High Dimentional Information Processing in Quaternionic Domain and its...
On the High Dimentional Information Processing in Quaternionic Domain and its...On the High Dimentional Information Processing in Quaternionic Domain and its...
On the High Dimentional Information Processing in Quaternionic Domain and its...IJAAS Team
 
Using FPGA Design and HIL Algorithm Simulation to Control Visual Servoing
Using FPGA Design and HIL Algorithm Simulation to Control Visual ServoingUsing FPGA Design and HIL Algorithm Simulation to Control Visual Servoing
Using FPGA Design and HIL Algorithm Simulation to Control Visual ServoingIJAAS Team
 
Mitigation of Selfish Node Attacks In Autoconfiguration of MANETs
Mitigation of Selfish Node Attacks In Autoconfiguration of MANETsMitigation of Selfish Node Attacks In Autoconfiguration of MANETs
Mitigation of Selfish Node Attacks In Autoconfiguration of MANETsIJAAS Team
 
Vision Based Approach to Sign Language Recognition
Vision Based Approach to Sign Language RecognitionVision Based Approach to Sign Language Recognition
Vision Based Approach to Sign Language RecognitionIJAAS Team
 
Design and Analysis of an Improved Nucleotide Sequences Compression Algorithm...
Design and Analysis of an Improved Nucleotide Sequences Compression Algorithm...Design and Analysis of an Improved Nucleotide Sequences Compression Algorithm...
Design and Analysis of an Improved Nucleotide Sequences Compression Algorithm...IJAAS Team
 
Review: Dual Band Microstrip Antennas for Wireless Applications
Review: Dual Band Microstrip Antennas for Wireless ApplicationsReview: Dual Band Microstrip Antennas for Wireless Applications
Review: Dual Band Microstrip Antennas for Wireless ApplicationsIJAAS Team
 
Building Fault Tolerance Within Wsn-A Topology Model
Building Fault Tolerance Within Wsn-A Topology ModelBuilding Fault Tolerance Within Wsn-A Topology Model
Building Fault Tolerance Within Wsn-A Topology ModelIJAAS Team
 
Simplified Space Vector Pulse Width Modulation Based on Switching Schemes wit...
Simplified Space Vector Pulse Width Modulation Based on Switching Schemes wit...Simplified Space Vector Pulse Width Modulation Based on Switching Schemes wit...
Simplified Space Vector Pulse Width Modulation Based on Switching Schemes wit...IJAAS Team
 
An Examination of the Impact of Power Sector Reform on Manufacturing and Serv...
An Examination of the Impact of Power Sector Reform on Manufacturing and Serv...An Examination of the Impact of Power Sector Reform on Manufacturing and Serv...
An Examination of the Impact of Power Sector Reform on Manufacturing and Serv...IJAAS Team
 
A Novel Handoff Necessity Estimation Approach Based on Travelling Distance
A Novel Handoff Necessity Estimation Approach Based on Travelling DistanceA Novel Handoff Necessity Estimation Approach Based on Travelling Distance
A Novel Handoff Necessity Estimation Approach Based on Travelling DistanceIJAAS Team
 
The Combination of Steganography and Cryptography for Medical Image Applications
The Combination of Steganography and Cryptography for Medical Image ApplicationsThe Combination of Steganography and Cryptography for Medical Image Applications
The Combination of Steganography and Cryptography for Medical Image ApplicationsIJAAS Team
 
Physical Properties and Compressive Strength of Zinc Oxide Nanopowder as a Po...
Physical Properties and Compressive Strength of Zinc Oxide Nanopowder as a Po...Physical Properties and Compressive Strength of Zinc Oxide Nanopowder as a Po...
Physical Properties and Compressive Strength of Zinc Oxide Nanopowder as a Po...IJAAS Team
 
Experimental and Modeling Dynamic Study of the Indirect Solar Water Heater: A...
Experimental and Modeling Dynamic Study of the Indirect Solar Water Heater: A...Experimental and Modeling Dynamic Study of the Indirect Solar Water Heater: A...
Experimental and Modeling Dynamic Study of the Indirect Solar Water Heater: A...IJAAS Team
 
SLIC Superpixel Based Self Organizing Maps Algorithm for Segmentation of Micr...
SLIC Superpixel Based Self Organizing Maps Algorithm for Segmentation of Micr...SLIC Superpixel Based Self Organizing Maps Algorithm for Segmentation of Micr...
SLIC Superpixel Based Self Organizing Maps Algorithm for Segmentation of Micr...IJAAS Team
 
An Improved Greedy Parameter Stateless Routing in Vehicular Ad Hoc Network
An Improved Greedy Parameter Stateless Routing in Vehicular Ad Hoc NetworkAn Improved Greedy Parameter Stateless Routing in Vehicular Ad Hoc Network
An Improved Greedy Parameter Stateless Routing in Vehicular Ad Hoc NetworkIJAAS Team
 
A Novel CAZAC Sequence Based Timing Synchronization Scheme for OFDM System
A Novel CAZAC Sequence Based Timing Synchronization Scheme for OFDM SystemA Novel CAZAC Sequence Based Timing Synchronization Scheme for OFDM System
A Novel CAZAC Sequence Based Timing Synchronization Scheme for OFDM SystemIJAAS Team
 
Workload Aware Incremental Repartitioning of NoSQL for Online Transactional P...
Workload Aware Incremental Repartitioning of NoSQL for Online Transactional P...Workload Aware Incremental Repartitioning of NoSQL for Online Transactional P...
Workload Aware Incremental Repartitioning of NoSQL for Online Transactional P...IJAAS Team
 
A Fusion Based Visibility Enhancement of Single Underwater Hazy Image
A Fusion Based Visibility Enhancement of Single Underwater Hazy ImageA Fusion Based Visibility Enhancement of Single Underwater Hazy Image
A Fusion Based Visibility Enhancement of Single Underwater Hazy ImageIJAAS Team
 
Graph Based Workload Driven Partitioning System by Using MongoDB
Graph Based Workload Driven Partitioning System by Using MongoDBGraph Based Workload Driven Partitioning System by Using MongoDB
Graph Based Workload Driven Partitioning System by Using MongoDBIJAAS Team
 

More from IJAAS Team (20)

Super Capacitor Electronic Circuit Design for Wireless Charging
Super Capacitor Electronic Circuit Design for Wireless ChargingSuper Capacitor Electronic Circuit Design for Wireless Charging
Super Capacitor Electronic Circuit Design for Wireless Charging
 
On the High Dimentional Information Processing in Quaternionic Domain and its...
On the High Dimentional Information Processing in Quaternionic Domain and its...On the High Dimentional Information Processing in Quaternionic Domain and its...
On the High Dimentional Information Processing in Quaternionic Domain and its...
 
Using FPGA Design and HIL Algorithm Simulation to Control Visual Servoing
Using FPGA Design and HIL Algorithm Simulation to Control Visual ServoingUsing FPGA Design and HIL Algorithm Simulation to Control Visual Servoing
Using FPGA Design and HIL Algorithm Simulation to Control Visual Servoing
 
Mitigation of Selfish Node Attacks In Autoconfiguration of MANETs
Mitigation of Selfish Node Attacks In Autoconfiguration of MANETsMitigation of Selfish Node Attacks In Autoconfiguration of MANETs
Mitigation of Selfish Node Attacks In Autoconfiguration of MANETs
 
Vision Based Approach to Sign Language Recognition
Vision Based Approach to Sign Language RecognitionVision Based Approach to Sign Language Recognition
Vision Based Approach to Sign Language Recognition
 
Design and Analysis of an Improved Nucleotide Sequences Compression Algorithm...
Design and Analysis of an Improved Nucleotide Sequences Compression Algorithm...Design and Analysis of an Improved Nucleotide Sequences Compression Algorithm...
Design and Analysis of an Improved Nucleotide Sequences Compression Algorithm...
 
Review: Dual Band Microstrip Antennas for Wireless Applications
Review: Dual Band Microstrip Antennas for Wireless ApplicationsReview: Dual Band Microstrip Antennas for Wireless Applications
Review: Dual Band Microstrip Antennas for Wireless Applications
 
Building Fault Tolerance Within Wsn-A Topology Model
Building Fault Tolerance Within Wsn-A Topology ModelBuilding Fault Tolerance Within Wsn-A Topology Model
Building Fault Tolerance Within Wsn-A Topology Model
 
Simplified Space Vector Pulse Width Modulation Based on Switching Schemes wit...
Simplified Space Vector Pulse Width Modulation Based on Switching Schemes wit...Simplified Space Vector Pulse Width Modulation Based on Switching Schemes wit...
Simplified Space Vector Pulse Width Modulation Based on Switching Schemes wit...
 
An Examination of the Impact of Power Sector Reform on Manufacturing and Serv...
An Examination of the Impact of Power Sector Reform on Manufacturing and Serv...An Examination of the Impact of Power Sector Reform on Manufacturing and Serv...
An Examination of the Impact of Power Sector Reform on Manufacturing and Serv...
 
A Novel Handoff Necessity Estimation Approach Based on Travelling Distance
A Novel Handoff Necessity Estimation Approach Based on Travelling DistanceA Novel Handoff Necessity Estimation Approach Based on Travelling Distance
A Novel Handoff Necessity Estimation Approach Based on Travelling Distance
 
The Combination of Steganography and Cryptography for Medical Image Applications
The Combination of Steganography and Cryptography for Medical Image ApplicationsThe Combination of Steganography and Cryptography for Medical Image Applications
The Combination of Steganography and Cryptography for Medical Image Applications
 
Physical Properties and Compressive Strength of Zinc Oxide Nanopowder as a Po...
Physical Properties and Compressive Strength of Zinc Oxide Nanopowder as a Po...Physical Properties and Compressive Strength of Zinc Oxide Nanopowder as a Po...
Physical Properties and Compressive Strength of Zinc Oxide Nanopowder as a Po...
 
Experimental and Modeling Dynamic Study of the Indirect Solar Water Heater: A...
Experimental and Modeling Dynamic Study of the Indirect Solar Water Heater: A...Experimental and Modeling Dynamic Study of the Indirect Solar Water Heater: A...
Experimental and Modeling Dynamic Study of the Indirect Solar Water Heater: A...
 
SLIC Superpixel Based Self Organizing Maps Algorithm for Segmentation of Micr...
SLIC Superpixel Based Self Organizing Maps Algorithm for Segmentation of Micr...SLIC Superpixel Based Self Organizing Maps Algorithm for Segmentation of Micr...
SLIC Superpixel Based Self Organizing Maps Algorithm for Segmentation of Micr...
 
An Improved Greedy Parameter Stateless Routing in Vehicular Ad Hoc Network
An Improved Greedy Parameter Stateless Routing in Vehicular Ad Hoc NetworkAn Improved Greedy Parameter Stateless Routing in Vehicular Ad Hoc Network
An Improved Greedy Parameter Stateless Routing in Vehicular Ad Hoc Network
 
A Novel CAZAC Sequence Based Timing Synchronization Scheme for OFDM System
A Novel CAZAC Sequence Based Timing Synchronization Scheme for OFDM SystemA Novel CAZAC Sequence Based Timing Synchronization Scheme for OFDM System
A Novel CAZAC Sequence Based Timing Synchronization Scheme for OFDM System
 
Workload Aware Incremental Repartitioning of NoSQL for Online Transactional P...
Workload Aware Incremental Repartitioning of NoSQL for Online Transactional P...Workload Aware Incremental Repartitioning of NoSQL for Online Transactional P...
Workload Aware Incremental Repartitioning of NoSQL for Online Transactional P...
 
A Fusion Based Visibility Enhancement of Single Underwater Hazy Image
A Fusion Based Visibility Enhancement of Single Underwater Hazy ImageA Fusion Based Visibility Enhancement of Single Underwater Hazy Image
A Fusion Based Visibility Enhancement of Single Underwater Hazy Image
 
Graph Based Workload Driven Partitioning System by Using MongoDB
Graph Based Workload Driven Partitioning System by Using MongoDBGraph Based Workload Driven Partitioning System by Using MongoDB
Graph Based Workload Driven Partitioning System by Using MongoDB
 

Recently uploaded

Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 

Recently uploaded (20)

Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 

Disease prediction in big data healthcare using extended convolutional neural network techniques

  • 1. International Journal of Advances in Applied Sciences (IJAAS) Vol. 9, No. 2, June 2020, pp. 85~92 ISSN: 2252-8814, DOI: 10.11591/ijaas.v9.i2.pp85-92  85 Journal homepage: http://ijaas.iaescore.com Disease prediction in big data healthcare using extended convolutional neural network techniques Asadi Srinivasulu, Asadi Pushpa Data Analytics Research Laboratory, Sree Vidyanikethan Engineering College, India Article Info ABSTRACT Article history: Received May 3, 2019 Revised Feb 2, 2020 Accepted Mar 14, 2020 Diabetes Mellitus is one of the growing fatal diseases all over the world. It leads to complications that include heart disease, stroke, and nerve disease, kidney damage. So, Medical Professionals want a reliable prediction system to diagnose Diabetes. To predict the diabetes at earlier stage, different machine learning techniques are useful for examining the data from different sources and valuable knowledge is synopsized. So, mining the diabetes data in an efficient way is a crucial concern. In this project, a medical dataset has been accomplished to predict the diabetes. The R-Studio and Pypark software was employed as a statistical computing tool for diagnosing diabetes. The PIMA Indian database was acquired from UCI repository will be used for analysis. The dataset was studied and analyzed to build an effective model that predicts and diagnoses the diabetes disease earlier. Keywords: CNN, Diabetes, Neural networks, RNN SVM, This is an open access article under the CC BY-SA license. Corresponding Author: Asadi Srinivasulu, Data Analytics Research Laboratory, Sree Vidyanikethan Engineering College, Sree Sainath Nagar, Tirupati, Andhra Pradesh 517102, India. Email: srinu.asadi@gmail.com 1. INTRODUCTION As we know that the growth in technology helps the computers to produce huge amount of data. Additionally, such advancements and innovations in the medical database management systems generate large volumes of medical data. Healthcare industry contains very large and sensitive data. This data needs to be treated very careful to get benefitted from it. Diabetic Mellitus is a set of associated diseases in which the human body is unable to control the quantity of sugar in the blood. It results in high sugar levels in blood, may be as the body does not produce sufficient insulin, or may because cells do not react to the produced insulin. The focus is to develop the prediction models by using certain machine learning algorithms. The Machine Learning is an application of artificial intelligence as it helps the computer to learn on its own. The two classification of ML are supervised and unsupervised. The Supervised learning calculation utilizes the past experience to influence expectations on new or inconspicuous information while unsupervised calculations to can draw derivations from datasets. Machine learning algorithms are: Supervised learning techniques: Classification The procedure of finding the obscure information of the class name which is utilizing recent known information is called as class mark which is intern called as classification. The following are Popular Classification Algorithms:  Random forest
  • 2.  ISSN: 2252-8814 Int J Adv Appl Sci, Vol. 9, No. 2, June 2020: 85 – 92 86  SVM  K-Nearest neighbors  Decision tree  Naïve Bayes Regression A supervised learning algorithm such as classification which finds the relationship between some independent variables with some dependent variables isn called Regression. The popular Regression algorithms are:  Simple Linear Regression  Multiple Linear Regression  Logistic Regression  Polynomial Regression  Linear Discriminant Analysis (LDA) Unsupervised Learning techniques: Clustering The process which classifies the similar objects into groups called as clustering mechanism. Some of the clustering techniques are:  K-means clustering  Hierarchical clustering R studio An Integrated Development Environment (IDE) for R programming language which was founded by Jjallaire is called as R Studio. The command line that R Studio uses is interpreter. R studio used for statistical computing and graphics. R Studio is having many built-in packages so it can manipulate huge dataset for analysis. 2. LITERATURE REVIEW The usage of big data for predicting diabetes has been conducted in many researches. Table 1 display researches in the field and also critique given for each research papers. Considering the critique and notes of each published research, this research will propose a new model for resolving problems from previous research. Tabel 1. Review of related research No. Paper Author(s) Name of the Journal Methods Findings Notes/Critique 1. Predicting Diabetes in Medical Datasets Using Machine Learning Techniques Uswa Ali Zia, Dr. Naeem Khan. International Journal of Scientific & Engineering Research (IJSER). Boot strapping resampling technique to enhance the accuracy and then applying i. Naive Bayes, ii.Decision Trees iii.k-Nearest Neighbors (k-NN) After Bootstrapping Accuracy: i.NaiveBayes- 74.89% ii.Decision Trees- 94.44% iii. k-NN(for k=1) 93.79% 4. k-NN(for k=3) - 76.79% i.Plan to use further more advanced classifiers such as Neural Networks. ii. It should consider some other important factors that are related to gestational diabetes, like metabolic syndrome, family history, habit of smoking, lazy routines, some dietary patterns etc. 2. Prediction of Diabetes Using Data Mining Techniques FikirteGirma, Woldemichael, Sumitra Menaria International Conference on Trends in Electronics and Informatics (ICOEI) i. Back Propagation Algorithm ii. J48 Algorithm iii. Naïve Bayes Classifier iv. Support Vector Machine. Back Propagation Algorithm has Accuracy-83.11% Sensitivity- 86.53% Specificity-76% i. Increment the accuracy of the algorithms.
  • 3. Int J Adv Appl Sci ISSN: 2252-8814  Disease prediction in big data healthcare using extended convolutional neural network… (Asadi Srinivasulu) 87 Table continued 3. Diabetes Disease Prediction Using Data Mining Deeraj Shetty, Kishor Rit, Sohail Shaikh, Nikita Patils International Conference on Innovations in Information, Embedded and Communication Systems (ICIIECS). i. Naïve Bayes ii. k-NN algorithms prediction of the disease will be done with the help of Bayesian algorithm and KNN algorithm and analyze them by taking various attributes of diabetes. i. Increment the accuracy of the algorithms. ii. So Working on some more attributes which is used to tackle the diabetes even more. 4. Classification of Diabetic Patients by using Efficient Prediction from Big Data using R Studio K. Sharmila, Dr. S.A. Vetha Manickam International Journal of Advanced Engineering Research and Science (IJAERS). Decision tree i. Using R, the dataset is analyzed and the correlation coefficient for two attributes is calculated. ii. Decision Tree is used to predict the type of Diabetes. Possibility of developing efficient predictive models using the information from the analysis which is already carried out. 5. Diagnosis of diabetes using Classification mining Techniques AiswaryaIyer, S. Jeyalatha and Ronak Sumbaly International Journal of Data Mining & Knowledge Management Process (IJDKP). Decision tree Naïve Bayes. J48 Cross validation-74.8698 % J48 Percentage Split-76.9565 % Naive Bayes- 79.5652 % i.In future the work, planned to be gathering the information from different locales over the world. ii.This work can be improved and extended for the automation of diabetes analysis. 6. An Disease Diagnosis using Data Mining Techniques And Empirical study. M. Deepika, Dr. K. Kalaiselvi The 2nd International Conference on Inventive Communication and Computational Technologies (ICICCT). i.Artificial Neural Network ii.Decision Tree iii. Logistic Regression iv. Naïve Bayes v. SVM Artificial Neural Network: 73.23% Logistic Regression :76.13% Decision Tree :77.87% Efficient and Accurate classifier can be developed. 3. PROPOSED SYSTEM We propose a classification model with boosted accuracy to predict the diabetic patient. In this model, we have employed different machine learning techniques are using like classification, regression and clustering. The major focus is to increase the accuracy by using resample technique on a benchmark well renowned PIMA diabetes dataset that was acquired from UCI machine learning repository, having eight attributes and one class label. The proposed framework is shown in Figure 1. The description of each phase is mentioned. 3.1. Data selection Data selection is a process in which the most relevant data is selected from a specific domain to derive values that are informative and facilitate learning. PIMA diabetes dataset having 8 attributes that are used to predict the diabetes at earlier stage. This dataset is obtained from UCI repository. 3.2. Data pre-processing Data pre-processing is a Machine Learning technique that includes changing crude information into reasonable configuration. It includes Data Cleaning, Data Integration, Data Transformation, and Data Discretization. 3.3. Feature extraction through principle component analysis Feature Extraction on the dataset to determine the most suitable set of attributes that can help achieve better classification. The set of attributes suggested by the PCA are termed as feature vector. Feature reduction or dimensionality reduction will be benefitted us by reducing the computation and space complexity.
  • 4.  ISSN: 2252-8814 Int J Adv Appl Sci, Vol. 9, No. 2, June 2020: 85 – 92 88 3.4. Resampling Filter The supervised Resample filter is applied to the pre-processed dataset. Re-sampling is a series of methods used to reconstruct your sample data sets, including training sets and validation sets. In this study, Boot strapping resampling technique to enhance the accuracy. Figure 1. Proposed system for diabetes prediction system 4. MACHINE LEARNING TECHNIQUES 4.1. Classification 4.1.1. Random forest The outfit learning technique used for the classification and regression that operates by constructing the multitude of decision trees at training time and outputting the class i.e mode of the classes or the regression of the individual trees. Irregular choice woods right for choice trees propensity which is used for over fitting on to their preparation set.
  • 5. Int J Adv Appl Sci ISSN: 2252-8814  Disease prediction in big data healthcare using extended convolutional neural network… (Asadi Srinivasulu) 89 4.1.2. Support vector machine (SVM) SVM is a division of Supervised Learning Algorithm. The strategy used to perform regression, classification and outlier detection of data.SVM will be grouping the information dependent that on the hyper plane. The hyper plane is used to totally isolate the two classes in the best way and the most extreme edge hyper plane ought to be picked as a best separator. The two types SVM Classifiers that are been used are used are: Linear Classifier and Non-Linear Classifier. 4.1.3. Decision tree The algorithm which is mainly used to produce a classification on training data and regression model into a tree structure is called as Decision tree algorithm, it is based on previous data to classify/predict class or target variables of future/new data with the help of decision rules or decision trees. Decision tree can be useful for both numerical and categorical data. The tree in which the root node in each level is a starting point or the best splitting attribute in that position which helps to test on an attribute is called as complete decision tree. The yield of the test will create branches. Leaf hub will go about as a last class mark or target variable to characterize/foresee the new information. Arrangement rules are attracted from root to leaf. 4.1.4. Naïve bayes The algorithm performs classification tasks in the field of ML are called as Naïve Bayes. It can perform classification very well on the dataset even it has huge records with multi class and binary class classification problems. The application of Naive Bayes is mainly to text analysis and Natural Language Processing. It works based on conditional probability. It can be represented (1). 𝑃(𝑀|𝑁) = 𝑃( 𝑀| 𝑁)𝑃(𝑀) 𝑃(𝑁) (1) Here M and N are two events and, P(M|N) is the conditional probability of M given N.P(M) is the probability of M. P(N) is the probability of N. P (N|M) is the conditional probability of N given M. 4.1.5. K-nearest neighbors The supervised classifier which is a best choice for K-NN is called as k-Nearest Neighbor. It is a best choice for the classification of k-NN kind of problems. In order to predict the target label of a test data, KNN which finds distance between nearest training data class labels and new test data point in the presence of K value? KNN uses K variable value between 0 to 10 normally. 4.2. Regression 4.2.1. Simple linear regression The linear Regression algorithm which explains the relationship between independent and dependent variables to predict the values of the dependent variable is called as Simple Linear Regression algorithm. Simple regression uses one independent variable. The simple linear regression model is represented (2). y= (b0 +b1x) (2) Here, x(independent variable) and y (dependant variable) are two factors involved in simple linear regression analysis. Also, b0 is the Y-intercept and b1 is the Slope. 4.2.2. Multiple linear regressions It explains the relationship between two or more independent variables and a dependent variable to predict the values of the dependent variable. It uses two or more independent variables. Dependent variable has a continuous and independent variable has discrete or continuous values. The multiple linear regression model is represented as (3) y= (p0 +p1x1+p2x2+…+pnxn) (3) Here x1, x2... xn (independent variable) and y (dependant variable) are two factors involved in multiple linear regression analysis. Also b0 is the y-intercept and p1, p2… pn is the slope.
  • 6.  ISSN: 2252-8814 Int J Adv Appl Sci, Vol. 9, No. 2, June 2020: 85 – 92 90 4.2.3. Logistic regression The predictive analysis which is used for the dependent variable is categorical called as Logistical Regression. Logistical Regression explains the relationship between one dependent variable and one or more independent variables. The various types of Logistic Regression are:  Multinomial Logistic Regression (many)  Binary Logistic Regression (two)  Ordinal Logistic Regression (1) The categorical response has only two possible outcomes. Multinomial Logistic Regression has three or more outcomes without ordering whereas Ordinal Logistic Regression has three or more outcomes with ordering. 4.2.4. Polynomial regression The form of regression analysis which explains the relationship between the independent variable and dependent variable as an nth degree polynomial is called as polynomial regression. It fits a non-linear relationship between the value of independent variable and conditional mean of dependent variable. It is represented as (4). x = a + b * y ^ n (4) Here p is Dependent Variable, q is Independent Variable and n is Degree. It is used to fit the data very well when the data is below and above the regression model. It minimizes the cost function and provides optimum result on the regression. 4.2.5. Linear discriminant analysis The process of using various data items and applying different functions to that set to analyze classes of objects or items separately is called Linear Discriminant Analysis. Image Recognition and Predictive analytics use this Linear Discriminant Analysis 4.3. Clustering 4.3.1. K-means clustering The unsupervised machine learning algorithm which is used to solve clustering problems by classifying the dataset into a number of clusters k (group of similar objects), which defines the number of clusters which is assumed before classifying the dataset. 4.3.2. Hierarchical clustering The type of clustering algorithm which is used to build a hierarchy of clusters is called hierarchical clustering. The two types of Hierarchical Clustering are: 4.3.3. Agglomerative clustering It is used to group objects into clusters based on their similarity. The result obtained at last is a tree representation of objects called Dendrogram. 4.3.4. Divisive analysis This is a best down methodology where all perceptions begin in one bunch, and parts are performed recursively as one moves down the pecking order. A hierarchical clustering is often represented as a dendrogram. Each cluster will be representing with centroids. Distance will be calculated by using linkage. 5. RESULTS AND ANALYSIS Indian diabetes dataset named PIMA were used for analysis for this study. It consists of eight independent attributes and one independent class attribute. The study was implemented by R programming language using R Studio. Machine learning algorithms like classification (Decision Tree, Naïve Bayes, k-NN and Random Forest), regression (linear, multiple, logistic, LDA) and clustering (k-means, hierarchical agglomerative) are used to predict the diabetics disease in early stages as shown in Table 1. Measure Performance model by using accuracy as shown in Figure 2.
  • 7. Int J Adv Appl Sci ISSN: 2252-8814  Disease prediction in big data healthcare using extended convolutional neural network… (Asadi Srinivasulu) 91 Table1. Predictive analysis of machine learning algorithms S. No Algorithm Accuracy 1 Random forest 83% 2. Decision tree 77% 3. SVM 92% 4. Naïve Bayes 86% 5. K-NN 91% 6. Simple linear regression 98% 7. Logistic regression 88% 8. LDA 88% 9. k-Means 81% 10. Hierarchical agglomerative 74% Figure 2. Comparison of accuracy of various algorithms 6. CONCLUSION AND FUTURE WORK Deep Learning and Data mining plays an important role in various fields such as Artificial Intelligence (AI) and Machine Learning (ML), Database Systems and more. The core objective is to enhance the accuracy of predictive model. This PIMA dataset will increase the accuracy of almost all algorithms but the SVM and linear regression leads over others. In future many advanced deep learning techniques will be used to increasing the accuracy of the algorithms. REFERENCES [1] Usma Niyaz et al, “Advances in Deep Learning Techniques for Medical Image Analysis” 5th IEEE International Conference on Parallel, Distributed and Grid Computing (PDGC-2018), 978-1-5386-6026-3/18/$31©2018 IEEE, 20-22 Dec, 2018, Solan, India. [2] Y. LeCun, L. Bottou, Y. Bengio and P. Haffner, “Gradient-Based Learning Applied to Document Recognition,” Proceedings of IEEE, vol. 86, pp. 2278-2324. [3] Leslie N. Smith, “Cyclical Learning Rates for Training Neural Networks,” 2017 IEEE Winter Conference on Applications of Computer Vision (WACV), pp. 464–472, 2017. [4] J. Wang, L. Perez, “The Effectiveness of Data Augmentation in Image Classification using Deep Learning,” Computing Research Repository (CoRR) - arXiv, 2017. [5] N. Srivastava, G. Hinton, A. Krizhevsky and I. Sutskever and R. Salakhutdinov, “Dropout: A Simple Way to Prevent Neural Networks from Overfitting,” Journal of Machine Learning Research (JMLR), vol.15, pp. 1929-1958, 2014. [6] M. S. Al-tarawneh, “Lung Cancer Detection Using Image Processing Techniques,” Leonardo Electronic Journal of Practices and Technologies (LEJPT), no. 20, pp. 147–158, 2012. [7] Abhishek S. Sambyal, Asha T., “Knowledge Abstraction from Textural Features of Brain MRI Images for Diagnosing Brain Tumor using Statistical Techniques and Associative Classification,” 2016 International Conference on Systems in Medicine and Biology, IIT Kharagpur, 2016. [8] A. Krizhevsky, I. Sutskever and G. E. Hinton, “ImageNet Classification with Deep Convolutional Neural Networks,” NIPS’12 Proceedings of the 25th International Conference on Neural Information Processing Systems, vol. 1, pp. 1097–1105.
  • 8.  ISSN: 2252-8814 Int J Adv Appl Sci, Vol. 9, No. 2, June 2020: 85 – 92 92 [9] K. Simonyan and A. Zisserman, “Very Deep Convolutional Network for Large-Scale Image Recognition,” International Conference on Learning Representations (ICLR), 2015. [10] C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, A. Rabinovich, “Going deeper with convolutions,”2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.1–9, 2015. [11] K. He, X. Zhang, S. Ren, J. Sun, “Deep Residual Learning for Image Recognition,” 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.770-778, 2016. [12] O. Ronneberger, P. Fischer, and T. Brox, “U-Net: Convolutional Networks for Biomedical Image Segmentation,” MICCAI 2015, pp. 234?241, 2015. [13] F. Milletari, N. Navab, and S. A. Ahmadi, “V-Net: Fully Convolutional Networks for Volumetric Medical Image Segmentation,” 2016 Fourth International Conference on 3D Vision, pp.565-571, 2016. [14] E. Gibson, W. Li, C. Sudre, L. Fidon, D. I. Shakir, G. Wang Z. Eaton- Rosen, R. Gray, T. Doel, Y. Hu, T. Whyntie, P. Nachev, M. Modat D. C. Barratt, S. Ourselin, M. Jorge Cardoso and T. Vercauteren, “NiftyNet: a deep-learning platform for medical imaging,” Computer Methods and Programs in Biomedicine, vol. 158, pp. 113 - 122, 2018. [15] S. Jgou, M. Drozdzal, D. Vzquez, A. Romero, and Y. Bengio, “The One Hundred Layers Tiramisu: Fully Convolutional DenseNets for Semantic Segmentation,” arXiv:1611.09326v2 [cs.CV],2016. [16] W. Chen, Y. Zhang, J. He, Y. Qiao, Y. Chen, H. Shi, X. Tang, “W-net: Bridged U-net for 2D Medical Image Segmentation,” arXiv:1807.04459v1 [cs.CV] 12 Jul 2018. [17] G. Yang, H. Jing, “Multiple Convolutional Neural Network for Feature Extraction,” International Conference on Intelligent Computing (ICIC), 2015. [18] J. Ding, A. Li, Z. Hu, and L. Wang, “Accurate Pulmonary Nodule Detection in Computed Tomography Images Using Deep Convolutional Neural Networks,” Computing Research Repository (CoRR) - arXiv, 2017. [19] Q. Song, L. Zhao, X. Luo, and X. Dou, “Using Deep Learning for Classification of Lung Nodules on Computed Tomography Images,” Journal of Healthcare Engineering, vol. 2017, 2017. [20] H. Chougrad, H. Zouaki, O. Alheyane “Convolutional Neural Networks for Breast Cancer Screening: Transfer Learning with Exponential Decay,”- arXiv, 2017. [21] A. Esteva, B. Kuprel, R. A. Novoa, J. Ko, S. M. Swetter, H. M. Blau& S. Thrun, “Dermatologist-level classification of skin cancer with deep neural networks,” Nature, vol. 542, pp. 115–118, 2017. [22] E. Sert, S. Ertekin, U. Halici, “Ensemble of Convolutional Neural Networks for Classification of Breast Microcalcification from Mammograms,” 2017 39th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), pp. 689–692, 2017. [23] N. C. F. Codella, Q. B. Nguyen, S. Pankanti, D. Gutman, B. Helba, A. Halpern, J. R. Smith, “Deep learning ensembles for melanoma recognition in dermoscopy images,” Computing Research Repository (CoRR), vol. abs/1610.04662, 2016. [24] K. J. Geras, S. Wolfson, Y. Shen, S. Gene Kim, L. Moy, K. Cho, “High-Resolution Breast Cancer Screening with Multi-View Deep Convolutional Neural Networks,” Computing Research Repository (CoRR) - arXiv, 2017.