Transfer learning (TL) is a technique of reuse and modify a pre-trained network.
It reuses feature extraction layer at a pre-trained network. A target domain in TL
obtains the features knowledge from the source domain. TL modified
classification layer at a pre-trained network. The target domain can do new tasks
according to a purpose. In this article, the target domain is fundus image
classification includes normal and neovascularization. Data consist of
100 patches. The comparison of training and validation data was 70:30.
The selection of training and validation data is done randomly. Steps of TL i.e
load pre-trained networks, replace final layers, train the network, and assess
network accuracy. First, the pre-trained network is a layer configuration of
the convolutional neural network architecture. Pre-trained network used are
AlexNet, VGG16, VGG19, ResNet50, ResNet101, GoogLeNet, Inception-V3,
InceptionResNetV2, and squeezenet. Second, replace the final layer is to replace
the last three layers. They are fully connected layer, softmax, and output layer.
The layer is replaced with a fully connected layer that classifies according to
number of classes. Furthermore, it's followed by a softmax and output layer that
matches with the target domain. Third, we trained the network. Networks were
trained to produce optimal accuracy. In this section, we use gradient descent
algorithm optimization. Fourth, assess network accuracy. The experiment results
show a testing accuracy between 80% and 100%.
Classification and Segmentation of Glaucomatous Image Using Probabilistic Neu...ijsrd.com
The gradual visual field loss and there is a characteristic type of damage to the retinal nerve fiber layer associated with the progression of the disease glaucoma. Texture features within images are actively pursued for accurate and efficient glaucoma classification. Energy distribution over wavelet subband is applied to find these important texture features. In this paper, we investigate the discriminatory potential of wavelet features obtained from the Daubechies (db3), symlets (sym3), and biorthogonal (bio3.3, bio3.5, and bio3.7) wavelet filters. We propose a novel technique to extract energy signatures obtained using 2-D discrete wavelet transform, and subject these signatures to different feature ranking and feature selection strategies. Here my project aims at the use of Probabilistic Neural Network (PNN), Fuzzy C-means (FCM) and K-means helps for the detection of glaucoma disease. For this, fuzzy c-means clustering algorithm and k-means algorithm is used. Fuzzy c-means results faster and reliably good clustering when compare to k-means.
Evaluation of deep neural network architectures in the identification of bone...TELKOMNIKA JOURNAL
Automated medical image processing, particularly of radiological images, can reduce the number of diagnostic errors, increase patient care and reduce medical costs. This paper seeks to evaluate the performance of three recent convolutional neural networks in the autonomous identification of fissures over two-dimensional radiological images. These architectures have been proposed as deep neural network types specially designed for image classification, which allows their integration with traditional image processing strategies for automatic analysis of medical images. In particular, we use three convolutional networks: ResNet (residual neural network), DenseNet (dense convolutional network), and NASNet (neural architecture search network) to learn information from a set of 200 images labeled half as fissured bones and half as seamless bones. All three networks are trained and adjusted under the same conditions, and their performance was evaluated with the same metrics. The final results consider not only the model's ability to predict the characteristics of an unknown image but also its internal complexity. The three neural models were optimized to reduce classification errors without producing network over-adjustment. In all three cases, generalization of behavior was observed, and the ability of the models to identify the images with fissures, however the expected performance was only achieved with the NASNet model.
TRANSFER LEARNING BASED IMAGE VISUALIZATION USING CNNijaia
Image classification is a popular machine learning based applications of deep learning. Deep learning techniques are very popular because they can be effectively used in performing operations on image data in large-scale. In this paper CNN model was designed to better classify images. We make use of feature extraction part of inception v3 model for feature vector calculation and retrained the classification layer with these feature vector. By using the transfer learning mechanism the classification layer of the CNN model was trained with 20 classes of Caltech101 image dataset and 17 classes of Oxford 17 flower image dataset. After training, network was evaluated with testing dataset images from Oxford 17 flower dataset and Caltech101 image dataset. The mean testing precision of the neural network architecture with Caltech101 dataset was 98 % and with Oxford 17 Flower image dataset was 92.27 %.
INVESTIGATIONS OF THE INFLUENCES OF A CNN’S RECEPTIVE FIELD ON SEGMENTATION O...adeij1
Segmentation of objects with various sizes is relatively less explored in medical imaging, and has been very challenging in computer vision tasks in general. We hypothesize that the receptive field of a deep model corresponds closely to the size of object to be segmented, which could critically influence the segmentation accuracy of objects with varied sizes. In this study, we employed “AmygNet”, a dual-branch fully convolutional neural network (FCNN) with two different sizes of receptive fields, to investigate the effects of receptive field on segmenting four major subnuclei of bilateral amygdalae. The experiment was conducted on 14 subjects, which are all 3-dimensional MRI human brain images. Since the scale of different subnuclear groups are different, by investigating the accuracy of each subnuclear group while using receptive fields of various sizes, we may find which kind of receptive field is suitable for object of which scale respectively. In the given condition, AmygNet with multiple receptive fields presents great potential in segmenting objects of different sizes.
IEEE Final Year Projects 2011-2012 :: Elysium Technologies Pvt Ltd::Imageproc...sunda2011
IEEE Final Year Projects 2011-2012 :: Elysium Technologies Pvt Ltd
IEEE projects, final year projects, students project, be project, engineering projects, academic project, project center in madurai, trichy, chennai, kollam, coimbatore
Fully Automatic Method for 3D T1-Weighted Brain Magnetic Resonance Images Seg...CSCJournals
In the domain of medical imaging, accurate segmentation of brain MR images is of interest for many brain disorders. However, due to several factors such noise, imaging artefacts, intrinsic tissue variation and partial volume effects, tissue segmentation remains a challenging task. So, in this paper, a full automatic method for segmentation of brain MR images is presented. The method consists of four steps segmentation procedure. First, noise removing by median filtering is done; second segmentation of brain/non-brain tissue is performed by using a Threshold Morphologic Brain Extraction method (TMBE). Then initial centroids estimation by gray level histogram analysis based is executed. Finally, Fuzzy C-means Algorithm is used for MRI tissue segmentation. The efficiency of the proposed method is demonstrated by extensive segmentation experiments using simulated and real MR images.
UNet-VGG16 with transfer learning for MRI-based brain tumor segmentationTELKOMNIKA JOURNAL
A brain tumor is one of a deadly disease that needs high accuracy in its medical surgery. Brain tumor detection can be done through magnetic resonance imaging (MRI). Image segmentation for the MRI brain tumor aims to separate the tumor area (as the region of interest or ROI) with a healthy brain and provide a clear boundary of the tumor. This study classifies the ROI and non-ROI using fully convolutional network with new architecture, namely UNet-VGG16. This model or architecture is a hybrid of U-Net and VGG16 with transfer Learning to simplify the U-Net architecture. This method has a high accuracy of about 96.1% in the learning dataset. The validation is done by calculating the correct classification ratio (CCR) to comparing the segmentation result with the ground truth. The CCR value shows that this UNet-VGG16 could recognize the brain tumor area with a mean of CCR value is about 95.69%.
Classification and Segmentation of Glaucomatous Image Using Probabilistic Neu...ijsrd.com
The gradual visual field loss and there is a characteristic type of damage to the retinal nerve fiber layer associated with the progression of the disease glaucoma. Texture features within images are actively pursued for accurate and efficient glaucoma classification. Energy distribution over wavelet subband is applied to find these important texture features. In this paper, we investigate the discriminatory potential of wavelet features obtained from the Daubechies (db3), symlets (sym3), and biorthogonal (bio3.3, bio3.5, and bio3.7) wavelet filters. We propose a novel technique to extract energy signatures obtained using 2-D discrete wavelet transform, and subject these signatures to different feature ranking and feature selection strategies. Here my project aims at the use of Probabilistic Neural Network (PNN), Fuzzy C-means (FCM) and K-means helps for the detection of glaucoma disease. For this, fuzzy c-means clustering algorithm and k-means algorithm is used. Fuzzy c-means results faster and reliably good clustering when compare to k-means.
Evaluation of deep neural network architectures in the identification of bone...TELKOMNIKA JOURNAL
Automated medical image processing, particularly of radiological images, can reduce the number of diagnostic errors, increase patient care and reduce medical costs. This paper seeks to evaluate the performance of three recent convolutional neural networks in the autonomous identification of fissures over two-dimensional radiological images. These architectures have been proposed as deep neural network types specially designed for image classification, which allows their integration with traditional image processing strategies for automatic analysis of medical images. In particular, we use three convolutional networks: ResNet (residual neural network), DenseNet (dense convolutional network), and NASNet (neural architecture search network) to learn information from a set of 200 images labeled half as fissured bones and half as seamless bones. All three networks are trained and adjusted under the same conditions, and their performance was evaluated with the same metrics. The final results consider not only the model's ability to predict the characteristics of an unknown image but also its internal complexity. The three neural models were optimized to reduce classification errors without producing network over-adjustment. In all three cases, generalization of behavior was observed, and the ability of the models to identify the images with fissures, however the expected performance was only achieved with the NASNet model.
TRANSFER LEARNING BASED IMAGE VISUALIZATION USING CNNijaia
Image classification is a popular machine learning based applications of deep learning. Deep learning techniques are very popular because they can be effectively used in performing operations on image data in large-scale. In this paper CNN model was designed to better classify images. We make use of feature extraction part of inception v3 model for feature vector calculation and retrained the classification layer with these feature vector. By using the transfer learning mechanism the classification layer of the CNN model was trained with 20 classes of Caltech101 image dataset and 17 classes of Oxford 17 flower image dataset. After training, network was evaluated with testing dataset images from Oxford 17 flower dataset and Caltech101 image dataset. The mean testing precision of the neural network architecture with Caltech101 dataset was 98 % and with Oxford 17 Flower image dataset was 92.27 %.
INVESTIGATIONS OF THE INFLUENCES OF A CNN’S RECEPTIVE FIELD ON SEGMENTATION O...adeij1
Segmentation of objects with various sizes is relatively less explored in medical imaging, and has been very challenging in computer vision tasks in general. We hypothesize that the receptive field of a deep model corresponds closely to the size of object to be segmented, which could critically influence the segmentation accuracy of objects with varied sizes. In this study, we employed “AmygNet”, a dual-branch fully convolutional neural network (FCNN) with two different sizes of receptive fields, to investigate the effects of receptive field on segmenting four major subnuclei of bilateral amygdalae. The experiment was conducted on 14 subjects, which are all 3-dimensional MRI human brain images. Since the scale of different subnuclear groups are different, by investigating the accuracy of each subnuclear group while using receptive fields of various sizes, we may find which kind of receptive field is suitable for object of which scale respectively. In the given condition, AmygNet with multiple receptive fields presents great potential in segmenting objects of different sizes.
IEEE Final Year Projects 2011-2012 :: Elysium Technologies Pvt Ltd::Imageproc...sunda2011
IEEE Final Year Projects 2011-2012 :: Elysium Technologies Pvt Ltd
IEEE projects, final year projects, students project, be project, engineering projects, academic project, project center in madurai, trichy, chennai, kollam, coimbatore
Fully Automatic Method for 3D T1-Weighted Brain Magnetic Resonance Images Seg...CSCJournals
In the domain of medical imaging, accurate segmentation of brain MR images is of interest for many brain disorders. However, due to several factors such noise, imaging artefacts, intrinsic tissue variation and partial volume effects, tissue segmentation remains a challenging task. So, in this paper, a full automatic method for segmentation of brain MR images is presented. The method consists of four steps segmentation procedure. First, noise removing by median filtering is done; second segmentation of brain/non-brain tissue is performed by using a Threshold Morphologic Brain Extraction method (TMBE). Then initial centroids estimation by gray level histogram analysis based is executed. Finally, Fuzzy C-means Algorithm is used for MRI tissue segmentation. The efficiency of the proposed method is demonstrated by extensive segmentation experiments using simulated and real MR images.
UNet-VGG16 with transfer learning for MRI-based brain tumor segmentationTELKOMNIKA JOURNAL
A brain tumor is one of a deadly disease that needs high accuracy in its medical surgery. Brain tumor detection can be done through magnetic resonance imaging (MRI). Image segmentation for the MRI brain tumor aims to separate the tumor area (as the region of interest or ROI) with a healthy brain and provide a clear boundary of the tumor. This study classifies the ROI and non-ROI using fully convolutional network with new architecture, namely UNet-VGG16. This model or architecture is a hybrid of U-Net and VGG16 with transfer Learning to simplify the U-Net architecture. This method has a high accuracy of about 96.1% in the learning dataset. The validation is done by calculating the correct classification ratio (CCR) to comparing the segmentation result with the ground truth. The CCR value shows that this UNet-VGG16 could recognize the brain tumor area with a mean of CCR value is about 95.69%.
International Journal of Engineering Research and Applications (IJERA) is an open access online peer reviewed international journal that publishes research and review articles in the fields of Computer Science, Neural Networks, Electrical Engineering, Software Engineering, Information Technology, Mechanical Engineering, Chemical Engineering, Plastic Engineering, Food Technology, Textile Engineering, Nano Technology & science, Power Electronics, Electronics & Communication Engineering, Computational mathematics, Image processing, Civil Engineering, Structural Engineering, Environmental Engineering, VLSI Testing & Low Power VLSI Design etc.
Comparison of Neural Network Training Functions for Hematoma Classification i...IOSR Journals
Classification is one of the most important task in application areas of artificial neural networks
(ANN).Training neural networks is a complex task in the supervised learning field of research. The main
difficulty in adopting ANN is to find the most appropriate combination of learning, transfer and training
function for the classification task. We compared the performances of three types of training algorithms in feed
forward neural network for brain hematoma classification. In this work we have selected Gradient Descent
based backpropagation, Gradient Descent with momentum, Resilence backpropogation algorithms. Under
conjugate based algorithms, Scaled Conjugate back propagation, Conjugate Gradient backpropagation with
Polak-Riebreupdates(CGP) and Conjugate Gradient backpropagation with Fletcher-Reeves updates (CGF).The
last category is Quasi Newton based algorithm, under this BFGS, Levenberg-Marquardt algorithms are
selected. Proposed work compared training algorithm on the basis of mean square error, accuracy, rate of
convergence and correctness of the classification. Our conclusion about the training functions is based on the
simulation results
A Parallel Framework For Multilayer Perceptron For Human Face RecognitionCSCJournals
Artificial neural networks have already shown their success in face recognition and similar complex pattern recognition tasks. However, a major disadvantage of the technique is that it is extremely slow during training for larger classes and hence not suitable for real-time complex problems such as pattern recognition. This is an attempt to develop a parallel framework for the training algorithm of a perceptron. In this paper, two general architectures for a Multilayer Perceptron (MLP) have been demonstrated. The first architecture is All-Class-in-One-Network (ACON) where all the classes are placed in a single network and the second one is One-Class-in-One-Network (OCON) where an individual single network is responsible for each and every class. Capabilities of these two architectures were compared and verified in solving human face recognition, which is a complex pattern recognition task where several factors affect the recognition performance like pose variations, facial expression changes, occlusions, and most importantly illumination changes. Experimental results show that the proposed OCON structure performs better than the conventional ACON in terms of network training convergence speed and which can be easily exercised in a parallel environment.
Hyper-parameter optimization of convolutional neural network based on particl...journalBEEI
Deep neural networks have accomplished enormous progress in tackling many problems. More specifically, convolutional neural network (CNN) is a category of deep networks that have been a dominant technique in computer vision tasks. Despite that these deep neural networks are highly effective; the ideal structure is still an issue that needs a lot of investigation. Deep Convolutional Neural Network model is usually designed manually by trials and repeated tests which enormously constrain its application. Many hyper-parameters of the CNN can affect the model performance. These parameters are depth of the network, numbers of convolutional layers, and numbers of kernels with their sizes. Therefore, it may be a huge challenge to design an appropriate CNN model that uses optimized hyper-parameters and reduces the reliance on manual involvement and domain expertise. In this paper, a design architecture method for CNNs is proposed by utilization of particle swarm optimization (PSO) algorithm to learn the optimal CNN hyper-parameters values. In the experiment, we used Modified National Institute of Standards and Technology (MNIST) database of handwritten digit recognition. The experiments showed that our proposed approach can find an architecture that is competitive to the state-of-the-art models with a testing error of 0.87%.
fMRI Segmentation Using Echo State Neural NetworkCSCJournals
This research work proposes a new intelligent segmentation technique for functional Magnetic Resonance Imaging (fMRI). It has been implemented using an Echostate Neural Network (ESN). Segmentation is an important process that helps in identifying objects of the image. Existing segmentation methods are not able to exactly segment the complicated profile of the fMRI accurately. Segmentation of every pixel in the fMRI correctly helps in proper location of tumor. The presence of noise and artifacts poses a challenging problem in proper segmentation. The proposed ESN is an estimation method with energy minimization. The estimation property helps in better segmentation of the complicated profile of the fMRI. The performance of the new segmentation method is found to be better with higher peak signal to noise ratio (PSNR) of 61 when compared to the PSNR of the existing back-propagation algorithm (BPA) segmentation method which is 57.
X-TREPAN : A Multi Class Regression and Adapted Extraction of Comprehensible ...csandit
In this work, the TREPAN algorithm is enhanced and extended for extracting decision trees
from neural networks. We empirically evaluated the performance of the algorithm on a set of
databases from real world events. This benchmark enhancement was achieved by adapting
Single-test TREPAN and C4.5 decision tree induction algorithms to analyze the datasets. The
models are then compared with X-TREPAN for comprehensibility and classification accuracy.
Furthermore, we validate the experimentations by applying statistical methods. Finally, the
modified algorithm is extended to work with multi-class regression problems and the ability to
comprehend generalized feed forward networks is achieved.
Enhancing the stability of the deep neural network using a non-constant lear...IJECEIAES
The data stream is considered the backbone of many real-world applications. These applications are most effective when using modern techniques of machine learning like deep neural networks (DNNs). DNNs are very sensitive to set parameters, the most prominent one is the learning rate. Choosing an appropriate learning rate value is critical because it is able to control the overall network performance. This paper presents a new developing DNN model using a multi-layer perceptron (MLP) structure that includes network training based on the optimal learning rate. Thereupon, this model consists of three hidden layers and does not adopt the stability of the learning rate but has a non-constant value (varying over time) to obtain the optimal learning rate which is able to reduce the error in each iteration and increase the model accuracy. This is done by deriving a new parameter that is added to and subtracted from the learning rate. The proposed model is evaluated by three streaming datasets: electricity, network security layer-knowledge discovery in database (NSL-KDD), and human gait database (HuGaDB) datasets. The results proved that the proposed model achieves better results than the constant model and outperforms previous models in terms of accuracy, where it achieved 88.16%, 98.67%, and 97.63% respectively.
Reconfiguration layers of convolutional neural network for fundus patches cla...journalBEEI
Convolutional neural network (CNN) is a method of supervised deep learning. The architectures including AlexNet, VGG16, VGG19, ResNet 50, ResNet101, GoogleNet, Inception-V3, Inception ResNet-V2, and Squeezenet that have 25 to 825 layers. This study aims to simplify layers of CNN architectures and increased accuracy for fundus patches classification. Fundus patches classify two categories: normal and neovascularization. Data used for classification is MESSIDOR and Retina Image Bank that have 2,080 patches. Results show the best accuracy of 93.17% for original data and 99,33% for augmentation data using CNN 31 layers. It consists input layer, 7 convolutional layers, 7 batch normalization, 7 rectified linear unit, 6 max-pooling, fully connected layer, softmax, and output layer.
AN IMPROVED METHOD FOR IDENTIFYING WELL-TEST INTERPRETATION MODEL BASED ON AG...IAEME Publication
This paper presents an approach based on applying an aggregated predictor formed by multiple versions of a multilayer neural network with a back-propagation optimization algorithm for helping the engineer to get a list of the most appropriate well-test interpretation models for a given set of pressure/ production data. The proposed method consists of three stages: (1) data decorrelation through principal component analysis to reduce the covariance between the variables and the dimension of the input layer in the artificial neural network, (2) bootstrap replicates of the learning set where the data is repeatedly sampled with a random split of the data into train sets and using these as new learning sets, and (3) automatic reservoir model identification through aggregated predictor formed by a plurality vote when predicting a new class. This method is described in detail to ensure successful replication of results. The required training and test dataset were generated by using analytical solution models. In our case, there were used 600 samples: 300 for training, 100 for cross-validation, and 200 for testing. Different network structures were tested during this study to arrive at optimum network design. We notice that the single net methodology always brings about confusion in selecting the correct model even though the training results for the constructed networks are close to 1. We notice also that the principal component analysis is an effective strategy in reducing the number of input features, simplifying the network structure, and lowering the training time of the ANN. The results obtained show that the proposed model provides better performance when predicting new data with a coefficient of correlation approximately equal to 95% Compared to a previous approach 80%, the combination of the PCA and ANN is more stable and determine the more accurate results with lesser computational complexity than was feasible previously. Clearly, the aggregated predictor is more stable and shows less bad classes compared to the previous approach.
International Journal of Engineering Research and Applications (IJERA) is an open access online peer reviewed international journal that publishes research and review articles in the fields of Computer Science, Neural Networks, Electrical Engineering, Software Engineering, Information Technology, Mechanical Engineering, Chemical Engineering, Plastic Engineering, Food Technology, Textile Engineering, Nano Technology & science, Power Electronics, Electronics & Communication Engineering, Computational mathematics, Image processing, Civil Engineering, Structural Engineering, Environmental Engineering, VLSI Testing & Low Power VLSI Design etc.
Comparison of Neural Network Training Functions for Hematoma Classification i...IOSR Journals
Classification is one of the most important task in application areas of artificial neural networks
(ANN).Training neural networks is a complex task in the supervised learning field of research. The main
difficulty in adopting ANN is to find the most appropriate combination of learning, transfer and training
function for the classification task. We compared the performances of three types of training algorithms in feed
forward neural network for brain hematoma classification. In this work we have selected Gradient Descent
based backpropagation, Gradient Descent with momentum, Resilence backpropogation algorithms. Under
conjugate based algorithms, Scaled Conjugate back propagation, Conjugate Gradient backpropagation with
Polak-Riebreupdates(CGP) and Conjugate Gradient backpropagation with Fletcher-Reeves updates (CGF).The
last category is Quasi Newton based algorithm, under this BFGS, Levenberg-Marquardt algorithms are
selected. Proposed work compared training algorithm on the basis of mean square error, accuracy, rate of
convergence and correctness of the classification. Our conclusion about the training functions is based on the
simulation results
A Parallel Framework For Multilayer Perceptron For Human Face RecognitionCSCJournals
Artificial neural networks have already shown their success in face recognition and similar complex pattern recognition tasks. However, a major disadvantage of the technique is that it is extremely slow during training for larger classes and hence not suitable for real-time complex problems such as pattern recognition. This is an attempt to develop a parallel framework for the training algorithm of a perceptron. In this paper, two general architectures for a Multilayer Perceptron (MLP) have been demonstrated. The first architecture is All-Class-in-One-Network (ACON) where all the classes are placed in a single network and the second one is One-Class-in-One-Network (OCON) where an individual single network is responsible for each and every class. Capabilities of these two architectures were compared and verified in solving human face recognition, which is a complex pattern recognition task where several factors affect the recognition performance like pose variations, facial expression changes, occlusions, and most importantly illumination changes. Experimental results show that the proposed OCON structure performs better than the conventional ACON in terms of network training convergence speed and which can be easily exercised in a parallel environment.
Hyper-parameter optimization of convolutional neural network based on particl...journalBEEI
Deep neural networks have accomplished enormous progress in tackling many problems. More specifically, convolutional neural network (CNN) is a category of deep networks that have been a dominant technique in computer vision tasks. Despite that these deep neural networks are highly effective; the ideal structure is still an issue that needs a lot of investigation. Deep Convolutional Neural Network model is usually designed manually by trials and repeated tests which enormously constrain its application. Many hyper-parameters of the CNN can affect the model performance. These parameters are depth of the network, numbers of convolutional layers, and numbers of kernels with their sizes. Therefore, it may be a huge challenge to design an appropriate CNN model that uses optimized hyper-parameters and reduces the reliance on manual involvement and domain expertise. In this paper, a design architecture method for CNNs is proposed by utilization of particle swarm optimization (PSO) algorithm to learn the optimal CNN hyper-parameters values. In the experiment, we used Modified National Institute of Standards and Technology (MNIST) database of handwritten digit recognition. The experiments showed that our proposed approach can find an architecture that is competitive to the state-of-the-art models with a testing error of 0.87%.
fMRI Segmentation Using Echo State Neural NetworkCSCJournals
This research work proposes a new intelligent segmentation technique for functional Magnetic Resonance Imaging (fMRI). It has been implemented using an Echostate Neural Network (ESN). Segmentation is an important process that helps in identifying objects of the image. Existing segmentation methods are not able to exactly segment the complicated profile of the fMRI accurately. Segmentation of every pixel in the fMRI correctly helps in proper location of tumor. The presence of noise and artifacts poses a challenging problem in proper segmentation. The proposed ESN is an estimation method with energy minimization. The estimation property helps in better segmentation of the complicated profile of the fMRI. The performance of the new segmentation method is found to be better with higher peak signal to noise ratio (PSNR) of 61 when compared to the PSNR of the existing back-propagation algorithm (BPA) segmentation method which is 57.
X-TREPAN : A Multi Class Regression and Adapted Extraction of Comprehensible ...csandit
In this work, the TREPAN algorithm is enhanced and extended for extracting decision trees
from neural networks. We empirically evaluated the performance of the algorithm on a set of
databases from real world events. This benchmark enhancement was achieved by adapting
Single-test TREPAN and C4.5 decision tree induction algorithms to analyze the datasets. The
models are then compared with X-TREPAN for comprehensibility and classification accuracy.
Furthermore, we validate the experimentations by applying statistical methods. Finally, the
modified algorithm is extended to work with multi-class regression problems and the ability to
comprehend generalized feed forward networks is achieved.
Enhancing the stability of the deep neural network using a non-constant lear...IJECEIAES
The data stream is considered the backbone of many real-world applications. These applications are most effective when using modern techniques of machine learning like deep neural networks (DNNs). DNNs are very sensitive to set parameters, the most prominent one is the learning rate. Choosing an appropriate learning rate value is critical because it is able to control the overall network performance. This paper presents a new developing DNN model using a multi-layer perceptron (MLP) structure that includes network training based on the optimal learning rate. Thereupon, this model consists of three hidden layers and does not adopt the stability of the learning rate but has a non-constant value (varying over time) to obtain the optimal learning rate which is able to reduce the error in each iteration and increase the model accuracy. This is done by deriving a new parameter that is added to and subtracted from the learning rate. The proposed model is evaluated by three streaming datasets: electricity, network security layer-knowledge discovery in database (NSL-KDD), and human gait database (HuGaDB) datasets. The results proved that the proposed model achieves better results than the constant model and outperforms previous models in terms of accuracy, where it achieved 88.16%, 98.67%, and 97.63% respectively.
Reconfiguration layers of convolutional neural network for fundus patches cla...journalBEEI
Convolutional neural network (CNN) is a method of supervised deep learning. The architectures including AlexNet, VGG16, VGG19, ResNet 50, ResNet101, GoogleNet, Inception-V3, Inception ResNet-V2, and Squeezenet that have 25 to 825 layers. This study aims to simplify layers of CNN architectures and increased accuracy for fundus patches classification. Fundus patches classify two categories: normal and neovascularization. Data used for classification is MESSIDOR and Retina Image Bank that have 2,080 patches. Results show the best accuracy of 93.17% for original data and 99,33% for augmentation data using CNN 31 layers. It consists input layer, 7 convolutional layers, 7 batch normalization, 7 rectified linear unit, 6 max-pooling, fully connected layer, softmax, and output layer.
AN IMPROVED METHOD FOR IDENTIFYING WELL-TEST INTERPRETATION MODEL BASED ON AG...IAEME Publication
This paper presents an approach based on applying an aggregated predictor formed by multiple versions of a multilayer neural network with a back-propagation optimization algorithm for helping the engineer to get a list of the most appropriate well-test interpretation models for a given set of pressure/ production data. The proposed method consists of three stages: (1) data decorrelation through principal component analysis to reduce the covariance between the variables and the dimension of the input layer in the artificial neural network, (2) bootstrap replicates of the learning set where the data is repeatedly sampled with a random split of the data into train sets and using these as new learning sets, and (3) automatic reservoir model identification through aggregated predictor formed by a plurality vote when predicting a new class. This method is described in detail to ensure successful replication of results. The required training and test dataset were generated by using analytical solution models. In our case, there were used 600 samples: 300 for training, 100 for cross-validation, and 200 for testing. Different network structures were tested during this study to arrive at optimum network design. We notice that the single net methodology always brings about confusion in selecting the correct model even though the training results for the constructed networks are close to 1. We notice also that the principal component analysis is an effective strategy in reducing the number of input features, simplifying the network structure, and lowering the training time of the ANN. The results obtained show that the proposed model provides better performance when predicting new data with a coefficient of correlation approximately equal to 95% Compared to a previous approach 80%, the combination of the PCA and ANN is more stable and determine the more accurate results with lesser computational complexity than was feasible previously. Clearly, the aggregated predictor is more stable and shows less bad classes compared to the previous approach.
AUTOMATIC FRUIT RECOGNITION BASED ON DCNN FOR COMMERCIAL SOURCE TRACE SYSTEMijcsa
Automatically fruit recognition by using machine vision is considered as challenging task due to similarities between various types of fruits and external environmental changes e-g lighting. In this paper, fruit recognition algorithm based on Deep Convolution Neural Network(DCNN) is proposed. Most of the
previous techniques have some limitations because they were examined and evaluated under limited dataset, furthermore they have not considered external environmental changes. Another major contribution in this paper is that we established fruit images database having 15 different categories comprising of
44406 images which were collected within a period of 6 months by keeping in view the limitations of existing dataset under different real-world conditions. Images were directly used as input to DCNN for training and recognition without extracting features, besides this DCNN learn optimal features from
images through adaptation process. The final decision was totally based on a fusion of all regional classification using probability mechanism. Experimental results exhibit that the proposed approach have efficient capability of automatically recognizing the fruit with a high accuracy of 99% and it can also
effectively meet real world application requirements.
AUTOMATIC FRUIT RECOGNITION BASED ON DCNN FOR COMMERCIAL SOURCE TRACE SYSTEMijcsa
Automatically fruit recognition by using machine vision is considered as challenging task due to similarities between various types of fruits and external environmental changes e-g lighting. In this paper, fruit recognition algorithm based on Deep Convolution Neural Network(DCNN) is proposed. Most of the previous techniques have some limitations because they were examined and evaluated under limited dataset, furthermore they have not considered external environmental changes. Another major contribution in this paper is that we established fruit images database having 15 different categories comprising of 44406 images which were collected within a period of 6 months by keeping in view the limitations of existing dataset under different real-world conditions. Images were directly used as input to DCNN for training and recognition without extracting features, besides this DCNN learn optimal features from images through adaptation process. The final decision was totally based on a fusion of all regional classification using probability mechanism. Experimental results exhibit that the proposed approach have efficient capability of automatically recognizing the fruit with a high accuracy of 99% and it can also effectively meet real world application requirements.
CONTRAST OF RESNET AND DENSENET BASED ON THE RECOGNITION OF SIMPLE FRUIT DATA...rinzindorjej
In this paper, a fruit image data set is used to compare the efficiency and accuracy of two widely used
Convolutional Neural Network, namely the ResNet and the DenseNet, for the recognition of 50 different
kinds of fruits. In the experiment, the structure of ResNet-34 and DenseNet_BC-121 (with bottleneck layer)
are used. The mathematic principle, experiment detail and the experiment result will be explained through
comparison.
In this paper, a fruit image data set is used to compare the efficiency and accuracy of two widely used Convolutional Neural Network, namely the ResNet and the DenseNet, for the recognition of 50 different kinds of fruits. In the experiment, the structure of ResNet-34 and DenseNet_BC-121 (with bottleneck layer) are used. The mathematic principle, experiment detail and the experiment result will be explained through comparison.
CONTRAST OF RESNET AND DENSENET BASED ON THE RECOGNITION OF SIMPLE FRUIT DATA...rinzindorjej
In this paper, a fruit image data set is used to compare the efficiency and accuracy of two widely used Convolutional Neural Network, namely the ResNet and the DenseNet, for the recognition of 50 different kinds of fruits. In the experiment, the structure of ResNet-34 and DenseNet_BC-121 (with bottleneck layer)
are used. The mathematic principle, experiment detail and the experiment result will be explained through comparison.
Image segmentation and classification tasks in computer vision have proven to be highly effective using neural networks, specifically Convolutional Neural Networks (CNNs). These tasks have numerous
practical applications, such as in medical imaging, autonomous driving, and surveillance. CNNs are capable
of learning complex features directly from images and achieving outstanding performance across several
datasets. In this work, we have utilized three different datasets to investigate the efficacy of various preprocessing and classification techniques in accurssedately segmenting and classifying different structures
within the MRI and natural images. We have utilized both sample gradient and Canny Edge Detection
methods for pre-processing, and K-means clustering have been applied to segment the images. Image
augmentation improves the size and diversity of datasets for training the models for image classification
Image segmentation and classification tasks in computer vision have proven to be highly effective using neural networks, specifically Convolutional Neural Networks (CNNs). These tasks have numerous
practical applications, such as in medical imaging, autonomous driving, and surveillance. CNNs are capable
of learning complex features directly from images and achieving outstanding performance across several
datasets. In this work, we have utilized three different datasets to investigate the efficacy of various preprocessing and classification techniques in accurssedately segmenting and classifying different structures
within the MRI and natural images. We have utilized both sample gradient and Canny Edge Detection
methods for pre-processing, and K-means clustering have been applied to segment the images. Image
augmentation improves the size and diversity of datasets for training the models for image classification.
This work highlights transfer learning’s effectiveness in image classification using CNNs and VGG 16 that
provides insights into the selection of pre-trained models and hyper parameters for optimal performance.
We have proposed a comprehensive approach for image segmentation and classification, incorporating preprocessing techniques, the K-means algorithm for segmentation, and employing deep learning models such
as CNN and VGG 16 for classification.
We propose an algorithm for training Multi Layer Preceptrons for classification problems, that we named Hidden Layer Learning Vector Quantization (H-LVQ). It consists of applying Learning Vector Quantization to the last hidden layer of a MLP and it gave very successful results on problems containing a large number of correlated inputs. It was applied with excellent results on classification of Rurtherford
backscattering spectra and on a benchmark problem of image recognition. It may also be used for efficient feature extraction.
Accuracy study of image classification for reverse vending machine waste segr...IJECEIAES
This study aims to create a sorting system with high accuracy that can classify various beverage containers based on types and separate them accordingly. This reverse vending machine (RVM) provides an image classification method and allows for recycling three types of beverage containers: drink carton boxes, polyethylene terephthalate (PET) bottles, and aluminium cans. The image classification method used in this project is transfer learning with convolutional neural networks (CNN). AlexNet, GoogLeNet, DenseNet201, InceptionResNetV2, InceptionV3, MobileNetV2, XceptionNet, ShuffleNet, ResNet 18, ResNet 50, and ResNet 101 are the neural networks that used in this project. This project will compare the F1- score and computational time among the eleven networks. The F1-score and computational time of image classification differs for each neural network. In this project, the AlexNet network gave the best F1-score, 97.50% with the shortest computational time, 2229.235 s among the eleven neural networks.
TEST-COST-SENSITIVE CONVOLUTIONAL NEURAL NETWORKS WITH EXPERT BRANCHESsipij
It has been proven that deeper convolutional neural networks (CNN) can result in better accuracy in many
problems, but this accuracy comes with a high computational cost. Also, input instances have not the same
difficulty. As a solution for accuracy vs. computational cost dilemma, we introduce a new test-cost-sensitive
method for convolutional neural networks. This method trains a CNN with a set of auxiliary outputs and
expert branches in some middle layers of the network. The expert branches decide to use a shallower part
of the network or going deeper to the end, based on the difficulty of input instance. The expert branches
learn to determine: is the current network prediction is wrong and if the given instance passed to deeper
layers of the network it will generate right output; If not, then the expert branches stop the computation
process. The experimental results on standard dataset CIFAR-10 show that the proposed method can train
models with lower test-cost and competitive accuracy in comparison with basic models.
Objective Evaluation of a Deep Neural Network Approach for Single-Channel Spe...csandit
Single-channel speech intelligibility enhancement is much more difficult than multi-channel
intelligibility enhancement. It has recently been reported that machine learning training-based
single-channel speech intelligibility enhancement algorithms perform better than traditional
algorithms. In this paper, the performance of a deep neural network method using a multiresolution
cochlea-gram feature set recently proposed to perform single-channel speech
intelligibility enhancement processing is evaluated. Various conditions such as different
speakers for training and testing as well as different noise conditions are tested. Simulations
and objective test results show that the method performs better than another deep neural
networks setup recently proposed for the same task, and leads to a more robust convergence
compared to a recently proposed Gaussian mixture model approach.
TRANSFER LEARNING BASED IMAGE VISUALIZATION USING CNNgerogepatton
Image classification is a popular machine learning based applications of deep learning. Deep learning techniques are very popular because they can be effectively used in performing operations on image data in large-scale. In this paper CNN model was designed to better classify images. We make use of feature extraction part of inception v3 model for feature vector calculation and retrained the classification layer with these feature vector. By using the transfer learning mechanism the classification layer of the CNN
model was trained with 20 classes of Caltech101 image dataset and 17 classes of Oxford 17 flower image dataset. After training, network was evaluated with testing dataset images from Oxford 17 flower dataset and Caltech101 image dataset. The mean testing precision of the neural network architecture with Caltech101 dataset was 98 % and with Oxford 17 Flower image dataset was 92.27 %.
Amazon products reviews classification based on machine learning, deep learni...TELKOMNIKA JOURNAL
In recent times, the trend of online shopping through e-commerce stores and websites has grown to a huge extent. Whenever a product is purchased on an e-commerce platform, people leave their reviews about the product. These reviews are very helpful for the store owners and the product’s manufacturers for the betterment of their work process as well as product quality. An automated system is proposed in this work that operates on two datasets D1 and D2 obtained from Amazon. After certain preprocessing steps, N-gram and word embedding-based features are extracted using term frequency-inverse document frequency (TF-IDF), bag of words (BoW) and global vectors (GloVe), and Word2vec, respectively. Four machine learning (ML) models support vector machines (SVM), logistic regression (RF), logistic regression (LR), multinomial Naïve Bayes (MNB), two deep learning (DL) models convolutional neural network (CNN), long-short term memory (LSTM), and standalone bidirectional encoder representations (BERT) are used to classify reviews as either positive or negative. The results obtained by the standard ML, DL models and BERT are evaluated using certain performance evaluation measures. BERT turns out to be the best-performing model in the case of D1 with an accuracy of 90% on features derived by word embedding models while the CNN provides the best accuracy of 97% upon word embedding features in the case of D2. The proposed model shows better overall performance on D2 as compared to D1.
Design, simulation, and analysis of microstrip patch antenna for wireless app...TELKOMNIKA JOURNAL
In this study, a microstrip patch antenna that works at 3.6 GHz was built and tested to see how well it works. In this work, Rogers RT/Duroid 5880 has been used as the substrate material, with a dielectric permittivity of 2.2 and a thickness of 0.3451 mm; it serves as the base for the examined antenna. The computer simulation technology (CST) studio suite is utilized to show the recommended antenna design. The goal of this study was to get a more extensive transmission capacity, a lower voltage standing wave ratio (VSWR), and a lower return loss, but the main goal was to get a higher gain, directivity, and efficiency. After simulation, the return loss, gain, directivity, bandwidth, and efficiency of the supplied antenna are found to be -17.626 dB, 9.671 dBi, 9.924 dBi, 0.2 GHz, and 97.45%, respectively. Besides, the recreation uncovered that the transfer speed side-lobe level at phi was much better than those of the earlier works, at -28.8 dB, respectively. Thus, it makes a solid contender for remote innovation and more robust communication.
Design and simulation an optimal enhanced PI controller for congestion avoida...TELKOMNIKA JOURNAL
In this paper, snake optimization algorithm (SOA) is used to find the optimal gains of an enhanced controller for controlling congestion problem in computer networks. M-file and Simulink platform is adopted to evaluate the response of the active queue management (AQM) system, a comparison with two classical controllers is done, all tuned gains of controllers are obtained using SOA method and the fitness function chose to monitor the system performance is the integral time absolute error (ITAE). Transient analysis and robust analysis is used to show the proposed controller performance, two robustness tests are applied to the AQM system, one is done by varying the size of queue value in different period and the other test is done by changing the number of transmission control protocol (TCP) sessions with a value of ± 20% from its original value. The simulation results reflect a stable and robust behavior and best performance is appeared clearly to achieve the desired queue size without any noise or any transmission problems.
Improving the detection of intrusion in vehicular ad-hoc networks with modifi...TELKOMNIKA JOURNAL
Vehicular ad-hoc networks (VANETs) are wireless-equipped vehicles that form networks along the road. The security of this network has been a major challenge. The identity-based cryptosystem (IBC) previously used to secure the networks suffers from membership authentication security features. This paper focuses on improving the detection of intruders in VANETs with a modified identity-based cryptosystem (MIBC). The MIBC is developed using a non-singular elliptic curve with Lagrange interpolation. The public key of vehicles and roadside units on the network are derived from number plates and location identification numbers, respectively. Pseudo-identities are used to mask the real identity of users to preserve their privacy. The membership authentication mechanism ensures that only valid and authenticated members of the network are allowed to join the network. The performance of the MIBC is evaluated using intrusion detection ratio (IDR) and computation time (CT) and then validated with the existing IBC. The result obtained shows that the MIBC recorded an IDR of 99.3% against 94.3% obtained for the existing identity-based cryptosystem (EIBC) for 140 unregistered vehicles attempting to intrude on the network. The MIBC shows lower CT values of 1.17 ms against 1.70 ms for EIBC. The MIBC can be used to improve the security of VANETs.
Conceptual model of internet banking adoption with perceived risk and trust f...TELKOMNIKA JOURNAL
Understanding the primary factors of internet banking (IB) acceptance is critical for both banks and users; nevertheless, our knowledge of the role of users’ perceived risk and trust in IB adoption is limited. As a result, we develop a conceptual model by incorporating perceived risk and trust into the technology acceptance model (TAM) theory toward the IB. The proper research emphasized that the most essential component in explaining IB adoption behavior is behavioral intention to use IB adoption. TAM is helpful for figuring out how elements that affect IB adoption are connected to one another. According to previous literature on IB and the use of such technology in Iraq, one has to choose a theoretical foundation that may justify the acceptance of IB from the customer’s perspective. The conceptual model was therefore constructed using the TAM as a foundation. Furthermore, perceived risk and trust were added to the TAM dimensions as external factors. The key objective of this work was to extend the TAM to construct a conceptual model for IB adoption and to get sufficient theoretical support from the existing literature for the essential elements and their relationships in order to unearth new insights about factors responsible for IB adoption.
Efficient combined fuzzy logic and LMS algorithm for smart antennaTELKOMNIKA JOURNAL
The smart antennas are broadly used in wireless communication. The least mean square (LMS) algorithm is a procedure that is concerned in controlling the smart antenna pattern to accommodate specified requirements such as steering the beam toward the desired signal, in addition to placing the deep nulls in the direction of unwanted signals. The conventional LMS (C-LMS) has some drawbacks like slow convergence speed besides high steady state fluctuation error. To overcome these shortcomings, the present paper adopts an adaptive fuzzy control step size least mean square (FC-LMS) algorithm to adjust its step size. Computer simulation outcomes illustrate that the given model has fast convergence rate as well as low mean square error steady state.
Design and implementation of a LoRa-based system for warning of forest fireTELKOMNIKA JOURNAL
This paper presents the design and implementation of a forest fire monitoring and warning system based on long range (LoRa) technology, a novel ultra-low power consumption and long-range wireless communication technology for remote sensing applications. The proposed system includes a wireless sensor network that records environmental parameters such as temperature, humidity, wind speed, and carbon dioxide (CO2) concentration in the air, as well as taking infrared photos.The data collected at each sensor node will be transmitted to the gateway via LoRa wireless transmission. Data will be collected, processed, and uploaded to a cloud database at the gateway. An Android smartphone application that allows anyone to easily view the recorded data has been developed. When a fire is detected, the system will sound a siren and send a warning message to the responsible personnel, instructing them to take appropriate action. Experiments in Tram Chim Park, Vietnam, have been conducted to verify and evaluate the operation of the system.
Wavelet-based sensing technique in cognitive radio networkTELKOMNIKA JOURNAL
Cognitive radio is a smart radio that can change its transmitter parameter based on interaction with the environment in which it operates. The demand for frequency spectrum is growing due to a big data issue as many Internet of Things (IoT) devices are in the network. Based on previous research, most frequency spectrum was used, but some spectrums were not used, called spectrum hole. Energy detection is one of the spectrum sensing methods that has been frequently used since it is easy to use and does not require license users to have any prior signal understanding. But this technique is incapable of detecting at low signal-to-noise ratio (SNR) levels. Therefore, the wavelet-based sensing is proposed to overcome this issue and detect spectrum holes. The main objective of this work is to evaluate the performance of wavelet-based sensing and compare it with the energy detection technique. The findings show that the percentage of detection in wavelet-based sensing is 83% higher than energy detection performance. This result indicates that the wavelet-based sensing has higher precision in detection and the interference towards primary user can be decreased.
A novel compact dual-band bandstop filter with enhanced rejection bandsTELKOMNIKA JOURNAL
In this paper, we present the design of a new wide dual-band bandstop filter (DBBSF) using nonuniform transmission lines. The method used to design this filter is to replace conventional uniform transmission lines with nonuniform lines governed by a truncated Fourier series. Based on how impedances are profiled in the proposed DBBSF structure, the fractional bandwidths of the two 10 dB-down rejection bands are widened to 39.72% and 52.63%, respectively, and the physical size has been reduced compared to that of the filter with the uniform transmission lines. The results of the electromagnetic (EM) simulation support the obtained analytical response and show an improved frequency behavior.
Deep learning approach to DDoS attack with imbalanced data at the application...TELKOMNIKA JOURNAL
A distributed denial of service (DDoS) attack is where one or more computers attack or target a server computer, by flooding internet traffic to the server. As a result, the server cannot be accessed by legitimate users. A result of this attack causes enormous losses for a company because it can reduce the level of user trust, and reduce the company’s reputation to lose customers due to downtime. One of the services at the application layer that can be accessed by users is a web-based lightweight directory access protocol (LDAP) service that can provide safe and easy services to access directory applications. We used a deep learning approach to detect DDoS attacks on the CICDDoS 2019 dataset on a complex computer network at the application layer to get fast and accurate results for dealing with unbalanced data. Based on the results obtained, it is observed that DDoS attack detection using a deep learning approach on imbalanced data performs better when implemented using synthetic minority oversampling technique (SMOTE) method for binary classes. On the other hand, the proposed deep learning approach performs better for detecting DDoS attacks in multiclass when implemented using the adaptive synthetic (ADASYN) method.
The appearance of uncertainties and disturbances often effects the characteristics of either linear or nonlinear systems. Plus, the stabilization process may be deteriorated thus incurring a catastrophic effect to the system performance. As such, this manuscript addresses the concept of matching condition for the systems that are suffering from miss-match uncertainties and exogeneous disturbances. The perturbation towards the system at hand is assumed to be known and unbounded. To reach this outcome, uncertainties and their classifications are reviewed thoroughly. The structural matching condition is proposed and tabulated in the proposition 1. Two types of mathematical expressions are presented to distinguish the system with matched uncertainty and the system with miss-matched uncertainty. Lastly, two-dimensional numerical expressions are provided to practice the proposed proposition. The outcome shows that matching condition has the ability to change the system to a design-friendly model for asymptotic stabilization.
Implementation of FinFET technology based low power 4×4 Wallace tree multipli...TELKOMNIKA JOURNAL
Many systems, including digital signal processors, finite impulse response (FIR) filters, application-specific integrated circuits, and microprocessors, use multipliers. The demand for low power multipliers is gradually rising day by day in the current technological trend. In this study, we describe a 4×4 Wallace multiplier based on a carry select adder (CSA) that uses less power and has a better power delay product than existing multipliers. HSPICE tool at 16 nm technology is used to simulate the results. In comparison to the traditional CSA-based multiplier, which has a power consumption of 1.7 µW and power delay product (PDP) of 57.3 fJ, the results demonstrate that the Wallace multiplier design employing CSA with first zero finding logic (FZF) logic has the lowest power consumption of 1.4 µW and PDP of 27.5 fJ.
Evaluation of the weighted-overlap add model with massive MIMO in a 5G systemTELKOMNIKA JOURNAL
The flaw in 5G orthogonal frequency division multiplexing (OFDM) becomes apparent in high-speed situations. Because the doppler effect causes frequency shifts, the orthogonality of OFDM subcarriers is broken, lowering both their bit error rate (BER) and throughput output. As part of this research, we use a novel design that combines massive multiple input multiple output (MIMO) and weighted overlap and add (WOLA) to improve the performance of 5G systems. To determine which design is superior, throughput and BER are calculated for both the proposed design and OFDM. The results of the improved system show a massive improvement in performance ver the conventional system and significant improvements with massive MIMO, including the best throughput and BER. When compared to conventional systems, the improved system has a throughput that is around 22% higher and the best performance in terms of BER, but it still has around 25% less error than OFDM.
Reflector antenna design in different frequencies using frequency selective s...TELKOMNIKA JOURNAL
In this study, it is aimed to obtain two different asymmetric radiation patterns obtained from antennas in the shape of the cross-section of a parabolic reflector (fan blade type antennas) and antennas with cosecant-square radiation characteristics at two different frequencies from a single antenna. For this purpose, firstly, a fan blade type antenna design will be made, and then the reflective surface of this antenna will be completed to the shape of the reflective surface of the antenna with the cosecant-square radiation characteristic with the frequency selective surface designed to provide the characteristics suitable for the purpose. The frequency selective surface designed and it provides the perfect transmission as possible at 4 GHz operating frequency, while it will act as a band-quenching filter for electromagnetic waves at 5 GHz operating frequency and will be a reflective surface. Thanks to this frequency selective surface to be used as a reflective surface in the antenna, a fan blade type radiation characteristic at 4 GHz operating frequency will be obtained, while a cosecant-square radiation characteristic at 5 GHz operating frequency will be obtained.
Reagentless iron detection in water based on unclad fiber optical sensorTELKOMNIKA JOURNAL
A simple and low-cost fiber based optical sensor for iron detection is demonstrated in this paper. The sensor head consist of an unclad optical fiber with the unclad length of 1 cm and it has a straight structure. Results obtained shows a linear relationship between the output light intensity and iron concentration, illustrating the functionality of this iron optical sensor. Based on the experimental results, the sensitivity and linearity are achieved at 0.0328/ppm and 0.9824 respectively at the wavelength of 690 nm. With the same wavelength, other performance parameters are also studied. Resolution and limit of detection (LOD) are found to be 0.3049 ppm and 0.0755 ppm correspondingly. This iron sensor is advantageous in that it does not require any reagent for detection, enabling it to be simpler and cost-effective in the implementation of the iron sensing.
Impact of CuS counter electrode calcination temperature on quantum dot sensit...TELKOMNIKA JOURNAL
In place of the commercial Pt electrode used in quantum sensitized solar cells, the low-cost CuS cathode is created using electrophoresis. High resolution scanning electron microscopy and X-ray diffraction were used to analyze the structure and morphology of structural cubic samples with diameters ranging from 40 nm to 200 nm. The conversion efficiency of solar cells is significantly impacted by the calcination temperatures of cathodes at 100 °C, 120 °C, 150 °C, and 180 °C under vacuum. The fluorine doped tin oxide (FTO)/CuS cathode electrode reached a maximum efficiency of 3.89% when it was calcined at 120 °C. Compared to other temperature combinations, CuS nanoparticles crystallize at 120 °C, which lowers resistance while increasing electron lifetime.
In place of the commercial Pt electrode used in quantum sensitized solar cells, the low-cost CuS cathode is created using electrophoresis. High resolution scanning electron microscopy and X-ray diffraction were used to analyze the structure and morphology of structural cubic samples with diameters ranging from 40 nm to 200 nm. The conversion efficiency of solar cells is significantly impacted by the calcination temperatures of cathodes at 100 °C, 120 °C, 150 °C, and 180 °C under vacuum. The fluorine doped tin oxide (FTO)/CuS cathode electrode reached a maximum efficiency of 3.89% when it was calcined at 120 °C. Compared to other temperature combinations, CuS nanoparticles crystallize at 120 °C, which lowers resistance while increasing electron lifetime.
A progressive learning for structural tolerance online sequential extreme lea...TELKOMNIKA JOURNAL
This article discusses the progressive learning for structural tolerance online sequential extreme learning machine (PSTOS-ELM). PSTOS-ELM can save robust accuracy while updating the new data and the new class data on the online training situation. The robustness accuracy arises from using the householder block exact QR decomposition recursive least squares (HBQRD-RLS) of the PSTOS-ELM. This method is suitable for applications that have data streaming and often have new class data. Our experiment compares the PSTOS-ELM accuracy and accuracy robustness while data is updating with the batch-extreme learning machine (ELM) and structural tolerance online sequential extreme learning machine (STOS-ELM) that both must retrain the data in a new class data case. The experimental results show that PSTOS-ELM has accuracy and robustness comparable to ELM and STOS-ELM while also can update new class data immediately.
Electroencephalography-based brain-computer interface using neural networksTELKOMNIKA JOURNAL
This study aimed to develop a brain-computer interface that can control an electric wheelchair using electroencephalography (EEG) signals. First, we used the Mind Wave Mobile 2 device to capture raw EEG signals from the surface of the scalp. The signals were transformed into the frequency domain using fast Fourier transform (FFT) and filtered to monitor changes in attention and relaxation. Next, we performed time and frequency domain analyses to identify features for five eye gestures: opened, closed, blink per second, double blink, and lookup. The base state was the opened-eyes gesture, and we compared the features of the remaining four action gestures to the base state to identify potential gestures. We then built a multilayer neural network to classify these features into five signals that control the wheelchair’s movement. Finally, we designed an experimental wheelchair system to test the effectiveness of the proposed approach. The results demonstrate that the EEG classification was highly accurate and computationally efficient. Moreover, the average performance of the brain-controlled wheelchair system was over 75% across different individuals, which suggests the feasibility of this approach.
Adaptive segmentation algorithm based on level set model in medical imagingTELKOMNIKA JOURNAL
For image segmentation, level set models are frequently employed. It offer best solution to overcome the main limitations of deformable parametric models. However, the challenge when applying those models in medical images stills deal with removing blurs in image edges which directly affects the edge indicator function, leads to not adaptively segmenting images and causes a wrong analysis of pathologies wich prevents to conclude a correct diagnosis. To overcome such issues, an effective process is suggested by simultaneously modelling and solving systems’ two-dimensional partial differential equations (PDE). The first PDE equation allows restoration using Euler’s equation similar to an anisotropic smoothing based on a regularized Perona and Malik filter that eliminates noise while preserving edge information in accordance with detected contours in the second equation that segments the image based on the first equation solutions. This approach allows developing a new algorithm which overcome the studied model drawbacks. Results of the proposed method give clear segments that can be applied to any application. Experiments on many medical images in particular blurry images with high information losses, demonstrate that the developed approach produces superior segmentation results in terms of quantity and quality compared to other models already presented in previeous works.
Automatic channel selection using shuffled frog leaping algorithm for EEG bas...TELKOMNIKA JOURNAL
Drug addiction is a complex neurobiological disorder that necessitates comprehensive treatment of both the body and mind. It is categorized as a brain disorder due to its impact on the brain. Various methods such as electroencephalography (EEG), functional magnetic resonance imaging (FMRI), and magnetoencephalography (MEG) can capture brain activities and structures. EEG signals provide valuable insights into neurological disorders, including drug addiction. Accurate classification of drug addiction from EEG signals relies on appropriate features and channel selection. Choosing the right EEG channels is essential to reduce computational costs and mitigate the risk of overfitting associated with using all available channels. To address the challenge of optimal channel selection in addiction detection from EEG signals, this work employs the shuffled frog leaping algorithm (SFLA). SFLA facilitates the selection of appropriate channels, leading to improved accuracy. Wavelet features extracted from the selected input channel signals are then analyzed using various machine learning classifiers to detect addiction. Experimental results indicate that after selecting features from the appropriate channels, classification accuracy significantly increased across all classifiers. Particularly, the multi-layer perceptron (MLP) classifier combined with SFLA demonstrated a remarkable accuracy improvement of 15.78% while reducing time complexity.
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptxR&R Consult
CFD analysis is incredibly effective at solving mysteries and improving the performance of complex systems!
Here's a great example: At a large natural gas-fired power plant, where they use waste heat to generate steam and energy, they were puzzled that their boiler wasn't producing as much steam as expected.
R&R and Tetra Engineering Group Inc. were asked to solve the issue with reduced steam production.
An inspection had shown that a significant amount of hot flue gas was bypassing the boiler tubes, where the heat was supposed to be transferred.
R&R Consult conducted a CFD analysis, which revealed that 6.3% of the flue gas was bypassing the boiler tubes without transferring heat. The analysis also showed that the flue gas was instead being directed along the sides of the boiler and between the modules that were supposed to capture the heat. This was the cause of the reduced performance.
Based on our results, Tetra Engineering installed covering plates to reduce the bypass flow. This improved the boiler's performance and increased electricity production.
It is always satisfying when we can help solve complex challenges like this. Do your systems also need a check-up or optimization? Give us a call!
Work done in cooperation with James Malloy and David Moelling from Tetra Engineering.
More examples of our work https://www.r-r-consult.dk/en/cases-en/
About
Indigenized remote control interface card suitable for MAFI system CCR equipment. Compatible for IDM8000 CCR. Backplane mounted serial and TCP/Ethernet communication module for CCR remote access. IDM 8000 CCR remote control on serial and TCP protocol.
• Remote control: Parallel or serial interface.
• Compatible with MAFI CCR system.
• Compatible with IDM8000 CCR.
• Compatible with Backplane mount serial communication.
• Compatible with commercial and Defence aviation CCR system.
• Remote control system for accessing CCR and allied system over serial or TCP.
• Indigenized local Support/presence in India.
• Easy in configuration using DIP switches.
Technical Specifications
Indigenized remote control interface card suitable for MAFI system CCR equipment. Compatible for IDM8000 CCR. Backplane mounted serial and TCP/Ethernet communication module for CCR remote access. IDM 8000 CCR remote control on serial and TCP protocol.
Key Features
Indigenized remote control interface card suitable for MAFI system CCR equipment. Compatible for IDM8000 CCR. Backplane mounted serial and TCP/Ethernet communication module for CCR remote access. IDM 8000 CCR remote control on serial and TCP protocol.
• Remote control: Parallel or serial interface
• Compatible with MAFI CCR system
• Copatiable with IDM8000 CCR
• Compatible with Backplane mount serial communication.
• Compatible with commercial and Defence aviation CCR system.
• Remote control system for accessing CCR and allied system over serial or TCP.
• Indigenized local Support/presence in India.
Application
• Remote control: Parallel or serial interface.
• Compatible with MAFI CCR system.
• Compatible with IDM8000 CCR.
• Compatible with Backplane mount serial communication.
• Compatible with commercial and Defence aviation CCR system.
• Remote control system for accessing CCR and allied system over serial or TCP.
• Indigenized local Support/presence in India.
• Easy in configuration using DIP switches.
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...Amil Baba Dawood bangali
Contact with Dawood Bhai Just call on +92322-6382012 and we'll help you. We'll solve all your problems within 12 to 24 hours and with 101% guarantee and with astrology systematic. If you want to take any personal or professional advice then also you can call us on +92322-6382012 , ONLINE LOVE PROBLEM & Other all types of Daily Life Problem's.Then CALL or WHATSAPP us on +92322-6382012 and Get all these problems solutions here by Amil Baba DAWOOD BANGALI
#vashikaranspecialist #astrologer #palmistry #amliyaat #taweez #manpasandshadi #horoscope #spiritual #lovelife #lovespell #marriagespell#aamilbabainpakistan #amilbabainkarachi #powerfullblackmagicspell #kalajadumantarspecialist #realamilbaba #AmilbabainPakistan #astrologerincanada #astrologerindubai #lovespellsmaster #kalajaduspecialist #lovespellsthatwork #aamilbabainlahore#blackmagicformarriage #aamilbaba #kalajadu #kalailam #taweez #wazifaexpert #jadumantar #vashikaranspecialist #astrologer #palmistry #amliyaat #taweez #manpasandshadi #horoscope #spiritual #lovelife #lovespell #marriagespell#aamilbabainpakistan #amilbabainkarachi #powerfullblackmagicspell #kalajadumantarspecialist #realamilbaba #AmilbabainPakistan #astrologerincanada #astrologerindubai #lovespellsmaster #kalajaduspecialist #lovespellsthatwork #aamilbabainlahore #blackmagicforlove #blackmagicformarriage #aamilbaba #kalajadu #kalailam #taweez #wazifaexpert #jadumantar #vashikaranspecialist #astrologer #palmistry #amliyaat #taweez #manpasandshadi #horoscope #spiritual #lovelife #lovespell #marriagespell#aamilbabainpakistan #amilbabainkarachi #powerfullblackmagicspell #kalajadumantarspecialist #realamilbaba #AmilbabainPakistan #astrologerincanada #astrologerindubai #lovespellsmaster #kalajaduspecialist #lovespellsthatwork #aamilbabainlahore #Amilbabainuk #amilbabainspain #amilbabaindubai #Amilbabainnorway #amilbabainkrachi #amilbabainlahore #amilbabaingujranwalan #amilbabainislamabad
Water scarcity is the lack of fresh water resources to meet the standard water demand. There are two type of water scarcity. One is physical. The other is economic water scarcity.
Overview of the fundamental roles in Hydropower generation and the components involved in wider Electrical Engineering.
This paper presents the design and construction of hydroelectric dams from the hydrologist’s survey of the valley before construction, all aspects and involved disciplines, fluid dynamics, structural engineering, generation and mains frequency regulation to the very transmission of power through the network in the United Kingdom.
Author: Robbie Edward Sayers
Collaborators and co editors: Charlie Sims and Connor Healey.
(C) 2024 Robbie E. Sayers
Hierarchical Digital Twin of a Naval Power SystemKerry Sado
A hierarchical digital twin of a Naval DC power system has been developed and experimentally verified. Similar to other state-of-the-art digital twins, this technology creates a digital replica of the physical system executed in real-time or faster, which can modify hardware controls. However, its advantage stems from distributing computational efforts by utilizing a hierarchical structure composed of lower-level digital twin blocks and a higher-level system digital twin. Each digital twin block is associated with a physical subsystem of the hardware and communicates with a singular system digital twin, which creates a system-level response. By extracting information from each level of the hierarchy, power system controls of the hardware were reconfigured autonomously. This hierarchical digital twin development offers several advantages over other digital twins, particularly in the field of naval power systems. The hierarchical structure allows for greater computational efficiency and scalability while the ability to autonomously reconfigure hardware controls offers increased flexibility and responsiveness. The hierarchical decomposition and models utilized were well aligned with the physical twin, as indicated by the maximum deviations between the developed digital twin hierarchy and the hardware.
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...Dr.Costas Sachpazis
Terzaghi's soil bearing capacity theory, developed by Karl Terzaghi, is a fundamental principle in geotechnical engineering used to determine the bearing capacity of shallow foundations. This theory provides a method to calculate the ultimate bearing capacity of soil, which is the maximum load per unit area that the soil can support without undergoing shear failure. The Calculation HTML Code included.
Transfer learning with multiple pre-trained network for fundus classification
1. TELKOMNIKA Telecommunication, Computing, Electronics and Control
Vol. 18, No. 3, June 2020, pp. 1382~1388
ISSN: 1693-6930, accredited First Grade by Kemenristekdikti, Decree No: 21/E/KPT/2018
DOI: 10.12928/TELKOMNIKA.v18i3.14868 1382
Journal homepage: http://journal.uad.ac.id/index.php/TELKOMNIKA
Transfer learning with multiple pre-trained network
for fundus classification
Wahyudi Setiawan1
, Moh. Imam Utoyo2
, Riries Rulaningtyas3
1
Informatics Department, University of Trunojoyo Madura, Indonesia
2
Mathematics Department, Universitas Airlangga, Indonesia
3
Physics Department, Universitas Airlangga, Indonesia
Article Info ABSTRACT
Article history:
Received Sep 10, 2019
Revised Dec 14, 2019
Accepted Dec 22, 2019
Transfer learning (TL) is a technique of reuse and modify a pre-trained network.
It reuses feature extraction layer at a pre-trained network. A target domain in TL
obtains the features knowledge from the source domain. TL modified
classification layer at a pre-trained network. The target domain can do new tasks
according to a purpose. In this article, the target domain is fundus image
classification includes normal and neovascularization. Data consist of
100 patches. The comparison of training and validation data was 70:30.
The selection of training and validation data is done randomly. Steps of TL i.e
load pre-trained networks, replace final layers, train the network, and assess
network accuracy. First, the pre-trained network is a layer configuration of
the convolutional neural network architecture. Pre-trained network used are
AlexNet, VGG16, VGG19, ResNet50, ResNet101, GoogLeNet, Inception-V3,
InceptionResNetV2, and squeezenet. Second, replace the final layer is to replace
the last three layers. They are fully connected layer, softmax, and output layer.
The layer is replaced with a fully connected layer that classifies according to
number of classes. Furthermore, it's followed by a softmax and output layer that
matches with the target domain. Third, we trained the network. Networks were
trained to produce optimal accuracy. In this section, we use gradient descent
algorithm optimization. Fourth, assess network accuracy. The experiment results
show a testing accuracy between 80% and 100%.
Keywords:
Classification
Convolutional neural network
Multiple pre-trained network
Neovascularization
Transfer learning
This is an open access article under the CC BY-SA license.
Corresponding Author:
Riries Rulaningtyas,
Physics Department,
Universitas Airlangga, Surabaya, Indonesia.
Email: riries-r@fst.unair.ac.id
1. INTRODUCTION
A system requires learning process to perform certain tasks. The tasks include image enhancement,
classification, clustering, recognition, and detection. Data processing needs to do it. Data is divided into two
parts, training and testing data. In conventional systems, training data processed to get knowledge.
The problem arises when the amount of training data is limited, the learning process doesn’t well perform.
An alternative solution to the problem is transfer learning. It is a machine learning method that works by
utilizing existing models. Transfer learning modifies and updates parameters on the model. Transfer learning
makes modified models as learning with different tasks. The model used for transfer learning has learned
from other data, so learning is not needed from scratch. The model has recognized features such as textures,
shapes, and colors as a result of previous learning.
2. TELKOMNIKA Telecommun Comput El Control
Transfer learning with multiple pre-trained network for fundus classification (Wahyudi Setiawan)
1383
The benefit of transfer learning is well learning even though limited training data. Contrast to
traditional machine learning, every learning process always requires relatively large amounts of data [1].
The difference between traditional machine learning and transfer learning is found in Figure 1. Nowadays,
transfer learning has been applied to robotics [2, 3] image classification [4, 5], sentiment classification [6],
game technology [7, 8] and text classification [9]. Generally, the type of transfer learning used in deep
learning is a pre-trained network. The phase for conducting transfer learning as follows:
- Select a specific model. Pre-trained network models are taken from existing models.
- Reused model. Pre-trained models can be used as a starting point for carrying out a new task. A new task
can use the whole part of a pre-trained model or partly depends on system requirements.
(a) (b)
Figure 1. (a) Traditional machine learning, (b) Transfer learning, modified from
- Modification of the model. Modifications are made at the last fully connected layer
This paper discussed classification of fundus images. Classification for distinguishing normal and
neovascularization. Neovascularization is the appearance of new vessels in optic disk or other surfaces
of retina. Neovascularization features of the blood vessels are brittle, irregular in shape, and easily lost.
Neovascularization is a severe diabetic retinopathy (DR). Neovascularization consists of two categories:
neovascularization on the disc (NVD) and neovascularization elsewhere (NVE). NVD is a new vessel
in the optic disc while NVE is a new vessel in the entire surface of the retina except in optic disc [10].
neovascularization of the fundus images is shown in Figure 2.
There are previous studies that classified fundus images. Tennakoon et al. classified two categories:
gradable and ungradable based on image quality fundus. The model uses shallowNet and a modified AlexNet
model. A fully connected layer (FCL) fc7 is a layer for feature extraction. FCL fc8 is the fine-tuning layer for
classification. Classification using SVM, boosted tree, and k-NN methods. The data consists of 463 images.
The highest accuracy is 98.27% using shallowNet [11]. Li et al. classified fundus images using data from
DR1 and MESSIDOR. The amount of data for each dataset is 1,014 and 1,200 images. There are three steps
for transfer learning, fine-tuning all layers on pre-trained CNN models according to their functions,
fine-tuning pre-trained CNN models on additional layers, then feature extraction and classification
using SVM.
(a) (b)
Figure 2. Neovascularization in fundus patch; (a) NVE, (b) NVD [12]
The experiment uses several models, AlexNet, googleNet, and VGG. The parameters used were
maximum epoch 30, minibatch size 50, learning rates 0.1 to 0.0001, weights 0.0005 and momentum 0.95.
Optimization using stochastic gradient descent with momentum (SGDM) algorithm. The test results showed
the best accuracy is the modified VGG-m model of 95.49% for the DR1 dataset and GoogleNet modification
of 79.37% for the MESSIDOR dataset [13].
3. ISSN: 1693-6930
TELKOMNIKA Telecommun Comput El Control, Vol. 18, No. 3, June 2020: 1382 - 1388
1384
Choi et al. [14] classified 10 classes of diabetic retinopathy (DR). Data consists of 10,000 images.
Each category has 1,000 images. The model used for transfer learning is VGG19 and AlexNet.
The optimization algorithm uses SGDM, momentum 0.9 learning rate 10-6, and max epoch 50. The test
scenario is varied, with 3 and 5 classes. The test results showed the best accuracy is VGG19 for classification
of three categories at 80.8%, while classification of 5 categories showed the highest accuracy of 59.1% [14].
Masood et al. classified 4 DR classed as mild, moderate, severe non-proliferative diabetic retinopathy
(NPDR), and PDR. The dataset is taken from eyePacs. The steps for training learning are preprocessing and
retraining Inception-V3. Result shows 48.2% for accuracy [15].
Oktalora et al. [16] classify for exudate. Exudate is a symptom in the form of a yellow spot, irregular
shape, arising from lipid infiltration in the retina. Exudate is a symptom of diabetic retinopathy. This study
uses a LeNet model with seven layers. Experiment data using Optha dataset. The size of data is 48x48 pixels.
The classification consists of two categories: normal and exudate [16]. Sadek et al. build transfer learning to
classify 3 categories include normal, exudates, and drusen. The dataset uses are STARE, HRF, DrisonDB,
Optha, HEIMED and MESSIDOR dataset. Transfer learning uses modified VGG, GoogleNet and ResNet
models. Result shows average accuracy from 91.23% to 92% [17].
The above studies have not reached the optimal accuracy. Characteristics of diabetic retinopathy
(DR) disease have not been fully classified. The characteristics of DR are microaneurysm, hemorrhages,
exudates, cotton wool spots, and neovascularization. The novelty of this study is classification of fundus
images to distinguish normal and neovascularization using transfer learning. Besides, novelty is also found
in CNN modification technique by utilizing the last three layers of each model. The results of measurement
accuracy from transfer learning are compared in the optimization of gradient descent such as stochastic
gradient descent with momentum (SGDM), root mean square propagation (RMSProp), and adaptive moment
optimization (Adam).
2. RESEARCH METHOD
The experiment data consists of 2 classes include normal and neovascularization. Each class has
50 patches, so the total data is 100 patches. It is taken from the MESSIDOR [18] and retina image
bank [12]. The pre-trained network is a CNN model. CNN is the same as the other neural networks,
consisting of weight, bias and activation functions. CNN has 2 big parts of the layer, layer for feature
extraction and layer for classification. The layer for feature extraction consists of a convolutional layer,
pooling layer, stride, and padding. While layer for classification consists of fully connected layer, softmax,
and output layer [19]. Pre-trained network becomes a part of transfer learning. Phase of transfer learning is
import pre-trained network, replace classification layer, train network on the fundus image data, and get
performance measure of accuracy.
2.1. Import pre-trained network
We use AlexNet [19], VGG16, VGG19 [20], ResNet50, ResNet101 [21], GoogleNet [22],
Inception-V3 [23], Inception_ResNetV2 [24], and Squeezenet [25] as pre-trained network. The pre-trained
network has trained in ImageNet competition. It has more than a million images and 20,000 classes [26].
Each pre-trained network has a different layer configuration. The layer at the beginning and middle called
a feature extraction layer. These layers produce simple features such as brightness and edges, to complex
unique features such as colors and shapes. The results of feature extraction layer at source domain can
be transferred for feature extraction layer at target domain. Feature extraction learning on target domains also
knows training fundus data images. Figure 3 shows a proposed framework of transfer learning for fundus
image classification.
Figure 3. The proposed framework of transfer learning for fundus image classification
4. TELKOMNIKA Telecommun Comput El Control
Transfer learning with multiple pre-trained network for fundus classification (Wahyudi Setiawan)
1385
2.2. Replace classification layer
The classification layer is known as 3 final layers i.e fully connected layer, softmax, and an output
layer. It replaced by a pre-trained network and substitutes with new classification layer that matched with
a new classification task. It includes new number of classes and a set learning rate in the new network.
There are exceptions for squeezenet, a layer that must be replaced consists of five layers. Table 1 shows
the classification layer that replaces the network.
Table 1. Classification layer of the pre-trained network
Pre-Trained Network Classification Layer
AlexNet fc8, prob, output
VGG16 fc8, prob, output
VGG19 fc8, prob, output
ResNet50 fc1000, fc1000_softmax, classificationlayer_fc1000
ResNet101 fc1000, prob, classificationlayer_predictions
GoogleNet loss3-classifier, prob, output
Inception-V3 predictions, predictions_softmax, classificationlayer_predictions
Inception-ResNetV2 predictions, predictions_softmax, classificationlayer_predictions
Squeezenet conv10, relu_conv10, pool10, prob, classificationlayer_predictions
2.3. Train network on fundus image
Data is processed with research method as shown in Figure 3. It is a phase of transfer learning with
a pre-trained network for classification of fundus images. At the top, source domains are pre-trained
networks that have classified data on ImageNet. The train network also needs an optimization algorithm. We
use an optimization gradient descent algorithm. Gradient descent (GD) obtain optimal parameter weights,
reduce prediction errors and improve predictions of accuracy. GD performs parameter optimization on
the network. Besides, GD has a linear complexity of data increment. GD can be computed in parallel by
utilizing a graphical processing unit (GPU). The application of GD on the CNN model proves that GD can do
training with millions of data [27].
2.4. Gradient descent with momentum
Momentum is a method for GD acceleration by utilizing gradient information in the previous
steps. Accumulation of gradients is useful for controlling oscillatory effects. Furthermore, it is expected that
the optimization path can be more stable [28].
Algorithm 1. (Gradient descent with momentum)
1. 𝑚0 = 0
2. 𝑔𝑡 ∶= ∇ 𝜃 𝑡−1
𝐿 (𝜃𝑡−1)
3. 𝑚 𝑡 ≔ 𝑔𝑡 + 𝛽𝑚 𝑡−1
4. 𝜃𝑡 ∶= 𝜃𝑡−1 − 𝛼 𝑚 𝑡
with 𝑔𝑡 = gradient loos function to 𝜃𝑡−1, 𝜃𝑡 = next parameter, 𝛼 = learning rate.
The constant controls the size of the contribution from the previous gradient. Generally, set to 0.9
is the best value of the experiment that has been carried out. If set to 0, then the GDM results are the same as
GD. Stochastic gradient descent with momentum (SGDM) is a variant of GDM. The difference is data
access. If in GDM the data is processed all the data at the same time. Data on SGDM will be processed
suitably with minibatch size [27].
2.4.1. AdaGrad and RMSProp
Adaptive subgradient descent (AdaGrad) [29] get GD improvements by providing different update
speeds for each vector dimension. The AdaGrad algorithm is found in algorithm 2 [28]. Constants provide
information about changing the value of an element in the gradient vector. If the value in a certain dimension
decreases, the update speed in certain dimensions increases and vice versa. This will balance the contribution
of each dimension of the gradient vector so that the optimization path becomes more stable
Algorithm 2. (AdaGrad)
1. 𝑛0 = 0
2. 𝑔𝑡 ∶= ∇ 𝜃 𝑡−1
𝐿 (𝜃𝑡−1)
3. 𝑛 𝑡 ≔ 𝑔𝑡
2
+ 𝑛 𝑡−1
4. 𝜃𝑡 ∶= 𝜃𝑡−1 − 𝛼
𝑔𝑡
√𝑛 𝑡+ 𝜀
with 𝑛 𝑡 = adaptive subgradient, 𝜀 =a constant 1e-6
The problem with AdaGrad is the value can be very large at certain time. It will slow down
the optimization process time. The solution to these problems is to modify by adding constants. These
5. ISSN: 1693-6930
TELKOMNIKA Telecommun Comput El Control, Vol. 18, No. 3, June 2020: 1382 - 1388
1386
constants are used to set variable quantities. The root means square propagation (RMSProp) algorithm is
found in algorithm 3 [28, 30].
Algorithm 3. (RMSProp)
1. 𝑛0 = 0
2. 𝑔𝑡 ∶= ∇ 𝜃 𝑡−1
𝐿 (𝜃𝑡−1)
3. 𝑛 𝑡 ≔ (1 − 𝛾)𝑔𝑡
2
+ 𝛾𝑛 𝑡−1
4. 𝜃𝑡 ∶= 𝜃𝑡−1 − 𝛼
𝑔𝑡
√𝑛 𝑡+ 𝜀
with 𝛾 = koefisien decay rate 0,95
2.4.2. Adaptive moment optimization (Adam)
Adam algorithm combines the two approaches to improve GD, momentum and adaptive
subgradient. This algorithm combines GDM with RMSProp. Adam's algorithm is shown in algorithm 4 [28].
Line 4 is an element of momentum, line 6 is an adaptive subgradient element. Adam has a correction bias
technique with a better approximation [31].
Algorithm 4. Adam
1. 𝑛0 = 0
2. 𝑔𝑡 ∶= ∇ 𝜃 𝑡−1
𝐿 (𝜃𝑡−1)
3. 𝑚 𝑡 ≔ (1 − 𝛽)𝑔𝑡 + 𝛽𝑚 𝑡−1
4. 𝑚̂ 𝑡 ∶=
𝑚 𝑡
1− 𝛽 𝑡
5. 𝑛 𝑡 ≔ (1 − 𝛾)𝑔𝑡
2
+ 𝛾𝑛 𝑡−1
6. 𝑛̂ 𝑡 ∶=
𝑛 𝑡
1− 𝛾 𝑡
7. 𝜃𝑡 ∶= 𝜃𝑡−1 − 𝛼
𝑚 𝑡̂
√𝑛 𝑡̂+ 𝜀
𝑚 𝑡 = momentum, 𝑛 𝑡 = adaptive subgradient, 𝑚̂ 𝑡 = momentum estimation with corrected bias at time t,
𝑛̂ 𝑡 = = adaptive subgradient estimation with corrected bias at time t.
3. RESULT AND DISCUSSION
The experiment includes the following scenarios:
- Data divide two parts, 70% for training and 30% for testing. Total data is 100 patches, 70 patches
for training and 30 patches for testing
- Training phase. Make sure the image size at the training and validation suitable with a pre-trained model.
If it is not yet sized, then resize the image. Another alternative is to get an augmentation image
to automatically suitable with the image input size.
- The training parameters are set as follows: learning rate 1e-4, minibatch size 4, max epoch 5,
validation frequency 3.
The results of an experiment are shown in Tables 2, 3, and 4. Table 3 shows validation using SGDM
algorithm produces the best accuracy up to 100% using VGG16 with a time of 16,572 seconds. In Table 4,
validation with RMSProp produces the best accuracy value of up to 93.3% with a time of 164.38 seconds.
Pre-trained Network used is Resnet50. Table 5 shows that validation using Adam algorithm produces best
accuracy of 96.7%. The experiment using Alexnet with a processing time of 36,274 seconds.
The initialization of learning rate, minibatch size, max epoch, validation frequency, and gradient
descent optimization algorithm are factors that influence results of validation accuracy and processing time.
Learning rate can be initialized starting from big value until it gradually shrinks. Learning rate is between
0 and 1. If learning rate too large, it will cause overfitting, while learning rate value is too small which will
cause a longer processing time.
Table 2. Accuracy and time processing
with SGDM
Pre-trained Network Accuracy (%) Time (Second)
Alex Net 93.3 8.9537
googLeNet 86.7 12.876
ResNet50 100 49.411
VGG16 100 16.572
VGG19 80 18.163
ResNet101 93.3 126.61
Inception-V3 96.7 95.715
InceptionResNetV2 70 352.05
Squeezenet 96.7 4.0363
Table 3. Accuracy and time processing
with RMSProp
Pre-trained Network Accuracy (%) Time (Second)
AlexNet 83.3 26.957
googLeNet 90 73.062
ResNet50 93.3 164.38
VGG16 53.3 246.06
VGG19 50 225.00
ResNet101 93.3 305.7
Inception-V3 90 230
InceptionResNetV2 90 530.06
Squeezenet 53.3 18.896
6. TELKOMNIKA Telecommun Comput El Control
Transfer learning with multiple pre-trained network for fundus classification (Wahyudi Setiawan)
1387
Table 4. Accuracy and time processing with adam
Pre-trained Network Accuracy (%) Time (Second)
AlexNet 96.7 36.274
googLeNet 93.3 64.961
ResNet50 90 149.93
VGG16 50 126.31
VGG19 50 388.24
ResNet101 86.7 196.91
Inception-V3 86.7 252.1
InceptionResNetV2 93.3 448.99
Squeezenet 96.7 44.925
Table 5. Result comparison with the previous study
Author Class Pre-trained Network Accuracy (%)
Tennakoon et al. [11] 2 AlexNet 98.27
Li et al. [13] 2 VGGm, GoogleNet
95.49
79.39
Choi et al. [14]
3 VGG19 80.8
5 VGG19 59.1
Masood et al. [15] 4 Inception V3 48.2
Sadek et al. [17] 3 VGG, GoogleNet, ResNet 91.23-92
Proposed Method 2
AlexNet 96.7
GoogLeNet 93.3
ResNet50 100
VGG16 100
VGG19 80
ResNet101 93.3
Inception-V3 96.7
InceptionResNetV2 93.3
Squeezenet 96.7
Minibatch size will affect memory usage during processing. Smaller minibatch size requires less
memory when processing. Generally, minibatch size is 2n
. Max epoch value is maximum value that can
be done to process one feedforward on CNN. Iteration stops when an error is constant or when max epoch
is reached. Validation frequency is value given for the number of validation frequency. These values can
be varied to obtain optimal accuracy and minimal processing time. Table 5 shows comparison between the
methods in this article with previous studies. In this article, transfer learning was done with multiple pre-
trained networks include multiple pre-trained networks. The results showed validation of up to 100%
accuracy using ResNet50 and VGG16.
4. CONCLUSION
Transfer learning using multiple pre-trained networks has been made to determine the category
of fundus images including normal and neovascularization. It is used 100 patches taken from MESSIDOR
and Retina Image Bank. Transfer learning can be used as an option to increase validation accuracy.
The experiment result shows the best classification is found in transfer learning using pre-trained network
VGG16 with validation accuracy up to 100% and time processing 16,572 seconds. For further research,
we can use own CNN model. The amount of data and number of classes need to be enlarged for validation
reliability of CNN model.
REFERENCES
[1] S. J. Pan and Q. Yang, “A survey on transfer learning,” IEEE Transactions on Knowledge and Data Engineering,
vol. 22, no. 10, pp. 1345-1359, October 2010.
[2] M. K. Helwa and A. P. Schoellig, “Multi-Robot Transfer Learning : A Dynamical System Perspective,”
IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 4702-4708, 2017.
[3] B. Botond and J. Peters, “Alignment-based Transfer Learning for Robot Models,” The 2013 International Joint
Conference Neural Networks (IJCNN), 2013.
[4] Y. Zhu, Y. Chen, and Z. Lu, “Heterogeneous Transfer Learning for Image Classification,” Twenty-Fifth AAAI
Conference on Artificial Intelligence Heterogeneous, pp. 1304-1309, 2008.
[5] B. Petrovska, I. Stojanovic, and T. Atanasova-pacemska, “Classification of Small Sets of Images with Pre-trained
Neural Networks,” Int. J. Eng. Manuf., vol. 4, pp. 40-55, 2018.
7. ISSN: 1693-6930
TELKOMNIKA Telecommun Comput El Control, Vol. 18, No. 3, June 2020: 1382 - 1388
1388
[6] Y. Yoshida, T. Hirao, T. Iwata, M. Nagata, and Y. Matsumoto, “Transfer Learning for Multiple-Domain Sentiment
Analysis-Identifying Domain Dependent/Independent Word Polarity,” AAAI Conference on Artificial Intelligence,
pp. 1286-1291, 2011.
[7] M. Sharma, M. Holmes, J. Santamaria, A. Irani, C. Isbell, and A. Ram, “Transfer Learning in Real-Time Strategy
Games Using Hybrid CBR/RL,” IJCAI, pp. 1041-1046, 2005.
[8] T. R. Hinrichs and K. D. Forbus, “Transfer Learning through Analogy in Games,” Ai Magazine, vol. 32, no. 1,
pp. 70-83, 2011.
[9] C. B. Do and A. Y. Ng, “Transfer learning for text classification,” Conference: Advances in Neural Information
Processing Systems 18 [Neural Information Processing Systems], 2005.
[10] W. Setiawan, M. Utoyo, and R. Rulaningtyas, “Classification of neovascularization using convolutional neural
network model,” TELKOMNIKA Telecommunication Computing Electronics and Control, vol. 17, no. 1,
pp. 463-473, 2019.
[11] R. Tennakoon and P. Roy, “Image Quality Classification for DR Screening Using Convolutional Neural
Networks,” Proceedings of the Ophthalmic Medical Image Analysis International Workshop, pp. 113-120, 2016.
[12] American Society of Retina Specialists, “Retina Image Bank,” [Online]. Available: https://imagebank.asrs.org
[13] X. Li, T. Pang, B. Xiong, W. Liu, P. Liang, and T. Wang, “Convolutional Neural Networks Based Transfer
Learning for Diabetic Retinopathy Fundus Image Classification,” 10th
International Congress on Image and Signal
Processing, BioMedical Engineering and Informatics (CISP-BMEI), no. 978, 2017.
[14] J. Y. Choi, T. K. Yoo, J. G. Seo, J. Kwak, T. T. Um, and T. H. Rim, “Multi-categorical deep learning neural
network to classify retinal images : A pilot study employing small database,” PLoS One, pp. 1-16, 2017.
[15] S. Masood and T. Luthra, “Identification of Diabetic Retinopathy in Eye Images Using Transfer Learning,”
International Conference on Computing, Communication and Automation (ICCCA2017), no. 2,
pp. 1183-1187, 2017.
[16] S. Oktalora, O. Perdomo, F. Gonzales, and H. Muller, “Training Deep Convolutional Neural Networks with Active
Learning for Exudate Classification in Eye Fundus Images,” CVII-STENT/LABELS 2017, LNCS 10552,
pp. 146-154, 2017.
[17] I. Sadek, M. Elawady, A. El, and R. Shabayek, “Automatic Classification of Bright Retinal Lesions via Deep
Network Features,” ArXiv, pp. 1-20, 2017.
[18] E. Decencière et al., “Feedback on a Publicly Distributed Image Database: the Messidor Database,” Image Anal.
Stereol., vol. 33, no. 3, pp. 231, 2014.
[19] A. Krizhevsky and G. E. Hinton, “ImageNet Classification with Deep Convolutional Neural Networks,” Advances
in neural information processing systems, vol. 25, no. 2, pp. 1-9, 2012.
[20] K. Simonyan and A. Zisserman, “Very Deep Convolutional Networks for Large-Scale Image Recognition,” ICLR,
pp. 1-14, 2015.
[21] K. He, X. Zhang, S. Ren and J. Sun, "Deep Residual Learning for Image Recognition," IEEE Conference on
Computer Vision and Pattern Recognition (CVPR), pp. 770-778, Las Vegas, NV, 2016.
[22] C. Szegedy et al., "Going deeper with convolutions," IEEE Conference on Computer Vision and Pattern
Recognition (CVPR), pp. 1-9, 2015.
[23] Szegedy, V. Vanhoucke, S. Ioffe, J. Shlens and Z. Wojna, "Rethinking the Inception Architecture for Computer
Vision," 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV,
pp. 2818-2826, 2016.
[24] C. Szegedy, S. Ioffe, and V. Vanhoucke, “Inception-v4, Inception-ResNet and the Impact of Residual Connections
on Learning,” ArXiv, pp. 1-12, 2016.
[25] F. N. Iandola, S. Han, M. W. Moskewicz, K. Ashraf, W. J. Dally, and K. Keutzer, “SqueezeNet: AlexNet-level
accuracy with 50x fewer parameters and <0.5MB model size,” ICLR, pp. 1-13, 2017.
[26] J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and F.-F. Li, “ImageNet: A large-scale hierarchical image database,”
IEEE Conf. Comput. Vis. pattern Recognit, pp. 248-255, 2009.
[27] S. Ruder, “An overview of gradient descent optimization,” ArXiv, pp. 1-14, 2017.
[28] M. Ghifary, “Deep Learning Optimization,” 2017. [Online]. Available: https://ghif.github.io/aiml/2017/04/11/
optimisasi-pada-deep-learning.html. [Accessed: 20-Aug-2018].
[29] J. Duchi, E. Hazan, and Y. Singer, “Adaptive subgradient methods for online learning and stochastic optimization,”
J. Mach. Learn. Res., vol. 12, pp. 2121-2159, 2011.
[30] G. E. Hinton, N. Srivastava, and K. Swersky, “Lecture 6a- overview of mini-batch gradient descent,” COURSERA
Neural Networks Mach. Learn., pp. 31, 2012.
[31] D. P. Kingma and J. Ba, “Adam: A Method for Stochastic Optimization,” ICLR, pp. 1-15, 2015.