This document summarizes a study that used pre-trained convolutional neural network (CNN) models to classify breast cancer histopathology images as benign or malignant. Five CNN architectures - ResNet-50, VGG-19, Inception-V3, AlexNet, and ResNet-50 as a feature extractor combined with random forest and k-nearest neighbors classifiers - were evaluated on the publicly available BreakHis dataset. The ResNet-50 network achieved the highest test accuracy of 97% for classifying the breast cancer images.
Modified fuzzy rough set technique with stacked autoencoder model for magneti...IJECEIAES
Breast cancer is the common cancer in women, where early detection reduces the mortality rate. The magnetic resonance imaging (MRI) images are efficient in analyzing breast cancer, but it is hard to identify the abnormalities. The manual breast cancer detection in MRI images is inefficient; therefore, a deep learning-based system is implemented in this manuscript. Initially, the visual quality improvement is done using region growing and adaptive histogram equalization (AHE), and then, the breast lesion is segmented by Otsu thresholding with morphological transform. Next, the features are extracted from the segmented lesion, and a modified fuzzy rough set technique is proposed to reduce the dimensions of the extracted features that decreases the system complexity and computational time. The active features are fed to the stacked autoencoder for classifying the benign and malignant classes. The results demonstrated that the proposed model attained 99% and 99.22% of classification accuracy on the benchmark datasets, which are higher related to the comparative classifiers: decision tree, naïve Bayes, random forest and k-nearest neighbor (KNN). The obtained results state that the proposed model superiorly screens and detects the breast lesions that assists clinicians in effective therapeutic intervention and timely treatment.
Cervical Cancer Detection: An Enhanced Approach through Transfer Learning and...IRJET Journal
This document presents research on using the DenseNet169 deep learning model for cervical cancer detection. The researcher trained and tested the model on a large cervical cell image dataset from Kaggle. Through data preprocessing like augmentation and normalization, and transfer learning by fine-tuning a DenseNet pre-trained on ImageNet, the model achieved 95.27% accuracy in classifying five cervical cell types. Evaluation of the model showed high average precision, recall, and F1-score, demonstrating its ability to correctly classify different cervical cell images. The research highlights the potential of deep learning models for automating cervical cancer screening and improving early detection.
Breast cancer histological images nuclei segmentation and optimized classifi...IJECEIAES
This document summarizes a research paper that proposes a deep learning method for breast cancer histological image segmentation and classification. The method uses a feature pyramid network to extract features from histological images, which are then optimized using a grasshopper optimization algorithm. The optimized features are used with a segmenting objects by locations convolutional network for nucleus segmentation. For classification, the features are passed through fully connected layers. The method achieves 88.46% segmentation accuracy and 99.2% classification accuracy on a large breast cancer image dataset. It provides an effective computer-aided approach for breast cancer analysis in a medical setting.
BREAST CANCER DIAGNOSIS USING MACHINE LEARNING ALGORITHMS –A SURVEYijdpsjournal
Breast cancer has become a common factor now-a-days. Despite the fact, not all general hospitals
have the facilities to diagnose breast cancer through mammograms. Waiting for diagnosing a breast
cancer for a long time may increase the possibility of the cancer spreading. Therefore a computerized
breast cancer diagnosis has been developed to reduce the time taken to diagnose the breast cancer and
reduce the death rate. This paper summarizes the survey on breast cancer diagnosis using various machine
learning algorithms and methods, which are used to improve the accuracy of predicting cancer. This survey
can also help us to know about number of papers that are implemented to diagnose the breast cancer.
The current big challenge facing radiologists in healthcare is the automatic detection and classification of masses in breast mammogram images. In the last few years, many researchers have proposed various solutions to this problem. These solutions are effectively dependent and work on annotated breast image data. But these solutions fail when applied to unlabeled and non-annotated breast image data. Therefore, this paper provides the solution to this problem with the help of a neural network that considers any kind of unlabeled data for its procedure. In this solution, the algorithm automatically extracts tumors in images using a segmentation approach, and after that, the features of the tumor are extracted for further processing. This approach used a double thresholding-based segmentation technique to obtain a perfect location of the tumor region, which was not possible in existing techniques in the literature. The experimental results also show that the proposed algorithm provides better accuracy compared to the accuracy of existing algorithms in the literature.
Breast cancer is the leading cause of death for women worldwide. Cancer can be discovered early, lowering the rate of death. Machine learning techniques are a hot field of research, and they have been shown to be helpful in cancer prediction and early detection. The primary purpose of this research is to identify which machine learning algorithms are the most successful in predicting and diagnosing breast cancer, according to five criteria: specificity, sensitivity, precision, accuracy, and F1 score. The project is finished in the Anaconda environment, which uses Python's NumPy and SciPy numerical and scientific libraries as well as matplotlib and Pandas. In this study, the Wisconsin diagnostic breast cancer dataset was used to evaluate eleven machine learning classifiers: decision tree, quadratic discriminant analysis, AdaBoost, Bagging meta estimator, Extra randomized trees, Gaussian process classifier, Ridge, Gaussian nave Bayes, k-Nearest neighbors, multilayer perceptron, and support vector classifier. During performance analysis, extremely randomized trees outperformed all other classifiers with an F1-score of 96.77% after data collection and data analysis.
Cervical cancer diagnosis based on cytology pap smear image classification us...TELKOMNIKA JOURNAL
This document presents a study on classifying cytology images from pap smear tests to diagnose cervical cancer. The study uses discrete cosine transform (DCT) and Haar transform coefficients as features extracted from the images. Five different sized feature vectors are formed using fractional coefficients. Seven machine learning classifiers are tested on the feature vectors, with random forest classifier achieving the highest accuracy of 81.11%. The study aims to assist pathologists in cervical cancer diagnosis by providing an automated second opinion based on pap smear image analysis and classification.
This document proposes using a DenseNet-II neural network model to classify mammogram images as benign or malignant. It first preprocesses mammogram images through normalization and data augmentation. It then improves the original DenseNet model by replacing the first convolutional layer with an Inception structure, creating a new DenseNet-II model. This model, along with other common models, are tested on mammogram data and the DenseNet-II model achieves the highest average accuracy of 94.55% for benign-malignant classification.
Modified fuzzy rough set technique with stacked autoencoder model for magneti...IJECEIAES
Breast cancer is the common cancer in women, where early detection reduces the mortality rate. The magnetic resonance imaging (MRI) images are efficient in analyzing breast cancer, but it is hard to identify the abnormalities. The manual breast cancer detection in MRI images is inefficient; therefore, a deep learning-based system is implemented in this manuscript. Initially, the visual quality improvement is done using region growing and adaptive histogram equalization (AHE), and then, the breast lesion is segmented by Otsu thresholding with morphological transform. Next, the features are extracted from the segmented lesion, and a modified fuzzy rough set technique is proposed to reduce the dimensions of the extracted features that decreases the system complexity and computational time. The active features are fed to the stacked autoencoder for classifying the benign and malignant classes. The results demonstrated that the proposed model attained 99% and 99.22% of classification accuracy on the benchmark datasets, which are higher related to the comparative classifiers: decision tree, naïve Bayes, random forest and k-nearest neighbor (KNN). The obtained results state that the proposed model superiorly screens and detects the breast lesions that assists clinicians in effective therapeutic intervention and timely treatment.
Cervical Cancer Detection: An Enhanced Approach through Transfer Learning and...IRJET Journal
This document presents research on using the DenseNet169 deep learning model for cervical cancer detection. The researcher trained and tested the model on a large cervical cell image dataset from Kaggle. Through data preprocessing like augmentation and normalization, and transfer learning by fine-tuning a DenseNet pre-trained on ImageNet, the model achieved 95.27% accuracy in classifying five cervical cell types. Evaluation of the model showed high average precision, recall, and F1-score, demonstrating its ability to correctly classify different cervical cell images. The research highlights the potential of deep learning models for automating cervical cancer screening and improving early detection.
Breast cancer histological images nuclei segmentation and optimized classifi...IJECEIAES
This document summarizes a research paper that proposes a deep learning method for breast cancer histological image segmentation and classification. The method uses a feature pyramid network to extract features from histological images, which are then optimized using a grasshopper optimization algorithm. The optimized features are used with a segmenting objects by locations convolutional network for nucleus segmentation. For classification, the features are passed through fully connected layers. The method achieves 88.46% segmentation accuracy and 99.2% classification accuracy on a large breast cancer image dataset. It provides an effective computer-aided approach for breast cancer analysis in a medical setting.
BREAST CANCER DIAGNOSIS USING MACHINE LEARNING ALGORITHMS –A SURVEYijdpsjournal
Breast cancer has become a common factor now-a-days. Despite the fact, not all general hospitals
have the facilities to diagnose breast cancer through mammograms. Waiting for diagnosing a breast
cancer for a long time may increase the possibility of the cancer spreading. Therefore a computerized
breast cancer diagnosis has been developed to reduce the time taken to diagnose the breast cancer and
reduce the death rate. This paper summarizes the survey on breast cancer diagnosis using various machine
learning algorithms and methods, which are used to improve the accuracy of predicting cancer. This survey
can also help us to know about number of papers that are implemented to diagnose the breast cancer.
The current big challenge facing radiologists in healthcare is the automatic detection and classification of masses in breast mammogram images. In the last few years, many researchers have proposed various solutions to this problem. These solutions are effectively dependent and work on annotated breast image data. But these solutions fail when applied to unlabeled and non-annotated breast image data. Therefore, this paper provides the solution to this problem with the help of a neural network that considers any kind of unlabeled data for its procedure. In this solution, the algorithm automatically extracts tumors in images using a segmentation approach, and after that, the features of the tumor are extracted for further processing. This approach used a double thresholding-based segmentation technique to obtain a perfect location of the tumor region, which was not possible in existing techniques in the literature. The experimental results also show that the proposed algorithm provides better accuracy compared to the accuracy of existing algorithms in the literature.
Breast cancer is the leading cause of death for women worldwide. Cancer can be discovered early, lowering the rate of death. Machine learning techniques are a hot field of research, and they have been shown to be helpful in cancer prediction and early detection. The primary purpose of this research is to identify which machine learning algorithms are the most successful in predicting and diagnosing breast cancer, according to five criteria: specificity, sensitivity, precision, accuracy, and F1 score. The project is finished in the Anaconda environment, which uses Python's NumPy and SciPy numerical and scientific libraries as well as matplotlib and Pandas. In this study, the Wisconsin diagnostic breast cancer dataset was used to evaluate eleven machine learning classifiers: decision tree, quadratic discriminant analysis, AdaBoost, Bagging meta estimator, Extra randomized trees, Gaussian process classifier, Ridge, Gaussian nave Bayes, k-Nearest neighbors, multilayer perceptron, and support vector classifier. During performance analysis, extremely randomized trees outperformed all other classifiers with an F1-score of 96.77% after data collection and data analysis.
Cervical cancer diagnosis based on cytology pap smear image classification us...TELKOMNIKA JOURNAL
This document presents a study on classifying cytology images from pap smear tests to diagnose cervical cancer. The study uses discrete cosine transform (DCT) and Haar transform coefficients as features extracted from the images. Five different sized feature vectors are formed using fractional coefficients. Seven machine learning classifiers are tested on the feature vectors, with random forest classifier achieving the highest accuracy of 81.11%. The study aims to assist pathologists in cervical cancer diagnosis by providing an automated second opinion based on pap smear image analysis and classification.
This document proposes using a DenseNet-II neural network model to classify mammogram images as benign or malignant. It first preprocesses mammogram images through normalization and data augmentation. It then improves the original DenseNet model by replacing the first convolutional layer with an Inception structure, creating a new DenseNet-II model. This model, along with other common models, are tested on mammogram data and the DenseNet-II model achieves the highest average accuracy of 94.55% for benign-malignant classification.
This document summarizes Mansi Chowkkar's MSc research project on detecting breast cancer from histopathological images using deep learning and transfer learning. The research aims to classify images as malignant or benign with high accuracy and efficiency. It implements CNN and DenseNet-121 models, with the latter using transfer learning with pre-trained ImageNet weights. The research achieved 90.9% test accuracy with CNN and 88.03% accuracy with transfer learning. Related work discusses deep learning applications in healthcare, image processing techniques, prior research on DenseNet, and the use of transfer learning for medical image classification with limited data.
IRJET- Breast Cancer Detection from Histopathology Images: A ReviewIRJET Journal
This document provides a review of techniques for detecting breast cancer from histopathology images. It discusses how histopathology examines tissue samples under a microscope to study diseases at a microscopic level. Detecting cell nuclei is an important first step, as is identifying mitosis (cell division) and metastasis (cancer spreading). The document reviews several techniques that use convolutional neural networks to automatically analyze histopathology images and detect breast cancer, including techniques for nuclei detection and segmentation. These automatic methods aim to assist pathologists by improving efficiency and reducing human error compared to manual analysis.
A deep learning framework for accurate diagnosis of colorectal cancer using h...IJECEIAES
Colorectal cancer (CRC) is one of the most prevalent malignancies worldwide, with high mortality and incidence rates. Early detection of the disease may increase the probability of survival, making it critical to develop effective procedures for precise treatment. In the past few years, there has been an increased use of deep learning techniques in image classification that aid in the detection of various types of cancer. In this study, convolutional neural network (CNN) models were used to classify colorectal cancer into benign and malignant. After applying various data preprocessing techniques to the image dataset, we evaluated our prototypes using three distinct subsets of testing data, representing 20%, 30%, and 40% of the total dataset. Additionally, four pre-trained CNN models (ResNet-18, ResNet-50, GoogLeNet, and MobileNetV2) were trained, and the network architectural techniques were compared by applying the Adam optimizer. Finally, we assessed the performance of algorithms in terms of accuracy, sensitivity, specificity, precision, F1-score, and area under the receiver operating characteristic (ROC) curve (AUC). In this research, deep learning approaches demonstrated high efficacy in accurately diagnosing colorectal cancer. This indicates that these techniques have an important and significant value for advancing medical research.
A Progressive Review on Early Stage Breast Cancer DetectionIRJET Journal
This document provides a progressive review of techniques for early stage breast cancer detection. It discusses various image segmentation methods used in previous research like threshold-based, region-based, edge-based, and clustering-based segmentation. Feature extraction techniques discussed include gray level co-occurrence matrix, principal component analysis, and linear discriminant analysis. Classifiers reviewed are convolutional neural networks, support vector machines, artificial neural networks, random forests, and decision trees. The paper analyzes the merits and limitations of these techniques and identifies convolutional neural networks and artificial neural networks as providing high accuracy for breast cancer classification from images.
A novel approach to jointly address localization and classification of breast...IJECEIAES
Localization of the cancerous region as well as classification of the type of the cancer is highly inter-linked with each other. However, investigation towards existing approaches depicts that these problems are always iindividually solved where there is still a big research gap for a generalized solution towards addressing both the problems. Therefore, the proposed manuscript presents a simple, novel, and less-iterative computational model that jointly address the localization-classification problems taking the case study of early diagnosis of breast cancer. The proposed study harnesses the potential of simple bio-inspired optimization technique in order to obtained better local and global best outcome to confirm the accuracy of the outcome. The study outcome of the proposed system exhibits that proposed system offers higher accuracy and lower response time in contrast with other existing classifiers that are freqently witnessed in existing approaches of classification in medical image process.
Breast Cancer Detection from Mammography Images Using Machine Learning Algorithms (U-Net Segmentation and Dense Net Classifier implementation are in progress)
Early detection of breast cancer using mammography images and software engine...TELKOMNIKA JOURNAL
The breast cancer has affected a wide region of women as a particular case. Therefore, different researchers have focused on the early detection of this disease to overcome it in efficient way. In this paper, an early breast cancer detection system has been proposed based on mammography images. The proposed system adopts deep-learning technique to increase the accuracy of detection. The convolutional neural network (CNN) model is considered for preparing the datasets of training and test. It is important to note that the software engineering process model has been adopted in constructing the proposed algorithm. This is to increase the reliably, flexibility and extendibility of the system. The user interfaces of the system are designed as a website used at country side general purpose (GP) health centers for early detection to the disease under lacking in specialist medical staff. The obtained results show the efficiency of the proposed system in terms of accuracy up to more than 90% and decrease the efforts of medical staff as well as helping the patients. As a conclusion, the proposed system can help patients by early detecting the breast cancer at far places from hospital and referring them to nearest specialist center.
Quantitative Comparison of Artificial Honey Bee Colony Clustering and Enhance...idescitation
This paper introduces a comparison of two popular
clustering algorithms for breast DCE-MRI segmentation
purpose. Magnetic resonance imaging (MRI) is an advanced
medical imaging technique providing rich information about
the human soft tissue anatomy. The goal of breast magnetic
resonance image segmentation is to accurately identify the
principal mass or lesion structures in these image volumes.
There are many methods that exist to segment the breast
DCE-MR images. One of these, K-means clustering procedure
provides effective solutions in many science and engineering
fields. They are especially popular in the pattern classification
and signal processing areas and can segment the breast DCE-
MRI with high precision. The artificial bee colony (ABC)
algorithm is a new, very simple and robust population based
optimization algorithm that is inspired by the intelligent
behavior of honey bee swarms. This paper compares the
performance of two image segmentation techniques in the
subject of breast DCE-MR image. In the experiments, the
real dynamic contrast enhanced magnetic resonance images
(DCE- MRI) are used. Results show that artificial bee colony
algorithm performs better in terms of segmentation accuracy,
robustness and speed of computation.
At the 35th AICC-RCOG Annual Conference in association with FOGSI and MOGS, Dr. Niranjan Chavan, President of MOGS, gave an address on Artificial Intelligence in Gynaecologic Oncology at Taj Lands' End, Bandra, Mumbai on the 6th November 2022
Predictive modeling for breast cancer based on machine learning algorithms an...IJECEIAES
Breast cancer is one of the leading causes of death among women worldwide. However, early prediction of breast cancer plays a crucial role. Therefore, strong needs exist for automatic accurate early prediction of breast cancer. In this paper, machine learning (ML) classifiers combined with features selection methods are used to build an intelligent tool for breast cancer prediction. The Wisconsin diagnostic breast cancer (WDBC) dataset is used to train and test the model. Classification algorithms, including support vector machine (SVM), light gradient boosting machine (LightGBM), random forest (RF), logistic regression (LR), k-nearest neighbors (k-NN), and naïve Bayes, were employed. Performance measures for each of them were obtained, namely: accuracy, precision, recall, F-score, Kappa, Matthews correlation coefficient (MCC), and time. The results indicate that without feature selection, LightGBM achieves the highest accuracy at 95%. With minimum redundancy maximum relevance (mRMR) feature selection (15 features), LightGBM outperforms other classifiers, achieving an accuracy of 98%. For Pearson correlation coefficient feature selection (15 features), LightGBM also excels with a 95% accuracy rate. Lasso feature selection (5 features) produces varied results across classifiers, with logistic regression achieving the highest accuracy at 96%. These findings underscore the importance of feature selection in refining model performance and in improving detection for breast cancer.
A deep convolutional structure-based approach for accurate recognition of ski...IJECEIAES
One-third of all cancer diagnoses worldwide are skin malignancies. One of the most common tumors, skin cancer can develop from a variety of dermatological conditions and is subdivided into different categories based on its textile, color, body, and other morphological characteristics. The most effective strategy to lower the mortality rate of melanoma is early identification because skin cancer incidence has been on the rise recently. In order to categorize dermoscopy images into the four diagnosis classifications of melanoma, benign, malignant, and human against machine (HAM) not melanoma, this research suggests a computer-aided diagnosis (CAD) system. Experimental results show that the suggested approach enabled 97.25% classification accuracy. In order to automate the identification of skin cancer and expedite the diagnosis process in order to save a life, the proposed technique offers a less complex and cutting-edge framework.
The document describes a student project that aims to classify and detect lung disorders from chest x-rays using generative adversarial networks (GANs). It provides an overview of the problem statement, literature review on existing methodologies, and the proposed system architecture. The problem is that early diagnosis of lung diseases is important but current methods are time-consuming. The project aims to develop a hybrid GAN model to overcome issues in existing models and accurately detect lung disorders from x-rays. It will classify diseases like pneumonia and lung cancer.
Possibilistic Fuzzy C Means Algorithm For Mass classificaion In Digital Mammo...IJERA Editor
This document presents a methodology for classifying masses in digital mammograms using feature extraction and the Possibilistic Fuzzy C Means (PFCM) clustering algorithm and Support Vector Machine (SVM) classification. The methodology involves preprocessing mammogram images using adaptive median filtering and image enhancement to reduce noise. Features such as texture, entropy, and correlation are then extracted. PFCM clustering is used to classify pixels into abnormal and normal regions based on their features. Finally, SVM classification is applied to the extracted features to classify mammograms as normal, benign, or malignant. The methodology aims to help radiologists more accurately detect abnormalities in mammograms at an earlier stage than traditional analysis.
Class imbalance is a pervasive issue in the field of disease classification from
medical images. It is necessary to balance out the class distribution while training a model. However, in the case of rare medical diseases, images from affected
patients are much harder to come by compared to images from non-affected
patients, resulting in unwanted class imbalance. Various processes of tackling
class imbalance issues have been explored so far, each having its fair share of
drawbacks. In this research, we propose an outlier detection based image classification technique which can handle even the most extreme case of class imbalance. We have utilized a dataset of malaria parasitized and uninfected cells. An
autoencoder model titled AnoMalNet is trained with only the uninfected cell images at the beginning and then used to classify both the affected and non-affected
cell images by thresholding a loss value. We have achieved an accuracy, precision, recall, and F1 score of 98.49%, 97.07%, 100%, and 98.52% respectively,
performing better than large deep learning models and other published works.
As our proposed approach can provide competitive results without needing the
disease-positive samples during training, it should prove to be useful in binary
disease classification on imbalanced datasets.
Sigma Xi Student Showcase - Fast Pre-Diagnosis of Breast Cancer Using CNNsMichael Batavia
This document proposes using a convolutional neural network (CNN) to classify breast cancer tumors in lymph nodes and maximize neural network accuracy. The author developed a custom CNN with parallel processing across multiple GPUs that achieved 83% validation accuracy on a breast cancer dataset, outperforming pathologist classification by 10%. A separate experiment found that images with 40x magnification led to statistically better accuracy than 20x magnification images. The CNN architecture, datasets used, training methodology, and results are described to support using CNNs for early and individualized breast cancer diagnosis.
An approach of cervical cancer diagnosis using class weighting and oversampli...TELKOMNIKA JOURNAL
Globally, cervical cancer caused 604,127 new cases and 341,831 deaths in 2020, according to the global cancer observatory. In addition, the number of cervical cancer patients who have no symptoms has grown recently. Therefore, giving patients early notice of the possibility of cervical cancer is a useful task since it would enable them to have a clear understanding of their health state. The use of artificial intelligence (AI), particularly in machine learning, in this work is continually uncovering cervical cancer. With the help of a logit model and a new deep learning technique, we hope to identify cervical cancer using patient-provided data. For better outcomes, we employ Keras deep learning and its technique, which includes class weighting and oversampling. In comparison to the actual diagnostic result, the experimental result with model accuracy is 94.18%, and it also demonstrates a successful logit model cervical cancer prediction.
Titles with Abstracts_2023-2024_Digital Image processing.pdfinfo751436
Digital image processing (DIP) refers to the manipulation of digital images using various algorithms and techniques to enhance or extract information from the images. There are several advantages to using digital image processing:
Enhancement of Image Quality:
DIP allows for the improvement of image quality by reducing noise, correcting distortions, and enhancing details. This is particularly useful in medical imaging, satellite imagery, and surveillance.
Image Restoration:
It helps in restoring images that have been degraded due to factors such as noise, blurring, or compression. Restoration techniques can improve the visual quality of images.
Image Compression:
DIP plays a crucial role in image compression, allowing for the reduction of file sizes while maintaining an acceptable level of image quality. This is essential for efficient storage and transmission of images over networks.
Image Recognition and Computer Vision:
DIP is widely used in computer vision applications for tasks such as object recognition, face detection, and gesture recognition. It enables machines to interpret and understand visual information.
Medical Image Processing:
In medical imaging, DIP is used for tasks like tumor detection, organ segmentation, and image reconstruction. It assists healthcare professionals in diagnosis and treatment planning.
Remote Sensing:
In satellite imagery and remote sensing applications, DIP helps analyze and interpret data for various purposes, including environmental monitoring, agriculture, and disaster management.
Geographic Information Systems (GIS):
DIP is employed in GIS to process and analyze spatial data, enabling the extraction of meaningful information from satellite imagery and maps.
Video Processing:
DIP is used in video processing for tasks such as video compression, object tracking, and motion analysis. It is essential in surveillance systems and video editing.
Authentication and Security:
DIP is utilized in authentication systems, such as fingerprint recognition and iris scanning. It also plays a role in security applications, such as facial recognition for access control.
Automated Inspection and Quality Control:
In industrial settings, DIP is used for automated inspection and quality control. It helps identify defects and ensures the production of high-quality products.
Entertainment and Multimedia:
DIP is integral in various entertainment and multimedia applications, including image and video editing, special effects, and virtual reality.
Scientific Research:
Researchers use DIP in fields such as astronomy, biology, and physics for image analysis, data extraction, and visualization.
In summary, digital image processing offers a wide range of advantages across various fields, contributing to improvements in image quality, information extraction, and automated decision-making processes.
A Progressive Review: Early Stage Breast Cancer Detection using Ultrasound Im...IRJET Journal
1) The document reviews various machine learning algorithms and imaging modalities for early-stage breast cancer detection.
2) It discusses studies that have used ultrasound imaging alone or in combination with mammography to detect breast cancers at early stages.
3) Machine learning algorithms like convolutional neural networks have shown promise in classifying ultrasound images to detect breast cancer when trained on large datasets.
PSO-SVM hybrid system for melanoma detection from histo-pathological imagesIJECEIAES
This paper introduces an automated system for skin cancer (melanoma) detection from Histo-pathological images sampled from microscopic slides of skin biopsy. The proposed system is a hybrid system based on Particle Swarm Optimization and Support Vector Machine (PSO-SVM). The features used are extracted from the grayscale image histogram, the co-occurrence matrix and the energy of the wavelet coefficients resulting from the wavelet packet decomposition. The PSO-SVM system selects the best feature set and the best values for the SVM parameters (C and γ) that optimize the performance of the SVM classifier. The system performance is tested on a real dataset obtained from the Southern Pathology Laboratory in Wollongong NSW, Australia. Evaluation results show a classification accuracy of 87.13%, a sensitivity of 94.1% and a specificity of 80.22%.The sensitivity and specificity results are comparable to those obtained by dermatologists.
AUTOMATIC BREAST CANCER DETECTION WITH OPTIMIZED ENSEMBLE OF CLASSIFIERSIAEME Publication
Many people are being affected by breast cancer. When numerous procedures are used to diagnose breast cancer, such as clump thickness, cell size uniformity, cell shape homogeneity, and so on, the end outcome might be challenging to get, even for medical professionals. Therefore, an automatic breast cancer detection model is developed in this research work. This research utilizes four key steps to construct an intelligent breast cancer detection approach: "(a) pre-processing, (b) segmentation, (c) feature extraction, and (d) classification". The provided input image is first pre-processed using the median filtering approach and “Contrast Limited Adaptive Histogram Equalization (CLAHE)”. Then, Chebyshev Distanced- Fuzzy C-Means Clustering (CD-FCM) is used to segment the pre-processed image for ROI recognition. The Augumented Local Vector Pattern (ALVP), Shape features, and “Gray-level Co-occurrence Matrix (GLCM)” are then extracted from the recognized ROI regions. The Improved information gain is used to choose the most optimum features from the retrieved features. Finally, an ensemble classification approach is used to complete the classification process. The “CNN-GRU [Gated Recurrent Units (GRU)-Convolutional Neural Networks (CNN)], Support Vector Machines (SVM), Random Forest (RF), and K-Nearest Neighbours (KNN)” are all included in this ensemble classification approach. With the specified relevant characteristics, SVM, RF, and KNN are trained. The last decision maker is the CNNGRU, which is trained using the results of SVM, RF, and KNN. The weight function of CNN-GRU is improved utilizing a newly created hybrid algorithm-Slimemould Updated Wildbeast Optimization (SUWO) formulated by integrating the principles of both Slime mould algorithm (SMA) and Wildebeest herd optimization (WHO), respectively, in order to improve the detection accuracy of CNN-GRU. Finally, a comparative evaluation is undergone to validate the efficiency of the projected model.
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...IJECEIAES
Medical image analysis has witnessed significant advancements with deep learning techniques. In the domain of brain tumor segmentation, the ability to
precisely delineate tumor boundaries from magnetic resonance imaging (MRI)
scans holds profound implications for diagnosis. This study presents an ensemble convolutional neural network (CNN) with transfer learning, integrating
the state-of-the-art Deeplabv3+ architecture with the ResNet18 backbone. The
model is rigorously trained and evaluated, exhibiting remarkable performance
metrics, including an impressive global accuracy of 99.286%, a high-class accuracy of 82.191%, a mean intersection over union (IoU) of 79.900%, a weighted
IoU of 98.620%, and a Boundary F1 (BF) score of 83.303%. Notably, a detailed comparative analysis with existing methods showcases the superiority of
our proposed model. These findings underscore the model’s competence in precise brain tumor localization, underscoring its potential to revolutionize medical
image analysis and enhance healthcare outcomes. This research paves the way
for future exploration and optimization of advanced CNN models in medical
imaging, emphasizing addressing false positives and resource efficiency.
Embedded machine learning-based road conditions and driving behavior monitoringIJECEIAES
Car accident rates have increased in recent years, resulting in losses in human lives, properties, and other financial costs. An embedded machine learning-based system is developed to address this critical issue. The system can monitor road conditions, detect driving patterns, and identify aggressive driving behaviors. The system is based on neural networks trained on a comprehensive dataset of driving events, driving styles, and road conditions. The system effectively detects potential risks and helps mitigate the frequency and impact of accidents. The primary goal is to ensure the safety of drivers and vehicles. Collecting data involved gathering information on three key road events: normal street and normal drive, speed bumps, circular yellow speed bumps, and three aggressive driving actions: sudden start, sudden stop, and sudden entry. The gathered data is processed and analyzed using a machine learning system designed for limited power and memory devices. The developed system resulted in 91.9% accuracy, 93.6% precision, and 92% recall. The achieved inference time on an Arduino Nano 33 BLE Sense with a 32-bit CPU running at 64 MHz is 34 ms and requires 2.6 kB peak RAM and 139.9 kB program flash memory, making it suitable for resource-constrained embedded systems.
More Related Content
Similar to Breast cancer classification with histopathological image based on machine learning
This document summarizes Mansi Chowkkar's MSc research project on detecting breast cancer from histopathological images using deep learning and transfer learning. The research aims to classify images as malignant or benign with high accuracy and efficiency. It implements CNN and DenseNet-121 models, with the latter using transfer learning with pre-trained ImageNet weights. The research achieved 90.9% test accuracy with CNN and 88.03% accuracy with transfer learning. Related work discusses deep learning applications in healthcare, image processing techniques, prior research on DenseNet, and the use of transfer learning for medical image classification with limited data.
IRJET- Breast Cancer Detection from Histopathology Images: A ReviewIRJET Journal
This document provides a review of techniques for detecting breast cancer from histopathology images. It discusses how histopathology examines tissue samples under a microscope to study diseases at a microscopic level. Detecting cell nuclei is an important first step, as is identifying mitosis (cell division) and metastasis (cancer spreading). The document reviews several techniques that use convolutional neural networks to automatically analyze histopathology images and detect breast cancer, including techniques for nuclei detection and segmentation. These automatic methods aim to assist pathologists by improving efficiency and reducing human error compared to manual analysis.
A deep learning framework for accurate diagnosis of colorectal cancer using h...IJECEIAES
Colorectal cancer (CRC) is one of the most prevalent malignancies worldwide, with high mortality and incidence rates. Early detection of the disease may increase the probability of survival, making it critical to develop effective procedures for precise treatment. In the past few years, there has been an increased use of deep learning techniques in image classification that aid in the detection of various types of cancer. In this study, convolutional neural network (CNN) models were used to classify colorectal cancer into benign and malignant. After applying various data preprocessing techniques to the image dataset, we evaluated our prototypes using three distinct subsets of testing data, representing 20%, 30%, and 40% of the total dataset. Additionally, four pre-trained CNN models (ResNet-18, ResNet-50, GoogLeNet, and MobileNetV2) were trained, and the network architectural techniques were compared by applying the Adam optimizer. Finally, we assessed the performance of algorithms in terms of accuracy, sensitivity, specificity, precision, F1-score, and area under the receiver operating characteristic (ROC) curve (AUC). In this research, deep learning approaches demonstrated high efficacy in accurately diagnosing colorectal cancer. This indicates that these techniques have an important and significant value for advancing medical research.
A Progressive Review on Early Stage Breast Cancer DetectionIRJET Journal
This document provides a progressive review of techniques for early stage breast cancer detection. It discusses various image segmentation methods used in previous research like threshold-based, region-based, edge-based, and clustering-based segmentation. Feature extraction techniques discussed include gray level co-occurrence matrix, principal component analysis, and linear discriminant analysis. Classifiers reviewed are convolutional neural networks, support vector machines, artificial neural networks, random forests, and decision trees. The paper analyzes the merits and limitations of these techniques and identifies convolutional neural networks and artificial neural networks as providing high accuracy for breast cancer classification from images.
A novel approach to jointly address localization and classification of breast...IJECEIAES
Localization of the cancerous region as well as classification of the type of the cancer is highly inter-linked with each other. However, investigation towards existing approaches depicts that these problems are always iindividually solved where there is still a big research gap for a generalized solution towards addressing both the problems. Therefore, the proposed manuscript presents a simple, novel, and less-iterative computational model that jointly address the localization-classification problems taking the case study of early diagnosis of breast cancer. The proposed study harnesses the potential of simple bio-inspired optimization technique in order to obtained better local and global best outcome to confirm the accuracy of the outcome. The study outcome of the proposed system exhibits that proposed system offers higher accuracy and lower response time in contrast with other existing classifiers that are freqently witnessed in existing approaches of classification in medical image process.
Breast Cancer Detection from Mammography Images Using Machine Learning Algorithms (U-Net Segmentation and Dense Net Classifier implementation are in progress)
Early detection of breast cancer using mammography images and software engine...TELKOMNIKA JOURNAL
The breast cancer has affected a wide region of women as a particular case. Therefore, different researchers have focused on the early detection of this disease to overcome it in efficient way. In this paper, an early breast cancer detection system has been proposed based on mammography images. The proposed system adopts deep-learning technique to increase the accuracy of detection. The convolutional neural network (CNN) model is considered for preparing the datasets of training and test. It is important to note that the software engineering process model has been adopted in constructing the proposed algorithm. This is to increase the reliably, flexibility and extendibility of the system. The user interfaces of the system are designed as a website used at country side general purpose (GP) health centers for early detection to the disease under lacking in specialist medical staff. The obtained results show the efficiency of the proposed system in terms of accuracy up to more than 90% and decrease the efforts of medical staff as well as helping the patients. As a conclusion, the proposed system can help patients by early detecting the breast cancer at far places from hospital and referring them to nearest specialist center.
Quantitative Comparison of Artificial Honey Bee Colony Clustering and Enhance...idescitation
This paper introduces a comparison of two popular
clustering algorithms for breast DCE-MRI segmentation
purpose. Magnetic resonance imaging (MRI) is an advanced
medical imaging technique providing rich information about
the human soft tissue anatomy. The goal of breast magnetic
resonance image segmentation is to accurately identify the
principal mass or lesion structures in these image volumes.
There are many methods that exist to segment the breast
DCE-MR images. One of these, K-means clustering procedure
provides effective solutions in many science and engineering
fields. They are especially popular in the pattern classification
and signal processing areas and can segment the breast DCE-
MRI with high precision. The artificial bee colony (ABC)
algorithm is a new, very simple and robust population based
optimization algorithm that is inspired by the intelligent
behavior of honey bee swarms. This paper compares the
performance of two image segmentation techniques in the
subject of breast DCE-MR image. In the experiments, the
real dynamic contrast enhanced magnetic resonance images
(DCE- MRI) are used. Results show that artificial bee colony
algorithm performs better in terms of segmentation accuracy,
robustness and speed of computation.
At the 35th AICC-RCOG Annual Conference in association with FOGSI and MOGS, Dr. Niranjan Chavan, President of MOGS, gave an address on Artificial Intelligence in Gynaecologic Oncology at Taj Lands' End, Bandra, Mumbai on the 6th November 2022
Predictive modeling for breast cancer based on machine learning algorithms an...IJECEIAES
Breast cancer is one of the leading causes of death among women worldwide. However, early prediction of breast cancer plays a crucial role. Therefore, strong needs exist for automatic accurate early prediction of breast cancer. In this paper, machine learning (ML) classifiers combined with features selection methods are used to build an intelligent tool for breast cancer prediction. The Wisconsin diagnostic breast cancer (WDBC) dataset is used to train and test the model. Classification algorithms, including support vector machine (SVM), light gradient boosting machine (LightGBM), random forest (RF), logistic regression (LR), k-nearest neighbors (k-NN), and naïve Bayes, were employed. Performance measures for each of them were obtained, namely: accuracy, precision, recall, F-score, Kappa, Matthews correlation coefficient (MCC), and time. The results indicate that without feature selection, LightGBM achieves the highest accuracy at 95%. With minimum redundancy maximum relevance (mRMR) feature selection (15 features), LightGBM outperforms other classifiers, achieving an accuracy of 98%. For Pearson correlation coefficient feature selection (15 features), LightGBM also excels with a 95% accuracy rate. Lasso feature selection (5 features) produces varied results across classifiers, with logistic regression achieving the highest accuracy at 96%. These findings underscore the importance of feature selection in refining model performance and in improving detection for breast cancer.
A deep convolutional structure-based approach for accurate recognition of ski...IJECEIAES
One-third of all cancer diagnoses worldwide are skin malignancies. One of the most common tumors, skin cancer can develop from a variety of dermatological conditions and is subdivided into different categories based on its textile, color, body, and other morphological characteristics. The most effective strategy to lower the mortality rate of melanoma is early identification because skin cancer incidence has been on the rise recently. In order to categorize dermoscopy images into the four diagnosis classifications of melanoma, benign, malignant, and human against machine (HAM) not melanoma, this research suggests a computer-aided diagnosis (CAD) system. Experimental results show that the suggested approach enabled 97.25% classification accuracy. In order to automate the identification of skin cancer and expedite the diagnosis process in order to save a life, the proposed technique offers a less complex and cutting-edge framework.
The document describes a student project that aims to classify and detect lung disorders from chest x-rays using generative adversarial networks (GANs). It provides an overview of the problem statement, literature review on existing methodologies, and the proposed system architecture. The problem is that early diagnosis of lung diseases is important but current methods are time-consuming. The project aims to develop a hybrid GAN model to overcome issues in existing models and accurately detect lung disorders from x-rays. It will classify diseases like pneumonia and lung cancer.
Possibilistic Fuzzy C Means Algorithm For Mass classificaion In Digital Mammo...IJERA Editor
This document presents a methodology for classifying masses in digital mammograms using feature extraction and the Possibilistic Fuzzy C Means (PFCM) clustering algorithm and Support Vector Machine (SVM) classification. The methodology involves preprocessing mammogram images using adaptive median filtering and image enhancement to reduce noise. Features such as texture, entropy, and correlation are then extracted. PFCM clustering is used to classify pixels into abnormal and normal regions based on their features. Finally, SVM classification is applied to the extracted features to classify mammograms as normal, benign, or malignant. The methodology aims to help radiologists more accurately detect abnormalities in mammograms at an earlier stage than traditional analysis.
Class imbalance is a pervasive issue in the field of disease classification from
medical images. It is necessary to balance out the class distribution while training a model. However, in the case of rare medical diseases, images from affected
patients are much harder to come by compared to images from non-affected
patients, resulting in unwanted class imbalance. Various processes of tackling
class imbalance issues have been explored so far, each having its fair share of
drawbacks. In this research, we propose an outlier detection based image classification technique which can handle even the most extreme case of class imbalance. We have utilized a dataset of malaria parasitized and uninfected cells. An
autoencoder model titled AnoMalNet is trained with only the uninfected cell images at the beginning and then used to classify both the affected and non-affected
cell images by thresholding a loss value. We have achieved an accuracy, precision, recall, and F1 score of 98.49%, 97.07%, 100%, and 98.52% respectively,
performing better than large deep learning models and other published works.
As our proposed approach can provide competitive results without needing the
disease-positive samples during training, it should prove to be useful in binary
disease classification on imbalanced datasets.
Sigma Xi Student Showcase - Fast Pre-Diagnosis of Breast Cancer Using CNNsMichael Batavia
This document proposes using a convolutional neural network (CNN) to classify breast cancer tumors in lymph nodes and maximize neural network accuracy. The author developed a custom CNN with parallel processing across multiple GPUs that achieved 83% validation accuracy on a breast cancer dataset, outperforming pathologist classification by 10%. A separate experiment found that images with 40x magnification led to statistically better accuracy than 20x magnification images. The CNN architecture, datasets used, training methodology, and results are described to support using CNNs for early and individualized breast cancer diagnosis.
An approach of cervical cancer diagnosis using class weighting and oversampli...TELKOMNIKA JOURNAL
Globally, cervical cancer caused 604,127 new cases and 341,831 deaths in 2020, according to the global cancer observatory. In addition, the number of cervical cancer patients who have no symptoms has grown recently. Therefore, giving patients early notice of the possibility of cervical cancer is a useful task since it would enable them to have a clear understanding of their health state. The use of artificial intelligence (AI), particularly in machine learning, in this work is continually uncovering cervical cancer. With the help of a logit model and a new deep learning technique, we hope to identify cervical cancer using patient-provided data. For better outcomes, we employ Keras deep learning and its technique, which includes class weighting and oversampling. In comparison to the actual diagnostic result, the experimental result with model accuracy is 94.18%, and it also demonstrates a successful logit model cervical cancer prediction.
Titles with Abstracts_2023-2024_Digital Image processing.pdfinfo751436
Digital image processing (DIP) refers to the manipulation of digital images using various algorithms and techniques to enhance or extract information from the images. There are several advantages to using digital image processing:
Enhancement of Image Quality:
DIP allows for the improvement of image quality by reducing noise, correcting distortions, and enhancing details. This is particularly useful in medical imaging, satellite imagery, and surveillance.
Image Restoration:
It helps in restoring images that have been degraded due to factors such as noise, blurring, or compression. Restoration techniques can improve the visual quality of images.
Image Compression:
DIP plays a crucial role in image compression, allowing for the reduction of file sizes while maintaining an acceptable level of image quality. This is essential for efficient storage and transmission of images over networks.
Image Recognition and Computer Vision:
DIP is widely used in computer vision applications for tasks such as object recognition, face detection, and gesture recognition. It enables machines to interpret and understand visual information.
Medical Image Processing:
In medical imaging, DIP is used for tasks like tumor detection, organ segmentation, and image reconstruction. It assists healthcare professionals in diagnosis and treatment planning.
Remote Sensing:
In satellite imagery and remote sensing applications, DIP helps analyze and interpret data for various purposes, including environmental monitoring, agriculture, and disaster management.
Geographic Information Systems (GIS):
DIP is employed in GIS to process and analyze spatial data, enabling the extraction of meaningful information from satellite imagery and maps.
Video Processing:
DIP is used in video processing for tasks such as video compression, object tracking, and motion analysis. It is essential in surveillance systems and video editing.
Authentication and Security:
DIP is utilized in authentication systems, such as fingerprint recognition and iris scanning. It also plays a role in security applications, such as facial recognition for access control.
Automated Inspection and Quality Control:
In industrial settings, DIP is used for automated inspection and quality control. It helps identify defects and ensures the production of high-quality products.
Entertainment and Multimedia:
DIP is integral in various entertainment and multimedia applications, including image and video editing, special effects, and virtual reality.
Scientific Research:
Researchers use DIP in fields such as astronomy, biology, and physics for image analysis, data extraction, and visualization.
In summary, digital image processing offers a wide range of advantages across various fields, contributing to improvements in image quality, information extraction, and automated decision-making processes.
A Progressive Review: Early Stage Breast Cancer Detection using Ultrasound Im...IRJET Journal
1) The document reviews various machine learning algorithms and imaging modalities for early-stage breast cancer detection.
2) It discusses studies that have used ultrasound imaging alone or in combination with mammography to detect breast cancers at early stages.
3) Machine learning algorithms like convolutional neural networks have shown promise in classifying ultrasound images to detect breast cancer when trained on large datasets.
PSO-SVM hybrid system for melanoma detection from histo-pathological imagesIJECEIAES
This paper introduces an automated system for skin cancer (melanoma) detection from Histo-pathological images sampled from microscopic slides of skin biopsy. The proposed system is a hybrid system based on Particle Swarm Optimization and Support Vector Machine (PSO-SVM). The features used are extracted from the grayscale image histogram, the co-occurrence matrix and the energy of the wavelet coefficients resulting from the wavelet packet decomposition. The PSO-SVM system selects the best feature set and the best values for the SVM parameters (C and γ) that optimize the performance of the SVM classifier. The system performance is tested on a real dataset obtained from the Southern Pathology Laboratory in Wollongong NSW, Australia. Evaluation results show a classification accuracy of 87.13%, a sensitivity of 94.1% and a specificity of 80.22%.The sensitivity and specificity results are comparable to those obtained by dermatologists.
AUTOMATIC BREAST CANCER DETECTION WITH OPTIMIZED ENSEMBLE OF CLASSIFIERSIAEME Publication
Many people are being affected by breast cancer. When numerous procedures are used to diagnose breast cancer, such as clump thickness, cell size uniformity, cell shape homogeneity, and so on, the end outcome might be challenging to get, even for medical professionals. Therefore, an automatic breast cancer detection model is developed in this research work. This research utilizes four key steps to construct an intelligent breast cancer detection approach: "(a) pre-processing, (b) segmentation, (c) feature extraction, and (d) classification". The provided input image is first pre-processed using the median filtering approach and “Contrast Limited Adaptive Histogram Equalization (CLAHE)”. Then, Chebyshev Distanced- Fuzzy C-Means Clustering (CD-FCM) is used to segment the pre-processed image for ROI recognition. The Augumented Local Vector Pattern (ALVP), Shape features, and “Gray-level Co-occurrence Matrix (GLCM)” are then extracted from the recognized ROI regions. The Improved information gain is used to choose the most optimum features from the retrieved features. Finally, an ensemble classification approach is used to complete the classification process. The “CNN-GRU [Gated Recurrent Units (GRU)-Convolutional Neural Networks (CNN)], Support Vector Machines (SVM), Random Forest (RF), and K-Nearest Neighbours (KNN)” are all included in this ensemble classification approach. With the specified relevant characteristics, SVM, RF, and KNN are trained. The last decision maker is the CNNGRU, which is trained using the results of SVM, RF, and KNN. The weight function of CNN-GRU is improved utilizing a newly created hybrid algorithm-Slimemould Updated Wildbeast Optimization (SUWO) formulated by integrating the principles of both Slime mould algorithm (SMA) and Wildebeest herd optimization (WHO), respectively, in order to improve the detection accuracy of CNN-GRU. Finally, a comparative evaluation is undergone to validate the efficiency of the projected model.
Similar to Breast cancer classification with histopathological image based on machine learning (20)
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...IJECEIAES
Medical image analysis has witnessed significant advancements with deep learning techniques. In the domain of brain tumor segmentation, the ability to
precisely delineate tumor boundaries from magnetic resonance imaging (MRI)
scans holds profound implications for diagnosis. This study presents an ensemble convolutional neural network (CNN) with transfer learning, integrating
the state-of-the-art Deeplabv3+ architecture with the ResNet18 backbone. The
model is rigorously trained and evaluated, exhibiting remarkable performance
metrics, including an impressive global accuracy of 99.286%, a high-class accuracy of 82.191%, a mean intersection over union (IoU) of 79.900%, a weighted
IoU of 98.620%, and a Boundary F1 (BF) score of 83.303%. Notably, a detailed comparative analysis with existing methods showcases the superiority of
our proposed model. These findings underscore the model’s competence in precise brain tumor localization, underscoring its potential to revolutionize medical
image analysis and enhance healthcare outcomes. This research paves the way
for future exploration and optimization of advanced CNN models in medical
imaging, emphasizing addressing false positives and resource efficiency.
Embedded machine learning-based road conditions and driving behavior monitoringIJECEIAES
Car accident rates have increased in recent years, resulting in losses in human lives, properties, and other financial costs. An embedded machine learning-based system is developed to address this critical issue. The system can monitor road conditions, detect driving patterns, and identify aggressive driving behaviors. The system is based on neural networks trained on a comprehensive dataset of driving events, driving styles, and road conditions. The system effectively detects potential risks and helps mitigate the frequency and impact of accidents. The primary goal is to ensure the safety of drivers and vehicles. Collecting data involved gathering information on three key road events: normal street and normal drive, speed bumps, circular yellow speed bumps, and three aggressive driving actions: sudden start, sudden stop, and sudden entry. The gathered data is processed and analyzed using a machine learning system designed for limited power and memory devices. The developed system resulted in 91.9% accuracy, 93.6% precision, and 92% recall. The achieved inference time on an Arduino Nano 33 BLE Sense with a 32-bit CPU running at 64 MHz is 34 ms and requires 2.6 kB peak RAM and 139.9 kB program flash memory, making it suitable for resource-constrained embedded systems.
Advanced control scheme of doubly fed induction generator for wind turbine us...IJECEIAES
This paper describes a speed control device for generating electrical energy on an electricity network based on the doubly fed induction generator (DFIG) used for wind power conversion systems. At first, a double-fed induction generator model was constructed. A control law is formulated to govern the flow of energy between the stator of a DFIG and the energy network using three types of controllers: proportional integral (PI), sliding mode controller (SMC) and second order sliding mode controller (SOSMC). Their different results in terms of power reference tracking, reaction to unexpected speed fluctuations, sensitivity to perturbations, and resilience against machine parameter alterations are compared. MATLAB/Simulink was used to conduct the simulations for the preceding study. Multiple simulations have shown very satisfying results, and the investigations demonstrate the efficacy and power-enhancing capabilities of the suggested control system.
Neural network optimizer of proportional-integral-differential controller par...IJECEIAES
Wide application of proportional-integral-differential (PID)-regulator in industry requires constant improvement of methods of its parameters adjustment. The paper deals with the issues of optimization of PID-regulator parameters with the use of neural network technology methods. A methodology for choosing the architecture (structure) of neural network optimizer is proposed, which consists in determining the number of layers, the number of neurons in each layer, as well as the form and type of activation function. Algorithms of neural network training based on the application of the method of minimizing the mismatch between the regulated value and the target value are developed. The method of back propagation of gradients is proposed to select the optimal training rate of neurons of the neural network. The neural network optimizer, which is a superstructure of the linear PID controller, allows increasing the regulation accuracy from 0.23 to 0.09, thus reducing the power consumption from 65% to 53%. The results of the conducted experiments allow us to conclude that the created neural superstructure may well become a prototype of an automatic voltage regulator (AVR)-type industrial controller for tuning the parameters of the PID controller.
An improved modulation technique suitable for a three level flying capacitor ...IJECEIAES
This research paper introduces an innovative modulation technique for controlling a 3-level flying capacitor multilevel inverter (FCMLI), aiming to streamline the modulation process in contrast to conventional methods. The proposed
simplified modulation technique paves the way for more straightforward and
efficient control of multilevel inverters, enabling their widespread adoption and
integration into modern power electronic systems. Through the amalgamation of
sinusoidal pulse width modulation (SPWM) with a high-frequency square wave
pulse, this controlling technique attains energy equilibrium across the coupling
capacitor. The modulation scheme incorporates a simplified switching pattern
and a decreased count of voltage references, thereby simplifying the control
algorithm.
A review on features and methods of potential fishing zoneIJECEIAES
This review focuses on the importance of identifying potential fishing zones in seawater for sustainable fishing practices. It explores features like sea surface temperature (SST) and sea surface height (SSH), along with classification methods such as classifiers. The features like SST, SSH, and different classifiers used to classify the data, have been figured out in this review study. This study underscores the importance of examining potential fishing zones using advanced analytical techniques. It thoroughly explores the methodologies employed by researchers, covering both past and current approaches. The examination centers on data characteristics and the application of classification algorithms for classification of potential fishing zones. Furthermore, the prediction of potential fishing zones relies significantly on the effectiveness of classification algorithms. Previous research has assessed the performance of models like support vector machines, naïve Bayes, and artificial neural networks (ANN). In the previous result, the results of support vector machine (SVM) were 97.6% more accurate than naive Bayes's 94.2% to classify test data for fisheries classification. By considering the recent works in this area, several recommendations for future works are presented to further improve the performance of the potential fishing zone models, which is important to the fisheries community.
Electrical signal interference minimization using appropriate core material f...IJECEIAES
As demand for smaller, quicker, and more powerful devices rises, Moore's law is strictly followed. The industry has worked hard to make little devices that boost productivity. The goal is to optimize device density. Scientists are reducing connection delays to improve circuit performance. This helped them understand three-dimensional integrated circuit (3D IC) concepts, which stack active devices and create vertical connections to diminish latency and lower interconnects. Electrical involvement is a big worry with 3D integrates circuits. Researchers have developed and tested through silicon via (TSV) and substrates to decrease electrical wave involvement. This study illustrates a novel noise coupling reduction method using several electrical involvement models. A 22% drop in electrical involvement from wave-carrying to victim TSVs introduces this new paradigm and improves system performance even at higher THz frequencies.
Electric vehicle and photovoltaic advanced roles in enhancing the financial p...IJECEIAES
Climate change's impact on the planet forced the United Nations and governments to promote green energies and electric transportation. The deployments of photovoltaic (PV) and electric vehicle (EV) systems gained stronger momentum due to their numerous advantages over fossil fuel types. The advantages go beyond sustainability to reach financial support and stability. The work in this paper introduces the hybrid system between PV and EV to support industrial and commercial plants. This paper covers the theoretical framework of the proposed hybrid system including the required equation to complete the cost analysis when PV and EV are present. In addition, the proposed design diagram which sets the priorities and requirements of the system is presented. The proposed approach allows setup to advance their power stability, especially during power outages. The presented information supports researchers and plant owners to complete the necessary analysis while promoting the deployment of clean energy. The result of a case study that represents a dairy milk farmer supports the theoretical works and highlights its advanced benefits to existing plants. The short return on investment of the proposed approach supports the paper's novelty approach for the sustainable electrical system. In addition, the proposed system allows for an isolated power setup without the need for a transmission line which enhances the safety of the electrical network
Bibliometric analysis highlighting the role of women in addressing climate ch...IJECEIAES
Fossil fuel consumption increased quickly, contributing to climate change
that is evident in unusual flooding and draughts, and global warming. Over
the past ten years, women's involvement in society has grown dramatically,
and they succeeded in playing a noticeable role in reducing climate change.
A bibliometric analysis of data from the last ten years has been carried out to
examine the role of women in addressing the climate change. The analysis's
findings discussed the relevant to the sustainable development goals (SDGs),
particularly SDG 7 and SDG 13. The results considered contributions made
by women in the various sectors while taking geographic dispersion into
account. The bibliometric analysis delves into topics including women's
leadership in environmental groups, their involvement in policymaking, their
contributions to sustainable development projects, and the influence of
gender diversity on attempts to mitigate climate change. This study's results
highlight how women have influenced policies and actions related to climate
change, point out areas of research deficiency and recommendations on how
to increase role of the women in addressing the climate change and
achieving sustainability. To achieve more successful results, this initiative
aims to highlight the significance of gender equality and encourage
inclusivity in climate change decision-making processes.
Voltage and frequency control of microgrid in presence of micro-turbine inter...IJECEIAES
The active and reactive load changes have a significant impact on voltage
and frequency. In this paper, in order to stabilize the microgrid (MG) against
load variations in islanding mode, the active and reactive power of all
distributed generators (DGs), including energy storage (battery), diesel
generator, and micro-turbine, are controlled. The micro-turbine generator is
connected to MG through a three-phase to three-phase matrix converter, and
the droop control method is applied for controlling the voltage and
frequency of MG. In addition, a method is introduced for voltage and
frequency control of micro-turbines in the transition state from gridconnected mode to islanding mode. A novel switching strategy of the matrix
converter is used for converting the high-frequency output voltage of the
micro-turbine to the grid-side frequency of the utility system. Moreover,
using the switching strategy, the low-order harmonics in the output current
and voltage are not produced, and consequently, the size of the output filter
would be reduced. In fact, the suggested control strategy is load-independent
and has no frequency conversion restrictions. The proposed approach for
voltage and frequency regulation demonstrates exceptional performance and
favorable response across various load alteration scenarios. The suggested
strategy is examined in several scenarios in the MG test systems, and the
simulation results are addressed.
Enhancing battery system identification: nonlinear autoregressive modeling fo...IJECEIAES
Precisely characterizing Li-ion batteries is essential for optimizing their
performance, enhancing safety, and prolonging their lifespan across various
applications, such as electric vehicles and renewable energy systems. This
article introduces an innovative nonlinear methodology for system
identification of a Li-ion battery, employing a nonlinear autoregressive with
exogenous inputs (NARX) model. The proposed approach integrates the
benefits of nonlinear modeling with the adaptability of the NARX structure,
facilitating a more comprehensive representation of the intricate
electrochemical processes within the battery. Experimental data collected
from a Li-ion battery operating under diverse scenarios are employed to
validate the effectiveness of the proposed methodology. The identified
NARX model exhibits superior accuracy in predicting the battery's behavior
compared to traditional linear models. This study underscores the
importance of accounting for nonlinearities in battery modeling, providing
insights into the intricate relationships between state-of-charge, voltage, and
current under dynamic conditions.
Smart grid deployment: from a bibliometric analysis to a surveyIJECEIAES
Smart grids are one of the last decades' innovations in electrical energy.
They bring relevant advantages compared to the traditional grid and
significant interest from the research community. Assessing the field's
evolution is essential to propose guidelines for facing new and future smart
grid challenges. In addition, knowing the main technologies involved in the
deployment of smart grids (SGs) is important to highlight possible
shortcomings that can be mitigated by developing new tools. This paper
contributes to the research trends mentioned above by focusing on two
objectives. First, a bibliometric analysis is presented to give an overview of
the current research level about smart grid deployment. Second, a survey of
the main technological approaches used for smart grid implementation and
their contributions are highlighted. To that effect, we searched the Web of
Science (WoS), and the Scopus databases. We obtained 5,663 documents
from WoS and 7,215 from Scopus on smart grid implementation or
deployment. With the extraction limitation in the Scopus database, 5,872 of
the 7,215 documents were extracted using a multi-step process. These two
datasets have been analyzed using a bibliometric tool called bibliometrix.
The main outputs are presented with some recommendations for future
research.
Use of analytical hierarchy process for selecting and prioritizing islanding ...IJECEIAES
One of the problems that are associated to power systems is islanding
condition, which must be rapidly and properly detected to prevent any
negative consequences on the system's protection, stability, and security.
This paper offers a thorough overview of several islanding detection
strategies, which are divided into two categories: classic approaches,
including local and remote approaches, and modern techniques, including
techniques based on signal processing and computational intelligence.
Additionally, each approach is compared and assessed based on several
factors, including implementation costs, non-detected zones, declining
power quality, and response times using the analytical hierarchy process
(AHP). The multi-criteria decision-making analysis shows that the overall
weight of passive methods (24.7%), active methods (7.8%), hybrid methods
(5.6%), remote methods (14.5%), signal processing-based methods (26.6%),
and computational intelligent-based methods (20.8%) based on the
comparison of all criteria together. Thus, it can be seen from the total weight
that hybrid approaches are the least suitable to be chosen, while signal
processing-based methods are the most appropriate islanding detection
method to be selected and implemented in power system with respect to the
aforementioned factors. Using Expert Choice software, the proposed
hierarchy model is studied and examined.
Enhancing of single-stage grid-connected photovoltaic system using fuzzy logi...IJECEIAES
The power generated by photovoltaic (PV) systems is influenced by
environmental factors. This variability hampers the control and utilization of
solar cells' peak output. In this study, a single-stage grid-connected PV
system is designed to enhance power quality. Our approach employs fuzzy
logic in the direct power control (DPC) of a three-phase voltage source
inverter (VSI), enabling seamless integration of the PV connected to the
grid. Additionally, a fuzzy logic-based maximum power point tracking
(MPPT) controller is adopted, which outperforms traditional methods like
incremental conductance (INC) in enhancing solar cell efficiency and
minimizing the response time. Moreover, the inverter's real-time active and
reactive power is directly managed to achieve a unity power factor (UPF).
The system's performance is assessed through MATLAB/Simulink
implementation, showing marked improvement over conventional methods,
particularly in steady-state and varying weather conditions. For solar
irradiances of 500 and 1,000 W/m2
, the results show that the proposed
method reduces the total harmonic distortion (THD) of the injected current
to the grid by approximately 46% and 38% compared to conventional
methods, respectively. Furthermore, we compare the simulation results with
IEEE standards to evaluate the system's grid compatibility.
Enhancing photovoltaic system maximum power point tracking with fuzzy logic-b...IJECEIAES
Photovoltaic systems have emerged as a promising energy resource that
caters to the future needs of society, owing to their renewable, inexhaustible,
and cost-free nature. The power output of these systems relies on solar cell
radiation and temperature. In order to mitigate the dependence on
atmospheric conditions and enhance power tracking, a conventional
approach has been improved by integrating various methods. To optimize
the generation of electricity from solar systems, the maximum power point
tracking (MPPT) technique is employed. To overcome limitations such as
steady-state voltage oscillations and improve transient response, two
traditional MPPT methods, namely fuzzy logic controller (FLC) and perturb
and observe (P&O), have been modified. This research paper aims to
simulate and validate the step size of the proposed modified P&O and FLC
techniques within the MPPT algorithm using MATLAB/Simulink for
efficient power tracking in photovoltaic systems.
Adaptive synchronous sliding control for a robot manipulator based on neural ...IJECEIAES
Robot manipulators have become important equipment in production lines, medical fields, and transportation. Improving the quality of trajectory tracking for
robot hands is always an attractive topic in the research community. This is a
challenging problem because robot manipulators are complex nonlinear systems
and are often subject to fluctuations in loads and external disturbances. This
article proposes an adaptive synchronous sliding control scheme to improve trajectory tracking performance for a robot manipulator. The proposed controller
ensures that the positions of the joints track the desired trajectory, synchronize
the errors, and significantly reduces chattering. First, the synchronous tracking
errors and synchronous sliding surfaces are presented. Second, the synchronous
tracking error dynamics are determined. Third, a robust adaptive control law is
designed,the unknown components of the model are estimated online by the neural network, and the parameters of the switching elements are selected by fuzzy
logic. The built algorithm ensures that the tracking and approximation errors
are ultimately uniformly bounded (UUB). Finally, the effectiveness of the constructed algorithm is demonstrated through simulation and experimental results.
Simulation and experimental results show that the proposed controller is effective with small synchronous tracking errors, and the chattering phenomenon is
significantly reduced.
Remote field-programmable gate array laboratory for signal acquisition and de...IJECEIAES
A remote laboratory utilizing field-programmable gate array (FPGA) technologies enhances students’ learning experience anywhere and anytime in embedded system design. Existing remote laboratories prioritize hardware access and visual feedback for observing board behavior after programming, neglecting comprehensive debugging tools to resolve errors that require internal signal acquisition. This paper proposes a novel remote embeddedsystem design approach targeting FPGA technologies that are fully interactive via a web-based platform. Our solution provides FPGA board access and debugging capabilities beyond the visual feedback provided by existing remote laboratories. We implemented a lab module that allows users to seamlessly incorporate into their FPGA design. The module minimizes hardware resource utilization while enabling the acquisition of a large number of data samples from the signal during the experiments by adaptively compressing the signal prior to data transmission. The results demonstrate an average compression ratio of 2.90 across three benchmark signals, indicating efficient signal acquisition and effective debugging and analysis. This method allows users to acquire more data samples than conventional methods. The proposed lab allows students to remotely test and debug their designs, bridging the gap between theory and practice in embedded system design.
Detecting and resolving feature envy through automated machine learning and m...IJECEIAES
Efficiently identifying and resolving code smells enhances software project quality. This paper presents a novel solution, utilizing automated machine learning (AutoML) techniques, to detect code smells and apply move method refactoring. By evaluating code metrics before and after refactoring, we assessed its impact on coupling, complexity, and cohesion. Key contributions of this research include a unique dataset for code smell classification and the development of models using AutoGluon for optimal performance. Furthermore, the study identifies the top 20 influential features in classifying feature envy, a well-known code smell, stemming from excessive reliance on external classes. We also explored how move method refactoring addresses feature envy, revealing reduced coupling and complexity, and improved cohesion, ultimately enhancing code quality. In summary, this research offers an empirical, data-driven approach, integrating AutoML and move method refactoring to optimize software project quality. Insights gained shed light on the benefits of refactoring on code quality and the significance of specific features in detecting feature envy. Future research can expand to explore additional refactoring techniques and a broader range of code metrics, advancing software engineering practices and standards.
Smart monitoring technique for solar cell systems using internet of things ba...IJECEIAES
Rapidly and remotely monitoring and receiving the solar cell systems status parameters, solar irradiance, temperature, and humidity, are critical issues in enhancement their efficiency. Hence, in the present article an improved smart prototype of internet of things (IoT) technique based on embedded system through NodeMCU ESP8266 (ESP-12E) was carried out experimentally. Three different regions at Egypt; Luxor, Cairo, and El-Beheira cities were chosen to study their solar irradiance profile, temperature, and humidity by the proposed IoT system. The monitoring data of solar irradiance, temperature, and humidity were live visualized directly by Ubidots through hypertext transfer protocol (HTTP) protocol. The measured solar power radiation in Luxor, Cairo, and El-Beheira ranged between 216-1000, 245-958, and 187-692 W/m 2 respectively during the solar day. The accuracy and rapidity of obtaining monitoring results using the proposed IoT system made it a strong candidate for application in monitoring solar cell systems. On the other hand, the obtained solar power radiation results of the three considered regions strongly candidate Luxor and Cairo as suitable places to build up a solar cells system station rather than El-Beheira.
An efficient security framework for intrusion detection and prevention in int...IJECEIAES
Over the past few years, the internet of things (IoT) has advanced to connect billions of smart devices to improve quality of life. However, anomalies or malicious intrusions pose several security loopholes, leading to performance degradation and threat to data security in IoT operations. Thereby, IoT security systems must keep an eye on and restrict unwanted events from occurring in the IoT network. Recently, various technical solutions based on machine learning (ML) models have been derived towards identifying and restricting unwanted events in IoT. However, most ML-based approaches are prone to miss-classification due to inappropriate feature selection. Additionally, most ML approaches applied to intrusion detection and prevention consider supervised learning, which requires a large amount of labeled data to be trained. Consequently, such complex datasets are impossible to source in a large network like IoT. To address this problem, this proposed study introduces an efficient learning mechanism to strengthen the IoT security aspects. The proposed algorithm incorporates supervised and unsupervised approaches to improve the learning models for intrusion detection and mitigation. Compared with the related works, the experimental outcome shows that the model performs well in a benchmark dataset. It accomplishes an improved detection accuracy of approximately 99.21%.
Understanding Inductive Bias in Machine LearningSUTEJAS
This presentation explores the concept of inductive bias in machine learning. It explains how algorithms come with built-in assumptions and preferences that guide the learning process. You'll learn about the different types of inductive bias and how they can impact the performance and generalizability of machine learning models.
The presentation also covers the positive and negative aspects of inductive bias, along with strategies for mitigating potential drawbacks. We'll explore examples of how bias manifests in algorithms like neural networks and decision trees.
By understanding inductive bias, you can gain valuable insights into how machine learning models work and make informed decisions when building and deploying them.
Comparative analysis between traditional aquaponics and reconstructed aquapon...bijceesjournal
The aquaponic system of planting is a method that does not require soil usage. It is a method that only needs water, fish, lava rocks (a substitute for soil), and plants. Aquaponic systems are sustainable and environmentally friendly. Its use not only helps to plant in small spaces but also helps reduce artificial chemical use and minimizes excess water use, as aquaponics consumes 90% less water than soil-based gardening. The study applied a descriptive and experimental design to assess and compare conventional and reconstructed aquaponic methods for reproducing tomatoes. The researchers created an observation checklist to determine the significant factors of the study. The study aims to determine the significant difference between traditional aquaponics and reconstructed aquaponics systems propagating tomatoes in terms of height, weight, girth, and number of fruits. The reconstructed aquaponics system’s higher growth yield results in a much more nourished crop than the traditional aquaponics system. It is superior in its number of fruits, height, weight, and girth measurement. Moreover, the reconstructed aquaponics system is proven to eliminate all the hindrances present in the traditional aquaponics system, which are overcrowding of fish, algae growth, pest problems, contaminated water, and dead fish.
ACEP Magazine edition 4th launched on 05.06.2024Rahul
This document provides information about the third edition of the magazine "Sthapatya" published by the Association of Civil Engineers (Practicing) Aurangabad. It includes messages from current and past presidents of ACEP, memories and photos from past ACEP events, information on life time achievement awards given by ACEP, and a technical article on concrete maintenance, repairs and strengthening. The document highlights activities of ACEP and provides a technical educational article for members.
Literature Review Basics and Understanding Reference Management.pptxDr Ramhari Poudyal
Three-day training on academic research focuses on analytical tools at United Technical College, supported by the University Grant Commission, Nepal. 24-26 May 2024
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODELgerogepatton
As digital technology becomes more deeply embedded in power systems, protecting the communication
networks of Smart Grids (SG) has emerged as a critical concern. Distributed Network Protocol 3 (DNP3)
represents a multi-tiered application layer protocol extensively utilized in Supervisory Control and Data
Acquisition (SCADA)-based smart grids to facilitate real-time data gathering and control functionalities.
Robust Intrusion Detection Systems (IDS) are necessary for early threat detection and mitigation because
of the interconnection of these networks, which makes them vulnerable to a variety of cyberattacks. To
solve this issue, this paper develops a hybrid Deep Learning (DL) model specifically designed for intrusion
detection in smart grids. The proposed approach is a combination of the Convolutional Neural Network
(CNN) and the Long-Short-Term Memory algorithms (LSTM). We employed a recent intrusion detection
dataset (DNP3), which focuses on unauthorized commands and Denial of Service (DoS) cyberattacks, to
train and test our model. The results of our experiments show that our CNN-LSTM method is much better
at finding smart grid intrusions than other deep learning algorithms used for classification. In addition,
our proposed approach improves accuracy, precision, recall, and F1 score, achieving a high detection
accuracy rate of 99.50%.
A review on techniques and modelling methodologies used for checking electrom...nooriasukmaningtyas
The proper function of the integrated circuit (IC) in an inhibiting electromagnetic environment has always been a serious concern throughout the decades of revolution in the world of electronics, from disjunct devices to today’s integrated circuit technology, where billions of transistors are combined on a single chip. The automotive industry and smart vehicles in particular, are confronting design issues such as being prone to electromagnetic interference (EMI). Electronic control devices calculate incorrect outputs because of EMI and sensors give misleading values which can prove fatal in case of automotives. In this paper, the authors have non exhaustively tried to review research work concerned with the investigation of EMI in ICs and prediction of this EMI using various modelling methodologies and measurement setups.
TIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEMHODECEDSIET
Time Division Multiplexing (TDM) is a method of transmitting multiple signals over a single communication channel by dividing the signal into many segments, each having a very short duration of time. These time slots are then allocated to different data streams, allowing multiple signals to share the same transmission medium efficiently. TDM is widely used in telecommunications and data communication systems.
### How TDM Works
1. **Time Slots Allocation**: The core principle of TDM is to assign distinct time slots to each signal. During each time slot, the respective signal is transmitted, and then the process repeats cyclically. For example, if there are four signals to be transmitted, the TDM cycle will divide time into four slots, each assigned to one signal.
2. **Synchronization**: Synchronization is crucial in TDM systems to ensure that the signals are correctly aligned with their respective time slots. Both the transmitter and receiver must be synchronized to avoid any overlap or loss of data. This synchronization is typically maintained by a clock signal that ensures time slots are accurately aligned.
3. **Frame Structure**: TDM data is organized into frames, where each frame consists of a set of time slots. Each frame is repeated at regular intervals, ensuring continuous transmission of data streams. The frame structure helps in managing the data streams and maintaining the synchronization between the transmitter and receiver.
4. **Multiplexer and Demultiplexer**: At the transmitting end, a multiplexer combines multiple input signals into a single composite signal by assigning each signal to a specific time slot. At the receiving end, a demultiplexer separates the composite signal back into individual signals based on their respective time slots.
### Types of TDM
1. **Synchronous TDM**: In synchronous TDM, time slots are pre-assigned to each signal, regardless of whether the signal has data to transmit or not. This can lead to inefficiencies if some time slots remain empty due to the absence of data.
2. **Asynchronous TDM (or Statistical TDM)**: Asynchronous TDM addresses the inefficiencies of synchronous TDM by allocating time slots dynamically based on the presence of data. Time slots are assigned only when there is data to transmit, which optimizes the use of the communication channel.
### Applications of TDM
- **Telecommunications**: TDM is extensively used in telecommunication systems, such as in T1 and E1 lines, where multiple telephone calls are transmitted over a single line by assigning each call to a specific time slot.
- **Digital Audio and Video Broadcasting**: TDM is used in broadcasting systems to transmit multiple audio or video streams over a single channel, ensuring efficient use of bandwidth.
- **Computer Networks**: TDM is used in network protocols and systems to manage the transmission of data from multiple sources over a single network medium.
### Advantages of TDM
- **Efficient Use of Bandwidth**: TDM all
Breast cancer classification with histopathological image based on machine learning
1. International Journal of Electrical and Computer Engineering (IJECE)
Vol. 13, No. 5, October 2023, pp. 5885~5897
ISSN: 2088-8708, DOI: 10.11591/ijece.v13i5.pp5885-5897 5885
Journal homepage: http://ijece.iaescore.com
Breast cancer classification with histopathological image based
on machine learning
Jia Rong Leow, Wee How Khoh, Ying Han Pang, Hui Yen Yap
Faculty of Information Science and Technology, Multimedia University, Malacca, Malaysia
Article Info ABSTRACT
Article history:
Received Nov 15, 2022
Revised Mar 29, 2023
Accepted Apr 7, 2023
Breast cancer represents one of the most common reasons for death in the
worldwide. It has a substantially higher death rate than other types of cancer.
Early detection can enhance the chances of receiving proper treatment and
survival. In order to address this problem, this work has provided a
convolutional neural network (CNN) deep learning (DL) based model on the
classification that may be used to differentiate breast cancer histopathology
images as benign or malignant. Besides that, five different types of pre-trained
CNN architectures have been used to investigate the performance of the model
to solve this problem which are the residual neural network-50 (ResNet-50),
visual geometry group-19 (VGG-19), Inception-V3, and AlexNet while the
ResNet-50 is also functions as a feature extractor to retrieve information from
images and passed them to machine learning algorithms, in this case, a random
forest (RF) and k-nearest neighbors (KNN) are employed for classification. In
this paper, experiments are done using the BreakHis public dataset. As a
result, the ResNet-50 network has the highest test accuracy of 97% to classify
breast cancer images.
Keywords:
Breast cancer classification
Convolution neural network
Image processing
Machine learning
Transfer learning
This is an open access article under the CC BY-SA license.
Corresponding Author:
Wee How Khoh
Faculty of Information Science and Technology, Multimedia University
St. Ayer Keroh Lama, Bukit Beruang, 75450, Melaka, Malaysia
Email: whkhoh@mmu.edu.my
1. INTRODUCTION
Breast cancer has a high death rate [1]. However, nowadays the chances of cure are excellent with the
ongoing advancement of advanced treatment levels and the upgrading of equipment. Current breast-conserving
surgery has a therapeutic result comparable to major resection and modified radical surgery, which not only
protects the structural integrity of women’s breasts but also reduces their physical and psychological trauma.
[2]. Early identification of breast cancer has a greater survival percentage than medium and late stages. Due to
the fact that the cancer cells in the early stages of breast cancer have not disseminated, they are amenable to
treatment with local surgery, radiation, chemotherapy, hormone therapy, and other comprehensive treatments
with a very high cure rate.
Current medical technology and pharmaceuticals for treating breast cancer have advanced
significantly in comparison to the past, as it has the rate of early identification of breast cancer. As a result,
early diagnosis of breast cancer may boost the success rate of therapy and assist to decrease the death rate of
breast cancer patients [3]. This work proposes the use of pre-trained convolutional neural network (CNN)
algorithms for distinguishing breast cancer histological images of patients. BreakHis is the dataset that employed in
this study. This paper also demonstrates how the CNN performance could be mixed with machine learning methods
for classification. Finally, a state-of-art result of the models to categorize breast cancer as benign or malignant,
and compared the performances of all of the pre-trained models that are employed.
2. ISSN: 2088-8708
Int J Elec & Comp Eng, Vol. 13, No. 5, October 2023: 5885-5897
5886
2. RELATED WORK
Reza and Ma [4] employed CNN to classify imbalanced data from histopathological photos of
malignant and non-cancerous tissues. The dataset is heavily imbalanced since the malignant class image is
about 3 times as much as the benign class. The oversampling and under-sampling methods were used to address
this imbalance problem. Under sampling resamples the data by eliminating majority classes until the minority
class reaches a predetermined proportion of the majority class. However, it may result in the loss of important
information. Utilizing oversampling to replicate samples from the minority class. The results demonstrated that
oversampling provided the best recognition rate of 86.13% and the oversampling method beat the
under-sampling strategy in all circumstances.
Hirra et al. [5], employed a deep belief network (DBN) to create a patch-based deep learning system
for recognizing breast cancer. DBN is constructed by stacking the restricted Boltzmann machine (RBM) layers
and initializing the weights in the network with a greedy learning method. Using the two hidden layers,
histopathological image characteristics were retrieved. The first RBM was constructed by joining the input
layer and the first hidden layer, whereas the second RBM was built by merging the first and second hidden
layers. The PA-DBN-BC model produced an accuracy of 86%.
Zou et al. [6] presented a breast cancer histopathology image classification model based on
GoogLeNet’s Inception V1 CNN model. The global average pooling (GAP) technique was used to integrate
images of any size into CNN, while the spatial pyramid pooling (SPP) approach was employed to integrate
those images into the network and to a certain number of nodes by showing the feature map in the final
convolution layer. Overall, the findings indicated that GAP outperformed SPP because it consisted of fewer
parameters, trains quicker, and could reduce overfitting.
Spanhol et al. [7] improved the pre-trained networks of various CNN types by enriching the dataset
using transfer learning by splitting it into numerous patches. It discovered the optimal results of 85% of
accuracy. This strategy successfully handled the problems of computation time and a small training set. In the
instance of residual neural network (ResNet), this method yielded a maximum accuracy of 85%.
Alzubaidi et al. [8] designed a network with two branches to create a deep network with adequate
feature representation for feature extraction and the capacity to accomplish the job with high performance. It
was created by expanding the network’s breadth and depth without considerably increasing its computational
cost. In terms of accuracy, it outperformed earlier approaches and was proven to be more resistant to change
in cell volume, color, and structure, which achieved 83.6% for patch-wise classification accuracy and 91.3%
for image-wise classification accuracy.
Wang et al. [9] proposed a prototype transfer generative adversarial network (PTGAN) to bridge the
domain disparity at the style and feature levels. According to the experimental outcomes, the PTGAN achieved
around 90% accuracy in categorizing benign and malignant tissues. Xu et al. [10] proposed a stacked sparse
autoencoder architecture for detecting breast cancer nuclei. The authors compared the stacked sparse
autoencoder to other encoders in terms of encoding high-level information from image pixels. Stacked sparse
autoencoder with softmax classifier was also tested. In a cohort of 500 histopathology pictures and around
3,500 humanely created distinct nuclei, SSAE had an F-measure of 84.49% and an average area under
precision-recall curve (AveP) of 78.83%.
Ahmad et al. [11] adapted an existing CNN architecture AlexNet created to classify breast cancer.
The training algorithms were designed to extract patches obtained randomly and used a sliding window strategy
to allow them to handle high-resolution pictures without modifying the CNN design. The trials demonstrated
that CNN outperformed other models on the same dataset, and they also combined CNNs using fusion rules,
resulting in a modest increase in recognition rates. Besides that, Parvin et al. [12] classified breast cancer
histopathology pictures using different types of CNN architectures such as LeNet-5, AlexNet, VGG-16,
ResNet-50, and Inception-vl. The Inception-vl network scored the highest test accuracy.
Nguyen et al. [13] are scaling the original images in order to construct a CNN model and categorize
breast cancer classes. A building model achieves an accuracy of 73.68%, which is satisfactory in relation to
the image’s unique characteristics. On the other hand, deep learning can extract features from a dataset without
the need to construct feature extractors.
Besides that, Anupama et al. [14] provided several CNN classification designs, including AlexNet,
Inception-Net, and ResNet. Due to the limitations of standard CNN, capsule networks that capture spatial and
orientational information were selected as the architecture. This research reveals that histology image
preprocessing increases the capsule network model’s precision. It is evident from this study that the
performance of traditional designs may be enhanced by data preprocessing and parameter adjustment.
Moreover, Ahmed et al. [15] suggested using a single CNN model to concurrently accept input from
the same image at four different magnification levels and a CNN model with multiple inputs to classify breast
histopathology images as benign or malignant. The model presented here has fewer parameters than earlier
3. Int J Elec & Comp Eng ISSN: 2088-8708
Breast cancer classification with histopathological image based on machine learning (Jia Rong Leow)
5887
complex models. It allows the model to be trained quicker than other models. In addition, real-time picture
enhancement minimizes storage complexity.
Xu and Dong [16], have conduct transfer learning research in breast cancer image categorization.
Most feature-representation-transfer algorithms for inductive transfer learning strive to eliminate domain
differences. It also provided a method to extract characteristics from breast cancer pictures utilizing multiple
pre-trained neural networks, GoogLeNet, visual geometry group neural network (VGGNet) and residual neural
network (ResNet). The classifier was then trained using feature data. GoogLeNet, VGGNet, ResNet, and
proposed network obtained 93.5%, 94.15%, 94.35%, and 97.525% classification accuracy. DCNN was used
for feature extraction, while is method was used for feature selection. The algorithm’s reduced calculation cost
seems promising. In this survey, we observed fine-tuned pre-trained network achieves greater performance
than full-trained network. Because it is challenging to get substantial, well-annotated data for breast cancer
classification network training.
Naderan and Zaychenko [17], proposed a convolutional autoencoders were developed to minimize
model complexity, parameter number, and overfitting. This work trained autoencoder models using multiple
architectures and hyperparameters. The model could rebuild the original picture with 84.72% accuracy with
37% noise of input images. Convolutional autoencoder contains fewer parameters than DenseNet, preventing
overfitting. The autoencoder is unsupervised and does not require labelled data. In the experiment, input photos
included 37% noise. The model could recreate pictures from input data.
After that, Ukwuoma et al. [18] provided an attention mechanism based deep-learning model. The
suggested attention method uses pre-trained models as inputs and a multilinear perceptron and softmax as
outputs. The proposed model combined DenseNet-201 and VGG-16 for feature extractors into the attention
mechanism, yielding more accuracy than pre-trained and ensemble models. The accuracy of classification
increased by 1% to 6%.
3. METHOD
3.1. Convolutional neural network models
CNN’s [19] are deep learning algorithms in image recognition and processing. A CNN is similar to a
multilayer perceptron in that it is based on the same technology as a multilayer perceptron but is optimized for
minimal processing needs. A CNN consists of the input layer, output layer, and a hidden layer comprising
several convolutional, pooling, fully connected, and normalizing layers. Figure 1 illustrates a sample of
conventional CNN architecture. A brief explanation of each layer is included in the following sections.
Figure 1. Sample of CNN architecture [20]
3.1.1. Convolutional layer
CNN’s essential component is the convolutional layer. It bears the lion’s share of the network’s
computational cost. Besides that, this layer performs a dot product on two matrices, one of which includes the
kernel’s set of learnable parameters and the other of which contains the receptive field’s restricted area. Kernels
are smaller than pictures yet contain a higher amount of information. So, if the image is under three red green
blue (RGB) channels, the kernel height and width are minimized in terms of spatial dimensions, but the depth
includes all three channels.
3.1.2. Pooling layer
A pooling layer plays an important role in CNNs in reducing the complexity of the feature maps
produced by the convolutional layer. This is typically achieved through down sampling process which involves
summarizing the local features of the filter’s output. The pooling process operates on a small window of the input
data, selecting a pre-defined value (commonly average or maximum value) in the window to create a simplified
4. ISSN: 2088-8708
Int J Elec & Comp Eng, Vol. 13, No. 5, October 2023: 5885-5897
5888
and smaller output feature map. Figure 2 depicts examples of the two most common pooling used in CNN,
maximum pooling (left), and average pooling (right). The maximum pooling selects the maximum value in the
window (2×2 pool size), emphasizing the most salient feature in the windows while the average pooling
computes the mean value in each window to reduce the impact of small variations in the input data. Both
possess the effect of reducing the resolution of the output feature map by a factor of the pooling window size.
Figure 2. Max and average pooling [21]
3.1.3. Fully-connected layer (FC layer)
As with feedforward neural networks, neurons in the fully-connected layers are entirely dependent on
all activations in the previous layer. FC layers are always the last to be implemented in a network. This layer
usually receives input from the pooling layer that has been flattened after the image’s features have been
extracted. The output is then sent to the layer responsible for picture classification.
3.2. Proposed models
The pre-trained CNN models utilized for the BreakHis dataset in this study include the ResNet-50,
VGG-19, AlexNet, and Inception-v3. Besides, the ResNet-50 is also used as a feature extraction to obtain the
feature from the BreakHis images, then the extracted features are fed into the (RF) random forest and k-nearest
neighbors (k-NN) for classification. The architectures of the adopted pre-trained CNN models are described in
the subsequence sections.
3.2.1. ResNet-50
The ResNet-50 has 50 layers. The input components of the ResNet network are all made up of a huge
convolution kernel with a stride of 2 and a maximum pooling with a size of 3×3 and a stride of 2. In this phase,
the input image must be changed to 224×224×3 dimensions. It contains five convolutional layers for producing
feature maps, a rectified linear unit (ReLU) layer for dimensionality reduction, a FC layer, and a softmax
activation function for categorizing the images of the breast cancer dataset into two categories (benign or
malignant). The first convolution layer provides low-level properties like color, edges, and gradient operation
while the deeper convolution layer delivers high-level characteristics of the input data. When the input is
received, it processes by the first convolutional layer, which contains 7×7 with 64 filters and has a stride value
of 2. The input size is then reduced to 112×112 and the data is processed through the following convolutional
layers: 1×1 with 64 filters, 3×3 with 64 filters, and 1×1 with 256 filter stride values of 3. The image is then
scaled down to 56×56 pixels and sent through the next convolutional layer, which is 1×1 with 128 filters, 3×3
with 128 filters, and 1×1 with 512 filters with a stride value of 4. The dimension is then reduced to 28×28 and
the dimension is transmitted through the next convolutional layer, which is 1×1 with 256 filters, 3×3 with 256
filters, and 1×1 with 1,024 filters with a stride value of 6. The image is then reduced again to a size of 14×14
and transmitted through the next convolutional layer, which is 1×1 with 512 filters, 3×3 with 512 filters, and
then 1×1 with 2,048 filters with a stride value of 3.
In the implementation phase, the final layer’s block is made trainable alone, and the learning rate is
set as 0.00001 to compile the frozen and unfrozen top blocks. The epochs are repeated 15 times, and the data
batch size is set as 16. The ResNet-50 architecture is shown in Figure 3 and the configuration of the model is
tabled in Table 1.
5. Int J Elec & Comp Eng ISSN: 2088-8708
Breast cancer classification with histopathological image based on machine learning (Jia Rong Leow)
5889
Figure 3. ResNet-50 architecture
Table 1. Configuration of ResNet-50
ConvNet configuration of Resnet-50
50 layers
Input (224×224×3 RGB image)
Conv1
7×7, 64, stride 2
3×3 max pool, stride 2
Conv2
1×1, 64
3×3, 64×3
1×1, 256
Conv3
1×1, 128
3×3, 128×4
1×1, 512
Conv4
1×1, 256
3×3, 256×6
1×1, 1024
Conv5
1×1, 512
3×3, 512×3
1×1, 2048
Flatten (100352)
FC (64)
soft-max
3.2.2. ResNet-50 with machine learning classifier
The ResNet-50 is combined with another machine learning model to create a hybrid model comprising
two blocks each shown in Figure 4. Within the first block, the weight of the pre-trained ResNet-50 which was
trained by ImageNet is adopted to retrieve the properties from the input images. Then, the deep features
extracted are passed to the second block. The second block uses a machine learning algorithm, including the
random forest and k-NN as classifiers to classify deep features. It is utilized in the second segment to categorize
the deep feature maps of the CNNs.
3.2.3. VGG-19
The size of the image that the VGG-19 receives as input is 224×224×3. The filter size for the
convolutional layer is 3×3, followed by the stride and padding of the convolution layers and five max-pooling
layers. The max-pooling was performed using a 2×2 window, and the stride is set at 2. For the entirely
interconnected layers, the first layer included 256 neurons, followed by a layer containing 256 neurons.
Categorization is achieved using a softmax output layer.
During the implementation phase, a learning rate of 0.001 is used. Additionally, it refines the model
such that each layer is trainable. The epoch size of the experiment has been set to 20, and the call-backs function
has been used to control the learning rate and early stopping. Figure 5 depicts the VGG-19 design, while
Table 2 displays the VGG-19 model’s configuration.
6. ISSN: 2088-8708
Int J Elec & Comp Eng, Vol. 13, No. 5, October 2023: 5885-5897
5890
Figure 4. ResNet-50 with machine learning architecture
Figure 5. VGG-19 architecture
3.2.4. Inception V3
The required input image size for the Inception-V3 network is 224×224×3 resolution. This CNN
architecture consists of 42-layer convolutional layers. It creates a basic yet robust deeper network, allowing us
to reduce computational expenses to a minimum. This network design is summarized in Figure 6.
The factorization of the n×n convolutional is the inception module. By merging the outputs of all the
layers, the output vector is formed. The output of the inception layer is subsequently passed on to the next
layer. The inception layer aids the preceding layers in identifying the proper filter size.
7. Int J Elec & Comp Eng ISSN: 2088-8708
Breast cancer classification with histopathological image based on machine learning (Jia Rong Leow)
5891
The stochastic gradient descent (SGD) is utilized as the optimizer throughout implementation, and all
the layers are frozen in order to train only the extra layers. It only fine-tunes the model by retraining a few of
the end layers of the Inception-V3 model. The learning rate for the SGD is set to 0.001, decay to 0.00001, and
the Nesterov to true. The experiment’s epoch is 20 times, and the batch size is set to 20.
Table 2. Configuration of VGG-19
ConvNet configuration of VGG-19
19 weight layers
Input (224×224×3 RGB image)
Conv 3×3(64)
Conv 3×3(64)
Maxpool
Conv 3×3(128)
Conv 3×3(128)
Maxpool
Conv 3×3(256)
Conv 3×3(256)
Conv 3×3(256)
Conv 3×3(256)
Maxpool
Conv 3×3(512)
Conv 3×3(512)
Conv 3×3(512)
Conv 3×3(512)
Maxpool
Conv 3×3(512)
Conv 3×3(512)
Conv 3×3(512)
Conv 3×3(512)
Maxpool
FC-256
FC-256
soft-max
Figure 6. Concept of inception-V3 (5×inception) architecture [22]
3.2.5. AlexNet
The AlexNet is composed of eight convolutional layers. This model is provided with images with
dimensions of 224×224×3. The first layer of AlexNet is a convolutional layer which consists of 96 filters with
11×11×3-pixel widths and strides of 4 pixels followed by a pooling layer and a batch normalization layer. Then
a ReLU activation layer is used in the convolutional layer. The output feature map now is 27×27×96.
Convolutional layers make up the second layer, as do the first. In this layer, the convolutional contains 256
filters with 11×11 filter sizes and 1-pixel strides. The third layer is also a convolutional layer, with 384 filters
that are 3×3 pixels in size and the output feature map is 6×6×384. The fourth layer is a convolutional layer as
well, with 384 filters that are 3×3 filters in size and the output feature map is 4×4×384. Similarly, the fifth
convolutional layer consists of 256 filters that are 3×3 pixels in size. A layer of overlapping pooling and local
8. ISSN: 2088-8708
Int J Elec & Comp Eng, Vol. 13, No. 5, October 2023: 5885-5897
5892
response normalization follows, and the output features map is 1×1×256. In entirely connected layers, the sixth
and seventh levels have 4,096 neurons with dropout function and ReLU activation. The classifications are then
performed on an outer layer using a softmax activation function. After that, the top two complicated layers are
selected to train, while the first eight layers are frozen, and the remaining layers are unfreezing. SGD is
employed as the optimizer, using a learning rate of 0.001, a momentum of 0.9, and a decay of 0.05. Figure 7
shows the summary architecture of the AlexNet pre-trained model.
Figure 7. AlexNet architecture
4. EXPERIMENTAL SETTINGS
The dataset named as BreakHis [23] is used in this study. This dataset consists of a collection of
9,109 samples of breast tumor tissue acquired from 82 cases at a range of magnification which 40, 100, 200,
and 400. The database presently has 5,429 malignant as shown in Figure 8 and 2,480 benign as shown in Figure
9. The images from the dataset each have the characteristics of 700×460 pixels, three-channel RGB, and eight-
bit depth for each channel. Details of the samples in the dataset are listed in Table 3.
Figure 8. Malignant tumors slide Figure 9. Benign tumors slide
9. Int J Elec & Comp Eng ISSN: 2088-8708
Breast cancer classification with histopathological image based on machine learning (Jia Rong Leow)
5893
Table 3. Details of BreakHis
Magnification Benign Malignant Total
40× 625 1,370 1,995
100× 644 1,437 2,081
200× 623 1,390 2,013
400× 588 1,232 1,820
Total 2480 5,429 7,909
The BreakHis dataset has been constructed as shown in Table 3 for the training, validation, and testing
phase. First, each image from the benign and malignant image folder is loaded and resized from 700×460 pixels
to 224×224 pixels so that it is compatible with the input size of the pre-trained CNN models that are feeding
in the next phase. Figures 10 and 11 illustrated the resized images in RGB format. Both the benign and
malignant data are separately loaded and assigned for train and test variables. After that, the benign train and
test variable are labelled as zero class while the malignant train test is labelled as one class. Then, the test data
for benign and malignant are combined and mixed into one testing dataset which is used in the testing phase.
Following the labelling and merging of the data, the train test split function is used to divide the data into 5,931
for training, 4,745 for validating, and 3,164 for testing the models, as shown in Table 4. The splitting ratio is
set as 75:60:40 for training, validation and testing, respectively. After completing the dataset construction,
begin training and testing the suggested model.
Figure 10. Resized benign image (224×224×3)
Figure 11. Resized malignant image (224×224×3)
Table 4. Data splitting details
Class Number
Training 5931
Validating 4745
Testing 3164
10. ISSN: 2088-8708
Int J Elec & Comp Eng, Vol. 13, No. 5, October 2023: 5885-5897
5894
5. RESULT AND DISCUSSION
In this section, the experimental results and discussion in utilizing transfer learning are presented.
Transfer learning is a powerful technique that enables the leverage of knowledge (weights, biases and other
learned features) that was previously trained on CNN models to solve the new domain problems. To conduct
the experiments, the Dell G15 Special Edition 5521 laptop equipped with the NVIDIA GeForce RTX 3050-Ti
Laptop GPU were used. For implementation, the Python programming language version 3.9.12 and
TensorFlow-GPU version 2.5.0 were used which allow us to take advantage of the powerful computational
capabilities of the GPU to accelerate the experiments. A series of experiments and evaluation of the results
based on various metrics, including accuracy, precision, recall, and F-feature are reported in this section.
Figure 12 shows the accuracy of all the pre-trained models for binary classification of breast cancer
from the BreakHis database. Figure 12 demonstrates that the Resnet-50 network scored the highest test accuracy
with 97.60%, followed by the VGG-19 models with an accuracy of 95.4%. The accuracy rate of the lightweight
AlexNet model was the lowest of the pre-trained models, at 81.42%. This demonstrates that increasing the
number of convolutional layers in a model may dramatically enhance its classification performance.
Figure 12. Accuracy performance of each pre-trained model
Intelligent systems often require a substantial quantity of training data to achieve human-level
diagnostic abilities. For some illnesses, the quantity of data available for model training may be insufficient,
leading to limited generalization of frequently used deep learning models. CNNs are an innovative technique
for learning picture representation, with convolutional filters learning to recognize image information via
backpropagation. Frequently, these attributes are given to a classifier, such as a softmax layer [24].
However, in this research, instead of using traditional backpropagation, a softmax layer is trained for
classification. Deep forest, a decision trees that outperforms deep neural networks (DNN) in a broad variety of
applications. Therefore, ResNet50 was used to retrieve information from breast cancer images before
categorizing them using deep forest. Deep forest has been proved to be particularly efficient in circumstances
when only small-scale training data is available, hence this technique was chosen. Furthermore, the deep forest
network chooses its own complexity, it tackles the dataset imbalance problem that we observed in this
circumstance. Aside from that, the standard k-NN [25] is a straightforward alternative technique to small class
prediction was also employed.
The accuracy based on Figure 12 has shown that the ResNet-50 with RF is better than the ResNet-50
with k-NN since ResNet-50 with RF is achieved at the 89.22% and ResNet-50 with k-NN is 73.67%.
Figure 13 shows the actual and predicted results. Table 5 details the area under the curve (AUC), recall,
accuracy, and F1-score obtained from all models. It could be seen that the ResNet-50 network achieved the
highest performance among the AUC of 97%, precision of 97% and 98%, recall of 95% and 98%, and F1-score
of 96% and 98% for benign and malignant, respectively.
Figure 14 shows the confusion matrices for the ResNet-50 model, which achieved the best accuracy
performance among the adopted models. This confusion matrix was created utilizing a 75% training and 40%
testing technique. The results show that the true positive rate is 68.14%, the true negative rate is 29.27%, the
false positive rate is 1.55%, and the false negative rate is 1.04% during the testing stage.
11. Int J Elec & Comp Eng ISSN: 2088-8708
Breast cancer classification with histopathological image based on machine learning (Jia Rong Leow)
5895
Figure 13. Actual and predicted results
Table 5. Performance of each pre-trained model
Model Class Precision Recall F-features Accuracy
AlexNet Benign 0.82 0.55 0.66 0.81
Malignant 0.81 0.94 0.87
VGG 19 Benign 0.82 0.88 0.85 0.90
Malignant 0.94 0.91 0.93
Inception V3 Benign 0.96 0.92 0.94 0.96
Malignant 0.96 0.98 0.97
ResNet-50 Benign 0.97 0.95 0.96 0.97
Malignant 0.98 0.98 0.98
ResNet50+
random forest
Benign 0.84 0.82 0.83 0.89
Malignant 0.92 0.92 0.92
ResNet50+KNN Benign 0.62 0.46 0.53 0.74
Malignant 0.77 0.87 0.82
Figure 14. Confusion matrix of ResNet-50 model
12. ISSN: 2088-8708
Int J Elec & Comp Eng, Vol. 13, No. 5, October 2023: 5885-5897
5896
6. CONCLUSION
Overall, this research has offered five distinct types of pre-trained CNN models on the classification
that may be utilized to categorize benign and malignant breast cancer histopathology images. The models
utilized in the study include ResNet-50, VGG-19, Inception-V3, and AlexNet, with the ResNet-50 also working
as a feature extractor, extracting features from images and passing them to machine learning algorithms for
classification, in this case, a RF and k-NN. The experiments of this study were based on a publicly available
dataset named BreakHis.
In conclusion, the ResNet-50 has a higher accuracy for the classification of medical color images, but
accurate cancer diagnosis is critical for protecting a person's life, and this research can aim to improve its
accuracy as much as possible in the future. This study’s proposed architecture might be used in real-world
medical imaging in real time. More testing in real-world scenarios is required for clinical applications, and the
system should be strengthened. Because of their speed, such devices may allow clinicians to consult with more
patients. Furthermore, because the suggested approach is more accurate in cancer classification, the death rate
from breast cancer might be greatly lowered.
REFERENCES
[1] N. Azamjah, Y. Soltan-Zadeh, and F. Zayeri, “Global trend of breast cancer mortality rate: A 25-Year study,” Asian Pacific Journal
of Cancer Prevention, vol. 20, no. 7, pp. 2015–2020, Jul. 2019, doi: 10.31557/APJCP.2019.20.7.2015.
[2] H. Qiu, W.-H. Xu, J. Kong, X.-J. Ding, and D.-F. Chen, “Effect of breast-conserving surgery and modified radical mastectomy on
operation index, symptom checklist-90 score and prognosis in patients with early breast cancer,” Medicine, vol. 99, no. 11, Mar.
2020, doi: 10.1097/MD.0000000000019279.
[3] G. N. Sharma, R. Dave, J. Sanadya, P. Sharma, and K. K. Sharma, “Various types and management of breast cancer: an overview,”
Journal of Advanced Pharmaceutical Technology and Research, vol. 1, no. 2, pp. 109–126, 2010.
[4] M. S. Reza and J. Ma, “Imbalanced histopathological breast cancer image classification with convolutional neural network,”
in 2018 14th
IEEE International Conference on Signal Processing (ICSP), Aug. 2018, pp. 619–624, doi:
10.1109/ICSP.2018.8652304.
[5] I. Hirra et al., “Breast cancer classification from histopathological images using patch-based deep learning modeling,” IEEE Access,
vol. 9, pp. 24273–24287, 2021, doi: 10.1109/ACCESS.2021.3056516.
[6] W. Zou, H. Lu, K. Yan, and M. Ye, “Breast cancer histopathological image classification using deep learning,” in 2019 10th
International Conference on Information Technology in Medicine and Education (ITME), Aug. 2019, pp. 53–57, doi:
10.1109/ITME.2019.00023.
[7] F. A. Spanhol, L. S. Oliveira, C. Petitjean, and L. Heutte, “Breast cancer histopathological image classification using convolutional
neural networks,” in 2016 International Joint Conference on Neural Networks (IJCNN), Jul. 2016, pp. 2560–2567, doi:
10.1109/IJCNN.2016.7727519.
[8] L. Alzubaidi, R. I. Hasan, F. H. Awad, M. A. Fadhel, O. Alshamma, and J. Zhang, “Multi-class breast cancer classification by a
novel two-branch deep convolutional neural network architecture,” in 2019 12th
International Conference on Developments in
eSystems Engineering (DeSE), Oct. 2019, pp. 268–273, doi: 10.1109/DeSE.2019.00057.
[9] D. Wang, Z. Chen, and H. Zhao, “Prototype transfer generative adversarial network for unsupervised breast cancer histology image
classification,” Biomedical Signal Processing and Control, vol. 68, Jul. 2021, doi: 10.1016/j.bspc.2021.102713.
[10] J. Xu et al., “Stacked sparse autoencoder (SSAE) for nuclei detection on breast cancer histopathology images,” IEEE Transactions
on Medical Imaging, vol. 35, no. 1, pp. 119–130, Jan. 2016, doi: 10.1109/TMI.2015.2458702.
[11] H. M. Ahmad, S. Ghuffar, and K. Khurshid, “Classification of breast cancer histology images using transfer learning,” in 2019 16th
International Bhurban Conference on Applied Sciences and Technology (IBCAST), Jan. 2019, pp. 328–332, doi:
10.1109/IBCAST.2019.8667221.
[12] F. Parvin, F. Parvin, M. A. M. Hasan, and M. Al Mehedi Hasan, “A comparative study of different types of convolutional neural
networks for breast cancer histopathological image classification,” in 2020 IEEE Region 10 Symposium (TENSYMP), 2020,
pp. 945–948, doi: 10.1109/TENSYMP50017.2020.9230787.
[13] P. T. Nguyen, T. T. Nguyen, N. C. Nguyen, and T. T. Le, “Multiclass breast cancer classification using convolutional neural
network,” in 2019 International Symposium on Electrical and Electronics Engineering (ISEE), Oct. 2019, pp. 130–134, doi:
10.1109/ISEE2.2019.8920916.
[14] M. A. Anupama, V. Sowmya, and K. P. Soman, “Breast cancer classification using capsule network with preprocessed histology
images,” in 2019 International Conference on Communication and Signal Processing (ICCSP), Apr. 2019, pp. 0143–0147, doi:
10.1109/ICCSP.2019.8698043.
[15] M. Ahmed and M. R. Islam, “A multiple-input based convolutional neural network in breast cancer classification from
histopathological images,” in 2021 24th
International Conference on Computer and Information Technology (ICCIT), Dec. 2021,
pp. 1–5, doi: 10.1109/ICCIT54785.2021.9689856.
[16] J. Xu and X. Dong, “A survey of transfer learning in breast cancer image classification,” in 2020 IEEE 3rd
International Conference
of Safe Production and Informatization (IICSPI), Nov. 2020, pp. 220–223, doi: 10.1109/IICSPI51290.2020.9332405.
[17] M. Naderan and Y. Zaychenko, “Convolutional autoencoder application for breast cancer classification,” in 2020 IEEE 2nd
International Conference on System Analysis and Intelligent Computing (SAIC), Oct. 2020, pp. 1–4, doi:
10.1109/SAIC51296.2020.9239139.
[18] C. C. Ukwuoma et al., “Boosting breast cancer classification from microscopic images using attention mechanism,” in 2022
International Conference on Decision Aid Sciences and Applications (DASA), Mar. 2022, pp. 258–264, doi:
10.1109/DASA54658.2022.9765013.
[19] S. Indolia, A. K. Goswami, S. P. Mishra, and P. Asopa, “Conceptual understanding of convolutional neural network-a deep learning
approach,” Procedia Computer Science, vol. 132, pp. 679–688, 2018, doi: 10.1016/j.procs.2018.05.069.
[20] H. Gu, Y. Wang, S. Hong, and G. Gui, “Blind channel identification aided generalized automatic modulation recognition based on
deep learning,” IEEE Access, vol. 7, pp. 110722–110729, 2019, doi: 10.1109/ACCESS.2019.2934354.
13. Int J Elec & Comp Eng ISSN: 2088-8708
Breast cancer classification with histopathological image based on machine learning (Jia Rong Leow)
5897
[21] M. Yani, B. Irawan, and C. Setiningsih, “Application of transfer learning using convolutional meural metwork method for early
detection of terry’s nail,” Journal of Physics: Conference Series, vol. 1201, no. 1, May 2019, doi: 10.1088/1742-
6596/1201/1/012052.
[22] Jahandad, S. M. Sam, K. Kamardin, N. N. A. Sjarif, and N. Mohamed, “Offline signature verification using deep learning
convolutional neural network (CNN) architectures GoogLeNet inception-v1 and inception-v3,” Procedia Computer Science,
vol. 161, pp. 475–483, 2019, doi: 10.1016/j.procs.2019.11.147.
[23] F. A. Spanhol, L. S. Oliveira, C. Petitjean, and L. Heutte, “A dataset for breast cancer histopathological image classification,” IEEE
Transactions on Biomedical Engineering, vol. 63, no. 7, pp. 1455–1462, Jul. 2016, doi: 10.1109/TBME.2015.2496264.
[24] S. Ray, “Disease classification within dermascopic images using features extracted by ResNet50 and classification through deep
forest,” Prepr. arXiv.1807.05711, Jul. 2018.
[25] J. Zhuang, J. Cai, R. Wang, J. Zhang, and W.-S. Zheng, “Deep kNN for medical image classification,” in MICCAI 2020: Medical
Image Computing and Computer Assisted Intervention-MICCAI 2020, 2020, pp. 127–136, doi: 10.1007/978-3-030-59710-8_13.
BIOGRAPHIES OF AUTHORS
Jia Rong Leow has taken his major in Bachelor of Computer Science Artificial
Intelligence at the Multimedia University of Malaysia began in 2020. He is presently studying
at Multimedia University. He was do his research through the Centre for Ubiquitous Computing
and Communication Research Centre (CUCC) at Multimedia University. He conducted research
on categorization breast cancer using several types of convolutional neural networks, and he has
also developed a hybrid model that combines convolutional neural networks with other machine
learning algorithms. He can be contacted at email: leow1017@gmail.com.
Wee How Khoh received his Master of Science in Information technology and
Ph.D degree from Multimedia University. He is also working as a lecturer in the Faculty of
Information Science and Technology (FIST), Multimedia University. He is currently the deputy
chairperson of the Centre for Ubiquitous Computing and Communication Research Centre
(CUCC) at the university. His research interests are machine learning, biometrics authentication
and security, and signal and image processing. He can be contacted at email:
whkhoh@mmu.edu.my.
Ying Han Pang received her B.E. (Hons) degree in Electronic Engineering in year
2002, Master of Science degree in year 2005 and Ph.D. degree in year 2013 from Multimedia
University. She is currently a lecturer at the Faculty of Information Science and Technology,
Multimedia University. Her research interests include biometrics, human activity recognition,
machine learning, inertial data processing, image processing and pattern recognition. She can be
contacted at email: yhpang@mmu.edu.my.
Hui Yen Yap is currently a lecturer in Faculty of Information Science and
Technology (FIST), Multimedia University. She is a Ph.D. candidate in Computer Science at the
Technical University of Malaysia (UTEM), Malacca. She received her Master’s Degree in
Computer Science in the year 2013. Her current research focuses on user recognition
with brainwaves. She can be contacted at email: hyyap@mmu.edu.my.