SlideShare a Scribd company logo
1 of 3
Download to read offline
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 05 | May 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 5324
A Feature Selection Framework for DNA Methylation Analysis in
Predicting Bladder Cancer
Sreeshma P.S1
1Department of Computer Sci. & Engg, Thejus Engineering College, Vellarakkad, Thrissur, Kerala, India
---------------------------------------------------------------------***---------------------------------------------------------------------
Abstract – DNA Methylation is a procedure by which methyl
groups are included to the DNA molecule. DNA methylation
characterize an important role for the origination and the
growth of human cancer. So for the earlypredictionofbladder
cancer, DNA methylation can be used. This paperpresentstwo
different feature selection methods namely, Correlationbased
Feature Weighting(CFW), and Differential Mean Feature
Selection(DMFS). CFW is a process by which each feature is
assigned a weight according to their correlation. And this
weight is compared with the threshold value, if the weight is
greater than the threshold that feature is selected. Otherwise
the feature will be rejected. DMFS uses the Differential mean
values of both normal and cancer samples.
Key Words: DNA Methylation, Feature Selection,
Correlation based Feature Weighting, DifferentialMean
Feature Selection.
1. INTRODUCTION
Throughout our lives, healthy cells in our bodies divide and
replace themselves is a controlled fashion. Cancer may be a
general term for large cluster of diseases, whose causes,
characteristics and prevalence vary greatly. It’s refers to an
abnormal growth of cell tissue. Genetic and epigenetic
changes contribute to the event of cancer. DNA methylation
is a procedure by that methyl teams superimposed to DNA
molecule. Methylation will modify the activity of a DNA
section while not changing the sequence. The inflated
methylation is named as hypermethylation and also the
massive loss of DNA methylation stated ashypomethylation.
complete hypomethylation and promoter region specific
hypermethylation are 2 forms of typical DNA methylation
found in human cancer. DNA methylation plays a awfully
vital role for the origination and growthofhumancancer. So,
it are often used as a biological marker for the primary
prediction of cancers [1].
The main problems with DNA methylation dataset is that
huge number of features in each sample and less number of
the available samples. One possible solution is to usefeature
selection methods to obtain relevant feature set and to
eliminate redundant and irrelevant features. So , this paper
introduces two methods for feature selection namely,
Correlation based Feature Extraction(CFW)andDifferential
Mean Feature Selection (DMFS). Sample size goes through
two reduction steps,correlationbasedfeature weightingand
redundant feature elimination and unsupervised feature
selection via vertical analysis of the feature.
The main contributions of this paper are:
 Building a framework for DNA methylation
classification.
 Uses correlation based feature weighting method
for the feature selection process.
 Differential Mean feature Selection by utilizing
vertical analysis of features across the sample set.
2. LITERATURE REVIEW
[3] A datamining approach for predicting a diseaseusingthe
most relevant genes associated withit.Afewexistingfeature
selection methods have been applied independently and
their results combined to form fused feature set and then
applying genetic algorithm which generate optimal feature
set. They have been applied to a classifier for identifying the
disease.
[4] A framework based on both feature selection methods
and feature extraction methods to predict cancer. This
framework is made foe detecting cancer based on the
methylated DNA probes.
[5] A method to select the small subset of informative gene
relevant to the classification. This method is based on a
binary migration model and binary mutation model.
[6] Supervised learning technique that have been employed
to classify cancers. A two step feature selection method is
done. First one is an attributeestimationmethod(eg.ReliefF)
and the second one is the genetic algorithm.
[7] Tried to find efficient feature selection methodstoselect
a small number of informative genes using mutual
information and rough sets .
[8] Sparse Compact Learning Incremental Machine(SCLIM)
for cancer classification on microarray gene expressiondata
that is robust against diverse noises and outliers.
3. METHODOLOGY
With the advancement of technology, real time monitoring
system to the early prediction of cancers have become
common. This paper proposes a framework for the early
prediction of bladder cancer. For the early prediction of
cancer DNA methylation analysis is used. One of the
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 05 | May 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 5325
important role is for classifying normal samplesfromcancer
ones. For that combining differentfeatureselectionmethods
for classification. The proposed framework consists ofthree
modules ; Data pre-processing means that the data in the
available format is loading and then pre-processing the data
in the required format, Feature selection module contains 2
methods correlation based feature weighting (CFW) and
Differential Mean Feature Selection (DMFS) by utilizing
vertical analysis of the methods for each sample, finally
classification method is applied to the selected features.
The fig 1 illustrates the workflow of the method.
Initially the data is pre-processed. For that, the data in the
available format is changes to the required format. The pre-
processed data consists of samples in rows and features in
columns. After pre-processing the data twofeatureselection
methods are applying CFW and DMFS. Then finally Naïve
Bayes classification is done.
Fig -1: Workflow of the method
3.1 Data Pre-Processing
Data pre-processing is a datamining technique that involves
transforming raw data into an understandable format. The
main aim of this module is to load the available data. The
dataset contains 24 samples, where 6 normal samples and
18 cancer samples. In each of the sample there is 27,578
features.
Fig -2 : Steps in data pre-processing
3.2 Feature Selection
Correlation based Feature Weighting(CFW) : The weight for
a feature is proportional to the difference between the
feature - class correlation (mutual relevance ) and the
average feature - feature intercorrelation (average mutual
redundancy ) [2]. In this method each feature is assigned a
different weight according to their correlation.
The equation for feature - class correlation and feature –
feature intercorrelation can be respectively defined as [2],
I(Ai;C) =∑ ai ∑ c P(ai,c) log (P(ai,c) /P(ai)P(c)) (1)
I(Ai;Aj) =∑ ai ∑ aj P(ai,aj) log ( P(ai,aj)/ P(ai)P(aj)) (2)
where C is the class variable, Ai and Aj are two different
feature variables, c, ai and aj represent the values that they
take, respectively.
Di = NI(Ai ;C) – 1/m-1 ∑ j=1 Λ j!=1 NI (Ai ; Aj ) (3)
where NI(Ai;C) is the normalizedI(Ai;C)representing mutual
relevance and NI(Ai;Aj) is the normalized I(Ai;Aj)
representing mutual redundancy.
NI ( Ai;C ) = I ( Ai;C ) / ( 1/m ∑i=1 to m I ( Ai;C ) ) (4)
NI(Ai;Aj)=I(Ai;Aj)/( 1/m(m-1) ∑i=1 to m ∑ j=1Λ j!=1I(Ai;Aj)
(5 )
The final weight of the feature Ai is
Wi = 1 / 1+e-D
i (6)
This weight is compared with the threshold value. If the
weight is greater than the threshold that feature will be
selected otherwise rejected.
Differential Mean Feature Selection (DMFS) : Analyzing each
features across all samples vertically. Finding the mean
values of both normal and cancer samples separately. And
then find the absolute difference between mean values of
both normal and cancer samples (weight vector), selecting
the features corresponding to the weight vector, and
comparing with the threshold, If the weight vector is greater
than the threshold that feature will be selected.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 05 | May 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 5326
Fig -3 : DMFS approach
3.3 Classification
There are many classification techniques and each of them
have its own advantages and disadvantages.TheNaïveBayes
classifier is the direct and simplest classifier that utilizes
Bayes theorem for conditional probabilities of random
variables given known observations to build the classifiers
[1]. In this classifier all features are assumed independent
from each other and it calculates independently the
probability of each feature for a particular class label
[9],[10].This classifier is simple and computationally fast to
reach a decision.
3. CONCLUSION
The main aim of the paper is to provide efficient feature
selection methods for the early prediction of bladdercancer.
So, building a framework for DNA methylationclassification.
Basically it is a classification problem which classifies
bladder cancer dataset into normal samples and cancer
samples. Mainly two feature selection methods are used
namely, Correlation based Feature Weighting (CFW) and
Differential mean Feature Selection (DMFS). CFW assigns
weights for each of the features according to their
importance. And then comparing the weight vector with the
threshold value and if the weight is greater than threshold,
then that feature is selected. DMFS uses differential mean
between the mean of normal and cancer samples. The
number of features can be reduced by applying the feature
selection methods.
REFERENCES
[1] Abdul Majid f. Al-Junaid and Talal s. Qaid,” Vertical and
Horizontal DNA Differential Methylation Analysis for
Predicting Breast Cancer,” IEEE Access.,vol. 6 , pp.
53533,Aug 2018.
[2] Liangxiao Jiang, Lungan Zhang, Chaoqun Li, Jia Wu,” A
Correlation-based Feature Weighting Filter for Naive
Bayes”, IEEE Transactions on Knowledge and Data
Engineering., pp. 2836440, 2018.
[3] Mohammed Siyad B, Visakh R,” Disease Predictionusing
Optimal Feature Selection from Epigenetic Data,“ IEEE
conf. on Innovations in Power and AdvancedComputing
Technologies, 2017.
[4] A. A. Raweh, M. Nassef. And A. Badr,” Feature selection
and extraction framework for DNA methylation in
cancer, Int. J. Adv. Comput. Sci. Appl., vol. 8, no. 7, pp.
3036,2017.
[5] X. Li and M. Yin, “Multi-objective binary biogeography
based optimization for feature selection using gene
expression data,” IEEE Trans. Nanobiosci., vol. 12, no.
4,pp. 343352,Dec. 2013.
[6] H. Hijaziand C. Chan ,” A classification framework
applied to cancer geneexpressionprofiles”, J.Healthcare
Eng., vol. 4, no. 2, pp. 255–283, 2013.
[7] W. Zhou, C. Zhou, G. Liu, and H. Zhu, ‘‘Feature selection
for microarray data analysis using mutual information
and rough set theory’’, in Proc. IFIP Int. Fed. Inf.
Process., Artif. Intell. Appl. Innov., vol. 204, I.
Maglogiannis, K. Karpouzis, and M. Bramer,Eds.Boston,
MA, USA: Springer, 2006, pp. 492–499.
[8] M. Nayyeri and H. S. Noghabi, ‘‘Cancer classification by
correntropy based sparsecompactincremental learning
machine,’’ Gene Rep., vol. 3, pp. 31–38, Jun. 2016.

More Related Content

What's hot

Comparing prediction accuracy for machine learning and
Comparing prediction accuracy for machine learning andComparing prediction accuracy for machine learning and
Comparing prediction accuracy for machine learning and
Alexander Decker
 
Feature Selection Approach based on Firefly Algorithm and Chi-square
Feature Selection Approach based on Firefly Algorithm and Chi-square Feature Selection Approach based on Firefly Algorithm and Chi-square
Feature Selection Approach based on Firefly Algorithm and Chi-square
IJECEIAES
 
2014 Gene expressionmicroarrayclassification usingPCA–BEL.
2014 Gene expressionmicroarrayclassification usingPCA–BEL.2014 Gene expressionmicroarrayclassification usingPCA–BEL.
2014 Gene expressionmicroarrayclassification usingPCA–BEL.
Ehsan Lotfi
 

What's hot (17)

Comparison of Feature selection methods for diagnosis of cervical cancer usin...
Comparison of Feature selection methods for diagnosis of cervical cancer usin...Comparison of Feature selection methods for diagnosis of cervical cancer usin...
Comparison of Feature selection methods for diagnosis of cervical cancer usin...
 
Hybrid prediction model with missing value imputation for medical data 2015-g...
Hybrid prediction model with missing value imputation for medical data 2015-g...Hybrid prediction model with missing value imputation for medical data 2015-g...
Hybrid prediction model with missing value imputation for medical data 2015-g...
 
Breast cancer diagnosis and recurrence prediction using machine learning tech...
Breast cancer diagnosis and recurrence prediction using machine learning tech...Breast cancer diagnosis and recurrence prediction using machine learning tech...
Breast cancer diagnosis and recurrence prediction using machine learning tech...
 
Comparing prediction accuracy for machine learning and
Comparing prediction accuracy for machine learning andComparing prediction accuracy for machine learning and
Comparing prediction accuracy for machine learning and
 
Supervised machine learning based liver disease prediction approach with LASS...
Supervised machine learning based liver disease prediction approach with LASS...Supervised machine learning based liver disease prediction approach with LASS...
Supervised machine learning based liver disease prediction approach with LASS...
 
GRC-MS: A GENETIC RULE-BASED CLASSIFIER MODEL FOR ANALYSIS OF MASS SPECTRA DATA
GRC-MS: A GENETIC RULE-BASED CLASSIFIER MODEL FOR ANALYSIS OF MASS SPECTRA DATAGRC-MS: A GENETIC RULE-BASED CLASSIFIER MODEL FOR ANALYSIS OF MASS SPECTRA DATA
GRC-MS: A GENETIC RULE-BASED CLASSIFIER MODEL FOR ANALYSIS OF MASS SPECTRA DATA
 
An Ensemble of Filters and Wrappers for Microarray Data Classification
An Ensemble of Filters and Wrappers for Microarray Data Classification An Ensemble of Filters and Wrappers for Microarray Data Classification
An Ensemble of Filters and Wrappers for Microarray Data Classification
 
Survey of Feature Selection / Extraction Methods used in Biomedical Imaging
Survey of Feature Selection / Extraction Methods used in Biomedical ImagingSurvey of Feature Selection / Extraction Methods used in Biomedical Imaging
Survey of Feature Selection / Extraction Methods used in Biomedical Imaging
 
IRJET- Image Processing for Brain Tumor Segmentation and Classification
IRJET-  	  Image Processing for Brain Tumor Segmentation and ClassificationIRJET-  	  Image Processing for Brain Tumor Segmentation and Classification
IRJET- Image Processing for Brain Tumor Segmentation and Classification
 
Positive Impression of Low-Ranking Microrn as in Human Cancer Classification
Positive Impression of Low-Ranking Microrn as in Human Cancer ClassificationPositive Impression of Low-Ranking Microrn as in Human Cancer Classification
Positive Impression of Low-Ranking Microrn as in Human Cancer Classification
 
IRJET-Implementation of CAD system for Cancer Detection using SVM based Class...
IRJET-Implementation of CAD system for Cancer Detection using SVM based Class...IRJET-Implementation of CAD system for Cancer Detection using SVM based Class...
IRJET-Implementation of CAD system for Cancer Detection using SVM based Class...
 
40120130405013
4012013040501340120130405013
40120130405013
 
research paper
research paperresearch paper
research paper
 
Feature Selection Approach based on Firefly Algorithm and Chi-square
Feature Selection Approach based on Firefly Algorithm and Chi-square Feature Selection Approach based on Firefly Algorithm and Chi-square
Feature Selection Approach based on Firefly Algorithm and Chi-square
 
Possibilistic Fuzzy C Means Algorithm For Mass classificaion In Digital Mammo...
Possibilistic Fuzzy C Means Algorithm For Mass classificaion In Digital Mammo...Possibilistic Fuzzy C Means Algorithm For Mass classificaion In Digital Mammo...
Possibilistic Fuzzy C Means Algorithm For Mass classificaion In Digital Mammo...
 
2014 Gene expressionmicroarrayclassification usingPCA–BEL.
2014 Gene expressionmicroarrayclassification usingPCA–BEL.2014 Gene expressionmicroarrayclassification usingPCA–BEL.
2014 Gene expressionmicroarrayclassification usingPCA–BEL.
 
Ol3425172524
Ol3425172524Ol3425172524
Ol3425172524
 

Similar to IRJET- A Feature Selection Framework for DNA Methylation Analysis in Predicting Bladder Cancer

A MODIFIED BINARY PSO BASED FEATURE SELECTION FOR AUTOMATIC LESION DETECTION ...
A MODIFIED BINARY PSO BASED FEATURE SELECTION FOR AUTOMATIC LESION DETECTION ...A MODIFIED BINARY PSO BASED FEATURE SELECTION FOR AUTOMATIC LESION DETECTION ...
A MODIFIED BINARY PSO BASED FEATURE SELECTION FOR AUTOMATIC LESION DETECTION ...
AIRCC Publishing Corporation
 
A CLASSIFICATION MODEL ON TUMOR CANCER DISEASE BASED MUTUAL INFORMATION AND F...
A CLASSIFICATION MODEL ON TUMOR CANCER DISEASE BASED MUTUAL INFORMATION AND F...A CLASSIFICATION MODEL ON TUMOR CANCER DISEASE BASED MUTUAL INFORMATION AND F...
A CLASSIFICATION MODEL ON TUMOR CANCER DISEASE BASED MUTUAL INFORMATION AND F...
Kiogyf
 
SVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTION
SVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTIONSVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTION
SVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTION
ijscai
 
SVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTION
SVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTIONSVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTION
SVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTION
ijscai
 
SVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTION
SVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTIONSVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTION
SVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTION
ijscai
 
SVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTION
SVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTIONSVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTION
SVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTION
ijscai
 
Impact of Classification Algorithms on Cardiotocography Dataset for Fetal Sta...
Impact of Classification Algorithms on Cardiotocography Dataset for Fetal Sta...Impact of Classification Algorithms on Cardiotocography Dataset for Fetal Sta...
Impact of Classification Algorithms on Cardiotocography Dataset for Fetal Sta...
BRNSSPublicationHubI
 
Cancer prognosis prediction using balanced stratified sampling
Cancer prognosis prediction using balanced stratified samplingCancer prognosis prediction using balanced stratified sampling
Cancer prognosis prediction using balanced stratified sampling
ijscai
 

Similar to IRJET- A Feature Selection Framework for DNA Methylation Analysis in Predicting Bladder Cancer (20)

IRJET - Prediction of Risk Factor of the Patient with Hepatocellular Carcinom...
IRJET - Prediction of Risk Factor of the Patient with Hepatocellular Carcinom...IRJET - Prediction of Risk Factor of the Patient with Hepatocellular Carcinom...
IRJET - Prediction of Risk Factor of the Patient with Hepatocellular Carcinom...
 
Controlling informative features for improved accuracy and faster predictions...
Controlling informative features for improved accuracy and faster predictions...Controlling informative features for improved accuracy and faster predictions...
Controlling informative features for improved accuracy and faster predictions...
 
A MODIFIED BINARY PSO BASED FEATURE SELECTION FOR AUTOMATIC LESION DETECTION ...
A MODIFIED BINARY PSO BASED FEATURE SELECTION FOR AUTOMATIC LESION DETECTION ...A MODIFIED BINARY PSO BASED FEATURE SELECTION FOR AUTOMATIC LESION DETECTION ...
A MODIFIED BINARY PSO BASED FEATURE SELECTION FOR AUTOMATIC LESION DETECTION ...
 
IRJET- Breast Cancer Prediction using Supervised Machine Learning Algorithms
IRJET- Breast Cancer Prediction using Supervised Machine Learning AlgorithmsIRJET- Breast Cancer Prediction using Supervised Machine Learning Algorithms
IRJET- Breast Cancer Prediction using Supervised Machine Learning Algorithms
 
A CLASSIFICATION MODEL ON TUMOR CANCER DISEASE BASED MUTUAL INFORMATION AND F...
A CLASSIFICATION MODEL ON TUMOR CANCER DISEASE BASED MUTUAL INFORMATION AND F...A CLASSIFICATION MODEL ON TUMOR CANCER DISEASE BASED MUTUAL INFORMATION AND F...
A CLASSIFICATION MODEL ON TUMOR CANCER DISEASE BASED MUTUAL INFORMATION AND F...
 
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
 
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
 
Classification of Breast Cancer Tissues using Decision Tree Algorithms
Classification of Breast Cancer Tissues using Decision Tree AlgorithmsClassification of Breast Cancer Tissues using Decision Tree Algorithms
Classification of Breast Cancer Tissues using Decision Tree Algorithms
 
IRJET- Detection of Breast Cancer using Machine Learning Techniques
IRJET-  	  Detection of Breast Cancer using Machine Learning TechniquesIRJET-  	  Detection of Breast Cancer using Machine Learning Techniques
IRJET- Detection of Breast Cancer using Machine Learning Techniques
 
SVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTION
SVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTIONSVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTION
SVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTION
 
SVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTION
SVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTIONSVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTION
SVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTION
 
SVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTION
SVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTIONSVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTION
SVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTION
 
SVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTION
SVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTIONSVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTION
SVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTION
 
Impact of Classification Algorithms on Cardiotocography Dataset for Fetal Sta...
Impact of Classification Algorithms on Cardiotocography Dataset for Fetal Sta...Impact of Classification Algorithms on Cardiotocography Dataset for Fetal Sta...
Impact of Classification Algorithms on Cardiotocography Dataset for Fetal Sta...
 
Classification AlgorithmBased Analysis of Breast Cancer Data
Classification AlgorithmBased Analysis of Breast Cancer DataClassification AlgorithmBased Analysis of Breast Cancer Data
Classification AlgorithmBased Analysis of Breast Cancer Data
 
Cancer prognosis prediction using balanced stratified sampling
Cancer prognosis prediction using balanced stratified samplingCancer prognosis prediction using balanced stratified sampling
Cancer prognosis prediction using balanced stratified sampling
 
Skin Cancer Detection Application
Skin Cancer Detection ApplicationSkin Cancer Detection Application
Skin Cancer Detection Application
 
Review on Mesothelioma Diagnosis
Review on Mesothelioma DiagnosisReview on Mesothelioma Diagnosis
Review on Mesothelioma Diagnosis
 
Computer Aided System for Detection and Classification of Breast Cancer
Computer Aided System for Detection and Classification of Breast CancerComputer Aided System for Detection and Classification of Breast Cancer
Computer Aided System for Detection and Classification of Breast Cancer
 
Robust Breast Cancer Diagnosis on Four Different Datasets Using Multi-Classif...
Robust Breast Cancer Diagnosis on Four Different Datasets Using Multi-Classif...Robust Breast Cancer Diagnosis on Four Different Datasets Using Multi-Classif...
Robust Breast Cancer Diagnosis on Four Different Datasets Using Multi-Classif...
 

More from IRJET Journal

More from IRJET Journal (20)

TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
 
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURESTUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
 
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
 
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsEffect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil Characteristics
 
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
 
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
 
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
 
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
 
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASA REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADAS
 
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
 
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProP.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
 
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
 
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemSurvey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare System
 
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesReview on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridges
 
React based fullstack edtech web application
React based fullstack edtech web applicationReact based fullstack edtech web application
React based fullstack edtech web application
 
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
 
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
 
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
 
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignMultistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
 
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
 

Recently uploaded

Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Digital Communication Essentials: DPCM, DM, and ADM .pptx
Digital Communication Essentials: DPCM, DM, and ADM .pptxDigital Communication Essentials: DPCM, DM, and ADM .pptx
Digital Communication Essentials: DPCM, DM, and ADM .pptx
pritamlangde
 
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
AldoGarca30
 
Hospital management system project report.pdf
Hospital management system project report.pdfHospital management system project report.pdf
Hospital management system project report.pdf
Kamal Acharya
 

Recently uploaded (20)

Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
 
AIRCANVAS[1].pdf mini project for btech students
AIRCANVAS[1].pdf mini project for btech studentsAIRCANVAS[1].pdf mini project for btech students
AIRCANVAS[1].pdf mini project for btech students
 
Introduction to Serverless with AWS Lambda
Introduction to Serverless with AWS LambdaIntroduction to Serverless with AWS Lambda
Introduction to Serverless with AWS Lambda
 
Jaipur ❤CALL GIRL 0000000000❤CALL GIRLS IN Jaipur ESCORT SERVICE❤CALL GIRL IN...
Jaipur ❤CALL GIRL 0000000000❤CALL GIRLS IN Jaipur ESCORT SERVICE❤CALL GIRL IN...Jaipur ❤CALL GIRL 0000000000❤CALL GIRLS IN Jaipur ESCORT SERVICE❤CALL GIRL IN...
Jaipur ❤CALL GIRL 0000000000❤CALL GIRLS IN Jaipur ESCORT SERVICE❤CALL GIRL IN...
 
Double Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueDouble Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torque
 
Ghuma $ Russian Call Girls Ahmedabad ₹7.5k Pick Up & Drop With Cash Payment 8...
Ghuma $ Russian Call Girls Ahmedabad ₹7.5k Pick Up & Drop With Cash Payment 8...Ghuma $ Russian Call Girls Ahmedabad ₹7.5k Pick Up & Drop With Cash Payment 8...
Ghuma $ Russian Call Girls Ahmedabad ₹7.5k Pick Up & Drop With Cash Payment 8...
 
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptxS1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
 
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...
 
Basic Electronics for diploma students as per technical education Kerala Syll...
Basic Electronics for diploma students as per technical education Kerala Syll...Basic Electronics for diploma students as per technical education Kerala Syll...
Basic Electronics for diploma students as per technical education Kerala Syll...
 
Hostel management system project report..pdf
Hostel management system project report..pdfHostel management system project report..pdf
Hostel management system project report..pdf
 
Digital Communication Essentials: DPCM, DM, and ADM .pptx
Digital Communication Essentials: DPCM, DM, and ADM .pptxDigital Communication Essentials: DPCM, DM, and ADM .pptx
Digital Communication Essentials: DPCM, DM, and ADM .pptx
 
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
 
A Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna MunicipalityA Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna Municipality
 
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced LoadsFEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
 
fitting shop and tools used in fitting shop .ppt
fitting shop and tools used in fitting shop .pptfitting shop and tools used in fitting shop .ppt
fitting shop and tools used in fitting shop .ppt
 
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
 
Ground Improvement Technique: Earth Reinforcement
Ground Improvement Technique: Earth ReinforcementGround Improvement Technique: Earth Reinforcement
Ground Improvement Technique: Earth Reinforcement
 
DC MACHINE-Motoring and generation, Armature circuit equation
DC MACHINE-Motoring and generation, Armature circuit equationDC MACHINE-Motoring and generation, Armature circuit equation
DC MACHINE-Motoring and generation, Armature circuit equation
 
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best ServiceTamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
 
Hospital management system project report.pdf
Hospital management system project report.pdfHospital management system project report.pdf
Hospital management system project report.pdf
 

IRJET- A Feature Selection Framework for DNA Methylation Analysis in Predicting Bladder Cancer

  • 1. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 05 | May 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 5324 A Feature Selection Framework for DNA Methylation Analysis in Predicting Bladder Cancer Sreeshma P.S1 1Department of Computer Sci. & Engg, Thejus Engineering College, Vellarakkad, Thrissur, Kerala, India ---------------------------------------------------------------------***--------------------------------------------------------------------- Abstract – DNA Methylation is a procedure by which methyl groups are included to the DNA molecule. DNA methylation characterize an important role for the origination and the growth of human cancer. So for the earlypredictionofbladder cancer, DNA methylation can be used. This paperpresentstwo different feature selection methods namely, Correlationbased Feature Weighting(CFW), and Differential Mean Feature Selection(DMFS). CFW is a process by which each feature is assigned a weight according to their correlation. And this weight is compared with the threshold value, if the weight is greater than the threshold that feature is selected. Otherwise the feature will be rejected. DMFS uses the Differential mean values of both normal and cancer samples. Key Words: DNA Methylation, Feature Selection, Correlation based Feature Weighting, DifferentialMean Feature Selection. 1. INTRODUCTION Throughout our lives, healthy cells in our bodies divide and replace themselves is a controlled fashion. Cancer may be a general term for large cluster of diseases, whose causes, characteristics and prevalence vary greatly. It’s refers to an abnormal growth of cell tissue. Genetic and epigenetic changes contribute to the event of cancer. DNA methylation is a procedure by that methyl teams superimposed to DNA molecule. Methylation will modify the activity of a DNA section while not changing the sequence. The inflated methylation is named as hypermethylation and also the massive loss of DNA methylation stated ashypomethylation. complete hypomethylation and promoter region specific hypermethylation are 2 forms of typical DNA methylation found in human cancer. DNA methylation plays a awfully vital role for the origination and growthofhumancancer. So, it are often used as a biological marker for the primary prediction of cancers [1]. The main problems with DNA methylation dataset is that huge number of features in each sample and less number of the available samples. One possible solution is to usefeature selection methods to obtain relevant feature set and to eliminate redundant and irrelevant features. So , this paper introduces two methods for feature selection namely, Correlation based Feature Extraction(CFW)andDifferential Mean Feature Selection (DMFS). Sample size goes through two reduction steps,correlationbasedfeature weightingand redundant feature elimination and unsupervised feature selection via vertical analysis of the feature. The main contributions of this paper are:  Building a framework for DNA methylation classification.  Uses correlation based feature weighting method for the feature selection process.  Differential Mean feature Selection by utilizing vertical analysis of features across the sample set. 2. LITERATURE REVIEW [3] A datamining approach for predicting a diseaseusingthe most relevant genes associated withit.Afewexistingfeature selection methods have been applied independently and their results combined to form fused feature set and then applying genetic algorithm which generate optimal feature set. They have been applied to a classifier for identifying the disease. [4] A framework based on both feature selection methods and feature extraction methods to predict cancer. This framework is made foe detecting cancer based on the methylated DNA probes. [5] A method to select the small subset of informative gene relevant to the classification. This method is based on a binary migration model and binary mutation model. [6] Supervised learning technique that have been employed to classify cancers. A two step feature selection method is done. First one is an attributeestimationmethod(eg.ReliefF) and the second one is the genetic algorithm. [7] Tried to find efficient feature selection methodstoselect a small number of informative genes using mutual information and rough sets . [8] Sparse Compact Learning Incremental Machine(SCLIM) for cancer classification on microarray gene expressiondata that is robust against diverse noises and outliers. 3. METHODOLOGY With the advancement of technology, real time monitoring system to the early prediction of cancers have become common. This paper proposes a framework for the early prediction of bladder cancer. For the early prediction of cancer DNA methylation analysis is used. One of the
  • 2. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 05 | May 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 5325 important role is for classifying normal samplesfromcancer ones. For that combining differentfeatureselectionmethods for classification. The proposed framework consists ofthree modules ; Data pre-processing means that the data in the available format is loading and then pre-processing the data in the required format, Feature selection module contains 2 methods correlation based feature weighting (CFW) and Differential Mean Feature Selection (DMFS) by utilizing vertical analysis of the methods for each sample, finally classification method is applied to the selected features. The fig 1 illustrates the workflow of the method. Initially the data is pre-processed. For that, the data in the available format is changes to the required format. The pre- processed data consists of samples in rows and features in columns. After pre-processing the data twofeatureselection methods are applying CFW and DMFS. Then finally Naïve Bayes classification is done. Fig -1: Workflow of the method 3.1 Data Pre-Processing Data pre-processing is a datamining technique that involves transforming raw data into an understandable format. The main aim of this module is to load the available data. The dataset contains 24 samples, where 6 normal samples and 18 cancer samples. In each of the sample there is 27,578 features. Fig -2 : Steps in data pre-processing 3.2 Feature Selection Correlation based Feature Weighting(CFW) : The weight for a feature is proportional to the difference between the feature - class correlation (mutual relevance ) and the average feature - feature intercorrelation (average mutual redundancy ) [2]. In this method each feature is assigned a different weight according to their correlation. The equation for feature - class correlation and feature – feature intercorrelation can be respectively defined as [2], I(Ai;C) =∑ ai ∑ c P(ai,c) log (P(ai,c) /P(ai)P(c)) (1) I(Ai;Aj) =∑ ai ∑ aj P(ai,aj) log ( P(ai,aj)/ P(ai)P(aj)) (2) where C is the class variable, Ai and Aj are two different feature variables, c, ai and aj represent the values that they take, respectively. Di = NI(Ai ;C) – 1/m-1 ∑ j=1 Λ j!=1 NI (Ai ; Aj ) (3) where NI(Ai;C) is the normalizedI(Ai;C)representing mutual relevance and NI(Ai;Aj) is the normalized I(Ai;Aj) representing mutual redundancy. NI ( Ai;C ) = I ( Ai;C ) / ( 1/m ∑i=1 to m I ( Ai;C ) ) (4) NI(Ai;Aj)=I(Ai;Aj)/( 1/m(m-1) ∑i=1 to m ∑ j=1Λ j!=1I(Ai;Aj) (5 ) The final weight of the feature Ai is Wi = 1 / 1+e-D i (6) This weight is compared with the threshold value. If the weight is greater than the threshold that feature will be selected otherwise rejected. Differential Mean Feature Selection (DMFS) : Analyzing each features across all samples vertically. Finding the mean values of both normal and cancer samples separately. And then find the absolute difference between mean values of both normal and cancer samples (weight vector), selecting the features corresponding to the weight vector, and comparing with the threshold, If the weight vector is greater than the threshold that feature will be selected.
  • 3. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 05 | May 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 5326 Fig -3 : DMFS approach 3.3 Classification There are many classification techniques and each of them have its own advantages and disadvantages.TheNaïveBayes classifier is the direct and simplest classifier that utilizes Bayes theorem for conditional probabilities of random variables given known observations to build the classifiers [1]. In this classifier all features are assumed independent from each other and it calculates independently the probability of each feature for a particular class label [9],[10].This classifier is simple and computationally fast to reach a decision. 3. CONCLUSION The main aim of the paper is to provide efficient feature selection methods for the early prediction of bladdercancer. So, building a framework for DNA methylationclassification. Basically it is a classification problem which classifies bladder cancer dataset into normal samples and cancer samples. Mainly two feature selection methods are used namely, Correlation based Feature Weighting (CFW) and Differential mean Feature Selection (DMFS). CFW assigns weights for each of the features according to their importance. And then comparing the weight vector with the threshold value and if the weight is greater than threshold, then that feature is selected. DMFS uses differential mean between the mean of normal and cancer samples. The number of features can be reduced by applying the feature selection methods. REFERENCES [1] Abdul Majid f. Al-Junaid and Talal s. Qaid,” Vertical and Horizontal DNA Differential Methylation Analysis for Predicting Breast Cancer,” IEEE Access.,vol. 6 , pp. 53533,Aug 2018. [2] Liangxiao Jiang, Lungan Zhang, Chaoqun Li, Jia Wu,” A Correlation-based Feature Weighting Filter for Naive Bayes”, IEEE Transactions on Knowledge and Data Engineering., pp. 2836440, 2018. [3] Mohammed Siyad B, Visakh R,” Disease Predictionusing Optimal Feature Selection from Epigenetic Data,“ IEEE conf. on Innovations in Power and AdvancedComputing Technologies, 2017. [4] A. A. Raweh, M. Nassef. And A. Badr,” Feature selection and extraction framework for DNA methylation in cancer, Int. J. Adv. Comput. Sci. Appl., vol. 8, no. 7, pp. 3036,2017. [5] X. Li and M. Yin, “Multi-objective binary biogeography based optimization for feature selection using gene expression data,” IEEE Trans. Nanobiosci., vol. 12, no. 4,pp. 343352,Dec. 2013. [6] H. Hijaziand C. Chan ,” A classification framework applied to cancer geneexpressionprofiles”, J.Healthcare Eng., vol. 4, no. 2, pp. 255–283, 2013. [7] W. Zhou, C. Zhou, G. Liu, and H. Zhu, ‘‘Feature selection for microarray data analysis using mutual information and rough set theory’’, in Proc. IFIP Int. Fed. Inf. Process., Artif. Intell. Appl. Innov., vol. 204, I. Maglogiannis, K. Karpouzis, and M. Bramer,Eds.Boston, MA, USA: Springer, 2006, pp. 492–499. [8] M. Nayyeri and H. S. Noghabi, ‘‘Cancer classification by correntropy based sparsecompactincremental learning machine,’’ Gene Rep., vol. 3, pp. 31–38, Jun. 2016.