SlideShare a Scribd company logo
IOSR Journal of Computer Engineering (IOSR-JCE)
e-ISSN: 2278-0661, p- ISSN: 2278-8727Volume 15, Issue 5 (Nov. - Dec. 2013), PP 39-44
www.iosrjournals.org
www.iosrjournals.org 39 | Page
Performance and Evaluation of Data Mining Techniques in
Cancer Diagnosis
R.M. Chandrasekar Ph.D, V. Palaniammal M.C.A., M.Phil.
(Professor, Computer Science, Annamalai University, Chidambaram, Tamil Nadu, India.)
(Asst. Professor, Thiruvalluvar University College of Arts & Science, Kallakurichi, Tamil Nadu, India.)
Abstract: We analyze the breast Cancer data available from the WBC, WDBC from UCI machine learning with
the aim of developing accurate prediction models for breast cancer using data mining techniques. Data mining
has, for good reason, recently attracted a lot of attention, it is a new Technology, tackling new problem, with
great potential for valuable commercial and scientific discoveries. The experiments are conducted in WEKA.
Several data mining classification techniques were used on the proposed data. There are many classification
techniques in data mining such as Decision Tree, Rules NNge, Tree random forest, Random Tree, lazy IBK. The
aim of this paper is to investigate the performance of different classification techniques. The data breast cancer
data with a total 286 rows and 10 columns will be used to test and justify the different between the classification
methods and algorithm.
Keywords - Machine learning, data mining Weka, classification, breast cancer
I. Introduction
About 1 in 8 U.S women (just under 12%) will develop invasive breast cancer over the course of her
life. In 2011, an estimated 230,480 new cases of invasive breast cancer were expected to be diagnosed in women
in the U.S, along with 57,650 new cases of non-invasive (in situ) breast cancer. For women in the U.S breast
cancer death rates are higher than those for any other cancer besides lung cancer. In 2011 there were more than
2.6million breast cancer survivor in the U.S.A women’s risk of breast cancer approximately relative (mother,
sister, daughter) who has been diagnosed with breast cancer. About 15% of women who get breast cancer have a
family member diagnosed with it. About 85% of breast cancers occur in women who have no family history of
breast cancer. These occur due to genetic mutations that happen as a result of the aging process and life in
general.
Breast cancer (An overview):
Breast Cancer is the most common cancer diseases among women excluding no melanoma skin.
Cancers are divided into two types benign and Malignant. If the cancer is benign under the conditions of early
diagnosis. Malignancy status includes the three basic measurement 1.Age 2.Longer tumor length 3.ADC or
Apparent Diffusion coefficient (biopsy confirmed).Attributes 1.patient id 2.diagnosis m=malignant ,b=benign
3.Real valued features. Cell uncles’ 1.radius 2.texture 3.perimeter 4.Area 5.Smoothness.I Diagnosis Attributes
are 1.Menopausal status a. Premenopausal, b.post menopausal. II.Basic diagnosis :-1.clinical
examinations,2.mammography (y/n),3.MRI (y/n),4.Ultrasound (y/n),5.Fine Needle aspiration (y/n),6.Core
biopsy (y/n),7.Open biopsy (y/n) ,8.Other (y/n),9.Unknown (y/n).III Clinical Trial Enrollment:-y/n/Not
stated/inadequate. IV. Initial Representation:-Screening-mmaaography,Screening-MRI,Screening-
Others,Symptomatic.V.Total extent of Lesion (DCIS AND INVASIVE) 1.Lesion in mm.VI.Lymphovascular
Invasion(presence of tumor cells in endothelium-lined spaces) 1.present/absent/suspicious/not stated /unknown.
Number stages of Breast cancer:-
There are 4 number stages of breast cancer, staging takes into various factors, including 1.The size of
the tumor (tumor means either breast lymph’s or Area of cancer cells found on a Scan or mammogram)
2.Cancer cells have spread into nearby lymph glands (lymph Node) 3.The tumor cells has spread to any other
part of the body (Metastasized-TNM). Stage1 breast cancer is split into 2 Stages.Stage1A tumor is 2 cm or
smaller and has spread outside the breast.Stage1B: Small areas of breast Cancer cells are found in the lymph
node closed to the breast and either the tumor is 2 cm or smaller. Stage 2 breast cancer: stage 2A:Tumor 2cm or
smaller in the breast and cancer cells are found in 1 to 3 lymph Node in the armpit or in the lymph Node near
the breast bone. Stage 2B: The tumor is larger than 2 cm but not larger than 5 cm and small areas of cancer cells
are in the lymph Node.2 to 5 cm spread 1 to 3 lymph nodes in the armpit or near the breastbones or the tumor is
larger than 5cm and has not spread to the lymph node. Stage 3A breast cancer No tumor is seen in the breast or
the tumor may be any size and cancer is found in 4 to 9 lymph glands under the arm or in the lymph glands near
the breast bone. The tumor is more than 5 cm and has spread into up to 3 lymph nodes near the breast bone.
Performance and Evaluation of Data Mining Techniques in Cancer Diagnosis
www.iosrjournals.org 40 | Page
Stage 3B: The tumor has spread to the skin of the breast or to the chest wall and made the kin break down or
cancer swelling. The cancer may have spread to the up 9 lymph Node in the armpit or to the lymph glands near
the breast bone. Stage 4 breast cancer: The tumor can be any size. The lymph Node may or may not contain
cancer cells. The cancer has spread to the other parts of the body such as bone, lungs, liver, and brain. The TNM
(Tumor, Node, and Metastasis) system is specifies for each type of cancer. Once a patient‘s T, N, and M
categories have been determined, this information is combined in a process called stage grouping to determine a
women disease stage.Stage0 (the least advantage stage) to Stage4 (the most advantage stage).
II. Headings
Existing System:
The data sets used are SEER data or Wisconsin data. Data pre-processing is applied before data mining
to improve the quality of the data. Data pre-processing includes data cleaning, data integration, data
transformation and data reduction techniques. The features used for classification purposes coincided with the
Breast Imaging Reporting and Data System (BI-RADS) as this is how radiologists classify breast cancer. The
BI-RADS features of density, mass shape, mass margin and abnormality assessment rank are used as they have
been proven to provide good classification accuracy. A Classification method, Decision tree algorithms are
widely used in medical field to classify the medical data for diagnosis. Feature Selection increases the accuracy
of the Classifier because it eliminates irrelevant attributes. Feature selection with decision tree classification
greatly enhances the quality of the data in medical diagnosis. CART algorithm with various feature selection
methods to find out whether the same feature selection method may lead to best accuracy on various datasets of
same domain. Artificial neural networks (ANNs) and support vector machines have been recently proposed as a
very effective method for pattern recognition, machine learning and data mining. The discrimination capability
of the features extracted from the sonograms was tested by using the SVM (support vector machine), ANN and
KNN (K-nearest neighbor) classifier. It was found that the SVM gave the greatest accuracy while the ANN had
the highest sensitivity. Back propagation neural network (BPNN) and radial basis function network (RBFN)
were used for the training and testing of data.
Algorithm:
1. CART Algorithm
2. Decision tree algorithms
3. ID3
4. C4.5 decision tree algorithms
5. Hunt’s algorithm
6. SVM
7. Naïve Bayes classifier.
8. The back-propagated neural network.
Proposed System:
Early diagnosis needs an accurate and reliable diagnosis procedure that can be used by physicians to
distinguish benign breast tumors from malignant ones without going for surgical biopsy. The objective of these
predictions is to assign patients to one of the two group either a “benign” that is noncancerous or a “malignant”
that is cancerous. The prognosis problem is the long-term care for the disease for patients whose cancer has been
surgically removed. In this paper, we propose a model-based data mining technique with a neural network
classification technique and the improvements possible using an ensemble approach. Ensembles have been
proposed as a mechanism for improving the classification accuracy of existing classifiers providing that
constituents are diverse. Contrasting the cluster analysis technique against a baseline neural network classifier,
and then considers the effects of applying an ensemble technique to improve the accuracies. In different cases
the state of the disease condition itself can be marked by stages where the diagnostic symptoms or signs can be
subtle or different to other stages of the disease. This means that there is often not a clean mapping between the
diagnostic features and the diagnosis. The usage of clustering has also been extended to classifiers and detection
systems in order to improve detection and provide greater classification accuracy. Use of a feature selection
technique with a decision tree classifier to classify breast cancer. Another advantage of the model-based
clustering approach is that no decisions have to be made about the scaling of the observed variables: for
instance, when working with normal distributions with unknown variances, the results will be the same
irrespective of whether the variables are normalized or not. In order to fasten the classification we are using
Artificial Neural Network (ANN).
Algorithm:
1. Decision tree algorithms – For feature selection
Performance and Evaluation of Data Mining Techniques in Cancer Diagnosis
www.iosrjournals.org 41 | Page
2. C4.5 Algorithm – For classification
3. ANN (Artificial Neural Network)
Software Requirement:
 The WEKA is an ensemble of tools for data classification, regression, clustering, association rules, and
visualization. The toolkit is developed in Java and is an open source software issued under the GNU
General Public License.
 WBCD dataset.
III. Figures And Tables
Characteristics of database:
Wisconsin diagnostics Breast Cancer this database is created by William H.Wolberge at University of
Wisconsin. This database contains 286 observations among which 201 are benign cases and 85 are malignant
cases. These instances are described by 10 Attributes:-.class no-recurrence events, recurrence events, 2.age,
3.Menopause, 4.tumor_size, 5.inv_nodes, 6.node_cap, 7.deg_malig, 8.breast, 9.breast_quad, 10.irradiat
Table1:
Attributes Values
Age 10-19, 20-29, 30-39, 40-49, 50-59, 60-69, 70-79, 80-89, 90-99
menopause lt40, ge40,
premeno
tumor-size 0-4, 5-9, 10-14, 15-19, 20-24, 25-29, 30-34, 35-39, 40- 44,45-49, 50-54,55-59
inv-nodes 0-2, 3-5, 6-8, 9-11, 12-14, 15-17, 18-20, 21-23, 24-26,27-29, 30-32, 33-35,36-39
node-caps yes, no
deg-malig 1, 2, 3
Breast left, right
breast-quad left-up, left-low, right-up, right-low, central
irradiat yes, no
Class no-recurrence-events, recurrence-events
Classification prediction process in data mining:
Data mining is collection of techniques for efficient automated discovery of previously unknown valid
novel, useful and understandable pattern in large databases. Classification is the process of minding a model that
describes and distinguishes data classes concepts. This derived model is based on the analysis of a set of training
data. Classification predicts categorial labels predictable numerical data values rather than class lables.
We used several classification methods resulting in identification of top 5 classification alogorithm.
This paper presents a brief description of the classifiers and meta_classifiers used in the experiments results.
Fig1: Classification prediction process in data mining
Rules NNge (Non_Nested Generalised Examplars):
NNge Classifier new examples by determining the nearest neighbour in the exemplar database using a
Euclidean distance function.
Performance and Evaluation of Data Mining Techniques in Cancer Diagnosis
www.iosrjournals.org 42 | Page
The function is 𝐷 𝐸𝐻 = 𝑊
𝐻 𝑊
𝐼
𝐸 𝐼−𝐻 𝐼
𝑀𝐴𝑋 𝐼−𝑀𝐼𝑁 𝐼
𝑀
𝐼=1
2
Lazy K Star classifier
Lasy kstar algorithm most successful classifier. Implementation of an instance based classifier, which
uses the entropic distance measures. Test instance is based upon the class of those training instance.
Lazy IBK (K_Nearest Neighbours classifier):
IBK classifier is simple instance based learner that uses the class of nearest K training instances for the
class of the test instance. Predicts the class of the single nearest training instance for each test instance. Get
maximum no of instances allowed in the training pool.
Meta Decorate (Diverse Ensemble creation by oppositional relabeling of artificial Training Examples):
Meta learners such as Boosting Bagging and Random Forest provide diversity by sub sampling or
reweighting the existing training data. Decorate performs by adding randomly constructed to the training
set,when building new members.
𝑑𝑗 𝑥 =
0 𝑖𝑓𝑐𝑗 𝑥
1 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒
= 𝑐∗
𝑥
𝐷 =
1
𝑁𝑀
𝑑𝑗 𝑥𝑖
𝑛
𝑖=1
𝑚
𝑗=1
𝑐∗
𝑥 =arg max 𝑝 𝑘 𝑥 ,kє{1…….Q)
𝑝 𝑘 𝑥 = 𝑝𝑐3,𝑘 𝑥𝑐 𝑗𝜖 𝑐∗ / 𝑐∗
Tree Random Forest:
Random Forest classifier consists of multiple decision trees. The final class of an instance in a Random
forest is assigned by outputting the class that is the mode of the outputs of individual trees. Which can produce
robust and accurate classification. Random classifiers is based on a combination of many decision Tree
predictors such that each tree depends on the values of a random vector sampled independently and with the
same distribution for all trees in the forest. RF has excellent accuracy among current classifier Algorithm.
Fig2:Block diagram of the survival detection system
Table2:
Simulation result of each Algorithm
Algorithm
Total instance 286
Correctly classify
Instance(%value)
Incorrectly classify
Instance(%value)
Time taken
(seconds)
Kuppa status
Rule NNge 249
97.2028
8
2.7972
0.16 0.933
Random Forest 280
97.9021
6
2.0979
0.11 0.9501
Meta Decorate 259
90.5594
27
9.4406
1.81 0.7584
Lazy IBK 280
97.9021
6
2.0979
0 0.9491
Lazy Kstar 280
97.9021
6
2.0979
0 0.9491
Random Tree 280
97.9021
6
2.0979
0.02 0.9491
Performance and Evaluation of Data Mining Techniques in Cancer Diagnosis
www.iosrjournals.org 43 | Page
Classification
 Knm
 SVM
 Neural N/W
 Decision Tree Classifier
Output Begin or Malignant
 Pixel analysis
 Intensity Analysis
 Texture Analysis
Feature Extraction
 Fuzzy C-means
 K-means clustering
 Free hand clustering
 Active contour
clustering
Segmentation
1. Image filter
 Linear filter
 Median Filter
 Adaptive Filter
2. Images resize
3. Contrast enhancement
 Run time
 Cache enhancement
algorithm
Input Image
Pre-processing
 Read single image
 Runtime read image
 Read multiple images
Table3:
Training and Simulation errors:
Algorithm
Total instance
286
Mean Absolute
Error %
Root Mean
Squared Error
%
Relative
absolute
Error%
Root Relative
Squared
Error%
Rules NNge 0.028 0.1672 6.6868 36.5947
Random Forest 0.0978 01614 23.3712 35.3247
Meta Decorate 0.3655 0.3797 87.3643 83.083
Lazy-IBK 0.0253 0.1053 6.0487 23.0348
Lazy-Kstar 0.0747 0.1399 17.8681 30.6094
Random Tree 0.0221 0.1052 5.2937 23.0237
Architecture Diagram
Performance and Evaluation of Data Mining Techniques in Cancer Diagnosis
www.iosrjournals.org 44 | Page
IV. Conclusion
This paper presents effective classification Techniques. After investigation of different classification
Algorithm we have chosen 6 classifier based on our simulation performance and we have used Tree Random
classifier achieved overall classification accuracy 98%, which is significanant. In future work we propose to
analyze Ensemble classifier for 100% accuracy.
Acknowledgements
The authors like to express our appreciations to prof. R.M. Chandrasekar for his help in correcting
earlier versions of this paper. We also would like to thank the anonymous reviewers for their valuable comments
and in sightful suggestions.
References
[1] Mitchell, T.M. 1997. Machine Learning. McGraw-Hill Science
[2] John, G., Cleary, E. & Leonard, E. 1995. K*: An Instance-based Learner Using an Entropic Distance Measure. In: 12th
International Conference on Machine Learning, 108-114.
[3] Aha, D. & Kibler, D. 1991. Instance-based learning algorithms. Machine Learning.
[4] American Cancer Societies. Breast Cancer Facts& Figures 2005-2006. Atlanta: American Cancer Society, Inc.
(http://www.cancer.org/).
[5] National Resource. Cancer Epidemiology Biomarkers & Prevention 1999; 8:1117-1121.
[6] Houston, Andrea L. and Chen, et. al.. Medical Data Mining on the Internet: Research on a Cancer Information System. Artificial
Intelligence Review 1999; 13:437-466.
[7] Lundin M, Lundin J, Burke HB, Toikkanen S Pylkkanen L, Joensuu H. Artificial neural Networks applied to survival prediction in
breast cancer. Oncology 1999; 57:281-6.
[8] Delen D, Walker G, Kadam A. Predicting breast cancer survivability: a comparison of three data Mining methods. Artificial
Intelligence in Medicine. 2005 Jun; 34(2):113-27.
[9] Ian H. Witten and Eibe Frank. Data Mining: Practical machine learning tools and techniques, 2nd Edition. San Francisco: Morgan
Kaufmann; 2005.
[10] Weka: Data Mining Software in Java,http://www.cs.waikato.ac.nz/ml/weka
[11] lando Anunciac¸ ˜ao and Bruno C. Gomes and Susana Vinga and Jorge Gaspar and Arlindo L. Oliveira and Jos´e Rueff , A Data
Mining Approach for the detection of High-Risk Breast Cancer Groups
[12] aria-Luiza Antonie, Osmar R. Za¨ıane, Alexandru Coma, .Application of Data Mining Techniques for Medical Image
Classification.Proceeding of second International worshop on Mutimedia data mining(MDM/KDD’2001),in conjuction with ACM
SIGKDD conference.SAN FRANCISCO,USA,AUG 26,2001.

More Related Content

What's hot

Breast cancer diagnosis machine learning ppt
Breast cancer diagnosis machine learning pptBreast cancer diagnosis machine learning ppt
Breast cancer diagnosis machine learning ppt
AnkitGupta1476
 
IRJET - Classifying Breast Cancer Tumour Type using Convolution Neural Netwo...
IRJET  - Classifying Breast Cancer Tumour Type using Convolution Neural Netwo...IRJET  - Classifying Breast Cancer Tumour Type using Convolution Neural Netwo...
IRJET - Classifying Breast Cancer Tumour Type using Convolution Neural Netwo...
IRJET Journal
 
Breast Cancer Diagnostics with Bayesian Networks
Breast Cancer Diagnostics with Bayesian NetworksBreast Cancer Diagnostics with Bayesian Networks
Breast Cancer Diagnostics with Bayesian Networks
Bayesia USA
 
IRJET- Breast Cancer Prediction using Support Vector Machine
IRJET-  	  Breast Cancer Prediction using Support Vector MachineIRJET-  	  Breast Cancer Prediction using Support Vector Machine
IRJET- Breast Cancer Prediction using Support Vector Machine
IRJET Journal
 
Breast Cancer Detection using Convolution Neural Network
Breast Cancer Detection using Convolution Neural NetworkBreast Cancer Detection using Convolution Neural Network
Breast Cancer Detection using Convolution Neural Network
IRJET Journal
 
Applying Machine Learning Techniques to Breast Cancer Research - by Benjamin ...
Applying Machine Learning Techniques to Breast Cancer Research - by Benjamin ...Applying Machine Learning Techniques to Breast Cancer Research - by Benjamin ...
Applying Machine Learning Techniques to Breast Cancer Research - by Benjamin ...
PAPIs.io
 
Segmentation of thermograms breast cancer tarek-to-slid share
Segmentation of thermograms breast cancer tarek-to-slid shareSegmentation of thermograms breast cancer tarek-to-slid share
Segmentation of thermograms breast cancer tarek-to-slid shareTarek Gaber
 
Breast cancer Detection using MATLAB
Breast cancer Detection using MATLABBreast cancer Detection using MATLAB
Breast cancer Detection using MATLAB
NupurRathi7
 
BREAST CANCER DIAGNOSIS USING MACHINE LEARNING ALGORITHMS –A SURVEY
BREAST CANCER DIAGNOSIS USING MACHINE LEARNING ALGORITHMS –A SURVEYBREAST CANCER DIAGNOSIS USING MACHINE LEARNING ALGORITHMS –A SURVEY
BREAST CANCER DIAGNOSIS USING MACHINE LEARNING ALGORITHMS –A SURVEY
ijdpsjournal
 
A Comparative Study on the Methods Used for the Detection of Breast Cancer
A Comparative Study on the Methods Used for the Detection of Breast CancerA Comparative Study on the Methods Used for the Detection of Breast Cancer
A Comparative Study on the Methods Used for the Detection of Breast Cancer
rahulmonikasharma
 
IRJET- Comparison of Breast Cancer Detection using Probabilistic Neural Netwo...
IRJET- Comparison of Breast Cancer Detection using Probabilistic Neural Netwo...IRJET- Comparison of Breast Cancer Detection using Probabilistic Neural Netwo...
IRJET- Comparison of Breast Cancer Detection using Probabilistic Neural Netwo...
IRJET Journal
 
Breast cancer diagnosis and recurrence prediction using machine learning tech...
Breast cancer diagnosis and recurrence prediction using machine learning tech...Breast cancer diagnosis and recurrence prediction using machine learning tech...
Breast cancer diagnosis and recurrence prediction using machine learning tech...
eSAT Journals
 
Predictive Analysis of Breast Cancer Detection using Classification Algorithm
Predictive Analysis of Breast Cancer Detection using Classification AlgorithmPredictive Analysis of Breast Cancer Detection using Classification Algorithm
Predictive Analysis of Breast Cancer Detection using Classification Algorithm
Sushanti Acharya
 
Breast cancer diagnosis via data mining performance analysis of seven differe...
Breast cancer diagnosis via data mining performance analysis of seven differe...Breast cancer diagnosis via data mining performance analysis of seven differe...
Breast cancer diagnosis via data mining performance analysis of seven differe...
cseij
 
Digital pathology in developing country
Digital pathology in developing countryDigital pathology in developing country
Digital pathology in developing country
Dr. Ashish lakhey
 
P033079086
P033079086P033079086
P033079086
ijceronline
 
Artificial Intelligence in pathology
Artificial Intelligence in pathologyArtificial Intelligence in pathology
Artificial Intelligence in pathology
nehaSingh1543
 
Breast cancer classification
Breast cancer classificationBreast cancer classification
Breast cancer classification
Ashwan Abdulmunem
 
Thermal Imaging: Opportunities and Challenges for Breast Cancer Detection
Thermal Imaging: Opportunities and Challenges for Breast Cancer DetectionThermal Imaging: Opportunities and Challenges for Breast Cancer Detection
Thermal Imaging: Opportunities and Challenges for Breast Cancer Detection
Tarek Gaber
 
On Predicting and Analyzing Breast Cancer using Data Mining Approach
On Predicting and Analyzing Breast Cancer using Data Mining ApproachOn Predicting and Analyzing Breast Cancer using Data Mining Approach
On Predicting and Analyzing Breast Cancer using Data Mining Approach
Masud Rana Basunia
 

What's hot (20)

Breast cancer diagnosis machine learning ppt
Breast cancer diagnosis machine learning pptBreast cancer diagnosis machine learning ppt
Breast cancer diagnosis machine learning ppt
 
IRJET - Classifying Breast Cancer Tumour Type using Convolution Neural Netwo...
IRJET  - Classifying Breast Cancer Tumour Type using Convolution Neural Netwo...IRJET  - Classifying Breast Cancer Tumour Type using Convolution Neural Netwo...
IRJET - Classifying Breast Cancer Tumour Type using Convolution Neural Netwo...
 
Breast Cancer Diagnostics with Bayesian Networks
Breast Cancer Diagnostics with Bayesian NetworksBreast Cancer Diagnostics with Bayesian Networks
Breast Cancer Diagnostics with Bayesian Networks
 
IRJET- Breast Cancer Prediction using Support Vector Machine
IRJET-  	  Breast Cancer Prediction using Support Vector MachineIRJET-  	  Breast Cancer Prediction using Support Vector Machine
IRJET- Breast Cancer Prediction using Support Vector Machine
 
Breast Cancer Detection using Convolution Neural Network
Breast Cancer Detection using Convolution Neural NetworkBreast Cancer Detection using Convolution Neural Network
Breast Cancer Detection using Convolution Neural Network
 
Applying Machine Learning Techniques to Breast Cancer Research - by Benjamin ...
Applying Machine Learning Techniques to Breast Cancer Research - by Benjamin ...Applying Machine Learning Techniques to Breast Cancer Research - by Benjamin ...
Applying Machine Learning Techniques to Breast Cancer Research - by Benjamin ...
 
Segmentation of thermograms breast cancer tarek-to-slid share
Segmentation of thermograms breast cancer tarek-to-slid shareSegmentation of thermograms breast cancer tarek-to-slid share
Segmentation of thermograms breast cancer tarek-to-slid share
 
Breast cancer Detection using MATLAB
Breast cancer Detection using MATLABBreast cancer Detection using MATLAB
Breast cancer Detection using MATLAB
 
BREAST CANCER DIAGNOSIS USING MACHINE LEARNING ALGORITHMS –A SURVEY
BREAST CANCER DIAGNOSIS USING MACHINE LEARNING ALGORITHMS –A SURVEYBREAST CANCER DIAGNOSIS USING MACHINE LEARNING ALGORITHMS –A SURVEY
BREAST CANCER DIAGNOSIS USING MACHINE LEARNING ALGORITHMS –A SURVEY
 
A Comparative Study on the Methods Used for the Detection of Breast Cancer
A Comparative Study on the Methods Used for the Detection of Breast CancerA Comparative Study on the Methods Used for the Detection of Breast Cancer
A Comparative Study on the Methods Used for the Detection of Breast Cancer
 
IRJET- Comparison of Breast Cancer Detection using Probabilistic Neural Netwo...
IRJET- Comparison of Breast Cancer Detection using Probabilistic Neural Netwo...IRJET- Comparison of Breast Cancer Detection using Probabilistic Neural Netwo...
IRJET- Comparison of Breast Cancer Detection using Probabilistic Neural Netwo...
 
Breast cancer diagnosis and recurrence prediction using machine learning tech...
Breast cancer diagnosis and recurrence prediction using machine learning tech...Breast cancer diagnosis and recurrence prediction using machine learning tech...
Breast cancer diagnosis and recurrence prediction using machine learning tech...
 
Predictive Analysis of Breast Cancer Detection using Classification Algorithm
Predictive Analysis of Breast Cancer Detection using Classification AlgorithmPredictive Analysis of Breast Cancer Detection using Classification Algorithm
Predictive Analysis of Breast Cancer Detection using Classification Algorithm
 
Breast cancer diagnosis via data mining performance analysis of seven differe...
Breast cancer diagnosis via data mining performance analysis of seven differe...Breast cancer diagnosis via data mining performance analysis of seven differe...
Breast cancer diagnosis via data mining performance analysis of seven differe...
 
Digital pathology in developing country
Digital pathology in developing countryDigital pathology in developing country
Digital pathology in developing country
 
P033079086
P033079086P033079086
P033079086
 
Artificial Intelligence in pathology
Artificial Intelligence in pathologyArtificial Intelligence in pathology
Artificial Intelligence in pathology
 
Breast cancer classification
Breast cancer classificationBreast cancer classification
Breast cancer classification
 
Thermal Imaging: Opportunities and Challenges for Breast Cancer Detection
Thermal Imaging: Opportunities and Challenges for Breast Cancer DetectionThermal Imaging: Opportunities and Challenges for Breast Cancer Detection
Thermal Imaging: Opportunities and Challenges for Breast Cancer Detection
 
On Predicting and Analyzing Breast Cancer using Data Mining Approach
On Predicting and Analyzing Breast Cancer using Data Mining ApproachOn Predicting and Analyzing Breast Cancer using Data Mining Approach
On Predicting and Analyzing Breast Cancer using Data Mining Approach
 

Viewers also liked

Old wine in new wineskins: Revisiting counselling in traditional Ndebele and ...
Old wine in new wineskins: Revisiting counselling in traditional Ndebele and ...Old wine in new wineskins: Revisiting counselling in traditional Ndebele and ...
Old wine in new wineskins: Revisiting counselling in traditional Ndebele and ...
IOSR Journals
 
A Survey Paper on Cluster Head Selection Techniques for Mobile Ad-Hoc Network
A Survey Paper on Cluster Head Selection Techniques for Mobile Ad-Hoc NetworkA Survey Paper on Cluster Head Selection Techniques for Mobile Ad-Hoc Network
A Survey Paper on Cluster Head Selection Techniques for Mobile Ad-Hoc Network
IOSR Journals
 
Numerical solution of heat equation through double interpolation
Numerical solution of heat equation through double interpolationNumerical solution of heat equation through double interpolation
Numerical solution of heat equation through double interpolation
IOSR Journals
 
Structure and transport coefficients of liquid Argon and neon using molecular...
Structure and transport coefficients of liquid Argon and neon using molecular...Structure and transport coefficients of liquid Argon and neon using molecular...
Structure and transport coefficients of liquid Argon and neon using molecular...
IOSR Journals
 
Modeling & Testing Of Hybrid Composite Laminate
Modeling & Testing Of Hybrid Composite LaminateModeling & Testing Of Hybrid Composite Laminate
Modeling & Testing Of Hybrid Composite Laminate
IOSR Journals
 
Micellar Effect On Dephosphorylation Of Bis-4-Chloro-3,5-Dimethylphenylphosph...
Micellar Effect On Dephosphorylation Of Bis-4-Chloro-3,5-Dimethylphenylphosph...Micellar Effect On Dephosphorylation Of Bis-4-Chloro-3,5-Dimethylphenylphosph...
Micellar Effect On Dephosphorylation Of Bis-4-Chloro-3,5-Dimethylphenylphosph...
IOSR Journals
 
K010416167
K010416167K010416167
K010416167
IOSR Journals
 
Industry- Academia Collaboration through Cloud Computing: A Pathway for Susta...
Industry- Academia Collaboration through Cloud Computing: A Pathway for Susta...Industry- Academia Collaboration through Cloud Computing: A Pathway for Susta...
Industry- Academia Collaboration through Cloud Computing: A Pathway for Susta...
IOSR Journals
 
How effective are the interventions related to physical activities and nutrit...
How effective are the interventions related to physical activities and nutrit...How effective are the interventions related to physical activities and nutrit...
How effective are the interventions related to physical activities and nutrit...
IOSR Journals
 
Marketing Analysis of onion in Bade and Geidam Local Government Areas of Yobe...
Marketing Analysis of onion in Bade and Geidam Local Government Areas of Yobe...Marketing Analysis of onion in Bade and Geidam Local Government Areas of Yobe...
Marketing Analysis of onion in Bade and Geidam Local Government Areas of Yobe...
IOSR Journals
 
The Reliability Study of 11-Kv Distribution Feeders: A Case Study of Idi-Arab...
The Reliability Study of 11-Kv Distribution Feeders: A Case Study of Idi-Arab...The Reliability Study of 11-Kv Distribution Feeders: A Case Study of Idi-Arab...
The Reliability Study of 11-Kv Distribution Feeders: A Case Study of Idi-Arab...
IOSR Journals
 
Energy Policy in India
Energy Policy in IndiaEnergy Policy in India
Energy Policy in India
IOSR Journals
 
Performance Evaluation of IEEE STD 802.16d Transceiver
Performance Evaluation of IEEE STD 802.16d TransceiverPerformance Evaluation of IEEE STD 802.16d Transceiver
Performance Evaluation of IEEE STD 802.16d Transceiver
IOSR Journals
 
OSPCV: Off-line Signature Verification using Principal Component Variances
OSPCV: Off-line Signature Verification using Principal Component VariancesOSPCV: Off-line Signature Verification using Principal Component Variances
OSPCV: Off-line Signature Verification using Principal Component Variances
IOSR Journals
 
Chemical Investigations of Some Commercial Samples of Calcium Based Ayurvedic...
Chemical Investigations of Some Commercial Samples of Calcium Based Ayurvedic...Chemical Investigations of Some Commercial Samples of Calcium Based Ayurvedic...
Chemical Investigations of Some Commercial Samples of Calcium Based Ayurvedic...
IOSR Journals
 
B010510613
B010510613B010510613
B010510613
IOSR Journals
 
Measurement of Efficiency Level in Nigerian Seaport after Reform Policy Imple...
Measurement of Efficiency Level in Nigerian Seaport after Reform Policy Imple...Measurement of Efficiency Level in Nigerian Seaport after Reform Policy Imple...
Measurement of Efficiency Level in Nigerian Seaport after Reform Policy Imple...
IOSR Journals
 
Effect of Planar Solid Shear Wall - Frame Arrangement on the Deformation Beha...
Effect of Planar Solid Shear Wall - Frame Arrangement on the Deformation Beha...Effect of Planar Solid Shear Wall - Frame Arrangement on the Deformation Beha...
Effect of Planar Solid Shear Wall - Frame Arrangement on the Deformation Beha...
IOSR Journals
 
Kinetic study of free and immobilized protease from Aspergillus sp.
Kinetic study of free and immobilized protease from Aspergillus sp.Kinetic study of free and immobilized protease from Aspergillus sp.
Kinetic study of free and immobilized protease from Aspergillus sp.
IOSR Journals
 

Viewers also liked (20)

Old wine in new wineskins: Revisiting counselling in traditional Ndebele and ...
Old wine in new wineskins: Revisiting counselling in traditional Ndebele and ...Old wine in new wineskins: Revisiting counselling in traditional Ndebele and ...
Old wine in new wineskins: Revisiting counselling in traditional Ndebele and ...
 
A Survey Paper on Cluster Head Selection Techniques for Mobile Ad-Hoc Network
A Survey Paper on Cluster Head Selection Techniques for Mobile Ad-Hoc NetworkA Survey Paper on Cluster Head Selection Techniques for Mobile Ad-Hoc Network
A Survey Paper on Cluster Head Selection Techniques for Mobile Ad-Hoc Network
 
Numerical solution of heat equation through double interpolation
Numerical solution of heat equation through double interpolationNumerical solution of heat equation through double interpolation
Numerical solution of heat equation through double interpolation
 
I0744347
I0744347I0744347
I0744347
 
Structure and transport coefficients of liquid Argon and neon using molecular...
Structure and transport coefficients of liquid Argon and neon using molecular...Structure and transport coefficients of liquid Argon and neon using molecular...
Structure and transport coefficients of liquid Argon and neon using molecular...
 
Modeling & Testing Of Hybrid Composite Laminate
Modeling & Testing Of Hybrid Composite LaminateModeling & Testing Of Hybrid Composite Laminate
Modeling & Testing Of Hybrid Composite Laminate
 
Micellar Effect On Dephosphorylation Of Bis-4-Chloro-3,5-Dimethylphenylphosph...
Micellar Effect On Dephosphorylation Of Bis-4-Chloro-3,5-Dimethylphenylphosph...Micellar Effect On Dephosphorylation Of Bis-4-Chloro-3,5-Dimethylphenylphosph...
Micellar Effect On Dephosphorylation Of Bis-4-Chloro-3,5-Dimethylphenylphosph...
 
K010416167
K010416167K010416167
K010416167
 
Industry- Academia Collaboration through Cloud Computing: A Pathway for Susta...
Industry- Academia Collaboration through Cloud Computing: A Pathway for Susta...Industry- Academia Collaboration through Cloud Computing: A Pathway for Susta...
Industry- Academia Collaboration through Cloud Computing: A Pathway for Susta...
 
How effective are the interventions related to physical activities and nutrit...
How effective are the interventions related to physical activities and nutrit...How effective are the interventions related to physical activities and nutrit...
How effective are the interventions related to physical activities and nutrit...
 
Marketing Analysis of onion in Bade and Geidam Local Government Areas of Yobe...
Marketing Analysis of onion in Bade and Geidam Local Government Areas of Yobe...Marketing Analysis of onion in Bade and Geidam Local Government Areas of Yobe...
Marketing Analysis of onion in Bade and Geidam Local Government Areas of Yobe...
 
The Reliability Study of 11-Kv Distribution Feeders: A Case Study of Idi-Arab...
The Reliability Study of 11-Kv Distribution Feeders: A Case Study of Idi-Arab...The Reliability Study of 11-Kv Distribution Feeders: A Case Study of Idi-Arab...
The Reliability Study of 11-Kv Distribution Feeders: A Case Study of Idi-Arab...
 
Energy Policy in India
Energy Policy in IndiaEnergy Policy in India
Energy Policy in India
 
Performance Evaluation of IEEE STD 802.16d Transceiver
Performance Evaluation of IEEE STD 802.16d TransceiverPerformance Evaluation of IEEE STD 802.16d Transceiver
Performance Evaluation of IEEE STD 802.16d Transceiver
 
OSPCV: Off-line Signature Verification using Principal Component Variances
OSPCV: Off-line Signature Verification using Principal Component VariancesOSPCV: Off-line Signature Verification using Principal Component Variances
OSPCV: Off-line Signature Verification using Principal Component Variances
 
Chemical Investigations of Some Commercial Samples of Calcium Based Ayurvedic...
Chemical Investigations of Some Commercial Samples of Calcium Based Ayurvedic...Chemical Investigations of Some Commercial Samples of Calcium Based Ayurvedic...
Chemical Investigations of Some Commercial Samples of Calcium Based Ayurvedic...
 
B010510613
B010510613B010510613
B010510613
 
Measurement of Efficiency Level in Nigerian Seaport after Reform Policy Imple...
Measurement of Efficiency Level in Nigerian Seaport after Reform Policy Imple...Measurement of Efficiency Level in Nigerian Seaport after Reform Policy Imple...
Measurement of Efficiency Level in Nigerian Seaport after Reform Policy Imple...
 
Effect of Planar Solid Shear Wall - Frame Arrangement on the Deformation Beha...
Effect of Planar Solid Shear Wall - Frame Arrangement on the Deformation Beha...Effect of Planar Solid Shear Wall - Frame Arrangement on the Deformation Beha...
Effect of Planar Solid Shear Wall - Frame Arrangement on the Deformation Beha...
 
Kinetic study of free and immobilized protease from Aspergillus sp.
Kinetic study of free and immobilized protease from Aspergillus sp.Kinetic study of free and immobilized protease from Aspergillus sp.
Kinetic study of free and immobilized protease from Aspergillus sp.
 

Similar to Performance and Evaluation of Data Mining Techniques in Cancer Diagnosis

Performance Evaluation using Supervised Learning Algorithms for Breast Cancer...
Performance Evaluation using Supervised Learning Algorithms for Breast Cancer...Performance Evaluation using Supervised Learning Algorithms for Breast Cancer...
Performance Evaluation using Supervised Learning Algorithms for Breast Cancer...
IRJET Journal
 
Comparison of breast cancer classification models on Wisconsin dataset
Comparison of breast cancer classification models on Wisconsin  datasetComparison of breast cancer classification models on Wisconsin  dataset
Comparison of breast cancer classification models on Wisconsin dataset
International Journal of Reconfigurable and Embedded Systems
 
USING DATA MINING TECHNIQUES FOR DIAGNOSIS AND PROGNOSIS OF CANCER DISEASE
USING DATA MINING TECHNIQUES FOR DIAGNOSIS AND PROGNOSIS OF CANCER DISEASEUSING DATA MINING TECHNIQUES FOR DIAGNOSIS AND PROGNOSIS OF CANCER DISEASE
USING DATA MINING TECHNIQUES FOR DIAGNOSIS AND PROGNOSIS OF CANCER DISEASE
IJCSEIT Journal
 
Breast Cancer Prediction using Machine Learning
Breast Cancer Prediction using Machine LearningBreast Cancer Prediction using Machine Learning
Breast Cancer Prediction using Machine Learning
IRJET Journal
 
Breast Cancer detection.pptx
Breast Cancer detection.pptxBreast Cancer detection.pptx
Breast Cancer detection.pptx
CHANDRAJEETJHA4
 
Classification AlgorithmBased Analysis of Breast Cancer Data
Classification AlgorithmBased Analysis of Breast Cancer DataClassification AlgorithmBased Analysis of Breast Cancer Data
Classification AlgorithmBased Analysis of Breast Cancer Data
IIRindia
 
A Classification of Cancer Diagnostics based on Microarray Gene Expression Pr...
A Classification of Cancer Diagnostics based on Microarray Gene Expression Pr...A Classification of Cancer Diagnostics based on Microarray Gene Expression Pr...
A Classification of Cancer Diagnostics based on Microarray Gene Expression Pr...
IJTET Journal
 
Logistic Regression Model for Predicting the Malignancy of Breast Cancer
Logistic Regression Model for Predicting the Malignancy of Breast CancerLogistic Regression Model for Predicting the Malignancy of Breast Cancer
Logistic Regression Model for Predicting the Malignancy of Breast Cancer
IRJET Journal
 
A Progressive Review on Early Stage Breast Cancer Detection
A Progressive Review on Early Stage Breast Cancer DetectionA Progressive Review on Early Stage Breast Cancer Detection
A Progressive Review on Early Stage Breast Cancer Detection
IRJET Journal
 
A Review Of Breast Cancer Classification And Detection Techniques
A Review Of Breast Cancer Classification And Detection TechniquesA Review Of Breast Cancer Classification And Detection Techniques
A Review Of Breast Cancer Classification And Detection Techniques
Lori Moore
 
Deep Learning Techniques for Breast Cancer Risk Prediction.pptx
Deep Learning Techniques for Breast Cancer Risk Prediction.pptxDeep Learning Techniques for Breast Cancer Risk Prediction.pptx
Deep Learning Techniques for Breast Cancer Risk Prediction.pptx
Anuraag Moharana
 
Datamining in BreastCancer.pptx
Datamining in BreastCancer.pptxDatamining in BreastCancer.pptx
Datamining in BreastCancer.pptx
MaligireddyTanujaRed1
 
Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...
Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...
Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...
mlaij
 
BREAST TUMOR DETECTION USING EFFICIENT MACHINE LEARNING AND DEEP LEARNING TEC...
BREAST TUMOR DETECTION USING EFFICIENT MACHINE LEARNING AND DEEP LEARNING TEC...BREAST TUMOR DETECTION USING EFFICIENT MACHINE LEARNING AND DEEP LEARNING TEC...
BREAST TUMOR DETECTION USING EFFICIENT MACHINE LEARNING AND DEEP LEARNING TEC...
mlaij
 
Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...
Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...
Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...
mlaij
 
Image processing and machine learning techniques used in computer-aided dete...
Image processing and machine learning techniques  used in computer-aided dete...Image processing and machine learning techniques  used in computer-aided dete...
Image processing and machine learning techniques used in computer-aided dete...
IJECEIAES
 
A Comprehensive Evaluation of Machine Learning Approaches for Breast Cancer C...
A Comprehensive Evaluation of Machine Learning Approaches for Breast Cancer C...A Comprehensive Evaluation of Machine Learning Approaches for Breast Cancer C...
A Comprehensive Evaluation of Machine Learning Approaches for Breast Cancer C...
IRJET Journal
 
PREDICTION OF BREAST CANCER USING DATA MINING TECHNIQUES
PREDICTION OF BREAST CANCER USING DATA MINING TECHNIQUESPREDICTION OF BREAST CANCER USING DATA MINING TECHNIQUES
PREDICTION OF BREAST CANCER USING DATA MINING TECHNIQUES
IAEME Publication
 
The Evolution and Impact of Medical Science Journals in Advancing Healthcare
The Evolution and Impact of Medical Science Journals in Advancing HealthcareThe Evolution and Impact of Medical Science Journals in Advancing Healthcare
The Evolution and Impact of Medical Science Journals in Advancing Healthcare
sana473753
 
journals on medical
journals on medicaljournals on medical
journals on medical
sana473753
 

Similar to Performance and Evaluation of Data Mining Techniques in Cancer Diagnosis (20)

Performance Evaluation using Supervised Learning Algorithms for Breast Cancer...
Performance Evaluation using Supervised Learning Algorithms for Breast Cancer...Performance Evaluation using Supervised Learning Algorithms for Breast Cancer...
Performance Evaluation using Supervised Learning Algorithms for Breast Cancer...
 
Comparison of breast cancer classification models on Wisconsin dataset
Comparison of breast cancer classification models on Wisconsin  datasetComparison of breast cancer classification models on Wisconsin  dataset
Comparison of breast cancer classification models on Wisconsin dataset
 
USING DATA MINING TECHNIQUES FOR DIAGNOSIS AND PROGNOSIS OF CANCER DISEASE
USING DATA MINING TECHNIQUES FOR DIAGNOSIS AND PROGNOSIS OF CANCER DISEASEUSING DATA MINING TECHNIQUES FOR DIAGNOSIS AND PROGNOSIS OF CANCER DISEASE
USING DATA MINING TECHNIQUES FOR DIAGNOSIS AND PROGNOSIS OF CANCER DISEASE
 
Breast Cancer Prediction using Machine Learning
Breast Cancer Prediction using Machine LearningBreast Cancer Prediction using Machine Learning
Breast Cancer Prediction using Machine Learning
 
Breast Cancer detection.pptx
Breast Cancer detection.pptxBreast Cancer detection.pptx
Breast Cancer detection.pptx
 
Classification AlgorithmBased Analysis of Breast Cancer Data
Classification AlgorithmBased Analysis of Breast Cancer DataClassification AlgorithmBased Analysis of Breast Cancer Data
Classification AlgorithmBased Analysis of Breast Cancer Data
 
A Classification of Cancer Diagnostics based on Microarray Gene Expression Pr...
A Classification of Cancer Diagnostics based on Microarray Gene Expression Pr...A Classification of Cancer Diagnostics based on Microarray Gene Expression Pr...
A Classification of Cancer Diagnostics based on Microarray Gene Expression Pr...
 
Logistic Regression Model for Predicting the Malignancy of Breast Cancer
Logistic Regression Model for Predicting the Malignancy of Breast CancerLogistic Regression Model for Predicting the Malignancy of Breast Cancer
Logistic Regression Model for Predicting the Malignancy of Breast Cancer
 
A Progressive Review on Early Stage Breast Cancer Detection
A Progressive Review on Early Stage Breast Cancer DetectionA Progressive Review on Early Stage Breast Cancer Detection
A Progressive Review on Early Stage Breast Cancer Detection
 
A Review Of Breast Cancer Classification And Detection Techniques
A Review Of Breast Cancer Classification And Detection TechniquesA Review Of Breast Cancer Classification And Detection Techniques
A Review Of Breast Cancer Classification And Detection Techniques
 
Deep Learning Techniques for Breast Cancer Risk Prediction.pptx
Deep Learning Techniques for Breast Cancer Risk Prediction.pptxDeep Learning Techniques for Breast Cancer Risk Prediction.pptx
Deep Learning Techniques for Breast Cancer Risk Prediction.pptx
 
Datamining in BreastCancer.pptx
Datamining in BreastCancer.pptxDatamining in BreastCancer.pptx
Datamining in BreastCancer.pptx
 
Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...
Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...
Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...
 
BREAST TUMOR DETECTION USING EFFICIENT MACHINE LEARNING AND DEEP LEARNING TEC...
BREAST TUMOR DETECTION USING EFFICIENT MACHINE LEARNING AND DEEP LEARNING TEC...BREAST TUMOR DETECTION USING EFFICIENT MACHINE LEARNING AND DEEP LEARNING TEC...
BREAST TUMOR DETECTION USING EFFICIENT MACHINE LEARNING AND DEEP LEARNING TEC...
 
Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...
Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...
Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...
 
Image processing and machine learning techniques used in computer-aided dete...
Image processing and machine learning techniques  used in computer-aided dete...Image processing and machine learning techniques  used in computer-aided dete...
Image processing and machine learning techniques used in computer-aided dete...
 
A Comprehensive Evaluation of Machine Learning Approaches for Breast Cancer C...
A Comprehensive Evaluation of Machine Learning Approaches for Breast Cancer C...A Comprehensive Evaluation of Machine Learning Approaches for Breast Cancer C...
A Comprehensive Evaluation of Machine Learning Approaches for Breast Cancer C...
 
PREDICTION OF BREAST CANCER USING DATA MINING TECHNIQUES
PREDICTION OF BREAST CANCER USING DATA MINING TECHNIQUESPREDICTION OF BREAST CANCER USING DATA MINING TECHNIQUES
PREDICTION OF BREAST CANCER USING DATA MINING TECHNIQUES
 
The Evolution and Impact of Medical Science Journals in Advancing Healthcare
The Evolution and Impact of Medical Science Journals in Advancing HealthcareThe Evolution and Impact of Medical Science Journals in Advancing Healthcare
The Evolution and Impact of Medical Science Journals in Advancing Healthcare
 
journals on medical
journals on medicaljournals on medical
journals on medical
 

More from IOSR Journals

A011140104
A011140104A011140104
A011140104
IOSR Journals
 
M0111397100
M0111397100M0111397100
M0111397100
IOSR Journals
 
L011138596
L011138596L011138596
L011138596
IOSR Journals
 
K011138084
K011138084K011138084
K011138084
IOSR Journals
 
J011137479
J011137479J011137479
J011137479
IOSR Journals
 
I011136673
I011136673I011136673
I011136673
IOSR Journals
 
G011134454
G011134454G011134454
G011134454
IOSR Journals
 
H011135565
H011135565H011135565
H011135565
IOSR Journals
 
F011134043
F011134043F011134043
F011134043
IOSR Journals
 
E011133639
E011133639E011133639
E011133639
IOSR Journals
 
D011132635
D011132635D011132635
D011132635
IOSR Journals
 
C011131925
C011131925C011131925
C011131925
IOSR Journals
 
B011130918
B011130918B011130918
B011130918
IOSR Journals
 
A011130108
A011130108A011130108
A011130108
IOSR Journals
 
I011125160
I011125160I011125160
I011125160
IOSR Journals
 
H011124050
H011124050H011124050
H011124050
IOSR Journals
 
G011123539
G011123539G011123539
G011123539
IOSR Journals
 
F011123134
F011123134F011123134
F011123134
IOSR Journals
 
E011122530
E011122530E011122530
E011122530
IOSR Journals
 
D011121524
D011121524D011121524
D011121524
IOSR Journals
 

More from IOSR Journals (20)

A011140104
A011140104A011140104
A011140104
 
M0111397100
M0111397100M0111397100
M0111397100
 
L011138596
L011138596L011138596
L011138596
 
K011138084
K011138084K011138084
K011138084
 
J011137479
J011137479J011137479
J011137479
 
I011136673
I011136673I011136673
I011136673
 
G011134454
G011134454G011134454
G011134454
 
H011135565
H011135565H011135565
H011135565
 
F011134043
F011134043F011134043
F011134043
 
E011133639
E011133639E011133639
E011133639
 
D011132635
D011132635D011132635
D011132635
 
C011131925
C011131925C011131925
C011131925
 
B011130918
B011130918B011130918
B011130918
 
A011130108
A011130108A011130108
A011130108
 
I011125160
I011125160I011125160
I011125160
 
H011124050
H011124050H011124050
H011124050
 
G011123539
G011123539G011123539
G011123539
 
F011123134
F011123134F011123134
F011123134
 
E011122530
E011122530E011122530
E011122530
 
D011121524
D011121524D011121524
D011121524
 

Recently uploaded

Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
MdTanvirMahtab2
 
ML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptxML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptx
Vijay Dialani, PhD
 
HYDROPOWER - Hydroelectric power generation
HYDROPOWER - Hydroelectric power generationHYDROPOWER - Hydroelectric power generation
HYDROPOWER - Hydroelectric power generation
Robbie Edward Sayers
 
MCQ Soil mechanics questions (Soil shear strength).pdf
MCQ Soil mechanics questions (Soil shear strength).pdfMCQ Soil mechanics questions (Soil shear strength).pdf
MCQ Soil mechanics questions (Soil shear strength).pdf
Osamah Alsalih
 
Architectural Portfolio Sean Lockwood
Architectural Portfolio Sean LockwoodArchitectural Portfolio Sean Lockwood
Architectural Portfolio Sean Lockwood
seandesed
 
road safety engineering r s e unit 3.pdf
road safety engineering  r s e unit 3.pdfroad safety engineering  r s e unit 3.pdf
road safety engineering r s e unit 3.pdf
VENKATESHvenky89705
 
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
obonagu
 
Investor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptxInvestor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptx
AmarGB2
 
block diagram and signal flow graph representation
block diagram and signal flow graph representationblock diagram and signal flow graph representation
block diagram and signal flow graph representation
Divya Somashekar
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
zwunae
 
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
Amil Baba Dawood bangali
 
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Dr.Costas Sachpazis
 
H.Seo, ICLR 2024, MLILAB, KAIST AI.pdf
H.Seo,  ICLR 2024, MLILAB,  KAIST AI.pdfH.Seo,  ICLR 2024, MLILAB,  KAIST AI.pdf
H.Seo, ICLR 2024, MLILAB, KAIST AI.pdf
MLILAB
 
Water Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdfWater Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation & Control
 
power quality voltage fluctuation UNIT - I.pptx
power quality voltage fluctuation UNIT - I.pptxpower quality voltage fluctuation UNIT - I.pptx
power quality voltage fluctuation UNIT - I.pptx
ViniHema
 
Student information management system project report ii.pdf
Student information management system project report ii.pdfStudent information management system project report ii.pdf
Student information management system project report ii.pdf
Kamal Acharya
 
Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024
Massimo Talia
 
Hierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power SystemHierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power System
Kerry Sado
 
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&BDesign and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Sreedhar Chowdam
 
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
bakpo1
 

Recently uploaded (20)

Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
 
ML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptxML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptx
 
HYDROPOWER - Hydroelectric power generation
HYDROPOWER - Hydroelectric power generationHYDROPOWER - Hydroelectric power generation
HYDROPOWER - Hydroelectric power generation
 
MCQ Soil mechanics questions (Soil shear strength).pdf
MCQ Soil mechanics questions (Soil shear strength).pdfMCQ Soil mechanics questions (Soil shear strength).pdf
MCQ Soil mechanics questions (Soil shear strength).pdf
 
Architectural Portfolio Sean Lockwood
Architectural Portfolio Sean LockwoodArchitectural Portfolio Sean Lockwood
Architectural Portfolio Sean Lockwood
 
road safety engineering r s e unit 3.pdf
road safety engineering  r s e unit 3.pdfroad safety engineering  r s e unit 3.pdf
road safety engineering r s e unit 3.pdf
 
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
 
Investor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptxInvestor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptx
 
block diagram and signal flow graph representation
block diagram and signal flow graph representationblock diagram and signal flow graph representation
block diagram and signal flow graph representation
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
 
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
 
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
 
H.Seo, ICLR 2024, MLILAB, KAIST AI.pdf
H.Seo,  ICLR 2024, MLILAB,  KAIST AI.pdfH.Seo,  ICLR 2024, MLILAB,  KAIST AI.pdf
H.Seo, ICLR 2024, MLILAB, KAIST AI.pdf
 
Water Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdfWater Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdf
 
power quality voltage fluctuation UNIT - I.pptx
power quality voltage fluctuation UNIT - I.pptxpower quality voltage fluctuation UNIT - I.pptx
power quality voltage fluctuation UNIT - I.pptx
 
Student information management system project report ii.pdf
Student information management system project report ii.pdfStudent information management system project report ii.pdf
Student information management system project report ii.pdf
 
Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024
 
Hierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power SystemHierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power System
 
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&BDesign and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
 
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
 

Performance and Evaluation of Data Mining Techniques in Cancer Diagnosis

  • 1. IOSR Journal of Computer Engineering (IOSR-JCE) e-ISSN: 2278-0661, p- ISSN: 2278-8727Volume 15, Issue 5 (Nov. - Dec. 2013), PP 39-44 www.iosrjournals.org www.iosrjournals.org 39 | Page Performance and Evaluation of Data Mining Techniques in Cancer Diagnosis R.M. Chandrasekar Ph.D, V. Palaniammal M.C.A., M.Phil. (Professor, Computer Science, Annamalai University, Chidambaram, Tamil Nadu, India.) (Asst. Professor, Thiruvalluvar University College of Arts & Science, Kallakurichi, Tamil Nadu, India.) Abstract: We analyze the breast Cancer data available from the WBC, WDBC from UCI machine learning with the aim of developing accurate prediction models for breast cancer using data mining techniques. Data mining has, for good reason, recently attracted a lot of attention, it is a new Technology, tackling new problem, with great potential for valuable commercial and scientific discoveries. The experiments are conducted in WEKA. Several data mining classification techniques were used on the proposed data. There are many classification techniques in data mining such as Decision Tree, Rules NNge, Tree random forest, Random Tree, lazy IBK. The aim of this paper is to investigate the performance of different classification techniques. The data breast cancer data with a total 286 rows and 10 columns will be used to test and justify the different between the classification methods and algorithm. Keywords - Machine learning, data mining Weka, classification, breast cancer I. Introduction About 1 in 8 U.S women (just under 12%) will develop invasive breast cancer over the course of her life. In 2011, an estimated 230,480 new cases of invasive breast cancer were expected to be diagnosed in women in the U.S, along with 57,650 new cases of non-invasive (in situ) breast cancer. For women in the U.S breast cancer death rates are higher than those for any other cancer besides lung cancer. In 2011 there were more than 2.6million breast cancer survivor in the U.S.A women’s risk of breast cancer approximately relative (mother, sister, daughter) who has been diagnosed with breast cancer. About 15% of women who get breast cancer have a family member diagnosed with it. About 85% of breast cancers occur in women who have no family history of breast cancer. These occur due to genetic mutations that happen as a result of the aging process and life in general. Breast cancer (An overview): Breast Cancer is the most common cancer diseases among women excluding no melanoma skin. Cancers are divided into two types benign and Malignant. If the cancer is benign under the conditions of early diagnosis. Malignancy status includes the three basic measurement 1.Age 2.Longer tumor length 3.ADC or Apparent Diffusion coefficient (biopsy confirmed).Attributes 1.patient id 2.diagnosis m=malignant ,b=benign 3.Real valued features. Cell uncles’ 1.radius 2.texture 3.perimeter 4.Area 5.Smoothness.I Diagnosis Attributes are 1.Menopausal status a. Premenopausal, b.post menopausal. II.Basic diagnosis :-1.clinical examinations,2.mammography (y/n),3.MRI (y/n),4.Ultrasound (y/n),5.Fine Needle aspiration (y/n),6.Core biopsy (y/n),7.Open biopsy (y/n) ,8.Other (y/n),9.Unknown (y/n).III Clinical Trial Enrollment:-y/n/Not stated/inadequate. IV. Initial Representation:-Screening-mmaaography,Screening-MRI,Screening- Others,Symptomatic.V.Total extent of Lesion (DCIS AND INVASIVE) 1.Lesion in mm.VI.Lymphovascular Invasion(presence of tumor cells in endothelium-lined spaces) 1.present/absent/suspicious/not stated /unknown. Number stages of Breast cancer:- There are 4 number stages of breast cancer, staging takes into various factors, including 1.The size of the tumor (tumor means either breast lymph’s or Area of cancer cells found on a Scan or mammogram) 2.Cancer cells have spread into nearby lymph glands (lymph Node) 3.The tumor cells has spread to any other part of the body (Metastasized-TNM). Stage1 breast cancer is split into 2 Stages.Stage1A tumor is 2 cm or smaller and has spread outside the breast.Stage1B: Small areas of breast Cancer cells are found in the lymph node closed to the breast and either the tumor is 2 cm or smaller. Stage 2 breast cancer: stage 2A:Tumor 2cm or smaller in the breast and cancer cells are found in 1 to 3 lymph Node in the armpit or in the lymph Node near the breast bone. Stage 2B: The tumor is larger than 2 cm but not larger than 5 cm and small areas of cancer cells are in the lymph Node.2 to 5 cm spread 1 to 3 lymph nodes in the armpit or near the breastbones or the tumor is larger than 5cm and has not spread to the lymph node. Stage 3A breast cancer No tumor is seen in the breast or the tumor may be any size and cancer is found in 4 to 9 lymph glands under the arm or in the lymph glands near the breast bone. The tumor is more than 5 cm and has spread into up to 3 lymph nodes near the breast bone.
  • 2. Performance and Evaluation of Data Mining Techniques in Cancer Diagnosis www.iosrjournals.org 40 | Page Stage 3B: The tumor has spread to the skin of the breast or to the chest wall and made the kin break down or cancer swelling. The cancer may have spread to the up 9 lymph Node in the armpit or to the lymph glands near the breast bone. Stage 4 breast cancer: The tumor can be any size. The lymph Node may or may not contain cancer cells. The cancer has spread to the other parts of the body such as bone, lungs, liver, and brain. The TNM (Tumor, Node, and Metastasis) system is specifies for each type of cancer. Once a patient‘s T, N, and M categories have been determined, this information is combined in a process called stage grouping to determine a women disease stage.Stage0 (the least advantage stage) to Stage4 (the most advantage stage). II. Headings Existing System: The data sets used are SEER data or Wisconsin data. Data pre-processing is applied before data mining to improve the quality of the data. Data pre-processing includes data cleaning, data integration, data transformation and data reduction techniques. The features used for classification purposes coincided with the Breast Imaging Reporting and Data System (BI-RADS) as this is how radiologists classify breast cancer. The BI-RADS features of density, mass shape, mass margin and abnormality assessment rank are used as they have been proven to provide good classification accuracy. A Classification method, Decision tree algorithms are widely used in medical field to classify the medical data for diagnosis. Feature Selection increases the accuracy of the Classifier because it eliminates irrelevant attributes. Feature selection with decision tree classification greatly enhances the quality of the data in medical diagnosis. CART algorithm with various feature selection methods to find out whether the same feature selection method may lead to best accuracy on various datasets of same domain. Artificial neural networks (ANNs) and support vector machines have been recently proposed as a very effective method for pattern recognition, machine learning and data mining. The discrimination capability of the features extracted from the sonograms was tested by using the SVM (support vector machine), ANN and KNN (K-nearest neighbor) classifier. It was found that the SVM gave the greatest accuracy while the ANN had the highest sensitivity. Back propagation neural network (BPNN) and radial basis function network (RBFN) were used for the training and testing of data. Algorithm: 1. CART Algorithm 2. Decision tree algorithms 3. ID3 4. C4.5 decision tree algorithms 5. Hunt’s algorithm 6. SVM 7. Naïve Bayes classifier. 8. The back-propagated neural network. Proposed System: Early diagnosis needs an accurate and reliable diagnosis procedure that can be used by physicians to distinguish benign breast tumors from malignant ones without going for surgical biopsy. The objective of these predictions is to assign patients to one of the two group either a “benign” that is noncancerous or a “malignant” that is cancerous. The prognosis problem is the long-term care for the disease for patients whose cancer has been surgically removed. In this paper, we propose a model-based data mining technique with a neural network classification technique and the improvements possible using an ensemble approach. Ensembles have been proposed as a mechanism for improving the classification accuracy of existing classifiers providing that constituents are diverse. Contrasting the cluster analysis technique against a baseline neural network classifier, and then considers the effects of applying an ensemble technique to improve the accuracies. In different cases the state of the disease condition itself can be marked by stages where the diagnostic symptoms or signs can be subtle or different to other stages of the disease. This means that there is often not a clean mapping between the diagnostic features and the diagnosis. The usage of clustering has also been extended to classifiers and detection systems in order to improve detection and provide greater classification accuracy. Use of a feature selection technique with a decision tree classifier to classify breast cancer. Another advantage of the model-based clustering approach is that no decisions have to be made about the scaling of the observed variables: for instance, when working with normal distributions with unknown variances, the results will be the same irrespective of whether the variables are normalized or not. In order to fasten the classification we are using Artificial Neural Network (ANN). Algorithm: 1. Decision tree algorithms – For feature selection
  • 3. Performance and Evaluation of Data Mining Techniques in Cancer Diagnosis www.iosrjournals.org 41 | Page 2. C4.5 Algorithm – For classification 3. ANN (Artificial Neural Network) Software Requirement:  The WEKA is an ensemble of tools for data classification, regression, clustering, association rules, and visualization. The toolkit is developed in Java and is an open source software issued under the GNU General Public License.  WBCD dataset. III. Figures And Tables Characteristics of database: Wisconsin diagnostics Breast Cancer this database is created by William H.Wolberge at University of Wisconsin. This database contains 286 observations among which 201 are benign cases and 85 are malignant cases. These instances are described by 10 Attributes:-.class no-recurrence events, recurrence events, 2.age, 3.Menopause, 4.tumor_size, 5.inv_nodes, 6.node_cap, 7.deg_malig, 8.breast, 9.breast_quad, 10.irradiat Table1: Attributes Values Age 10-19, 20-29, 30-39, 40-49, 50-59, 60-69, 70-79, 80-89, 90-99 menopause lt40, ge40, premeno tumor-size 0-4, 5-9, 10-14, 15-19, 20-24, 25-29, 30-34, 35-39, 40- 44,45-49, 50-54,55-59 inv-nodes 0-2, 3-5, 6-8, 9-11, 12-14, 15-17, 18-20, 21-23, 24-26,27-29, 30-32, 33-35,36-39 node-caps yes, no deg-malig 1, 2, 3 Breast left, right breast-quad left-up, left-low, right-up, right-low, central irradiat yes, no Class no-recurrence-events, recurrence-events Classification prediction process in data mining: Data mining is collection of techniques for efficient automated discovery of previously unknown valid novel, useful and understandable pattern in large databases. Classification is the process of minding a model that describes and distinguishes data classes concepts. This derived model is based on the analysis of a set of training data. Classification predicts categorial labels predictable numerical data values rather than class lables. We used several classification methods resulting in identification of top 5 classification alogorithm. This paper presents a brief description of the classifiers and meta_classifiers used in the experiments results. Fig1: Classification prediction process in data mining Rules NNge (Non_Nested Generalised Examplars): NNge Classifier new examples by determining the nearest neighbour in the exemplar database using a Euclidean distance function.
  • 4. Performance and Evaluation of Data Mining Techniques in Cancer Diagnosis www.iosrjournals.org 42 | Page The function is 𝐷 𝐸𝐻 = 𝑊 𝐻 𝑊 𝐼 𝐸 𝐼−𝐻 𝐼 𝑀𝐴𝑋 𝐼−𝑀𝐼𝑁 𝐼 𝑀 𝐼=1 2 Lazy K Star classifier Lasy kstar algorithm most successful classifier. Implementation of an instance based classifier, which uses the entropic distance measures. Test instance is based upon the class of those training instance. Lazy IBK (K_Nearest Neighbours classifier): IBK classifier is simple instance based learner that uses the class of nearest K training instances for the class of the test instance. Predicts the class of the single nearest training instance for each test instance. Get maximum no of instances allowed in the training pool. Meta Decorate (Diverse Ensemble creation by oppositional relabeling of artificial Training Examples): Meta learners such as Boosting Bagging and Random Forest provide diversity by sub sampling or reweighting the existing training data. Decorate performs by adding randomly constructed to the training set,when building new members. 𝑑𝑗 𝑥 = 0 𝑖𝑓𝑐𝑗 𝑥 1 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒 = 𝑐∗ 𝑥 𝐷 = 1 𝑁𝑀 𝑑𝑗 𝑥𝑖 𝑛 𝑖=1 𝑚 𝑗=1 𝑐∗ 𝑥 =arg max 𝑝 𝑘 𝑥 ,kє{1…….Q) 𝑝 𝑘 𝑥 = 𝑝𝑐3,𝑘 𝑥𝑐 𝑗𝜖 𝑐∗ / 𝑐∗ Tree Random Forest: Random Forest classifier consists of multiple decision trees. The final class of an instance in a Random forest is assigned by outputting the class that is the mode of the outputs of individual trees. Which can produce robust and accurate classification. Random classifiers is based on a combination of many decision Tree predictors such that each tree depends on the values of a random vector sampled independently and with the same distribution for all trees in the forest. RF has excellent accuracy among current classifier Algorithm. Fig2:Block diagram of the survival detection system Table2: Simulation result of each Algorithm Algorithm Total instance 286 Correctly classify Instance(%value) Incorrectly classify Instance(%value) Time taken (seconds) Kuppa status Rule NNge 249 97.2028 8 2.7972 0.16 0.933 Random Forest 280 97.9021 6 2.0979 0.11 0.9501 Meta Decorate 259 90.5594 27 9.4406 1.81 0.7584 Lazy IBK 280 97.9021 6 2.0979 0 0.9491 Lazy Kstar 280 97.9021 6 2.0979 0 0.9491 Random Tree 280 97.9021 6 2.0979 0.02 0.9491
  • 5. Performance and Evaluation of Data Mining Techniques in Cancer Diagnosis www.iosrjournals.org 43 | Page Classification  Knm  SVM  Neural N/W  Decision Tree Classifier Output Begin or Malignant  Pixel analysis  Intensity Analysis  Texture Analysis Feature Extraction  Fuzzy C-means  K-means clustering  Free hand clustering  Active contour clustering Segmentation 1. Image filter  Linear filter  Median Filter  Adaptive Filter 2. Images resize 3. Contrast enhancement  Run time  Cache enhancement algorithm Input Image Pre-processing  Read single image  Runtime read image  Read multiple images Table3: Training and Simulation errors: Algorithm Total instance 286 Mean Absolute Error % Root Mean Squared Error % Relative absolute Error% Root Relative Squared Error% Rules NNge 0.028 0.1672 6.6868 36.5947 Random Forest 0.0978 01614 23.3712 35.3247 Meta Decorate 0.3655 0.3797 87.3643 83.083 Lazy-IBK 0.0253 0.1053 6.0487 23.0348 Lazy-Kstar 0.0747 0.1399 17.8681 30.6094 Random Tree 0.0221 0.1052 5.2937 23.0237 Architecture Diagram
  • 6. Performance and Evaluation of Data Mining Techniques in Cancer Diagnosis www.iosrjournals.org 44 | Page IV. Conclusion This paper presents effective classification Techniques. After investigation of different classification Algorithm we have chosen 6 classifier based on our simulation performance and we have used Tree Random classifier achieved overall classification accuracy 98%, which is significanant. In future work we propose to analyze Ensemble classifier for 100% accuracy. Acknowledgements The authors like to express our appreciations to prof. R.M. Chandrasekar for his help in correcting earlier versions of this paper. We also would like to thank the anonymous reviewers for their valuable comments and in sightful suggestions. References [1] Mitchell, T.M. 1997. Machine Learning. McGraw-Hill Science [2] John, G., Cleary, E. & Leonard, E. 1995. K*: An Instance-based Learner Using an Entropic Distance Measure. In: 12th International Conference on Machine Learning, 108-114. [3] Aha, D. & Kibler, D. 1991. Instance-based learning algorithms. Machine Learning. [4] American Cancer Societies. Breast Cancer Facts& Figures 2005-2006. Atlanta: American Cancer Society, Inc. (http://www.cancer.org/). [5] National Resource. Cancer Epidemiology Biomarkers & Prevention 1999; 8:1117-1121. [6] Houston, Andrea L. and Chen, et. al.. Medical Data Mining on the Internet: Research on a Cancer Information System. Artificial Intelligence Review 1999; 13:437-466. [7] Lundin M, Lundin J, Burke HB, Toikkanen S Pylkkanen L, Joensuu H. Artificial neural Networks applied to survival prediction in breast cancer. Oncology 1999; 57:281-6. [8] Delen D, Walker G, Kadam A. Predicting breast cancer survivability: a comparison of three data Mining methods. Artificial Intelligence in Medicine. 2005 Jun; 34(2):113-27. [9] Ian H. Witten and Eibe Frank. Data Mining: Practical machine learning tools and techniques, 2nd Edition. San Francisco: Morgan Kaufmann; 2005. [10] Weka: Data Mining Software in Java,http://www.cs.waikato.ac.nz/ml/weka [11] lando Anunciac¸ ˜ao and Bruno C. Gomes and Susana Vinga and Jorge Gaspar and Arlindo L. Oliveira and Jos´e Rueff , A Data Mining Approach for the detection of High-Risk Breast Cancer Groups [12] aria-Luiza Antonie, Osmar R. Za¨ıane, Alexandru Coma, .Application of Data Mining Techniques for Medical Image Classification.Proceeding of second International worshop on Mutimedia data mining(MDM/KDD’2001),in conjuction with ACM SIGKDD conference.SAN FRANCISCO,USA,AUG 26,2001.