SlideShare a Scribd company logo
1 of 4
Download to read offline
Vidya Sagar Ponnam Int. Journal of Engineering Research and Applications www.ijera.com
ISSN: 2248-9622, Vol. 5, Issue 7, (Part - 3) July 2015, pp.32-35
www.ijera.com 32 | P a g e
Analysis On Classification Techniques In Mammographic Mass
Data Set
Mrs. K. K. Kavitha *, Dr. A Kangaiammal**, Mr. K. Satheesh***
*(Department of Computer Science, Selvamm Arts & Science College, Namakkal-03)
** (Department of Computer Science, Government Arts College, Salem-07)
***(Department of Computer Science, Selvamm Arts & Science College, Namakkal-03)
ABSTRACT
Data mining, the extraction of hidden information from large databases, is to predict future trends and behaviors,
allowing businesses to make proactive, knowledge-driven decisions. Data-Mining classification techniques deals
with determining to which group each data instances are associated with. It can deal with a wide variety of data
so that large amount of data can be involved in processing. This paper deals with analysis on various data mining
classification techniques such as Decision Tree Induction, Naïve Bayes , k-Nearest Neighbour (KNN) classifiers
in mammographic mass dataset.
Keywords - Classification , Data Mining, Decision Tree , Mammographic Dataset, Naïve Bayes Classifier.
I. INTRODUCTION
The field of data mining is an emerging
research area with important applications in
Engineering, Science, Medicine, Business and
Education[9]. The size of data base in educational
application is large where the number of records in a
data set can vary from some thousand to
thousand of millions. The size of data is
accumulated from different fields exponentially
increasing. Data mining has been used in different
methods at the intersection of Machine Learning,
Artificial Intelligence, Statistics and Database
Systems.
Cancer begins in cells, the building blocks that
make up tissues. Tissues make up the breasts and
other parts of the body. Normal cells grow and divide
to form new cells as the body needs them. When
normal cells grow old or get damaged, they die, and
new cells take their place. Sometimes, this process
leads to wrong results. New cells form when the
body does not need them, and old or damaged cells
do not die as they should. The built-up of extra cells
often forms a mass of tissue called a lump, growth, or
tumor. Cancer that forms in the tissues of breast,
usually in the ducts (tubes that carry milk to the
nipple) and in the lobules (glands that make milk) is
the breast cancer [1]. It occurs in both men and
women; although in male, breast cancer is very rare.
Breast cancer is one of the leading cancers in the
female population. About 25% of all cancers
diagnosed in women are breast cancers and about
20% of all lethal cancers are breast cancers.
Mammography has been proven to be the most
reliable method and it is the key screening tool for
the early detection of breast cancer. Mammography
is highly accurate. On average, mammography will
detect about 80–90% of the breast cancers in women
without any recognizable symptoms.
Classification technique is capable of processing a
wider variety of data and is growing in popularity.
This paper deals with analysis on various data mining
classification techniques in mammographic mass
datasets.
The next section presented is Decision Tree
Induction. Section II describes Naïve Bayesian
Classification whereas k-nearest neighbor (KNN)
classifier is described in section III. Finally, the last
section concludes this work.
II. Decision tree induction
Decision Tree Induction is the learning of
decision trees from class-labeled training tuples. A
decision tree is a flowchart-like tree structure, where
each internal node denotes a test on an attribute,
each branch represents an outcome of the test, and
each leaf node holds a class label[9],[4].
The feature that best divides the training data would
be the root node of the tree. There are numerous
methods for finding the feature that best divides the
training data such as information gain (Hunt et al.,
1966) and gini index (Breiman et al., 1984). While
myopic measures estimate each attribute
independently, Relief algorithm (Kononenko, 1994)
estimates them in the context of other attributes.
However, a majority of studies have concluded that
there is no single best method (Murthy,1998).
RESEARCH ARTICLE OPEN ACCESS
Vidya Sagar Ponnam Int. Journal of Engineering Research and Applications www.ijera.com
ISSN: 2248-9622, Vol. 5, Issue 7, (Part - 3) July 2015, pp.32-35
www.ijera.com 33 | P a g e
Figure 1: Decision Tree for Breast Cancer
The basic algorithm for decision tree induction
is a greedy algorithm that constructs decision trees in
a top-down recursive divide-and-conquer manner
[9]. Algorithm for decision tree is described in Fig.
2.
Figure 2: Decision Tree Algorithm
III. NAÏVE BAYESIAN
CLASSIFICATION
Naïve Bayes is a simple technique for
constructing classifier models that assign class labels
to problem instances, represented as vectors of
feature values, where the class labels are drawn from
some finite set. It is not a single algorithm for
training such classifiers, but a family of algorithms
based on a common principle: all naive Bayes
classifiers assume that the value of a particular
feature is independent of the value of any other
feature, given the class variable[13].
Abstractly, naive Bayes is a conditional
probability model: given a problem instance to be
classified, represented by a vector x= (x ,….,x )
representing some n features (dependent variables),
it assigns to this instance probabilities for each of K
possible outcomes or classes.
Using Bayes theorem, the conditional
probability can be decomposed as:
(1)
In general, using Bayesian probability
terminology, the above equation can be written as:
(2)
Using the chain rule for repeated applications of
the definition of conditional probability[11]:
(3)
One can easily estimate the probabilities from
the training tuples.
IV. KNN CLASSIFICATION
K Nearest Neighbors (KNN) is a simple
algorithm that stores all available cases and
classifies new cases based on a specific similarity
measure (e.g., distance functions). KNN has been
used in statistical estimation and pattern
recognition[10].
Nearest neighbor classifiers are based on
learning by analogy. The training samples are
described by n dimensional numeric attributes. Each
sample represents a point in an n-dimensional space.
Similarly, all of the training samples are stored in an
n-dimensional pattern space. When given an
unknown sample, a k-nearest neighbor classifier
searches the pattern space for the k- training samples
that are closest to the unknown sample.
A case is classified by a majority vote of its
neighbors, with the case being assigned to the class
most common amongst its K-nearest neighbors
measured by a distance function. If K = 1, then the
case is simply assigned to the class of its nearest
neighbor[6]:
(4)
It should also be noted that all three distance
measures are only valid for continuous variables. In
the instance of categorical variables the Hamming
distance must be used. It also brings up the issue of
Vidya Sagar Ponnam Int. Journal of Engineering Research and Applications www.ijera.com
ISSN: 2248-9622, Vol. 5, Issue 7, (Part - 3) July 2015, pp.32-35
www.ijera.com 34 | P a g e
standardization of the numerical variables between 0
and 1 when there is a mixture of numerical and
categorical variables in the dataset.
(5)
The k-nearest neighbors algorithm is amongst
the simplest of all machine learning algorithms. An
object is classified by a majority vote of its
neighbors, with the object being assigned to the class
most common amongst its k-nearest neighbors. k is a
positive integer, typically small. If k= 1, then the
object is simply assigned to the class of its nearest
neighbor. In binary classification problems (two
class), it is helpful to choose k to be an odd number
as this avoids tie in votes.
The same method can be used for regression, by
simply assigning the property value for the object to
be the average of the values of its k-nearest
neighbors. It can be useful to weigh the
contributions of the neighbors, so that the nearer
neighbors contribute more to the average than the
more distant ones.
The neighbors are taken from a set of objects for
which the correct classification is known. This can
be thought of as the training set for the algorithm,
though no explicit training step is required. In order
to identify neighbors, the objects are represented by
position vectors in a multidimensional feature space.
It is usual to use the Euclidian distance, though other
distance measures, such as the Manhanttan distance
could be used in principle instead. The k-nearest
neighbor algorithm is sensitive to the local structure
of the data.
V. EXPERIMENTAL RESULTS
1. DATASET ACQUISITION
This breast cancer domain was obtained from
the University Medical Centre, Institute of
Oncology, and Ljubljana, Yugoslavia. This data set
includes 201 instances of one class and 85 instances
of another class. The instances are described by 9
attributes, some of which are linear and some are
nominal.
2. TOOL
In this paper, we applied weka tool for finding
classification accuracy. Weka is a collection of
machine learning algorithms for data mining tasks.
The algorithms can either be applied directly to a
dataset or called from your own Java code. Weka
contains tools for data pre-processing, classification,
regression, clustering, association rules, and
visualization, etc.. It is also well-suited for
developing new machine learning schemes[14].
VI. PERFORMANCE EVALUATION
In this paper, we discussed about three major
data mining classification algorithms. Now, we had
given some results by applying these algorithms in
the following table with some classification errors
such as MAE, RMSE, RAE and so on.
Table 1 Comparison of Classification Algorithms
Now, we represent the comparison results in the
following fig. 3 through 5.
i) Accuracy:
Figure 3: Accuracy
ii) Mean Absolute Error (MAE) and
Root Mean Square Error (RMSE):
Figure 4: MAE & RMSE
72
73
74
75
76
77
78
79
80
Accuracy(%)
Decision Tree
Naïve Bayes
KNN
0
0.05
0.1
0.15
0.2
0.25
0.3
0.35
0.4
0.45
MAE
RMSE
Accuracy(%) MAE RMSE RAE(%)
Decision
Tree
75.8 0.3658 0.4269 87.44
Naïve
Bayes
75.1 0.3012 0.4278 72.00
KNN 80 0.2679 0.3671 64.04
Vidya Sagar Ponnam Int. Journal of Engineering Research and Applications www.ijera.com
ISSN: 2248-9622, Vol. 5, Issue 7, (Part - 3) July 2015, pp.32-35
www.ijera.com 35 | P a g e
iii) Relative Absolute Error:
Figure 5: RAE
VII. CONCLUSION
From the above analysis, it is concluded that
KNN Classifier provides high accuracy than
Decision Tree approach and the Naïve Bayes
classifier. In future, this work can extended for
implementing these algorithms for uncertain
datasets.
REFERENCES
Books:
[1] Aley Abraham,Blessings of My Breast
Cancer, Innova Publishing, 2012.
[2] Arun K Pujari, Data mining Technique,
Published by Universities Press (I) Pvt. Ltd,
Hyderabad, India, 2001.
[3] Jiawei Han and Micheline Kamber,Data
Mining Concepts and Techniques, Second
Edition, Elsevier Inc., 2006.
Journal Papers:
[4] Bharati M. Ramageri, Data Mining
Techniques and Application, Indian
Journal of Computer Science and
Engineering, Vol. 1 No. 4, pp. 301-305.
[5] Jiazhen He, Yang Zhang, Xue Li & Peng
Shi, Learning naive Bayes classifiers from
positive and unlabelled examples with
uncertainty, International Journal of
Systems Science, Volume 43, Issue 10,
2012.
[6] Kotsiantis. S. B.,Supervised Machine
Learning: A Review of Classification
Techniques, Informatica, pp.249-268, 2007.
[7] Narasimha Murty, M., Susheela Devi, V.
Pattern Recognition: An Algorithmic
Approach, Springer Science & Business
Media, 2011.
[8] Remco R. Bouckaert, Eibe Frank, Mark
Hall, Richard Kirkby, Peter Reutemann,
Alex Seewald, David Scuse, Weka Manual
for Version 3-6-11, University of Waikato,
Hamilton, New Zealand, 2014.
[9] Sumit Garg, Arvind K. Sharma,
Comparative Analysis of Data Mining
Techniques on Educational Dataset,
International Journal of Computer
Applications (0975 – 8887) Volume 74,
No.5, July 2013.
[10]Suresh Chandra Satapathy, Siba K Udgata,
Bhabendra Narayan Biswal,” Proceedings
of the International Conference on Frontiers
of Intelligent Computing: Theory and
Applications (FICTA)”, Springer, 2013.
Conference Proceedings:
[11]Radovic, M. R&D Center for Bioeng.,
BioIRC, Kragujevac, Serbia Djokovic, M.,
Peulic, A. , Filipovic, N., Application of
data mining algorithms for mammogram
classification, Bioinformatics and
Bioengineering (BIBE), IEEE 13th
International Conference, 2013.
[12]Thair Nu Phyu, Survey of Classification
Techniques in Data Mining, Proceedings
of the International Multi Conference of
Engineers and Computer Scientists , Vol I
IMECS 2009, March 18 - 20, 2009.
Web References;
[13]http://en.wikipedia.org/wiki/Naive_Bayes_
classifier.
[14]http://www.cs.waikato.ac.nz/ml/weka/.
0
20
40
60
80
100
RAE(%)
RAE(%)

More Related Content

What's hot

A Survey Ondecision Tree Learning Algorithms for Knowledge Discovery
A Survey Ondecision Tree Learning Algorithms for Knowledge DiscoveryA Survey Ondecision Tree Learning Algorithms for Knowledge Discovery
A Survey Ondecision Tree Learning Algorithms for Knowledge DiscoveryIJERA Editor
 
Building a Classifier Employing Prism Algorithm with Fuzzy Logic
Building a Classifier Employing Prism Algorithm with Fuzzy LogicBuilding a Classifier Employing Prism Algorithm with Fuzzy Logic
Building a Classifier Employing Prism Algorithm with Fuzzy LogicIJDKP
 
Multi-Cluster Based Approach for skewed Data in Data Mining
Multi-Cluster Based Approach for skewed Data in Data MiningMulti-Cluster Based Approach for skewed Data in Data Mining
Multi-Cluster Based Approach for skewed Data in Data MiningIOSR Journals
 
K Mean and Fuzzy Clustering Algorithm Predicated Brain Tumor Segmentation And...
K Mean and Fuzzy Clustering Algorithm Predicated Brain Tumor Segmentation And...K Mean and Fuzzy Clustering Algorithm Predicated Brain Tumor Segmentation And...
K Mean and Fuzzy Clustering Algorithm Predicated Brain Tumor Segmentation And...IRJET Journal
 
Analysis of Classification Algorithm in Data Mining
Analysis of Classification Algorithm in Data MiningAnalysis of Classification Algorithm in Data Mining
Analysis of Classification Algorithm in Data Miningijdmtaiir
 
K-Medoids Clustering Using Partitioning Around Medoids for Performing Face Re...
K-Medoids Clustering Using Partitioning Around Medoids for Performing Face Re...K-Medoids Clustering Using Partitioning Around Medoids for Performing Face Re...
K-Medoids Clustering Using Partitioning Around Medoids for Performing Face Re...ijscmcj
 
CLUSTERING DICHOTOMOUS DATA FOR HEALTH CARE
CLUSTERING DICHOTOMOUS DATA FOR HEALTH CARECLUSTERING DICHOTOMOUS DATA FOR HEALTH CARE
CLUSTERING DICHOTOMOUS DATA FOR HEALTH CAREijistjournal
 
A BINARY BAT INSPIRED ALGORITHM FOR THE CLASSIFICATION OF BREAST CANCER DATA
A BINARY BAT INSPIRED ALGORITHM FOR THE CLASSIFICATION OF BREAST CANCER DATAA BINARY BAT INSPIRED ALGORITHM FOR THE CLASSIFICATION OF BREAST CANCER DATA
A BINARY BAT INSPIRED ALGORITHM FOR THE CLASSIFICATION OF BREAST CANCER DATAIJSCAI Journal
 
An Heterogeneous Population-Based Genetic Algorithm for Data Clustering
An Heterogeneous Population-Based Genetic Algorithm for Data ClusteringAn Heterogeneous Population-Based Genetic Algorithm for Data Clustering
An Heterogeneous Population-Based Genetic Algorithm for Data Clusteringijeei-iaes
 
Hybrid Approach for Brain Tumour Detection in Image Segmentation
Hybrid Approach for Brain Tumour Detection in Image SegmentationHybrid Approach for Brain Tumour Detection in Image Segmentation
Hybrid Approach for Brain Tumour Detection in Image Segmentationijtsrd
 
Performance Evaluation of Different Data Mining Classification Algorithm and ...
Performance Evaluation of Different Data Mining Classification Algorithm and ...Performance Evaluation of Different Data Mining Classification Algorithm and ...
Performance Evaluation of Different Data Mining Classification Algorithm and ...IOSR Journals
 
PREDICTION OF MALIGNANCY IN SUSPECTED THYROID TUMOUR PATIENTS BY THREE DIFFER...
PREDICTION OF MALIGNANCY IN SUSPECTED THYROID TUMOUR PATIENTS BY THREE DIFFER...PREDICTION OF MALIGNANCY IN SUSPECTED THYROID TUMOUR PATIENTS BY THREE DIFFER...
PREDICTION OF MALIGNANCY IN SUSPECTED THYROID TUMOUR PATIENTS BY THREE DIFFER...cscpconf
 
Study of Different Multi-instance Learning kNN Algorithms
Study of Different Multi-instance Learning kNN AlgorithmsStudy of Different Multi-instance Learning kNN Algorithms
Study of Different Multi-instance Learning kNN AlgorithmsEditor IJCATR
 
2014 Gene expressionmicroarrayclassification usingPCA–BEL.
2014 Gene expressionmicroarrayclassification usingPCA–BEL.2014 Gene expressionmicroarrayclassification usingPCA–BEL.
2014 Gene expressionmicroarrayclassification usingPCA–BEL.Ehsan Lotfi
 
MULTI-PARAMETER BASED PERFORMANCE EVALUATION OF CLASSIFICATION ALGORITHMS
MULTI-PARAMETER BASED PERFORMANCE EVALUATION OF CLASSIFICATION ALGORITHMSMULTI-PARAMETER BASED PERFORMANCE EVALUATION OF CLASSIFICATION ALGORITHMS
MULTI-PARAMETER BASED PERFORMANCE EVALUATION OF CLASSIFICATION ALGORITHMSijcsit
 
IJCSI-10-6-1-288-292
IJCSI-10-6-1-288-292IJCSI-10-6-1-288-292
IJCSI-10-6-1-288-292HARDIK SINGH
 

What's hot (20)

A Survey Ondecision Tree Learning Algorithms for Knowledge Discovery
A Survey Ondecision Tree Learning Algorithms for Knowledge DiscoveryA Survey Ondecision Tree Learning Algorithms for Knowledge Discovery
A Survey Ondecision Tree Learning Algorithms for Knowledge Discovery
 
Building a Classifier Employing Prism Algorithm with Fuzzy Logic
Building a Classifier Employing Prism Algorithm with Fuzzy LogicBuilding a Classifier Employing Prism Algorithm with Fuzzy Logic
Building a Classifier Employing Prism Algorithm with Fuzzy Logic
 
Multi-Cluster Based Approach for skewed Data in Data Mining
Multi-Cluster Based Approach for skewed Data in Data MiningMulti-Cluster Based Approach for skewed Data in Data Mining
Multi-Cluster Based Approach for skewed Data in Data Mining
 
PERFORMANCE EVALUATION OF DIFFERENT CLASSIFIER ON BREAST CANCER
PERFORMANCE EVALUATION OF DIFFERENT CLASSIFIER ON BREAST CANCERPERFORMANCE EVALUATION OF DIFFERENT CLASSIFIER ON BREAST CANCER
PERFORMANCE EVALUATION OF DIFFERENT CLASSIFIER ON BREAST CANCER
 
K Mean and Fuzzy Clustering Algorithm Predicated Brain Tumor Segmentation And...
K Mean and Fuzzy Clustering Algorithm Predicated Brain Tumor Segmentation And...K Mean and Fuzzy Clustering Algorithm Predicated Brain Tumor Segmentation And...
K Mean and Fuzzy Clustering Algorithm Predicated Brain Tumor Segmentation And...
 
Analysis of Classification Algorithm in Data Mining
Analysis of Classification Algorithm in Data MiningAnalysis of Classification Algorithm in Data Mining
Analysis of Classification Algorithm in Data Mining
 
K-Medoids Clustering Using Partitioning Around Medoids for Performing Face Re...
K-Medoids Clustering Using Partitioning Around Medoids for Performing Face Re...K-Medoids Clustering Using Partitioning Around Medoids for Performing Face Re...
K-Medoids Clustering Using Partitioning Around Medoids for Performing Face Re...
 
CLUSTERING DICHOTOMOUS DATA FOR HEALTH CARE
CLUSTERING DICHOTOMOUS DATA FOR HEALTH CARECLUSTERING DICHOTOMOUS DATA FOR HEALTH CARE
CLUSTERING DICHOTOMOUS DATA FOR HEALTH CARE
 
A BINARY BAT INSPIRED ALGORITHM FOR THE CLASSIFICATION OF BREAST CANCER DATA
A BINARY BAT INSPIRED ALGORITHM FOR THE CLASSIFICATION OF BREAST CANCER DATAA BINARY BAT INSPIRED ALGORITHM FOR THE CLASSIFICATION OF BREAST CANCER DATA
A BINARY BAT INSPIRED ALGORITHM FOR THE CLASSIFICATION OF BREAST CANCER DATA
 
Ijetcas14 327
Ijetcas14 327Ijetcas14 327
Ijetcas14 327
 
An Heterogeneous Population-Based Genetic Algorithm for Data Clustering
An Heterogeneous Population-Based Genetic Algorithm for Data ClusteringAn Heterogeneous Population-Based Genetic Algorithm for Data Clustering
An Heterogeneous Population-Based Genetic Algorithm for Data Clustering
 
Hybrid Approach for Brain Tumour Detection in Image Segmentation
Hybrid Approach for Brain Tumour Detection in Image SegmentationHybrid Approach for Brain Tumour Detection in Image Segmentation
Hybrid Approach for Brain Tumour Detection in Image Segmentation
 
Performance Evaluation of Different Data Mining Classification Algorithm and ...
Performance Evaluation of Different Data Mining Classification Algorithm and ...Performance Evaluation of Different Data Mining Classification Algorithm and ...
Performance Evaluation of Different Data Mining Classification Algorithm and ...
 
PREDICTION OF MALIGNANCY IN SUSPECTED THYROID TUMOUR PATIENTS BY THREE DIFFER...
PREDICTION OF MALIGNANCY IN SUSPECTED THYROID TUMOUR PATIENTS BY THREE DIFFER...PREDICTION OF MALIGNANCY IN SUSPECTED THYROID TUMOUR PATIENTS BY THREE DIFFER...
PREDICTION OF MALIGNANCY IN SUSPECTED THYROID TUMOUR PATIENTS BY THREE DIFFER...
 
Study of Different Multi-instance Learning kNN Algorithms
Study of Different Multi-instance Learning kNN AlgorithmsStudy of Different Multi-instance Learning kNN Algorithms
Study of Different Multi-instance Learning kNN Algorithms
 
2014 Gene expressionmicroarrayclassification usingPCA–BEL.
2014 Gene expressionmicroarrayclassification usingPCA–BEL.2014 Gene expressionmicroarrayclassification usingPCA–BEL.
2014 Gene expressionmicroarrayclassification usingPCA–BEL.
 
Ijetr021251
Ijetr021251Ijetr021251
Ijetr021251
 
MULTI-PARAMETER BASED PERFORMANCE EVALUATION OF CLASSIFICATION ALGORITHMS
MULTI-PARAMETER BASED PERFORMANCE EVALUATION OF CLASSIFICATION ALGORITHMSMULTI-PARAMETER BASED PERFORMANCE EVALUATION OF CLASSIFICATION ALGORITHMS
MULTI-PARAMETER BASED PERFORMANCE EVALUATION OF CLASSIFICATION ALGORITHMS
 
IJCSI-10-6-1-288-292
IJCSI-10-6-1-288-292IJCSI-10-6-1-288-292
IJCSI-10-6-1-288-292
 
Hx3115011506
Hx3115011506Hx3115011506
Hx3115011506
 

Viewers also liked

Optimization of a Shell and Tube Condenser using Numerical Method
Optimization of a Shell and Tube Condenser using Numerical MethodOptimization of a Shell and Tube Condenser using Numerical Method
Optimization of a Shell and Tube Condenser using Numerical MethodIJERA Editor
 
Importance of Power Quality
Importance of Power QualityImportance of Power Quality
Importance of Power QualityIJERA Editor
 
A Model For Integrating Environmental Considerations Into The Valuation Of In...
A Model For Integrating Environmental Considerations Into The Valuation Of In...A Model For Integrating Environmental Considerations Into The Valuation Of In...
A Model For Integrating Environmental Considerations Into The Valuation Of In...IJERA Editor
 
Determination of the most important General Failure Types based on Tripod-DELTA
Determination of the most important General Failure Types based on Tripod-DELTADetermination of the most important General Failure Types based on Tripod-DELTA
Determination of the most important General Failure Types based on Tripod-DELTAIJERA Editor
 
Simulation of FCC Riser Reactor Based on Ten Lump Model
Simulation of FCC Riser Reactor Based on Ten Lump ModelSimulation of FCC Riser Reactor Based on Ten Lump Model
Simulation of FCC Riser Reactor Based on Ten Lump ModelIJERA Editor
 
Modified design for Full Swing SERF and High Speed SERF
Modified design for Full Swing SERF and High Speed SERFModified design for Full Swing SERF and High Speed SERF
Modified design for Full Swing SERF and High Speed SERFIJERA Editor
 
0993002 0992984-0992987
0993002 0992984-09929870993002 0992984-0992987
0993002 0992984-0992987建豪 陳
 
Trabajo comun - Iphone 4
Trabajo comun - Iphone 4Trabajo comun - Iphone 4
Trabajo comun - Iphone 4valeriare27
 
Una mirada al mundo: Google earth
Una  mirada al mundo: Google earthUna  mirada al mundo: Google earth
Una mirada al mundo: Google earthsanbargra
 
Asp.net mvc 3 - eco 2010
Asp.net mvc 3 - eco 2010Asp.net mvc 3 - eco 2010
Asp.net mvc 3 - eco 2010joselinoneto
 
Normas t568 a y b
Normas t568 a y bNormas t568 a y b
Normas t568 a y bssk01
 

Viewers also liked (20)

Optimization of a Shell and Tube Condenser using Numerical Method
Optimization of a Shell and Tube Condenser using Numerical MethodOptimization of a Shell and Tube Condenser using Numerical Method
Optimization of a Shell and Tube Condenser using Numerical Method
 
Importance of Power Quality
Importance of Power QualityImportance of Power Quality
Importance of Power Quality
 
A Model For Integrating Environmental Considerations Into The Valuation Of In...
A Model For Integrating Environmental Considerations Into The Valuation Of In...A Model For Integrating Environmental Considerations Into The Valuation Of In...
A Model For Integrating Environmental Considerations Into The Valuation Of In...
 
Determination of the most important General Failure Types based on Tripod-DELTA
Determination of the most important General Failure Types based on Tripod-DELTADetermination of the most important General Failure Types based on Tripod-DELTA
Determination of the most important General Failure Types based on Tripod-DELTA
 
Simulation of FCC Riser Reactor Based on Ten Lump Model
Simulation of FCC Riser Reactor Based on Ten Lump ModelSimulation of FCC Riser Reactor Based on Ten Lump Model
Simulation of FCC Riser Reactor Based on Ten Lump Model
 
Modified design for Full Swing SERF and High Speed SERF
Modified design for Full Swing SERF and High Speed SERFModified design for Full Swing SERF and High Speed SERF
Modified design for Full Swing SERF and High Speed SERF
 
ISO 9001
ISO 9001ISO 9001
ISO 9001
 
0993002 0992984-0992987
0993002 0992984-09929870993002 0992984-0992987
0993002 0992984-0992987
 
Jklm
JklmJklm
Jklm
 
Over Analytics
Over AnalyticsOver Analytics
Over Analytics
 
Trabajo comun - Iphone 4
Trabajo comun - Iphone 4Trabajo comun - Iphone 4
Trabajo comun - Iphone 4
 
Una mirada al mundo: Google earth
Una  mirada al mundo: Google earthUna  mirada al mundo: Google earth
Una mirada al mundo: Google earth
 
Mus tau valencia
Mus tau valenciaMus tau valencia
Mus tau valencia
 
Asp.net mvc 3 - eco 2010
Asp.net mvc 3 - eco 2010Asp.net mvc 3 - eco 2010
Asp.net mvc 3 - eco 2010
 
Introducción
IntroducciónIntroducción
Introducción
 
La musica de nuesrtra vida
La musica de nuesrtra vidaLa musica de nuesrtra vida
La musica de nuesrtra vida
 
CBA Duero
CBA DueroCBA Duero
CBA Duero
 
Normas t568 a y b
Normas t568 a y bNormas t568 a y b
Normas t568 a y b
 
miileena
miileenamiileena
miileena
 
Folha 239
Folha 239Folha 239
Folha 239
 

Similar to Analysis On Classification Techniques In Mammographic Mass Data Set

IMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHES
IMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHESIMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHES
IMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHESVikash Kumar
 
ANALYSIS AND COMPARISON STUDY OF DATA MINING ALGORITHMS USING RAPIDMINER
ANALYSIS AND COMPARISON STUDY OF DATA MINING ALGORITHMS USING RAPIDMINERANALYSIS AND COMPARISON STUDY OF DATA MINING ALGORITHMS USING RAPIDMINER
ANALYSIS AND COMPARISON STUDY OF DATA MINING ALGORITHMS USING RAPIDMINERIJCSEA Journal
 
Robust Breast Cancer Diagnosis on Four Different Datasets Using Multi-Classif...
Robust Breast Cancer Diagnosis on Four Different Datasets Using Multi-Classif...Robust Breast Cancer Diagnosis on Four Different Datasets Using Multi-Classif...
Robust Breast Cancer Diagnosis on Four Different Datasets Using Multi-Classif...ahmad abdelhafeez
 
The improved k means with particle swarm optimization
The improved k means with particle swarm optimizationThe improved k means with particle swarm optimization
The improved k means with particle swarm optimizationAlexander Decker
 
MACHINE LEARNING TOOLBOX
MACHINE LEARNING TOOLBOXMACHINE LEARNING TOOLBOX
MACHINE LEARNING TOOLBOXmlaij
 
A New Active Learning Technique Using Furthest Nearest Neighbour Criterion fo...
A New Active Learning Technique Using Furthest Nearest Neighbour Criterion fo...A New Active Learning Technique Using Furthest Nearest Neighbour Criterion fo...
A New Active Learning Technique Using Furthest Nearest Neighbour Criterion fo...ijcsa
 
Regularized Weighted Ensemble of Deep Classifiers
Regularized Weighted Ensemble of Deep Classifiers Regularized Weighted Ensemble of Deep Classifiers
Regularized Weighted Ensemble of Deep Classifiers ijcsa
 
Clustering and Classification of Cancer Data Using Soft Computing Technique
Clustering and Classification of Cancer Data Using Soft Computing Technique Clustering and Classification of Cancer Data Using Soft Computing Technique
Clustering and Classification of Cancer Data Using Soft Computing Technique IOSR Journals
 
Survey on Various Classification Techniques in Data Mining
Survey on Various Classification Techniques in Data MiningSurvey on Various Classification Techniques in Data Mining
Survey on Various Classification Techniques in Data Miningijsrd.com
 
Survey on classification algorithms for data mining (comparison and evaluation)
Survey on classification algorithms for data mining (comparison and evaluation)Survey on classification algorithms for data mining (comparison and evaluation)
Survey on classification algorithms for data mining (comparison and evaluation)Alexander Decker
 
IMPROVING SUPERVISED CLASSIFICATION OF DAILY ACTIVITIES LIVING USING NEW COST...
IMPROVING SUPERVISED CLASSIFICATION OF DAILY ACTIVITIES LIVING USING NEW COST...IMPROVING SUPERVISED CLASSIFICATION OF DAILY ACTIVITIES LIVING USING NEW COST...
IMPROVING SUPERVISED CLASSIFICATION OF DAILY ACTIVITIES LIVING USING NEW COST...cscpconf
 
Particle Swarm Optimization based K-Prototype Clustering Algorithm
Particle Swarm Optimization based K-Prototype Clustering Algorithm Particle Swarm Optimization based K-Prototype Clustering Algorithm
Particle Swarm Optimization based K-Prototype Clustering Algorithm iosrjce
 
Max stable set problem to found the initial centroids in clustering problem
Max stable set problem to found the initial centroids in clustering problemMax stable set problem to found the initial centroids in clustering problem
Max stable set problem to found the initial centroids in clustering problemnooriasukmaningtyas
 
Comprehensive Survey of Data Classification & Prediction Techniques
Comprehensive Survey of Data Classification & Prediction TechniquesComprehensive Survey of Data Classification & Prediction Techniques
Comprehensive Survey of Data Classification & Prediction Techniquesijsrd.com
 
A Formal Machine Learning or Multi Objective Decision Making System for Deter...
A Formal Machine Learning or Multi Objective Decision Making System for Deter...A Formal Machine Learning or Multi Objective Decision Making System for Deter...
A Formal Machine Learning or Multi Objective Decision Making System for Deter...Editor IJCATR
 
A Comparative Study Of Various Clustering Algorithms In Data Mining
A Comparative Study Of Various Clustering Algorithms In Data MiningA Comparative Study Of Various Clustering Algorithms In Data Mining
A Comparative Study Of Various Clustering Algorithms In Data MiningNatasha Grant
 
Classification Techniques: A Review
Classification Techniques: A ReviewClassification Techniques: A Review
Classification Techniques: A ReviewIOSRjournaljce
 

Similar to Analysis On Classification Techniques In Mammographic Mass Data Set (20)

Di35605610
Di35605610Di35605610
Di35605610
 
IMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHES
IMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHESIMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHES
IMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHES
 
ANALYSIS AND COMPARISON STUDY OF DATA MINING ALGORITHMS USING RAPIDMINER
ANALYSIS AND COMPARISON STUDY OF DATA MINING ALGORITHMS USING RAPIDMINERANALYSIS AND COMPARISON STUDY OF DATA MINING ALGORITHMS USING RAPIDMINER
ANALYSIS AND COMPARISON STUDY OF DATA MINING ALGORITHMS USING RAPIDMINER
 
Robust Breast Cancer Diagnosis on Four Different Datasets Using Multi-Classif...
Robust Breast Cancer Diagnosis on Four Different Datasets Using Multi-Classif...Robust Breast Cancer Diagnosis on Four Different Datasets Using Multi-Classif...
Robust Breast Cancer Diagnosis on Four Different Datasets Using Multi-Classif...
 
The improved k means with particle swarm optimization
The improved k means with particle swarm optimizationThe improved k means with particle swarm optimization
The improved k means with particle swarm optimization
 
MACHINE LEARNING TOOLBOX
MACHINE LEARNING TOOLBOXMACHINE LEARNING TOOLBOX
MACHINE LEARNING TOOLBOX
 
A New Active Learning Technique Using Furthest Nearest Neighbour Criterion fo...
A New Active Learning Technique Using Furthest Nearest Neighbour Criterion fo...A New Active Learning Technique Using Furthest Nearest Neighbour Criterion fo...
A New Active Learning Technique Using Furthest Nearest Neighbour Criterion fo...
 
Regularized Weighted Ensemble of Deep Classifiers
Regularized Weighted Ensemble of Deep Classifiers Regularized Weighted Ensemble of Deep Classifiers
Regularized Weighted Ensemble of Deep Classifiers
 
Clustering and Classification of Cancer Data Using Soft Computing Technique
Clustering and Classification of Cancer Data Using Soft Computing Technique Clustering and Classification of Cancer Data Using Soft Computing Technique
Clustering and Classification of Cancer Data Using Soft Computing Technique
 
Survey on Various Classification Techniques in Data Mining
Survey on Various Classification Techniques in Data MiningSurvey on Various Classification Techniques in Data Mining
Survey on Various Classification Techniques in Data Mining
 
Survey on classification algorithms for data mining (comparison and evaluation)
Survey on classification algorithms for data mining (comparison and evaluation)Survey on classification algorithms for data mining (comparison and evaluation)
Survey on classification algorithms for data mining (comparison and evaluation)
 
IMPROVING SUPERVISED CLASSIFICATION OF DAILY ACTIVITIES LIVING USING NEW COST...
IMPROVING SUPERVISED CLASSIFICATION OF DAILY ACTIVITIES LIVING USING NEW COST...IMPROVING SUPERVISED CLASSIFICATION OF DAILY ACTIVITIES LIVING USING NEW COST...
IMPROVING SUPERVISED CLASSIFICATION OF DAILY ACTIVITIES LIVING USING NEW COST...
 
Particle Swarm Optimization based K-Prototype Clustering Algorithm
Particle Swarm Optimization based K-Prototype Clustering Algorithm Particle Swarm Optimization based K-Prototype Clustering Algorithm
Particle Swarm Optimization based K-Prototype Clustering Algorithm
 
I017235662
I017235662I017235662
I017235662
 
Max stable set problem to found the initial centroids in clustering problem
Max stable set problem to found the initial centroids in clustering problemMax stable set problem to found the initial centroids in clustering problem
Max stable set problem to found the initial centroids in clustering problem
 
Comprehensive Survey of Data Classification & Prediction Techniques
Comprehensive Survey of Data Classification & Prediction TechniquesComprehensive Survey of Data Classification & Prediction Techniques
Comprehensive Survey of Data Classification & Prediction Techniques
 
A Formal Machine Learning or Multi Objective Decision Making System for Deter...
A Formal Machine Learning or Multi Objective Decision Making System for Deter...A Formal Machine Learning or Multi Objective Decision Making System for Deter...
A Formal Machine Learning or Multi Objective Decision Making System for Deter...
 
[IJET-V2I3P21] Authors: Amit Kumar Dewangan, Akhilesh Kumar Shrivas, Prem Kumar
[IJET-V2I3P21] Authors: Amit Kumar Dewangan, Akhilesh Kumar Shrivas, Prem Kumar[IJET-V2I3P21] Authors: Amit Kumar Dewangan, Akhilesh Kumar Shrivas, Prem Kumar
[IJET-V2I3P21] Authors: Amit Kumar Dewangan, Akhilesh Kumar Shrivas, Prem Kumar
 
A Comparative Study Of Various Clustering Algorithms In Data Mining
A Comparative Study Of Various Clustering Algorithms In Data MiningA Comparative Study Of Various Clustering Algorithms In Data Mining
A Comparative Study Of Various Clustering Algorithms In Data Mining
 
Classification Techniques: A Review
Classification Techniques: A ReviewClassification Techniques: A Review
Classification Techniques: A Review
 

Recently uploaded

Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile servicerehmti665
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )Tsuyoshi Horigome
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Christo Ananth
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Dr.Costas Sachpazis
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal
 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...ranjana rawat
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingrakeshbaidya232001
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSRajkumarAkumalla
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxupamatechverse
 
Analog to Digital and Digital to Analog Converter
Analog to Digital and Digital to Analog ConverterAnalog to Digital and Digital to Analog Converter
Analog to Digital and Digital to Analog ConverterAbhinavSharma374939
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxpranjaldaimarysona
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3
 
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxupamatechverse
 
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptxthe ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptxhumanexperienceaaa
 

Recently uploaded (20)

Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile service
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writing
 
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCRCall Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
 
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINEDJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptx
 
Analog to Digital and Digital to Analog Converter
Analog to Digital and Digital to Analog ConverterAnalog to Digital and Digital to Analog Converter
Analog to Digital and Digital to Analog Converter
 
Roadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and RoutesRoadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and Routes
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptx
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
 
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptx
 
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptxthe ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
 

Analysis On Classification Techniques In Mammographic Mass Data Set

  • 1. Vidya Sagar Ponnam Int. Journal of Engineering Research and Applications www.ijera.com ISSN: 2248-9622, Vol. 5, Issue 7, (Part - 3) July 2015, pp.32-35 www.ijera.com 32 | P a g e Analysis On Classification Techniques In Mammographic Mass Data Set Mrs. K. K. Kavitha *, Dr. A Kangaiammal**, Mr. K. Satheesh*** *(Department of Computer Science, Selvamm Arts & Science College, Namakkal-03) ** (Department of Computer Science, Government Arts College, Salem-07) ***(Department of Computer Science, Selvamm Arts & Science College, Namakkal-03) ABSTRACT Data mining, the extraction of hidden information from large databases, is to predict future trends and behaviors, allowing businesses to make proactive, knowledge-driven decisions. Data-Mining classification techniques deals with determining to which group each data instances are associated with. It can deal with a wide variety of data so that large amount of data can be involved in processing. This paper deals with analysis on various data mining classification techniques such as Decision Tree Induction, Naïve Bayes , k-Nearest Neighbour (KNN) classifiers in mammographic mass dataset. Keywords - Classification , Data Mining, Decision Tree , Mammographic Dataset, Naïve Bayes Classifier. I. INTRODUCTION The field of data mining is an emerging research area with important applications in Engineering, Science, Medicine, Business and Education[9]. The size of data base in educational application is large where the number of records in a data set can vary from some thousand to thousand of millions. The size of data is accumulated from different fields exponentially increasing. Data mining has been used in different methods at the intersection of Machine Learning, Artificial Intelligence, Statistics and Database Systems. Cancer begins in cells, the building blocks that make up tissues. Tissues make up the breasts and other parts of the body. Normal cells grow and divide to form new cells as the body needs them. When normal cells grow old or get damaged, they die, and new cells take their place. Sometimes, this process leads to wrong results. New cells form when the body does not need them, and old or damaged cells do not die as they should. The built-up of extra cells often forms a mass of tissue called a lump, growth, or tumor. Cancer that forms in the tissues of breast, usually in the ducts (tubes that carry milk to the nipple) and in the lobules (glands that make milk) is the breast cancer [1]. It occurs in both men and women; although in male, breast cancer is very rare. Breast cancer is one of the leading cancers in the female population. About 25% of all cancers diagnosed in women are breast cancers and about 20% of all lethal cancers are breast cancers. Mammography has been proven to be the most reliable method and it is the key screening tool for the early detection of breast cancer. Mammography is highly accurate. On average, mammography will detect about 80–90% of the breast cancers in women without any recognizable symptoms. Classification technique is capable of processing a wider variety of data and is growing in popularity. This paper deals with analysis on various data mining classification techniques in mammographic mass datasets. The next section presented is Decision Tree Induction. Section II describes Naïve Bayesian Classification whereas k-nearest neighbor (KNN) classifier is described in section III. Finally, the last section concludes this work. II. Decision tree induction Decision Tree Induction is the learning of decision trees from class-labeled training tuples. A decision tree is a flowchart-like tree structure, where each internal node denotes a test on an attribute, each branch represents an outcome of the test, and each leaf node holds a class label[9],[4]. The feature that best divides the training data would be the root node of the tree. There are numerous methods for finding the feature that best divides the training data such as information gain (Hunt et al., 1966) and gini index (Breiman et al., 1984). While myopic measures estimate each attribute independently, Relief algorithm (Kononenko, 1994) estimates them in the context of other attributes. However, a majority of studies have concluded that there is no single best method (Murthy,1998). RESEARCH ARTICLE OPEN ACCESS
  • 2. Vidya Sagar Ponnam Int. Journal of Engineering Research and Applications www.ijera.com ISSN: 2248-9622, Vol. 5, Issue 7, (Part - 3) July 2015, pp.32-35 www.ijera.com 33 | P a g e Figure 1: Decision Tree for Breast Cancer The basic algorithm for decision tree induction is a greedy algorithm that constructs decision trees in a top-down recursive divide-and-conquer manner [9]. Algorithm for decision tree is described in Fig. 2. Figure 2: Decision Tree Algorithm III. NAÏVE BAYESIAN CLASSIFICATION Naïve Bayes is a simple technique for constructing classifier models that assign class labels to problem instances, represented as vectors of feature values, where the class labels are drawn from some finite set. It is not a single algorithm for training such classifiers, but a family of algorithms based on a common principle: all naive Bayes classifiers assume that the value of a particular feature is independent of the value of any other feature, given the class variable[13]. Abstractly, naive Bayes is a conditional probability model: given a problem instance to be classified, represented by a vector x= (x ,….,x ) representing some n features (dependent variables), it assigns to this instance probabilities for each of K possible outcomes or classes. Using Bayes theorem, the conditional probability can be decomposed as: (1) In general, using Bayesian probability terminology, the above equation can be written as: (2) Using the chain rule for repeated applications of the definition of conditional probability[11]: (3) One can easily estimate the probabilities from the training tuples. IV. KNN CLASSIFICATION K Nearest Neighbors (KNN) is a simple algorithm that stores all available cases and classifies new cases based on a specific similarity measure (e.g., distance functions). KNN has been used in statistical estimation and pattern recognition[10]. Nearest neighbor classifiers are based on learning by analogy. The training samples are described by n dimensional numeric attributes. Each sample represents a point in an n-dimensional space. Similarly, all of the training samples are stored in an n-dimensional pattern space. When given an unknown sample, a k-nearest neighbor classifier searches the pattern space for the k- training samples that are closest to the unknown sample. A case is classified by a majority vote of its neighbors, with the case being assigned to the class most common amongst its K-nearest neighbors measured by a distance function. If K = 1, then the case is simply assigned to the class of its nearest neighbor[6]: (4) It should also be noted that all three distance measures are only valid for continuous variables. In the instance of categorical variables the Hamming distance must be used. It also brings up the issue of
  • 3. Vidya Sagar Ponnam Int. Journal of Engineering Research and Applications www.ijera.com ISSN: 2248-9622, Vol. 5, Issue 7, (Part - 3) July 2015, pp.32-35 www.ijera.com 34 | P a g e standardization of the numerical variables between 0 and 1 when there is a mixture of numerical and categorical variables in the dataset. (5) The k-nearest neighbors algorithm is amongst the simplest of all machine learning algorithms. An object is classified by a majority vote of its neighbors, with the object being assigned to the class most common amongst its k-nearest neighbors. k is a positive integer, typically small. If k= 1, then the object is simply assigned to the class of its nearest neighbor. In binary classification problems (two class), it is helpful to choose k to be an odd number as this avoids tie in votes. The same method can be used for regression, by simply assigning the property value for the object to be the average of the values of its k-nearest neighbors. It can be useful to weigh the contributions of the neighbors, so that the nearer neighbors contribute more to the average than the more distant ones. The neighbors are taken from a set of objects for which the correct classification is known. This can be thought of as the training set for the algorithm, though no explicit training step is required. In order to identify neighbors, the objects are represented by position vectors in a multidimensional feature space. It is usual to use the Euclidian distance, though other distance measures, such as the Manhanttan distance could be used in principle instead. The k-nearest neighbor algorithm is sensitive to the local structure of the data. V. EXPERIMENTAL RESULTS 1. DATASET ACQUISITION This breast cancer domain was obtained from the University Medical Centre, Institute of Oncology, and Ljubljana, Yugoslavia. This data set includes 201 instances of one class and 85 instances of another class. The instances are described by 9 attributes, some of which are linear and some are nominal. 2. TOOL In this paper, we applied weka tool for finding classification accuracy. Weka is a collection of machine learning algorithms for data mining tasks. The algorithms can either be applied directly to a dataset or called from your own Java code. Weka contains tools for data pre-processing, classification, regression, clustering, association rules, and visualization, etc.. It is also well-suited for developing new machine learning schemes[14]. VI. PERFORMANCE EVALUATION In this paper, we discussed about three major data mining classification algorithms. Now, we had given some results by applying these algorithms in the following table with some classification errors such as MAE, RMSE, RAE and so on. Table 1 Comparison of Classification Algorithms Now, we represent the comparison results in the following fig. 3 through 5. i) Accuracy: Figure 3: Accuracy ii) Mean Absolute Error (MAE) and Root Mean Square Error (RMSE): Figure 4: MAE & RMSE 72 73 74 75 76 77 78 79 80 Accuracy(%) Decision Tree Naïve Bayes KNN 0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 MAE RMSE Accuracy(%) MAE RMSE RAE(%) Decision Tree 75.8 0.3658 0.4269 87.44 Naïve Bayes 75.1 0.3012 0.4278 72.00 KNN 80 0.2679 0.3671 64.04
  • 4. Vidya Sagar Ponnam Int. Journal of Engineering Research and Applications www.ijera.com ISSN: 2248-9622, Vol. 5, Issue 7, (Part - 3) July 2015, pp.32-35 www.ijera.com 35 | P a g e iii) Relative Absolute Error: Figure 5: RAE VII. CONCLUSION From the above analysis, it is concluded that KNN Classifier provides high accuracy than Decision Tree approach and the Naïve Bayes classifier. In future, this work can extended for implementing these algorithms for uncertain datasets. REFERENCES Books: [1] Aley Abraham,Blessings of My Breast Cancer, Innova Publishing, 2012. [2] Arun K Pujari, Data mining Technique, Published by Universities Press (I) Pvt. Ltd, Hyderabad, India, 2001. [3] Jiawei Han and Micheline Kamber,Data Mining Concepts and Techniques, Second Edition, Elsevier Inc., 2006. Journal Papers: [4] Bharati M. Ramageri, Data Mining Techniques and Application, Indian Journal of Computer Science and Engineering, Vol. 1 No. 4, pp. 301-305. [5] Jiazhen He, Yang Zhang, Xue Li & Peng Shi, Learning naive Bayes classifiers from positive and unlabelled examples with uncertainty, International Journal of Systems Science, Volume 43, Issue 10, 2012. [6] Kotsiantis. S. B.,Supervised Machine Learning: A Review of Classification Techniques, Informatica, pp.249-268, 2007. [7] Narasimha Murty, M., Susheela Devi, V. Pattern Recognition: An Algorithmic Approach, Springer Science & Business Media, 2011. [8] Remco R. Bouckaert, Eibe Frank, Mark Hall, Richard Kirkby, Peter Reutemann, Alex Seewald, David Scuse, Weka Manual for Version 3-6-11, University of Waikato, Hamilton, New Zealand, 2014. [9] Sumit Garg, Arvind K. Sharma, Comparative Analysis of Data Mining Techniques on Educational Dataset, International Journal of Computer Applications (0975 – 8887) Volume 74, No.5, July 2013. [10]Suresh Chandra Satapathy, Siba K Udgata, Bhabendra Narayan Biswal,” Proceedings of the International Conference on Frontiers of Intelligent Computing: Theory and Applications (FICTA)”, Springer, 2013. Conference Proceedings: [11]Radovic, M. R&D Center for Bioeng., BioIRC, Kragujevac, Serbia Djokovic, M., Peulic, A. , Filipovic, N., Application of data mining algorithms for mammogram classification, Bioinformatics and Bioengineering (BIBE), IEEE 13th International Conference, 2013. [12]Thair Nu Phyu, Survey of Classification Techniques in Data Mining, Proceedings of the International Multi Conference of Engineers and Computer Scientists , Vol I IMECS 2009, March 18 - 20, 2009. Web References; [13]http://en.wikipedia.org/wiki/Naive_Bayes_ classifier. [14]http://www.cs.waikato.ac.nz/ml/weka/. 0 20 40 60 80 100 RAE(%) RAE(%)