SlideShare a Scribd company logo
1 of 5
Download to read offline
© 2013, IJARCSSE All Rights Reserved Page | 972
Volume 3, Issue 11, November 2013 ISSN: 2277 128X
International Journal of Advanced Research in
Computer Science and Software Engineering
Research Paper
Available online at: www.ijarcsse.com
Performance Evaluation: A Comparative Study of Various Classifiers
Amresh Kumar, Yashwant S. Ingle
Department of Computer Science & Engineering
G.H. Raisoni College of Engineering, Nagpur, INDIA
Abstract— Real-world knowledge discovery processes typically consist of complex data pre-processing, machine
learning, evaluation, and visualization steps. Hence a data mining platform should allow complex nested operator
chains or trees, provide the transparent data managing, comfortable constraint handling and optimization, be flexible,
extendable and easy-to-use. Modern machine learning techniques have encouraged interest in the development of
various systems that ensure protected, trustworthy and many more operations in the different fields and applications.
In an earlier study, many other approaches/methods were investigated to develop various applications using modern
machine learning techniques and more specific classification algorithms. The Weka machine learning workbench
provides a general-purpose environment for automatic classification, clustering and feature selection, and common
data mining problems in bioinformatics research. Here in this Project Report paper we have used various classifiers
with filters to perform classification and we have done analysis of data with different classifiers and then we have
done feature selection process and during all these activities we have observed and record the various performance
change and different graphs which are briefed inside this paper.
Keywords— Machine Learning, WEKA, Data mining, KDD, Classification, Filters, Feature Selection
I. INTRODUCTION
In an earlier study, many other approaches/methods were investigated to develop various applications using modern
machine learning techniques and more specific classification algorithms. Modern machine learning techniques have
encouraged interest in the development of various systems that ensure secure, reliable and many more operations in the
different fields and applications. The Weka machine learning workbench provides a general-purpose environment for
automatic classification, clustering and feature selection. Therefore Weka also contains an extensive data pre-processing
methods and the experimental comparison of different machine learning techniques on the same problem. Data
mining (the analysis step of the "Knowledge Discovery in Databases”) is a field at the intersection of computer
science and statistics is the process that attempts to discover patterns in huge data sets. It utilizes methods at the junction
of artificial intelligence, machine learning, statistics, and database systems. The term Knowledge Discovery in Databases
or KDD for short, refers to the broad process of finding knowledge in data, and emphasizes the "high-level" application
of particular data mining methods. The unifying goal of the KDD process is to extract knowledge from data in the
context of large databases. In feature selection operation we are going to find out which are the most important instances
to carry out the classification to get accurate result by improving their performance. Here, in this Project Report paper we
have used various classifiers with filters to perform classification and we have done analysis of data with different
classifiers and then we have done feature selection process and during all these activities we have observed and record
the various performance change and different graphs which are briefed inside this paper. To improve the performance we
have experimented with Artificial Intelligence based (AI Classifier), a rule-based learning method using statistical
analysis and also Decision tree and Support Vector Machine (SVM) based classification schemes were used to analysis
and inspection of data. This selected algorithm efficiency and overall performance for the given data set (Temp.csv) are
observed and calculated. This experiment/study has been conducted using six classifiers, namely SMO, REPTree, IBK,
Logistic and Multilayer perceptron, with Temp.csv datasets having 41 instances. The Waikato Environment for
Knowledge Analysis (WEKA) learning tool has been used in this Experiment/study.
II. METHODOLOGY
Fig 1 Flow of Methodology involved.
Amresh et al., International Journal of Advanced Research in Computer Science and Software Engineering 3(11),
November - 2013, pp. 972-976
© 2013, IJARCSSE All Rights Reserved Page | 973
Using Weka tool we have executed six various classifiers algorithm on our dataset and compared the various classifiers
based on the ROC Area (Weighted Average) value.
Fig 2 ROC (Weighted Average)
And also we found that out 6 classifiers, 4 Classifiers are showing 100% correctly classified instances and 2 Classifiers
REPTree and DMNBText are showing incorrectly classified instances, therefore next step we proceed with finding out
which instances was not correctly classified. For this we have to do a tuple wise analysis, so for the same I am taking
only one classifier now i.e. DMNBtext Classifier.
III. RESULTS
TABLE 1: CLASSIFIERS COMPARISON CHART
Sl. No. Classifiers ROC (Weighted Average)
1 SMO 1
2 REPTree 0.746
3 Ibk 1
4 Logistic 1
5 MultilayePerceptron 1
6 DMNBText 0.997
TABLE 2: CONSOLIDATED CLASSIFIERS SHEET USING TRAINING SET AS TEST OPTIONS
Relation=temp
Summary
Number ofCorrectly Classified Instances 41 28 41 41 41 35
Correctly Classified Instances % 100 68.2927 100 100 100 85.3659
Number ofIncorrectly Classified Instances 0 13 0 0 0 6
Incorrectly Classified Instances % 0 31.7073 0 0 0 14.6341
Kappa statistic 1 0.4336 1 1 1 0.7681
Mean absolute error 0.2222 0.2969 0.0303 0 0.0059 0.2135
Root mean squared error 0.2722 0.3853 0.0321 0 0.0091 0.2783
Relative absolute error 55.9901 74.8118 7.635 0.0001 1.4937 53.7995
Root relative squared error 61.3467 86.8492 7.2447 0.0001 2.0548 62.7209
TotalNumber ofInstances 41 41 41 41 41 41
Detailed Accuracy By Class
TP Rate (weighted avg) 1 0.683 1 1 1 0.854
FP Rate (weighted avg) 0 0.266 0 0 0 0.025
Precision (weighted avg) 1 0.585 1 1 1 0.927
Recall(weighted avg) 1 0.683 1 1 1 0.854
F-measure (weighted avg) 1 0.627 1 1 1 0.871
ROC Area (weighted avg) 1 0.746 1 1 1 0.997
Confusion Matrix
a=O,b=L,c=N
a-a 22 16 22 22 22 19
a-b 0 6 0 0 0 0
a-c 0 0 0 0 0 3
b-a 0 1 0 0 0 0
b-b 13 12 13 13 13 10
b-c 0 0 0 0 0 3
c-a 6 6 0 0 0 0
c-b 0 0 0 0 0 0
c-c 6 0 6 6 6 6
CLASSIFIER 6:
DMNBText
CLASSIFIER 3:
Ibk
CLASSIFIER 4:
Logistic
CLASSIFIER 5:
MultiLayer
Perceptron
DATA CLASSIFICATION STATISTICS
Data File Description
Attributes=62 Instances=41
CLASSIFIER 1:
AI Classifier
SMO
CLASSIFIER 2:
Tree Classifier
REPTree
Amresh et al., International Journal of Advanced Research in Computer Science and Software Engineering 3(11),
November - 2013, pp. 972-976
© 2013, IJARCSSE All Rights Reserved Page | 974
TABLE 3
ANALYSIS AND PERFORMANCE CHANGE FOR DMNBTEXT CLASSIFIERS USING CROSS-VALIDATION AS TEST OPTIONS
From the Figure given below we can see the ROC value for DMNBText Classifier without and With the Attributor
Evaluator. Thus by Calculating ROC change we can Find out the performance Change as given below.
Performance Change= (ROCWith Evaluator - ROCWithout Evaluator) / ROCWithout Evaluator
Without
Attribute
Selection
With Attribute
Selection:
Attribute
Evaluator: na
me
SearchMethod:
Ranker
List of Se le cte d Attribute s---------->> All
RL-54, L-18, L-14, L-
20, L-24,, L-16, L-
15, H-35, RL-60, RL-
62, P-49, H-32, L-17
Summary
Number of Correctly Classified Instances 33 31
Correctly Classified Instances % 80.487 75.6098
Number of Incorrectly Classified Instances 8 10
Incorrectly Classified Instances % 19.512 24.3902
Kappa statistic 0.6439 0.5514
Mean absolute error 0.2455 0.2827
Root mean squared error 0.3339 0.3463
Relative absolute error 61.4542 70.7685
Root relative squared error 74.8162 77.6059
Total Number of Instances 41 41
Detailed Accuracy By Class
TP Rate (weighted avg) 0.805 0.756
FP Rate (weighted avg) 0.175 0.215
Precision (weighted avg) 0.829 0.645
Recall (weighted avg) 0.805 0.756
F-measure (weighted avg) 0.77 0.696
ROC Area (weighted avg) 0.897 0.909
Confusion Matrix
a=O,b=L,c=N
a-a 21 20
a-b 1 2
a-c 0 0
b-a 2 2
b-b 11 11
b-c 0 0
c-a 3 4
c-b 2 2
c-c 1 0
Performance change(%) 1.34
CLASSIFIER NAME: DMNBText
Fig 3 Frequency
Fig 4 Attribute Frequency Pattern
Amresh et al., International Journal of Advanced Research in Computer Science and Software Engineering 3(11),
November - 2013, pp. 972-976
© 2013, IJARCSSE All Rights Reserved Page | 975
TABLE 4: MULTILLAYER PERCEPTRON (USING TRAINING SET)
Summary
With all
attributes
With only
selected 35
attributes
Number of Correctly Classified Instances 41 41
Correctly Classified Instances % 100 100
Number of Incorrectly Classified Instances 0 0
Incorrectly Classified Instances % 0 0
Kappa statistic 1 1
Mean absolute error 0.0059 0.0075
Root mean squared error 0.0091 0.0119
Relative absolute error 1.4937 1.8879
Root relative squared error 2.0548 2.6893
Total Number of Instances 41 41
Detailed Accuracy By Class
TP Rate (weighted avg) 1 1
FP Rate (weighted avg) 0 0
Precision (weighted avg) 1 1
Recall (weighted avg) 1 1
F-measure (weighted avg) 1 1
ROC Area (weighted avg) 1 1
Confusion Matrix
a=O,b=L,c=N
a-a 22 22
a-b 0 0
a-c 0 0
b-a 0 0
b-b 13 13
b-c 0 0
c-a 0 0
c-b 0 0
c-c 6 6
Performance change 0 0
Classifier Name: MultiLayerPerceptron (Using Traing Set)
TABLE 5: MULTILLAYER PERCEPTRON (USING CROSS VALIDATION)
Classifier Name: MultiLayerPerceptron (Cross-Validation)
Summary
With all
attributes
With only selected
35 attributes
Number of Correctly Classified Instances 30 32
Correctly Classified Instances % 73.1701 78.0488
Number of Incorrectly Classified Instances 11 9
Incorrectly Classified Instances % 26.8293 21.9512
Kappa statistic 0.5847 0.6506
Mean absolute error 0.1866 0.1636
Root mean squared error 0.3666 0.3474
Relative absolute error 46.7145 40.9598
Root relative squared error 82.148 77.8538
Total Number of Instances 41 41
Detailed Accuracy By Class
TP Rate (weighted avg) 0.732 0.78
FP Rate (weighted avg) 0.101 0.093
Precision (weighted avg) 0.818 0.828
Recall (weighted avg) 0.732 0.78
F-measure (weighted avg) 0.754 0.795
ROC Area (weighted avg) 0.908 0.911
Confusion Matrix
a=O,b=L,c=N
a-a 15 16
a-b 1 1
a-c 6 5
b-a 1 0
b-b 10 12
b-c 2 1
c-a 1 2
c-b 0 0
c-c 5 4
Performance change 0 0.330396476
Amresh et al., International Journal of Advanced Research in Computer Science and Software Engineering 3(11),
November - 2013, pp. 972-976
© 2013, IJARCSSE All Rights Reserved Page | 976
IV. DISCUSSION
For the experimental analysis SMO, REPTree, IBK, Logistic and Multilayer perceptron classifiers are considered in this
experiment/study. 41 instances in data sets were selected from the collected data. To cover this experimental I have taken
Temp.csv datasets, and the observation and Calculation is done by considering following:
 Attribute selection
 Frequency
 ROC
 Confusion metrics
TABLE 5
COMPARISONS WITH GRAPH WITHOUT FEATURE EXTRACTION AND WITH FEATURE EXTRACTION USING DMNBTEXT
CLASSIFIER
Classifier:
DMNBText Type ofSearchMethod ROCArea(Wt. Avg.)
AttributeEvaluator:
ChiSquaredAttributeEval WithoutFeature Extraction 0.897
Ranker 0.909
Fig 5 Performance Index for DBMNText
V. CONCLUSION
As a conclusion we can tell that classification algorithms play a key role to solve real world problems. Selection of an
application specific classifier is an emerging research area. In this paper, performance change is being evaluated and
calculated using various popular classifiers. Initially, the percentage of correct classifications has been measured with the
highest accuracy. Later, ranking performance has been estimated to select a suitable algorithm for this application. The
ranking performance has shown that DMNBText performs the best for the given datasets. This also reduces
computational complexity, and development and maintenance costs both in terms of hardware and human inspection.
Based on the results obtained in the various algorithms, we can conclude that the feature selection concept played an
important role and can be useful component for many classifications. This is possible due to the low computational cost
of this method, which is more efficient compared to the other ones. The main advantage of this method is that it makes
no assumptions and these methods, not only improved the classification speed significantly, but they also improved the
accuracy rate and the reliability in most of the cases. Thus using the concept of Data Mining techniques we examine and
calculate the performance using ROC Values.
REFERENCES
[1] Ioannis Charalampopoulos, Ioannis Anagnostopoulos, A Comparable Study employing WEKA
Clustering/Classification Algorithms for Web Page Classification, IEEE 2011, Page(s): 235 – 239.
[2] G M Shafiullah, A B M Shawkat Ali, Adam Thompson, Peter J Wolfs, Rule-Based Classification Approach for
Railway Wagon HealthMonitoring, IEEE 2010, Page(s): 1 – 7.
[3] Weka, Available: http://www.cs.waikato.ac.nz
[4] Data_mining, Available: http://en.wikipedia.org
[5] aimag-kdd-overview-1996-Fayyad.pdf , Available: http://www.kdnuggets.com
[6] IJCSE10-01-04-51.pdf, Available: http://www.ijcse.com

More Related Content

What's hot

Mine Blood Donors Information through Improved K-Means Clustering
Mine Blood Donors Information through Improved K-Means ClusteringMine Blood Donors Information through Improved K-Means Clustering
Mine Blood Donors Information through Improved K-Means Clusteringijcsity
 
Analysis and implementation of modified k medoids
Analysis and implementation of modified k medoidsAnalysis and implementation of modified k medoids
Analysis and implementation of modified k medoidseSAT Publishing House
 
MPSKM Algorithm to Cluster Uneven Dimensional Time Series Subspace Data
MPSKM Algorithm to Cluster Uneven Dimensional Time Series Subspace DataMPSKM Algorithm to Cluster Uneven Dimensional Time Series Subspace Data
MPSKM Algorithm to Cluster Uneven Dimensional Time Series Subspace DataIRJET Journal
 
Column store decision tree classification of unseen attribute set
Column store decision tree classification of unseen attribute setColumn store decision tree classification of unseen attribute set
Column store decision tree classification of unseen attribute setijma
 
Predicting performance of classification algorithms
Predicting performance of classification algorithmsPredicting performance of classification algorithms
Predicting performance of classification algorithmsIAEME Publication
 
PREDICTING PERFORMANCE OF CLASSIFICATION ALGORITHMS
PREDICTING PERFORMANCE OF CLASSIFICATION ALGORITHMSPREDICTING PERFORMANCE OF CLASSIFICATION ALGORITHMS
PREDICTING PERFORMANCE OF CLASSIFICATION ALGORITHMSSamsung Electronics
 
IRJET - Movie Genre Prediction from Plot Summaries by Comparing Various C...
IRJET -  	  Movie Genre Prediction from Plot Summaries by Comparing Various C...IRJET -  	  Movie Genre Prediction from Plot Summaries by Comparing Various C...
IRJET - Movie Genre Prediction from Plot Summaries by Comparing Various C...IRJET Journal
 
IRJET - Hazardous Asteroid Classification through Various Machine Learning Te...
IRJET - Hazardous Asteroid Classification through Various Machine Learning Te...IRJET - Hazardous Asteroid Classification through Various Machine Learning Te...
IRJET - Hazardous Asteroid Classification through Various Machine Learning Te...IRJET Journal
 
Fuzzy clustering and fuzzy c-means partition cluster analysis and validation ...
Fuzzy clustering and fuzzy c-means partition cluster analysis and validation ...Fuzzy clustering and fuzzy c-means partition cluster analysis and validation ...
Fuzzy clustering and fuzzy c-means partition cluster analysis and validation ...IJECEIAES
 
Experimental study of Data clustering using k- Means and modified algorithms
Experimental study of Data clustering using k- Means and modified algorithmsExperimental study of Data clustering using k- Means and modified algorithms
Experimental study of Data clustering using k- Means and modified algorithmsIJDKP
 
Top-K Dominating Queries on Incomplete Data with Priorities
Top-K Dominating Queries on Incomplete Data with PrioritiesTop-K Dominating Queries on Incomplete Data with Priorities
Top-K Dominating Queries on Incomplete Data with Prioritiesijtsrd
 
Data Analysis and Prediction System for Meteorological Data
Data Analysis and Prediction System for Meteorological DataData Analysis and Prediction System for Meteorological Data
Data Analysis and Prediction System for Meteorological DataIRJET Journal
 
Survey on classification algorithms for data mining (comparison and evaluation)
Survey on classification algorithms for data mining (comparison and evaluation)Survey on classification algorithms for data mining (comparison and evaluation)
Survey on classification algorithms for data mining (comparison and evaluation)Alexander Decker
 
Multi-Dimensional Features Reduction of Consistency Subset Evaluator on Unsup...
Multi-Dimensional Features Reduction of Consistency Subset Evaluator on Unsup...Multi-Dimensional Features Reduction of Consistency Subset Evaluator on Unsup...
Multi-Dimensional Features Reduction of Consistency Subset Evaluator on Unsup...CSCJournals
 
On comprehensive analysis of learning algorithms on pedestrian detection usin...
On comprehensive analysis of learning algorithms on pedestrian detection usin...On comprehensive analysis of learning algorithms on pedestrian detection usin...
On comprehensive analysis of learning algorithms on pedestrian detection usin...UniversitasGadjahMada
 
VARIATIONS IN OUTCOME FOR THE SAME MAP REDUCE TRANSITIVE CLOSURE ALGORITHM IM...
VARIATIONS IN OUTCOME FOR THE SAME MAP REDUCE TRANSITIVE CLOSURE ALGORITHM IM...VARIATIONS IN OUTCOME FOR THE SAME MAP REDUCE TRANSITIVE CLOSURE ALGORITHM IM...
VARIATIONS IN OUTCOME FOR THE SAME MAP REDUCE TRANSITIVE CLOSURE ALGORITHM IM...ijcsit
 
Review of Existing Methods in K-means Clustering Algorithm
Review of Existing Methods in K-means Clustering AlgorithmReview of Existing Methods in K-means Clustering Algorithm
Review of Existing Methods in K-means Clustering AlgorithmIRJET Journal
 
Parametric comparison based on split criterion on classification algorithm
Parametric comparison based on split criterion on classification algorithmParametric comparison based on split criterion on classification algorithm
Parametric comparison based on split criterion on classification algorithmIAEME Publication
 
A HYBRID MODEL FOR MINING MULTI DIMENSIONAL DATA SETS
A HYBRID MODEL FOR MINING MULTI DIMENSIONAL DATA SETSA HYBRID MODEL FOR MINING MULTI DIMENSIONAL DATA SETS
A HYBRID MODEL FOR MINING MULTI DIMENSIONAL DATA SETSEditor IJCATR
 

What's hot (20)

Lx3520322036
Lx3520322036Lx3520322036
Lx3520322036
 
Mine Blood Donors Information through Improved K-Means Clustering
Mine Blood Donors Information through Improved K-Means ClusteringMine Blood Donors Information through Improved K-Means Clustering
Mine Blood Donors Information through Improved K-Means Clustering
 
Analysis and implementation of modified k medoids
Analysis and implementation of modified k medoidsAnalysis and implementation of modified k medoids
Analysis and implementation of modified k medoids
 
MPSKM Algorithm to Cluster Uneven Dimensional Time Series Subspace Data
MPSKM Algorithm to Cluster Uneven Dimensional Time Series Subspace DataMPSKM Algorithm to Cluster Uneven Dimensional Time Series Subspace Data
MPSKM Algorithm to Cluster Uneven Dimensional Time Series Subspace Data
 
Column store decision tree classification of unseen attribute set
Column store decision tree classification of unseen attribute setColumn store decision tree classification of unseen attribute set
Column store decision tree classification of unseen attribute set
 
Predicting performance of classification algorithms
Predicting performance of classification algorithmsPredicting performance of classification algorithms
Predicting performance of classification algorithms
 
PREDICTING PERFORMANCE OF CLASSIFICATION ALGORITHMS
PREDICTING PERFORMANCE OF CLASSIFICATION ALGORITHMSPREDICTING PERFORMANCE OF CLASSIFICATION ALGORITHMS
PREDICTING PERFORMANCE OF CLASSIFICATION ALGORITHMS
 
IRJET - Movie Genre Prediction from Plot Summaries by Comparing Various C...
IRJET -  	  Movie Genre Prediction from Plot Summaries by Comparing Various C...IRJET -  	  Movie Genre Prediction from Plot Summaries by Comparing Various C...
IRJET - Movie Genre Prediction from Plot Summaries by Comparing Various C...
 
IRJET - Hazardous Asteroid Classification through Various Machine Learning Te...
IRJET - Hazardous Asteroid Classification through Various Machine Learning Te...IRJET - Hazardous Asteroid Classification through Various Machine Learning Te...
IRJET - Hazardous Asteroid Classification through Various Machine Learning Te...
 
Fuzzy clustering and fuzzy c-means partition cluster analysis and validation ...
Fuzzy clustering and fuzzy c-means partition cluster analysis and validation ...Fuzzy clustering and fuzzy c-means partition cluster analysis and validation ...
Fuzzy clustering and fuzzy c-means partition cluster analysis and validation ...
 
Experimental study of Data clustering using k- Means and modified algorithms
Experimental study of Data clustering using k- Means and modified algorithmsExperimental study of Data clustering using k- Means and modified algorithms
Experimental study of Data clustering using k- Means and modified algorithms
 
Top-K Dominating Queries on Incomplete Data with Priorities
Top-K Dominating Queries on Incomplete Data with PrioritiesTop-K Dominating Queries on Incomplete Data with Priorities
Top-K Dominating Queries on Incomplete Data with Priorities
 
Data Analysis and Prediction System for Meteorological Data
Data Analysis and Prediction System for Meteorological DataData Analysis and Prediction System for Meteorological Data
Data Analysis and Prediction System for Meteorological Data
 
Survey on classification algorithms for data mining (comparison and evaluation)
Survey on classification algorithms for data mining (comparison and evaluation)Survey on classification algorithms for data mining (comparison and evaluation)
Survey on classification algorithms for data mining (comparison and evaluation)
 
Multi-Dimensional Features Reduction of Consistency Subset Evaluator on Unsup...
Multi-Dimensional Features Reduction of Consistency Subset Evaluator on Unsup...Multi-Dimensional Features Reduction of Consistency Subset Evaluator on Unsup...
Multi-Dimensional Features Reduction of Consistency Subset Evaluator on Unsup...
 
On comprehensive analysis of learning algorithms on pedestrian detection usin...
On comprehensive analysis of learning algorithms on pedestrian detection usin...On comprehensive analysis of learning algorithms on pedestrian detection usin...
On comprehensive analysis of learning algorithms on pedestrian detection usin...
 
VARIATIONS IN OUTCOME FOR THE SAME MAP REDUCE TRANSITIVE CLOSURE ALGORITHM IM...
VARIATIONS IN OUTCOME FOR THE SAME MAP REDUCE TRANSITIVE CLOSURE ALGORITHM IM...VARIATIONS IN OUTCOME FOR THE SAME MAP REDUCE TRANSITIVE CLOSURE ALGORITHM IM...
VARIATIONS IN OUTCOME FOR THE SAME MAP REDUCE TRANSITIVE CLOSURE ALGORITHM IM...
 
Review of Existing Methods in K-means Clustering Algorithm
Review of Existing Methods in K-means Clustering AlgorithmReview of Existing Methods in K-means Clustering Algorithm
Review of Existing Methods in K-means Clustering Algorithm
 
Parametric comparison based on split criterion on classification algorithm
Parametric comparison based on split criterion on classification algorithmParametric comparison based on split criterion on classification algorithm
Parametric comparison based on split criterion on classification algorithm
 
A HYBRID MODEL FOR MINING MULTI DIMENSIONAL DATA SETS
A HYBRID MODEL FOR MINING MULTI DIMENSIONAL DATA SETSA HYBRID MODEL FOR MINING MULTI DIMENSIONAL DATA SETS
A HYBRID MODEL FOR MINING MULTI DIMENSIONAL DATA SETS
 

Viewers also liked

Case Study - Auditing remote access of employees at a leading financial insti...
Case Study - Auditing remote access of employees at a leading financial insti...Case Study - Auditing remote access of employees at a leading financial insti...
Case Study - Auditing remote access of employees at a leading financial insti...ObserveIT
 
ObserveIT Brochure - Like a Security Camera on your Servers
ObserveIT Brochure - Like a Security Camera on your ServersObserveIT Brochure - Like a Security Camera on your Servers
ObserveIT Brochure - Like a Security Camera on your ServersObserveIT
 
Data in the service of the Patient is Imporving Patient Outcomes and Patient ...
Data in the service of the Patient is Imporving Patient Outcomes and Patient ...Data in the service of the Patient is Imporving Patient Outcomes and Patient ...
Data in the service of the Patient is Imporving Patient Outcomes and Patient ...Apollo Hospitals Group and ATNF
 
Whitepaper: User Audit Options for Linux and Solaris
Whitepaper: User Audit Options for Linux and SolarisWhitepaper: User Audit Options for Linux and Solaris
Whitepaper: User Audit Options for Linux and SolarisObserveIT
 
Case Study - System Access Audit Compliance at The Center to Promote HealthCa...
Case Study - System Access Audit Compliance at The Center to Promote HealthCa...Case Study - System Access Audit Compliance at The Center to Promote HealthCa...
Case Study - System Access Audit Compliance at The Center to Promote HealthCa...ObserveIT
 
Type of auditing
Type of auditingType of auditing
Type of auditingDharmik
 

Viewers also liked (7)

Case Study - Auditing remote access of employees at a leading financial insti...
Case Study - Auditing remote access of employees at a leading financial insti...Case Study - Auditing remote access of employees at a leading financial insti...
Case Study - Auditing remote access of employees at a leading financial insti...
 
ObserveIT Brochure - Like a Security Camera on your Servers
ObserveIT Brochure - Like a Security Camera on your ServersObserveIT Brochure - Like a Security Camera on your Servers
ObserveIT Brochure - Like a Security Camera on your Servers
 
Data in the service of the Patient is Imporving Patient Outcomes and Patient ...
Data in the service of the Patient is Imporving Patient Outcomes and Patient ...Data in the service of the Patient is Imporving Patient Outcomes and Patient ...
Data in the service of the Patient is Imporving Patient Outcomes and Patient ...
 
Whitepaper: User Audit Options for Linux and Solaris
Whitepaper: User Audit Options for Linux and SolarisWhitepaper: User Audit Options for Linux and Solaris
Whitepaper: User Audit Options for Linux and Solaris
 
Case Study - System Access Audit Compliance at The Center to Promote HealthCa...
Case Study - System Access Audit Compliance at The Center to Promote HealthCa...Case Study - System Access Audit Compliance at The Center to Promote HealthCa...
Case Study - System Access Audit Compliance at The Center to Promote HealthCa...
 
Case study on forensic audit
Case study on forensic auditCase study on forensic audit
Case study on forensic audit
 
Type of auditing
Type of auditingType of auditing
Type of auditing
 

Similar to Performance Evaluation: A Comparative Study of Various Classifiers

Artificial Intelligence based Pattern Recognition
Artificial Intelligence based Pattern RecognitionArtificial Intelligence based Pattern Recognition
Artificial Intelligence based Pattern RecognitionDr. Amarjeet Singh
 
E-Healthcare monitoring System for diagnosis of Heart Disease using Machine L...
E-Healthcare monitoring System for diagnosis of Heart Disease using Machine L...E-Healthcare monitoring System for diagnosis of Heart Disease using Machine L...
E-Healthcare monitoring System for diagnosis of Heart Disease using Machine L...IRJET Journal
 
Classification Techniques: A Review
Classification Techniques: A ReviewClassification Techniques: A Review
Classification Techniques: A ReviewIOSRjournaljce
 
IRJET- Study and Evaluation of Classification Algorithms in Data Mining
IRJET- Study and Evaluation of Classification Algorithms in Data MiningIRJET- Study and Evaluation of Classification Algorithms in Data Mining
IRJET- Study and Evaluation of Classification Algorithms in Data MiningIRJET Journal
 
Performance Comparision of Machine Learning Algorithms
Performance Comparision of Machine Learning AlgorithmsPerformance Comparision of Machine Learning Algorithms
Performance Comparision of Machine Learning AlgorithmsDinusha Dilanka
 
IRJET- A Detailed Study on Classification Techniques for Data Mining
IRJET- A Detailed Study on Classification Techniques for Data MiningIRJET- A Detailed Study on Classification Techniques for Data Mining
IRJET- A Detailed Study on Classification Techniques for Data MiningIRJET Journal
 
Performance analysis of Data Mining algorithms in Weka
Performance analysis of Data Mining algorithms in WekaPerformance analysis of Data Mining algorithms in Weka
Performance analysis of Data Mining algorithms in WekaIOSR Journals
 
K Means Clustering Algorithm for Partitioning Data Sets Evaluated From Horizo...
K Means Clustering Algorithm for Partitioning Data Sets Evaluated From Horizo...K Means Clustering Algorithm for Partitioning Data Sets Evaluated From Horizo...
K Means Clustering Algorithm for Partitioning Data Sets Evaluated From Horizo...IOSR Journals
 
Different Classification Technique for Data mining in Insurance Industry usin...
Different Classification Technique for Data mining in Insurance Industry usin...Different Classification Technique for Data mining in Insurance Industry usin...
Different Classification Technique for Data mining in Insurance Industry usin...IOSRjournaljce
 
Applying K-Means Clustering Algorithm to Discover Knowledge from Insurance Da...
Applying K-Means Clustering Algorithm to Discover Knowledge from Insurance Da...Applying K-Means Clustering Algorithm to Discover Knowledge from Insurance Da...
Applying K-Means Clustering Algorithm to Discover Knowledge from Insurance Da...theijes
 
EFFICIENT USE OF HYBRID ADAPTIVE NEURO-FUZZY INFERENCE SYSTEM COMBINED WITH N...
EFFICIENT USE OF HYBRID ADAPTIVE NEURO-FUZZY INFERENCE SYSTEM COMBINED WITH N...EFFICIENT USE OF HYBRID ADAPTIVE NEURO-FUZZY INFERENCE SYSTEM COMBINED WITH N...
EFFICIENT USE OF HYBRID ADAPTIVE NEURO-FUZZY INFERENCE SYSTEM COMBINED WITH N...csandit
 
N ETWORK F AULT D IAGNOSIS U SING D ATA M INING C LASSIFIERS
N ETWORK F AULT D IAGNOSIS U SING D ATA M INING C LASSIFIERSN ETWORK F AULT D IAGNOSIS U SING D ATA M INING C LASSIFIERS
N ETWORK F AULT D IAGNOSIS U SING D ATA M INING C LASSIFIERScsandit
 
Hypothesis on Different Data Mining Algorithms
Hypothesis on Different Data Mining AlgorithmsHypothesis on Different Data Mining Algorithms
Hypothesis on Different Data Mining AlgorithmsIJERA Editor
 
EXTRACTING USEFUL RULES THROUGH IMPROVED DECISION TREE INDUCTION USING INFORM...
EXTRACTING USEFUL RULES THROUGH IMPROVED DECISION TREE INDUCTION USING INFORM...EXTRACTING USEFUL RULES THROUGH IMPROVED DECISION TREE INDUCTION USING INFORM...
EXTRACTING USEFUL RULES THROUGH IMPROVED DECISION TREE INDUCTION USING INFORM...ijistjournal
 
The pertinent single-attribute-based classifier for small datasets classific...
The pertinent single-attribute-based classifier  for small datasets classific...The pertinent single-attribute-based classifier  for small datasets classific...
The pertinent single-attribute-based classifier for small datasets classific...IJECEIAES
 
Performance Comparison of Dimensionality Reduction Methods using MCDR
Performance Comparison of Dimensionality Reduction Methods using MCDRPerformance Comparison of Dimensionality Reduction Methods using MCDR
Performance Comparison of Dimensionality Reduction Methods using MCDRAM Publications
 
Ijariie1117 volume 1-issue 1-page-25-27
Ijariie1117 volume 1-issue 1-page-25-27Ijariie1117 volume 1-issue 1-page-25-27
Ijariie1117 volume 1-issue 1-page-25-27IJARIIE JOURNAL
 

Similar to Performance Evaluation: A Comparative Study of Various Classifiers (20)

E05312426
E05312426E05312426
E05312426
 
Artificial Intelligence based Pattern Recognition
Artificial Intelligence based Pattern RecognitionArtificial Intelligence based Pattern Recognition
Artificial Intelligence based Pattern Recognition
 
E-Healthcare monitoring System for diagnosis of Heart Disease using Machine L...
E-Healthcare monitoring System for diagnosis of Heart Disease using Machine L...E-Healthcare monitoring System for diagnosis of Heart Disease using Machine L...
E-Healthcare monitoring System for diagnosis of Heart Disease using Machine L...
 
Classification Techniques: A Review
Classification Techniques: A ReviewClassification Techniques: A Review
Classification Techniques: A Review
 
IRJET- Study and Evaluation of Classification Algorithms in Data Mining
IRJET- Study and Evaluation of Classification Algorithms in Data MiningIRJET- Study and Evaluation of Classification Algorithms in Data Mining
IRJET- Study and Evaluation of Classification Algorithms in Data Mining
 
D-5436
D-5436D-5436
D-5436
 
Performance Comparision of Machine Learning Algorithms
Performance Comparision of Machine Learning AlgorithmsPerformance Comparision of Machine Learning Algorithms
Performance Comparision of Machine Learning Algorithms
 
IRJET- A Detailed Study on Classification Techniques for Data Mining
IRJET- A Detailed Study on Classification Techniques for Data MiningIRJET- A Detailed Study on Classification Techniques for Data Mining
IRJET- A Detailed Study on Classification Techniques for Data Mining
 
T0 numtq0n tk=
T0 numtq0n tk=T0 numtq0n tk=
T0 numtq0n tk=
 
Performance analysis of Data Mining algorithms in Weka
Performance analysis of Data Mining algorithms in WekaPerformance analysis of Data Mining algorithms in Weka
Performance analysis of Data Mining algorithms in Weka
 
K Means Clustering Algorithm for Partitioning Data Sets Evaluated From Horizo...
K Means Clustering Algorithm for Partitioning Data Sets Evaluated From Horizo...K Means Clustering Algorithm for Partitioning Data Sets Evaluated From Horizo...
K Means Clustering Algorithm for Partitioning Data Sets Evaluated From Horizo...
 
Different Classification Technique for Data mining in Insurance Industry usin...
Different Classification Technique for Data mining in Insurance Industry usin...Different Classification Technique for Data mining in Insurance Industry usin...
Different Classification Technique for Data mining in Insurance Industry usin...
 
Applying K-Means Clustering Algorithm to Discover Knowledge from Insurance Da...
Applying K-Means Clustering Algorithm to Discover Knowledge from Insurance Da...Applying K-Means Clustering Algorithm to Discover Knowledge from Insurance Da...
Applying K-Means Clustering Algorithm to Discover Knowledge from Insurance Da...
 
EFFICIENT USE OF HYBRID ADAPTIVE NEURO-FUZZY INFERENCE SYSTEM COMBINED WITH N...
EFFICIENT USE OF HYBRID ADAPTIVE NEURO-FUZZY INFERENCE SYSTEM COMBINED WITH N...EFFICIENT USE OF HYBRID ADAPTIVE NEURO-FUZZY INFERENCE SYSTEM COMBINED WITH N...
EFFICIENT USE OF HYBRID ADAPTIVE NEURO-FUZZY INFERENCE SYSTEM COMBINED WITH N...
 
N ETWORK F AULT D IAGNOSIS U SING D ATA M INING C LASSIFIERS
N ETWORK F AULT D IAGNOSIS U SING D ATA M INING C LASSIFIERSN ETWORK F AULT D IAGNOSIS U SING D ATA M INING C LASSIFIERS
N ETWORK F AULT D IAGNOSIS U SING D ATA M INING C LASSIFIERS
 
Hypothesis on Different Data Mining Algorithms
Hypothesis on Different Data Mining AlgorithmsHypothesis on Different Data Mining Algorithms
Hypothesis on Different Data Mining Algorithms
 
EXTRACTING USEFUL RULES THROUGH IMPROVED DECISION TREE INDUCTION USING INFORM...
EXTRACTING USEFUL RULES THROUGH IMPROVED DECISION TREE INDUCTION USING INFORM...EXTRACTING USEFUL RULES THROUGH IMPROVED DECISION TREE INDUCTION USING INFORM...
EXTRACTING USEFUL RULES THROUGH IMPROVED DECISION TREE INDUCTION USING INFORM...
 
The pertinent single-attribute-based classifier for small datasets classific...
The pertinent single-attribute-based classifier  for small datasets classific...The pertinent single-attribute-based classifier  for small datasets classific...
The pertinent single-attribute-based classifier for small datasets classific...
 
Performance Comparison of Dimensionality Reduction Methods using MCDR
Performance Comparison of Dimensionality Reduction Methods using MCDRPerformance Comparison of Dimensionality Reduction Methods using MCDR
Performance Comparison of Dimensionality Reduction Methods using MCDR
 
Ijariie1117 volume 1-issue 1-page-25-27
Ijariie1117 volume 1-issue 1-page-25-27Ijariie1117 volume 1-issue 1-page-25-27
Ijariie1117 volume 1-issue 1-page-25-27
 

Recently uploaded

chaitra-1.pptx fake news detection using machine learning
chaitra-1.pptx  fake news detection using machine learningchaitra-1.pptx  fake news detection using machine learning
chaitra-1.pptx fake news detection using machine learningmisbanausheenparvam
 
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerStudy on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerAnamika Sarkar
 
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfCCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfAsst.prof M.Gokilavani
 
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfCCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfAsst.prof M.Gokilavani
 
Introduction to Microprocesso programming and interfacing.pptx
Introduction to Microprocesso programming and interfacing.pptxIntroduction to Microprocesso programming and interfacing.pptx
Introduction to Microprocesso programming and interfacing.pptxvipinkmenon1
 
Current Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLCurrent Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLDeelipZope
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxDeepakSakkari2
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024hassan khalil
 
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝soniya singh
 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSCAESB
 
microprocessor 8085 and its interfacing
microprocessor 8085  and its interfacingmicroprocessor 8085  and its interfacing
microprocessor 8085 and its interfacingjaychoudhary37
 
Heart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxHeart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxPoojaBan
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )Tsuyoshi Horigome
 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxwendy cai
 
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2RajaP95
 
Application of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptxApplication of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptx959SahilShah
 
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...ZTE
 

Recently uploaded (20)

chaitra-1.pptx fake news detection using machine learning
chaitra-1.pptx  fake news detection using machine learningchaitra-1.pptx  fake news detection using machine learning
chaitra-1.pptx fake news detection using machine learning
 
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerStudy on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
 
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfCCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
 
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfCCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
 
Introduction to Microprocesso programming and interfacing.pptx
Introduction to Microprocesso programming and interfacing.pptxIntroduction to Microprocesso programming and interfacing.pptx
Introduction to Microprocesso programming and interfacing.pptx
 
Current Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLCurrent Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCL
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptx
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024
 
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentation
 
microprocessor 8085 and its interfacing
microprocessor 8085  and its interfacingmicroprocessor 8085  and its interfacing
microprocessor 8085 and its interfacing
 
Heart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxHeart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptx
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )
 
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCRCall Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptx
 
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
 
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
 
Application of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptxApplication of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptx
 
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
 

Performance Evaluation: A Comparative Study of Various Classifiers

  • 1. © 2013, IJARCSSE All Rights Reserved Page | 972 Volume 3, Issue 11, November 2013 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Performance Evaluation: A Comparative Study of Various Classifiers Amresh Kumar, Yashwant S. Ingle Department of Computer Science & Engineering G.H. Raisoni College of Engineering, Nagpur, INDIA Abstract— Real-world knowledge discovery processes typically consist of complex data pre-processing, machine learning, evaluation, and visualization steps. Hence a data mining platform should allow complex nested operator chains or trees, provide the transparent data managing, comfortable constraint handling and optimization, be flexible, extendable and easy-to-use. Modern machine learning techniques have encouraged interest in the development of various systems that ensure protected, trustworthy and many more operations in the different fields and applications. In an earlier study, many other approaches/methods were investigated to develop various applications using modern machine learning techniques and more specific classification algorithms. The Weka machine learning workbench provides a general-purpose environment for automatic classification, clustering and feature selection, and common data mining problems in bioinformatics research. Here in this Project Report paper we have used various classifiers with filters to perform classification and we have done analysis of data with different classifiers and then we have done feature selection process and during all these activities we have observed and record the various performance change and different graphs which are briefed inside this paper. Keywords— Machine Learning, WEKA, Data mining, KDD, Classification, Filters, Feature Selection I. INTRODUCTION In an earlier study, many other approaches/methods were investigated to develop various applications using modern machine learning techniques and more specific classification algorithms. Modern machine learning techniques have encouraged interest in the development of various systems that ensure secure, reliable and many more operations in the different fields and applications. The Weka machine learning workbench provides a general-purpose environment for automatic classification, clustering and feature selection. Therefore Weka also contains an extensive data pre-processing methods and the experimental comparison of different machine learning techniques on the same problem. Data mining (the analysis step of the "Knowledge Discovery in Databases”) is a field at the intersection of computer science and statistics is the process that attempts to discover patterns in huge data sets. It utilizes methods at the junction of artificial intelligence, machine learning, statistics, and database systems. The term Knowledge Discovery in Databases or KDD for short, refers to the broad process of finding knowledge in data, and emphasizes the "high-level" application of particular data mining methods. The unifying goal of the KDD process is to extract knowledge from data in the context of large databases. In feature selection operation we are going to find out which are the most important instances to carry out the classification to get accurate result by improving their performance. Here, in this Project Report paper we have used various classifiers with filters to perform classification and we have done analysis of data with different classifiers and then we have done feature selection process and during all these activities we have observed and record the various performance change and different graphs which are briefed inside this paper. To improve the performance we have experimented with Artificial Intelligence based (AI Classifier), a rule-based learning method using statistical analysis and also Decision tree and Support Vector Machine (SVM) based classification schemes were used to analysis and inspection of data. This selected algorithm efficiency and overall performance for the given data set (Temp.csv) are observed and calculated. This experiment/study has been conducted using six classifiers, namely SMO, REPTree, IBK, Logistic and Multilayer perceptron, with Temp.csv datasets having 41 instances. The Waikato Environment for Knowledge Analysis (WEKA) learning tool has been used in this Experiment/study. II. METHODOLOGY Fig 1 Flow of Methodology involved.
  • 2. Amresh et al., International Journal of Advanced Research in Computer Science and Software Engineering 3(11), November - 2013, pp. 972-976 © 2013, IJARCSSE All Rights Reserved Page | 973 Using Weka tool we have executed six various classifiers algorithm on our dataset and compared the various classifiers based on the ROC Area (Weighted Average) value. Fig 2 ROC (Weighted Average) And also we found that out 6 classifiers, 4 Classifiers are showing 100% correctly classified instances and 2 Classifiers REPTree and DMNBText are showing incorrectly classified instances, therefore next step we proceed with finding out which instances was not correctly classified. For this we have to do a tuple wise analysis, so for the same I am taking only one classifier now i.e. DMNBtext Classifier. III. RESULTS TABLE 1: CLASSIFIERS COMPARISON CHART Sl. No. Classifiers ROC (Weighted Average) 1 SMO 1 2 REPTree 0.746 3 Ibk 1 4 Logistic 1 5 MultilayePerceptron 1 6 DMNBText 0.997 TABLE 2: CONSOLIDATED CLASSIFIERS SHEET USING TRAINING SET AS TEST OPTIONS Relation=temp Summary Number ofCorrectly Classified Instances 41 28 41 41 41 35 Correctly Classified Instances % 100 68.2927 100 100 100 85.3659 Number ofIncorrectly Classified Instances 0 13 0 0 0 6 Incorrectly Classified Instances % 0 31.7073 0 0 0 14.6341 Kappa statistic 1 0.4336 1 1 1 0.7681 Mean absolute error 0.2222 0.2969 0.0303 0 0.0059 0.2135 Root mean squared error 0.2722 0.3853 0.0321 0 0.0091 0.2783 Relative absolute error 55.9901 74.8118 7.635 0.0001 1.4937 53.7995 Root relative squared error 61.3467 86.8492 7.2447 0.0001 2.0548 62.7209 TotalNumber ofInstances 41 41 41 41 41 41 Detailed Accuracy By Class TP Rate (weighted avg) 1 0.683 1 1 1 0.854 FP Rate (weighted avg) 0 0.266 0 0 0 0.025 Precision (weighted avg) 1 0.585 1 1 1 0.927 Recall(weighted avg) 1 0.683 1 1 1 0.854 F-measure (weighted avg) 1 0.627 1 1 1 0.871 ROC Area (weighted avg) 1 0.746 1 1 1 0.997 Confusion Matrix a=O,b=L,c=N a-a 22 16 22 22 22 19 a-b 0 6 0 0 0 0 a-c 0 0 0 0 0 3 b-a 0 1 0 0 0 0 b-b 13 12 13 13 13 10 b-c 0 0 0 0 0 3 c-a 6 6 0 0 0 0 c-b 0 0 0 0 0 0 c-c 6 0 6 6 6 6 CLASSIFIER 6: DMNBText CLASSIFIER 3: Ibk CLASSIFIER 4: Logistic CLASSIFIER 5: MultiLayer Perceptron DATA CLASSIFICATION STATISTICS Data File Description Attributes=62 Instances=41 CLASSIFIER 1: AI Classifier SMO CLASSIFIER 2: Tree Classifier REPTree
  • 3. Amresh et al., International Journal of Advanced Research in Computer Science and Software Engineering 3(11), November - 2013, pp. 972-976 © 2013, IJARCSSE All Rights Reserved Page | 974 TABLE 3 ANALYSIS AND PERFORMANCE CHANGE FOR DMNBTEXT CLASSIFIERS USING CROSS-VALIDATION AS TEST OPTIONS From the Figure given below we can see the ROC value for DMNBText Classifier without and With the Attributor Evaluator. Thus by Calculating ROC change we can Find out the performance Change as given below. Performance Change= (ROCWith Evaluator - ROCWithout Evaluator) / ROCWithout Evaluator Without Attribute Selection With Attribute Selection: Attribute Evaluator: na me SearchMethod: Ranker List of Se le cte d Attribute s---------->> All RL-54, L-18, L-14, L- 20, L-24,, L-16, L- 15, H-35, RL-60, RL- 62, P-49, H-32, L-17 Summary Number of Correctly Classified Instances 33 31 Correctly Classified Instances % 80.487 75.6098 Number of Incorrectly Classified Instances 8 10 Incorrectly Classified Instances % 19.512 24.3902 Kappa statistic 0.6439 0.5514 Mean absolute error 0.2455 0.2827 Root mean squared error 0.3339 0.3463 Relative absolute error 61.4542 70.7685 Root relative squared error 74.8162 77.6059 Total Number of Instances 41 41 Detailed Accuracy By Class TP Rate (weighted avg) 0.805 0.756 FP Rate (weighted avg) 0.175 0.215 Precision (weighted avg) 0.829 0.645 Recall (weighted avg) 0.805 0.756 F-measure (weighted avg) 0.77 0.696 ROC Area (weighted avg) 0.897 0.909 Confusion Matrix a=O,b=L,c=N a-a 21 20 a-b 1 2 a-c 0 0 b-a 2 2 b-b 11 11 b-c 0 0 c-a 3 4 c-b 2 2 c-c 1 0 Performance change(%) 1.34 CLASSIFIER NAME: DMNBText Fig 3 Frequency Fig 4 Attribute Frequency Pattern
  • 4. Amresh et al., International Journal of Advanced Research in Computer Science and Software Engineering 3(11), November - 2013, pp. 972-976 © 2013, IJARCSSE All Rights Reserved Page | 975 TABLE 4: MULTILLAYER PERCEPTRON (USING TRAINING SET) Summary With all attributes With only selected 35 attributes Number of Correctly Classified Instances 41 41 Correctly Classified Instances % 100 100 Number of Incorrectly Classified Instances 0 0 Incorrectly Classified Instances % 0 0 Kappa statistic 1 1 Mean absolute error 0.0059 0.0075 Root mean squared error 0.0091 0.0119 Relative absolute error 1.4937 1.8879 Root relative squared error 2.0548 2.6893 Total Number of Instances 41 41 Detailed Accuracy By Class TP Rate (weighted avg) 1 1 FP Rate (weighted avg) 0 0 Precision (weighted avg) 1 1 Recall (weighted avg) 1 1 F-measure (weighted avg) 1 1 ROC Area (weighted avg) 1 1 Confusion Matrix a=O,b=L,c=N a-a 22 22 a-b 0 0 a-c 0 0 b-a 0 0 b-b 13 13 b-c 0 0 c-a 0 0 c-b 0 0 c-c 6 6 Performance change 0 0 Classifier Name: MultiLayerPerceptron (Using Traing Set) TABLE 5: MULTILLAYER PERCEPTRON (USING CROSS VALIDATION) Classifier Name: MultiLayerPerceptron (Cross-Validation) Summary With all attributes With only selected 35 attributes Number of Correctly Classified Instances 30 32 Correctly Classified Instances % 73.1701 78.0488 Number of Incorrectly Classified Instances 11 9 Incorrectly Classified Instances % 26.8293 21.9512 Kappa statistic 0.5847 0.6506 Mean absolute error 0.1866 0.1636 Root mean squared error 0.3666 0.3474 Relative absolute error 46.7145 40.9598 Root relative squared error 82.148 77.8538 Total Number of Instances 41 41 Detailed Accuracy By Class TP Rate (weighted avg) 0.732 0.78 FP Rate (weighted avg) 0.101 0.093 Precision (weighted avg) 0.818 0.828 Recall (weighted avg) 0.732 0.78 F-measure (weighted avg) 0.754 0.795 ROC Area (weighted avg) 0.908 0.911 Confusion Matrix a=O,b=L,c=N a-a 15 16 a-b 1 1 a-c 6 5 b-a 1 0 b-b 10 12 b-c 2 1 c-a 1 2 c-b 0 0 c-c 5 4 Performance change 0 0.330396476
  • 5. Amresh et al., International Journal of Advanced Research in Computer Science and Software Engineering 3(11), November - 2013, pp. 972-976 © 2013, IJARCSSE All Rights Reserved Page | 976 IV. DISCUSSION For the experimental analysis SMO, REPTree, IBK, Logistic and Multilayer perceptron classifiers are considered in this experiment/study. 41 instances in data sets were selected from the collected data. To cover this experimental I have taken Temp.csv datasets, and the observation and Calculation is done by considering following:  Attribute selection  Frequency  ROC  Confusion metrics TABLE 5 COMPARISONS WITH GRAPH WITHOUT FEATURE EXTRACTION AND WITH FEATURE EXTRACTION USING DMNBTEXT CLASSIFIER Classifier: DMNBText Type ofSearchMethod ROCArea(Wt. Avg.) AttributeEvaluator: ChiSquaredAttributeEval WithoutFeature Extraction 0.897 Ranker 0.909 Fig 5 Performance Index for DBMNText V. CONCLUSION As a conclusion we can tell that classification algorithms play a key role to solve real world problems. Selection of an application specific classifier is an emerging research area. In this paper, performance change is being evaluated and calculated using various popular classifiers. Initially, the percentage of correct classifications has been measured with the highest accuracy. Later, ranking performance has been estimated to select a suitable algorithm for this application. The ranking performance has shown that DMNBText performs the best for the given datasets. This also reduces computational complexity, and development and maintenance costs both in terms of hardware and human inspection. Based on the results obtained in the various algorithms, we can conclude that the feature selection concept played an important role and can be useful component for many classifications. This is possible due to the low computational cost of this method, which is more efficient compared to the other ones. The main advantage of this method is that it makes no assumptions and these methods, not only improved the classification speed significantly, but they also improved the accuracy rate and the reliability in most of the cases. Thus using the concept of Data Mining techniques we examine and calculate the performance using ROC Values. REFERENCES [1] Ioannis Charalampopoulos, Ioannis Anagnostopoulos, A Comparable Study employing WEKA Clustering/Classification Algorithms for Web Page Classification, IEEE 2011, Page(s): 235 – 239. [2] G M Shafiullah, A B M Shawkat Ali, Adam Thompson, Peter J Wolfs, Rule-Based Classification Approach for Railway Wagon HealthMonitoring, IEEE 2010, Page(s): 1 – 7. [3] Weka, Available: http://www.cs.waikato.ac.nz [4] Data_mining, Available: http://en.wikipedia.org [5] aimag-kdd-overview-1996-Fayyad.pdf , Available: http://www.kdnuggets.com [6] IJCSE10-01-04-51.pdf, Available: http://www.ijcse.com