SlideShare a Scribd company logo
1 of 6
Download to read offline
International Association of Scientific Innovation and Research (IASIR)
(An Association Unifying the Sciences, Engineering, and Applied Research)
International Journal of Emerging Technologies in Computational
and Applied Sciences (IJETCAS)
www.iasir.net
IJETCAS 14-338; © 2014, IJETCAS All Rights Reserved Page 147
ISSN (Print): 2279-0047
ISSN (Online): 2279-0055
Classification of data using New Enhanced Decision Tree Algorithm
(NEDTA)
Hardeep Kaur 1
Harpreet Kaur 2
Department of Computer Science and Engineering
SBBSIET, Jalandhar, Punjab, India
__________________________________________________________________________________________
Abstract: Data mining is method of maintaining a large amount of data stored in the database. Decision tree is
a technique of data mining which classify the data and produces valuable results. These results are used in
analysis and future prediction. The prime objective of this research work is to present an enhanced decision tree
algorithm that classifies the data more efficiently and effectively than existing decision tree classifiers. We apply
existing decision tree classifiers ID3, J48, NBTree on a large amount of data. Then the efficiency and
performance of existing algorithms is examined and compared with new enhanced decision tree algorithm
(NEDTA). Our enhanced decision tree algorithm produces better results as compared to other decision tree
algorithms.
Keywords: Data Mining; Decision Tree; ID3; J48; NBTree
__________________________________________________________________________________________
I. Introduction
Data mining is a knowledge discovery process in which analysis of the data is done. The analysis is based on
historical activities stored in very large repositories and results are used to obtain useful information. Data
mining is method to find the hidden patterns in a large amount of data. There are various applications of data
mining such as banking, insurance, medicine, real estate etc. Data mining concept is applied in insurance and
banking field [1] for fraud detection, identification of loyal customers, sales promotion and enhanced research.
The process of data mining is iterative and also known as Knowledge Discovery process. It consists of
following phases:
1. Problem Definition: This phase consists of data mining experts, business experts and domain experts, who
understand the problem, define objectives.
2. Data Exploration: In this phase data is explored and metadata is defined by domain experts.
3. Data Preparation: A data model is formed in this phase from collected data.
4. Modeling: Data mining functions from data model are selected and applied on data.
5. Evaluation: The results obtained by modeling are evaluated. If the results are not according to expectations
then model is rebuild until required results are obtained.
6. Deployment: In deployment process, the mining results are deployed into applications
There are various data mining techniques are available such as Clustering, Classification, Association.
Clustering is the process of grouping the similar objects into one class. Therefore multiple classes are formed
which comprises similar objects. Association is the process in which association rules are created. These
association rules analyze unrelated data and produces association between them.
In this paper, Section I describes introduction of data mining and the process of knowledge discovery. Section II
describes about classification and their techniques. Section III gives information about decision tree
classification technique. In this section some decision tree algorithms are explained. In Section IV the objectives
of research work are discussed. In Section V Weka data mining tool is discussed briefly. Weka tool is used in
research work. Section VI describes about data used in research work. The proposed work is implemented in
Section VII. The results are evaluated and compared in section VIII. The comparison of algorithms is based on
execution time, accuracy and error rate (Mean absolute error (MSE), Root mean squared error (RMSE), Relative
absolute error (RAE), Root relative squared error (RRSE)). In section IX a new enhanced decision tree
algorithm (NEDTA) is proposed. In section X the results of NEDTA are evaluated and compared with ID3, J48
and NBTree. Section XI contains conclusion. Section XII contains references that are used in this research work.
II. Classification
Classification is data mining technique which classifies data with the help of certain classification rules and
valuable results are formed. There are two areas of classification: Decision Tree Induction and Neural Induction.
Hardeep Kaur et al., International Journal of Emerging Technologies in Computational and Applied Sciences, 8(2), March-May 2014, pp.
147-152
IJETCAS 14-338; © 2014, IJETCAS All Rights Reserved Page 148
Decision tree induction is a classification method in which a set of rules are applied recursively on a dataset and
a tree is generated.
III. Decision Tree
A decision tree consists of nodes [2]. Each node represents some information. Decision tree learning is started
from root node and discrete values are produced at each node by testing the values of attribute. These discrete
values acts as target function. Then by using target function, value of attribute for next node is evaluated. This
process is repeated for each new node. The learned tree is represented by if-then rules.
Decision tree algorithms [3] such as ID3, C4.5, J48 [4] NBTree can be applied on large amount of data and
valuable predictions can be produced. These predictions evaluate future behavior of problem. Decision tree are
preferred because they can evaluate information more accurately than other methods.
In this research work following decision tree algorithms are used:
(1) ID3: ID3 means Iterative Dichotomiser 3. It is a decision tree algorithm which is developed by Ross
Quinlan. The steps of ID3 algorithm are as following:
(a) ID3 is a greedy algorithm in which the tree created from top to bottom.
(b) At each node, the appropriate attribute is selected which best classifies the data. Data is in the form of
training examples.
(c) The above process is repeated until the complete tree is generated or until all the attributes used.
(2) J48: J48 is the open source Java implementation [5] of C4.5 decision tree algorithm in Weka data mining
tool. Following are the steps of J48 algorithm:
(a) This algorithm uses basic algorithm which create trees by using recursive top down divide and conquer
approach.
(b) First of all, the training examples are at the root node.
(c) Test attribute is selected based on some measures such as information gain, entropy etc.
(d) Examples are divided repeatedly by using test attribute.
(e) The process continued until no sample leaf is leaf.
(3) NBTree: NBTree(Naive Bayesian tree) consists of [6] naïve Bayesian classification and decision tree
learning. An NBTree classification sorts the example to a leaf and then assigns a class label by applying a
naïve bayes on that leaf. The steps of NBTree algorithm are:
(a) At each leaf node of a tree, a naive bayes is applied.
(b) By using naive bayes for each leaf node, the instances are classified.
(c) As the tree grows, for each leaf a naive bayes is constructed.
(d) This process repeated until no example is left.
IV. Objectives of Research Work
The objectives of this research work are as following:
(1) To apply Decision tree algorithms ID3, J48 and NBTree on banking dataset.
(2) Evaluation of results produced.
(3) Comparative analysis of results using parameters accuracy, execution time and error rate for ID3, J48 and
NBTree.
(4) To build a new enhanced method for classification of data.
V. Tool Used
In this research work, an open source tool named Weka is used. Weka is free open source data mining software
which is based on a Java data mining library. Weka consists of various machine learning algorithms for different
data mining applications. The algorithms are directly applied to dataset and results are generated in the form of
tree. Weka contains various classifiers for classification [7], clustering, association, regression, pre-processing
and visualization. Weka is also used for development of new machine learning schemes.
VI. Data Set Used
In our research work, we have used banking dataset [8]. The main focus of this research is performance and
evaluation of decision tree algorithms. There are many decision tree algorithms in data mining but we focus
mainly on ID3, J48 and NBTree. The data set contains 5264 rows and 13 columns.
VII. Implementation of Work
To implement the objectives of research work, Firstly, we have applied the ID3, J48 and NBtree algorithms on
banking [9] dataset using data mining tool Weka 3.4 Figure 1 shows implementation of ID3 decision tree
Hardeep Kaur et al., International Journal of Emerging Technologies in Computational and Applied Sciences, 8(2), March-May 2014, pp.
147-152
IJETCAS 14-338; © 2014, IJETCAS All Rights Reserved Page 149
algorithm on Weka data mining tool. Figure 2 shows implementation of J48 on Weka data mining tool. Figure 4
shows implementation of NBTree decision tree algorithm on Weka data mining tool
Figure 1: ID3 Algorithm Figure 2: J48 Algorithm
Figure 3: J48 Algorithm Visualization Tree in Weka Explorer Figure 4: NBTree Algorithm
VIII. Comparative Analysis of ID3, J48 and NBTree
The experiments have been conducted and different decision tree algorithms are applied on banking dataset in
Weka Framework. The results of Decision tree classifiers ID3, J48 and NBTree are compared. In our
experiment, parameters such as error rate, execution time and accuracy are evaluated and compared. Table I
shows the accuracy of decision tree classifiers ID3, J48 and NBTree. Table II shows the performance of
different decision tree classifiers. The table shows execution time of various classifiers.
Table I: Classifier Accuracy Table II: Performance of Classifiers
Table III shows the error rate of decision tree classifiers. Error rate is shown as Mean absolute error (MSE),
Root mean squared error (RMSE), Relative absolute error (RAE), Root relative squared error (RRSE)
Table III: Error rate of Classifiers
Algorithm MSE RMSE RAE RRSE
ID3 0.0991 0.2237 28.3881 % 53.4855 %
J48 0.0787 0.1983 22.4945 % 47.4297 %
NBTree 0.1065 0.2296 30.4526 % 54.9118 %
Algorithm Correctly
Classified
instances
Incorrectly
Classified
Instances
ID3 84.8784 % 15.1216 %
J48 87.9179 % 12.0821 %
NBTree 85.6003 % 14.3997 %
Algorithm Time Taken
to build model (in
seconds)
ID3 0.08
J48 0.1
NBTree 7.96
Hardeep Kaur et al., International Journal of Emerging Technologies in Computational and Applied Sciences, 8(2), March-May 2014, pp.
147-152
IJETCAS 14-338; © 2014, IJETCAS All Rights Reserved Page 150
Following graphs shows accuracy, error rate and execution comparison of ID3, J48 and NBTree algorithms.
Figure 5: This graph shows accuracy comparison of Figure 6: This graph shows Execution Time of
ID3, J48 and NBTree algorithms ID3, J48 and NBTree algorithms
Figure 7: This graph shows Error Rate (MSE, RMSE, RAE and RRSE) comparison of
ID3, J48 and NBTree algorithms
XI. New Enhanced Decision Tree Algorithm
We have proposed a new decision tree algorithm which classifies a large amount of data. Existing decision tree
algorithms have some drawbacks. But our enhanced algorithm produces better results as compared to ID3, J48
and NBTree [10]. Following are the steps of proposed algorithm:
1. A decision tree DT built from the training examples, with a collection S of m source leaf nodes and a
collection D of n destination leaf nodes.
2. A pre specified constant k (k≤ m), where m is the total number of source leaf nodes,
3. Construct the branches according to different values of attribute Pi so that the samples are partitioned
accordingly.
4. If samples in a certain value are all of the same class, then generate a leaf node and is labeled with that
class.
5. Otherwise use the same process repeated recursively to form a decision tree for the samples at each
partition.
X. Results
NEDTA is applied on banking dataset and results are compared with ID3, J48 and NBTree algorithms. Figure 9
shows the evaluation of NEDTA on Weka data mining tool. NEDTA produces better results as compared to
ID3, J48 and NBTree in terms of execution time, accuracy and error rate.
0
10
20
30
40
50
60
70
80
90
100
Correctly
Classified
Instances
Incorrectly
Classified
Instances
0
1
2
3
4
5
6
7
8
Execution
Time
Hardeep Kaur et al., International Journal of Emerging Technologies in Computational and Applied Sciences, 8(2), March-May 2014, pp.
147-152
IJETCAS 14-338; © 2014, IJETCAS All Rights Reserved Page 151
Figure 8: Implementation of New Enhanced Decision Tree Algorithm (NEDTA) on Weka Explorer
Table IV shows accuracy comparison of ID3, J48 sand NBTree with NEDTA. The percentage of correctly
classified instances of our algorithm is better than other algorithms. Table V shows the performance comparison
of ID3, J48 and NBTree with NEDTA. The execution time of our algorithm is better than other algorithms.
Table VI shows error rate comparison of ID3, J48 and NBTree with NEDTA.
Table IV: Comparison of accuracy of NEDTA Table V: Performance comparison of NEDTA
with ID3, J48 and NBTree with ID3, J48 and NBTree
Table VI: Comparison of error rate of NEDTA with ID3, J48 and NBTree
Algorithm MSE RMSE RAE RRSE
ID3 0.0991 0.2237 28.3881 % 53.4855 %
J48 0.0787 0.1983 22.4945 % 47.4297 %
NBTree 0.1065 0.2296 30.4526 % 54.9118 %
NEDTA 0.0675 0.1838 19.3113 % 43.9458 %
Figure 9: This graph shows accuracy comparison of ID3, J48 and NBTree algorithms with NEDTA
0
20
40
60
80
100
Correctly
Classified
Instances
Incorrectly
Classified
Instances
Algorithm Correctly
Classified
instances
Incorrectly
Classified
Instances
ID3 84.8784 % 15.1216 %
J48 87.9179 % 12.0821 %
NBTree 85.6003 % 14.3997 %
NEDTA 88.3549 % 11.6451 %
Algorithm Time Taken to build
model (in seconds)
ID3 0.08
J48 0.1
NBTree 7.96
NEDTA 0.06
Hardeep Kaur et al., International Journal of Emerging Technologies in Computational and Applied Sciences, 8(2), March-May 2014, pp.
147-152
IJETCAS 14-338; © 2014, IJETCAS All Rights Reserved Page 152
Figure 10: This graph shows performance Figure 11: This graph shows Error rate (MSE,
comparison of ID3, J48 and NBTree RMSE, RAE and RRSE) comparison of ID3, J48
algorithms with NEDTA and NBTree algorithms with NEDTA
XI. Conclusion
Data mining plays an important role in knowledge discovery. There are various decision tree algorithms which
are used to classify a larger amount of data. Each algorithm has different performance for different data set.
While classifying a large amount of data, the performance of some algorithms decreases. Our algorithm removes
this problem. The results of NEDTA show better performance in terms of execution time, error rate and
accuracy than other algorithms.
References
[1] Kazi Imran Moin and Dr. Qazi Baseer Ahmed, ‘‘Use of Data Mining in Baking’’, International Journal of Engineering Research
and Applications (IJERA),Vol. 2, Issue 2, pp.738-742, 2012.
[2] J. R. Quinlan, ‘Introduction of decision tree’, Journal of Machine learning.
[3] Mrs. Swati .V. Kulkarni, ‘‘Mining knowledge using Decision Tree Algorithm’’, International Journal of Scientific &
Engineering Research, Volume 2, Issue 5.
[4] Youvrajsinh Chauhan, Jignesh Vania, “J48 Classifier Approach to Detect Characteristic of Bt Cotton base on Soil Micro
Nutrient”, International Journal of Computer Trends and Technology (IJCTT), volume 5 number, 2013.
[5] Bangsuk Jantawan and Cheng-Fa Tsai, “The Application of Data Mining to Build Classification Model for Predicting Graduate
Employment”, “International Journal of Computer Science and Information Security, Vol. 11, No. 10, October 2013
[6] Yumin Zhao, Zhendong Niu_ and Xueping Peng, “Research on Data Mining Technologies for Complicated Attributes
Relationship in Digital Library Collections”,“Applied Mathematics & Information Sciences, An International Journal”, Appl.
Math. Inf. Sci. 8, No. 3, 1173-1178 (2014)
[7] Aman Kumar Sharma and Suruchi Sahni, ‘‘A Comparative Study of Classification Algorithms for Spam Email Data Analysis’’,
International Journal on Computer Science and Engineering (IJCSE), Vol. 3 No. 5, pp. 1890-1895, 2011.
[8] Pardeep Kumar, Nitin, Vivek Kumar Sehgal and Durg Singh Chauhan, ‘‘A BENCHMARK TO SELECT DATA MINING
BASED CLASSIFICATION ALGORITHMS FOR BUSINESS INTELLIGENCE AND DECISION SUPPORT SYSTEMS’’,
International Journal of Data Mining & Knowledge Management Process (IJDKP), Vol.2, No.5, pp. 25-42, 2012.
[9] Vivek Bhambri, ‘‘Role of Data Mining in Banking Sector’’, International Indexed & Referred Research Journal, VoL.III,
ISSUE-33, pp. 70-71, 2012.
[10] Milija Suknovic, Boris Delibasic, Milos Jovanovic, Milan Vukicevic, Dragana Becejski-Vujaklija and Zoran Obradovic,
‘‘Reusable components in decision tree induction algorithms’’, Springer, 2011.
Acknowledgment
I express my sincere gratitude to Er. Harpreet Kaur, Assistant Professor in department of computer science
engineering at SBBSIET, Jalandhar, Punjab for her stimulating guidance, continuous encouragement and
supervision.
0
1
2
3
4
5
6
7
8
Execution
time

More Related Content

What's hot

Data mining techniques a survey paper
Data mining techniques a survey paperData mining techniques a survey paper
Data mining techniques a survey papereSAT Publishing House
 
Data mining techniques
Data mining techniquesData mining techniques
Data mining techniqueseSAT Journals
 
SURVEY ON CLASSIFICATION ALGORITHMS USING BIG DATASET
SURVEY ON CLASSIFICATION ALGORITHMS USING BIG DATASETSURVEY ON CLASSIFICATION ALGORITHMS USING BIG DATASET
SURVEY ON CLASSIFICATION ALGORITHMS USING BIG DATASETEditor IJMTER
 
IRJET- Machine Learning Classification Algorithms for Predictive Analysis in ...
IRJET- Machine Learning Classification Algorithms for Predictive Analysis in ...IRJET- Machine Learning Classification Algorithms for Predictive Analysis in ...
IRJET- Machine Learning Classification Algorithms for Predictive Analysis in ...IRJET Journal
 
Analysis on Data Mining Techniques for Heart Disease Dataset
Analysis on Data Mining Techniques for Heart Disease DatasetAnalysis on Data Mining Techniques for Heart Disease Dataset
Analysis on Data Mining Techniques for Heart Disease DatasetIRJET Journal
 
A NEW DECISION TREE METHOD FOR DATA MINING IN MEDICINE
A NEW DECISION TREE METHOD FOR DATA MINING IN MEDICINEA NEW DECISION TREE METHOD FOR DATA MINING IN MEDICINE
A NEW DECISION TREE METHOD FOR DATA MINING IN MEDICINEaciijournal
 
A SURVEY ON DATA MINING IN STEEL INDUSTRIES
A SURVEY ON DATA MINING IN STEEL INDUSTRIESA SURVEY ON DATA MINING IN STEEL INDUSTRIES
A SURVEY ON DATA MINING IN STEEL INDUSTRIESIJCSES Journal
 
Performance Evaluation of Different Data Mining Classification Algorithm and ...
Performance Evaluation of Different Data Mining Classification Algorithm and ...Performance Evaluation of Different Data Mining Classification Algorithm and ...
Performance Evaluation of Different Data Mining Classification Algorithm and ...IOSR Journals
 
IRJET- Medical Data Mining
IRJET- Medical Data MiningIRJET- Medical Data Mining
IRJET- Medical Data MiningIRJET Journal
 
CLUSTERING DICHOTOMOUS DATA FOR HEALTH CARE
CLUSTERING DICHOTOMOUS DATA FOR HEALTH CARECLUSTERING DICHOTOMOUS DATA FOR HEALTH CARE
CLUSTERING DICHOTOMOUS DATA FOR HEALTH CAREijistjournal
 
Effective data mining for proper
Effective data mining for properEffective data mining for proper
Effective data mining for properIJDKP
 
Different Classification Technique for Data mining in Insurance Industry usin...
Different Classification Technique for Data mining in Insurance Industry usin...Different Classification Technique for Data mining in Insurance Industry usin...
Different Classification Technique for Data mining in Insurance Industry usin...IOSRjournaljce
 
Recommendation system using bloom filter in mapreduce
Recommendation system using bloom filter in mapreduceRecommendation system using bloom filter in mapreduce
Recommendation system using bloom filter in mapreduceIJDKP
 
CLASSIFICATION ALGORITHM USING RANDOM CONCEPT ON A VERY LARGE DATA SET: A SURVEY
CLASSIFICATION ALGORITHM USING RANDOM CONCEPT ON A VERY LARGE DATA SET: A SURVEYCLASSIFICATION ALGORITHM USING RANDOM CONCEPT ON A VERY LARGE DATA SET: A SURVEY
CLASSIFICATION ALGORITHM USING RANDOM CONCEPT ON A VERY LARGE DATA SET: A SURVEYEditor IJMTER
 
Gene Selection Based on Rough Set Applications of Rough Set on Computational ...
Gene Selection Based on Rough Set Applications of Rough Set on Computational ...Gene Selection Based on Rough Set Applications of Rough Set on Computational ...
Gene Selection Based on Rough Set Applications of Rough Set on Computational ...ijcoa
 
IRJET- The Machine Learning: The method of Artificial Intelligence
IRJET- The Machine Learning: The method of Artificial IntelligenceIRJET- The Machine Learning: The method of Artificial Intelligence
IRJET- The Machine Learning: The method of Artificial IntelligenceIRJET Journal
 
Introduction to feature subset selection method
Introduction to feature subset selection methodIntroduction to feature subset selection method
Introduction to feature subset selection methodIJSRD
 

What's hot (20)

Data mining techniques a survey paper
Data mining techniques a survey paperData mining techniques a survey paper
Data mining techniques a survey paper
 
Data mining techniques
Data mining techniquesData mining techniques
Data mining techniques
 
SURVEY ON CLASSIFICATION ALGORITHMS USING BIG DATASET
SURVEY ON CLASSIFICATION ALGORITHMS USING BIG DATASETSURVEY ON CLASSIFICATION ALGORITHMS USING BIG DATASET
SURVEY ON CLASSIFICATION ALGORITHMS USING BIG DATASET
 
IRJET- Machine Learning Classification Algorithms for Predictive Analysis in ...
IRJET- Machine Learning Classification Algorithms for Predictive Analysis in ...IRJET- Machine Learning Classification Algorithms for Predictive Analysis in ...
IRJET- Machine Learning Classification Algorithms for Predictive Analysis in ...
 
Analysis on Data Mining Techniques for Heart Disease Dataset
Analysis on Data Mining Techniques for Heart Disease DatasetAnalysis on Data Mining Techniques for Heart Disease Dataset
Analysis on Data Mining Techniques for Heart Disease Dataset
 
A NEW DECISION TREE METHOD FOR DATA MINING IN MEDICINE
A NEW DECISION TREE METHOD FOR DATA MINING IN MEDICINEA NEW DECISION TREE METHOD FOR DATA MINING IN MEDICINE
A NEW DECISION TREE METHOD FOR DATA MINING IN MEDICINE
 
A SURVEY ON DATA MINING IN STEEL INDUSTRIES
A SURVEY ON DATA MINING IN STEEL INDUSTRIESA SURVEY ON DATA MINING IN STEEL INDUSTRIES
A SURVEY ON DATA MINING IN STEEL INDUSTRIES
 
Performance Evaluation of Different Data Mining Classification Algorithm and ...
Performance Evaluation of Different Data Mining Classification Algorithm and ...Performance Evaluation of Different Data Mining Classification Algorithm and ...
Performance Evaluation of Different Data Mining Classification Algorithm and ...
 
IRJET- Medical Data Mining
IRJET- Medical Data MiningIRJET- Medical Data Mining
IRJET- Medical Data Mining
 
SAM 40
SAM 40SAM 40
SAM 40
 
CLUSTERING DICHOTOMOUS DATA FOR HEALTH CARE
CLUSTERING DICHOTOMOUS DATA FOR HEALTH CARECLUSTERING DICHOTOMOUS DATA FOR HEALTH CARE
CLUSTERING DICHOTOMOUS DATA FOR HEALTH CARE
 
Effective data mining for proper
Effective data mining for properEffective data mining for proper
Effective data mining for proper
 
Different Classification Technique for Data mining in Insurance Industry usin...
Different Classification Technique for Data mining in Insurance Industry usin...Different Classification Technique for Data mining in Insurance Industry usin...
Different Classification Technique for Data mining in Insurance Industry usin...
 
Recommendation system using bloom filter in mapreduce
Recommendation system using bloom filter in mapreduceRecommendation system using bloom filter in mapreduce
Recommendation system using bloom filter in mapreduce
 
CLASSIFICATION ALGORITHM USING RANDOM CONCEPT ON A VERY LARGE DATA SET: A SURVEY
CLASSIFICATION ALGORITHM USING RANDOM CONCEPT ON A VERY LARGE DATA SET: A SURVEYCLASSIFICATION ALGORITHM USING RANDOM CONCEPT ON A VERY LARGE DATA SET: A SURVEY
CLASSIFICATION ALGORITHM USING RANDOM CONCEPT ON A VERY LARGE DATA SET: A SURVEY
 
Gene Selection Based on Rough Set Applications of Rough Set on Computational ...
Gene Selection Based on Rough Set Applications of Rough Set on Computational ...Gene Selection Based on Rough Set Applications of Rough Set on Computational ...
Gene Selection Based on Rough Set Applications of Rough Set on Computational ...
 
IRJET- The Machine Learning: The method of Artificial Intelligence
IRJET- The Machine Learning: The method of Artificial IntelligenceIRJET- The Machine Learning: The method of Artificial Intelligence
IRJET- The Machine Learning: The method of Artificial Intelligence
 
Introduction to feature subset selection method
Introduction to feature subset selection methodIntroduction to feature subset selection method
Introduction to feature subset selection method
 
61_Empirical
61_Empirical61_Empirical
61_Empirical
 
2-IJCSE-00536
2-IJCSE-005362-IJCSE-00536
2-IJCSE-00536
 

Viewers also liked

Guia para la_gestion_integral_de_los_residuos_solidos_urbanos
Guia para la_gestion_integral_de_los_residuos_solidos_urbanosGuia para la_gestion_integral_de_los_residuos_solidos_urbanos
Guia para la_gestion_integral_de_los_residuos_solidos_urbanosLissbeth Rodriguez
 
Engl 102 final exam 1
Engl 102 final exam 1Engl 102 final exam 1
Engl 102 final exam 1scorpions1232
 
Como calcular o valor da empresa
Como calcular o valor da empresaComo calcular o valor da empresa
Como calcular o valor da empresaEduardo Vilas Boas
 
E Tail East10 Tls Sn
E Tail East10 Tls SnE Tail East10 Tls Sn
E Tail East10 Tls Snshawnneveu
 
Project Portfolio Management
Project Portfolio ManagementProject Portfolio Management
Project Portfolio ManagementAnand Subramaniam
 
Greek and romans chapter 5
Greek and romans chapter 5Greek and romans chapter 5
Greek and romans chapter 5Karen Owens
 
Las-artes-y-su-ensenanza-en-la-educacion-basica
 Las-artes-y-su-ensenanza-en-la-educacion-basica Las-artes-y-su-ensenanza-en-la-educacion-basica
Las-artes-y-su-ensenanza-en-la-educacion-basicaYuly Andrea
 
20 F Portugues 2011
20 F Portugues 201120 F Portugues 2011
20 F Portugues 2011Embraer RI
 
Taller autoestima
Taller autoestimaTaller autoestima
Taller autoestimaMiroslava
 
Standard Operating Procedure (SOP) for Information Technology (IT) Operations
Standard Operating Procedure (SOP) for Information Technology (IT) OperationsStandard Operating Procedure (SOP) for Information Technology (IT) Operations
Standard Operating Procedure (SOP) for Information Technology (IT) OperationsRonald Bartels
 
Editorial santillana ciencias II
Editorial santillana ciencias IIEditorial santillana ciencias II
Editorial santillana ciencias IIEey Thesame
 
The Marketing Concept
The Marketing ConceptThe Marketing Concept
The Marketing ConceptWaseem Saeed
 
Greek and romans chapter 5
Greek and romans chapter 5Greek and romans chapter 5
Greek and romans chapter 5Karen Owens
 
La secuenciación del genoma humano revolucionó la biología
La secuenciación del genoma humano revolucionó la biologíaLa secuenciación del genoma humano revolucionó la biología
La secuenciación del genoma humano revolucionó la biologíaleo
 
Int A La Economia
Int A La EconomiaInt A La Economia
Int A La Economiaguest3886a2
 
Modulo herramientas telematicas
Modulo herramientas telematicasModulo herramientas telematicas
Modulo herramientas telematicascoycoy_28
 
Ciencias ComunicacióN
Ciencias ComunicacióNCiencias ComunicacióN
Ciencias ComunicacióNguest448765
 

Viewers also liked (20)

Guia para la_gestion_integral_de_los_residuos_solidos_urbanos
Guia para la_gestion_integral_de_los_residuos_solidos_urbanosGuia para la_gestion_integral_de_los_residuos_solidos_urbanos
Guia para la_gestion_integral_de_los_residuos_solidos_urbanos
 
Engl 102 final exam 1
Engl 102 final exam 1Engl 102 final exam 1
Engl 102 final exam 1
 
Como calcular o valor da empresa
Como calcular o valor da empresaComo calcular o valor da empresa
Como calcular o valor da empresa
 
E Tail East10 Tls Sn
E Tail East10 Tls SnE Tail East10 Tls Sn
E Tail East10 Tls Sn
 
Project Portfolio Management
Project Portfolio ManagementProject Portfolio Management
Project Portfolio Management
 
Grupo de pedro (4)
Grupo de pedro (4)Grupo de pedro (4)
Grupo de pedro (4)
 
Mexico frente a_la_crisis_
Mexico frente a_la_crisis_Mexico frente a_la_crisis_
Mexico frente a_la_crisis_
 
Greek and romans chapter 5
Greek and romans chapter 5Greek and romans chapter 5
Greek and romans chapter 5
 
Physics
PhysicsPhysics
Physics
 
Las-artes-y-su-ensenanza-en-la-educacion-basica
 Las-artes-y-su-ensenanza-en-la-educacion-basica Las-artes-y-su-ensenanza-en-la-educacion-basica
Las-artes-y-su-ensenanza-en-la-educacion-basica
 
20 F Portugues 2011
20 F Portugues 201120 F Portugues 2011
20 F Portugues 2011
 
Taller autoestima
Taller autoestimaTaller autoestima
Taller autoestima
 
Standard Operating Procedure (SOP) for Information Technology (IT) Operations
Standard Operating Procedure (SOP) for Information Technology (IT) OperationsStandard Operating Procedure (SOP) for Information Technology (IT) Operations
Standard Operating Procedure (SOP) for Information Technology (IT) Operations
 
Editorial santillana ciencias II
Editorial santillana ciencias IIEditorial santillana ciencias II
Editorial santillana ciencias II
 
The Marketing Concept
The Marketing ConceptThe Marketing Concept
The Marketing Concept
 
Greek and romans chapter 5
Greek and romans chapter 5Greek and romans chapter 5
Greek and romans chapter 5
 
La secuenciación del genoma humano revolucionó la biología
La secuenciación del genoma humano revolucionó la biologíaLa secuenciación del genoma humano revolucionó la biología
La secuenciación del genoma humano revolucionó la biología
 
Int A La Economia
Int A La EconomiaInt A La Economia
Int A La Economia
 
Modulo herramientas telematicas
Modulo herramientas telematicasModulo herramientas telematicas
Modulo herramientas telematicas
 
Ciencias ComunicacióN
Ciencias ComunicacióNCiencias ComunicacióN
Ciencias ComunicacióN
 

Similar to Ijetcas14 338

Comparative Analysis of Classification Algorithms using Weka
Comparative Analysis of Classification Algorithms using WekaComparative Analysis of Classification Algorithms using Weka
Comparative Analysis of Classification Algorithms using Wekaijtsrd
 
Comparative Analysis of Naive Bayes and Decision Tree Algorithms in Data Mini...
Comparative Analysis of Naive Bayes and Decision Tree Algorithms in Data Mini...Comparative Analysis of Naive Bayes and Decision Tree Algorithms in Data Mini...
Comparative Analysis of Naive Bayes and Decision Tree Algorithms in Data Mini...Universitas Bhayangkara Jakarta Raya
 
Smart Health Guide App
Smart Health Guide AppSmart Health Guide App
Smart Health Guide AppIRJET Journal
 
An analysis and impact factors on Agriculture field using Data Mining Techniques
An analysis and impact factors on Agriculture field using Data Mining TechniquesAn analysis and impact factors on Agriculture field using Data Mining Techniques
An analysis and impact factors on Agriculture field using Data Mining Techniquesijcnes
 
Analysis of Bayes, Neural Network and Tree Classifier of Classification Techn...
Analysis of Bayes, Neural Network and Tree Classifier of Classification Techn...Analysis of Bayes, Neural Network and Tree Classifier of Classification Techn...
Analysis of Bayes, Neural Network and Tree Classifier of Classification Techn...cscpconf
 
4113ijaia09
4113ijaia094113ijaia09
4113ijaia09mamin321
 
Clustering of Big Data Using Different Data-Mining Techniques
Clustering of Big Data Using Different Data-Mining TechniquesClustering of Big Data Using Different Data-Mining Techniques
Clustering of Big Data Using Different Data-Mining TechniquesIRJET Journal
 
Software Bug Detection Algorithm using Data mining Techniques
Software Bug Detection Algorithm using Data mining TechniquesSoftware Bug Detection Algorithm using Data mining Techniques
Software Bug Detection Algorithm using Data mining TechniquesAM Publications
 
Review of Algorithms for Crime Analysis & Prediction
Review of Algorithms for Crime Analysis & PredictionReview of Algorithms for Crime Analysis & Prediction
Review of Algorithms for Crime Analysis & PredictionIRJET Journal
 
Irjet v4 iA Survey on FP (Growth) Tree using Association Rule Mining7351
Irjet v4 iA Survey on FP (Growth) Tree using Association Rule Mining7351Irjet v4 iA Survey on FP (Growth) Tree using Association Rule Mining7351
Irjet v4 iA Survey on FP (Growth) Tree using Association Rule Mining7351IRJET Journal
 
Selecting the correct Data Mining Method: Classification & InDaMiTe-R
Selecting the correct Data Mining Method: Classification & InDaMiTe-RSelecting the correct Data Mining Method: Classification & InDaMiTe-R
Selecting the correct Data Mining Method: Classification & InDaMiTe-RIOSR Journals
 
Classification Techniques: A Review
Classification Techniques: A ReviewClassification Techniques: A Review
Classification Techniques: A ReviewIOSRjournaljce
 
Classification and Prediction Based Data Mining Algorithm in Weka Tool
Classification and Prediction Based Data Mining Algorithm in Weka ToolClassification and Prediction Based Data Mining Algorithm in Weka Tool
Classification and Prediction Based Data Mining Algorithm in Weka ToolIRJET Journal
 
Advanced Computational Intelligence: An International Journal (ACII)
Advanced Computational Intelligence: An International Journal (ACII)Advanced Computational Intelligence: An International Journal (ACII)
Advanced Computational Intelligence: An International Journal (ACII)aciijournal
 
Performance Analysis of Various Data Mining Techniques on Banknote Authentica...
Performance Analysis of Various Data Mining Techniques on Banknote Authentica...Performance Analysis of Various Data Mining Techniques on Banknote Authentica...
Performance Analysis of Various Data Mining Techniques on Banknote Authentica...inventionjournals
 
Assessment of Decision Tree Algorithms on Student’s Recital
Assessment of Decision Tree Algorithms on Student’s RecitalAssessment of Decision Tree Algorithms on Student’s Recital
Assessment of Decision Tree Algorithms on Student’s RecitalIRJET Journal
 
IRJET- Deep Learning Model to Predict Hardware Performance
IRJET- Deep Learning Model to Predict Hardware PerformanceIRJET- Deep Learning Model to Predict Hardware Performance
IRJET- Deep Learning Model to Predict Hardware PerformanceIRJET Journal
 

Similar to Ijetcas14 338 (20)

Comparative Analysis of Classification Algorithms using Weka
Comparative Analysis of Classification Algorithms using WekaComparative Analysis of Classification Algorithms using Weka
Comparative Analysis of Classification Algorithms using Weka
 
G046024851
G046024851G046024851
G046024851
 
Comparative Analysis of Naive Bayes and Decision Tree Algorithms in Data Mini...
Comparative Analysis of Naive Bayes and Decision Tree Algorithms in Data Mini...Comparative Analysis of Naive Bayes and Decision Tree Algorithms in Data Mini...
Comparative Analysis of Naive Bayes and Decision Tree Algorithms in Data Mini...
 
Smart Health Guide App
Smart Health Guide AppSmart Health Guide App
Smart Health Guide App
 
An analysis and impact factors on Agriculture field using Data Mining Techniques
An analysis and impact factors on Agriculture field using Data Mining TechniquesAn analysis and impact factors on Agriculture field using Data Mining Techniques
An analysis and impact factors on Agriculture field using Data Mining Techniques
 
Analysis of Bayes, Neural Network and Tree Classifier of Classification Techn...
Analysis of Bayes, Neural Network and Tree Classifier of Classification Techn...Analysis of Bayes, Neural Network and Tree Classifier of Classification Techn...
Analysis of Bayes, Neural Network and Tree Classifier of Classification Techn...
 
4113ijaia09
4113ijaia094113ijaia09
4113ijaia09
 
4113ijaia09
4113ijaia094113ijaia09
4113ijaia09
 
Clustering of Big Data Using Different Data-Mining Techniques
Clustering of Big Data Using Different Data-Mining TechniquesClustering of Big Data Using Different Data-Mining Techniques
Clustering of Big Data Using Different Data-Mining Techniques
 
Software Bug Detection Algorithm using Data mining Techniques
Software Bug Detection Algorithm using Data mining TechniquesSoftware Bug Detection Algorithm using Data mining Techniques
Software Bug Detection Algorithm using Data mining Techniques
 
Review of Algorithms for Crime Analysis & Prediction
Review of Algorithms for Crime Analysis & PredictionReview of Algorithms for Crime Analysis & Prediction
Review of Algorithms for Crime Analysis & Prediction
 
Seminar Report Vaibhav
Seminar Report VaibhavSeminar Report Vaibhav
Seminar Report Vaibhav
 
Irjet v4 iA Survey on FP (Growth) Tree using Association Rule Mining7351
Irjet v4 iA Survey on FP (Growth) Tree using Association Rule Mining7351Irjet v4 iA Survey on FP (Growth) Tree using Association Rule Mining7351
Irjet v4 iA Survey on FP (Growth) Tree using Association Rule Mining7351
 
Selecting the correct Data Mining Method: Classification & InDaMiTe-R
Selecting the correct Data Mining Method: Classification & InDaMiTe-RSelecting the correct Data Mining Method: Classification & InDaMiTe-R
Selecting the correct Data Mining Method: Classification & InDaMiTe-R
 
Classification Techniques: A Review
Classification Techniques: A ReviewClassification Techniques: A Review
Classification Techniques: A Review
 
Classification and Prediction Based Data Mining Algorithm in Weka Tool
Classification and Prediction Based Data Mining Algorithm in Weka ToolClassification and Prediction Based Data Mining Algorithm in Weka Tool
Classification and Prediction Based Data Mining Algorithm in Weka Tool
 
Advanced Computational Intelligence: An International Journal (ACII)
Advanced Computational Intelligence: An International Journal (ACII)Advanced Computational Intelligence: An International Journal (ACII)
Advanced Computational Intelligence: An International Journal (ACII)
 
Performance Analysis of Various Data Mining Techniques on Banknote Authentica...
Performance Analysis of Various Data Mining Techniques on Banknote Authentica...Performance Analysis of Various Data Mining Techniques on Banknote Authentica...
Performance Analysis of Various Data Mining Techniques on Banknote Authentica...
 
Assessment of Decision Tree Algorithms on Student’s Recital
Assessment of Decision Tree Algorithms on Student’s RecitalAssessment of Decision Tree Algorithms on Student’s Recital
Assessment of Decision Tree Algorithms on Student’s Recital
 
IRJET- Deep Learning Model to Predict Hardware Performance
IRJET- Deep Learning Model to Predict Hardware PerformanceIRJET- Deep Learning Model to Predict Hardware Performance
IRJET- Deep Learning Model to Predict Hardware Performance
 

More from Iasir Journals (20)

ijetcas14 650
ijetcas14 650ijetcas14 650
ijetcas14 650
 
Ijetcas14 648
Ijetcas14 648Ijetcas14 648
Ijetcas14 648
 
Ijetcas14 647
Ijetcas14 647Ijetcas14 647
Ijetcas14 647
 
Ijetcas14 643
Ijetcas14 643Ijetcas14 643
Ijetcas14 643
 
Ijetcas14 641
Ijetcas14 641Ijetcas14 641
Ijetcas14 641
 
Ijetcas14 639
Ijetcas14 639Ijetcas14 639
Ijetcas14 639
 
Ijetcas14 632
Ijetcas14 632Ijetcas14 632
Ijetcas14 632
 
Ijetcas14 624
Ijetcas14 624Ijetcas14 624
Ijetcas14 624
 
Ijetcas14 619
Ijetcas14 619Ijetcas14 619
Ijetcas14 619
 
Ijetcas14 615
Ijetcas14 615Ijetcas14 615
Ijetcas14 615
 
Ijetcas14 608
Ijetcas14 608Ijetcas14 608
Ijetcas14 608
 
Ijetcas14 605
Ijetcas14 605Ijetcas14 605
Ijetcas14 605
 
Ijetcas14 604
Ijetcas14 604Ijetcas14 604
Ijetcas14 604
 
Ijetcas14 598
Ijetcas14 598Ijetcas14 598
Ijetcas14 598
 
Ijetcas14 594
Ijetcas14 594Ijetcas14 594
Ijetcas14 594
 
Ijetcas14 593
Ijetcas14 593Ijetcas14 593
Ijetcas14 593
 
Ijetcas14 591
Ijetcas14 591Ijetcas14 591
Ijetcas14 591
 
Ijetcas14 589
Ijetcas14 589Ijetcas14 589
Ijetcas14 589
 
Ijetcas14 585
Ijetcas14 585Ijetcas14 585
Ijetcas14 585
 
Ijetcas14 584
Ijetcas14 584Ijetcas14 584
Ijetcas14 584
 

Recently uploaded

Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Blooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxBlooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxUnboundStockton
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
Class 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdfClass 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdfakmcokerachita
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 
Biting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfBiting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfadityarao40181
 
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptxENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptxAnaBeatriceAblay2
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Celine George
 
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,Virag Sontakke
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 

Recently uploaded (20)

Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
Blooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxBlooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docx
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
Class 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdfClass 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdf
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
Biting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfBiting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdf
 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptxENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 

Ijetcas14 338

  • 1. International Association of Scientific Innovation and Research (IASIR) (An Association Unifying the Sciences, Engineering, and Applied Research) International Journal of Emerging Technologies in Computational and Applied Sciences (IJETCAS) www.iasir.net IJETCAS 14-338; © 2014, IJETCAS All Rights Reserved Page 147 ISSN (Print): 2279-0047 ISSN (Online): 2279-0055 Classification of data using New Enhanced Decision Tree Algorithm (NEDTA) Hardeep Kaur 1 Harpreet Kaur 2 Department of Computer Science and Engineering SBBSIET, Jalandhar, Punjab, India __________________________________________________________________________________________ Abstract: Data mining is method of maintaining a large amount of data stored in the database. Decision tree is a technique of data mining which classify the data and produces valuable results. These results are used in analysis and future prediction. The prime objective of this research work is to present an enhanced decision tree algorithm that classifies the data more efficiently and effectively than existing decision tree classifiers. We apply existing decision tree classifiers ID3, J48, NBTree on a large amount of data. Then the efficiency and performance of existing algorithms is examined and compared with new enhanced decision tree algorithm (NEDTA). Our enhanced decision tree algorithm produces better results as compared to other decision tree algorithms. Keywords: Data Mining; Decision Tree; ID3; J48; NBTree __________________________________________________________________________________________ I. Introduction Data mining is a knowledge discovery process in which analysis of the data is done. The analysis is based on historical activities stored in very large repositories and results are used to obtain useful information. Data mining is method to find the hidden patterns in a large amount of data. There are various applications of data mining such as banking, insurance, medicine, real estate etc. Data mining concept is applied in insurance and banking field [1] for fraud detection, identification of loyal customers, sales promotion and enhanced research. The process of data mining is iterative and also known as Knowledge Discovery process. It consists of following phases: 1. Problem Definition: This phase consists of data mining experts, business experts and domain experts, who understand the problem, define objectives. 2. Data Exploration: In this phase data is explored and metadata is defined by domain experts. 3. Data Preparation: A data model is formed in this phase from collected data. 4. Modeling: Data mining functions from data model are selected and applied on data. 5. Evaluation: The results obtained by modeling are evaluated. If the results are not according to expectations then model is rebuild until required results are obtained. 6. Deployment: In deployment process, the mining results are deployed into applications There are various data mining techniques are available such as Clustering, Classification, Association. Clustering is the process of grouping the similar objects into one class. Therefore multiple classes are formed which comprises similar objects. Association is the process in which association rules are created. These association rules analyze unrelated data and produces association between them. In this paper, Section I describes introduction of data mining and the process of knowledge discovery. Section II describes about classification and their techniques. Section III gives information about decision tree classification technique. In this section some decision tree algorithms are explained. In Section IV the objectives of research work are discussed. In Section V Weka data mining tool is discussed briefly. Weka tool is used in research work. Section VI describes about data used in research work. The proposed work is implemented in Section VII. The results are evaluated and compared in section VIII. The comparison of algorithms is based on execution time, accuracy and error rate (Mean absolute error (MSE), Root mean squared error (RMSE), Relative absolute error (RAE), Root relative squared error (RRSE)). In section IX a new enhanced decision tree algorithm (NEDTA) is proposed. In section X the results of NEDTA are evaluated and compared with ID3, J48 and NBTree. Section XI contains conclusion. Section XII contains references that are used in this research work. II. Classification Classification is data mining technique which classifies data with the help of certain classification rules and valuable results are formed. There are two areas of classification: Decision Tree Induction and Neural Induction.
  • 2. Hardeep Kaur et al., International Journal of Emerging Technologies in Computational and Applied Sciences, 8(2), March-May 2014, pp. 147-152 IJETCAS 14-338; © 2014, IJETCAS All Rights Reserved Page 148 Decision tree induction is a classification method in which a set of rules are applied recursively on a dataset and a tree is generated. III. Decision Tree A decision tree consists of nodes [2]. Each node represents some information. Decision tree learning is started from root node and discrete values are produced at each node by testing the values of attribute. These discrete values acts as target function. Then by using target function, value of attribute for next node is evaluated. This process is repeated for each new node. The learned tree is represented by if-then rules. Decision tree algorithms [3] such as ID3, C4.5, J48 [4] NBTree can be applied on large amount of data and valuable predictions can be produced. These predictions evaluate future behavior of problem. Decision tree are preferred because they can evaluate information more accurately than other methods. In this research work following decision tree algorithms are used: (1) ID3: ID3 means Iterative Dichotomiser 3. It is a decision tree algorithm which is developed by Ross Quinlan. The steps of ID3 algorithm are as following: (a) ID3 is a greedy algorithm in which the tree created from top to bottom. (b) At each node, the appropriate attribute is selected which best classifies the data. Data is in the form of training examples. (c) The above process is repeated until the complete tree is generated or until all the attributes used. (2) J48: J48 is the open source Java implementation [5] of C4.5 decision tree algorithm in Weka data mining tool. Following are the steps of J48 algorithm: (a) This algorithm uses basic algorithm which create trees by using recursive top down divide and conquer approach. (b) First of all, the training examples are at the root node. (c) Test attribute is selected based on some measures such as information gain, entropy etc. (d) Examples are divided repeatedly by using test attribute. (e) The process continued until no sample leaf is leaf. (3) NBTree: NBTree(Naive Bayesian tree) consists of [6] naïve Bayesian classification and decision tree learning. An NBTree classification sorts the example to a leaf and then assigns a class label by applying a naïve bayes on that leaf. The steps of NBTree algorithm are: (a) At each leaf node of a tree, a naive bayes is applied. (b) By using naive bayes for each leaf node, the instances are classified. (c) As the tree grows, for each leaf a naive bayes is constructed. (d) This process repeated until no example is left. IV. Objectives of Research Work The objectives of this research work are as following: (1) To apply Decision tree algorithms ID3, J48 and NBTree on banking dataset. (2) Evaluation of results produced. (3) Comparative analysis of results using parameters accuracy, execution time and error rate for ID3, J48 and NBTree. (4) To build a new enhanced method for classification of data. V. Tool Used In this research work, an open source tool named Weka is used. Weka is free open source data mining software which is based on a Java data mining library. Weka consists of various machine learning algorithms for different data mining applications. The algorithms are directly applied to dataset and results are generated in the form of tree. Weka contains various classifiers for classification [7], clustering, association, regression, pre-processing and visualization. Weka is also used for development of new machine learning schemes. VI. Data Set Used In our research work, we have used banking dataset [8]. The main focus of this research is performance and evaluation of decision tree algorithms. There are many decision tree algorithms in data mining but we focus mainly on ID3, J48 and NBTree. The data set contains 5264 rows and 13 columns. VII. Implementation of Work To implement the objectives of research work, Firstly, we have applied the ID3, J48 and NBtree algorithms on banking [9] dataset using data mining tool Weka 3.4 Figure 1 shows implementation of ID3 decision tree
  • 3. Hardeep Kaur et al., International Journal of Emerging Technologies in Computational and Applied Sciences, 8(2), March-May 2014, pp. 147-152 IJETCAS 14-338; © 2014, IJETCAS All Rights Reserved Page 149 algorithm on Weka data mining tool. Figure 2 shows implementation of J48 on Weka data mining tool. Figure 4 shows implementation of NBTree decision tree algorithm on Weka data mining tool Figure 1: ID3 Algorithm Figure 2: J48 Algorithm Figure 3: J48 Algorithm Visualization Tree in Weka Explorer Figure 4: NBTree Algorithm VIII. Comparative Analysis of ID3, J48 and NBTree The experiments have been conducted and different decision tree algorithms are applied on banking dataset in Weka Framework. The results of Decision tree classifiers ID3, J48 and NBTree are compared. In our experiment, parameters such as error rate, execution time and accuracy are evaluated and compared. Table I shows the accuracy of decision tree classifiers ID3, J48 and NBTree. Table II shows the performance of different decision tree classifiers. The table shows execution time of various classifiers. Table I: Classifier Accuracy Table II: Performance of Classifiers Table III shows the error rate of decision tree classifiers. Error rate is shown as Mean absolute error (MSE), Root mean squared error (RMSE), Relative absolute error (RAE), Root relative squared error (RRSE) Table III: Error rate of Classifiers Algorithm MSE RMSE RAE RRSE ID3 0.0991 0.2237 28.3881 % 53.4855 % J48 0.0787 0.1983 22.4945 % 47.4297 % NBTree 0.1065 0.2296 30.4526 % 54.9118 % Algorithm Correctly Classified instances Incorrectly Classified Instances ID3 84.8784 % 15.1216 % J48 87.9179 % 12.0821 % NBTree 85.6003 % 14.3997 % Algorithm Time Taken to build model (in seconds) ID3 0.08 J48 0.1 NBTree 7.96
  • 4. Hardeep Kaur et al., International Journal of Emerging Technologies in Computational and Applied Sciences, 8(2), March-May 2014, pp. 147-152 IJETCAS 14-338; © 2014, IJETCAS All Rights Reserved Page 150 Following graphs shows accuracy, error rate and execution comparison of ID3, J48 and NBTree algorithms. Figure 5: This graph shows accuracy comparison of Figure 6: This graph shows Execution Time of ID3, J48 and NBTree algorithms ID3, J48 and NBTree algorithms Figure 7: This graph shows Error Rate (MSE, RMSE, RAE and RRSE) comparison of ID3, J48 and NBTree algorithms XI. New Enhanced Decision Tree Algorithm We have proposed a new decision tree algorithm which classifies a large amount of data. Existing decision tree algorithms have some drawbacks. But our enhanced algorithm produces better results as compared to ID3, J48 and NBTree [10]. Following are the steps of proposed algorithm: 1. A decision tree DT built from the training examples, with a collection S of m source leaf nodes and a collection D of n destination leaf nodes. 2. A pre specified constant k (k≤ m), where m is the total number of source leaf nodes, 3. Construct the branches according to different values of attribute Pi so that the samples are partitioned accordingly. 4. If samples in a certain value are all of the same class, then generate a leaf node and is labeled with that class. 5. Otherwise use the same process repeated recursively to form a decision tree for the samples at each partition. X. Results NEDTA is applied on banking dataset and results are compared with ID3, J48 and NBTree algorithms. Figure 9 shows the evaluation of NEDTA on Weka data mining tool. NEDTA produces better results as compared to ID3, J48 and NBTree in terms of execution time, accuracy and error rate. 0 10 20 30 40 50 60 70 80 90 100 Correctly Classified Instances Incorrectly Classified Instances 0 1 2 3 4 5 6 7 8 Execution Time
  • 5. Hardeep Kaur et al., International Journal of Emerging Technologies in Computational and Applied Sciences, 8(2), March-May 2014, pp. 147-152 IJETCAS 14-338; © 2014, IJETCAS All Rights Reserved Page 151 Figure 8: Implementation of New Enhanced Decision Tree Algorithm (NEDTA) on Weka Explorer Table IV shows accuracy comparison of ID3, J48 sand NBTree with NEDTA. The percentage of correctly classified instances of our algorithm is better than other algorithms. Table V shows the performance comparison of ID3, J48 and NBTree with NEDTA. The execution time of our algorithm is better than other algorithms. Table VI shows error rate comparison of ID3, J48 and NBTree with NEDTA. Table IV: Comparison of accuracy of NEDTA Table V: Performance comparison of NEDTA with ID3, J48 and NBTree with ID3, J48 and NBTree Table VI: Comparison of error rate of NEDTA with ID3, J48 and NBTree Algorithm MSE RMSE RAE RRSE ID3 0.0991 0.2237 28.3881 % 53.4855 % J48 0.0787 0.1983 22.4945 % 47.4297 % NBTree 0.1065 0.2296 30.4526 % 54.9118 % NEDTA 0.0675 0.1838 19.3113 % 43.9458 % Figure 9: This graph shows accuracy comparison of ID3, J48 and NBTree algorithms with NEDTA 0 20 40 60 80 100 Correctly Classified Instances Incorrectly Classified Instances Algorithm Correctly Classified instances Incorrectly Classified Instances ID3 84.8784 % 15.1216 % J48 87.9179 % 12.0821 % NBTree 85.6003 % 14.3997 % NEDTA 88.3549 % 11.6451 % Algorithm Time Taken to build model (in seconds) ID3 0.08 J48 0.1 NBTree 7.96 NEDTA 0.06
  • 6. Hardeep Kaur et al., International Journal of Emerging Technologies in Computational and Applied Sciences, 8(2), March-May 2014, pp. 147-152 IJETCAS 14-338; © 2014, IJETCAS All Rights Reserved Page 152 Figure 10: This graph shows performance Figure 11: This graph shows Error rate (MSE, comparison of ID3, J48 and NBTree RMSE, RAE and RRSE) comparison of ID3, J48 algorithms with NEDTA and NBTree algorithms with NEDTA XI. Conclusion Data mining plays an important role in knowledge discovery. There are various decision tree algorithms which are used to classify a larger amount of data. Each algorithm has different performance for different data set. While classifying a large amount of data, the performance of some algorithms decreases. Our algorithm removes this problem. The results of NEDTA show better performance in terms of execution time, error rate and accuracy than other algorithms. References [1] Kazi Imran Moin and Dr. Qazi Baseer Ahmed, ‘‘Use of Data Mining in Baking’’, International Journal of Engineering Research and Applications (IJERA),Vol. 2, Issue 2, pp.738-742, 2012. [2] J. R. Quinlan, ‘Introduction of decision tree’, Journal of Machine learning. [3] Mrs. Swati .V. Kulkarni, ‘‘Mining knowledge using Decision Tree Algorithm’’, International Journal of Scientific & Engineering Research, Volume 2, Issue 5. [4] Youvrajsinh Chauhan, Jignesh Vania, “J48 Classifier Approach to Detect Characteristic of Bt Cotton base on Soil Micro Nutrient”, International Journal of Computer Trends and Technology (IJCTT), volume 5 number, 2013. [5] Bangsuk Jantawan and Cheng-Fa Tsai, “The Application of Data Mining to Build Classification Model for Predicting Graduate Employment”, “International Journal of Computer Science and Information Security, Vol. 11, No. 10, October 2013 [6] Yumin Zhao, Zhendong Niu_ and Xueping Peng, “Research on Data Mining Technologies for Complicated Attributes Relationship in Digital Library Collections”,“Applied Mathematics & Information Sciences, An International Journal”, Appl. Math. Inf. Sci. 8, No. 3, 1173-1178 (2014) [7] Aman Kumar Sharma and Suruchi Sahni, ‘‘A Comparative Study of Classification Algorithms for Spam Email Data Analysis’’, International Journal on Computer Science and Engineering (IJCSE), Vol. 3 No. 5, pp. 1890-1895, 2011. [8] Pardeep Kumar, Nitin, Vivek Kumar Sehgal and Durg Singh Chauhan, ‘‘A BENCHMARK TO SELECT DATA MINING BASED CLASSIFICATION ALGORITHMS FOR BUSINESS INTELLIGENCE AND DECISION SUPPORT SYSTEMS’’, International Journal of Data Mining & Knowledge Management Process (IJDKP), Vol.2, No.5, pp. 25-42, 2012. [9] Vivek Bhambri, ‘‘Role of Data Mining in Banking Sector’’, International Indexed & Referred Research Journal, VoL.III, ISSUE-33, pp. 70-71, 2012. [10] Milija Suknovic, Boris Delibasic, Milos Jovanovic, Milan Vukicevic, Dragana Becejski-Vujaklija and Zoran Obradovic, ‘‘Reusable components in decision tree induction algorithms’’, Springer, 2011. Acknowledgment I express my sincere gratitude to Er. Harpreet Kaur, Assistant Professor in department of computer science engineering at SBBSIET, Jalandhar, Punjab for her stimulating guidance, continuous encouragement and supervision. 0 1 2 3 4 5 6 7 8 Execution time