SlideShare a Scribd company logo
1 of 5
Download to read offline
IOSR Journal of Computer Engineering (IOSR-JCE)
e-ISSN: 2278-0661,p-ISSN: 2278-8727, Volume 18, Issue 2, Ver. II (Mar-Apr. 2016), PP 37-41
www.iosrjournals.org
DOI: 10.9790/0661-1802023741 www.iosrjournals.org 37 | Page
Role of Fuzzy Set in Students’ Performance Prediction
Mrs. Jyoti Upadhyay1,
Dr. Pratima Gautam2
1
Ph.D. (Research Scholar) AISECT University, Bhopal, M.P.
2
Dean (IT Dept.) AISECT University, Bhopal, M.P
Abstract: We can use educational data mining to predict student’ performance on the basis of different
attribute. In this paper, the classification task is used to predict the result of students. Decision tree (DT)
learning is one of the most famous practical methods for analytical study. We use DT as prediction method in
this paper. In this paper we propose a model using fuzzy set to predict more accurate result.Fuzzy logic brings
in an improvement of analysis aspects due to the elasticity of fuzzy sets formalism. Therefore, we proposed a
decision tree on fuzzy set data, which combines ID3 with fuzzy theory.The results are compared to some other
popular classification algorithms.
Key words: Decision tree, Fuzzy set, Data mining
I. Introduction
1.1 Decision tree: In data mining Decision trees (DT) are among the most popular prediction technique.
Although DT’s are better known in their role as classifiers, they also have prominent applications in regression,
clustering and feature selection.A decision tree presents possible outcomes of a decision through graph. We use
specialized software to draw graph, which is rapid miner. DT is useful for focusing discussion when a group
must make a decision.
Ross Quinlan developed ID3 (Iterative Dichotomiser 3) in 1986. ID3 algorithm based on top-down
search approach on the given data sets. To select the attribute that is most useful for classifying a given sets, use
information gain. To find an optimal way to classify a learning set we need some function, which provides the
most balanced splitting [1]. A data set contains attributes, we have to select most suitable attribute for the root
node of DT. We are using Entropy and information gain. Entropy is the index used to measure degree of
impurity [2].
The Entropy is calculated as follows: Entropy= −𝑝 log2 𝑝 𝑗𝑗
Splitting criteria used for splitting of nodes of the tree is Information gain. To determine the best attribute for a
particular node in the tree we use the measure called Information Gain. The information gain, Gain (S, A) of an
attribute A, relative to a collection of examples S, is defined as:
Gain (S , A ) = Entropy (S) – sv𝑣𝜖𝑣𝑎𝑙𝑢𝑒 𝐴 /|𝑠|Entropy (sv)
For the best split point we have to calculate gain for each attribute and repeat the process until we didn’t get
label attribute.
1.2 fuzzy Set: The selection of the best classification algorithm for a given dataset is a very widespread
problem. In this sense it requires to make several methodological choices. Among them, in this research it
focuses on the decision tree algorithms from classification methods, which is used to assess the classification
performance and to find the best algorithm in obtaining qualitative student data. Fuzzy set theory is also known
as possibility theory, was proposed by Lotfi Zadeh in 1965. A researcher seems it as an alternative to traditional
theory. Most important, fuzzy set theory allows us to deal with inexact facts. For example, if we says that a
student who has 80% is eligible for any course but less % is not considerable then what happen for 79.8%? This
can be thought of as an extension of traditional crisp sets, in which each element must either be in or not in a
seti.e. Fuzzy set.Fuzzy sets are defined on a non-fuzzy universe of discourse, which is an ordinary sets.Fuzzy
logic set can model the normal attributes as linguistic variables (such as good, poor, avg, tall, high) into the
inductive generation of the tree structure. Fuzzy decision trees (FDTs) are better suitable to deal with the
uncertainty, fuzziness [3].A fuzzy sets F of a universe of discourse U is characterized by a membership function
μF (x) which assigns to every elementx ∈ U ,a membership degree μF (x)∈[0,1]. Anelement x∈U is said to be in
a fuzzy sets F if and only if μ (x)>0 and to be a full member if and A only if μF (x) =1[4]. Membership functions
can either be chosen by the user arbitrarily or based on the user’s experience.Typically, a fuzzy subset A can be
represented as,
A = {μA (x1 ) / x1},{μA (x2 ) / x2 },...,{μA (xn ) / xn }
Where the separating symbol / is used to associate the membership value with its coordinate on the horizontal
axis. Fuzzy sets and fuzzy logic allow providing a symbolic framework for knowledge clarity. Fuzzy decision
trees differ from traditional crisp decision trees in following manner, they use splitting criteria based on fuzzy
Role of fuzzy set in students’ performance prediction
DOI: 10.9790/0661-1802023741 www.iosrjournals.org 38 | Page
restrictions and the fuzzy sets representing the defined data.
Fuzzy decision tree required to develop the following things attribute value partitioning methods,
branching attribute selection method, branching test method to decide which degree data follows down branches
of a node, and leaf node labeling methods to determine classes for which leaf nodes stand[5]. A data set with
some condition attributes and one decision attributes can be presented in the form of knowledge representation
system J = (U , C ∪D ) , whereU ={u1,u2....,us}is the set of data samples, C = {c1 , c2 ...., cn } is the set of
condition attributes.D = {d } is the one-elemental set with the decision attribute or class label attribute. Suppose
this class label attribute has m distinct values and defining m distinct classes , di (for i=l, ..,m).
For a given subset Sj , information gain s expressed as

I(s1j,...smj)=−∑m pijlog2 pij
So information gain of attribute ci is given by
Gain(ci ) = I (s1 j ,...smj ) − E(ci )
The attribute withhighest information gain is the most informative attribute of the given data set.
II. Methodology
For our work we are using Rapid Miner tool. Which is the most powerful, easy to use graphical user
interface for the design of analytic processes[6]. In this paper, we propose a computational model based on
fuzzy-rough decision trees to learn the most significant factor for student performance.Our methodology uses
fuzzy rough sets to discard irrelevant features on the basis of their dependency. Our work methodology shows in
fig.1.We collect data from govt. schools of Chhattisgarh for the session 2014-15.Using Rapidminer we obtained
different confidence and support values. Model Evaluation shows accuracy of model
Figure 1: work methodology
III. Proposed work
To implement above methodology we will follow some steps, which are:
3.1 Data Set: For Hypothesis formation we select Educational Environment.For an educational environment it
is important to know which factor can affect result of students so that they can emphasis on particular factor
and improve successful result and get less drop out ratio. For mining select relevant data shows in table 1.
Attribute Values
Prev_year % Result of 12th
class
Cast SC,ST,OBC,GEN
Attendance % Attendance % in present class
Location RURAL,URBAN,SEMI-URBAN
Recent Result(label class) Pass,Fail
We convert data set into Fuzzy Set.
Figure 2:Data Set
Role of fuzzy set in students’ performance prediction
DOI: 10.9790/0661-1802023741 www.iosrjournals.org 39 | Page
Fuzzy Set:
Class of object can define Fuzzy set, there is no noisy margins for object [7].A fuzzy set formed by
combination of linguistic variable.
Figure 3: Fuzzy Data Set
3.2 Data Preparation done: The success of data mining techniques depends highly on an appropriate pre-
processing of the data. Pre-processing includes data selection, data normalization and transformation.
Selection Data selection is critical for the result of a data mining process. Although a relation between a
certain attribute and the desired result is not obvious, the attribute has to be considered as well because
some information may be hidden in the data.
3.3 Unsuitable Size: Remove inconsistent or remove noisy data and apply treatment about incomplete and
erroneous data. Apply Data transformation into Modified Data (after it data transforming into a new
format).We apply filter operator of Rapid miner, which takes an Example Set as input and returns a new
Example Set including only those examples that satisfy the specified condition. Several predefined
conditions are provided; users can select any of them. Users can also define their own conditions to filter
examples. This operator may reduce the number of examples in an Example Set but it has no effect on the
number of attributes. The select Attributes operator is used to select required attributes.
Figure 4: graphical representation of attributes
3.4 Experiment: We collected qualitative data for experiment and 10-fold cross validation applied. After
applying model we evaluate it and generate confusion matrix.
IV. Result and Discussion:
Data set consist 56 students.On the basis of decision tree algorithm fuzzy_attn has the highest
information gain, then the decision tree generated as shown in figure:5. We extract Classification rule with the
help of decision tree which shows location and cast also play an important role in students’ performance.
Role of fuzzy set in students’ performance prediction
DOI: 10.9790/0661-1802023741 www.iosrjournals.org 40 | Page
Figure 5:Decision tree
We use the condition attribute Fuzzy_attn, Fuzzy_prev_result, cast and location. Result as class label.
Fuzzy attn = AVG: fail {fail=17, pass=0, Pass=0}
Fuzzy attn = good
fuzzy prev result = first: pass {fail=0, pass=7, Pass=1}
fuzzy prev result = second
Cast = Gen.: pass {fail=0, pass=7, Pass=0}
Cast = OBC: pass {fail=0, pass=1, Pass=1}
Cast = SC: pass {fail=1, pass=2, Pass=0}
Cast = ST: fail {fail=1, pass=1, Pass=0}
fuzzy prev result = thierd
Location = Rural: pass {fail=1, pass=2, Pass=0}
Location = Semi-urban: fail {fail=2, pass=0, Pass=0}
Location = Urban: pass {fail=1, pass=2, Pass=0}
Fuzzy attn = poor: fail {fail=9, pass=0, Pass=0}
Figure 6: Predicted Result
We can interpret from the rule for confidence values are that the students’’ performance will be poor if
their attendance is poor. Also their location will have an impact on their result.
V. Model Evaluation
Rapid Miner is the most powerful, easy to use and intuitive graphical user interface for the design of
analytic processes [8].Classification performance is referred as the characteristics and how successful the
models formulated using DT and Fuzzy rough set decision tree learning algorithms are in accurately classifying
data points from the testing dataset and/or the independent one-class dataset.The confusion matrix shows the
accuracy of the DT for the given data sets. The proposed model was able to classify 82% of the input instances
correctly. The results show clearly that the proposed method performs well compared to other similar methods
in the literature.
Role of fuzzy set in students’ performance prediction
DOI: 10.9790/0661-1802023741 www.iosrjournals.org 41 | Page
PerformanceVector:
accuracy: 81.50% +/- 10.97% (mikro: 81.63%)
ConfusionMatrix:
True: fail pass Pass
fail: 24 2 1
pass: 5 16 1
kappa: 0.636 +/- 0.207 (mikro: 0.639)
ConfusionMatrix:
True: fail pass Pass
fail: 24 2 1
pass: 5 16 1
Performance Vector for non Fuzzy Set
accuracy: 76.00% +/- 14.97% (mikro: 75.51%)
ConfusionMatrix:
True: fail pass Pass
fail: 25 6 0
pass: 4 12 2
kappa: 0.502 +/- 0.327 (mikro: 0.501)
ConfusionMatrix:
True: fail pass Pass
fail: 25 6 0
pass: 4 12 2
VI. Conclusion
Our work is highly concerned with fuzzy set and decision tree .we propose that numeric data can be
represented by fuzzy values. When we apply decision tree model on fuzzy set then it produce more accurate
result as we we can see through confusion matrix. Our analysis shows that students’ performance can be
affected by many factors. In our next paper we will present more attributes with weight.
References
[1]. T.Miranda Lakshmi
 , A.Martin
 ,others,An Analysis on Performance of Decision Tree Algorithms using Student’s Qualitative
Data,I.J.Modern Education and Computer Science, 5, 18-27, 2013
[2]. Abeer Badr El Din Ahmed, Ibrahim Sayed Elaraby,Data Mining: A prediction for Student's Performance Using Classification
Method,World Journal of Computer Application and Technology,vol2,43-47, 2014
[3]. Y.-l. Chen, T. Wang, B.-s. Wang, and Z.-j. Li, “A survey of fuzzy decision tree classifier,” Fuzzy Information and Engineering, vol.
1, pp. 149–159, June 2009.
[4]. FuzzyLogical Toolbox: zmf. Matlab Help. The MathWorks, Inc. 1984-2004.
[5]. TIEN-CHIN WANG,HSIEN-DA LEE,Constructing a Fuzzy Decision Tree by Integrating Fuzzy Sets and Entropy
[6]. https://rapidminer.com/
[7]. Nasution, H., Design Methodology of Fuzzy Logic Control, UTM, 2002. 

[8]. Han J, Rodriguze J, Beheshti M. Diabetes data analysis and prediction model discovery using rapid miner. 2nd International
Conference on Future Generation Communication and Networking, IEEE; 2008

More Related Content

What's hot

Hypothesis on Different Data Mining Algorithms
Hypothesis on Different Data Mining AlgorithmsHypothesis on Different Data Mining Algorithms
Hypothesis on Different Data Mining AlgorithmsIJERA Editor
 
Chapter 4 Classification
Chapter 4 ClassificationChapter 4 Classification
Chapter 4 ClassificationKhalid Elshafie
 
Chapter - 6 Data Mining Concepts and Techniques 2nd Ed slides Han & Kamber
Chapter - 6 Data Mining Concepts and Techniques 2nd Ed slides Han & KamberChapter - 6 Data Mining Concepts and Techniques 2nd Ed slides Han & Kamber
Chapter - 6 Data Mining Concepts and Techniques 2nd Ed slides Han & Kambererror007
 
Extensive Analysis on Generation and Consensus Mechanisms of Clustering Ensem...
Extensive Analysis on Generation and Consensus Mechanisms of Clustering Ensem...Extensive Analysis on Generation and Consensus Mechanisms of Clustering Ensem...
Extensive Analysis on Generation and Consensus Mechanisms of Clustering Ensem...IJECEIAES
 
EXTRACTING USEFUL RULES THROUGH IMPROVED DECISION TREE INDUCTION USING INFORM...
EXTRACTING USEFUL RULES THROUGH IMPROVED DECISION TREE INDUCTION USING INFORM...EXTRACTING USEFUL RULES THROUGH IMPROVED DECISION TREE INDUCTION USING INFORM...
EXTRACTING USEFUL RULES THROUGH IMPROVED DECISION TREE INDUCTION USING INFORM...ijistjournal
 
Efficient classification of big data using vfdt (very fast decision tree)
Efficient classification of big data using vfdt (very fast decision tree)Efficient classification of big data using vfdt (very fast decision tree)
Efficient classification of big data using vfdt (very fast decision tree)eSAT Journals
 
2.1 Data Mining-classification Basic concepts
2.1 Data Mining-classification Basic concepts2.1 Data Mining-classification Basic concepts
2.1 Data Mining-classification Basic conceptsKrish_ver2
 
Building a Classifier Employing Prism Algorithm with Fuzzy Logic
Building a Classifier Employing Prism Algorithm with Fuzzy LogicBuilding a Classifier Employing Prism Algorithm with Fuzzy Logic
Building a Classifier Employing Prism Algorithm with Fuzzy LogicIJDKP
 
IRJET- A Detailed Study on Classification Techniques for Data Mining
IRJET- A Detailed Study on Classification Techniques for Data MiningIRJET- A Detailed Study on Classification Techniques for Data Mining
IRJET- A Detailed Study on Classification Techniques for Data MiningIRJET Journal
 
EXTRACTING USEFUL RULES THROUGH IMPROVED DECISION TREE INDUCTION USING INFORM...
EXTRACTING USEFUL RULES THROUGH IMPROVED DECISION TREE INDUCTION USING INFORM...EXTRACTING USEFUL RULES THROUGH IMPROVED DECISION TREE INDUCTION USING INFORM...
EXTRACTING USEFUL RULES THROUGH IMPROVED DECISION TREE INDUCTION USING INFORM...ijistjournal
 
A survey of modified support vector machine using particle of swarm optimizat...
A survey of modified support vector machine using particle of swarm optimizat...A survey of modified support vector machine using particle of swarm optimizat...
A survey of modified support vector machine using particle of swarm optimizat...Editor Jacotech
 
Unsupervised Learning Techniques to Diversifying and Pruning Random Forest
Unsupervised Learning Techniques to Diversifying and Pruning Random ForestUnsupervised Learning Techniques to Diversifying and Pruning Random Forest
Unsupervised Learning Techniques to Diversifying and Pruning Random ForestMohamed Medhat Gaber
 
2.8 accuracy and ensemble methods
2.8 accuracy and ensemble methods2.8 accuracy and ensemble methods
2.8 accuracy and ensemble methodsKrish_ver2
 
An Approach to Mixed Dataset Clustering and Validation with ART-2 Artificial ...
An Approach to Mixed Dataset Clustering and Validation with ART-2 Artificial ...An Approach to Mixed Dataset Clustering and Validation with ART-2 Artificial ...
An Approach to Mixed Dataset Clustering and Validation with ART-2 Artificial ...Happiest Minds Technologies
 
Data Mining:Concepts and Techniques, Chapter 8. Classification: Basic Concepts
Data Mining:Concepts and Techniques, Chapter 8. Classification: Basic ConceptsData Mining:Concepts and Techniques, Chapter 8. Classification: Basic Concepts
Data Mining:Concepts and Techniques, Chapter 8. Classification: Basic ConceptsSalah Amean
 
SURVEY ON CLASSIFICATION ALGORITHMS USING BIG DATASET
SURVEY ON CLASSIFICATION ALGORITHMS USING BIG DATASETSURVEY ON CLASSIFICATION ALGORITHMS USING BIG DATASET
SURVEY ON CLASSIFICATION ALGORITHMS USING BIG DATASETEditor IJMTER
 

What's hot (19)

Anirban part1
Anirban part1Anirban part1
Anirban part1
 
Hypothesis on Different Data Mining Algorithms
Hypothesis on Different Data Mining AlgorithmsHypothesis on Different Data Mining Algorithms
Hypothesis on Different Data Mining Algorithms
 
Chapter 4 Classification
Chapter 4 ClassificationChapter 4 Classification
Chapter 4 Classification
 
Tuition Reduction Determination Using Fuzzy Tsukamoto
Tuition Reduction Determination Using Fuzzy TsukamotoTuition Reduction Determination Using Fuzzy Tsukamoto
Tuition Reduction Determination Using Fuzzy Tsukamoto
 
Chapter - 6 Data Mining Concepts and Techniques 2nd Ed slides Han & Kamber
Chapter - 6 Data Mining Concepts and Techniques 2nd Ed slides Han & KamberChapter - 6 Data Mining Concepts and Techniques 2nd Ed slides Han & Kamber
Chapter - 6 Data Mining Concepts and Techniques 2nd Ed slides Han & Kamber
 
Extensive Analysis on Generation and Consensus Mechanisms of Clustering Ensem...
Extensive Analysis on Generation and Consensus Mechanisms of Clustering Ensem...Extensive Analysis on Generation and Consensus Mechanisms of Clustering Ensem...
Extensive Analysis on Generation and Consensus Mechanisms of Clustering Ensem...
 
EXTRACTING USEFUL RULES THROUGH IMPROVED DECISION TREE INDUCTION USING INFORM...
EXTRACTING USEFUL RULES THROUGH IMPROVED DECISION TREE INDUCTION USING INFORM...EXTRACTING USEFUL RULES THROUGH IMPROVED DECISION TREE INDUCTION USING INFORM...
EXTRACTING USEFUL RULES THROUGH IMPROVED DECISION TREE INDUCTION USING INFORM...
 
Efficient classification of big data using vfdt (very fast decision tree)
Efficient classification of big data using vfdt (very fast decision tree)Efficient classification of big data using vfdt (very fast decision tree)
Efficient classification of big data using vfdt (very fast decision tree)
 
2.1 Data Mining-classification Basic concepts
2.1 Data Mining-classification Basic concepts2.1 Data Mining-classification Basic concepts
2.1 Data Mining-classification Basic concepts
 
Building a Classifier Employing Prism Algorithm with Fuzzy Logic
Building a Classifier Employing Prism Algorithm with Fuzzy LogicBuilding a Classifier Employing Prism Algorithm with Fuzzy Logic
Building a Classifier Employing Prism Algorithm with Fuzzy Logic
 
IRJET- A Detailed Study on Classification Techniques for Data Mining
IRJET- A Detailed Study on Classification Techniques for Data MiningIRJET- A Detailed Study on Classification Techniques for Data Mining
IRJET- A Detailed Study on Classification Techniques for Data Mining
 
Data Mining
Data MiningData Mining
Data Mining
 
EXTRACTING USEFUL RULES THROUGH IMPROVED DECISION TREE INDUCTION USING INFORM...
EXTRACTING USEFUL RULES THROUGH IMPROVED DECISION TREE INDUCTION USING INFORM...EXTRACTING USEFUL RULES THROUGH IMPROVED DECISION TREE INDUCTION USING INFORM...
EXTRACTING USEFUL RULES THROUGH IMPROVED DECISION TREE INDUCTION USING INFORM...
 
A survey of modified support vector machine using particle of swarm optimizat...
A survey of modified support vector machine using particle of swarm optimizat...A survey of modified support vector machine using particle of swarm optimizat...
A survey of modified support vector machine using particle of swarm optimizat...
 
Unsupervised Learning Techniques to Diversifying and Pruning Random Forest
Unsupervised Learning Techniques to Diversifying and Pruning Random ForestUnsupervised Learning Techniques to Diversifying and Pruning Random Forest
Unsupervised Learning Techniques to Diversifying and Pruning Random Forest
 
2.8 accuracy and ensemble methods
2.8 accuracy and ensemble methods2.8 accuracy and ensemble methods
2.8 accuracy and ensemble methods
 
An Approach to Mixed Dataset Clustering and Validation with ART-2 Artificial ...
An Approach to Mixed Dataset Clustering and Validation with ART-2 Artificial ...An Approach to Mixed Dataset Clustering and Validation with ART-2 Artificial ...
An Approach to Mixed Dataset Clustering and Validation with ART-2 Artificial ...
 
Data Mining:Concepts and Techniques, Chapter 8. Classification: Basic Concepts
Data Mining:Concepts and Techniques, Chapter 8. Classification: Basic ConceptsData Mining:Concepts and Techniques, Chapter 8. Classification: Basic Concepts
Data Mining:Concepts and Techniques, Chapter 8. Classification: Basic Concepts
 
SURVEY ON CLASSIFICATION ALGORITHMS USING BIG DATASET
SURVEY ON CLASSIFICATION ALGORITHMS USING BIG DATASETSURVEY ON CLASSIFICATION ALGORITHMS USING BIG DATASET
SURVEY ON CLASSIFICATION ALGORITHMS USING BIG DATASET
 

Viewers also liked (20)

F013123542
F013123542F013123542
F013123542
 
Q180305116119
Q180305116119Q180305116119
Q180305116119
 
K017426872
K017426872K017426872
K017426872
 
G018214246
G018214246G018214246
G018214246
 
D1103032328
D1103032328D1103032328
D1103032328
 
J010346786
J010346786J010346786
J010346786
 
L010326978
L010326978L010326978
L010326978
 
L1303046572
L1303046572L1303046572
L1303046572
 
F1302022933
F1302022933F1302022933
F1302022933
 
B1304030609
B1304030609B1304030609
B1304030609
 
Q130403104114
Q130403104114Q130403104114
Q130403104114
 
C1303020912
C1303020912C1303020912
C1303020912
 
Q01262105114
Q01262105114Q01262105114
Q01262105114
 
A1303040108
A1303040108A1303040108
A1303040108
 
B013160914
B013160914B013160914
B013160914
 
K1303037582
K1303037582K1303037582
K1303037582
 
RAPD Analysis Of Rapidly Multiplied In Vitro Plantlets of Anthurium Andreanum...
RAPD Analysis Of Rapidly Multiplied In Vitro Plantlets of Anthurium Andreanum...RAPD Analysis Of Rapidly Multiplied In Vitro Plantlets of Anthurium Andreanum...
RAPD Analysis Of Rapidly Multiplied In Vitro Plantlets of Anthurium Andreanum...
 
H010634043
H010634043H010634043
H010634043
 
D1103042431
D1103042431D1103042431
D1103042431
 
Q01765102112
Q01765102112Q01765102112
Q01765102112
 

Similar to Role of Fuzzy Logic in Predicting Student Performance

Perfomance Comparison of Decsion Tree Algorithms to Findout the Reason for St...
Perfomance Comparison of Decsion Tree Algorithms to Findout the Reason for St...Perfomance Comparison of Decsion Tree Algorithms to Findout the Reason for St...
Perfomance Comparison of Decsion Tree Algorithms to Findout the Reason for St...ijcnes
 
AN EFFICIENT FEATURE SELECTION IN CLASSIFICATION OF AUDIO FILES
AN EFFICIENT FEATURE SELECTION IN CLASSIFICATION OF AUDIO FILESAN EFFICIENT FEATURE SELECTION IN CLASSIFICATION OF AUDIO FILES
AN EFFICIENT FEATURE SELECTION IN CLASSIFICATION OF AUDIO FILEScscpconf
 
An efficient feature selection in
An efficient feature selection inAn efficient feature selection in
An efficient feature selection incsandit
 
An improved teaching learning
An improved teaching learningAn improved teaching learning
An improved teaching learningcsandit
 
A Combined Approach for Feature Subset Selection and Size Reduction for High ...
A Combined Approach for Feature Subset Selection and Size Reduction for High ...A Combined Approach for Feature Subset Selection and Size Reduction for High ...
A Combined Approach for Feature Subset Selection and Size Reduction for High ...IJERA Editor
 
ADABOOST ENSEMBLE WITH SIMPLE GENETIC ALGORITHM FOR STUDENT PREDICTION MODEL
ADABOOST ENSEMBLE WITH SIMPLE GENETIC ALGORITHM FOR STUDENT PREDICTION MODELADABOOST ENSEMBLE WITH SIMPLE GENETIC ALGORITHM FOR STUDENT PREDICTION MODEL
ADABOOST ENSEMBLE WITH SIMPLE GENETIC ALGORITHM FOR STUDENT PREDICTION MODELijcsit
 
Higgs Boson Machine Learning Challenge - Kaggle
Higgs Boson Machine Learning Challenge - KaggleHiggs Boson Machine Learning Challenge - Kaggle
Higgs Boson Machine Learning Challenge - KaggleSajith Edirisinghe
 
Higgs bosob machine learning challange
Higgs bosob machine learning challangeHiggs bosob machine learning challange
Higgs bosob machine learning challangeTharindu Ranasinghe
 
final report (ppt)
final report (ppt)final report (ppt)
final report (ppt)butest
 
ASSOCIATION RULE DISCOVERY FOR STUDENT PERFORMANCE PREDICTION USING METAHEURI...
ASSOCIATION RULE DISCOVERY FOR STUDENT PERFORMANCE PREDICTION USING METAHEURI...ASSOCIATION RULE DISCOVERY FOR STUDENT PERFORMANCE PREDICTION USING METAHEURI...
ASSOCIATION RULE DISCOVERY FOR STUDENT PERFORMANCE PREDICTION USING METAHEURI...cscpconf
 
Association rule discovery for student performance prediction using metaheuri...
Association rule discovery for student performance prediction using metaheuri...Association rule discovery for student performance prediction using metaheuri...
Association rule discovery for student performance prediction using metaheuri...csandit
 
IRJET- Performance Evaluation of Various Classification Algorithms
IRJET- Performance Evaluation of Various Classification AlgorithmsIRJET- Performance Evaluation of Various Classification Algorithms
IRJET- Performance Evaluation of Various Classification AlgorithmsIRJET Journal
 
IRJET- Performance Evaluation of Various Classification Algorithms
IRJET- Performance Evaluation of Various Classification AlgorithmsIRJET- Performance Evaluation of Various Classification Algorithms
IRJET- Performance Evaluation of Various Classification AlgorithmsIRJET Journal
 
Explanations in Data Systems
Explanations in Data SystemsExplanations in Data Systems
Explanations in Data SystemsFotis Savva
 
A Modified KS-test for Feature Selection
A Modified KS-test for Feature SelectionA Modified KS-test for Feature Selection
A Modified KS-test for Feature SelectionIOSR Journals
 
Fuzzy Association Rule Mining based Model to Predict Students’ Performance
Fuzzy Association Rule Mining based Model to Predict Students’ Performance Fuzzy Association Rule Mining based Model to Predict Students’ Performance
Fuzzy Association Rule Mining based Model to Predict Students’ Performance IJECEIAES
 
Scalable decision tree based on fuzzy partitioning and an incremental approach
Scalable decision tree based on fuzzy partitioning and an  incremental approachScalable decision tree based on fuzzy partitioning and an  incremental approach
Scalable decision tree based on fuzzy partitioning and an incremental approachIJECEIAES
 
Using ID3 Decision Tree Algorithm to the Student Grade Analysis and Prediction
Using ID3 Decision Tree Algorithm to the Student Grade Analysis and PredictionUsing ID3 Decision Tree Algorithm to the Student Grade Analysis and Prediction
Using ID3 Decision Tree Algorithm to the Student Grade Analysis and Predictionijtsrd
 

Similar to Role of Fuzzy Logic in Predicting Student Performance (20)

Perfomance Comparison of Decsion Tree Algorithms to Findout the Reason for St...
Perfomance Comparison of Decsion Tree Algorithms to Findout the Reason for St...Perfomance Comparison of Decsion Tree Algorithms to Findout the Reason for St...
Perfomance Comparison of Decsion Tree Algorithms to Findout the Reason for St...
 
AN EFFICIENT FEATURE SELECTION IN CLASSIFICATION OF AUDIO FILES
AN EFFICIENT FEATURE SELECTION IN CLASSIFICATION OF AUDIO FILESAN EFFICIENT FEATURE SELECTION IN CLASSIFICATION OF AUDIO FILES
AN EFFICIENT FEATURE SELECTION IN CLASSIFICATION OF AUDIO FILES
 
An efficient feature selection in
An efficient feature selection inAn efficient feature selection in
An efficient feature selection in
 
An improved teaching learning
An improved teaching learningAn improved teaching learning
An improved teaching learning
 
A Combined Approach for Feature Subset Selection and Size Reduction for High ...
A Combined Approach for Feature Subset Selection and Size Reduction for High ...A Combined Approach for Feature Subset Selection and Size Reduction for High ...
A Combined Approach for Feature Subset Selection and Size Reduction for High ...
 
RESULT MINING: ANALYSIS OF DATA MINING TECHNIQUES IN EDUCATION
RESULT MINING: ANALYSIS OF DATA MINING TECHNIQUES IN EDUCATIONRESULT MINING: ANALYSIS OF DATA MINING TECHNIQUES IN EDUCATION
RESULT MINING: ANALYSIS OF DATA MINING TECHNIQUES IN EDUCATION
 
ADABOOST ENSEMBLE WITH SIMPLE GENETIC ALGORITHM FOR STUDENT PREDICTION MODEL
ADABOOST ENSEMBLE WITH SIMPLE GENETIC ALGORITHM FOR STUDENT PREDICTION MODELADABOOST ENSEMBLE WITH SIMPLE GENETIC ALGORITHM FOR STUDENT PREDICTION MODEL
ADABOOST ENSEMBLE WITH SIMPLE GENETIC ALGORITHM FOR STUDENT PREDICTION MODEL
 
Higgs Boson Machine Learning Challenge - Kaggle
Higgs Boson Machine Learning Challenge - KaggleHiggs Boson Machine Learning Challenge - Kaggle
Higgs Boson Machine Learning Challenge - Kaggle
 
Higgs bosob machine learning challange
Higgs bosob machine learning challangeHiggs bosob machine learning challange
Higgs bosob machine learning challange
 
final report (ppt)
final report (ppt)final report (ppt)
final report (ppt)
 
ASSOCIATION RULE DISCOVERY FOR STUDENT PERFORMANCE PREDICTION USING METAHEURI...
ASSOCIATION RULE DISCOVERY FOR STUDENT PERFORMANCE PREDICTION USING METAHEURI...ASSOCIATION RULE DISCOVERY FOR STUDENT PERFORMANCE PREDICTION USING METAHEURI...
ASSOCIATION RULE DISCOVERY FOR STUDENT PERFORMANCE PREDICTION USING METAHEURI...
 
Association rule discovery for student performance prediction using metaheuri...
Association rule discovery for student performance prediction using metaheuri...Association rule discovery for student performance prediction using metaheuri...
Association rule discovery for student performance prediction using metaheuri...
 
IRJET- Performance Evaluation of Various Classification Algorithms
IRJET- Performance Evaluation of Various Classification AlgorithmsIRJET- Performance Evaluation of Various Classification Algorithms
IRJET- Performance Evaluation of Various Classification Algorithms
 
IRJET- Performance Evaluation of Various Classification Algorithms
IRJET- Performance Evaluation of Various Classification AlgorithmsIRJET- Performance Evaluation of Various Classification Algorithms
IRJET- Performance Evaluation of Various Classification Algorithms
 
Explanations in Data Systems
Explanations in Data SystemsExplanations in Data Systems
Explanations in Data Systems
 
A Modified KS-test for Feature Selection
A Modified KS-test for Feature SelectionA Modified KS-test for Feature Selection
A Modified KS-test for Feature Selection
 
Hx3115011506
Hx3115011506Hx3115011506
Hx3115011506
 
Fuzzy Association Rule Mining based Model to Predict Students’ Performance
Fuzzy Association Rule Mining based Model to Predict Students’ Performance Fuzzy Association Rule Mining based Model to Predict Students’ Performance
Fuzzy Association Rule Mining based Model to Predict Students’ Performance
 
Scalable decision tree based on fuzzy partitioning and an incremental approach
Scalable decision tree based on fuzzy partitioning and an  incremental approachScalable decision tree based on fuzzy partitioning and an  incremental approach
Scalable decision tree based on fuzzy partitioning and an incremental approach
 
Using ID3 Decision Tree Algorithm to the Student Grade Analysis and Prediction
Using ID3 Decision Tree Algorithm to the Student Grade Analysis and PredictionUsing ID3 Decision Tree Algorithm to the Student Grade Analysis and Prediction
Using ID3 Decision Tree Algorithm to the Student Grade Analysis and Prediction
 

More from IOSR Journals (20)

A011140104
A011140104A011140104
A011140104
 
M0111397100
M0111397100M0111397100
M0111397100
 
L011138596
L011138596L011138596
L011138596
 
K011138084
K011138084K011138084
K011138084
 
J011137479
J011137479J011137479
J011137479
 
I011136673
I011136673I011136673
I011136673
 
G011134454
G011134454G011134454
G011134454
 
H011135565
H011135565H011135565
H011135565
 
F011134043
F011134043F011134043
F011134043
 
E011133639
E011133639E011133639
E011133639
 
D011132635
D011132635D011132635
D011132635
 
C011131925
C011131925C011131925
C011131925
 
B011130918
B011130918B011130918
B011130918
 
A011130108
A011130108A011130108
A011130108
 
I011125160
I011125160I011125160
I011125160
 
H011124050
H011124050H011124050
H011124050
 
G011123539
G011123539G011123539
G011123539
 
F011123134
F011123134F011123134
F011123134
 
E011122530
E011122530E011122530
E011122530
 
D011121524
D011121524D011121524
D011121524
 

Recently uploaded

SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Neo4j
 
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 

Recently uploaded (20)

SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024
 
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 

Role of Fuzzy Logic in Predicting Student Performance

  • 1. IOSR Journal of Computer Engineering (IOSR-JCE) e-ISSN: 2278-0661,p-ISSN: 2278-8727, Volume 18, Issue 2, Ver. II (Mar-Apr. 2016), PP 37-41 www.iosrjournals.org DOI: 10.9790/0661-1802023741 www.iosrjournals.org 37 | Page Role of Fuzzy Set in Students’ Performance Prediction Mrs. Jyoti Upadhyay1, Dr. Pratima Gautam2 1 Ph.D. (Research Scholar) AISECT University, Bhopal, M.P. 2 Dean (IT Dept.) AISECT University, Bhopal, M.P Abstract: We can use educational data mining to predict student’ performance on the basis of different attribute. In this paper, the classification task is used to predict the result of students. Decision tree (DT) learning is one of the most famous practical methods for analytical study. We use DT as prediction method in this paper. In this paper we propose a model using fuzzy set to predict more accurate result.Fuzzy logic brings in an improvement of analysis aspects due to the elasticity of fuzzy sets formalism. Therefore, we proposed a decision tree on fuzzy set data, which combines ID3 with fuzzy theory.The results are compared to some other popular classification algorithms. Key words: Decision tree, Fuzzy set, Data mining I. Introduction 1.1 Decision tree: In data mining Decision trees (DT) are among the most popular prediction technique. Although DT’s are better known in their role as classifiers, they also have prominent applications in regression, clustering and feature selection.A decision tree presents possible outcomes of a decision through graph. We use specialized software to draw graph, which is rapid miner. DT is useful for focusing discussion when a group must make a decision. Ross Quinlan developed ID3 (Iterative Dichotomiser 3) in 1986. ID3 algorithm based on top-down search approach on the given data sets. To select the attribute that is most useful for classifying a given sets, use information gain. To find an optimal way to classify a learning set we need some function, which provides the most balanced splitting [1]. A data set contains attributes, we have to select most suitable attribute for the root node of DT. We are using Entropy and information gain. Entropy is the index used to measure degree of impurity [2]. The Entropy is calculated as follows: Entropy= −𝑝 log2 𝑝 𝑗𝑗 Splitting criteria used for splitting of nodes of the tree is Information gain. To determine the best attribute for a particular node in the tree we use the measure called Information Gain. The information gain, Gain (S, A) of an attribute A, relative to a collection of examples S, is defined as: Gain (S , A ) = Entropy (S) – sv𝑣𝜖𝑣𝑎𝑙𝑢𝑒 𝐴 /|𝑠|Entropy (sv) For the best split point we have to calculate gain for each attribute and repeat the process until we didn’t get label attribute. 1.2 fuzzy Set: The selection of the best classification algorithm for a given dataset is a very widespread problem. In this sense it requires to make several methodological choices. Among them, in this research it focuses on the decision tree algorithms from classification methods, which is used to assess the classification performance and to find the best algorithm in obtaining qualitative student data. Fuzzy set theory is also known as possibility theory, was proposed by Lotfi Zadeh in 1965. A researcher seems it as an alternative to traditional theory. Most important, fuzzy set theory allows us to deal with inexact facts. For example, if we says that a student who has 80% is eligible for any course but less % is not considerable then what happen for 79.8%? This can be thought of as an extension of traditional crisp sets, in which each element must either be in or not in a seti.e. Fuzzy set.Fuzzy sets are defined on a non-fuzzy universe of discourse, which is an ordinary sets.Fuzzy logic set can model the normal attributes as linguistic variables (such as good, poor, avg, tall, high) into the inductive generation of the tree structure. Fuzzy decision trees (FDTs) are better suitable to deal with the uncertainty, fuzziness [3].A fuzzy sets F of a universe of discourse U is characterized by a membership function μF (x) which assigns to every elementx ∈ U ,a membership degree μF (x)∈[0,1]. Anelement x∈U is said to be in a fuzzy sets F if and only if μ (x)>0 and to be a full member if and A only if μF (x) =1[4]. Membership functions can either be chosen by the user arbitrarily or based on the user’s experience.Typically, a fuzzy subset A can be represented as, A = {μA (x1 ) / x1},{μA (x2 ) / x2 },...,{μA (xn ) / xn } Where the separating symbol / is used to associate the membership value with its coordinate on the horizontal axis. Fuzzy sets and fuzzy logic allow providing a symbolic framework for knowledge clarity. Fuzzy decision trees differ from traditional crisp decision trees in following manner, they use splitting criteria based on fuzzy
  • 2. Role of fuzzy set in students’ performance prediction DOI: 10.9790/0661-1802023741 www.iosrjournals.org 38 | Page restrictions and the fuzzy sets representing the defined data. Fuzzy decision tree required to develop the following things attribute value partitioning methods, branching attribute selection method, branching test method to decide which degree data follows down branches of a node, and leaf node labeling methods to determine classes for which leaf nodes stand[5]. A data set with some condition attributes and one decision attributes can be presented in the form of knowledge representation system J = (U , C ∪D ) , whereU ={u1,u2....,us}is the set of data samples, C = {c1 , c2 ...., cn } is the set of condition attributes.D = {d } is the one-elemental set with the decision attribute or class label attribute. Suppose this class label attribute has m distinct values and defining m distinct classes , di (for i=l, ..,m). For a given subset Sj , information gain s expressed as
 I(s1j,...smj)=−∑m pijlog2 pij So information gain of attribute ci is given by Gain(ci ) = I (s1 j ,...smj ) − E(ci ) The attribute withhighest information gain is the most informative attribute of the given data set. II. Methodology For our work we are using Rapid Miner tool. Which is the most powerful, easy to use graphical user interface for the design of analytic processes[6]. In this paper, we propose a computational model based on fuzzy-rough decision trees to learn the most significant factor for student performance.Our methodology uses fuzzy rough sets to discard irrelevant features on the basis of their dependency. Our work methodology shows in fig.1.We collect data from govt. schools of Chhattisgarh for the session 2014-15.Using Rapidminer we obtained different confidence and support values. Model Evaluation shows accuracy of model Figure 1: work methodology III. Proposed work To implement above methodology we will follow some steps, which are: 3.1 Data Set: For Hypothesis formation we select Educational Environment.For an educational environment it is important to know which factor can affect result of students so that they can emphasis on particular factor and improve successful result and get less drop out ratio. For mining select relevant data shows in table 1. Attribute Values Prev_year % Result of 12th class Cast SC,ST,OBC,GEN Attendance % Attendance % in present class Location RURAL,URBAN,SEMI-URBAN Recent Result(label class) Pass,Fail We convert data set into Fuzzy Set. Figure 2:Data Set
  • 3. Role of fuzzy set in students’ performance prediction DOI: 10.9790/0661-1802023741 www.iosrjournals.org 39 | Page Fuzzy Set: Class of object can define Fuzzy set, there is no noisy margins for object [7].A fuzzy set formed by combination of linguistic variable. Figure 3: Fuzzy Data Set 3.2 Data Preparation done: The success of data mining techniques depends highly on an appropriate pre- processing of the data. Pre-processing includes data selection, data normalization and transformation. Selection Data selection is critical for the result of a data mining process. Although a relation between a certain attribute and the desired result is not obvious, the attribute has to be considered as well because some information may be hidden in the data. 3.3 Unsuitable Size: Remove inconsistent or remove noisy data and apply treatment about incomplete and erroneous data. Apply Data transformation into Modified Data (after it data transforming into a new format).We apply filter operator of Rapid miner, which takes an Example Set as input and returns a new Example Set including only those examples that satisfy the specified condition. Several predefined conditions are provided; users can select any of them. Users can also define their own conditions to filter examples. This operator may reduce the number of examples in an Example Set but it has no effect on the number of attributes. The select Attributes operator is used to select required attributes. Figure 4: graphical representation of attributes 3.4 Experiment: We collected qualitative data for experiment and 10-fold cross validation applied. After applying model we evaluate it and generate confusion matrix. IV. Result and Discussion: Data set consist 56 students.On the basis of decision tree algorithm fuzzy_attn has the highest information gain, then the decision tree generated as shown in figure:5. We extract Classification rule with the help of decision tree which shows location and cast also play an important role in students’ performance.
  • 4. Role of fuzzy set in students’ performance prediction DOI: 10.9790/0661-1802023741 www.iosrjournals.org 40 | Page Figure 5:Decision tree We use the condition attribute Fuzzy_attn, Fuzzy_prev_result, cast and location. Result as class label. Fuzzy attn = AVG: fail {fail=17, pass=0, Pass=0} Fuzzy attn = good fuzzy prev result = first: pass {fail=0, pass=7, Pass=1} fuzzy prev result = second Cast = Gen.: pass {fail=0, pass=7, Pass=0} Cast = OBC: pass {fail=0, pass=1, Pass=1} Cast = SC: pass {fail=1, pass=2, Pass=0} Cast = ST: fail {fail=1, pass=1, Pass=0} fuzzy prev result = thierd Location = Rural: pass {fail=1, pass=2, Pass=0} Location = Semi-urban: fail {fail=2, pass=0, Pass=0} Location = Urban: pass {fail=1, pass=2, Pass=0} Fuzzy attn = poor: fail {fail=9, pass=0, Pass=0} Figure 6: Predicted Result We can interpret from the rule for confidence values are that the students’’ performance will be poor if their attendance is poor. Also their location will have an impact on their result. V. Model Evaluation Rapid Miner is the most powerful, easy to use and intuitive graphical user interface for the design of analytic processes [8].Classification performance is referred as the characteristics and how successful the models formulated using DT and Fuzzy rough set decision tree learning algorithms are in accurately classifying data points from the testing dataset and/or the independent one-class dataset.The confusion matrix shows the accuracy of the DT for the given data sets. The proposed model was able to classify 82% of the input instances correctly. The results show clearly that the proposed method performs well compared to other similar methods in the literature.
  • 5. Role of fuzzy set in students’ performance prediction DOI: 10.9790/0661-1802023741 www.iosrjournals.org 41 | Page PerformanceVector: accuracy: 81.50% +/- 10.97% (mikro: 81.63%) ConfusionMatrix: True: fail pass Pass fail: 24 2 1 pass: 5 16 1 kappa: 0.636 +/- 0.207 (mikro: 0.639) ConfusionMatrix: True: fail pass Pass fail: 24 2 1 pass: 5 16 1 Performance Vector for non Fuzzy Set accuracy: 76.00% +/- 14.97% (mikro: 75.51%) ConfusionMatrix: True: fail pass Pass fail: 25 6 0 pass: 4 12 2 kappa: 0.502 +/- 0.327 (mikro: 0.501) ConfusionMatrix: True: fail pass Pass fail: 25 6 0 pass: 4 12 2 VI. Conclusion Our work is highly concerned with fuzzy set and decision tree .we propose that numeric data can be represented by fuzzy values. When we apply decision tree model on fuzzy set then it produce more accurate result as we we can see through confusion matrix. Our analysis shows that students’ performance can be affected by many factors. In our next paper we will present more attributes with weight. References [1]. T.Miranda Lakshmi
 , A.Martin
 ,others,An Analysis on Performance of Decision Tree Algorithms using Student’s Qualitative Data,I.J.Modern Education and Computer Science, 5, 18-27, 2013 [2]. Abeer Badr El Din Ahmed, Ibrahim Sayed Elaraby,Data Mining: A prediction for Student's Performance Using Classification Method,World Journal of Computer Application and Technology,vol2,43-47, 2014 [3]. Y.-l. Chen, T. Wang, B.-s. Wang, and Z.-j. Li, “A survey of fuzzy decision tree classifier,” Fuzzy Information and Engineering, vol. 1, pp. 149–159, June 2009. [4]. FuzzyLogical Toolbox: zmf. Matlab Help. The MathWorks, Inc. 1984-2004. [5]. TIEN-CHIN WANG,HSIEN-DA LEE,Constructing a Fuzzy Decision Tree by Integrating Fuzzy Sets and Entropy [6]. https://rapidminer.com/ [7]. Nasution, H., Design Methodology of Fuzzy Logic Control, UTM, 2002. 
 [8]. Han J, Rodriguze J, Beheshti M. Diabetes data analysis and prediction model discovery using rapid miner. 2nd International Conference on Future Generation Communication and Networking, IEEE; 2008