SlideShare a Scribd company logo
1 of 31
Students Academic
Performance
Knowledge Discovery from Data
Introduction..
 Our project aim is to find students academic performance
and find out whether there is any general pattern in their
marks and performance.
 So here ,We are analyzing both internal and external
marks of a student.
 We did the following KDD preprocessing steps to mine
our data.
Learning the application domain
 Learning the application domain is the first step in KDD
process .
 Need to have a clear understanding about the application
domain and our objectives.
 The institution considered for mining is MCA batch of Rajagiri
College of Social Sciences.
 We collected all previous year academic record from the
department of computer science
Create a target data set:
data selection
 We selected 2007-2010 batch marks for analysing the
pattern.
 There were around 45 records(45 students).
 Both the internal and external marks of each student were
selected, in order to find out the performance pattern.
Internal & External Dataset
Data cleaning & preprocessing
 Data cleaning is the step where noise and irrelevant data are
removed from the large data set.
 This is a very important pre-processing step because our
outcome would be dependent on the quality of selected data.
 Remove duplicate records, enter logically correct values for
missing records(absent students), remove unnecessary data
fields and standardize data format.
 There was no much duplicate data or unnecessary data in the
collected record . The dataset was partially cleaned.
 Student internal mark and external mark were stored in
different records.
 By applying data integration these records were integrated
into one record.
 The new dataset consist of internal mark details and external
mark details of each student in one record.
Data reduction & transformation
 Data is transformed into appropriate form for making it ready for
data mining step.
 The dataset contains marks of 5 theory paper and 2 lab paper of
all 5 semesters.
 These marks are transformed into sum of internal marks and sum
of external marks of each student for the easiness of analysing
the pattern.
Cluster Analysis
 The data mining technique we used here is clustering.
 A cluster is a collection of data objects that are similar to
one another within same cluster and are dissimilar to
objects in other cluster.
 We first partitioned the set of data into groups based on
data similarity and then assign labels
Choosing functions of data mining
K-MEANS Partitioning
 The K-means algorithm takes input parameter k and
partitions the set of n objects into k clusters.
 Here we selected no: of cluster as 4
 Objects are distributed to a cluster based on cluster
center to which it is nearest.
 For each semester we found out the clusters separately
and labeled them as students Excellent, Good, Fair and
Poor
Choosing mining algorithms
The Tool used for pattern evaluation is ORANGE
Orange Cluster Analysis
No of cluster selected is 4
Semester 1
poor
Fair
Good
Excellent
Semester 2
Semester 3
Semester 4
Semester 5
Centroid Analysis
Semester 1
Semester 2
Semester 3
Semester 4
Semester 5
Combined Centroid Analysis
Data mining search for patterns of
interest
 From the mining process we found that “All the 5 semester
clusters followed the same pattern of performance”.
 A student with high internal mark has higher external
marks and a student with less internal marks has less
external marks.
 There is a direct relation between the internal and the
external marks.
 At some case this evaluation is not valid, cases like
 Being absent for internal exam and scoring high marks for
the externals (vice versa)
CONCLUSION
 A students performance in his university exam can be
predicted with the help of his internal marks. There is
a direct relation between the internal and the external
marks.
 A student with low internals will get low marks for
externals too
Use of discovered knowledge
representation
Thank You

More Related Content

What's hot

School management system
School management systemSchool management system
School management system
Soumya Behera
 
Student information-system-project-outline
Student information-system-project-outlineStudent information-system-project-outline
Student information-system-project-outline
Amit Panwar
 
Students management system
Students management systemStudents management system
Students management system
Kumar Rajeev
 

What's hot (20)

College Management System
College Management SystemCollege Management System
College Management System
 
Presentation Slides of College Management System Report
Presentation Slides of College Management System ReportPresentation Slides of College Management System Report
Presentation Slides of College Management System Report
 
Student database management system
Student database management systemStudent database management system
Student database management system
 
Education data mining presentation
Education data mining presentationEducation data mining presentation
Education data mining presentation
 
School management system
School management systemSchool management system
School management system
 
Student Management System Project Abstract
Student Management System Project AbstractStudent Management System Project Abstract
Student Management System Project Abstract
 
Student information-system-project-outline
Student information-system-project-outlineStudent information-system-project-outline
Student information-system-project-outline
 
University Management System
University Management SystemUniversity Management System
University Management System
 
Final project presentation CSE
Final project presentation CSEFinal project presentation CSE
Final project presentation CSE
 
Student Result Management System
Student Result  Management System Student Result  Management System
Student Result Management System
 
Student Performance Data Mining Project Report
Student Performance Data Mining Project ReportStudent Performance Data Mining Project Report
Student Performance Data Mining Project Report
 
College Management System
College Management SystemCollege Management System
College Management System
 
School management system
School management systemSchool management system
School management system
 
Big Data Stockholm v 7 | "Federated Machine Learning for Collaborative and Se...
Big Data Stockholm v 7 | "Federated Machine Learning for Collaborative and Se...Big Data Stockholm v 7 | "Federated Machine Learning for Collaborative and Se...
Big Data Stockholm v 7 | "Federated Machine Learning for Collaborative and Se...
 
University/College Transport management system Documentation
University/College Transport management system DocumentationUniversity/College Transport management system Documentation
University/College Transport management system Documentation
 
Exam management system
Exam management systemExam management system
Exam management system
 
Computer vision
Computer visionComputer vision
Computer vision
 
Student Result
Student ResultStudent Result
Student Result
 
KMS (1)
KMS (1)KMS (1)
KMS (1)
 
Students management system
Students management systemStudents management system
Students management system
 

Viewers also liked

A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...
A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...
A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...
Editor IJCATR
 
Attendance and student performance arp (1)
Attendance and student performance arp (1)Attendance and student performance arp (1)
Attendance and student performance arp (1)
Cindy Paynter
 
Social Web: (Big) Data Mining | summer 2014/2015 course syllabus
Social Web: (Big) Data Mining | summer 2014/2015 course syllabusSocial Web: (Big) Data Mining | summer 2014/2015 course syllabus
Social Web: (Big) Data Mining | summer 2014/2015 course syllabus
Jakub Ruzicka
 
The effects of skipping breakfast on the academic performance
The effects of skipping breakfast on the academic performance The effects of skipping breakfast on the academic performance
The effects of skipping breakfast on the academic performance
Hafizah R
 

Viewers also liked (20)

Factors affecting the academic performance of college students (1)
Factors affecting the academic performance of college students (1)Factors affecting the academic performance of college students (1)
Factors affecting the academic performance of college students (1)
 
A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...
A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...
A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...
 
LinkedIn Summer Sales Guide - B2B Sales Influencers #LISummerGuide
LinkedIn Summer Sales Guide - B2B Sales Influencers #LISummerGuideLinkedIn Summer Sales Guide - B2B Sales Influencers #LISummerGuide
LinkedIn Summer Sales Guide - B2B Sales Influencers #LISummerGuide
 
Sania rtp
Sania rtpSania rtp
Sania rtp
 
Smartcards and Authentication Tokens
Smartcards and Authentication TokensSmartcards and Authentication Tokens
Smartcards and Authentication Tokens
 
Data Mining _ Weka
Data Mining _ WekaData Mining _ Weka
Data Mining _ Weka
 
Attendance and student performance arp (1)
Attendance and student performance arp (1)Attendance and student performance arp (1)
Attendance and student performance arp (1)
 
Some Thoughts on Learning Analytics and Educational Data Mining
Some Thoughts on Learning Analytics and Educational Data MiningSome Thoughts on Learning Analytics and Educational Data Mining
Some Thoughts on Learning Analytics and Educational Data Mining
 
Data Mining Project for student academic specialization and performance
Data Mining Project for student academic specialization and performanceData Mining Project for student academic specialization and performance
Data Mining Project for student academic specialization and performance
 
Mining Student Data LIVE_EUR_v2
Mining Student Data LIVE_EUR_v2Mining Student Data LIVE_EUR_v2
Mining Student Data LIVE_EUR_v2
 
Grand challenges for the Educational Data Mining and Learning Sciences Commun...
Grand challenges for the Educational Data Mining and Learning Sciences Commun...Grand challenges for the Educational Data Mining and Learning Sciences Commun...
Grand challenges for the Educational Data Mining and Learning Sciences Commun...
 
Provision and management of school plant as a correlate of science students a...
Provision and management of school plant as a correlate of science students a...Provision and management of school plant as a correlate of science students a...
Provision and management of school plant as a correlate of science students a...
 
Predicting Student Performance in Solving Parameterized Exercises
Predicting Student Performance in Solving Parameterized ExercisesPredicting Student Performance in Solving Parameterized Exercises
Predicting Student Performance in Solving Parameterized Exercises
 
Ethical Hacking
Ethical HackingEthical Hacking
Ethical Hacking
 
Solar and wind power forecasting
Solar and wind power forecastingSolar and wind power forecasting
Solar and wind power forecasting
 
USING LEARNING ANALYTICS TO PREDICT STUDENTS’ PERFORMANCE IN MOODLE LMS
USING LEARNING ANALYTICS TO PREDICT STUDENTS’ PERFORMANCE IN MOODLE LMSUSING LEARNING ANALYTICS TO PREDICT STUDENTS’ PERFORMANCE IN MOODLE LMS
USING LEARNING ANALYTICS TO PREDICT STUDENTS’ PERFORMANCE IN MOODLE LMS
 
My First Data Science Project (using Rapid Miner)
My First Data Science Project (using Rapid Miner)My First Data Science Project (using Rapid Miner)
My First Data Science Project (using Rapid Miner)
 
Social Web: (Big) Data Mining | summer 2014/2015 course syllabus
Social Web: (Big) Data Mining | summer 2014/2015 course syllabusSocial Web: (Big) Data Mining | summer 2014/2015 course syllabus
Social Web: (Big) Data Mining | summer 2014/2015 course syllabus
 
The effects of skipping breakfast on the academic performance
The effects of skipping breakfast on the academic performance The effects of skipping breakfast on the academic performance
The effects of skipping breakfast on the academic performance
 
Big Data in Education
Big Data in EducationBig Data in Education
Big Data in Education
 

Similar to Students academic performance using clustering technique

A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...
A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...
A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...
Editor IJCATR
 
Performance Evaluation of Feature Selection Algorithms in Educational Data Mi...
Performance Evaluation of Feature Selection Algorithms in Educational Data Mi...Performance Evaluation of Feature Selection Algorithms in Educational Data Mi...
Performance Evaluation of Feature Selection Algorithms in Educational Data Mi...
IIRindia
 

Similar to Students academic performance using clustering technique (20)

EFFICIENCY OF DECISION TREES IN PREDICTING STUDENT’S ACADEMIC PERFORMANCE
EFFICIENCY OF DECISION TREES IN PREDICTING STUDENT’S ACADEMIC PERFORMANCE EFFICIENCY OF DECISION TREES IN PREDICTING STUDENT’S ACADEMIC PERFORMANCE
EFFICIENCY OF DECISION TREES IN PREDICTING STUDENT’S ACADEMIC PERFORMANCE
 
IRJET- Academic Performance Analysis System
IRJET- Academic Performance Analysis SystemIRJET- Academic Performance Analysis System
IRJET- Academic Performance Analysis System
 
Using ID3 Decision Tree Algorithm to the Student Grade Analysis and Prediction
Using ID3 Decision Tree Algorithm to the Student Grade Analysis and PredictionUsing ID3 Decision Tree Algorithm to the Student Grade Analysis and Prediction
Using ID3 Decision Tree Algorithm to the Student Grade Analysis and Prediction
 
Data Clustering in Education for Students
Data Clustering in Education for StudentsData Clustering in Education for Students
Data Clustering in Education for Students
 
Predicting students' performance using id3 and c4.5 classification algorithms
Predicting students' performance using id3 and c4.5 classification algorithmsPredicting students' performance using id3 and c4.5 classification algorithms
Predicting students' performance using id3 and c4.5 classification algorithms
 
A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...
A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...
A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...
 
DATA MINING METHODOLOGIES TO STUDY STUDENT'S ACADEMIC PERFORMANCE USING THE...
DATA MINING METHODOLOGIES TO  STUDY STUDENT'S ACADEMIC  PERFORMANCE USING THE...DATA MINING METHODOLOGIES TO  STUDY STUDENT'S ACADEMIC  PERFORMANCE USING THE...
DATA MINING METHODOLOGIES TO STUDY STUDENT'S ACADEMIC PERFORMANCE USING THE...
 
Big data project
Big data projectBig data project
Big data project
 
M-Learners Performance Using Intelligence and Adaptive E-Learning Classify th...
M-Learners Performance Using Intelligence and Adaptive E-Learning Classify th...M-Learners Performance Using Intelligence and Adaptive E-Learning Classify th...
M-Learners Performance Using Intelligence and Adaptive E-Learning Classify th...
 
A Survey on the Classification Techniques In Educational Data Mining
A Survey on the Classification Techniques In Educational Data MiningA Survey on the Classification Techniques In Educational Data Mining
A Survey on the Classification Techniques In Educational Data Mining
 
Clustering Students of Computer in Terms of Level of Programming
Clustering Students of Computer in Terms of Level of ProgrammingClustering Students of Computer in Terms of Level of Programming
Clustering Students of Computer in Terms of Level of Programming
 
Performance Evaluation of Feature Selection Algorithms in Educational Data Mi...
Performance Evaluation of Feature Selection Algorithms in Educational Data Mi...Performance Evaluation of Feature Selection Algorithms in Educational Data Mi...
Performance Evaluation of Feature Selection Algorithms in Educational Data Mi...
 
IRJET- Using Data Mining to Predict Students Performance
IRJET-  	  Using Data Mining to Predict Students PerformanceIRJET-  	  Using Data Mining to Predict Students Performance
IRJET- Using Data Mining to Predict Students Performance
 
Student Performance Evaluation in Education Sector Using Prediction and Clust...
Student Performance Evaluation in Education Sector Using Prediction and Clust...Student Performance Evaluation in Education Sector Using Prediction and Clust...
Student Performance Evaluation in Education Sector Using Prediction and Clust...
 
Analysis on Student Admission Enquiry System
Analysis on Student Admission Enquiry SystemAnalysis on Student Admission Enquiry System
Analysis on Student Admission Enquiry System
 
Analysis on Student Admission Enquiry System
Analysis on Student Admission Enquiry SystemAnalysis on Student Admission Enquiry System
Analysis on Student Admission Enquiry System
 
Fuzzy Association Rule Mining based Model to Predict Students’ Performance
Fuzzy Association Rule Mining based Model to Predict Students’ Performance Fuzzy Association Rule Mining based Model to Predict Students’ Performance
Fuzzy Association Rule Mining based Model to Predict Students’ Performance
 
Brown, chapter 4 By Savaedi
Brown, chapter 4 By SavaediBrown, chapter 4 By Savaedi
Brown, chapter 4 By Savaedi
 
CORRELATION BASED FEATURE SELECTION (CFS) TECHNIQUE TO PREDICT STUDENT PERFRO...
CORRELATION BASED FEATURE SELECTION (CFS) TECHNIQUE TO PREDICT STUDENT PERFRO...CORRELATION BASED FEATURE SELECTION (CFS) TECHNIQUE TO PREDICT STUDENT PERFRO...
CORRELATION BASED FEATURE SELECTION (CFS) TECHNIQUE TO PREDICT STUDENT PERFRO...
 
Correlation based feature selection (cfs) technique to predict student perfro...
Correlation based feature selection (cfs) technique to predict student perfro...Correlation based feature selection (cfs) technique to predict student perfro...
Correlation based feature selection (cfs) technique to predict student perfro...
 

More from saniacorreya (6)

PROJECT REPORT ON CRYPTOGRAPHIC ALGORITHM
PROJECT REPORT ON CRYPTOGRAPHIC ALGORITHMPROJECT REPORT ON CRYPTOGRAPHIC ALGORITHM
PROJECT REPORT ON CRYPTOGRAPHIC ALGORITHM
 
Object recognition
Object recognitionObject recognition
Object recognition
 
Color and human vision
Color and human visionColor and human vision
Color and human vision
 
Manipulator robot for crack detection and welding
Manipulator robot for crack detection and weldingManipulator robot for crack detection and welding
Manipulator robot for crack detection and welding
 
Windows 10 ppt
Windows 10 pptWindows 10 ppt
Windows 10 ppt
 
Li fi
Li fiLi fi
Li fi
 

Recently uploaded

Spellings Wk 4 and Wk 5 for Grade 4 at CAPS
Spellings Wk 4 and Wk 5 for Grade 4 at CAPSSpellings Wk 4 and Wk 5 for Grade 4 at CAPS
Spellings Wk 4 and Wk 5 for Grade 4 at CAPS
AnaAcapella
 
Transparency, Recognition and the role of eSealing - Ildiko Mazar and Koen No...
Transparency, Recognition and the role of eSealing - Ildiko Mazar and Koen No...Transparency, Recognition and the role of eSealing - Ildiko Mazar and Koen No...
Transparency, Recognition and the role of eSealing - Ildiko Mazar and Koen No...
EADTU
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 

Recently uploaded (20)

Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
 
Simple, Complex, and Compound Sentences Exercises.pdf
Simple, Complex, and Compound Sentences Exercises.pdfSimple, Complex, and Compound Sentences Exercises.pdf
Simple, Complex, and Compound Sentences Exercises.pdf
 
How to Add a Tool Tip to a Field in Odoo 17
How to Add a Tool Tip to a Field in Odoo 17How to Add a Tool Tip to a Field in Odoo 17
How to Add a Tool Tip to a Field in Odoo 17
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
 
Our Environment Class 10 Science Notes pdf
Our Environment Class 10 Science Notes pdfOur Environment Class 10 Science Notes pdf
Our Environment Class 10 Science Notes pdf
 
FICTIONAL SALESMAN/SALESMAN SNSW 2024.pdf
FICTIONAL SALESMAN/SALESMAN SNSW 2024.pdfFICTIONAL SALESMAN/SALESMAN SNSW 2024.pdf
FICTIONAL SALESMAN/SALESMAN SNSW 2024.pdf
 
Spellings Wk 4 and Wk 5 for Grade 4 at CAPS
Spellings Wk 4 and Wk 5 for Grade 4 at CAPSSpellings Wk 4 and Wk 5 for Grade 4 at CAPS
Spellings Wk 4 and Wk 5 for Grade 4 at CAPS
 
Model Attribute _rec_name in the Odoo 17
Model Attribute _rec_name in the Odoo 17Model Attribute _rec_name in the Odoo 17
Model Attribute _rec_name in the Odoo 17
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfUnit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
 
What is 3 Way Matching Process in Odoo 17.pptx
What is 3 Way Matching Process in Odoo 17.pptxWhat is 3 Way Matching Process in Odoo 17.pptx
What is 3 Way Matching Process in Odoo 17.pptx
 
Transparency, Recognition and the role of eSealing - Ildiko Mazar and Koen No...
Transparency, Recognition and the role of eSealing - Ildiko Mazar and Koen No...Transparency, Recognition and the role of eSealing - Ildiko Mazar and Koen No...
Transparency, Recognition and the role of eSealing - Ildiko Mazar and Koen No...
 
Introduction to TechSoup’s Digital Marketing Services and Use Cases
Introduction to TechSoup’s Digital Marketing  Services and Use CasesIntroduction to TechSoup’s Digital Marketing  Services and Use Cases
Introduction to TechSoup’s Digital Marketing Services and Use Cases
 
How to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxHow to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptx
 
dusjagr & nano talk on open tools for agriculture research and learning
dusjagr & nano talk on open tools for agriculture research and learningdusjagr & nano talk on open tools for agriculture research and learning
dusjagr & nano talk on open tools for agriculture research and learning
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 

Students academic performance using clustering technique

  • 2. Introduction..  Our project aim is to find students academic performance and find out whether there is any general pattern in their marks and performance.  So here ,We are analyzing both internal and external marks of a student.  We did the following KDD preprocessing steps to mine our data.
  • 3. Learning the application domain  Learning the application domain is the first step in KDD process .  Need to have a clear understanding about the application domain and our objectives.  The institution considered for mining is MCA batch of Rajagiri College of Social Sciences.  We collected all previous year academic record from the department of computer science
  • 4. Create a target data set: data selection  We selected 2007-2010 batch marks for analysing the pattern.  There were around 45 records(45 students).  Both the internal and external marks of each student were selected, in order to find out the performance pattern.
  • 6. Data cleaning & preprocessing  Data cleaning is the step where noise and irrelevant data are removed from the large data set.  This is a very important pre-processing step because our outcome would be dependent on the quality of selected data.  Remove duplicate records, enter logically correct values for missing records(absent students), remove unnecessary data fields and standardize data format.
  • 7.  There was no much duplicate data or unnecessary data in the collected record . The dataset was partially cleaned.  Student internal mark and external mark were stored in different records.  By applying data integration these records were integrated into one record.  The new dataset consist of internal mark details and external mark details of each student in one record.
  • 8.
  • 9. Data reduction & transformation  Data is transformed into appropriate form for making it ready for data mining step.  The dataset contains marks of 5 theory paper and 2 lab paper of all 5 semesters.  These marks are transformed into sum of internal marks and sum of external marks of each student for the easiness of analysing the pattern.
  • 10.
  • 11. Cluster Analysis  The data mining technique we used here is clustering.  A cluster is a collection of data objects that are similar to one another within same cluster and are dissimilar to objects in other cluster.  We first partitioned the set of data into groups based on data similarity and then assign labels Choosing functions of data mining
  • 12. K-MEANS Partitioning  The K-means algorithm takes input parameter k and partitions the set of n objects into k clusters.  Here we selected no: of cluster as 4  Objects are distributed to a cluster based on cluster center to which it is nearest.  For each semester we found out the clusters separately and labeled them as students Excellent, Good, Fair and Poor Choosing mining algorithms
  • 13. The Tool used for pattern evaluation is ORANGE
  • 15. No of cluster selected is 4
  • 28. Data mining search for patterns of interest  From the mining process we found that “All the 5 semester clusters followed the same pattern of performance”.  A student with high internal mark has higher external marks and a student with less internal marks has less external marks.  There is a direct relation between the internal and the external marks.  At some case this evaluation is not valid, cases like  Being absent for internal exam and scoring high marks for the externals (vice versa)
  • 29. CONCLUSION  A students performance in his university exam can be predicted with the help of his internal marks. There is a direct relation between the internal and the external marks.  A student with low internals will get low marks for externals too
  • 30. Use of discovered knowledge representation