Prediction-Next-Term Student Performance Prediction: A Recommender Systems Approach

•

0 likes•182 views

In this study, the purpose was to compare about 10 prediction methods and find out which one is most effective in university students' garde. They used Root Mean Squared Error (RMSE) and Mean Absolute Error (MAE) to compare the prediction methods. They found that Factorization Matrix (FM)-Random Forest (RF) hybrid the most effective method in general. It is an effective method to overcome the cold-start limitations that FM could not handle. Sweeney, M., Rangwala, H., Lester, J., Johri, A. (2016). Next-Term Student Performance Prediction: A Recommender Systems Approach, Journal of Educational Data Mining, 8(1), 22-51.

Education

Next-Term Student Performance Prediction:
A Recommender Systems Approach
Sweeney, M., Rangwala, H., Lester, J., Johri, A. (2016)

Context of the Study
The purpose of the study presented in this paper was to apply state-of-the-art recommender systems
techniques to the task of student performance prediction.

Research Questions
 In the present study, they compare more models on a larger dataset and conduct a more detailed
analysis of feature importance, performance errors, and implications for educational applications.

Research Method of the Study
 Predictive Analytical Method

Data Collection - Data
 Students from a public university
 Instructor (classification, rank, tenure status)
 Student info (transfer or not)
 Courses
 Disciplines
 Letter grades
 Demographics data, such as age, race, sex,
high school CEEB code and GPA, zip code, and
1600-scale SAT scores.

Data Collection - Methods
 Simple baselines
 MF-based methods
 Common regression models.

Data Collection - Methods
 Uniform Random (UR): Randomly predict grades from a uniform distribution over the range [0, 4].
 Global Mean (GM): Predict grades using the mean of all previously observed grades.
 Mean of Means (MoM): Predict grades using an average of the global mean, the per student mean, and
the per-course mean.

Data Collection - Methods
Three methods based on Matrix Factorization (MF):
 Singular Value Decomposition (SVD)
 SVD-kNN: SVD post-processed with kNN
 Factorization Machine (FM)

Data Collection - Methods
Four different regression models:
 Random Forest (RF)
 Stochastic Gradient Descent (SGD) Regression
 k-Nearest Neighbors (kNN)
 Personalized Linear Multiple Regression (PLMR)

Data Collection - Methods
Evaluations are performed in terms of two common regression metrics:
 Root Mean Squared Error (RMSE)
 Mean Absolute Error (MAE).

Data Collection - Tools
 No information on which tool they utilized to analyze data.

Results
 The FM model outperforms all the others by a wide margin, indicating 2-way feature interactions play an
important role in predicting performance.
 FM-RF hybrid is an effective method of overcoming the cold-start limitations of FMs and is the most
effective method in general.
 When the test distribution differs from the training distribution, the FM model is liable to learn
overconfident 2-way interactions that reduce its ability to generalize.

Implications
 For students, we can incorporate this information into a degree planning system.
 For educators, knowledge of which students have the lowest expected grades could provide
opportunities to increase detection of at-risk students.
 For advisors, any additional information that helps them personalize their advice to each student could
potentially help thousands of students.

What's hot

CHARACTERISTICS OF QUANTITAIVE RESEARCHEmmanuel Nestory Kayuni

AN INVESTIGATION OF THE IMPACT OF ATYPICAL PRINCIPAL PREPARATION PROGRAMS ON ...William Kritsonis

.Analyzing dataAlexis Viera

RoneRone Ryan Desierto

Alise2014 30 40 d7Sarah Sutton

Educ 190_Data Analysis and Collection ToolsTeacher Pauline

B-School Selection Criteria Influencersetcases

The latest assigned task amendment based on the new working title.mizzyatie14

83341 ch23 jacobsenNada G.Youssef

Analytics of information flows and decision making in heterogeneous learning ...Technological Ecosystems for Enhancing Multiculturality

Alice Research PlanFantastic1234

Predicting Success : An Application of Data Mining Techniques to Student Outc...IJDKP

Multi variate presentationArun Kumar

83341 ch24 jacobsenNada G.Youssef

Analyzing dataAdan Rodriguez

Comparative and Non-Comparative 61820_62133

Some Glaring Mistakes made by Researchers in Education in Statistical AnalysisMadhavi Dharankar

What's hot (17)

CHARACTERISTICS OF QUANTITAIVE RESEARCH

AN INVESTIGATION OF THE IMPACT OF ATYPICAL PRINCIPAL PREPARATION PROGRAMS ON ...

.Analyzing data

Rone

Alise2014 30 40 d7

Educ 190_Data Analysis and Collection Tools

B-School Selection Criteria Influencers

The latest assigned task amendment based on the new working title.

83341 ch23 jacobsen

Analytics of information flows and decision making in heterogeneous learning ...

Alice Research Plan

Predicting Success : An Application of Data Mining Techniques to Student Outc...

Multi variate presentation

83341 ch24 jacobsen

Analyzing data

Comparative and Non-Comparative

Some Glaring Mistakes made by Researchers in Education in Statistical Analysis

Similar to Prediction-Next-Term Student Performance Prediction: A Recommender Systems Approach

A Comparative Study of Educational Data Mining Techniques for Skill-based Pre...IJCSIS Research Publications

UNIVERSITY ADMISSION SYSTEMS USING DATA MINING TECHNIQUES TO PREDICT STUDENT ...IRJET Journal

Maynooth university postgraduate student surveySuchismit Gupta

ADABOOST ENSEMBLE WITH SIMPLE GENETIC ALGORITHM FOR STUDENT PREDICTION MODELijcsit

Oversampling technique in student performance classification from engineering...IJECEIAES

Farshid_DefenseFarshid Marbouti

03 20250 classifiers ensembleIAESIJEECS

PREDICTING SUCCESS: AN APPLICATION OF DATA MINING TECHNIQUES TO STUDENT OUTCOMESIJDKP

IRJET- Using Data Mining to Predict Students PerformanceIRJET Journal

Clustering Students of Computer in Terms of Level of ProgrammingEditor IJCATR

A COMPREHENSIVE STUDY FOR IDENTIFICATION OF FAST AND SLOW LEARNERS USING MACH...IRJET Journal

Fd33935939IJERA Editor

Clustering analysis of learning style on anggana high school studentTELKOMNIKA JOURNAL

IMPROVING SUPERVISED CLASSIFICATION OF DAILY ACTIVITIES LIVING USING NEW COST...csandit

machine learning based predictive analytics of student academic performance i...CloudTechnologies

IMPROVING SUPERVISED CLASSIFICATION OF DAILY ACTIVITIES LIVING USING NEW COST...cscpconf

An LSTM-based prediction model for gradient-descending optimization in virtua...CSITiaesprime

ASSOCIATION RULE DISCOVERY FOR STUDENT PERFORMANCE PREDICTION USING METAHEURI...cscpconf

Similar to Prediction-Next-Term Student Performance Prediction: A Recommender Systems Approach (20)

A Comparative Study of Educational Data Mining Techniques for Skill-based Pre...

UNIVERSITY ADMISSION SYSTEMS USING DATA MINING TECHNIQUES TO PREDICT STUDENT ...

Maynooth university postgraduate student survey

ADABOOST ENSEMBLE WITH SIMPLE GENETIC ALGORITHM FOR STUDENT PREDICTION MODEL

Oversampling technique in student performance classification from engineering...

Farshid_Defense

03 20250 classifiers ensemble

PREDICTING SUCCESS: AN APPLICATION OF DATA MINING TECHNIQUES TO STUDENT OUTCOMES

IRJET- Using Data Mining to Predict Students Performance

Clustering Students of Computer in Terms of Level of Programming

A COMPREHENSIVE STUDY FOR IDENTIFICATION OF FAST AND SLOW LEARNERS USING MACH...

Fd33935939

Clustering analysis of learning style on anggana high school student

IMPROVING SUPERVISED CLASSIFICATION OF DAILY ACTIVITIES LIVING USING NEW COST...

machine learning based predictive analytics of student academic performance i...

IMPROVING SUPERVISED CLASSIFICATION OF DAILY ACTIVITIES LIVING USING NEW COST...

An LSTM-based prediction model for gradient-descending optimization in virtua...

ASSOCIATION RULE DISCOVERY FOR STUDENT PERFORMANCE PREDICTION USING METAHEURI...

Recently uploaded

Introduction to AI in Higher Education_draft.pptxpboyjonauth

Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝9953056974 Low Rate Call Girls In Saket, Delhi NCR

Procuring digital preservation CAN be quick and painless with our new dynamic...Jisc

Computed Fields and api Depends in the Odoo 17Celine George

Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth

“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr

Proudly South Africa powerpoint Thorisha.pptxthorishapillay1

Full Stack Web Development Course for BeginnersSabitha Banu

Hierarchy of management that covers different levels of managementmkooblal

CELL CYCLE Division Science 8 quarter IV.pptxJiesonDelaCerna

Capitol Tech U Doctoral Presentation - April 2024.pptxCapitolTechU

Types of Journalistic Writing Grade 8.pptxEyham Joco

Pharmacognosy Flower 3. Compositae 2023.pdfMahmoud M. Sallam

Alper Gobel In Media Res Media ComponentInMediaRes1

18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a

TataKelola dan KamSiber Kecerdasan Buatan v022.pdfSarwono Sutikno, Dr.Eng.,CISA,CISSP,CISM,CSX-F

Difference Between Search & Browse Methods in Odoo 17Celine George

POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar

Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre

Earth Day Presentation wow hello nice greatYousafMalik24

Recently uploaded (20)

Introduction to AI in Higher Education_draft.pptx

Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝

Procuring digital preservation CAN be quick and painless with our new dynamic...

Computed Fields and api Depends in the Odoo 17

Introduction to ArtificiaI Intelligence in Higher Education

“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...

Proudly South Africa powerpoint Thorisha.pptx

Full Stack Web Development Course for Beginners

Hierarchy of management that covers different levels of management

CELL CYCLE Division Science 8 quarter IV.pptx

Capitol Tech U Doctoral Presentation - April 2024.pptx

Types of Journalistic Writing Grade 8.pptx

Pharmacognosy Flower 3. Compositae 2023.pdf

Alper Gobel In Media Res Media Component

18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf

TataKelola dan KamSiber Kecerdasan Buatan v022.pdf

Difference Between Search & Browse Methods in Odoo 17

POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx

Organic Name Reactions for the students and aspirants of Chemistry12th.pptx

Earth Day Presentation wow hello nice great

Prediction-Next-Term Student Performance Prediction: A Recommender Systems Approach

1. CEIT 720: Learning Anaytics In Education Predictive Methods Beste Ulus

2. Next-Term Student Performance Prediction: A Recommender Systems Approach Sweeney, M., Rangwala, H., Lester, J., Johri, A. (2016)

3. Context of the Study The purpose of the study presented in this paper was to apply state-of-the-art recommender systems techniques to the task of student performance prediction.

4. Research Questions  In the present study, they compare more models on a larger dataset and conduct a more detailed analysis of feature importance, performance errors, and implications for educational applications.

5. Research Method of the Study  Predictive Analytical Method

6. Data Collection - Data  Students from a public university  Instructor (classification, rank, tenure status)  Student info (transfer or not)  Courses  Disciplines  Letter grades  Demographics data, such as age, race, sex, high school CEEB code and GPA, zip code, and 1600-scale SAT scores.

7. Data Collection - Methods  Simple baselines  MF-based methods  Common regression models.

8. Data Collection - Methods  Uniform Random (UR): Randomly predict grades from a uniform distribution over the range [0, 4].  Global Mean (GM): Predict grades using the mean of all previously observed grades.  Mean of Means (MoM): Predict grades using an average of the global mean, the per student mean, and the per-course mean.

9. Data Collection - Methods Three methods based on Matrix Factorization (MF):  Singular Value Decomposition (SVD)  SVD-kNN: SVD post-processed with kNN  Factorization Machine (FM)

10. Data Collection - Methods Four different regression models:  Random Forest (RF)  Stochastic Gradient Descent (SGD) Regression  k-Nearest Neighbors (kNN)  Personalized Linear Multiple Regression (PLMR)

11. Data Collection - Methods Evaluations are performed in terms of two common regression metrics:  Root Mean Squared Error (RMSE)  Mean Absolute Error (MAE).

12. Data Collection - Tools  No information on which tool they utilized to analyze data.

13. Results  The FM model outperforms all the others by a wide margin, indicating 2-way feature interactions play an important role in predicting performance.  FM-RF hybrid is an effective method of overcoming the cold-start limitations of FMs and is the most effective method in general.  When the test distribution differs from the training distribution, the FM model is liable to learn overconfident 2-way interactions that reduce its ability to generalize.

14. Results

15. Results

16. Implications  For students, we can incorporate this information into a degree planning system.  For educators, knowledge of which students have the lowest expected grades could provide opportunities to increase detection of at-risk students.  For advisors, any additional information that helps them personalize their advice to each student could potentially help thousands of students.

17. Thank You

Editor's Notes

Is it a part of project? If yes name of the project? Is the study descriptive, diagnostic, predictive or prescriptive ?
Cold-start dyads have either a new student, a new course, or both. Appears in the prediction phase but not in previous terms training phase
1 randomly guessing, 2, overall central tendency, 3, row and column averages Coldstart hiç biri yoksa GM herhangi biri varsa MoM
each grade is simply predicted as the dot product of the latent student and course feature vectors. We call this the factorized 2-way interaction of the student with the course. k-nearest neighbors (kNN) yields improved predictive performance. Pure collaborative filtering (CF) methods such as SVD and SVD-kNN are unable to make predictions for cold-start records. the FM model is able to capture all of the information captured by the simple baselines as well as the information captured by SVD. In general, it captures the global central tendency, 1-way (linear) relationships between the predictors and the grade (bias terms), and 2-way factorized interactions between each predictor and the grade:
Once built, the tree can be used for regression of new data samples. The Random Forest then combines many of these trees in a weighted averaging approach to make decisions regarding unseen data. SGD is a gradient-based optimization technique that updates the model parameters incrementally, rather than on the entire training set at once (which is what normal gradient descent would do). This reduces overfitting and significantly improves training time. The k-Nearest Neighbors (kNN) algorithm is a classic method for clustering samples based on similarity.
RMSE is the metric we use to compare methods. MAE allows us to understand the range of grades we might actually be predicting.

Prediction-Next-Term Student Performance Prediction: A Recommender Systems Approach

Recommended

Recommended

More Related Content

What's hot

What's hot (17)

Similar to Prediction-Next-Term Student Performance Prediction: A Recommender Systems Approach

Similar to Prediction-Next-Term Student Performance Prediction: A Recommender Systems Approach (20)

Recently uploaded

Recently uploaded (20)

Prediction-Next-Term Student Performance Prediction: A Recommender Systems Approach

Editor's Notes