SlideShare a Scribd company logo
1 of 8
Download to read offline
Predictive models of academic success
A case study with version control systems
´Angel Manuel Guerrero Higueras
Noem´ı de Castro Garc´ıa
Vicente Matell´an
Miguel ´Angel Conde
University of Le´on
Distributed under: Creative Commons Attribution-ShareAlike 4.0 International
Aim
Goal
Predict academic success in order to identify the students at-risk
situation.
How? Monitoring their interaction with a Version Control System
(VCS).
Practical assignment → ASSOOFS.
VCS → GitHub.
Data analysis → MoEv tool.
Research questions:
1 Are there features that we can extract from the students’ interactions
with this type of systems that are related with the academic success?
2 Can we build a model that allows predicting students’ success at a
practical assignment, by monitoring their use of a VCS?
Predictive models of academic success: a case study with VCSs 2 de 8
Methodolody: MoEv tool
1 Input data gathering.
features: raw data + synthethic data
target variable (class): AP or SS
2 Selection of most significant features.
Feature importance is computed as the
Gini coefficient (G).
3 Selection of models.
parametric and non-parametric models:
Adaptive Boosting (AB), Classification
And Regression Tree (CART),
K-Nearest Neighbors (KNN), Linear
Discriminant Analysis (LDA), Logistic
Regression (LR), Multi-Layer
Perceptron (MLP), Naive Bayes (NB),
and Random Forest (RF).
4 Decision.
Accuracy, Precision, Recall, and F1
score.
Predictive models of academic success: a case study with VCSs 3 de 8
Materials
Target Binary variable with two possible values: “AP”, for those
students who will finish a practical assignment
successfully; and “SS”, for those who not.
Input data:
Anonymized student identifier (id).
Number of commit operations (commits).
Number of days where there is at least one commit operation (days).
Average number of commit operations per date (commits/day).
Number of lines of code added (additions).
Number of lines deleted (deletions).
Number of issued opened (issues).
Number of issued closed (closed).
Authorship proof.
Datasets:
1 2016–2017 ASSOO course (46 students).
21 students labelled with “AP”, and 25 students with the label “SS”.
Used to train and test the prediction model.
2 2017–2018 ASSOO course (40 students who).
21 students labelled with “AP”, and 19 students with the label “SS”.
Used to validate the prediction model.
Predictive models of academic success: a case study with VCSs 4 de 8
Results I
Features importances
Predictive models of academic success: a case study with VCSs 5 de 8
Results II
Classifier Test score Validation score
NB 0.8 0.825
RF 0.8 0.8
LDA 0.8 0.725
MLP 0.5 0.65
CART 0.4 0.6
AB 0.4 0.5
LR 0.7 0.475
KNN 0.6 0.475
SVM 0.4 0.525
Accuracy classification score.
Predictive models of academic success: a case study with VCSs 6 de 8
Results III
Confusion matrix for the NB (left), RF (center), and LDA (right) classifiers
evaluated using the test dataset.
Confusion matrix for the NB (left), RF (center), and LDA (right) classifiers
evaluated using the validation dataset.
Predictive models of academic success: a case study with VCSs 7 de 8
Conclusions and future steps
Regarding question 1
Results show that some features related with students interaction
with the VCS are discriminant.
However, including more features, such as an authorship proof,
increases models accuracy.
Regarding question 2
The MoEv tools provide a prediction model by evaluating several
classifiers.
There are works to do in order to optimize the selected model by
tuning its hyper-parameters, but results are enough to assert that we
can predict students’ results at a practical assignment with a high
percentage of success.
Predictive models of academic success: a case study with VCSs 8 de 8

More Related Content

Similar to Predictive models of academic success: a case study with version control systems

An intelligent approach to demand forecasting
An intelligent approach to demand forecastingAn intelligent approach to demand forecasting
An intelligent approach to demand forecasting
Nimai Chand Das Adhikari
 
EDUCA Leveraging Analytics FINAL
EDUCA Leveraging Analytics FINALEDUCA Leveraging Analytics FINAL
EDUCA Leveraging Analytics FINAL
Ellen Wagner
 
Stereotype Modeling for Problem-Solving Performance Predictions in MOOCs and ...
Stereotype Modeling for Problem-Solving Performance Predictions in MOOCs and ...Stereotype Modeling for Problem-Solving Performance Predictions in MOOCs and ...
Stereotype Modeling for Problem-Solving Performance Predictions in MOOCs and ...
Peter Brusilovsky
 

Similar to Predictive models of academic success: a case study with version control systems (20)

Whitmer, Fernandes, Kodai CSU Chico Learner Analytics
Whitmer, Fernandes, Kodai CSU Chico Learner AnalyticsWhitmer, Fernandes, Kodai CSU Chico Learner Analytics
Whitmer, Fernandes, Kodai CSU Chico Learner Analytics
 
Open academic early alert & risk assessment ap presentation
Open academic early alert & risk assessment ap presentationOpen academic early alert & risk assessment ap presentation
Open academic early alert & risk assessment ap presentation
 
Ecoop05 qaoos eb.ppt
Ecoop05 qaoos eb.pptEcoop05 qaoos eb.ppt
Ecoop05 qaoos eb.ppt
 
MULTI-LEARNING SPECIAL SESSION / EDUCON 2018 / EMADRID TEAM
MULTI-LEARNING SPECIAL SESSION /  EDUCON 2018 / EMADRID TEAMMULTI-LEARNING SPECIAL SESSION /  EDUCON 2018 / EMADRID TEAM
MULTI-LEARNING SPECIAL SESSION / EDUCON 2018 / EMADRID TEAM
 
ReComp for genomics
ReComp for genomicsReComp for genomics
ReComp for genomics
 
Machine Learning Regression Analysis of EDX 2012-13 Data for Identifying the ...
Machine Learning Regression Analysis of EDX 2012-13 Data for Identifying the ...Machine Learning Regression Analysis of EDX 2012-13 Data for Identifying the ...
Machine Learning Regression Analysis of EDX 2012-13 Data for Identifying the ...
 
Lecture 3.pptx
Lecture 3.pptxLecture 3.pptx
Lecture 3.pptx
 
6317ijite01
6317ijite016317ijite01
6317ijite01
 
An intelligent approach to demand forecasting
An intelligent approach to demand forecastingAn intelligent approach to demand forecasting
An intelligent approach to demand forecasting
 
SEM on MIDUS Dataset
SEM on MIDUS DatasetSEM on MIDUS Dataset
SEM on MIDUS Dataset
 
Post Graduate Admission Prediction System
Post Graduate Admission Prediction SystemPost Graduate Admission Prediction System
Post Graduate Admission Prediction System
 
A cluster-based analysis to diagnose students’ learning achievements
A cluster-based analysis to diagnose students’ learning achievementsA cluster-based analysis to diagnose students’ learning achievements
A cluster-based analysis to diagnose students’ learning achievements
 
final
finalfinal
final
 
EDUCA Leveraging Analytics FINAL
EDUCA Leveraging Analytics FINALEDUCA Leveraging Analytics FINAL
EDUCA Leveraging Analytics FINAL
 
Qualitative Comparitive Analysis (QCA)and Complimentary Methods 29 April 2016
Qualitative Comparitive Analysis (QCA)and Complimentary Methods 29 April 2016Qualitative Comparitive Analysis (QCA)and Complimentary Methods 29 April 2016
Qualitative Comparitive Analysis (QCA)and Complimentary Methods 29 April 2016
 
Stereotype Modeling for Problem-Solving Performance Predictions in MOOCs and ...
Stereotype Modeling for Problem-Solving Performance Predictions in MOOCs and ...Stereotype Modeling for Problem-Solving Performance Predictions in MOOCs and ...
Stereotype Modeling for Problem-Solving Performance Predictions in MOOCs and ...
 
VII Jornadas eMadrid "Education in exponential times". Mesa redonda eMadrid L...
VII Jornadas eMadrid "Education in exponential times". Mesa redonda eMadrid L...VII Jornadas eMadrid "Education in exponential times". Mesa redonda eMadrid L...
VII Jornadas eMadrid "Education in exponential times". Mesa redonda eMadrid L...
 
Adaptive Multilevel Clustering Model for the Prediction of Academic Risk
Adaptive Multilevel Clustering Model for the Prediction of Academic RiskAdaptive Multilevel Clustering Model for the Prediction of Academic Risk
Adaptive Multilevel Clustering Model for the Prediction of Academic Risk
 
Learninig Analytics Special Track: A cluster-based analisys to diagnose stude...
Learninig Analytics Special Track: A cluster-based analisys to diagnose stude...Learninig Analytics Special Track: A cluster-based analisys to diagnose stude...
Learninig Analytics Special Track: A cluster-based analisys to diagnose stude...
 
Survey on Techniques for Predictive Analysis of Student Grades and Career
Survey on Techniques for Predictive Analysis of Student Grades and CareerSurvey on Techniques for Predictive Analysis of Student Grades and Career
Survey on Techniques for Predictive Analysis of Student Grades and Career
 

More from Technological Ecosystems for Enhancing Multiculturality

More from Technological Ecosystems for Enhancing Multiculturality (20)

A Preliminary Study of Proof of Concept Practices and their connection with I...
A Preliminary Study of Proof of Concept Practices and their connection with I...A Preliminary Study of Proof of Concept Practices and their connection with I...
A Preliminary Study of Proof of Concept Practices and their connection with I...
 
Social networks as a promotional space for Spanish radio content. The case st...
Social networks as a promotional space for Spanish radio content. The case st...Social networks as a promotional space for Spanish radio content. The case st...
Social networks as a promotional space for Spanish radio content. The case st...
 
Towards the study of sentiment in the public opinion of science in Spanish
Towards the study of sentiment in the public opinion of science in SpanishTowards the study of sentiment in the public opinion of science in Spanish
Towards the study of sentiment in the public opinion of science in Spanish
 
A Three-Step Data-Mining Analysis of Top-Ranked Higher Education Institutions...
A Three-Step Data-Mining Analysis of Top-Ranked Higher Education Institutions...A Three-Step Data-Mining Analysis of Top-Ranked Higher Education Institutions...
A Three-Step Data-Mining Analysis of Top-Ranked Higher Education Institutions...
 
Specifics of multimedia texts in the context of social networks media aesthetics
Specifics of multimedia texts in the context of social networks media aestheticsSpecifics of multimedia texts in the context of social networks media aesthetics
Specifics of multimedia texts in the context of social networks media aesthetics
 
Combined Effects of Similarity and Imagined Contact on First-Person Testimoni...
Combined Effects of Similarity and Imagined Contact on First-Person Testimoni...Combined Effects of Similarity and Imagined Contact on First-Person Testimoni...
Combined Effects of Similarity and Imagined Contact on First-Person Testimoni...
 
Direct online political communication effects on civil participation in spain...
Direct online political communication effects on civil participation in spain...Direct online political communication effects on civil participation in spain...
Direct online political communication effects on civil participation in spain...
 
University Media in Ecuador: Types, Functions and Self-determination
University Media in Ecuador: Types, Functions and Self-determinationUniversity Media in Ecuador: Types, Functions and Self-determination
University Media in Ecuador: Types, Functions and Self-determination
 
Like it or die: using social networks to improve collaborative learning in hi...
Like it or die: using social networks to improve collaborative learning in hi...Like it or die: using social networks to improve collaborative learning in hi...
Like it or die: using social networks to improve collaborative learning in hi...
 
Framing theory in studies of environmental information in press
Framing theory in studies of environmental information in pressFraming theory in studies of environmental information in press
Framing theory in studies of environmental information in press
 
Domain engineering for generating dashboards to analyze employment and employ...
Domain engineering for generating dashboards to analyze employment and employ...Domain engineering for generating dashboards to analyze employment and employ...
Domain engineering for generating dashboards to analyze employment and employ...
 
Mapping the systematic literature studies about software ecosystems
Mapping the systematic literature studies about software ecosystemsMapping the systematic literature studies about software ecosystems
Mapping the systematic literature studies about software ecosystems
 
Tag-Based Browsing of Digital Collections with Inverted Indexes and Browsing ...
Tag-Based Browsing of Digital Collections with Inverted Indexes and Browsing ...Tag-Based Browsing of Digital Collections with Inverted Indexes and Browsing ...
Tag-Based Browsing of Digital Collections with Inverted Indexes and Browsing ...
 
A Multivocal Literature Review on the use of DevOps for e-learning systems
A Multivocal Literature Review on the use of DevOps for e-learning systemsA Multivocal Literature Review on the use of DevOps for e-learning systems
A Multivocal Literature Review on the use of DevOps for e-learning systems
 
Document Annotation Tools: Annotation Classification Mechanisms
Document Annotation Tools: Annotation Classification MechanismsDocument Annotation Tools: Annotation Classification Mechanisms
Document Annotation Tools: Annotation Classification Mechanisms
 
Toward supporting decision-making under uncertainty in digital humanities wit...
Toward supporting decision-making under uncertainty in digital humanities wit...Toward supporting decision-making under uncertainty in digital humanities wit...
Toward supporting decision-making under uncertainty in digital humanities wit...
 
Managing Uncertainty in the Humanities: Digital and Analogue Approaches
Managing Uncertainty in the Humanities: Digital and Analogue ApproachesManaging Uncertainty in the Humanities: Digital and Analogue Approaches
Managing Uncertainty in the Humanities: Digital and Analogue Approaches
 
Representing Imprecise and Uncertain Knowledge in Digital Humanities: A Theor...
Representing Imprecise and Uncertain Knowledge in Digital Humanities: A Theor...Representing Imprecise and Uncertain Knowledge in Digital Humanities: A Theor...
Representing Imprecise and Uncertain Knowledge in Digital Humanities: A Theor...
 
Dotmocracy and Planning Poker for Uncertainty Management in Collaborative Res...
Dotmocracy and Planning Poker for Uncertainty Management in Collaborative Res...Dotmocracy and Planning Poker for Uncertainty Management in Collaborative Res...
Dotmocracy and Planning Poker for Uncertainty Management in Collaborative Res...
 
Applying Commercial Computer Vision Tools to Cope with Uncertainties in a Cit...
Applying Commercial Computer Vision Tools to Cope with Uncertainties in a Cit...Applying Commercial Computer Vision Tools to Cope with Uncertainties in a Cit...
Applying Commercial Computer Vision Tools to Cope with Uncertainties in a Cit...
 

Recently uploaded

Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 

Recently uploaded (20)

Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Magic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptxMagic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptx
 
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxSKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 

Predictive models of academic success: a case study with version control systems

  • 1. Predictive models of academic success A case study with version control systems ´Angel Manuel Guerrero Higueras Noem´ı de Castro Garc´ıa Vicente Matell´an Miguel ´Angel Conde University of Le´on Distributed under: Creative Commons Attribution-ShareAlike 4.0 International
  • 2. Aim Goal Predict academic success in order to identify the students at-risk situation. How? Monitoring their interaction with a Version Control System (VCS). Practical assignment → ASSOOFS. VCS → GitHub. Data analysis → MoEv tool. Research questions: 1 Are there features that we can extract from the students’ interactions with this type of systems that are related with the academic success? 2 Can we build a model that allows predicting students’ success at a practical assignment, by monitoring their use of a VCS? Predictive models of academic success: a case study with VCSs 2 de 8
  • 3. Methodolody: MoEv tool 1 Input data gathering. features: raw data + synthethic data target variable (class): AP or SS 2 Selection of most significant features. Feature importance is computed as the Gini coefficient (G). 3 Selection of models. parametric and non-parametric models: Adaptive Boosting (AB), Classification And Regression Tree (CART), K-Nearest Neighbors (KNN), Linear Discriminant Analysis (LDA), Logistic Regression (LR), Multi-Layer Perceptron (MLP), Naive Bayes (NB), and Random Forest (RF). 4 Decision. Accuracy, Precision, Recall, and F1 score. Predictive models of academic success: a case study with VCSs 3 de 8
  • 4. Materials Target Binary variable with two possible values: “AP”, for those students who will finish a practical assignment successfully; and “SS”, for those who not. Input data: Anonymized student identifier (id). Number of commit operations (commits). Number of days where there is at least one commit operation (days). Average number of commit operations per date (commits/day). Number of lines of code added (additions). Number of lines deleted (deletions). Number of issued opened (issues). Number of issued closed (closed). Authorship proof. Datasets: 1 2016–2017 ASSOO course (46 students). 21 students labelled with “AP”, and 25 students with the label “SS”. Used to train and test the prediction model. 2 2017–2018 ASSOO course (40 students who). 21 students labelled with “AP”, and 19 students with the label “SS”. Used to validate the prediction model. Predictive models of academic success: a case study with VCSs 4 de 8
  • 5. Results I Features importances Predictive models of academic success: a case study with VCSs 5 de 8
  • 6. Results II Classifier Test score Validation score NB 0.8 0.825 RF 0.8 0.8 LDA 0.8 0.725 MLP 0.5 0.65 CART 0.4 0.6 AB 0.4 0.5 LR 0.7 0.475 KNN 0.6 0.475 SVM 0.4 0.525 Accuracy classification score. Predictive models of academic success: a case study with VCSs 6 de 8
  • 7. Results III Confusion matrix for the NB (left), RF (center), and LDA (right) classifiers evaluated using the test dataset. Confusion matrix for the NB (left), RF (center), and LDA (right) classifiers evaluated using the validation dataset. Predictive models of academic success: a case study with VCSs 7 de 8
  • 8. Conclusions and future steps Regarding question 1 Results show that some features related with students interaction with the VCS are discriminant. However, including more features, such as an authorship proof, increases models accuracy. Regarding question 2 The MoEv tools provide a prediction model by evaluating several classifiers. There are works to do in order to optimize the selected model by tuning its hyper-parameters, but results are enough to assert that we can predict students’ results at a practical assignment with a high percentage of success. Predictive models of academic success: a case study with VCSs 8 de 8