SlideShare a Scribd company logo
1 of 16
Download to read offline
MOOC Dropout Prediction Using Machine Learning
Techniques: Review and Research Challenges
Fisnik Dalipi
Linnaeus University, Sweden & University College of Southeast Norway
Ali Shariq Imran, Zenun Kastrati
Norwegian University of Science and Technology
This is a presentation of the paper: “MOOC Dropout Prediction Using Machine
Learning Techniques: Review and Research Challenges”, which was accepted and
presented at the
EDUCON’2018 – IEEE Global Engineering Education Conference
The full paper can be downloaded from the IEEEXplore website: https://ieeexplore.ieee.org/
Introduction
Massive Open Online Courses (MOOCs) are now the most recent topic within the field of e-
learning
Some researchers even predict MOOC will represent the fourth stage of online education
evolution, after the third stage of LMS
MOOCs are progressively becoming an integral part of a learning process in higher education
settings
Also proven that the MOOC approach when complemented with a flipped classroom pedagogy
would also bring advantages toward enhancing the learning experience of students substantially
But …
The biggest threat to MOOCs
Reasons behind the dropoutStudentrelatedMOOCrelated
Lack of motivation
Lack of time
Insufficient background
Course design
Lack of interactivity
Hidden costs
State-of-the-art: Frequency of occurrences of
ML for MOOC dropout prediction
0
1
2
3
4
5
6
7
8
9
10
Logistic
Regression
Deep Neural
Network
Support Vector
Machine
Hidden Markov
Models
Recurrent
Neural Network
Natural
Language
Processing
Technique
Decision Tree Survival
Analysis
Bayesian
Network
Challenges of dropout prediction using ML
Lack of enough sample data
Managing big masses of unstructured data
Data variance
High data imbalance
Availability of publicly accesible dataset
Lack of standard for creating and representing clickstream data
Student schedule related challenges
Lack of enough sample data
Managing big masses of unstructured data
Data variance
High data imbalance
Availability of publicly accesible dataset
Lack of standard for creating and representing clickstream data
Student schedule related challenges
oFor unbiased classification results
oLack of negative samples, i.e. HarvardX
o E.g. Out of 641138 registered students only 17687
obtained certification
oHowever, the effectiveness of the ML relies on the
availability of enormous amount of both positive
and negative samples
oA significant difference in observable sample data
affects
othe generalization of the deep neural network
model during the training step, but also
ocomprises the classification accuracy.
Challenges of dropout prediction using ML
oMOOC comprises of big masses of unstructured
data missing data occurs and is hard to validate
oA multitude of ML techniques can’t be applicable
oE.g. Hidden Markov Models, since
oit relies on a finite data set, and
ocannot be applied if observations are missing
oResearchers reaction: replacing missing
observations with mean values.
oNot a realistic choice for many of the observed
features
Lack of enough sample data
Managing big masses of unstructured data
Data variance
High data imbalance
Availability of publicly accesible dataset
Lack of standard for creating and representing clickstream data
Student schedule related challenges
Challenges of dropout prediction using ML
oIn MOOC's self-paced learning pedagogy, students
have the freedom to decide what, when, and how
to study.
oThis might lead to considerable data variance,
which may produce less accurate and reliable ML
models
oParticularly for SVM and Naïve Bayes, since
otheir performance can quickly turn poor when
dealing with imbalanced classes
oOverall, the high data imbalance may result in
biasing of the classifier towards the majority of the
class.
Lack of enough sample data
Managing big masses of unstructured data
Data variance
High data imbalance
Availability of publicly accesible dataset
Lack of standard for creating and representing clickstream data
Student schedule related challenges
Challenges of dropout prediction using ML
oMOOC platforms are often reluctant to publish the
data due to confidentiality and privacy concerns
oThe de-identified information which is made public
is also restricted to system generated events
(clickstream data) only, while most of the user-
provided data is omitted.
oThe lack of user-provided data can be a hindrance
in successfully estimating the reasons behind the
dropouts.
Challenges of dropout prediction using ML
Lack of enough sample data
Managing big masses of unstructured data
Data variance
High data imbalance
Availability of publicly accesible dataset
Lack of standard for creating and representing clickstream data
Student schedule related challenges
oThe user data MOOC platforms gather and the
clickstream data they generate doesn’t necessarily
conform to a standard format, due to lack of available
standards - such as IEEE LOM for learning objects
oThe lack of a standard for creating and representing
clickstream data for MOOC implies that
oa network trained and validated for one MOOC on a
given platform might not be interoperable across
different MOOC on another platform, or even on the
same platform if a MOOC gathers and collects data
differently
oLack of standards gives rise to another challenge, i.e., how
to create compelling ML models and predictive solutions
that can be generalized across different MOOCs?
Lack of enough sample data
Managing big masses of unstructured data
Data variance
High data imbalance
Availability of publicly accesible dataset
Lack of standard for creating and representing
clickstream data
Student schedule related challenges
Challenges of dropout prediction using ML
oStudents want to construct timetable based on
their preferences
oSatisfying such preferences of students to be
accommodated on different timetables is a
complicated problem which can be solved using
o Artificial intelligence (AI) optimization techniques.
o These AI techniques are specialized to be used for
scheduling and thus students’ time constraints can
be solved using these techniques.
Lack of enough sample data
Managing big masses of unstructured data
Data variance
High data imbalance
Availability of publicly accesible dataset
Lack of standard for creating and representing clickstream data
Student schedule related challenges
Challenges of dropout prediction using ML
Towards useful and effective predictive
solutions
Useful
&
Effective
solutions
Clickstream
data
standards
Student
provided
data
Feature
engineering
techniques
Engagement
system
Evaluating
models and
predictors
Improving
student
social
engagement
Unification/standardization of a
clickstream data
Predicting models can benefit from the
data provided by the students that
don't reveal the identity of the user
To analyze posts, feedback, and
comments shared by the candidate on
the discussion blogs and social forums
of the MOOC
Encouraging social interactions and
simulating peer-evaluation activities
could also potentially represent an
efficient way to mitigate the dropout
To transfer and evaluate models from
one MOOC to another, and to perform
real-time prediction with ongoing
courses.
To allow instructors track and monitor
students activity across different
modules of the course
Conclusions
This paper provides a comprehensive review of the most recent and relevant research
endeavors on machine learning application toward predicting, explaining and solving the
problem of student dropout in MOOCs
It also identifies some of the critical challenges associated with student dropout prediction and
provides recommendations and proposal to assist researchers employing various machine
learning techniques in solving it timely and efficiently.
Additionally, this paper floats the idea of unification of a clickstream data and student-provided
data as a standard, similar to various learning objects metadata standards.
Future work will follow on data collection for the development and analysis of deep learning
algorithms towards solving the MOOC student dropout.
Thank You!

More Related Content

What's hot

Digital Pedagogy is about Breaking Stuff: Toward a Critical Digital Humanitie...
Digital Pedagogy is about Breaking Stuff: Toward a Critical Digital Humanitie...Digital Pedagogy is about Breaking Stuff: Toward a Critical Digital Humanitie...
Digital Pedagogy is about Breaking Stuff: Toward a Critical Digital Humanitie...Jesse Stommel
 
The Digital Classroom of Tomorrow
The Digital Classroom of TomorrowThe Digital Classroom of Tomorrow
The Digital Classroom of TomorrowJulie Evans
 
Education in the Age of COVID-19
Education in the Age of COVID-19Education in the Age of COVID-19
Education in the Age of COVID-19AlbertMao4
 
Ppt mooc
Ppt moocPpt mooc
Ppt moocleiriel
 
ICT Integration in Higher Education in Africa - Challenges and Opportunities
ICT Integration in Higher Education in Africa - Challenges and OpportunitiesICT Integration in Higher Education in Africa - Challenges and Opportunities
ICT Integration in Higher Education in Africa - Challenges and OpportunitiesGreig Krull
 
Online learning
Online learningOnline learning
Online learningnickfurman
 
Online Education System
Online Education SystemOnline Education System
Online Education SystemLakshmi Ramani
 
THE MASSIVE OPEN ONLINE COURSE (MOOC) IS A NEW WAY OF DISTNCE LEARNING AT THE...
THE MASSIVE OPEN ONLINE COURSE (MOOC) IS A NEW WAY OF DISTNCE LEARNING AT THE...THE MASSIVE OPEN ONLINE COURSE (MOOC) IS A NEW WAY OF DISTNCE LEARNING AT THE...
THE MASSIVE OPEN ONLINE COURSE (MOOC) IS A NEW WAY OF DISTNCE LEARNING AT THE...Dr. Anjaiah Mothukuri
 
Using Blended Learning as a stepping stone to enhance e-learning
Using Blended Learning as a stepping stone to enhance e-learningUsing Blended Learning as a stepping stone to enhance e-learning
Using Blended Learning as a stepping stone to enhance e-learningRiri Kusumarani
 
Online education or e learning
Online education or e learningOnline education or e learning
Online education or e learningZubayar Rahman
 

What's hot (20)

Digital Pedagogy is about Breaking Stuff: Toward a Critical Digital Humanitie...
Digital Pedagogy is about Breaking Stuff: Toward a Critical Digital Humanitie...Digital Pedagogy is about Breaking Stuff: Toward a Critical Digital Humanitie...
Digital Pedagogy is about Breaking Stuff: Toward a Critical Digital Humanitie...
 
The Digital Classroom of Tomorrow
The Digital Classroom of TomorrowThe Digital Classroom of Tomorrow
The Digital Classroom of Tomorrow
 
Education in the Age of COVID-19
Education in the Age of COVID-19Education in the Age of COVID-19
Education in the Age of COVID-19
 
Ppt mooc
Ppt moocPpt mooc
Ppt mooc
 
ICT Integration in Higher Education in Africa - Challenges and Opportunities
ICT Integration in Higher Education in Africa - Challenges and OpportunitiesICT Integration in Higher Education in Africa - Challenges and Opportunities
ICT Integration in Higher Education in Africa - Challenges and Opportunities
 
Online learning
Online learningOnline learning
Online learning
 
Online Education System
Online Education SystemOnline Education System
Online Education System
 
mooc
moocmooc
mooc
 
THE MASSIVE OPEN ONLINE COURSE (MOOC) IS A NEW WAY OF DISTNCE LEARNING AT THE...
THE MASSIVE OPEN ONLINE COURSE (MOOC) IS A NEW WAY OF DISTNCE LEARNING AT THE...THE MASSIVE OPEN ONLINE COURSE (MOOC) IS A NEW WAY OF DISTNCE LEARNING AT THE...
THE MASSIVE OPEN ONLINE COURSE (MOOC) IS A NEW WAY OF DISTNCE LEARNING AT THE...
 
Master thesis presentation
Master thesis presentationMaster thesis presentation
Master thesis presentation
 
MOOCs, Higher Education and NEP 2020
MOOCs, Higher Education and NEP 2020MOOCs, Higher Education and NEP 2020
MOOCs, Higher Education and NEP 2020
 
Moodle - Learning Management System
Moodle - Learning Management SystemMoodle - Learning Management System
Moodle - Learning Management System
 
Moodle Features
Moodle FeaturesMoodle Features
Moodle Features
 
EDA
EDAEDA
EDA
 
Intro to kaggle
Intro to kaggleIntro to kaggle
Intro to kaggle
 
Using Blended Learning as a stepping stone to enhance e-learning
Using Blended Learning as a stepping stone to enhance e-learningUsing Blended Learning as a stepping stone to enhance e-learning
Using Blended Learning as a stepping stone to enhance e-learning
 
Moocs
MoocsMoocs
Moocs
 
Coursera
CourseraCoursera
Coursera
 
Online education or e learning
Online education or e learningOnline education or e learning
Online education or e learning
 
ONLINE LEARNING PLATFORMS.pptx
ONLINE LEARNING PLATFORMS.pptxONLINE LEARNING PLATFORMS.pptx
ONLINE LEARNING PLATFORMS.pptx
 

Similar to MOOC Dropout Prediction Using ML

Using Ontology in Electronic Evaluation for Personalization of eLearning Systems
Using Ontology in Electronic Evaluation for Personalization of eLearning SystemsUsing Ontology in Electronic Evaluation for Personalization of eLearning Systems
Using Ontology in Electronic Evaluation for Personalization of eLearning Systemsinfopapers
 
A Systematic Literature Review Of Student Performance Prediction Using Machi...
A Systematic Literature Review Of Student  Performance Prediction Using Machi...A Systematic Literature Review Of Student  Performance Prediction Using Machi...
A Systematic Literature Review Of Student Performance Prediction Using Machi...Angie Miller
 
Increasing Learners' Retention and Persistence in MOOCs: Design-Based Researc...
Increasing Learners' Retention and Persistence in MOOCs: Design-Based Researc...Increasing Learners' Retention and Persistence in MOOCs: Design-Based Researc...
Increasing Learners' Retention and Persistence in MOOCs: Design-Based Researc...Maha Al-Freih
 
Assignments As Influential Factor To Improve The Prediction Of Student Perfor...
Assignments As Influential Factor To Improve The Prediction Of Student Perfor...Assignments As Influential Factor To Improve The Prediction Of Student Perfor...
Assignments As Influential Factor To Improve The Prediction Of Student Perfor...Kate Campbell
 
VII Jornadas eMadrid "Education in exponential times". Mesa redonda eMadrid L...
VII Jornadas eMadrid "Education in exponential times". Mesa redonda eMadrid L...VII Jornadas eMadrid "Education in exponential times". Mesa redonda eMadrid L...
VII Jornadas eMadrid "Education in exponential times". Mesa redonda eMadrid L...eMadrid network
 
[DSC Europe 22] Machine learning algorithms as tools for student success pred...
[DSC Europe 22] Machine learning algorithms as tools for student success pred...[DSC Europe 22] Machine learning algorithms as tools for student success pred...
[DSC Europe 22] Machine learning algorithms as tools for student success pred...DataScienceConferenc1
 
Utilising learning styles
Utilising learning stylesUtilising learning styles
Utilising learning stylesarteimi
 
Meta-review of recognition of learning in LMS and MOOCs - Ruth Cobos
Meta-review of recognition of learning in LMS and MOOCs - Ruth CobosMeta-review of recognition of learning in LMS and MOOCs - Ruth Cobos
Meta-review of recognition of learning in LMS and MOOCs - Ruth CoboseMadrid network
 
Macfadyen usc tlt keynote 2015.pptx
Macfadyen usc tlt keynote 2015.pptxMacfadyen usc tlt keynote 2015.pptx
Macfadyen usc tlt keynote 2015.pptxLeah Macfadyen
 
Predicting student performance using aggregated data sources
Predicting student performance using aggregated data sourcesPredicting student performance using aggregated data sources
Predicting student performance using aggregated data sourcesOlugbenga Wilson Adejo
 
Learning Analytics
Learning AnalyticsLearning Analytics
Learning AnalyticsViplav Baxi
 
Expert Perceptions of the Feasibility of MOOCs
Expert Perceptions of the Feasibility of MOOCsExpert Perceptions of the Feasibility of MOOCs
Expert Perceptions of the Feasibility of MOOCsShalin Hai-Jew
 
Dissertation Defense Powerpoint presented Aug. 8th, 2015
Dissertation Defense Powerpoint presented Aug. 8th, 2015Dissertation Defense Powerpoint presented Aug. 8th, 2015
Dissertation Defense Powerpoint presented Aug. 8th, 2015KJ Slyusar
 
Dissertation Defense presented August 8th 2015 Dr. Koffajuah Toeque Ed.D
Dissertation Defense  presented August 8th 2015 Dr. Koffajuah Toeque Ed.DDissertation Defense  presented August 8th 2015 Dr. Koffajuah Toeque Ed.D
Dissertation Defense presented August 8th 2015 Dr. Koffajuah Toeque Ed.DDr. Koffajuah Toeque Ed.D
 
DATA ANALYTICS FOR HIGHER EDUCATION
 DATA ANALYTICS FOR HIGHER EDUCATION DATA ANALYTICS FOR HIGHER EDUCATION
DATA ANALYTICS FOR HIGHER EDUCATIONSamantha Suraweera
 
Poster: Very Open Data Project
Poster: Very Open Data ProjectPoster: Very Open Data Project
Poster: Very Open Data ProjectEdward Blurock
 

Similar to MOOC Dropout Prediction Using ML (20)

Using Ontology in Electronic Evaluation for Personalization of eLearning Systems
Using Ontology in Electronic Evaluation for Personalization of eLearning SystemsUsing Ontology in Electronic Evaluation for Personalization of eLearning Systems
Using Ontology in Electronic Evaluation for Personalization of eLearning Systems
 
A Systematic Literature Review Of Student Performance Prediction Using Machi...
A Systematic Literature Review Of Student  Performance Prediction Using Machi...A Systematic Literature Review Of Student  Performance Prediction Using Machi...
A Systematic Literature Review Of Student Performance Prediction Using Machi...
 
Education 11-00552
Education 11-00552Education 11-00552
Education 11-00552
 
Learning and Educational Analytics
Learning and Educational AnalyticsLearning and Educational Analytics
Learning and Educational Analytics
 
Learning Analytics for MOOCs: EMMA case
Learning Analytics for MOOCs: EMMA caseLearning Analytics for MOOCs: EMMA case
Learning Analytics for MOOCs: EMMA case
 
Increasing Learners' Retention and Persistence in MOOCs: Design-Based Researc...
Increasing Learners' Retention and Persistence in MOOCs: Design-Based Researc...Increasing Learners' Retention and Persistence in MOOCs: Design-Based Researc...
Increasing Learners' Retention and Persistence in MOOCs: Design-Based Researc...
 
Assignments As Influential Factor To Improve The Prediction Of Student Perfor...
Assignments As Influential Factor To Improve The Prediction Of Student Perfor...Assignments As Influential Factor To Improve The Prediction Of Student Perfor...
Assignments As Influential Factor To Improve The Prediction Of Student Perfor...
 
VII Jornadas eMadrid "Education in exponential times". Mesa redonda eMadrid L...
VII Jornadas eMadrid "Education in exponential times". Mesa redonda eMadrid L...VII Jornadas eMadrid "Education in exponential times". Mesa redonda eMadrid L...
VII Jornadas eMadrid "Education in exponential times". Mesa redonda eMadrid L...
 
[DSC Europe 22] Machine learning algorithms as tools for student success pred...
[DSC Europe 22] Machine learning algorithms as tools for student success pred...[DSC Europe 22] Machine learning algorithms as tools for student success pred...
[DSC Europe 22] Machine learning algorithms as tools for student success pred...
 
Utilising learning styles
Utilising learning stylesUtilising learning styles
Utilising learning styles
 
Meta-review of recognition of learning in LMS and MOOCs - Ruth Cobos
Meta-review of recognition of learning in LMS and MOOCs - Ruth CobosMeta-review of recognition of learning in LMS and MOOCs - Ruth Cobos
Meta-review of recognition of learning in LMS and MOOCs - Ruth Cobos
 
Macfadyen usc tlt keynote 2015.pptx
Macfadyen usc tlt keynote 2015.pptxMacfadyen usc tlt keynote 2015.pptx
Macfadyen usc tlt keynote 2015.pptx
 
Predicting student performance using aggregated data sources
Predicting student performance using aggregated data sourcesPredicting student performance using aggregated data sources
Predicting student performance using aggregated data sources
 
Learning Analytics
Learning AnalyticsLearning Analytics
Learning Analytics
 
Mecar2010
Mecar2010Mecar2010
Mecar2010
 
Expert Perceptions of the Feasibility of MOOCs
Expert Perceptions of the Feasibility of MOOCsExpert Perceptions of the Feasibility of MOOCs
Expert Perceptions of the Feasibility of MOOCs
 
Dissertation Defense Powerpoint presented Aug. 8th, 2015
Dissertation Defense Powerpoint presented Aug. 8th, 2015Dissertation Defense Powerpoint presented Aug. 8th, 2015
Dissertation Defense Powerpoint presented Aug. 8th, 2015
 
Dissertation Defense presented August 8th 2015 Dr. Koffajuah Toeque Ed.D
Dissertation Defense  presented August 8th 2015 Dr. Koffajuah Toeque Ed.DDissertation Defense  presented August 8th 2015 Dr. Koffajuah Toeque Ed.D
Dissertation Defense presented August 8th 2015 Dr. Koffajuah Toeque Ed.D
 
DATA ANALYTICS FOR HIGHER EDUCATION
 DATA ANALYTICS FOR HIGHER EDUCATION DATA ANALYTICS FOR HIGHER EDUCATION
DATA ANALYTICS FOR HIGHER EDUCATION
 
Poster: Very Open Data Project
Poster: Very Open Data ProjectPoster: Very Open Data Project
Poster: Very Open Data Project
 

Recently uploaded

Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSAishani27
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...shivangimorya083
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 

Recently uploaded (20)

Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts Service
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICS
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 

MOOC Dropout Prediction Using ML

  • 1. MOOC Dropout Prediction Using Machine Learning Techniques: Review and Research Challenges Fisnik Dalipi Linnaeus University, Sweden & University College of Southeast Norway Ali Shariq Imran, Zenun Kastrati Norwegian University of Science and Technology
  • 2. This is a presentation of the paper: “MOOC Dropout Prediction Using Machine Learning Techniques: Review and Research Challenges”, which was accepted and presented at the EDUCON’2018 – IEEE Global Engineering Education Conference The full paper can be downloaded from the IEEEXplore website: https://ieeexplore.ieee.org/
  • 3. Introduction Massive Open Online Courses (MOOCs) are now the most recent topic within the field of e- learning Some researchers even predict MOOC will represent the fourth stage of online education evolution, after the third stage of LMS MOOCs are progressively becoming an integral part of a learning process in higher education settings Also proven that the MOOC approach when complemented with a flipped classroom pedagogy would also bring advantages toward enhancing the learning experience of students substantially But …
  • 5. Reasons behind the dropoutStudentrelatedMOOCrelated Lack of motivation Lack of time Insufficient background Course design Lack of interactivity Hidden costs
  • 6. State-of-the-art: Frequency of occurrences of ML for MOOC dropout prediction 0 1 2 3 4 5 6 7 8 9 10 Logistic Regression Deep Neural Network Support Vector Machine Hidden Markov Models Recurrent Neural Network Natural Language Processing Technique Decision Tree Survival Analysis Bayesian Network
  • 7. Challenges of dropout prediction using ML Lack of enough sample data Managing big masses of unstructured data Data variance High data imbalance Availability of publicly accesible dataset Lack of standard for creating and representing clickstream data Student schedule related challenges
  • 8. Lack of enough sample data Managing big masses of unstructured data Data variance High data imbalance Availability of publicly accesible dataset Lack of standard for creating and representing clickstream data Student schedule related challenges oFor unbiased classification results oLack of negative samples, i.e. HarvardX o E.g. Out of 641138 registered students only 17687 obtained certification oHowever, the effectiveness of the ML relies on the availability of enormous amount of both positive and negative samples oA significant difference in observable sample data affects othe generalization of the deep neural network model during the training step, but also ocomprises the classification accuracy. Challenges of dropout prediction using ML
  • 9. oMOOC comprises of big masses of unstructured data missing data occurs and is hard to validate oA multitude of ML techniques can’t be applicable oE.g. Hidden Markov Models, since oit relies on a finite data set, and ocannot be applied if observations are missing oResearchers reaction: replacing missing observations with mean values. oNot a realistic choice for many of the observed features Lack of enough sample data Managing big masses of unstructured data Data variance High data imbalance Availability of publicly accesible dataset Lack of standard for creating and representing clickstream data Student schedule related challenges Challenges of dropout prediction using ML
  • 10. oIn MOOC's self-paced learning pedagogy, students have the freedom to decide what, when, and how to study. oThis might lead to considerable data variance, which may produce less accurate and reliable ML models oParticularly for SVM and Naïve Bayes, since otheir performance can quickly turn poor when dealing with imbalanced classes oOverall, the high data imbalance may result in biasing of the classifier towards the majority of the class. Lack of enough sample data Managing big masses of unstructured data Data variance High data imbalance Availability of publicly accesible dataset Lack of standard for creating and representing clickstream data Student schedule related challenges Challenges of dropout prediction using ML
  • 11. oMOOC platforms are often reluctant to publish the data due to confidentiality and privacy concerns oThe de-identified information which is made public is also restricted to system generated events (clickstream data) only, while most of the user- provided data is omitted. oThe lack of user-provided data can be a hindrance in successfully estimating the reasons behind the dropouts. Challenges of dropout prediction using ML Lack of enough sample data Managing big masses of unstructured data Data variance High data imbalance Availability of publicly accesible dataset Lack of standard for creating and representing clickstream data Student schedule related challenges
  • 12. oThe user data MOOC platforms gather and the clickstream data they generate doesn’t necessarily conform to a standard format, due to lack of available standards - such as IEEE LOM for learning objects oThe lack of a standard for creating and representing clickstream data for MOOC implies that oa network trained and validated for one MOOC on a given platform might not be interoperable across different MOOC on another platform, or even on the same platform if a MOOC gathers and collects data differently oLack of standards gives rise to another challenge, i.e., how to create compelling ML models and predictive solutions that can be generalized across different MOOCs? Lack of enough sample data Managing big masses of unstructured data Data variance High data imbalance Availability of publicly accesible dataset Lack of standard for creating and representing clickstream data Student schedule related challenges Challenges of dropout prediction using ML
  • 13. oStudents want to construct timetable based on their preferences oSatisfying such preferences of students to be accommodated on different timetables is a complicated problem which can be solved using o Artificial intelligence (AI) optimization techniques. o These AI techniques are specialized to be used for scheduling and thus students’ time constraints can be solved using these techniques. Lack of enough sample data Managing big masses of unstructured data Data variance High data imbalance Availability of publicly accesible dataset Lack of standard for creating and representing clickstream data Student schedule related challenges Challenges of dropout prediction using ML
  • 14. Towards useful and effective predictive solutions Useful & Effective solutions Clickstream data standards Student provided data Feature engineering techniques Engagement system Evaluating models and predictors Improving student social engagement Unification/standardization of a clickstream data Predicting models can benefit from the data provided by the students that don't reveal the identity of the user To analyze posts, feedback, and comments shared by the candidate on the discussion blogs and social forums of the MOOC Encouraging social interactions and simulating peer-evaluation activities could also potentially represent an efficient way to mitigate the dropout To transfer and evaluate models from one MOOC to another, and to perform real-time prediction with ongoing courses. To allow instructors track and monitor students activity across different modules of the course
  • 15. Conclusions This paper provides a comprehensive review of the most recent and relevant research endeavors on machine learning application toward predicting, explaining and solving the problem of student dropout in MOOCs It also identifies some of the critical challenges associated with student dropout prediction and provides recommendations and proposal to assist researchers employing various machine learning techniques in solving it timely and efficiently. Additionally, this paper floats the idea of unification of a clickstream data and student-provided data as a standard, similar to various learning objects metadata standards. Future work will follow on data collection for the development and analysis of deep learning algorithms towards solving the MOOC student dropout.