SlideShare a Scribd company logo
1 of 15
Analyzing navigation logs in
MOOC: a case study
Martín Alonso Mercado-Varela, Alicia García-Holgado,
Francisco J. García-Peñalvo, María Soledad Ramírez-Montoya
martin_mercado44@hotmail.com, aliciagh@usal.es, fgarcia@usal.es,
solramirez@itesm.mx
Index
• Introduction
• Methodology
• Data pre-processing
– Clickstream log
– Process the file
– Coursera Parser
• Analysis
• Conclusions
TEEM16 - Track 12. Educational Innovation
Introduction
• This work describes the analysis of participants’ navigation logs in a
massive learning environment hosted on the Coursera platform
• The “Educational Innovation with Open Resources” course developed
by the Tecnológico de Monterrey (Mexico) (September 2014)
• Coursera indicates that a total of 14,226 students enrolled for the
course, of which 6,696 visited the course all time, 5,106 students
committed to finish, and 1,197 submitted at least one exercise
• It is part of a thesis work about the development of skills to mobilize
OER in MOOC format
• The results presented in this paper are only part of a larger work still
developing, which follows a mixed methodology, where quantitative
results are exploratory and qualitative results have the function to
expand and deepen
TEEM16 - Track 12. Educational Innovation
Methodology
• We have adapted the visual e-
learning analytics process
(VeLA)
• VeLA model provides a
framework to process the data
provided by Coursera in order
to prepare the information for
extending the visual
component in future works
TEEM16 - Track 12. Educational Innovation
Data pre-processing:
clickstream log
• We have limited our analysis to data about participants’ navigation
behavior provided by the stream of click events they generated on
the course webpages
• Each click event is represented through a set of variables
• Clickstream log use the JSON format
• Each click event is represented by an individual serialized JSON
array
TEEM16 - Track 12. Educational Innovation
Data pre-processing:
process the file
• Three cycles:
• First we were analyzed the different variables contained in the
clickstream log in order to delete those that did not provide relevant
information for further analysis: client, user_ip, user_agent, etc.
• The second cycle has focused on dump the entire contents of the file in
a relational database in MySQL
• Third, infer the derived variables from generic variables
presented in previous cycles
TEEM16 - Track 12. Educational Innovation
Data pre-processing:
Coursera Parser (I)
• To facilitate this process and
support dynamic query about
different participants, we have
developed a web tool,
https://github.com/aliciagh/cour
seraparser
• Using anonymous identifiers of
participants you receive a CSV
file where each line represents
a user and each column a
derived variable
TEEM16 - Track 12. Educational Innovation
Data pre-processing:
Coursera Parser (II)
TEEM16 - Track 12. Educational Innovation
TEEM16 - Track 12. Educational Innovation
• The total connections after the course was 14,797, with a standard
deviation of 81,108, total clicks 222.895, with an average of 779.35
and a standard deviation of 454,361, suggesting differences among
participants. In addition, connections beyond the time limits of
course, in the months of October and November were recorded.
• Description of participation per week:
Analysis:
¿What can we say about
participation?
Total Mean Standard deviation Min Max
Clicks 222,895 779.35 454.361 43 3052
Connections 14,797 51.74 81.108 2 972
Time connection 360:05:32:37 1:06:26:31 1:16:17:31.9 00:27:51 18:06:17:40
Connection on
different days
6,471 22.63 6.596 2 57
First and last
connection
Among the first three days of September it registered the first connection (86.8%)
In October, 80.3 percentage of the last connections registered even when the course ended the last day of September;
additional connections were recorded until late November although with lower percentages.
Clicks Connections Time connections
Week Total Mean Standard deviation Total Mean Total Mean
1 62,921 220.00 180.984 3,191 22.28 106:16:09:37 17:53:09
2 61,041 213.43 153.427 3,914 27,34 97:09:15:59 16:19:16
3 51,569 180.31 130.135 3,363 23,50 82:23:37:01 13:54:36
4 36,007 125.90 92.787 2,715 18,95 59:16:12:35 09:59:50
Analysis:
¿what can we say about
participation per week?
• The largest number of interactions occurred in the first week of the
course with a total of 62,921 clicks (Table 5). A progressive decrease
is observed in the number of registered interactions throughout the
course.
• Description of participation per week:
TEEM16 - Track 12. Educational Innovation
TEEM16 - Track 12. Educational Innovation
• The most consulted module was the video experts with 91.640
clicks, followed by the contents page with 38.123 clicks. Highlight
there are participants with very low level of activity in the different
modules, for example in discussion forums (0) or peer evaluation (1)
which are spaces of socialization and where most activity would be
expected by any participant
• Descriptions of variables associated with different activity modules
Analysis:
¿What techno-pedagogical components
presented more intensity?
Total Mean Standard deviation Min Max
Expert video 91,640 320.42 288.616 2 2405
Discussion forums 34,901 122.03 177.897 0 1393
Peer evaluation 36,692 128.29 64.655 1 361
Self-assessment 8,270 28.92 10.567 12 74
Homepage 12,754 44.59 40.322 3 442
Contents page 38,123 133.30 61.426 19 414
Search page 80 .28 1.641 0 24
Analysis:
¿Main navigation paths in
participants?
• Forums displayed in three of the four identified paths, so it is
considered important socialization space that accompanies
other activities, as a first or subsequent activity
• (1) Discussion forums →Peer evaluation (weight=1740)
• (2) Peer evaluation → Discussion forums (weight=1114)
• (3) Self-assessment → Peer evaluation (weight =863)
• (4) Self-assessment → Discussion forums (weight =200)
• Navigation paths are the same between approved and
unapproved participants
TEEM16 - Track 12. Educational Innovation
Analysis:
¿What techno-pedagogical
component associated with the level
of achievement?
• According to the Mann-Whitney U test only variables associated with
socialization were significant. Observing the mean rank, there is a
tendency that favors the approved participants, who had higher
intensity in the variables
• Descriptions of variables associated with level of achievement
TEEM16 - Track 12. Educational Innovation
Level of achievement Mean rank Rank-sum Significance
Working connections Approved 146.68 31829.50 p=.02
Unapproved 124.21 8073.50
Learning connections Approved 145.75 31336.00 p=.04
Unapproved 125.53 8285.00
Socialization of knowledge Approved 151.46 32716.00 p=.00
Unapproved 106.23 6905.00
Post in forums Approved 157.77 34394.50 p=.00
Unapproved 97.74 6646.50
Forums comments Approved 155.34 33864.00 p=.00
Unapproved 105.54 7177.00
TEEM16 - Track 12. Educational Innovation
• About participants' navigation paths, the forums were a recurrent
space in the identified paths; this implies that the forums were an
important resource to carry out the learning activities. Also,
participant’s socialization is associated with the condition of being
approved
• The participants recorded connections to November being the
course had ended last September
• The lack of navigations log that lead to resources and spaces
outside of the platform is a learning analytics problem when an
online learning environment is analyzed. Thus, it is required
exploring different ways to include those datasets
Conclusions
Thanks
Martín Alonso Mercado-Varela, Alicia García-Holgado,
Francisco J. García-Peñalvo, María Soledad Ramírez-Montoya
martin_mercado44@hotmail.com, aliciagh@usal.es, fgarcia@usal.es,
solramirez@itesm.mx

More Related Content

Viewers also liked

Viewers also liked (20)

VISIR’s Usage as an Educational Resource: a Review of the Empirical Research
VISIR’s Usage as an Educational Resource: a Review of the Empirical ResearchVISIR’s Usage as an Educational Resource: a Review of the Empirical Research
VISIR’s Usage as an Educational Resource: a Review of the Empirical Research
 
Betting on innovation and experiments
Betting on innovation and experimentsBetting on innovation and experiments
Betting on innovation and experiments
 
Assessing Engagement in an Emotionally-Adaptive Applied Game
Assessing Engagement in an Emotionally-Adaptive Applied GameAssessing Engagement in an Emotionally-Adaptive Applied Game
Assessing Engagement in an Emotionally-Adaptive Applied Game
 
Recognition of an optimal study modality in a continuous education program in...
Recognition of an optimal study modality in a continuous education program in...Recognition of an optimal study modality in a continuous education program in...
Recognition of an optimal study modality in a continuous education program in...
 
Study of the Methodologies used by the Teaching staff of Graphic Expression f...
Study of the Methodologies used by the Teaching staff of Graphic Expression f...Study of the Methodologies used by the Teaching staff of Graphic Expression f...
Study of the Methodologies used by the Teaching staff of Graphic Expression f...
 
Evidence-based Innovation methodology as a way to produce Open Educational Re...
Evidence-based Innovation methodology as a way to produce Open Educational Re...Evidence-based Innovation methodology as a way to produce Open Educational Re...
Evidence-based Innovation methodology as a way to produce Open Educational Re...
 
The Use of Online Quizzes for Continuous Assessment and Self-Assessment of Se...
The Use of Online Quizzes for Continuous Assessment and Self-Assessment of Se...The Use of Online Quizzes for Continuous Assessment and Self-Assessment of Se...
The Use of Online Quizzes for Continuous Assessment and Self-Assessment of Se...
 
ECO European Project: Inclusive Education through Accessible MOOCs
ECO European Project: Inclusive Education through Accessible MOOCsECO European Project: Inclusive Education through Accessible MOOCs
ECO European Project: Inclusive Education through Accessible MOOCs
 
A strategy to reduce the blank answers on math tests at first engineering cou...
A strategy to reduce the blank answers on math tests at first engineering cou...A strategy to reduce the blank answers on math tests at first engineering cou...
A strategy to reduce the blank answers on math tests at first engineering cou...
 
A spatio-temporal visual analysis tool for historical dictionaries.
A spatio-temporal visual analysis tool for historical dictionaries. A spatio-temporal visual analysis tool for historical dictionaries.
A spatio-temporal visual analysis tool for historical dictionaries.
 
How Wiki-based Tasks, and Forums Favor University Students' Writing Skills an...
How Wiki-based Tasks, and Forums Favor University Students' Writing Skills an...How Wiki-based Tasks, and Forums Favor University Students' Writing Skills an...
How Wiki-based Tasks, and Forums Favor University Students' Writing Skills an...
 
A3bycomp: a software tool to help social organizations to manage skills
A3bycomp: a software tool to help social organizations to manage skillsA3bycomp: a software tool to help social organizations to manage skills
A3bycomp: a software tool to help social organizations to manage skills
 
Systematic mapping of the literature: social innovation laboratories for the ...
Systematic mapping of the literature: social innovation laboratories for the ...Systematic mapping of the literature: social innovation laboratories for the ...
Systematic mapping of the literature: social innovation laboratories for the ...
 
Incidence of Hearing Training in Musical Reading at First Sight. An explorato...
Incidence of Hearing Training in Musical Reading at First Sight. An explorato...Incidence of Hearing Training in Musical Reading at First Sight. An explorato...
Incidence of Hearing Training in Musical Reading at First Sight. An explorato...
 
Perceived Risks in Social Media Use – A Longitudinal Study Among University S...
Perceived Risks in Social Media Use – A Longitudinal Study Among University S...Perceived Risks in Social Media Use – A Longitudinal Study Among University S...
Perceived Risks in Social Media Use – A Longitudinal Study Among University S...
 
A cross-platform interoperable component for course analytics.
A cross-platform interoperable component for course analytics. A cross-platform interoperable component for course analytics.
A cross-platform interoperable component for course analytics.
 
Designing game-like activities to engage adult learners in higher education
Designing game-like activities to engage adult learners in higher educationDesigning game-like activities to engage adult learners in higher education
Designing game-like activities to engage adult learners in higher education
 
Leveraging chatbots to improve self-guided learning through conversational qu...
Leveraging chatbots to improve self-guided learning through conversational qu...Leveraging chatbots to improve self-guided learning through conversational qu...
Leveraging chatbots to improve self-guided learning through conversational qu...
 
Training to capture software requirements by role playing
Training to capture software requirements by role playingTraining to capture software requirements by role playing
Training to capture software requirements by role playing
 
Motivating students of Chemical Engineering through a cooperative work record...
Motivating students of Chemical Engineering through a cooperative work record...Motivating students of Chemical Engineering through a cooperative work record...
Motivating students of Chemical Engineering through a cooperative work record...
 

Similar to Analyzing navigation logs in MOOC: the Coursera case

Skills for Prosperity: Using OER to support nationwide change in Kenya
Skills for Prosperity: Using OER to support nationwide change in KenyaSkills for Prosperity: Using OER to support nationwide change in Kenya
Skills for Prosperity: Using OER to support nationwide change in Kenya
Fereshte Goshtasbpour
 
DAM_DocentProgramResearchResults
DAM_DocentProgramResearchResultsDAM_DocentProgramResearchResults
DAM_DocentProgramResearchResults
Kendall Newby
 
Supporting staff to teach effectively online
Supporting staff to teach effectively onlineSupporting staff to teach effectively online
Supporting staff to teach effectively online
Jisc
 
Evaluation of the Use of VoiceThread for Assessment
Evaluation of the Use of VoiceThread for AssessmentEvaluation of the Use of VoiceThread for Assessment
Evaluation of the Use of VoiceThread for Assessment
Wendy Taleo
 
Learning design and learning analytics
Learning design and learning analyticsLearning design and learning analytics
Learning design and learning analytics
Rebecca Ferguson
 
Conole eden _mooc_evaln
Conole eden _mooc_evalnConole eden _mooc_evaln
Conole eden _mooc_evaln
Grainne Conole
 

Similar to Analyzing navigation logs in MOOC: the Coursera case (20)

Educational Question Routing in Online Student Communities
 Educational Question Routing in Online Student Communities Educational Question Routing in Online Student Communities
Educational Question Routing in Online Student Communities
 
A Learning Analytics Approach
A Learning Analytics ApproachA Learning Analytics Approach
A Learning Analytics Approach
 
Ally Presentation: Accessing Higher Ground Conference 2018
Ally Presentation: Accessing Higher Ground Conference 2018Ally Presentation: Accessing Higher Ground Conference 2018
Ally Presentation: Accessing Higher Ground Conference 2018
 
Skills for Prosperity: Using OER to support nationwide change in Kenya
Skills for Prosperity: Using OER to support nationwide change in KenyaSkills for Prosperity: Using OER to support nationwide change in Kenya
Skills for Prosperity: Using OER to support nationwide change in Kenya
 
Skills for Prosperity: Using OER to support nationwide change in Kenya
Skills for Prosperity: Using OER to support nationwide change in KenyaSkills for Prosperity: Using OER to support nationwide change in Kenya
Skills for Prosperity: Using OER to support nationwide change in Kenya
 
VII Jornadas eMadrid "Education in exponential times". Mesa redonda eMadrid L...
VII Jornadas eMadrid "Education in exponential times". Mesa redonda eMadrid L...VII Jornadas eMadrid "Education in exponential times". Mesa redonda eMadrid L...
VII Jornadas eMadrid "Education in exponential times". Mesa redonda eMadrid L...
 
DAM_DocentProgramResearchResults
DAM_DocentProgramResearchResultsDAM_DocentProgramResearchResults
DAM_DocentProgramResearchResults
 
Teaching FEM software in formal and non-formal environment with MOOCs
Teaching FEM software in formal and non-formal environment with MOOCsTeaching FEM software in formal and non-formal environment with MOOCs
Teaching FEM software in formal and non-formal environment with MOOCs
 
LACE Masterclass Learning Analytics M&L Brussels 2014
LACE Masterclass Learning Analytics M&L Brussels 2014LACE Masterclass Learning Analytics M&L Brussels 2014
LACE Masterclass Learning Analytics M&L Brussels 2014
 
Ucisa presentation spotlight on digital capabilities - may 2017
Ucisa presentation   spotlight on digital capabilities -  may 2017Ucisa presentation   spotlight on digital capabilities -  may 2017
Ucisa presentation spotlight on digital capabilities - may 2017
 
Learning dashboard for supporting students: from first-year engineering to MO...
Learning dashboard for supporting students: from first-year engineering to MO...Learning dashboard for supporting students: from first-year engineering to MO...
Learning dashboard for supporting students: from first-year engineering to MO...
 
Learning Analytics: New thinking supporting educational research
Learning Analytics: New thinking supporting educational researchLearning Analytics: New thinking supporting educational research
Learning Analytics: New thinking supporting educational research
 
Establishing value: success with & business impact of learning
Establishing value: success with & business impact of learningEstablishing value: success with & business impact of learning
Establishing value: success with & business impact of learning
 
Supporting staff to teach effectively online
Supporting staff to teach effectively onlineSupporting staff to teach effectively online
Supporting staff to teach effectively online
 
TESTA Interactive Masterclass
TESTA Interactive MasterclassTESTA Interactive Masterclass
TESTA Interactive Masterclass
 
Evaluation of the Use of VoiceThread for Assessment
Evaluation of the Use of VoiceThread for AssessmentEvaluation of the Use of VoiceThread for Assessment
Evaluation of the Use of VoiceThread for Assessment
 
EMMA Summer School - Rebecca Ferguson - Learning design and learning analytic...
EMMA Summer School - Rebecca Ferguson - Learning design and learning analytic...EMMA Summer School - Rebecca Ferguson - Learning design and learning analytic...
EMMA Summer School - Rebecca Ferguson - Learning design and learning analytic...
 
Learning design and learning analytics
Learning design and learning analyticsLearning design and learning analytics
Learning design and learning analytics
 
Conole eden _mooc_evaln
Conole eden _mooc_evalnConole eden _mooc_evaln
Conole eden _mooc_evaln
 
Jisc learning analytics service oct 2016
Jisc learning analytics service oct 2016Jisc learning analytics service oct 2016
Jisc learning analytics service oct 2016
 

More from Technological Ecosystems for Enhancing Multiculturality

More from Technological Ecosystems for Enhancing Multiculturality (20)

A Preliminary Study of Proof of Concept Practices and their connection with I...
A Preliminary Study of Proof of Concept Practices and their connection with I...A Preliminary Study of Proof of Concept Practices and their connection with I...
A Preliminary Study of Proof of Concept Practices and their connection with I...
 
Social networks as a promotional space for Spanish radio content. The case st...
Social networks as a promotional space for Spanish radio content. The case st...Social networks as a promotional space for Spanish radio content. The case st...
Social networks as a promotional space for Spanish radio content. The case st...
 
Towards the study of sentiment in the public opinion of science in Spanish
Towards the study of sentiment in the public opinion of science in SpanishTowards the study of sentiment in the public opinion of science in Spanish
Towards the study of sentiment in the public opinion of science in Spanish
 
A Three-Step Data-Mining Analysis of Top-Ranked Higher Education Institutions...
A Three-Step Data-Mining Analysis of Top-Ranked Higher Education Institutions...A Three-Step Data-Mining Analysis of Top-Ranked Higher Education Institutions...
A Three-Step Data-Mining Analysis of Top-Ranked Higher Education Institutions...
 
Specifics of multimedia texts in the context of social networks media aesthetics
Specifics of multimedia texts in the context of social networks media aestheticsSpecifics of multimedia texts in the context of social networks media aesthetics
Specifics of multimedia texts in the context of social networks media aesthetics
 
Combined Effects of Similarity and Imagined Contact on First-Person Testimoni...
Combined Effects of Similarity and Imagined Contact on First-Person Testimoni...Combined Effects of Similarity and Imagined Contact on First-Person Testimoni...
Combined Effects of Similarity and Imagined Contact on First-Person Testimoni...
 
Direct online political communication effects on civil participation in spain...
Direct online political communication effects on civil participation in spain...Direct online political communication effects on civil participation in spain...
Direct online political communication effects on civil participation in spain...
 
University Media in Ecuador: Types, Functions and Self-determination
University Media in Ecuador: Types, Functions and Self-determinationUniversity Media in Ecuador: Types, Functions and Self-determination
University Media in Ecuador: Types, Functions and Self-determination
 
Like it or die: using social networks to improve collaborative learning in hi...
Like it or die: using social networks to improve collaborative learning in hi...Like it or die: using social networks to improve collaborative learning in hi...
Like it or die: using social networks to improve collaborative learning in hi...
 
Framing theory in studies of environmental information in press
Framing theory in studies of environmental information in pressFraming theory in studies of environmental information in press
Framing theory in studies of environmental information in press
 
Domain engineering for generating dashboards to analyze employment and employ...
Domain engineering for generating dashboards to analyze employment and employ...Domain engineering for generating dashboards to analyze employment and employ...
Domain engineering for generating dashboards to analyze employment and employ...
 
Mapping the systematic literature studies about software ecosystems
Mapping the systematic literature studies about software ecosystemsMapping the systematic literature studies about software ecosystems
Mapping the systematic literature studies about software ecosystems
 
Tag-Based Browsing of Digital Collections with Inverted Indexes and Browsing ...
Tag-Based Browsing of Digital Collections with Inverted Indexes and Browsing ...Tag-Based Browsing of Digital Collections with Inverted Indexes and Browsing ...
Tag-Based Browsing of Digital Collections with Inverted Indexes and Browsing ...
 
A Multivocal Literature Review on the use of DevOps for e-learning systems
A Multivocal Literature Review on the use of DevOps for e-learning systemsA Multivocal Literature Review on the use of DevOps for e-learning systems
A Multivocal Literature Review on the use of DevOps for e-learning systems
 
Document Annotation Tools: Annotation Classification Mechanisms
Document Annotation Tools: Annotation Classification MechanismsDocument Annotation Tools: Annotation Classification Mechanisms
Document Annotation Tools: Annotation Classification Mechanisms
 
Toward supporting decision-making under uncertainty in digital humanities wit...
Toward supporting decision-making under uncertainty in digital humanities wit...Toward supporting decision-making under uncertainty in digital humanities wit...
Toward supporting decision-making under uncertainty in digital humanities wit...
 
Managing Uncertainty in the Humanities: Digital and Analogue Approaches
Managing Uncertainty in the Humanities: Digital and Analogue ApproachesManaging Uncertainty in the Humanities: Digital and Analogue Approaches
Managing Uncertainty in the Humanities: Digital and Analogue Approaches
 
Representing Imprecise and Uncertain Knowledge in Digital Humanities: A Theor...
Representing Imprecise and Uncertain Knowledge in Digital Humanities: A Theor...Representing Imprecise and Uncertain Knowledge in Digital Humanities: A Theor...
Representing Imprecise and Uncertain Knowledge in Digital Humanities: A Theor...
 
Dotmocracy and Planning Poker for Uncertainty Management in Collaborative Res...
Dotmocracy and Planning Poker for Uncertainty Management in Collaborative Res...Dotmocracy and Planning Poker for Uncertainty Management in Collaborative Res...
Dotmocracy and Planning Poker for Uncertainty Management in Collaborative Res...
 
Applying Commercial Computer Vision Tools to Cope with Uncertainties in a Cit...
Applying Commercial Computer Vision Tools to Cope with Uncertainties in a Cit...Applying Commercial Computer Vision Tools to Cope with Uncertainties in a Cit...
Applying Commercial Computer Vision Tools to Cope with Uncertainties in a Cit...
 

Recently uploaded

1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 

Recently uploaded (20)

General AI for Medical Educators April 2024
General AI for Medical Educators April 2024General AI for Medical Educators April 2024
General AI for Medical Educators April 2024
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdf
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room service
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajan
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 

Analyzing navigation logs in MOOC: the Coursera case

  • 1. Analyzing navigation logs in MOOC: a case study Martín Alonso Mercado-Varela, Alicia García-Holgado, Francisco J. García-Peñalvo, María Soledad Ramírez-Montoya martin_mercado44@hotmail.com, aliciagh@usal.es, fgarcia@usal.es, solramirez@itesm.mx
  • 2. Index • Introduction • Methodology • Data pre-processing – Clickstream log – Process the file – Coursera Parser • Analysis • Conclusions TEEM16 - Track 12. Educational Innovation
  • 3. Introduction • This work describes the analysis of participants’ navigation logs in a massive learning environment hosted on the Coursera platform • The “Educational Innovation with Open Resources” course developed by the Tecnológico de Monterrey (Mexico) (September 2014) • Coursera indicates that a total of 14,226 students enrolled for the course, of which 6,696 visited the course all time, 5,106 students committed to finish, and 1,197 submitted at least one exercise • It is part of a thesis work about the development of skills to mobilize OER in MOOC format • The results presented in this paper are only part of a larger work still developing, which follows a mixed methodology, where quantitative results are exploratory and qualitative results have the function to expand and deepen TEEM16 - Track 12. Educational Innovation
  • 4. Methodology • We have adapted the visual e- learning analytics process (VeLA) • VeLA model provides a framework to process the data provided by Coursera in order to prepare the information for extending the visual component in future works TEEM16 - Track 12. Educational Innovation
  • 5. Data pre-processing: clickstream log • We have limited our analysis to data about participants’ navigation behavior provided by the stream of click events they generated on the course webpages • Each click event is represented through a set of variables • Clickstream log use the JSON format • Each click event is represented by an individual serialized JSON array TEEM16 - Track 12. Educational Innovation
  • 6. Data pre-processing: process the file • Three cycles: • First we were analyzed the different variables contained in the clickstream log in order to delete those that did not provide relevant information for further analysis: client, user_ip, user_agent, etc. • The second cycle has focused on dump the entire contents of the file in a relational database in MySQL • Third, infer the derived variables from generic variables presented in previous cycles TEEM16 - Track 12. Educational Innovation
  • 7. Data pre-processing: Coursera Parser (I) • To facilitate this process and support dynamic query about different participants, we have developed a web tool, https://github.com/aliciagh/cour seraparser • Using anonymous identifiers of participants you receive a CSV file where each line represents a user and each column a derived variable TEEM16 - Track 12. Educational Innovation
  • 8. Data pre-processing: Coursera Parser (II) TEEM16 - Track 12. Educational Innovation
  • 9. TEEM16 - Track 12. Educational Innovation • The total connections after the course was 14,797, with a standard deviation of 81,108, total clicks 222.895, with an average of 779.35 and a standard deviation of 454,361, suggesting differences among participants. In addition, connections beyond the time limits of course, in the months of October and November were recorded. • Description of participation per week: Analysis: ¿What can we say about participation? Total Mean Standard deviation Min Max Clicks 222,895 779.35 454.361 43 3052 Connections 14,797 51.74 81.108 2 972 Time connection 360:05:32:37 1:06:26:31 1:16:17:31.9 00:27:51 18:06:17:40 Connection on different days 6,471 22.63 6.596 2 57 First and last connection Among the first three days of September it registered the first connection (86.8%) In October, 80.3 percentage of the last connections registered even when the course ended the last day of September; additional connections were recorded until late November although with lower percentages.
  • 10. Clicks Connections Time connections Week Total Mean Standard deviation Total Mean Total Mean 1 62,921 220.00 180.984 3,191 22.28 106:16:09:37 17:53:09 2 61,041 213.43 153.427 3,914 27,34 97:09:15:59 16:19:16 3 51,569 180.31 130.135 3,363 23,50 82:23:37:01 13:54:36 4 36,007 125.90 92.787 2,715 18,95 59:16:12:35 09:59:50 Analysis: ¿what can we say about participation per week? • The largest number of interactions occurred in the first week of the course with a total of 62,921 clicks (Table 5). A progressive decrease is observed in the number of registered interactions throughout the course. • Description of participation per week: TEEM16 - Track 12. Educational Innovation
  • 11. TEEM16 - Track 12. Educational Innovation • The most consulted module was the video experts with 91.640 clicks, followed by the contents page with 38.123 clicks. Highlight there are participants with very low level of activity in the different modules, for example in discussion forums (0) or peer evaluation (1) which are spaces of socialization and where most activity would be expected by any participant • Descriptions of variables associated with different activity modules Analysis: ¿What techno-pedagogical components presented more intensity? Total Mean Standard deviation Min Max Expert video 91,640 320.42 288.616 2 2405 Discussion forums 34,901 122.03 177.897 0 1393 Peer evaluation 36,692 128.29 64.655 1 361 Self-assessment 8,270 28.92 10.567 12 74 Homepage 12,754 44.59 40.322 3 442 Contents page 38,123 133.30 61.426 19 414 Search page 80 .28 1.641 0 24
  • 12. Analysis: ¿Main navigation paths in participants? • Forums displayed in three of the four identified paths, so it is considered important socialization space that accompanies other activities, as a first or subsequent activity • (1) Discussion forums →Peer evaluation (weight=1740) • (2) Peer evaluation → Discussion forums (weight=1114) • (3) Self-assessment → Peer evaluation (weight =863) • (4) Self-assessment → Discussion forums (weight =200) • Navigation paths are the same between approved and unapproved participants TEEM16 - Track 12. Educational Innovation
  • 13. Analysis: ¿What techno-pedagogical component associated with the level of achievement? • According to the Mann-Whitney U test only variables associated with socialization were significant. Observing the mean rank, there is a tendency that favors the approved participants, who had higher intensity in the variables • Descriptions of variables associated with level of achievement TEEM16 - Track 12. Educational Innovation Level of achievement Mean rank Rank-sum Significance Working connections Approved 146.68 31829.50 p=.02 Unapproved 124.21 8073.50 Learning connections Approved 145.75 31336.00 p=.04 Unapproved 125.53 8285.00 Socialization of knowledge Approved 151.46 32716.00 p=.00 Unapproved 106.23 6905.00 Post in forums Approved 157.77 34394.50 p=.00 Unapproved 97.74 6646.50 Forums comments Approved 155.34 33864.00 p=.00 Unapproved 105.54 7177.00
  • 14. TEEM16 - Track 12. Educational Innovation • About participants' navigation paths, the forums were a recurrent space in the identified paths; this implies that the forums were an important resource to carry out the learning activities. Also, participant’s socialization is associated with the condition of being approved • The participants recorded connections to November being the course had ended last September • The lack of navigations log that lead to resources and spaces outside of the platform is a learning analytics problem when an online learning environment is analyzed. Thus, it is required exploring different ways to include those datasets Conclusions
  • 15. Thanks Martín Alonso Mercado-Varela, Alicia García-Holgado, Francisco J. García-Peñalvo, María Soledad Ramírez-Montoya martin_mercado44@hotmail.com, aliciagh@usal.es, fgarcia@usal.es, solramirez@itesm.mx

Editor's Notes

  1. First we have transformed the dataset in order to prepare them to apply analytics models, particularly, statistical analysis. In parallel with data analysis, we have mapped a portion of the data to visualize navigation patterns between course resources pairs. The results have provided feedback to post-processing the dataset and repeat the process in order to discover knowledge. The following sections describe each phase after several iterations.
  2. Client because it always takes the same value, 'spark'. User_ip, user_agent and 12 because the data analysis does not consider issues geolocation devices or use by the user.