SlideShare a Scribd company logo
brooksch@umich.edu @cab938
Shape of Educational Data:
Predictive Modeling as an Enabler of
Personalized Learning
Christopher Brooks
Research Assistant Professor, School of Information
Director of Learning Analytics and Research
Digital Education and Innovation
University of Michigan
brooksch@umich.edu @cab938
Psychohistory
“…[it] combined history, psychology and
mathematical statistics to create a (nearly)
exact science of the behavior of very large
populations of people…Asimov used the analogy
of a gas: in a gas, the motion of a single
molecule is very difficult to predict, but the mass
action of the gas can be predicted to a high level
of accuracy. Asimov applied this concept to the
population of the fictional Galactic Empire, which
numbered in the quadrillions.”
http://asimov.wikia.com/wiki/Psychohistory
brooksch@umich.edu @cab938
• "Averaginarianism"
• Regression towards a mean that
doesn't actually naturally exist
• There is a gulf between the
predictive modeling perspectives,
and the explanatory modeling
ones
brooksch@umich.edu @cab938
Research Perspective
• Learners are individuals
• There is nuance in data that is
important and being missed by
studying populations vs.
individuals
• Computational modelling (esp.
predictive modelling) has
opportunity to help
brooksch@umich.edu @cab938
Traditional Higher
Education
Low Stakes
Lifelong Learning
brooksch@umich.edu @cab938
Lecture Capture
• How do students integrate educational technologies into their study habits?
– (and do those technologies have any effect?)
• A need for insight
– Studies largely show only student satisfaction benefits from lecture capture
– Several studies show no effect to the use of lecture capture on performance
• Data mining for usage patterns
– Apply unsupervised machine learning methods (k-means clustering) to viewership data by
week
– Then built general model from prototypes and apply to new datasets and determine fit
(replication)
brooksch@umich.edu @cab938
Results (Chemistry 2xx 2010)
• 5 groups found, each pedagogically labelled (by investigators!)
• Error and size of groups ranges considerably
• The final exam period is not indicative of activity throughout semester
brooksch@umich.edu @cab938
The general model
• How well does the model generalize?
brooksch@umich.edu @cab938
(Grades Pairwise Tukey HSD, * p<0.1 ** p<0.05)
(Grades)
brooksch@umich.edu @cab938
Results
• Not a predictive model,
but a more discriminate
descriptive model
– Showed an effect not for general use of lecture
capture, but for specific ways of using lecture capture
• Replication suggests there is merit to the model, but
that it is highly contextualized (theme of course)
• Data from more sources could add further detail to the
model as to causal effects
Brooks, C. A., Erickson, G., Greer, J. E.,
Gutwin, C. (2014) Modelling and Quantifying
the Behaviours of Students in Lecture Capture
Environments. In Computers & Education. Vol
75 June. pages 282-292.
brooksch@umich.edu @cab938
Bonus Calculus Slide
brooksch@umich.edu @cab938
Massive Open Online Courses
• As of the end of 2014, MOOCs at Michigan have attracted
1.9 million enrollees and nearly 1 million participants
• Of these participants, ~ 300K attempt some assessment
task, ~80K end up passing the course (certificate)
• Can we do better in understanding student success in this
environment?
• Could we predict who is at-risk for students who want to
obtain a certificate?
brooksch@umich.edu @cab938
• MOOCs lack the diversity of data we have about residential students
– Previous achievement (SAT/ACT, last years course)
– Socioeconomic status (distance from university, first in family,
wealth)
– Gender
– Ethnicity
– Motivation
• Building predictive models of student achievement in learning
analytics is largely done on these entry-level features
• Both frustrating and refreshing
– Want accurate models, but want actionable data
brooksch@umich.edu @cab938
• Built a novel feature selection algorithm inspired by
work in the text-mining community
• It looks at the pattern of engagement that a student
has with course resources
• Build of historical data (last years course) to create
day-by-day multilevel models (C4.5)
• Initial work is based on student certificate
achievement (pass/fail)
–(not the only valuable outcome variable to try and
predict!)
brooksch@umich.edu @cab938
Resour
ce
Day of Course
1 2 3 4 5 6 7 8 9
Video
Daily Accesses
Day 1: Yes
Day 2: No
Day 3: Yes
Day 4: No
Day 5: No
Day 6: No
Day 7: No
Day 8: Yes
Day 9: No
3-Day counts
Day 1-3: Yes
Day 4-6: No
Day 7-9: Yes
Weekly counts
Week 1: Yes
Week 2: Yes
Monthly counts
Month 1: Yes
For a 104 day long course,
with three resources
(videos, forums, quizzes)
this gives us 408 features
for the modelling activity.
brooksch@umich.edu @cab938
Text Mining Inspiration
• Text mining often uses n-grams as features in a document
– A bigram (cat, good) is the number of pairs of these two words in a document, a
trigam (cat, was not good), etc.
– We build engagement n-grams up to 5 gram
Daily Accesses
Day 1: Yes
Day 2: No
Day 3: Yes
Day 4: No
Day 5: No
Day 6: No
Day 7: No
Day 8: Yes
Day 9: No
Possible bigrams
[yes, yes]: 0
[no, no]: 3
[yes, no]: 3
[no, yes]: 2
Possible trigrams:
[yes, yes, yes]: 0
[yes, yes, no]: 0
[yes, no, yes]: 1
…
For a 104 day long course,
with three resources
(videos, forums, quizzes)
this gives us 717 more
features for the modelling
activity.
brooksch@umich.edu @cab938
In a nutshell
• We do not have diverse set of data, but
we do have a detailed set of data
• And there is a lot of it (200 million+
clickstream events)
• By pulling out patterns of resource
access, we can use supervised machine
learning (C4.5) techniques to build
predictive models
• But what if we did have entry data
from students?
– Gender & Ethnicity, certification
status, country of origin, etc.
brooksch@umich.edu @cab938
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 31 33 35 37 39 41 43 45 47 49 51 53 55 57 59 61 63 65 67 69 71 73 75
Fliess'κ
Day of Course Offering
Fliess' κ versus Time in Days
Activity Features Only
Demographics Features Only
Activity and Demographics Features
brooksch@umich.edu @cab938
Results
• It is possible to create predictive models on clickstream data for MOOCs
• 3 weeks into the MOOC seems to be an interesting point for some courses
• It is computationally intensive to create these models (daily!)
• MOOC entry/demographics information does not seem to add value
C. Brooks, C. Thompson, S. Teasley. (2015) A Time Series Interaction Analysis Method for Building Predictive
Models of Learners using Log Data. 5th International Conference on Learning Analytics and Knowledge 2015
(LAK'15)
C. Brooks, C. Thompson, S. Teasley. (2015) Who You Are or What You Do: Comparing the Predictive Power of
Demographics vs. Activity Patterns in Massive Open Online Courses (MOOCs). The second annual conference on
Learning At Scale 2015 (L@S2015), Works in Progress track. Vancouver BC, March 14-15, 2015. Vancouver, BC.
brooksch@umich.edu @cab938
brooksch@umich.edu @cab938
No Particular Night or Morning
“I looked at the page with my name under the
title…it was some other man…the story was
familiar – I knew I had written it – but that name
on the paper still was not me. It was a symbol, a
name.”
“I’ve always figured it that you die each day, and
each day is a box…but you never go back and lift
the lids...each is a different you, somebody you
do not know or understand or want to
understand.
”
brooksch@umich.edu @cab938
Questions? Comments?
Christopher Brooks
Research Assistant Professor, School of Information
Director of Learning Analytics and Research
in Digital Education and Innovation
University of Michigan
brooksch@umich.edu
@cab938

More Related Content

Viewers also liked

Genetics and inheritance
Genetics and inheritanceGenetics and inheritance
Genetics and inheritancejjcorrea121
 
Section b theory overview
Section b   theory overviewSection b   theory overview
Section b theory overview
Nick Crafts
 
windows vista
windows vistawindows vista
windows vistaannemarie
 
Potato Pals Introduction
Potato Pals IntroductionPotato Pals Introduction
Potato Pals Introduction
patjack67
 
Sistema nervoso 2011
Sistema nervoso 2011Sistema nervoso 2011
Sistema nervoso 2011BriefCase
 
Leishmaniose Visceral
Leishmaniose VisceralLeishmaniose Visceral
Leishmaniose Visceral
Mari Sousa
 
Neighbouring group participation
Neighbouring group participationNeighbouring group participation
Neighbouring group participation
JOYNA123
 
Torch
TorchTorch

Viewers also liked (9)

Genetics and inheritance
Genetics and inheritanceGenetics and inheritance
Genetics and inheritance
 
Section b theory overview
Section b   theory overviewSection b   theory overview
Section b theory overview
 
windows vista
windows vistawindows vista
windows vista
 
Een efficiënte blok
Een efficiënte blokEen efficiënte blok
Een efficiënte blok
 
Potato Pals Introduction
Potato Pals IntroductionPotato Pals Introduction
Potato Pals Introduction
 
Sistema nervoso 2011
Sistema nervoso 2011Sistema nervoso 2011
Sistema nervoso 2011
 
Leishmaniose Visceral
Leishmaniose VisceralLeishmaniose Visceral
Leishmaniose Visceral
 
Neighbouring group participation
Neighbouring group participationNeighbouring group participation
Neighbouring group participation
 
Torch
TorchTorch
Torch
 

Similar to Christopher Brooks SOED 2016

If MOOCs are the answer, did we ask the right questions? Implications for the...
If MOOCs are the answer, did we ask the right questions? Implications for the...If MOOCs are the answer, did we ask the right questions? Implications for the...
If MOOCs are the answer, did we ask the right questions? Implications for the...
Marco Kalz
 
MOOC Dropout Prediction Using Machine Learning Techniques: Review and Researc...
MOOC Dropout Prediction Using Machine Learning Techniques: Review and Researc...MOOC Dropout Prediction Using Machine Learning Techniques: Review and Researc...
MOOC Dropout Prediction Using Machine Learning Techniques: Review and Researc...
Fisnik Dalipi
 
Websci 2018
Websci 2018Websci 2018
Websci 2018
Christian Bokhove
 
[DSC Europe 22] Machine learning algorithms as tools for student success pred...
[DSC Europe 22] Machine learning algorithms as tools for student success pred...[DSC Europe 22] Machine learning algorithms as tools for student success pred...
[DSC Europe 22] Machine learning algorithms as tools for student success pred...
DataScienceConferenc1
 
connections_12_1_taylor_et_al
connections_12_1_taylor_et_alconnections_12_1_taylor_et_al
connections_12_1_taylor_et_alOliver Bond
 
Social Network Analysis: applications for education research
Social Network Analysis: applications for education researchSocial Network Analysis: applications for education research
Social Network Analysis: applications for education research
Christian Bokhove
 
Application of Significance Tests to Massive Open Online Courses (MOOCs)
Application of Significance Tests to Massive Open Online Courses (MOOCs)Application of Significance Tests to Massive Open Online Courses (MOOCs)
Application of Significance Tests to Massive Open Online Courses (MOOCs)
FutureLearn FLAN
 
MIT Concept Catalog for Deeper Learning
MIT Concept Catalog for Deeper LearningMIT Concept Catalog for Deeper Learning
MIT Concept Catalog for Deeper Learningbirdland
 
Insung Jung, Ilju Rha (icome 2017 international conference) Mooc impacts 017...
Insung Jung, Ilju Rha (icome 2017 international conference) Mooc impacts  017...Insung Jung, Ilju Rha (icome 2017 international conference) Mooc impacts  017...
Insung Jung, Ilju Rha (icome 2017 international conference) Mooc impacts 017...
Ilju Rha
 
Methodological innovation for mathematics education research
Methodological innovation for mathematics education researchMethodological innovation for mathematics education research
Methodological innovation for mathematics education research
Christian Bokhove
 
A Query Routing Model to Rank Expertcandidates on Twitter
A Query Routing Model to Rank Expertcandidates on TwitterA Query Routing Model to Rank Expertcandidates on Twitter
A Query Routing Model to Rank Expertcandidates on Twitter
Jonathas Magalhães
 
Increasing Learners' Retention and Persistence in MOOCs: Design-Based Researc...
Increasing Learners' Retention and Persistence in MOOCs: Design-Based Researc...Increasing Learners' Retention and Persistence in MOOCs: Design-Based Researc...
Increasing Learners' Retention and Persistence in MOOCs: Design-Based Researc...
Maha Al-Freih
 
VII Jornadas eMadrid "Education in exponential times". Mesa redonda eMadrid L...
VII Jornadas eMadrid "Education in exponential times". Mesa redonda eMadrid L...VII Jornadas eMadrid "Education in exponential times". Mesa redonda eMadrid L...
VII Jornadas eMadrid "Education in exponential times". Mesa redonda eMadrid L...
eMadrid network
 
Tcc keynote Hawaii april 2013
Tcc keynote Hawaii april 2013Tcc keynote Hawaii april 2013
Tcc keynote Hawaii april 2013
Terry Anderson
 
From Chrysalis to Butterfly: Lifecycles of an OER Project
From Chrysalis to Butterfly: Lifecycles of an OER ProjectFrom Chrysalis to Butterfly: Lifecycles of an OER Project
From Chrysalis to Butterfly: Lifecycles of an OER Project
Beck Pitt
 
From Chrysalis to Butterfly: Lifecycles of an OER Project
From Chrysalis to Butterfly: Lifecycles of an OER Project From Chrysalis to Butterfly: Lifecycles of an OER Project
From Chrysalis to Butterfly: Lifecycles of an OER Project
OER Hub
 
Learning Analytics: Realizing their Promise in the California State University
Learning Analytics:  Realizing their Promise in the California State UniversityLearning Analytics:  Realizing their Promise in the California State University
Learning Analytics: Realizing their Promise in the California State UniversityJohn Whitmer, Ed.D.
 
Multimodal Course Design and Implementation using LEML and LMS for Instructio...
Multimodal Course Design and Implementation using LEML and LMS for Instructio...Multimodal Course Design and Implementation using LEML and LMS for Instructio...
Multimodal Course Design and Implementation using LEML and LMS for Instructio...
IJMIT JOURNAL
 
MULTIMODAL COURSE DESIGN AND IMPLEMENTATION USING LEML AND LMS FOR INSTRUCTIO...
MULTIMODAL COURSE DESIGN AND IMPLEMENTATION USING LEML AND LMS FOR INSTRUCTIO...MULTIMODAL COURSE DESIGN AND IMPLEMENTATION USING LEML AND LMS FOR INSTRUCTIO...
MULTIMODAL COURSE DESIGN AND IMPLEMENTATION USING LEML AND LMS FOR INSTRUCTIO...
IJMIT JOURNAL
 
MULTIMODAL COURSE DESIGN AND IMPLEMENTATION USING LEML AND LMS FOR INSTRUCTIO...
MULTIMODAL COURSE DESIGN AND IMPLEMENTATION USING LEML AND LMS FOR INSTRUCTIO...MULTIMODAL COURSE DESIGN AND IMPLEMENTATION USING LEML AND LMS FOR INSTRUCTIO...
MULTIMODAL COURSE DESIGN AND IMPLEMENTATION USING LEML AND LMS FOR INSTRUCTIO...
IJMIT JOURNAL
 

Similar to Christopher Brooks SOED 2016 (20)

If MOOCs are the answer, did we ask the right questions? Implications for the...
If MOOCs are the answer, did we ask the right questions? Implications for the...If MOOCs are the answer, did we ask the right questions? Implications for the...
If MOOCs are the answer, did we ask the right questions? Implications for the...
 
MOOC Dropout Prediction Using Machine Learning Techniques: Review and Researc...
MOOC Dropout Prediction Using Machine Learning Techniques: Review and Researc...MOOC Dropout Prediction Using Machine Learning Techniques: Review and Researc...
MOOC Dropout Prediction Using Machine Learning Techniques: Review and Researc...
 
Websci 2018
Websci 2018Websci 2018
Websci 2018
 
[DSC Europe 22] Machine learning algorithms as tools for student success pred...
[DSC Europe 22] Machine learning algorithms as tools for student success pred...[DSC Europe 22] Machine learning algorithms as tools for student success pred...
[DSC Europe 22] Machine learning algorithms as tools for student success pred...
 
connections_12_1_taylor_et_al
connections_12_1_taylor_et_alconnections_12_1_taylor_et_al
connections_12_1_taylor_et_al
 
Social Network Analysis: applications for education research
Social Network Analysis: applications for education researchSocial Network Analysis: applications for education research
Social Network Analysis: applications for education research
 
Application of Significance Tests to Massive Open Online Courses (MOOCs)
Application of Significance Tests to Massive Open Online Courses (MOOCs)Application of Significance Tests to Massive Open Online Courses (MOOCs)
Application of Significance Tests to Massive Open Online Courses (MOOCs)
 
MIT Concept Catalog for Deeper Learning
MIT Concept Catalog for Deeper LearningMIT Concept Catalog for Deeper Learning
MIT Concept Catalog for Deeper Learning
 
Insung Jung, Ilju Rha (icome 2017 international conference) Mooc impacts 017...
Insung Jung, Ilju Rha (icome 2017 international conference) Mooc impacts  017...Insung Jung, Ilju Rha (icome 2017 international conference) Mooc impacts  017...
Insung Jung, Ilju Rha (icome 2017 international conference) Mooc impacts 017...
 
Methodological innovation for mathematics education research
Methodological innovation for mathematics education researchMethodological innovation for mathematics education research
Methodological innovation for mathematics education research
 
A Query Routing Model to Rank Expertcandidates on Twitter
A Query Routing Model to Rank Expertcandidates on TwitterA Query Routing Model to Rank Expertcandidates on Twitter
A Query Routing Model to Rank Expertcandidates on Twitter
 
Increasing Learners' Retention and Persistence in MOOCs: Design-Based Researc...
Increasing Learners' Retention and Persistence in MOOCs: Design-Based Researc...Increasing Learners' Retention and Persistence in MOOCs: Design-Based Researc...
Increasing Learners' Retention and Persistence in MOOCs: Design-Based Researc...
 
VII Jornadas eMadrid "Education in exponential times". Mesa redonda eMadrid L...
VII Jornadas eMadrid "Education in exponential times". Mesa redonda eMadrid L...VII Jornadas eMadrid "Education in exponential times". Mesa redonda eMadrid L...
VII Jornadas eMadrid "Education in exponential times". Mesa redonda eMadrid L...
 
Tcc keynote Hawaii april 2013
Tcc keynote Hawaii april 2013Tcc keynote Hawaii april 2013
Tcc keynote Hawaii april 2013
 
From Chrysalis to Butterfly: Lifecycles of an OER Project
From Chrysalis to Butterfly: Lifecycles of an OER ProjectFrom Chrysalis to Butterfly: Lifecycles of an OER Project
From Chrysalis to Butterfly: Lifecycles of an OER Project
 
From Chrysalis to Butterfly: Lifecycles of an OER Project
From Chrysalis to Butterfly: Lifecycles of an OER Project From Chrysalis to Butterfly: Lifecycles of an OER Project
From Chrysalis to Butterfly: Lifecycles of an OER Project
 
Learning Analytics: Realizing their Promise in the California State University
Learning Analytics:  Realizing their Promise in the California State UniversityLearning Analytics:  Realizing their Promise in the California State University
Learning Analytics: Realizing their Promise in the California State University
 
Multimodal Course Design and Implementation using LEML and LMS for Instructio...
Multimodal Course Design and Implementation using LEML and LMS for Instructio...Multimodal Course Design and Implementation using LEML and LMS for Instructio...
Multimodal Course Design and Implementation using LEML and LMS for Instructio...
 
MULTIMODAL COURSE DESIGN AND IMPLEMENTATION USING LEML AND LMS FOR INSTRUCTIO...
MULTIMODAL COURSE DESIGN AND IMPLEMENTATION USING LEML AND LMS FOR INSTRUCTIO...MULTIMODAL COURSE DESIGN AND IMPLEMENTATION USING LEML AND LMS FOR INSTRUCTIO...
MULTIMODAL COURSE DESIGN AND IMPLEMENTATION USING LEML AND LMS FOR INSTRUCTIO...
 
MULTIMODAL COURSE DESIGN AND IMPLEMENTATION USING LEML AND LMS FOR INSTRUCTIO...
MULTIMODAL COURSE DESIGN AND IMPLEMENTATION USING LEML AND LMS FOR INSTRUCTIO...MULTIMODAL COURSE DESIGN AND IMPLEMENTATION USING LEML AND LMS FOR INSTRUCTIO...
MULTIMODAL COURSE DESIGN AND IMPLEMENTATION USING LEML AND LMS FOR INSTRUCTIO...
 

More from Colleen Ganley

Dragan Gasevic SOED 2016
Dragan Gasevic SOED 2016Dragan Gasevic SOED 2016
Dragan Gasevic SOED 2016
Colleen Ganley
 
Paul Wang SOED 2016
Paul Wang SOED 2016Paul Wang SOED 2016
Paul Wang SOED 2016
Colleen Ganley
 
Matti Pauna SOED 2016
Matti Pauna SOED 2016Matti Pauna SOED 2016
Matti Pauna SOED 2016
Colleen Ganley
 
Michael Gage SOED 2016
Michael Gage SOED 2016Michael Gage SOED 2016
Michael Gage SOED 2016
Colleen Ganley
 
Yahya Almalki SOED 2016
Yahya Almalki SOED 2016Yahya Almalki SOED 2016
Yahya Almalki SOED 2016
Colleen Ganley
 
Hart & Ganley SOED 2016
Hart & Ganley SOED 2016Hart & Ganley SOED 2016
Hart & Ganley SOED 2016
Colleen Ganley
 

More from Colleen Ganley (6)

Dragan Gasevic SOED 2016
Dragan Gasevic SOED 2016Dragan Gasevic SOED 2016
Dragan Gasevic SOED 2016
 
Paul Wang SOED 2016
Paul Wang SOED 2016Paul Wang SOED 2016
Paul Wang SOED 2016
 
Matti Pauna SOED 2016
Matti Pauna SOED 2016Matti Pauna SOED 2016
Matti Pauna SOED 2016
 
Michael Gage SOED 2016
Michael Gage SOED 2016Michael Gage SOED 2016
Michael Gage SOED 2016
 
Yahya Almalki SOED 2016
Yahya Almalki SOED 2016Yahya Almalki SOED 2016
Yahya Almalki SOED 2016
 
Hart & Ganley SOED 2016
Hart & Ganley SOED 2016Hart & Ganley SOED 2016
Hart & Ganley SOED 2016
 

Recently uploaded

How to Create Map Views in the Odoo 17 ERP
How to Create Map Views in the Odoo 17 ERPHow to Create Map Views in the Odoo 17 ERP
How to Create Map Views in the Odoo 17 ERP
Celine George
 
Cambridge International AS A Level Biology Coursebook - EBook (MaryFosbery J...
Cambridge International AS  A Level Biology Coursebook - EBook (MaryFosbery J...Cambridge International AS  A Level Biology Coursebook - EBook (MaryFosbery J...
Cambridge International AS A Level Biology Coursebook - EBook (MaryFosbery J...
AzmatAli747758
 
Additional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdfAdditional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdf
joachimlavalley1
 
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptxStudents, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
EduSkills OECD
 
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Thiyagu K
 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345
beazzy04
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
Tamralipta Mahavidyalaya
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
Celine George
 
Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
Balvir Singh
 
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideas
GeoBlogs
 
Instructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptxInstructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptx
Jheel Barad
 
Sectors of the Indian Economy - Class 10 Study Notes pdf
Sectors of the Indian Economy - Class 10 Study Notes pdfSectors of the Indian Economy - Class 10 Study Notes pdf
Sectors of the Indian Economy - Class 10 Study Notes pdf
Vivekanand Anglo Vedic Academy
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
siemaillard
 
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
EugeneSaldivar
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
Sandy Millin
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
EverAndrsGuerraGuerr
 
The Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdfThe Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdf
kaushalkr1407
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
Special education needs
 
ESC Beyond Borders _From EU to You_ InfoPack general.pdf
ESC Beyond Borders _From EU to You_ InfoPack general.pdfESC Beyond Borders _From EU to You_ InfoPack general.pdf
ESC Beyond Borders _From EU to You_ InfoPack general.pdf
Fundacja Rozwoju Społeczeństwa Przedsiębiorczego
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
JosvitaDsouza2
 

Recently uploaded (20)

How to Create Map Views in the Odoo 17 ERP
How to Create Map Views in the Odoo 17 ERPHow to Create Map Views in the Odoo 17 ERP
How to Create Map Views in the Odoo 17 ERP
 
Cambridge International AS A Level Biology Coursebook - EBook (MaryFosbery J...
Cambridge International AS  A Level Biology Coursebook - EBook (MaryFosbery J...Cambridge International AS  A Level Biology Coursebook - EBook (MaryFosbery J...
Cambridge International AS A Level Biology Coursebook - EBook (MaryFosbery J...
 
Additional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdfAdditional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdf
 
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptxStudents, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
 
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
 
Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
 
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideas
 
Instructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptxInstructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptx
 
Sectors of the Indian Economy - Class 10 Study Notes pdf
Sectors of the Indian Economy - Class 10 Study Notes pdfSectors of the Indian Economy - Class 10 Study Notes pdf
Sectors of the Indian Economy - Class 10 Study Notes pdf
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
 
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
 
The Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdfThe Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdf
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
 
ESC Beyond Borders _From EU to You_ InfoPack general.pdf
ESC Beyond Borders _From EU to You_ InfoPack general.pdfESC Beyond Borders _From EU to You_ InfoPack general.pdf
ESC Beyond Borders _From EU to You_ InfoPack general.pdf
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
 

Christopher Brooks SOED 2016

  • 1. brooksch@umich.edu @cab938 Shape of Educational Data: Predictive Modeling as an Enabler of Personalized Learning Christopher Brooks Research Assistant Professor, School of Information Director of Learning Analytics and Research Digital Education and Innovation University of Michigan
  • 2. brooksch@umich.edu @cab938 Psychohistory “…[it] combined history, psychology and mathematical statistics to create a (nearly) exact science of the behavior of very large populations of people…Asimov used the analogy of a gas: in a gas, the motion of a single molecule is very difficult to predict, but the mass action of the gas can be predicted to a high level of accuracy. Asimov applied this concept to the population of the fictional Galactic Empire, which numbered in the quadrillions.” http://asimov.wikia.com/wiki/Psychohistory
  • 3. brooksch@umich.edu @cab938 • "Averaginarianism" • Regression towards a mean that doesn't actually naturally exist • There is a gulf between the predictive modeling perspectives, and the explanatory modeling ones
  • 4. brooksch@umich.edu @cab938 Research Perspective • Learners are individuals • There is nuance in data that is important and being missed by studying populations vs. individuals • Computational modelling (esp. predictive modelling) has opportunity to help
  • 6. brooksch@umich.edu @cab938 Lecture Capture • How do students integrate educational technologies into their study habits? – (and do those technologies have any effect?) • A need for insight – Studies largely show only student satisfaction benefits from lecture capture – Several studies show no effect to the use of lecture capture on performance • Data mining for usage patterns – Apply unsupervised machine learning methods (k-means clustering) to viewership data by week – Then built general model from prototypes and apply to new datasets and determine fit (replication)
  • 7. brooksch@umich.edu @cab938 Results (Chemistry 2xx 2010) • 5 groups found, each pedagogically labelled (by investigators!) • Error and size of groups ranges considerably • The final exam period is not indicative of activity throughout semester
  • 8. brooksch@umich.edu @cab938 The general model • How well does the model generalize?
  • 9. brooksch@umich.edu @cab938 (Grades Pairwise Tukey HSD, * p<0.1 ** p<0.05) (Grades)
  • 10. brooksch@umich.edu @cab938 Results • Not a predictive model, but a more discriminate descriptive model – Showed an effect not for general use of lecture capture, but for specific ways of using lecture capture • Replication suggests there is merit to the model, but that it is highly contextualized (theme of course) • Data from more sources could add further detail to the model as to causal effects Brooks, C. A., Erickson, G., Greer, J. E., Gutwin, C. (2014) Modelling and Quantifying the Behaviours of Students in Lecture Capture Environments. In Computers & Education. Vol 75 June. pages 282-292.
  • 12. brooksch@umich.edu @cab938 Massive Open Online Courses • As of the end of 2014, MOOCs at Michigan have attracted 1.9 million enrollees and nearly 1 million participants • Of these participants, ~ 300K attempt some assessment task, ~80K end up passing the course (certificate) • Can we do better in understanding student success in this environment? • Could we predict who is at-risk for students who want to obtain a certificate?
  • 13. brooksch@umich.edu @cab938 • MOOCs lack the diversity of data we have about residential students – Previous achievement (SAT/ACT, last years course) – Socioeconomic status (distance from university, first in family, wealth) – Gender – Ethnicity – Motivation • Building predictive models of student achievement in learning analytics is largely done on these entry-level features • Both frustrating and refreshing – Want accurate models, but want actionable data
  • 14. brooksch@umich.edu @cab938 • Built a novel feature selection algorithm inspired by work in the text-mining community • It looks at the pattern of engagement that a student has with course resources • Build of historical data (last years course) to create day-by-day multilevel models (C4.5) • Initial work is based on student certificate achievement (pass/fail) –(not the only valuable outcome variable to try and predict!)
  • 15. brooksch@umich.edu @cab938 Resour ce Day of Course 1 2 3 4 5 6 7 8 9 Video Daily Accesses Day 1: Yes Day 2: No Day 3: Yes Day 4: No Day 5: No Day 6: No Day 7: No Day 8: Yes Day 9: No 3-Day counts Day 1-3: Yes Day 4-6: No Day 7-9: Yes Weekly counts Week 1: Yes Week 2: Yes Monthly counts Month 1: Yes For a 104 day long course, with three resources (videos, forums, quizzes) this gives us 408 features for the modelling activity.
  • 16. brooksch@umich.edu @cab938 Text Mining Inspiration • Text mining often uses n-grams as features in a document – A bigram (cat, good) is the number of pairs of these two words in a document, a trigam (cat, was not good), etc. – We build engagement n-grams up to 5 gram Daily Accesses Day 1: Yes Day 2: No Day 3: Yes Day 4: No Day 5: No Day 6: No Day 7: No Day 8: Yes Day 9: No Possible bigrams [yes, yes]: 0 [no, no]: 3 [yes, no]: 3 [no, yes]: 2 Possible trigrams: [yes, yes, yes]: 0 [yes, yes, no]: 0 [yes, no, yes]: 1 … For a 104 day long course, with three resources (videos, forums, quizzes) this gives us 717 more features for the modelling activity.
  • 17. brooksch@umich.edu @cab938 In a nutshell • We do not have diverse set of data, but we do have a detailed set of data • And there is a lot of it (200 million+ clickstream events) • By pulling out patterns of resource access, we can use supervised machine learning (C4.5) techniques to build predictive models • But what if we did have entry data from students? – Gender & Ethnicity, certification status, country of origin, etc.
  • 18. brooksch@umich.edu @cab938 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 31 33 35 37 39 41 43 45 47 49 51 53 55 57 59 61 63 65 67 69 71 73 75 Fliess'κ Day of Course Offering Fliess' κ versus Time in Days Activity Features Only Demographics Features Only Activity and Demographics Features
  • 19. brooksch@umich.edu @cab938 Results • It is possible to create predictive models on clickstream data for MOOCs • 3 weeks into the MOOC seems to be an interesting point for some courses • It is computationally intensive to create these models (daily!) • MOOC entry/demographics information does not seem to add value C. Brooks, C. Thompson, S. Teasley. (2015) A Time Series Interaction Analysis Method for Building Predictive Models of Learners using Log Data. 5th International Conference on Learning Analytics and Knowledge 2015 (LAK'15) C. Brooks, C. Thompson, S. Teasley. (2015) Who You Are or What You Do: Comparing the Predictive Power of Demographics vs. Activity Patterns in Massive Open Online Courses (MOOCs). The second annual conference on Learning At Scale 2015 (L@S2015), Works in Progress track. Vancouver BC, March 14-15, 2015. Vancouver, BC.
  • 21. brooksch@umich.edu @cab938 No Particular Night or Morning “I looked at the page with my name under the title…it was some other man…the story was familiar – I knew I had written it – but that name on the paper still was not me. It was a symbol, a name.” “I’ve always figured it that you die each day, and each day is a box…but you never go back and lift the lids...each is a different you, somebody you do not know or understand or want to understand. ”
  • 22. brooksch@umich.edu @cab938 Questions? Comments? Christopher Brooks Research Assistant Professor, School of Information Director of Learning Analytics and Research in Digital Education and Innovation University of Michigan brooksch@umich.edu @cab938

Editor's Notes

  1. Generalize the model to some theoretical constructs
  2. Some statistical significance in the chemistry courses Biomolecules class has less statistical rigor, but it seems like there might be an effect Reminder: BMSC course was half the size of the chemistry 2011 course, and students can use any other study aid, these are field studies
  3. Recently, predictive models were used by a University President to pressure students to unenroll with the aim of increasing student success measures. Predictive models will allow us to target individuals, but we need to remember that models are not necessarily causal, and tend to have huge accuracy challenges. There are ethical obligations we have as the creators of models, especially as models become more complex and difficult to interpret.