Learnersourcing: Improving Learning with Collective Learner Activity

Juho Kim (MIT CSAIL)
Learnersourcing:
Improving Learning with
Collective Learner Activity

Video enables learning at scale

# of learners
millionsone hundreds
Scalable delivery ≠ Scalable learning

In-person learning: Direct
learner-instructor interaction
Effective pedagogy

Video learning: Mediated
learner-instructor interaction
Video interfaces are limiting.

no information
about learners
Challenges in
video learning at scale
? ?
no information
about content
lack of
interactivity

Challenges in
video learning at scale
Understand
learners’
engagement
Understand
video
content
Support
interactive
learning

Data-Driven Approach
use data from learner interaction to
understand and improve learning
second-by-second
process tracking
data-driven
content & UI updates

Learnersourcing
crowdsourcing with learners as a crowd

Learnersourcing
inherently motivated
naturally engaged

Learnersourcing
Learners’ collective learning activities
dynamically improve content & UI for future learners.
inherently motivated
naturally engaged

Learners watch videos.
UI provides
social navigation & recommendation.
System analyzes
interaction traces
for hot spots.
[Learner3879, Video327, “play”, 35.6]
[Learner3879, Video327, “pause”, 47.2]
…
Video player adapts to
collective learner engagement

Learners are prompted to
summarize video sections.
UI presents a video outline.
System coordinates
learner tasks for a
final summary.
What’s the overall goal of the section
you just watched?
X ………………
V ………………
X ………………
X ………………
Video player coordinates
learners to generate a video
outline

Two types of learnersourcing
Passive
track what
learners are doing
Active
ask learners to
engage in activities

Learnersourcing applications for
educational videos
ToolScape [CHI 2014]
Interaction Peaks [L@S 2014] LectureScape [UIST 2014] RIMES [CHI 2015]
Mudslide [CHI 2015]Crowdy [CSCW 2015]

Learnersourcing requires
a multi-disciplinary approach
Crowdsourcing
– Quality control, Task design, Large-scale input mgmt
Social computing
– Incentive design, Sense of community among learners
UI design
– Data-driven & dynamic interaction techniques
Video content analysis
– Computer vision, Natural language processing
Learning science
– Pedagogically useful activity, Theoretical background

Thesis statement
“In large-scale video learning
environments,
interfaces powered by learnersourcing
can
enhance content navigation,
create a sense of learning with others,
and improve engagement and learning.”

I. Passive learnersourcing (MOOC videos)
– Video player clickstream analysis [L@S 2014a, L@S 2014b]
– Data-driven content navigation [UIST 2014a, UIST 2014b]
II. Active learnersourcing (how-to videos)
– Step-by-step information [ CHI 2014]
– Summary of steps [CSCW 2015]

Classrooms: rich, natural interaction
data
armgov on Flickr | CC by-nc-saMaria Fleischmann / Worldbank on Flickr | CC by-nc-nd
Love Krittaya | public domain unknown author | from pc4all.co.kr

liquidnight on Flickr | CC by-nc-sa

First MOOC-scale
video interaction analysis
Data Source: 4 edX courses (fall 2012)
Domains: computer science, statistics, chemistry
Video Events: start, end, play, pause, jump
Learners Videos
Mean Video
Length
Processed
Video Events
127,839 862 7:46 39.3M

Factors affecting video engagement
Shorter videos
- significant drop after 6 mins
Informal shots over studio production
- more personal feel helps
Tablet drawing tutorials over slides
- continuous visual flow helps
How Video Production Affects Student Engagement: An Empirical Study of MOOC Videos.
Philip J. Guo, Juho Kim, Rob Rubin.
Learning at Scale 2014.
Metric: session length

How do learners navigate
videos?
• Watch sequentially
• Pause
• Re-watch
• Skip / Skim
• Search

Collective interaction traces
video time
Learner #1
Learner #2
Learner #3
Learner #4
. . . . . .
Learner #7888
Learner #7887

Collective interaction traces
into interaction patterns
video time
interaction
events

Interaction peaks
Temporal peaks in the number of interaction
events, where a significant number of learners
show similar interaction patterns
video time
Understanding In-Video Dropouts and Interaction Peaks in Online Lecture Videos.
Juho Kim, Philip J. Guo, Daniel T. Seaton, Piotr Mitros, Krzysztof Z. Gajos, Robert C. Miller.
Learning at Scale 2014.
? !interaction
events

What causes an interaction
peak?
Video interaction log data
Video content analysis
– Visual content (video frames)
– Verbal content (transcript)

Observation: Visual / Topical transitions
in the video often coincide with a peak.

Returning to content
interaction
video time
# play button
clicks

Beginning of new material
interaction
video time
# play button
clicks

Data-driven
video interaction techniques
Use interaction peaks to
• draw learners’ attention
• support diverse navigational needs
• create a sense of learning with others

LectureScape: Lecture video player
powered by collective watching data
Data-driven interaction techniques for improving navigation of educational videos.
Juho Kim, Philip J. Guo, Carrie J. Cai, Shang-Wen (Daniel) Li, Krzysztof Z. Gajos, Robert C. Miller.
UIST 2014.

“Where did other learners find
confusing / important?”
“I want a quick overview
of this clip.”
“I want to see that
previous slide.”

Phantom cursor
• Visual & physical emphasis on interaction peaks
• Read wear [Hill et al., 1992], Semantic pointing [Blanch et al., 2004],
Pseudo-haptic feedback [Lécuyer et al., 2004]

Visual clip highlights
• Interaction data + frame processing

Pinning:
Automatic side-by-side view
Pinned slide Video stream

Lab study:
12 edX & on-campus students
• LectureScape vs baseline interface
• Navigation & learning tasks
Visual search
“Find a slide where the instructor displays on
screen examples of the singleton operation.”
Problem search
“If the step size in an approximation method decreases,
does the code run faster or slower?”
Summarization
“write down the main points of a video in three minutes.”

Diverse navigation patterns
With LectureScape:
• more non-linear jumps in navigation
• more navigation options
- rollercoaster timeline
- phantom cursor
- highlight summary
- pinning
“[LectureScape] gives you more options.
It personalizes the strategy I can use in the task.”

Interaction data give
a sense of “learning together”
Interaction peaks matched with participants’ points of
“confusion” (8/12) and “importance” (6/12)
“It’s not like cold-watching.
It feels like watching with other students.”
“[interaction data] makes it seem more classroom-y,
as in you can compare yourself to how other students
are learning and what they need to repeat.”

Summary of passive learnersourcing
• Unobtrusive, adaptive use of interaction data
• Analysis of MOOC-scale video clickstream data
• LectureScape: video player powered by
learners’ collective watching behavior
• Data-driven interaction techniques for
social navigation

Navigating how-to videos is hard
find
repeat
skip

How-to videos contain
a step-by-step solution structure
Apply
gradient map

Completeness & detail of
instructions
[Eiriksdottir and Catrambone, 2011]
Proactive & random
access in instructional
videos
[Zhang et al., 2006]
Interactivity: stopping,
starting and replaying
[Tversky et al., 2002]
Subgoals: a group of steps
representing task structures
[Catrambone, 1994, 1998]
Seeing and interacting with
solution structure helps learning

Learning with solution structure
helps

Improving how-to video learning
Interacting with the solution
• UI for solution structure navigation
Seeing the solution
• Extract steps + subgoals at scale

ToolScape: Step-aware video player
Crowdsourcing Step-by-Step Information Extraction to Enhance Existing How-to Videos.
Juho Kim, Phu Nguyen, Sarah Weir, Philip J. Guo, Robert C. Miller, & Krzysztof Z. Gajos.
CHI 2014. Best of CHI Honorable Mention.

work in progress
images
parts with no
visual progress
step labels & links

Study: Photoshop design tasks
12 novice Photoshop users
manually annotated videos

Participants felt more confident about their
design skills with ToolScape.
– Self-efficacy gain
– Four 7-Likert scale questions
– Mann-Whitney’s U test (Z=2.06, p<0.05), error bar: standard error
1.4
0 1 2 3 4 5 6 7
ToolScape
Baseline
0.13.8
3.8
Before task Self-efficacy gain after task

Participants believed they produced
better designs with ToolScape.
– Self-rating on designs produced
– One 7-Likert scale question
– Mann-Whitney’s U test (Z=2.70, p<0.01), error bar: standard error
5.3
3.5
0 1 2 3 4 5 6 7
ToolScape
Baseline

Participants actually produced
better designs with ToolScape.
– External rating on designs
– Krippendorff’s alpha = 0.753
– Wilcoxon Signed-rank test (W=317, Z=-2.79, p<0.01, r=0.29)
– Error bar: standard error
5.7
7.3
0 2 4 6 8 10 12
ToolScape
Baseline
Ranking: lower is better

Extracting solution structure
• Step-by-step information extraction
• Subgoal label generation

Goals for annotation method
• domain-independent
• existing videos
• non-expert annotators Learners
Crowd workers

Crowd-powered algorithms
improvement $0.05 3 votes @ $0.01
…
Crowd workflow for complex tasks
• Soylent [UIST 2010], CrowdForge [UIST 2011],
PlateMate [UIST 2011], Turkomatic [CSCW 2012]

3. Before/after results
per each step
1. Step time
2. Step label
Desired
annotations

Multi-stage annotation workflow
When &
What are
the steps?
Vote &
Improve
Before/After
the steps?
FIND VERIFY EXPAND
Crowdsourcing Step-by-Step Information Extraction to Enhance Existing How-to Videos.
Juho Kim, Phu Nguyen, Sarah Weir, Philip J. Guo, Robert C. Miller, & Krzysztof Z. Gajos.
CHI 2014. Best of CHI Honorable Mention.

When &
What are
the steps?
Vote &
Improve
Before/After
the steps?
FIND VERIFY EXPAND
Input video

When &
What are
the steps?
Vote &
Improve
Before/After
the steps?
FIND VERIFY EXPAND
Input video
Output timeline

Evaluation
• Generalizable?
75 Photoshop / Cooking / Makeup videos
• Accurate?
precision and recall
against trained annotators’ labels
• Non-expert annotators?

Across all domains,
~80% precision and recall
Domain Precision Recall
Cooking 0.77 0.84
Makeup 0.74 0.77
Photoshop 0.79 0.79
All 0.77 0.81
Precision: % correct labels extracted by crowd
Recall: % ground truth labels extracted by crowd

Timing is 2.7 seconds off on average
Ground truth: one step every 17.3 seconds
2.7 seconds
User 1
User 2
User 3

• Requires domain experts and knowledge
extraction experts to work together. [Catrambone, 2011]
• Insight: the subgoal labeling process is
a good exercise for learning!
– Reflect on
– Explain
– Summarize
Generating subgoal labels is difficult

Multi-stage learnersourcing
workflow
Learnersourcing Subgoal Labels for How-to Videos.
Sarah Weir, Juho Kim, Krzysztof Z. Gajos, & Robert C. Miller.
CSCW 2015.

Stage 1. Generate subgoal labels
• Learner: summarize

Stage 2. Evaluate candidate
labels
• Learner: compare

Stage 3. Proofread subgoal
labels
• Learner: inspect

Sidebar w/ interactive
subgoals & steps

Crowdy evaluation
• Does participating in learnersourcing
improve learning?
• Does the learnersourcing workflow
produce good subgoal labels?

Study 1: Pedagogical benefits of
learnersourcing
• 300 Turkers
• Intro stats video
• IV: 3 video interfaces (Between-subjects)
• DV:
– Learning
• Pretest + Posttest
• Retention test (3-5 days after video watching)
– Workload (NASA TLX Test)

Baseline
- No prompting
- No subgoal shown
Expert
- No prompting
- Subgoal shown
Crowdy
- Prompting
- Subgoal shown

Retention test:
Crowdy = Expert > Baseline
1-way ANOVA: F(2, 226)=3.6, p< 0.05, partial η2=0.03
Crowdy vs Baseline: p < 0.05, Cohen’s d = 0.38
Expert vs Baseline: p < 0.05, Cohen’s d = 0.35
Error bar: Standard error
1-way ANOVA: F(2, 226)=4.8, p< 0.01, partial η2=0.04
Crowdy vs Baseline: p < 0.05, Cohen’s d = 0.38
Expert vs Baseline: p < 0.01, Cohen’s d = 0.45
100  79100  75 100  75

Pretest + Posttest scores were
not different across conditions.
One-way ANOVA: p > 0.05

Crowdy didn’t add
additional workload.
Questions on
mental demand, physical demand, temporal demand,
performance, effort, and frustration
7-point Likert scale (1: low workload, 7: high workload)
One-way ANOVA: p > 0.05

Study 2: Subgoal labeling quality
• ~50 web programming + statistics videos
• Classroom + live website deployment
• ~1,000 participating users (out of ~2,500 visitors)
922 966
527
Stage 1
subgoals created
Stage 2
upvotes
Stage 3
upvotes

Analyzed 4 most popular videos
4 external raters compared expert vs learner subgoals
Subgoal quality evaluation

Majority of learner-generated subgoals were rated
as matching or better than expert-generated ones.
Analyzed 4 most popular videos
4 external raters compared expert vs learner subgoals

Interview with learners & creator
• Learners
• Creator
“I was more... attentive to watching,
to trying to understand what exactly am I watching.”
“Having pop up questions means
the viewer has to be paying attention.”
the choices “...made me feel as though I was on the
same wavelength still.”

Learnersourcing design
principles
Crowdsourcing
simple and concrete task
quality control
data collection
microscopic, focused task
cost: money
Learnersourcing
pedagogically meaningful task
incentive design
learning + data collection
overall contribution visible
cost: learners’ time & effort

Summary of active learnersourcing
• Techniques for extracting solution structure
from existing videos
• Video UIs for learning with steps & subgoals
• Studies on learning benefits + label quality
• Learnersourcing activity design:
Engaging & pedagogically meaningful tasks,
while byproducts make useful information

• Richer learner responses
Learnersourcing research
agenda

• Large-scale corpus of annotated videos
- Multiple learning paths
- Deep search, browsing, recommendation
agenda

• Large-scale corpus of annotated videos
- Multiple learning paths
- Deep search, browsing, recommendation
• Completely learnersourced course
- Course created, taught, improved entirely by learners
agenda

Generalizing learnersourcing
community-guided planning, discussion,
decision making, collaborative work
- Conference planning [UIST 2013, CHI 2014, HCOMP 2013, HCOMP
2014]
- Civic engagement [ CHI 2015, CHI 2015 EA]

Learning at scale:
Does learning scale?
learning
benefit
per
learner
# of learners

Learning at scale research:
Enable the good parts of in-person learning, at scale
learning
benefit
per
learner
# of learners

Vision for learnersourcing
learning
benefit
per
learner
Interactive, collaborative, data-driven
online education
# of learners

Contributions
Learnersourcing: support video learning at
scale
• UIs
– Novel video interfaces & data-driven interaction techniques
powered by large-scale learning interaction data
• Workflows
– Techniques for inferring learner engagement from clickstream data,
and extracting semantic information from educational videos
• Evaluation studies
– Studies measuring pedagogical benefits, resulting data quality,
and learners’ qualitative experiences

Learnersourcing: Improving Learning with Collective Learner
Activity
Juho Kim | MIT CSAIL | juhokim@mit.edu | juhokim.com
ToolScape
Interaction Peaks LectureScape
Crowdy

Learnersourcing: Improving Learning with Collective Learner Activity

More Related Content

What's hot

Viewers also liked

Similar to Learnersourcing: Improving Learning with Collective Learner Activity

Recently uploaded

Learnersourcing: Improving Learning with Collective Learner Activity

Editor's Notes