Umap17 learner modelingforintegrationskills_yunhuang

Learner
Modeling
for
Integration
Skills
1
Yun
Huang1,
Julio
Guerra-‐Hollstein1,2,
Jordan
Barria-‐Pineda1,
Peter
Brusilovsky1
1University
of
Pittsburgh,
2Universidad
Austral
de
Chile
07/11/2017
@
UMAP

2
Ambrose, Susan A.,
et al. How learning
works: Seven
research-based
principles for smart
teaching. 2010.
How
do
students
develop
mastery?
KNOW WHEN

TO
APPLY
Skills
PRACTICE

Integrating

Skills
ACQUIRE
Component

Skills
MASTERY

Empirical
evidence
showing
difficulty
in
integration?
Ø Algebra
• Composition
effect
• Heffernan&Koedinger ‘97;
Koedinger&McLaughlin,
‘16
• translate
two
matched
one-‐step
problems
800-‐y and
40x
• translate
two-‐step
story
problems
into
expressions
800-‐40x
• Intervention
study

• Koedinger&McLaughlin,
‘10
3

Empirical
evidence
showing
difficulty
in
integration?
Ø Programming
Ø Patterns
in
programming
expertise(Gilmore&Green ’88;
Soloway&Ehrlich ’84)
4
print("Entertemperature, -300 to stop")
count = 0
sum = 0.0
temp = float(input("First:"))
while temp > -300.0:
sum += temp
count += 1
temp = float(input("Next: "))
print("Entertemperature, -300 to stop")
count = 0
sum = 0.0
temp = float(input("First:"))
while temp > -300.0:
sum += temp
count += 1
temp = float(input("Next: "))
Pattern
of
Sentinel
Input
ProcessingPattern
of
Summing
a
Sequence

Empirical
evidence
showing
difficulty
in
integration?
public
static
void
main(String[]
args)
{
int y
=
1;
for
(int j
=
5;
j
<
8;
j++){
y
+=
j;
}
System.out.print(y);
}
What
is
the
output
of
the
program?
5
success
rate:
64%
success
rate:
39%
Ø Our
recent
studies
demonstrate
integration
difficulty
in
program
comprehension
public static void main(String[] args) {
int z = 8;
int j = 7;
z += j;
System.out.print(z);
j = 1;
z += j;
j = 3;
z += j;
for(int k= 1; k < 4; k++) {
System.out.print(k);
}
}
What
is
the
output
of
the
program?

Existed
popular
learner
models
for

multiple
skill
practices
Ø Model
skills
independently
or
individually[3,
4,
9,
10,
11,
12]
Ø Danger:
shallow
learner,
ineffective
remediation
6
Weakest Knowledge Tracing (WKT) [4,13,14]
Conjunctive Knowledge Modeling (CKM) [9,10,11,12]
Ki:
latent
knowledge
level
Oj:
observed
performance

Limited
evaluation
by
performance
prediction
◦ Is
it
worthy
to
make
such
fine-‐grained
refinement
of
learner
models?

◦ Will
traditional
learner
model
evaluation
metrics
reveal
the
effect?
◦ Our
recent
work:
performance
prediction
is
not
enough!
◦ Highly
predictive
models
can
be
useless
for
adaptive
tutoring[1,
2]
◦ Similarly
predictive
models
can
be
very
different
for
adaptive
tutoring
[1,
2]
7

Approach
We
propose
and
demonstrate
the
effectiveness
of:
Ø A
new
knowledge
graph
defining
progressive
integration
skills
Ø A
new
learner
model
monitoring
students’
integration
skills
Ø A
multifaceted
evaluation
framework
for
complex
latent
variable
models

Integration-‐level
Learner
Model
10
basic
component
skills

(e.g.,
for,
+=,
a[])
integration
skills
(e.g.,
for&+=,
for&a[])
Conjunctive
Knowledge
Modeling
with
Hierarchical
Integration
skills
(CKM-‐HI)
• Based
on
an
integration
graph
(pairwise)
• Basic
skills
and
integration
skills
are
separately
represented
• Latent
skills
organized
in
a
hierarchical
way
latent
observed

Multifaceted
Evaluation
Framework
Ø Performance
prediction
Ø RMSE,
AUC
Ø Parameter
Plausibility
Ø Parameters
for
capturing
noise
(guess,
slip)
should
be
small
Ø Expected
instructional
effectiveness

Ø How
much
effort
a
student
needs
to
reach
a
specific
score
assuming

students
are
practicing
under
the
guidance
of
a
learner
model?
Ø Real-‐world
recommendation
helpfulness
(User
study)
Ø How
do
students
rank
recommendations
from
different
learner
models?
11

Dataset
and
Experimental
Setup
12
• QuizJET system
• 25,988
attempts,
347
students,
91

questions,
67%
correct
• 72
basic
individual,
43
integration
skills
• 10-‐fold
student
stratified
cross-‐
validation:
• In
each
fold
train
on
90%
of
students,

and
test
on
the
remaining
10%
of

new
students.

• Sequential
update
by
Bayesian
rule

Performance
Prediction
and
Parameter
Plausibility
13
CKM-‐HI
significantly
outperforms
WKT
and
CKM
in
both
aspects
* sig. at 0.05/3=0.017, ** sig. at 0.01/3=0.0033, *** sig. at 0.001/3=0.00033.
+ effect size ≥ 1 (large).

Expected
Instructional
Effectiveness
14
• Computed
based
on
collected
data,
focus
on
the
higher
mastery
threshold
region
• To
reach
the
same
score,
students
under
CKM-‐HI
needs
the
least
effort
• Using
the
same
effort,
students
under
CKM-‐HI
gets
the
highest
score

Expected
Instructional
Effectiveness
15
Ø Extends
our
prior
evaluation
framework
LEOPARD
(EDM
’14)
[1]
Ø Metrics:
§ Score: Computed
by
the
mean
performance
on
real
data

after
a
learner
model
asserts
mastery
for
the
set
of

required
skills.

§ Effort:

Computed
by
the
number
of
practices
on
real
data

in
order
to
reach
mastery
inferred
by
a
learner
model.

§ Consider
a
range
of
mastery
thresholds

User
Study
Setup
Ø Solve
7
Java
comprehension
problems
and
rank
recommended
subproblems
Ø 20
participants
pursuing
undergraduate
or
master’s
degrees
in
information

science
at
the
University
of
Pittsburgh

Ø 1.5h
session
on
average
ØCompare
3
learner
models
(CKM-‐HI,
CKM,
WKT)
+
1
distractor,
each

recommends
2
subproblems,
mixed
together
Ø Identify
weakest
skill,
picks
a
subproblem addressing
this
skill
Ø Identify
2nd weakest
skill,
picks
a
subproblem addressing
this
skill
Ø Compare
under
two
different
recommendation
strategies:
MaxDiff,
MinDiff
16

17
Real-‐world
recommendation
helpfulness

Does
CKM-‐HI
receive
the
highest
ranking?

18
• CKM-‐HI
receives
significantly
higher

ranking
than
others
• Two
ways
of
analyzing
the
ranking,

as
continuous/ordinal
variables
• Two
recommendation
strategies

• No
sig.
diff.
between
WKT
and
CKM
• All
models
sig.
outperform
Distractor

Future
Work
§ Conduct
a
large-‐span
and
long-‐scale
study
to
collect
objective
measurements.

§ Explore
skill
integration
beyond
the
single
context

§ Continue
to
contribute
to
best
practices
in
evaluating adaptive

educational
systems

§ Automated
methodsfor
extracting
integration
skills
that
advance
our

preliminary
approach
[15]
19

Conclusion
• New
knowledge
graph:
Integration
Graph
• New
integration-‐level
leaner
model

• CKM-‐HI,
which
significantly
outperforms
two
popular
multiple-‐skill

learner
models,
WKT
and
CKM,
on
investigated
dimensions
• New
multifaceted
evaluation
framework
• Performance
prediction
• Parameter
Plausibility
• Expected
instructional
effectiveness

• Real-‐world
recommendation
helpfulness
(User
study)
20

Details in the poster session J
Thank you very much for listening!

Reference
[1]
José
P
González-‐Brenes and
Yun
Huang.
2015.
Your
model
is
predictive
– but
is
it
useful?
theoretical
and

empirical
considerations
of
a
new
paradigm
for
adaptive
tutoring
evaluation.
In
Proc.
8th
Intl.
Conf.
Educational

Data
Mining.
187–194.

[2]
Yun
Huang,
José
P
González-‐Brenes,
Rohit Kumar,
and
Peter
Brusilovsky.
2015.
A
framework
for
multifaceted

evaluation
of
student
models.
In
Proc.
8th
Int.
Conf.
Educational
Data
Mining.
203–210.

[3]
AlbertT.
Corbett
and
JohnR.
Anderson.
1995.
Knowledge
tracing:
Modeling
the
acquisition
of
procedural

knowledge.
User
Modeling
and
User-‐Adapted
Interaction
4,
4
(1995),
253–278.

[4]
Yue
Gong,
Joseph
E
Beck,
and
Neil
T
Heffernan.
2010.
Comparing
knowledge
tracing
and
performance
factor

analysis
by
using
multiple
model
fitting
procedures.
In
Intelligent
Tutoring
Systems.
Springer,
35–44.

[5]
D.
J.
Gilmore
and
T.
R.
G.
Green.
1988.
Programming
plans
and
programming
expertise.
The
Quarterly
Journal

of
Experimental
Psychology
Section
A
40,
3
(1
Aug.
1988),
423–442.

[6]
Elliot
Soloway and
Kate
Ehrlich.
1984.
Empirical
Studies
of
Programming
Knowledge.
IEEE
Trans.
Software

Engineering
SE-‐10,
5
(1984),
595–609.
[7]
Heffernan,
Neil
T.,
and
Kenneth
R.
Koedinger.
"The
composition
effect
in
symbolizing:
The
role
of
symbol

production
vs.
text
comprehension." Proceedings
of
the
Nineteenth
Annual
Conference
of
the
Cognitive
Science

Society.
1997.
[8]
Anderson,
J.
R.
&
Lebiere,
C.
(1998). The
atomic
components
of
thought. Mahwah,
NJ:
Erlbaum.

22

Reference
[9]
Cristina
Conati,
Abigail
Gertner,
and
Kurt
Vanlehn.
2002.
Using
Bayesian
Networks
to
Manage
Uncertainty
in
Student

Modeling.
User
Modeling
and
User-‐Adapted
Interaction
12,
4
(2002),
371–417.
citeulike-‐article-‐id:2877137
[10]
Michael
Mayo
and
Antonija Mitrovic.
2001.
Optimising ITS
behaviour with
Bayesian
networks
and
decision
theory.

(2001).

[11]
Eva
Millán and
José
Luis
Pérez-‐De-‐La-‐Cruz.
2002.
A
Bayesian
diagnostic
algo-‐ rithm for
student
modeling
and
its

evaluation.
User
Modeling
and
User-‐Adapted
Interaction
12,
2-‐3
(2002),
281–330.
[12]
Cristina
Carmona,
Eva
Millán,
José-‐Luis
Pérez-‐de-‐la Cruz,
Mónica Trella,
and
Ricardo
Conejo.
2005.
Introducing

prerequisite
relations
in
a
multi-‐layered
Bayesian
student
model.
In
International
Conference
on
User
Modeling.
Springer,

347–356.

[13]
José
P
González-‐Brenes,
Yun
Huang,
and
Peter
Brusilovsky.
2014.
General
features
in
knowledge
tracing:
Applications

to
multiple
subskills,
temporal
item
response
theory,
and
expert
knowledge.
In
Proc.
7th
Int.
Conf.
Educational
Data

Mining.
84–91.

[14]
Yanbo Xu
and
Jack
Mostow.
2012.
Comparison
of
methods
to
trace
multiple
subskills:
Is
LR-‐DBN
best?.
In
Proc.
5th

Intl.
Conf.
Educational
Data
Mining.
Chania,
Greece,
41–48.

[15]
Yun
Huang,
Julio
Guerra,
and
Peter
Brusilovsky.
2016.
Modeling
skill
combination
patterns
for
deeper
knowledge

tracing.
In
Proceedings
of
the
6th
Workshop
on
Personalization
Approaches
in
Learning
Environments
(PALE
2016).

23

Umap17 learner modelingforintegrationskills_yunhuang

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Umap17 learner modelingforintegrationskills_yunhuang

Similar to Umap17 learner modelingforintegrationskills_yunhuang (20)

Recently uploaded

Recently uploaded (20)

Umap17 learner modelingforintegrationskills_yunhuang

Editor's Notes