Applications of Multivariate Techniques to Measure Content Structure with Multidimensional IRT

Applications of Multivariate
Techniques to Measure Content
Structure with Multidimensional IRT
Quinn N Lathrop
Advanced Computing & Data Science Lab
1

What do we do in the Advanced
Computing & Data Science Lab?
Prototyping
● Adaptive learning capabilities
● Authoring and tagging tools with machine
intelligence
● Exploration of new technology
Research and Development
● Direct support of a product or prototype
● Exploratory research into new capabilities
2

Data Context and IRT
Online learning systems provide students with instruction, homework, and summative
tests for an entire course.
The Book is organized hierarchically into Chapters and Sections. Organization is
called the Table of Contents (TOC). Items are within a single Section.
4
Traditional (unidimensional) IRT models can work well at both
● Local-level: Section or Chapter specific models
● Book-level: placing the entire course on a single latent trait
We need to turn to multidimensional models to help understand the relationships
between the structures in the book.

Explore the use of multivariate
and multidimensional techniques
to make inferences about the
structure of content in online
learning systems
5

Multivariate Section-Level IRT
1. IRT model specified at section level
○ Can be any specification, 1PL, 2PL, 3PL polytomous, etc
2. Section-level covariance matrix
○ Jointly estimate all section-level IRT models
○ Simple structure. All items within a section only load on the section-level
latent trait. All section-level latent traits can freely covary
3. Secondary analysis of covariance matrix
○ Plug covariance into any EFA, PCA, SEM
○ EFA to explore book structures analysis
○ SEM to verify TOC structures or “aggregate” covariance up the TOC
7

A Few Equations
8
1. IRT model specified at section level

1. Section-level IRT
All the usual unidimensional
psychometric results are available
• ability
• difficulty
• discrimination
• guessing
• ...
Now we have psychometric results but only inside in context of the item’s
section. Next, we look at the covariance between all objectives.

Section 1 Section 2 Section 3 Section 4
Item 1 X
Item 2 X
Item 3 X
Item 4 X
Item 5 X
Item 6 X
Item 7 X
Item 8 X
Item 9 X
Item 10 X
Item 11 X
Item 12 X
Item 13 X
Item 14 X
Item 15 X
Item 16 X

Multidimensional
Section 1 1.000 0.836 0.855 0.456
Section 2 0.836 1.000 0.919 0.684
Section 3 0.855 0.919 1.000 0.413
Section 4 0.456 0.684 0.413 1.000
Independent
Section 1 1 0 0 0
Section 2 0 1 0 0
Section 3 0 0 1 0
Section 4 0 0 0 1
Dependent
Section 1 1 1 1 1
Section 2 1 1 1 1
Section 3 1 1 1 1
Section 4 1 1 1 1

Book-level IRT model
Section Book-level diff % Correct
Section 1 -0.098 76%
Section 2 -0.543 83%
Section 3 0.296 68%
Section 4 0.146 71%
Section 1
Section 2
Section 3
F1
F2
0.8
0.8
1.0
0.9
Section 4
0.5
Objective-Level Multivariate IRT
Section 1 1.000 0.836 0.855 0.456
Section 2 0.836 1.000 0.919 0.684
Section 3 0.855 0.919 1.000 0.413
Section 4 0.456 0.684 0.413 1.000

What
13
Presentation Title Arial Bold 7 pt

Not your usual data...
15
● 5,000 items
● Item cloning, each item can have up to thousands of instantions
● Instructor controlled learning aids, scoring policies, and settings
● A semester can have 20,000 students and 6,000,000 responses
● As a person by item response matrix, that’s 95% missing data
● Missingness do to an ensemble of effects
○ Instructor customizations
○ Variety of courses and institutions using the same book

Table of Contents and Multidimensional Models
16
● Books have can have 10 to 30 Chapters
● Each Chapter has about 4 to 8 Sections
So if we want to do what we said...
…that implies a 40 to 250 dimensional model.
How do we do that?

How do we estimate high dimensional models?
Pairwise.
● Problem grows quadratically, not exponentially.
● Instead of fitting one 40-dimensional model with, for example, 10^40 latent evaluation points,
we fit (40^2 - 40)/2 = 780 2-dimensional models each with 10^2 latent evaluation points
● Pairwise models are easily parallelized, CPU-limited, and chunk the data, allowing the method to
scale with appropriate computational resources
18

The Obvious Criticism
● Secondary (post-hoc) analysis of covariance matrix does not correctly account
for standard errors.
● It would be better to jointly estimate the model on the covariance matrix
simultaneous with its estimation.
○ ...but the pairwise estimation is hard part, requiring significant
computational resources and time. Once we get that, the secondary
analysis is trivial.
○ ...and there are larger threats to the inference and standard errors
(non-ignorable missing data, student growth over time, etc).
○ ...even still, the value of the results justify its use.
19

Example 1: Comparing
TOC to Exploratory
Factor Analysis

Example 2: Impose the
TOC with Structural
Equation Modeling

Can the Chapters Explain the Covariance of Sections?
Intro =~ 1_1
Ch1 =~ 2_1 + 2_2 + 2_3 + 2_4 + 2_5 + 2_6 + 2_7 + 2_8
Ch2 =~ 3_1 + 3_2 + 3_3 + 3_4 + 3_5 + 3_6 + 3_7
Ch3 =~ 4_1 + 4_2 + 4_3 + 4_4 + 4_5 + 4_6
Ch4 =~ 5_1 + 5_2 + 5_3 + 5_4 + 5_5
25

Screen Book for Areas for
Expert Review
28
Ch A
Ch B
Ch C
● Flag areas where data do not match
expectations
● Can be thought of as taking the TOC as
the expert domain model, and then
validating that model with the data
● Target human and expert reviews to
areas most likely in need
Odd Section

Example 3: Where can
this take us?

Goals of Psychometric Models
Primary goal is measuring latent traits
Secondary goal involves inferences about content
33
Test-level inferences
● Dimensionality analysis
● Linking and equating
● Validity studies
Item-level inferences
● Item parameter filtering
● Differential item functioning
● Item fit

Applications of Multivariate Techniques to Measure Content Structure with Multidimensional IRT

Recommended

Recommended

More Related Content

Similar to Applications of Multivariate Techniques to Measure Content Structure with Multidimensional IRT

Similar to Applications of Multivariate Techniques to Measure Content Structure with Multidimensional IRT (20)

Recently uploaded

Recently uploaded (20)

Applications of Multivariate Techniques to Measure Content Structure with Multidimensional IRT