Kathleen Preston Jan 9, 2009 Presentation

Most general divide-by-total item response

theory model
NRM has received the least attention

Can be used to address important

psychometric questions
Useful in exploratory item response data

Currently unclear how researchers should

approach hypothesis testing of specific
parameters.

EXP  ix  cix 
Pix    m

 EXP    c 
ix ix
x 0

 ix =  cix = 0

1
Pix | x  x or x ' 
1  exp(   c )
* *

where, * = x - x’
and c* = (cx’ – cx)/ *

 is a category slope
There are four for a 4-Point Item
* is a category discrimination
There are three for a 4-Point Item
They represent the discrimination of
three dichotomies

1* = 2 - 1 1 vs. 2
2* = 3 - 2 2 vs. 3
3* = 4 - 3 3 vs. 4

 Rating scale model
◦ constrain all c* parameters to be
equal across items
Partial Credit Model


Generalized Partial Credit Model


Rating Scale model


 Partial Credit model
◦ * is constrained to be equal
within and between items
Generalized Partial Credit model


Rating Scale model


Partial Credit model


 Generalized Partial Credit model
(G-PCM):
◦ a* parameters are constrained
within an item, but not between
items

The NRM will be evaluated as a

method of hypothesis testing
◦ Evaluate the assumption of the G-PCM of
equal category discriminations within
items
◦ Using PROMIS data as an example of
testing the assumption
◦ Power to detect different category
discrimination parameters within an item

Part 1: Evaluation the assumption of the G-

PCM of equal category discriminations within
items
◦ Manipulated variables
Category discrimination parameter

Intersection parameters

Number of items

Sample size

Distribution of θ


Part 2: Using PROMIS data to test assumption

◦ PROMIS Depression Inventory
768 individuals

28 items

G-PCM was fit to data using PARSCALE

Data simulated using produced parameter estimates

 Distribution of θ
 Sample size

Part 3: Power to detect different category

discrimination parameters within an item
 Average category discrimination
 Category discrimination variance
 Different forms of too many response options
 One discrimination too many
 Multi-point item should be a dichotomy

Estimate the G-PCM for all simulated data

and identify the log-likelihood
Free up the category discriminations one item

at a time and identify the log-likelihood
Evaluate the change in log-likelihood

Difference in log-likelihood should be chi-

square distributed (M=df, σ2 = 2df)

For all conditions with normal θ distribution


�� = .05
�� = 2.00
�� 2 = 4.01
For all conditions with skewed θ distribution


�� = .31
�� = 5.18
�� 2 = 16.56

Normal θ Skewed θ
 

PROMIS data parameters


�� = 2.25 ��1 = 0.36 ��2 = 0.81 ��3 = 1.67
L-R test results


Sample Size
500 1,000 2,000
M(σ2) 2.28 (167.43) 2.28 (12.91) 2.32 (9.57)
Normal θ
.07 .07 .09
Type I error
M(σ2) 1.90 (247.09) 3.5 (14.21) 5.58 (21.83)
Skewed θ
.14 .19 .37
Type I error

Average category discrimination

◦ α* = 1.75  �� = .63
◦ α* = 1.25  �� = .67
◦ α* = 0.75  �� = .76
Category discrimination variance

◦ α* variance = 0.5  �� = .26
◦ α* variance = 2.0  �� = .63
Different forms of too many response options

◦ One discrimination too many
�� = 1.00
 For all conditions 
◦ Multi-point item should be a dichotomy
�� = .77
 For all conditions 

For all conditions under a normal θ

distribution, the LR-difference test appears to
be valid
The LR-difference test appears to have

adequate power to detect unequal
discrimination parameters
The LR-difference test has excellent power to

detect when an item has one too many
discrimination parameters (α4 = 0)
High category discriminations and skewed θ

distribution appears to present some
problems

Kathleen Preston Jan 9, 2009 Presentation

Recommended

Recommended

More Related Content

What's hot

What's hot (19)

Viewers also liked

Viewers also liked (20)

Similar to Kathleen Preston Jan 9, 2009 Presentation

Similar to Kathleen Preston Jan 9, 2009 Presentation (20)

Kathleen Preston Jan 9, 2009 Presentation