The Art and Science of Applied Test Development. This is the fourth in a series of PPT modules explicating the development of psychological tests in the domain of cognitive ability using contemporary methods (e.g., theory-driven test specification; IRT-Rasch scaling; etc.). The presentations are intended to be conceptual and not statistical in nature. Feedback is appreciated.
8447779800, Low rate Call girls in New Ashok Nagar Delhi NCR
Β
Applied Psych Test Design: Part D--Develop norm (standardization) plan
1. The Art and Science of Test DevelopmentβPart D
Develop norm (standardization) sample plan
Kevin S. McGrew, PhD.
Educational Psychologist
Research Director
Woodcock-MuΓ±oz Foundation
The basic structure and content of this presentation is grounded extensively on the test
development procedures developed by Dr. Richard Woodcock
2. Develop norm (standardization) sampling plan
ο
οο
ο
οο
οο
οο ο
οο
οοο
οοο
οοοοο
οοοο
ο ο οο ο ο
ο οο ο ο
The goal:
A nationally representative sample
from which to develop test norms
3. Implementation of sampling plan in practical test
development framework
ο
οο
ο
οο
οο
οοο
οο
οοο
οοο
οοοοο
οοοο
ο ο οο ο ο
ο οο ο ο
Gather the norm (standardization) data and use
for Rasch scaling, norm development,
psychometric, and statistical analyses
4. Implementation of sampling plan in
Conceptual Psychometric Validity Framework
οο
ο
ο
Gather the norm (standardization) data and use οο
οο
οο
οο
ο
for Rasch scaling, norm development, οο
οο
οο
psychometric, and statistical analyses οοο
οοο
οοο
ο οο ο
οοοο
οοο
5. Develop norm (standardization) sample specifications for
country/nation where test is intended to be used
ο
οο
ο
οο
οο
οο ο
οο
οοο
οοο
οοοοο
οοοο
ο ο οο ο ο
ο οο ο ο
The goal:
A nationally representative sample
from which to develop test norms
6. Sampling plan should be based on best available source of
national statistics (select examples below)
7. Three-stage sampling plan strategy used for
WJ batteries in United States
ο
οο
ο
οο
οο
οοο
οο
Stage 1: Sampling of communities οοο
οοο
οοοοο
οοοο
ο ο οο ο ο
ο οο ο ο
The goal:
Stage 2: Sampling of schools A nationally representative sample
from which to develop test norms
Stage 3: Sampling of subjects
β’ School-age subjects β random sampling in grades
β’ Preschool, university and adult subjects - quotas selected as
per selected US Census variables
8. ο
οο
ο
οο
οο
Goal is a nationally ο οο
οο
representative sample upon οοο
οοο
which to base the norms for the οοοοο
οοοο
measures (tests, clusters) ο ο οο ο ο
ο οο ο ο
9. No sampling plan is perfect: βTweakβ final norm data via subject weighting
A few tips/cautions
β’Oversample small groups
and then down-weight
β’No amount of creative
weighting can βfixβ a poorly
executed sampling plan
β’Continuous sample analysis:
Make sure to frequently
monitor data as it is collected
to see how close the βfitβ
between the sampling plan
and actual subject
characteristics. Important so
you can make adjustments
during the data collection
(before it is too late)
10.
11.
12.
13. WJ III Three-stage sampling plan strategy: Stage 1
Sampling of communities β communities sampled according
to 10 different community characteristics
14. Important question to think about
$$
$$$$$$$$
Are students who are categorized as βlow SESβ (at parent/family level) from $$ Community
likely to be having the same community/life/family/educational experiences as βlow SESβ
students (at parent/family level) from Community $$$$$$$$ ?
15. Illustration of potential threat to representative sample when community
SES is ignored and only family/parent SES is used in sampling plan
Level of abilities of selected students
Cmnty C
High
Community SES classification
Cmnty B
Middle
Cmnty A
Low
Family/Parent SES classification
Low Middle High
16. Community SES is one of the most important, yet
most frequently overlooked variables in a test
standardization sampling plan
17. Sample research/norm
demographic and data
file variable coding
sheet: WJ III
Tips/Cautions
β’ Spend considerable time
designing this data
collection form (be OCD)
β’ Think of all future
analyses when deciding
what to include/code
β’ If in doubt about a
variableβ¦include it (you
typically canβt go back
later to get information)
β’ Create coding system
with quality data entry
procedures in mind
β’ You canβt analyze what
you havenβt gathered and
coded from the beginning
18. Development and implementation of sampling plan is probably the most critical
phase of test development
β’ GIGO β garbage inο garbage out
Retain as much control over all aspects of data collection and data entry as
possible
Quality of data is only as good as your pool of examiners
β’ Recruiting, training, supervising, and retaining good examiners requires major
attention and is very important
β’ Examiners need supervision
β’ Terminate poor examiners as soon as you can
β’ Central office should review every single piece of information on submitted test
records, especially at the beginning of an examiners testing (you donβt want them
practicing errors)
β’ Run special Rasch βperson fitβ reports to flag test records that look suspicious (and
see if they come from certain examiners)
β’ Best examiners are not necessarily psychologists or doctoral students.
(cont. next slide)
19. Spend considerable time developing the test record and variable coding
sheets
Run βcontinuous sample analysis reportsβ to monitor sampling plan
adherence or drift β so you can make changes quickly before it is too late
There is no single-purpose system, or collection of published software
programs, that can handle the detailed and fluid work of data entry, editing
and monitoring. Be prepared to develop and pay for custom software.
Dedicated professional data-entry software is a must. SPSS, Excel, etc.
wonβt cut it.
β’ Double data entry verification a must
β’ Only have a few well trained and diligent individuals enter the data
Donβt succumb to samples of convenience
Oversample small groups β and then down weight
Seriously consider three-stage sampling plan that controls for community
SES
21. Recent example: Australian WJ IIII standardization partial matrix sampling plan
Test 1 A B C D E X Total n
1. Verbal Comprehension C C C C C C 1346
2. Visual-Auditory Learning C C C C C C 1336
3. Spatial Relations C C C C C C 1375
4. Sound Blending C C C C C C 1382
Yellow column
(1) represents 5. Concept Formation C C C C C C 1300
initial βcoreβ 6. Visual Matching C C C C C C 1364
battery prior to
7. Numbers Reversed C C C C C C 1325
implementation
of βcore + 8. Incomplete Words C M 627
matrixβ battery 9. Auditory Working Memory C M 548
Mixed batteries
plan 11. General Information M M 389
12. Retrieval Fluency M M 388
13. Picture Recognition M M 390
14. Auditory Attention M M 388
15. Analysis-Synthesis M M 362
16. Decision Speed M M 376
17. Memory for Words M M 364
1. Letter-Word Identification C C C C C C 1323
2. Reading Fluency C M 588
5. Calculation C C C C C C 1210
6. Math Fluency C M 588
7. Spelling C M 557
8. Writing Fluency M M 352
9. Passage Comprehension C C C C C C 1203
10. Applied Problems C C C C C C 964
11. Writing Samples M M 342
Totals 450 102 117 127 108 120 372 1396
22. End of Part D
Additional steps in test development process will be
presented in subsequent modules as they are developed