Accuracy and errors

CHAPTER
17:
ACCURACY
AND
ERRORReporter: SHELAMIE M. SANTILLAN-EDUC 243
student2nd Sem. S.Y. 2016-

When is a test score
inaccurate?
Almost
always.
All tests and
scores are
imperfect
and are
subject to

Error – What is it?
 No test measures perfectly,
and many tests fail to
measure as well as we
would like them to.
 Tests make “mistakes”.
They are always associated
with some degree of error.

Error – What is it?
 Think about the last test
you took.
 Did you obtain exactly the
score you thought or knew
you deserved?

Example of a type of error
that lower your obtained
score?
 When you couldn’t sleep the night
before the test
 When you are sick but took the test
anyway
 When the essay test you were taking
was so poorly constructed it was hard to
tell what was being tested.

Example of a type of error
that lower your obtained
score?
 When the test had a 45-minute time
limit but you were allowed only 38
minutes,
 When you took a test that had
multiple defensible answers

Example of a type error (of
situation) that raised your obtained
score?
 The time you just happened to see
the answers on your neighbor’s
paper,
 The time you got lucky guessing,
 The time you had 52 minutes for a
45-minute test

Example of a type error (of
situation) that raised your obtained
score?
 The time the test was so full of
unintentional clues that you were
able to answer several questions
based on the information given
in other question.

Then how does one go about
discovering one’s true score?
Unfortunately, we don’t have an
answer. The true score and the
error score are both theoretical or
hypothetical values.

Why bother with the true score or
error score?
Because they allow us to
illustrate some important
points about test score
reliability and test score

Simply keep the mind!
Remember:
Obtained Score = true score+ error
score

Table 17.1 The relationship among Obtained Scores,
Hypothetical True Scores, and Hypothetical Error Score for a
Ninth-Grade Math Test
Student Obtained
Score
True Score Error Score
Donna 91 88 +3
Jack 72 79 -7
Phyllis 68 70 -2
Gary 85 80 +5
Marsha 90 86 +4
Hypothetical Values

We will use the error scores from
table 17.1 (3, -7, -2, 5,4, -3)
Is the standard deviation of error scores of
a test.
The Standard
Error of
Measurement
(abbreviated S )
m

Step 1: Determine the
mean.
M = X = 0 = 0
Student Obtain
ed
Score
True
Score
Error
Score
Donna 91 88 +3
Jack 72 79 -7
Phyllis 68 70 -2
Gary 85 80 +5
Marsha 90 86 +4
Milton 75 78 -3
∑
N 6

Student Obtaine
d Score
True
Score
Error
Score
Donna 91 88 +3
Jack 72 79 -7
Phyllis 68 70 -2
Gary 85 80 +5
Marsha 90 86 +4
Milton 75 78 -3
Step 2: Subtract the mean from each error score to
arrive at the deviation scores. Square each deviation
score and sum the squared deviations.
X – M = x x
+3 – 0 = 3 9
-7– 0 = -7 49
-2 – 0 = -2 4
+5 – 0 = 5 25
+4 – 0 = 4 16
-3 – 0 = -3 9
2
∑X =
2
112

Step 3: Plug the x sum into the formula
and solve for the standard deviation.
2
Error Score SD =

Fortunately, a rather simple statistical formula can be
used to estimate this standard deviation (Sm) without
actually knowing the error scores:
Where r is the reliability of the test
and SD is the test’s standard
deviation.

USING THE STANDARD ERROR
OF MEASUREMENT
In summary, then, we know that error
scores:
1. are normally distributed
2. have a mean of zero
3. have a standard deviation called the
standard error of measurement

USING THE STANDARD ERROR OF
MEASUREMENT
Studen
t
Obtained
Score
True
Scor
e
Error
Scor
e
Donna 91 88 +3
Jack 72 79 -7
Phyllis 68 70 -2
Gary 85 80 +5
Marsh
a
90 86 +4
Milton 75 78 -3
Figure 17.1 The error score
distribution
Table 17.1

This figure tells us that the distribution
of error scores is a normal distribution
Figure 17.2 The error score distribution for the test depicted
Error score of the ninth-grade
math test

Fig. 17.3 The error score distribution for
the test depicted in Table 17.1
With approximate normal curve
percentages.

Let’s use the following number line to represent an
individual’s obtained score, which we will simply call
the X:

Fig. 17.4 The error distribution around an
obtained score of 90 for a test with Sm=
4.32
Student Obtained
Score
True
Scor
e
Error
Score
Donna 91 88 +3
Jack 72 79 -7
Phyllis 68 70 -2
Gary 85 80 +5
Marsh
a
90 86 +4
Milton 75 78 -3

Fig. 17.5 The error distribution around an
obtained score of 75 for a test with Sm =
4.32
Student Obtained
Score
True
Scor
e
Error
Score
Donna 91 88 +3
Jack 72 79 -7
Phyllis 68 70 -2
Gary 85 80 +5
Marsh
a
90 86 +4
Milton 75 78 -3

Standard Deviation or Standard error of
measurement?
Standard Deviation
(SD)
Standard Error of
Measurement
(Sm)
 Is the variability of raw
scores.
 It tells us how spread out
the scores are in a
distribution of raw scores.
 Is based on a group of
 Is the variability of
error scores.
 Is based on a group
of scores that is
hypothetical.

Why all the fuss about error?
For two reasons:
1.We want to make you aware
of the fallibility of test
scores.
2.We want to sensitize you

Classification of sources of
error
1. Test Takers.
2. The test itself.
3. Test administration.
4. Test scoring.

Test Takers:
Factors that would likely result in an
obtained score lower than a student’s true
score:
• fatigue and illness
• Accidentally seeing another

The test itself:
 Trick questions
 Reading level that is too
high.
 Ambiguous questions.
 Items that are too difficult.

Test Administration:
 Physical Comfort
 Instructions &
Explanations
 Test administrator
Attitudes

Error in Scoring:
 When computer scoring is
used, error can occur.
 When test are hand scored,
the likelihood of error
increases greatly.

Sources of Error Influencing
Various Reliability Coefficients
 Test-Retest
 Alternate Forms
 Internal
Consistency

Test- Retest
 Short-interval test-retest coefficients are
not likely to be affected greatly by within-
student error.
 Any problem that do exist in the test are
present both the first and second
administrations, affecting scores the same
way each time the test is administered.

Alternate Form
 Since alternate-forms reliability is
determined by administering two different
forms or versions of the same test to the
same group close together in time, the
effects within student error are negligible.

Alternate Form
 Error within the test, however, has a
significant effect on alternate-forms
reliability.
 As with test-retest method, alternate-
forms score reliability is not greatly
affected by error in administering or
scoring the test, as long as similar

BAND
INTERPRETATION
 uses the standard error
of measurement to a
more realistic
interpretation and report
groups of test scores.

BAND
INTERPRETATION
Formula to compute the reliability of the
difference score is as follows:

BAND
INTERPRETATION
Step 1: List Data (let’s assume)
M: 100 , SD: 10, Score reliability - .91 for all
subtests.
Here are the
subtest scores
for John:

BAND
INTERPRETATION
Step 2: Determine Sm (standard error of
measurement)
Since SD and r are the same for each
subtest in this example, the standard error
of measurement will be the same for each
student.

BAND
INTERPRETATION
Step 2: Add and Subtract Sm

BAND
INTERPRETATION
Step 3: Graph the Results
Shade in the
bands to
represent the
range of scores
that has 68%
chance of
capturing John’s

BAND
INTERPRETATION
Step 4: Interpret the Bands
• Interpret the profile of bands by visually
inspecting the bars to see which bands
overlap and which do not.
• Those that overlap probably represent
differences that likely occurred by chance.

Final Word:
 Technically, there are more accurate statistical
procedures for determining real differences between an
individual’s test scores than the ones we have been able
to present here. These procedures, however, are time-
consuming, complex, and overly specific for the typical
teacher.
 Within the classroom, band interpretation, properly used,
makes for a practical alternative to those more advanced

Accuracy and errors

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Accuracy and errors

Similar to Accuracy and errors (20)

Recently uploaded

Recently uploaded (20)

Accuracy and errors

Editor's Notes