This document discusses different types of scoring tools used to evaluate performance-based assessments, including checklists, rating scales, and rubrics. It defines each tool, provides examples, and outlines their advantages and limitations. Guidelines are provided for constructing effective rubrics, including involving students and using clear, specific criteria aligned to learning outcomes. The key scoring tools - checklists, rating scales, and analytic and holistic rubrics - are compared.
2. Agenda
• In this unit, you will:
learn more about the two
performance-based tests.
identify the advantages
and limitations for each
Learn about the
construction of
Performance tasks
Learn about performance
Criteria
Learn about rubrics &
Scoring
4/3/2022
Types of Performance –based Assessment
2
4. Objectives
By the end of this topic, you
should be able to:
1. Define checklists, rating
scales and rubrics;
2. List the types of rating scales
and rubrics;
3. List the advantages and
disadvantages of rating scales
and rubrics;
4. Compare checklists, rating
scales and rubrics; and
5. Construct any of the two
instruments.
4/3/2022
Types of Performance –based Assessment
4
5. Group Activity1: 5 min
Check quickly pages 271-275 of the
booklet to find out the scoring tools
used to rate performance tasks.
4/3/2022 Types of Performance –based Assessment 5
6. Introduction
• Checklists, rating scales and rubrics are tools that state specific
criteria and allow teachers and students to gather information and to
make judgements about what students know and can do in relation
to the outcomes.
• The purpose of checklists, rating scales and rubrics is to:
• provide tools for systematic recording of observations
• provide tools for self-assessment
• provide samples of criteria for students prior to collecting and evaluating data
on their work
record the development of specific skills, strategies, attitudes and behaviours
necessary for demonstrating learning
• clarify students' instructional needs by presenting a record of current
accomplishments.
4/3/2022 Types of Performance –based Assessment 6
7. Group Activity 2: 5 min
Go to the scoring tools folder. You will
find a sample of a rubric, a checklist,
and a rating scale.
Determine which is which.
4/3/2022 Types of Performance –based Assessment 7
8. Checklists
Checklists usually offer a yes/no format in relation to
student demonstration of specific criteria. This is similar to
a light switch; the light is either on or off.
They may be used to record observations of an individual,
a group or a whole class.
4/3/2022 Types of Performance –based Assessment 8
9. Checklists
Advantages
(a) Easy to Use
(b) A Method for Assessing Individual Children
(c) Provide Visual Images
(d) Flexibility
(e) updated easily
Limitations
• Not Indicators of Child Performance
4/3/2022 Types of Performance –based Assessment 9
10. Rating Scales
Unlike checklists that document the existence or non-existence of traits or skills, rating scales are used to describe
the degree to which those behaviours or traits are believed to be present in the individual (Gullo, 2005)
Rating Scales allow teachers to indicate the degree or frequency of the behaviours, skills and strategies
displayed by the learner. A rating scale provides for a range of performance levels. Rating scales state the
criteria and provide three or four response selections to describe the quality or frequency of student work.
Teachers can use rating scales to record observations and students can use them as self-assessment tools.
Teaching students to use descriptive words, such as always, usually, sometimes and never helps them
pinpoint specific strengths and needs.
Rating scales also give students information for setting goals and improving performance. In a rating scale, the
descriptive word is more important than the related number. The more precise and descriptive the words for each
scale point, the more reliable the tool.
4/3/2022 Types of Performance –based Assessment 10
11. Rating Scales
Advantages
(a) Can Capture Behaviours other
Measurements Cannot Capture
(b) Quick to Complete
(d) Easy to Develop
(e) Useful in Assessing the Children’s
Progress
Limitations
• (a) Highly Subjective
• (b) Ambiguity of Descriptors
4/3/2022 Types of Performance –based Assessment 11
12. Rating Scales Types
4/3/2022 Types of Performance –based Assessment 12
Numerical Rating Scale Graphic Rating Scale
There are many types of rating scales but the ones that are used most frequently
are the numerical rating scale and the graphic rating scale.
15. Rubrics
Rubrics are similar to rating scales because they are also qualitative instruments. However, unlike other
classroom assessment strategies, rubrics can be used to assess students’ progress as well as score
students’work (Wortham, 2008).
Rubrics use a set of criteria to evaluate a student's performance. They consist of a fixed
measurement scale and detailed description of the characteristics for each level of performance.
These descriptions focus on the quality of the product or performance and not the quantity; e.g.,
not number of paragraphs, examples to support an idea, spelling errors.
Rubrics are commonly used to evaluate student performance with the intention of including the
result in a grade for reporting purposes. Rubrics can increase the consistency and reliability of
scoring.
Rubrics may be compared over time.
4/3/2022 Types of Performance –based Assessment 15
16. Rubrics
Advantages
(a) Can Lead to Objectivity
(b) Flexibility
(c) Adaptable
(d) Can be Used by Teacher and
Student
(e) Can be Translated into Grades
Limitations
(a) New Teachers May Have
Difficulty Developing the
Criteria
(b) Teachers may Focus on
Quantity
4/3/2022 Types of Performance –based Assessment 16
17. Tips for Constructing Rubrics
4/3/2022 Types of Performance –based Assessment 17
Rubrics should be constructed with input from
students whenever possible.
A good start is to define what quality work looks like
based on the learning outcomes.
Exemplars of achievement need to be used to
demonstrate to students what an excellent or
acceptable performance is. This provides a
collection of quality work for students to use as
reference points. Once the standard is established,
it is easy to define what exemplary levels and less-
than-satisfactory levels of performance look like.
The best rubrics have three to five descriptive
levels to allow for discrimination in the evaluation of
the product or task. Rubrics may be used for
summative purposes to gauge marks by assigning
a score to each of the various levels.
18. Group Activity 3: 5 min
Go to the rubric folder and decide
which is a holistic rubric and which is
an analytical rubric.
4/3/2022 Types of Performance –based Assessment 18
19. Analytical vs Holistic Rubrics
Analytic rubrics are more
specific than holistic rubrics and
more suitable for diagnostic
purposes as well as grading
purposes.
Holistic rubrics lack validity and
reliability because the
descriptors for the holistic rubric
can be too general and lack
specificity (Wortham, 2008).
Thus, it is important to use the
analytic rubric if the purpose for
the rubric is assessment and
scoring rather than the overly
general holistic rubric.
4/3/2022 Types of Performance –based Assessment 19
20. Holistic Rubrics
The holistic rubric is used when making general judgments about
performance.
According to Wortham (2008), „this type of rubrics assigns a single
score to the student’s overall performance. These rubrics usually have
competency labels that define the level of performance. A number of
indicators describe the quality of work or performance at each level.
The example of holistic rubric in Rubric 1 has four levels of competence
– from inexperienced writer to mature writer.
4/3/2022 Types of Performance –based Assessment 20
21. Analytical Rubrics
An analytic rubric is used when the task is broken down to various necessary parts.
The example of analytic rubric in Rubric 2 is about tackling the student work from three
dimensions, namely the students’ understanding of the problem, the procedures taken by
the students to solve the problem and the outcome that the students arrived at.
Each of the dimensions has descriptors and each of the descriptors have a numerical scale.
Following this rubric would lead the teacher to arrive at the score for each of the students
and can also help the teacher to give necessary feedback to the students.
4/3/2022 Types of Performance –based Assessment 21
22. Creating Rubrics with Students (1/2)
• Learning increases when students are actively involved in the assessment process.
Students do better when they know the goal, see models and know how their
performance compares to learning outcomes.
• Learning outcomes are clarified when students assist in describing the criteria used to
evaluate performance.
• Use brainstorming and discussion to help students analyze what each level looks like.
• Use student-friendly language and encourage students to identify descriptors that are
meaningful to them. For example, a Grade 3 class might describe levels of quality with
phrases such as the following.
• Super!
• Going beyond
• Meets the mark
• Needs more work.
4/3/2022 Types of Performance –based Assessment 22
23. Creating Rubrics with Students (2/2)
• Use work samples to help students practise and analyze specific
criteria for developing a critical elements list. They can also use
samples to practise assigning performance levels and compare criteria
from level to level.
• Involve students in the assessment process by having them participate
in the creation of a rubric. This process facilitates a deeper
understanding of the intended outcomes and the associated
assessment criteria.
• After a rubric has been created, students can use it to guide their
learning. Criteria described in a rubric serve to focus student reflection
on their work and facilitate the setting of learning goals for a particular
performance assessment. Through self-assessment or peer-
assessment, students can use a rubric to assess work completed to
date and use it to guide their planning for the "next steps" in learning.
4/3/2022 Types of Performance –based Assessment 23
24. Tips for Developing Checklists, Rating
Scales and Rubrics
1.Use checklists, rating scales and rubrics in relation to outcomes and standards.
2.Use simple formats that can be understood by students and that will communicate
information about student learning to parents.
3.Ensure that the characteristics and descriptors listed are clear, specific and observable.
4.Encourage students to assist with constructing appropriate criteria. For example, what
are the descriptors that demonstrate levels of performance in problem solving?
5.Ensure that checklists, rating scales and rubrics are dated to track progress over time.
6.Leave space to record anecdotal notes or comments.
7.Use generic templates that become familiar to students and to which various
descriptors can be added quickly, depending on the outcome(s) being assessed.
8.Provide guidance to students to use and create their own checklists, rating scales and
rubrics for self-assessment purposes and as guidelines for goal setting.
4/3/2022 Types of Performance –based Assessment 24
25. Summary
4/3/2022 Types of Performance –based Assessment 25
A checklist is an instrument for recording and examining sequenced
series of behaviour that are related to educational developmental goals.
Except for the modality of measurement, rating scales and checklists
are similar.
There are mainly two types of rating scales, namely numerical rating
scales and graphical rating scales.
The main two types of rubrics are holistic and analytic rubrics.
Editor's Notes
They offer systematic ways of collecting data about specific behaviours, knowledge and skills.
Advantages
(a) Easy to Use
Checklists are either checked or ticked, therefore using them is not difficult for any teacher or caregiver. Moreover, checklists are easy to use because they
require little instruction or training for teachers to quickly learn how to use them (Wortham, 2008).
(b) A Method for Assessing Individual Children
A separate checklist can be kept for each child in the classroom. In this manner, each child can be monitored and their progress can also be tracked.
(c) Provide Visual Images
Checklists can provide a visual image of a child’s progress. Through coding or dating the observations, it is easy for teachers to determine how each child
progresses in the specific areas described on the checklists (Gullo, 2005). By just casting a glance at the checklist, the teacher can have a visual image
representing the developmental or learning progress of each child in the class.
(d) Flexibility
The flexibility of checklists makes them usable with a variety of assessment strategies. For example, it can be used with observation; it can also be used
with classroom tests and assignments. The flexibility thus makes it a great assessment instrument. It also provides the opportunity to perform
evaluation using a convenient strategy and also a combination of assessment strategies (Wortham, 2008).
(e) Availability
Because the checklist is always available to the teacher, new behaviours or developmental traits can always be recorded as and when they appear. In
other words, checklists can be updated easily.
Limitations
Checklists can only indicate whether a child can perform adequately or otherwise. When it comes to the issue of using
assessments to give grades or levels of performance, checklists cannot be used (Irwin & Bushnell, 1980).
It could be used with rating scales for proper assessment.
Advantages
(a) Can Capture Behaviours other Measurements Cannot Capture
There are certain behaviours that are not easily measured by other forms of assessment strategies (checklists vs rating scales)
(b) Quick to Complete
Usually, rating scales come with the descriptors of the child’s behaviour. As a result of this, raters can easily complete the rating scale using the
descriptors as a guide. The presence of the descriptors on the rating scale also gives the rater the opportunity to complete the scale some time after the observation has been conducted (Jablon et al., 2007).
(c) Easy to Understand
The use of the scale’s indicators makes it easy for people to easily understand how to use the rating scale. Consequently, only minimal training is required
for a beginner to use the rating scale. Thus, professionals as well as students can complete the rating scale without stress (Wortham, 2008).
(d) Easy to Develop
The use of descriptors also makes it easy for anyone to design and develop a rating scale easily. According to Wortham (2008): ⁄ because descriptors remain consistent on some rating scales, teachers find them easy to design. When using rating indicators such as always, sometimes, rarely and never, the teacher can add the statements for rating without having to think of rating categories for each one.
(e) Useful in Assessing the Child’s Progress
It permits the teacher to describe the child’s progress towards understanding or mastery instead of whether the child has achieved a predetermined level, as in the case of checklists (Wortham, 2008).
Limitations
(a) Highly Subjective
Since the use of rating scales depends on the judgment of only one person, it is usually subjective in nature. Consequently, rater error and rater bias are
the most common problems. Sometimes, teachers rate the children on the basis of prior information, interactions or emotion rather than on an objective
basis. On the same vein, Linn and Miller (2005) opined that rating scales often reflect the attitude of the teacher towards the child.
(b) Ambiguity of Descriptors
If any ambiguous term is used as a descriptor, it is highly likely that raters will not agree on the precise meaning of the term. Consequently, the reliability of
the information derived would decrease. A good example is the use of terms such as „sometimes‰ or „rarely‰. In these two terms, what constitutes
sometimes for teacher A may not be at the same level for teacher B.
Wortham (2008) also asserted that „rubrics are related to performance assessments. They provide guidelines to distinguish performance
from one level to another.
Advantages
(a) Can Lead to Objectivity
As you would have observed from the examples given, rubrics usually have guidelines for the teacher to use when scoring and assessing the students’
work. If the guidelines are strictly adhered to, the scoring and assessment are expected to be objective. All the students will be scored and assessed using
the same descriptors and the same score parameters.
(b) Flexibility
Because of its flexibility, rubrics can be designed for many uses and ability levels.
(c) Adaptable
Wortham (2008) opined that „they are dynamic and subject to revision and refinement. Because they are easily modified and changed, they can meet changing classroom and student needs. Moreover, a teacher can continue to adapt his rubric to changing goals and objectives based on the curriculum.
(d) Can be Used by Teacher and Student
In most cases where the rubric is used, it serves as a guide to both the teacher and the student in guiding the student towards completing the task. For
example, the student can consistently look at the rubric as he progresses in his work to achieve the level he would be attaining while the teacher can also
use the rubric formatively as he makes suggestions to the student before the completion of the task.
(e) Can be Translated into Grades
Through the rubrics, especially the analytic rubrics, the scores of the students can be easily given and also translated into grades.
Limitations
New Teachers May Have Difficulty Developing the Criteria
Users may Focus on Quantity
It is usually common for teachers to mistakenly grade students on the volume of work done rather than on
quality of the work. When this happens, the teacher has focused on wrongcharacteristics of student work.
When developing a rubric, consider the following:
What are the specific outcomes in the task?
Do the students have some experience with this or a similar task?
What does an excellent performance look like? What are the qualities that distinguish an excellent response from other levels?
What do other responses along the performance quality continuum look like?
Is each description qualitatively different from the others? Are there an equal number of descriptors at each level of quality? Are the differences clear and understandable to students and others?
According to Weiner and Cohen (1997), „an analytic rubric describes and scores each of the task attributes separately, uses limited descriptors for each
attribute, uses a scale that can be both narrow and broad, and allows for specific diagnostic feedback.
Although rubrics are often used as assessment of learning tools, they can also be used as assessment for learning tools. Students can benefit from using rubrics as they become more competent at judging the quality of their work and examining their own progress.Example:
Involve students in the assessment process by having them participate in the creation of a rubric. This process facilitates a deeper understanding of the intended outcomes and the associated assessment criteria.
After a rubric has been created, students can use it to guide their learning. Criteria described in a rubric serve to focus student reflection on their work and facilitate the setting of learning goals for a particular performance assessment. Through self-assessment or peer-assessment, students can use a rubric to assess work completed to date and use it to guide their planning for the "next steps" in learning.