This document discusses various concepts related to standardized testing and assessment. It defines key terms like measurement, assessment, standardized tests, norm-referenced testing, criterion-referenced testing, descriptive statistics, frequency distributions, measures of central tendency, measures of variability, normal distributions, and ways to interpret standardized test results including raw scores, percentiles, grade equivalents, and standard scores. The document uses examples and visuals to explain these concepts.
Standardized testing is vital to guiding instruction and monitoring progress for home school students. Check out our schedule at http://altheapenn.tripod.com/id29.html.
ASSESSMENT: The term assessment refers to the wide variety of methods or tools that educators use to evaluate, measure, and document the academic readiness, learning progress, skill acquisition, or educational needs of students.
TYPES OF ASSESSMENT:
There are four types of assessments
1) Prognostic assessment
2) Diagnostic assessment
3) Formative assessment
4) Summative assessment
Norm referenced and Criterion Referenced TestDrSindhuAlmas
By the end of the lesson, students will be able to:
Differentiate between Criterion-referenced tests (CRT) and Norm- referenced tests (NRT).
State uses of CRT and NRT.
Describe ways of interpreting CRT and NRT.
Here are my slides for my report for my Advanced Measurements and Evaluation subject on Educational Measurement and Evaluation. #Polytechnic University of the Philippines. #GraduateSchool
Standardized testing is vital to guiding instruction and monitoring progress for home school students. Check out our schedule at http://altheapenn.tripod.com/id29.html.
ASSESSMENT: The term assessment refers to the wide variety of methods or tools that educators use to evaluate, measure, and document the academic readiness, learning progress, skill acquisition, or educational needs of students.
TYPES OF ASSESSMENT:
There are four types of assessments
1) Prognostic assessment
2) Diagnostic assessment
3) Formative assessment
4) Summative assessment
Norm referenced and Criterion Referenced TestDrSindhuAlmas
By the end of the lesson, students will be able to:
Differentiate between Criterion-referenced tests (CRT) and Norm- referenced tests (NRT).
State uses of CRT and NRT.
Describe ways of interpreting CRT and NRT.
Here are my slides for my report for my Advanced Measurements and Evaluation subject on Educational Measurement and Evaluation. #Polytechnic University of the Philippines. #GraduateSchool
The content provider has been teaching in a B.Ed. College. He was searching for content on this topic on the internet. But he failed to get relevant materials. eventually, he prepares one on his own and uploads the same in slideshare for the convenience of the learners. This topic will help B.Ed. trainess to a great extent.
Topic: Norm Referenced and Criterion Referenced
Student Name: Madiha Shahid
Class: B.Ed. Hons Elementary Part (II)
Project Name: “Young Teachers' Professional Development (TPD)"
"Project Founder: Prof. Dr. Amjad Ali Arain
Faculty of Education, University of Sindh, Pakistan
Chapter one of "Testing in language programs" by James Dean Brown (2005) discusses "Types and uses of language tests". It's about norm-referenced and criterion-referenced tests.
The content provider has been teaching in a B.Ed. College. He was searching for content on this topic on the internet. But he failed to get relevant materials. eventually, he prepares one on his own and uploads the same in slideshare for the convenience of the learners. This topic will help B.Ed. trainess to a great extent.
Topic: Norm Referenced and Criterion Referenced
Student Name: Madiha Shahid
Class: B.Ed. Hons Elementary Part (II)
Project Name: “Young Teachers' Professional Development (TPD)"
"Project Founder: Prof. Dr. Amjad Ali Arain
Faculty of Education, University of Sindh, Pakistan
Chapter one of "Testing in language programs" by James Dean Brown (2005) discusses "Types and uses of language tests". It's about norm-referenced and criterion-referenced tests.
F ProjHOSPITAL INPATIENT P & L20162017Variance Variance Per DC 20.docxmecklenburgstrelitzh
F ProjHOSPITAL INPATIENT P & L20162017Variance Variance %Per DC 2016Per DC 2017Total Number of Beds149149Maximum Occupancy55,74554,561Total Patient Days37,25037,926Actual Occupancy %ALOSDischarges by PayerMedicare/Medicaid4,9224,989Commercial Ins5,2415,099Private Pay/Bad Debt1,2801,162Total DischargesREVENUEGross Patient Revenue$ 161,325,872$ 135,365,715Contract Allowances, Uncollectables$ (84,696,083)$ (65,680,261) Net Patient RevenueMisc Income$ 378,530$ 303,233 NET REVENUEPatient Care Expenses Salaries $ 18,387,223$ 18,244,610Benefits $ 4,140,146$ 4,211,157Contract Labor $ 1,724,507$ 1,820,377Physician Contract Services$ 6,439,165$ 6,335,188Lab Services $ 1,589,648$ 1,575,808Radiology Services$ 2,336,043$ 2,343,920Rehabilitation Services$ 655,766$ 679,444General Supplies $ 653,941$ 689,766Medical Supplies $ 1,006,220$ 1,029,151Cost of Food $ 576,245$ 612,890Patient Transportation $ 35,324$ 36,031Total Patient Care ExpensesGeneral and Administrative ExpensesSalaries$ 8,450,134$ 8,629,126Benefits$ 2,001,199$ 1,993,174Contract Labor$ 157,925$ 161,015Purchased Services $ 1,285,925$ 1,355,602Medical Director $ 162,909$ 167,207Telephone$ 586,985$ 596,466Meals & Entertainment $ 254,517$ 289,185Travel$ 126,951$ 141,561General Supplies $ 332,069$ 337,874Postage$ 53,760$ 57,383Building Expense$ 2,685,376$ 2,950,379Equipment Rents $ 363,302$ 429,694Repairs and Maintenance $ 337,711$ 366,311Insurance$ 644,384$ 715,563Utilities $ 504,959$ 556,226Total General and Administrative ExpensesNet Operating Expenses NET PROFIT (LOSS) before Interest, Taxes and Depreciation (EBITDA)NET PROFIT (LOSS) %2017CASH FLOW 2016RELEVANT FINANCIAL RATIOS 2016What is your average Daily Revenue?Return on Assets (ROA)Return on Assets (ROA)Assume your AR Days are 55, what is your Total AR?Return on Equity (ROE)Return on Equity (ROE)What is your Average Daily Expense?Current RatioCurrent RatioAssume your AP Days are 35, what is your total AP?Debt RatioDebt RatioBALANCE SHEET 2016ASSETS Cash and EquivalentsAssume 45 days of ExpensesAssume 45 days of Expenses Accounts Receivable$ - 0$ - 0 Inventory All SuppliesAssume 55 days of suppliesAssume 55 days of suppliesTotal Current AssetsFixed Assets:xxxxxxxxxxxxxxxxxxxxxxxxxxxx Bldg and Equipment$ 14,700,779$14,700,779Total AssetsLIABILITIES AND EQUITYCurrent Liabilitiesxxxxxxxxxxxxxxxxxxxxxxxxxxxx Accounts Payable$ - 0$0Long Term Debtxxxxxxxxxxxxxxxxxxxxxxxxxxxx Bldg and Equipment$ 8,149,152$8,149,152Total LiabilitiesEquityTotal Liabilities and EquityITEMSPOINT VALUEOccupany Calcs2Hospital Cols B & C3Variance (2014-2013) $ and %2PPD 2013 - 20142Cash flow 20142Balance Sheet Calculations5Relevant Financial Ratios4Sub-Total20
35879 Topic: Discussion6
Number of Pages: 1 (Double Spaced)
Number of sources: 1
Writing Style: APA
Type of document: Essay
Academic Level:Master
.
When you are working on the Inferential Statistics Paper I want yo.docxalanfhall8953
When you are working on the Inferential Statistics Paper I want you to format your paper with the following information
I. Introduction – What are inferential statistics and what is the research problem and hypothesis of the article?
II. Methods – Who are the subjects and variables within the article?
III. Results – What is the statistical analysis used, why were these tests chosen? What were the results of these tests and what do they mean?
IV. Discussion – What were the strengths of this article? What would you have done differently in terms of variables and statistical analysis? Why?
V. Conclusion – Reiterate the introduction and include relevant information that answers the questions regarding the hypothesis.
`
Read: Chapter 3 and 4 of Statistics for the Behavioral and Social Sciences.
Participate in One discussion.
Discussion 1 –Standard Normal Distribution– This allows you to look at any data set into the standard distribution form.
Quiz – Hypothesis testing
Submit your Inferential Statics Article Critique – Read Differential Effects of a Body Image Exposure Session on Smoking Urge Between Physically Active and Sedentary Female Smokers. What is the research question and hypothesis? Identify what variables were present, what inferential statistics were used and why, and if proper research methods were used. See grading rubric for full details.
Discussion Post Expectations:
Your initial post (your answer) is due by Day 3 (Thursday) of this week for Discussion 1.
When grading the Standard Normative Distribution discussion I will be looking for your answer to contain:
Week 2 Discussion 1 Board Rubric
Earned
Weight
Content Criteria
0.5
Student identifies and defines what Standard Normative Distribution (SND) is.
Student explains why it is needed to use a SND to compare two data sets.
0.5
Student identifies the purpose of a z-score in a SND.
0.5
Student identifies the purpose of a percentage in a SND.
0.25
Student explains whether a z-score or a percentage does a better job of identifying proportion of a SND.
0.25
The student responds to at least two classmates’ initial posts by Day 7.
1
Student uses correct spelling, grammar and sentence structure.
2
5
Grading - The discussions are both worth a total of 5 points. The breakdown of the grading for this week’s assignment (per discussion assignment) will be as follows:
Posting your answer by the due date (Day 3, Thursday) is worth 4 points. These five points will be based on the information outlined within the Discussion Assignment Expectations. Content will be worth 2 points and format; spelling and grammar will be worth 2 points.
Responding to two of your classmates (for each assignment) is worth 1 point. The answers must be substantive and go beyond “I agree” or “Good job” to qualify for this point.
Intellectual Elaboration:
In Wee.
Unit 8 - Information and Communication Technology (Paper I).pdfThiyagu K
This slides describes the basic concepts of ICT, basics of Email, Emerging Technology and Digital Initiatives in Education. This presentations aligns with the UGC Paper I syllabus.
2024.06.01 Introducing a competency framework for languag learning materials ...Sandy Millin
http://sandymillin.wordpress.com/iateflwebinar2024
Published classroom materials form the basis of syllabuses, drive teacher professional development, and have a potentially huge influence on learners, teachers and education systems. All teachers also create their own materials, whether a few sentences on a blackboard, a highly-structured fully-realised online course, or anything in between. Despite this, the knowledge and skills needed to create effective language learning materials are rarely part of teacher training, and are mostly learnt by trial and error.
Knowledge and skills frameworks, generally called competency frameworks, for ELT teachers, trainers and managers have existed for a few years now. However, until I created one for my MA dissertation, there wasn’t one drawing together what we need to know and do to be able to effectively produce language learning materials.
This webinar will introduce you to my framework, highlighting the key competencies I identified from my research. It will also show how anybody involved in language teaching (any language, not just English!), teacher training, managing schools or developing language learning materials can benefit from using the framework.
The Roman Empire A Historical Colossus.pdfkaushalkr1407
The Roman Empire, a vast and enduring power, stands as one of history's most remarkable civilizations, leaving an indelible imprint on the world. It emerged from the Roman Republic, transitioning into an imperial powerhouse under the leadership of Augustus Caesar in 27 BCE. This transformation marked the beginning of an era defined by unprecedented territorial expansion, architectural marvels, and profound cultural influence.
The empire's roots lie in the city of Rome, founded, according to legend, by Romulus in 753 BCE. Over centuries, Rome evolved from a small settlement to a formidable republic, characterized by a complex political system with elected officials and checks on power. However, internal strife, class conflicts, and military ambitions paved the way for the end of the Republic. Julius Caesar’s dictatorship and subsequent assassination in 44 BCE created a power vacuum, leading to a civil war. Octavian, later Augustus, emerged victorious, heralding the Roman Empire’s birth.
Under Augustus, the empire experienced the Pax Romana, a 200-year period of relative peace and stability. Augustus reformed the military, established efficient administrative systems, and initiated grand construction projects. The empire's borders expanded, encompassing territories from Britain to Egypt and from Spain to the Euphrates. Roman legions, renowned for their discipline and engineering prowess, secured and maintained these vast territories, building roads, fortifications, and cities that facilitated control and integration.
The Roman Empire’s society was hierarchical, with a rigid class system. At the top were the patricians, wealthy elites who held significant political power. Below them were the plebeians, free citizens with limited political influence, and the vast numbers of slaves who formed the backbone of the economy. The family unit was central, governed by the paterfamilias, the male head who held absolute authority.
Culturally, the Romans were eclectic, absorbing and adapting elements from the civilizations they encountered, particularly the Greeks. Roman art, literature, and philosophy reflected this synthesis, creating a rich cultural tapestry. Latin, the Roman language, became the lingua franca of the Western world, influencing numerous modern languages.
Roman architecture and engineering achievements were monumental. They perfected the arch, vault, and dome, constructing enduring structures like the Colosseum, Pantheon, and aqueducts. These engineering marvels not only showcased Roman ingenuity but also served practical purposes, from public entertainment to water supply.
How to Make a Field invisible in Odoo 17Celine George
It is possible to hide or invisible some fields in odoo. Commonly using “invisible” attribute in the field definition to invisible the fields. This slide will show how to make a field invisible in odoo 17.
How to Split Bills in the Odoo 17 POS ModuleCeline George
Bills have a main role in point of sale procedure. It will help to track sales, handling payments and giving receipts to customers. Bill splitting also has an important role in POS. For example, If some friends come together for dinner and if they want to divide the bill then it is possible by POS bill splitting. This slide will show how to split bills in odoo 17 POS.
Operation “Blue Star” is the only event in the history of Independent India where the state went into war with its own people. Even after about 40 years it is not clear if it was culmination of states anger over people of the region, a political game of power or start of dictatorial chapter in the democratic setup.
The people of Punjab felt alienated from main stream due to denial of their just demands during a long democratic struggle since independence. As it happen all over the word, it led to militant struggle with great loss of lives of military, police and civilian personnel. Killing of Indira Gandhi and massacre of innocent Sikhs in Delhi and other India cities was also associated with this movement.
Ethnobotany and Ethnopharmacology:
Ethnobotany in herbal drug evaluation,
Impact of Ethnobotany in traditional medicine,
New development in herbals,
Bio-prospecting tools for drug discovery,
Role of Ethnopharmacology in drug evaluation,
Reverse Pharmacology.
We all have good and bad thoughts from time to time and situation to situation. We are bombarded daily with spiraling thoughts(both negative and positive) creating all-consuming feel , making us difficult to manage with associated suffering. Good thoughts are like our Mob Signal (Positive thought) amidst noise(negative thought) in the atmosphere. Negative thoughts like noise outweigh positive thoughts. These thoughts often create unwanted confusion, trouble, stress and frustration in our mind as well as chaos in our physical world. Negative thoughts are also known as “distorted thinking”.
2. Measurement and assessment
Measurement: an evaluation expressed
in quantitative (numerical) terms
Assessment: procedures used to obtain
information about student performance
3. Assessment
Assessment includes
many different
possible activities,
not just tests!
But, since tests are a large part of current assessment practices, this whole
chapter is dedicated to them.
4. Standardized tests
Assessment instruments given to large
samples of students under uniform
conditions and scored according to
uniform procedures
For example: ACT, SAT, GRE, Praxis, and many of the achievement tests you took in
school.
5. Norm-referenced testing: testing in which scores are compared to
guys named “Norm.” Just kidding. Testing in which scores are
compared with the average performance of others.
Norms
An individual’s score on a standardized test is
compared to some kind of “normal” group.
Norming group: the representative group of
individuals whose standardized test scores are
compiled for the purpose of constructing
national norms.
National norms: scores on standardized tests
earned by representative groups of students
from around the nation to which an individual’s
score is compared.
Your score on achievement tests such as SAT or ACT was
compared to the scores of the norming group.
6. Criterion-referenced test
interpretations
Testing in which scores are compared to
a set performance standard.
The test you took to get a driver’s license
was criterion-referenced.
Criterion-referenced tests measure
mastery of concepts or skills.
8. This
Frequency distribution arrangement
doesn’t tell
Scores on 50 point test: you much
48, 47, 46, 45, 45, 44, 44, 44, 44, 44, 43, 43,
This 43, 43, 42, 42, 42, 42, 41, 41, 41, 40, 40, 39,
arrangement 39, 38, 38, 37, 36, 35
shows you X
more about
what X X X
happened.
Remember X X X X
the general
shape that is X X X X X X X X X
defined by
the x’s. It will X X X X X X X X X X X X X X
be important.
35 36 37 38 39 40 41 42 43 44 45 46 47 48
A frequency distribution is a distribution of test scores that shows a
simple count of the number of people who obtained each score.
9. Histogram
5
4.5
4
3.5
3
2.5
SCORES
2
1.5
1
0.5
0
35 37 39 41 43 45 47
The same frequency distribution can be represented by a histogram: a bargraph of
a frequency distribution.
By the way, if you were a teacher, what would you think of this group of scores?
10. Thinking about the scores
Clearly no one had mastery of the material—out
of a 50 point test, no one answered all
questions correctly.
So, the next question is, does this have
something to do with the nature of the students,
the nature of the teaching that went on, or the
nature of the assessment? Why did so few
students do well on this test?
An assessment is not just information gathered
about students; it also represents something
about teaching and potentially something about
the assessment procedures themselves.
11. Measures of Central Tendency
Mean: average score
Median: middle score
Mode: most frequent score
Measures of central tendency are quantitative descriptions of how a
group performed as a whole.
12. More about measures of
central tendency
Imagine a classroom that So how come we
has one genius and 20 can’t just use the
ordinary people. The average?
genius’s score will
artificially raise the
average of the class.
Median and mode are not
as affected by a single
“outlying” score.
13. And another thing….
X X
X X
X X X
X X X X X X
X X X X X X X X X X
X x X X x X X X X X X X
60 61 62 63 64 65 66 67 68 69 70 71
You might have a frequency distribution that looks like this. For example, if you
measured the heights of a group of men and women, you would have a “bump” for the
average height of men and another “bump” for the height of the women. This situation
would mean that you have two modes (63 and 68 inches in this case) and your
collection of numbers would be “bimodal.” Keep in mind that “average” for the whole
group in this case would be somewhere between the men and the women. The average
wouldn’t really represent what is going on here very well.
14. Measures of variability
Range: the distance between the top
and bottom score in a distribution of
scores.
Standard deviation: a statistical
measure of the spread of scores
Variability: degree of difference from
mean.
15. Standard deviation
Who is the better shot? Standard deviation could tell us because it is essentially the
average deviation from the norm. We would measure the distance between each
hold and the center of the target and then average those distances to come up with
the standard deviation. The archer with the lowest standard deviation would be the
better archer.
16. The Normal Distribution
Imagine the SAT scores for the millions of people who take it. Some people do
extremely well. Some do extremely poorly. Most do average. If you graphed the SAT
scores of everyone who takes it the way we graphed scores earlier in this presentation
(with the X’s), you would get a shape like this. This is called the “bell curve” or the
“normal distribution.”
Normal distribution is a distribution of scores in which the mean, median, and mode
are equal and the scores distribute themselves symmetrically in a bell-shaped curve.
17. Normal distribution
There is an interesting relationship between the percent of people with a score and
standard deviation (the average deviation from the average score). 68% (blue on this
graph) of the test takers fall within one standard deviation of the norm. This is true for
any measure that yields a normal distribution (height of women or height of men, IQ,
ACT, SAT, number of pushups people can do, etc.).
18. Normal distribution
Statisticians have taken advantage of this relationship between average and standard
deviation in the development of standardized test scoring.
20. Raw score
The number of items an individual answered
correctly on a standardized test or subtest.
This is a number that has not been interpreted.
The statistical procedures which follow allow us
to interpret a raw score in relation to other
people who have taken the test.
You cannot make any interpretation of a simple
raw score—you need to know the information
that follows in order to interpret the score.
21. Percentiles
Percentile or percentile rank: a ranking that compares an
individual’s score with the scores of all the others who
have taken the test.
Your percentile rank tells you how you did in relation to
others. A percentile rank of 80 means you did as well or
better than 80% of those who took the test.
Percentile is not the same as percentage (the number of
correct out of the total number of items). Percentile is a
comparison of people.
Percentile bands are ranges of percentile scores on
standardized tests. They allow for the fact that test
scores are an estimation rather than a perfect measure.
22. Grade equivalents
A score that is determined by comparing
an individual’s score on a standardized
test to the scores of students in a
particular age group; the first digit
represents the grade and the second the
month of that school year.
A first grader who has the grade
equivalent of 2.4 on a reading test has
the same score as the AVERAGE 2nd
grader in the fourth month of school.
23. Grade equivalents
When a student scores above grade
level on a test, that does not mean the
student is ready for higher level work. It
simply means that the student has
mastered the work at his/her grade level.
24. Standard score
Imagine
various A description of performance on a
scores of this standardized test that uses standard
sort and then
go back to the deviation as a basic unit.
chart that
shows you Z-score: the number of standard
the deviation units from the mean (average).
percentage of
people in T-score: standard score that defines 50
relation to
standard as the mean and a standard deviation as
deviation. 10.
That will show
you the utility
of this
concept.
26. Stanines
A stanine (standard nine) is a description of an
individual’s standardized test performance that
uses a scale ranging from 1-9 points.
These 9 points are spread equitably across the
range of the bell curve and are standardized in
terms of how many standard deviations they
cover.
While stanines are a simple way to report
scores and they reflect the fact that students
who score closely in percentile ranks are really
not much different, they also may be overly
reductive—they reduce a whole test to a single
digit.
29. Consistency of test results
Reliability
How reliable is your bathroom scale? If it reads 100 pounds, then 75 pounds, then
105 pounds for the same object, then it’s not reliable.
30. Standard error of
measurement
True score: the hypothetical average of
an individual’s scores if repeated testing
under ideal conditions were possible.
Standard error of measurement: the
range of scores within which an
individual’s true score is likely to fall.
(also called confidence interval, score
band, or profile band).
This is how statisticians get around the idea that no test situation is
perfect.
31. Validity
An evaluation of the adequacy and
appropriateness of the interpretations
and uses of assessment results.
Focuses on USE of test results, not the
test itself.
Three types: content, predictive, and
construct
32. Content validity
A test’s ability to representatively sample the
content that is taught and measure the extent to
which learners understand it.
In other words, how close is the test to what
was actually taught? That is its content validity.
If it is not close to the curriculum, then its
results do not measure how much students
understood the curriculum and it is not
appropriate to use it for that purpose.
33. Predictive validity
An indicator of a test’s ability to gauge
future performance.
You find this by correlating the test score
with grades. For example, the extent to
which the SAT predicts college grades is
.42 (which is not very high—a perfect
correlation is 1.0).
34. Construct validity
An indicator of the logical connection between a
test and what it is designed to measure.
A reading test that has students actually read a
passage has construct validity because of the
logical connection between the every day task
(reading) and the process on the test (reading
and responding to questions about a text).
A reading test does not typically assess
mathematical knowledge, and would not have
construct validity for that purpose.
36. Cultural minorities and high-
stakes tests
Tests can be harmful to people from
cultural minorities in the US since people
from many of these cultures tend to
score lower than non-minority students.
Assessment bias: qualities of an assessment instrument that offend or
unfairly penalize a group of students because of the students’ gender,
economic class, race, ethnicity, etc.
37. Content
Content tends to be biased toward white
middle class students.
For example, the 4th grade proficiency
test writing prompt that asked students
to write about going camping. Students
without this kind of experience would not
be able to write well about this topic.
38. Procedures
Students’ cultures can affect their
response to the whole procedure of
testing.
Time limits can be unfair to students for
whom English is a foreign language or
for students with learning disabilities.
39. Test use
Test results can be used to discriminate
against groups of students.
40. Eliminating bias
Examine test content and analyze the
results. For example, if most students
get a certain answer wrong, they were
probably not taught that concept.
Adapt testing procedures if possible and
teach students about the test.
Use more than just standardized tests
for making decisions—use alternative
assessment data as well.
41. Creating bias-free tests
Culture-fair/culture-free test: a test
without cultural bias.
This is very difficult to create
42. Purposes of standardized
testing
Student assessment: how students in one
classroom compare to students across the
nation (or even around the world)
Diagnosis: a student’s specific strengths and
weaknesses
Selection and placement: standardized tests
may determine whether or not a student is
invited to take advanced classes
Program evaluation: how students in a
particular school or program compare with
students across the nation
Accountability: how students of a particular
teacher score on a test.
44. Achievement
Determining the extent to which students
What
achievement have mastered a content area
tests have you Comparing the performance of students
taken?
with others across the country
Tracking student progress over time
Determining if students have the
background knowledge to begin
instruction in particular areas
Identifying learning problems
Achievement tests are designed to measure and communicate how much students
have learned in specified content areas.
45. Diagnostic tests
Usually given individually
Usually have more subtests and
measure knowledge of a particular area
in detail
Provide information that teachers can
use in order to instruct the child or
address weaknesses
Diagnostic tests are designed to provide a detailed description of learners’ strengths
and weaknesses in specified skill areas.
46. Aptitude tests
Standardized tests designed to predict
the potential for future learning and
measure general abilities developed
over long periods of time.
Different from intelligence tests because
aptitude is just one aspect of
intelligence.
ACT and SAT are aptitude tests
(measuring your aptitude for college-
level work).
47. Intelligence tests
A type of aptitude test
Standardized tests designed to measure
an individual’s ability to acquire
knowledge, capacity to think and reason
in the abstract, and ability to solve novel
problems.
Individually administered tests are
typically more accurate than group-
administered intelligence tests.
48. Teacher’s job
Make sure the test content matches
learning goals
Prepare students for the test
Administer tests according to instructions
Communicate results to students and
their caregivers so their results can be
used for educational decision-making
50. Accountability
Accountability: the process of
Adequate yearly progress: requiring students to demonstrate
objectives for yearly that they have met specified
improvement for all students
and for specific groups, such standards and holding teachers
as students from major ethnic responsible for students’
and racial groups, students performance.
with disabilities, students
from low-income families, and High stakes tests: standardized
students whose English is tests designed to measure the
limited.
extent to which standards are being
met. (minimum competency testing).
Standards-based education: the process of focusing curricula and instruction
on pre-determined goals.
51. Accountability
When you have high stakes testing
(where the results of the test determine
something major about a person’s life
such as whether or not he/she
graduates), teachers often “teach to the
test.” Instruction becomes boring and
students with a low tolerance for busy
work struggle.
Some say these tests help us to know
which schools are effective.
52. Standardized testing with
alternative formats
These are time-consuming to score but
they address critics’ concerns that
multiple choice formats are a limited way
to know what a student knows.
53. Implications for teachers
You will have to deal with standardized testing
—it’s here to stay.
You will need to know more content knowledge
as well as about how children learn.
You need to know about how to interpret
standardized tests.
It’s possible to find a creative way to deal with
standardized testing (see p. 544 in your book
for an example).
54. New directions in assessment
Authentic assessment: measurement of
important abilities using procedures that
simulate the application of these abilities
to real-life problems.
Constructed response formats:
assessment procedures that require the
student to create an answer instead of
selecting an answer from a set of
choices.
55. Praxis and new directions
Praxis II uses constructed responses
Praxis III is supposed to be authentic
assessment (you actually teach a lesson
while being observed).
These practices are not without their critics—you don’t have a lot of time to
do the constructed responses in Praxis II and the pass rate for Praxis III is
very high—perhaps the universities are doing a good job of preparing pre-
service teachers after all…
56. Vocabulary
Confi- Standa
Accoun- Grade Percentile
dence Median r-dized Validity
tability equivalent bands
interval tests
Achiev Con- High
Predictive Standard
e-ment struct Stakes Mode Variability
validity score
tests validity test
Stan-
Adequate Con- dards-
National
yearly structed Histogram Range based Z-score
norms
progress responses edu-
cation
Culture
Normal
Aptitude -fair/ Intelligence
distribu Reliability Stanine
tests culture- tests
-tion
free test
Stan-
Assess-
Diagnostic Norming dard
ment Mean True score
tests group de-
bias
viation
Fre- Measure Standard
Authentic quency s of error of
Percentile T-score
assessment distribu central measure-
-tion tendency ment