SlideShare a Scribd company logo
CHARACTERISTICS OF A GOOD TEST
Ann Meredith U. Garcia, MD
Reliability vs. validity
¤  A degree of test reliability is requisite to validity.
VALID ≠ RELIABLE
TEST RELIABILITY
Definition
¤  Consistency with which a test measures what it is
measuring
¤  Consistent, constant, and repeatable results?
¤  Over time? Across different versions of a test? Among scale
items?
TEST RELIABILITY
Definition
¤  Consistency with which a test measures what it is
measuring
¤  Consistent, constant, and repeatable results?
¤  Goal: As close as possible to measuring the TRUE SCORE
TEST RELIABILITY
Sources of error
TEST RELIABILITY
is a
HUMAN BEING
Examinee
Sources of error
TEST RELIABILITY
is a
HUMAN BEING
Examinee
Examiner
Sources of error
TEST RELIABILITY
is designed by
& for
HUMAN BEINGS
Examinee
Examiner
Examination
Sources of measurement error:
1. OBJECTIVITY OF SCORING
¤  Different scorers produce the same score if they apply
the same scoring key
¤  More objective scoring à more accurate score
TEST RELIABILITY
Score1? Score2? Score3?
Sources of measurement error:
2. SAMPLING OF CONTENT
¤  A teacher cannot really
construct 2 forms of a
test that are
independent of each
other.
¤  Another teacher’s test
usually would differ even
more.
TEST RELIABILITY
Sources of measurement error:
2. SAMPLING OF CONTENT
¤  If the test plan is fairly
detailed and followed
carefully à content
sampling for an
objective test with a
large number of items
should be reasonably
adequate
TEST RELIABILITY
Sources of measurement error:
3. TEMPORAL INFLUENCES
¤  TEMPORAL STABILITY – scores should fluctuate very little
over a reasonably brief time interval
TEST RELIABILITY
TEST A
Score?
TEST A
Score?
Methods of estimating reliability:
1. TEST-RETEST METHOD
¤  Estimates TEMPORAL RELIABILITY – correlation between scores on
the 2 trials
¤  COEFFICIENT OF STABILITY – measure of the correspondence of
scores obtained at 2 different times
TEST RELIABILITY
TEST A
Score?
TEST A
Score?
Methods of estimating reliability:
1. TEST-RETEST METHOD
¤  Assesses the external consistency of a test
¤  NO information about possible effects of inadequate
sampling of contents and processes
TEST RELIABILITY
TEST A
Score?
TEST A
Score?
Methods of estimating reliability:
2. ALTERNATE-FORMS METHOD
¤  COEFFICIENT OF STABILITY AND EQUIVALENCE –
correlation of scores on the 2 forms would reveal not only
temporal influences (delayed testing) but also content
differences (immediate & delayed testing)
TEST AX
Score?
TEST AY
Score?
TEST RELIABILITY
Methods of estimating reliability:
3. INTER-RATER RELIABILITY
¤  Different and equally competent raters evaluate the
results of a single test à correlate the 2 sets of scores
¤  Assesses the consistency of how a measuring system is
implemented
TEST RELIABILITY
Score1? Score2?
AVERAGE
¤  Also called ODD-EVEN RELIABILITY
¤  r = estimate of content reliability for half of the test
¤  R = estimate of content reliability for the whole test
Methods of estimating reliability:
4. SPLIT-HALF METHOD
TEST RELIABILITY
TEST
Aodd
Score?
TEST
Aeven
Score?
r
Methods of estimating reliability:
4. SPLIT-HALF METHOD
TEST RELIABILITY
¤  Extension of the split-half method performed on all
combinations of questions à average of split-half estimates
that would be expected from making all possible divisions of
a test into halves
¤  Measure of internal consistency reliability for measures with
dichotomous choices
Methods of estimating reliability:
5. KUDER-RICHARDSON APPROACH
TEST RELIABILITY
TEST
Aodd
Score?
TEST
Aeven
Score?
r
k = number of questions
pj = number of people in the sample who answered question j correctly
qj = number of people in the sample who didn’t answer question j correctly
σ2 = variance of the total scores of all the people taking the test
Methods of estimating reliability:
5. KUDER-RICHARDSON APPROACH
TEST RELIABILITY
TEST
Aodd
Score?
TEST
Aeven
Score?
r
Advantages & disadvantages
TEST RELIABILITY
Which method should be used?
• Stability of test scores over time
• Consistency of scores over different test forms
• Go-togetherness of test items
TEST RELIABILITY
Factors affecting reliability:
1. LENGTH OF TEST
TEST RELIABILITY
¤  Larger sampling of responses with equally good items or
greater length of test à higher reliability
¤  Reliability does NOT increase in a straight line (SPEARMAN-
BROWN FORMULA)
¤  Reliability of .50 increases to .67 when the length of a
test is doubled
¤  Assumption: Subjects do not become exhausted and lose
motivation
Factors affecting reliability:
2. RANGE OF TALENT
TEST RELIABILITY
¤  Validity and reliability coefficients can be expected to
increase as range of talent of the subjects increases
¤  Homogeneous group à lower reliability coefficient
¤  Wider spread of scores à higher reliability
¤  Sample of subjects should be representative of those for
whom one wishes to draw conclusions about individual
differences
Factors affecting reliability:
3. TIME LIMITS
TEST RELIABILITY
¤  SPLIT-HALF and KUDER-RICHARDSON approaches
¤  If some students do not have time to try some items à
¤  Proportion of correct responses for those items will decrease
and the score spread will increase à
¤  Positive although spurious influence on the size of the
reliability coefficient
Factors affecting reliability:
4. DIFFICULTY OF TEST ITEMS
TEST RELIABILITY
¤  Narrow score distributions à low reliability
VERY
DIFFICULT
TEST
VERY
EASY TEST
Other factors affecting reliability
TEST RELIABILITY
Best reliability
TEST RELIABILITY
Definition
¤  Usefulness or applicability of the testing procedure in
order to serve the needs of its users
PRACTICALITY
Economy of:
þ Time
þ Effort
þ Money
1. Ease of CONSTRUCTION
¤  Demands adequate time and informed talent
PRACTICALITY
2. Ease of ADMINISTRATION
¤  Clarity and simplicity
¤  Ease of reading instructions
3. Ease of SCORING
¤  Subjective vs. objective?
4. Ease of INTERPRETATION and
APPLICATION
¤  Meaningfulness of scores obtained from the test
¤  Misinterpreted or misapplied test results – of little value and
may be harmful to certain individuals or groups
PRACTICALITY
Definition
¤  RELIABILITY and VALIDITY – often discussed separately but
sometimes you will see them both referred to as aspects
of generalizability
¤  Extent one can generalize the results of a measure or a
test used with a particular group to other tests or other
groups
GENERALIZABILITY
Thank you! J

More Related Content

What's hot

Administration/Conducting the Test
Administration/Conducting the TestAdministration/Conducting the Test
Administration/Conducting the Test
Dr. Amjad Ali Arain
 
Standardized testing.pptx 2
Standardized testing.pptx 2Standardized testing.pptx 2
Standardized testing.pptx 2
Jesullyna Manuel
 
Likert scale
Likert scaleLikert scale
Likert scale
vanithasuresh
 
Characteristics of a good test
Characteristics of a good testCharacteristics of a good test
Characteristics of a good test
ALMA HERMOGINO
 
Constructing subjective test items
Constructing  subjective test itemsConstructing  subjective test items
Constructing subjective test items
International advisers
 
Construction of Test
Construction of TestConstruction of Test
Construction of Test
JEMIMASULTANA32
 
Constructing Objective and Subjective Test
Constructing Objective and Subjective TestConstructing Objective and Subjective Test
Constructing Objective and Subjective Test
Dr. Amjad Ali Arain
 
School Evaluation Program
School Evaluation ProgramSchool Evaluation Program
School Evaluation Program
Dr. Amjad Ali Arain
 
Norm Referenced and Criterion Referenced
Norm Referenced and Criterion ReferencedNorm Referenced and Criterion Referenced
Norm Referenced and Criterion Referenced
Dr. Amjad Ali Arain
 
Test Assembling (writing and constructing)
Test Assembling (writing and constructing)Test Assembling (writing and constructing)
Test Assembling (writing and constructing)
Tasneem Ahmad
 
Characteristics of a Good Test
Characteristics of a Good TestCharacteristics of a Good Test
Characteristics of a Good Test
Ajab Ali Lashari
 
Qualities of a Good Test
Qualities of a Good TestQualities of a Good Test
Qualities of a Good Test
DrSindhuAlmas
 
Distinction among measurement, assessment and evaluation
Distinction among measurement, assessment and evaluationDistinction among measurement, assessment and evaluation
Distinction among measurement, assessment and evaluation
USMAN GANI AL HAQUE
 
General principles of assessment
General principles of assessmentGeneral principles of assessment
General principles of assessment
DEBABRATA GIRI
 
Assembling The Test
Assembling The TestAssembling The Test
Assembling The Test
Dr. Amjad Ali Arain
 
Subjective and Objective Test
Subjective and Objective TestSubjective and Objective Test
Subjective and Objective Test
Dr. Amjad Ali Arain
 
Principles of Test Construction 1
Principles of Test Construction 1Principles of Test Construction 1
Principles of Test Construction 1
Monica P
 
Educational research
Educational researchEducational research
Educational research
Mukut Deori
 
Attitude of secondary school teachers towards
Attitude of secondary school teachers towardsAttitude of secondary school teachers towards
Attitude of secondary school teachers towards
Nurnabihah Mohamad Nizar
 
stages of test construction
stages of test constructionstages of test construction
stages of test construction
irshad narejo
 

What's hot (20)

Administration/Conducting the Test
Administration/Conducting the TestAdministration/Conducting the Test
Administration/Conducting the Test
 
Standardized testing.pptx 2
Standardized testing.pptx 2Standardized testing.pptx 2
Standardized testing.pptx 2
 
Likert scale
Likert scaleLikert scale
Likert scale
 
Characteristics of a good test
Characteristics of a good testCharacteristics of a good test
Characteristics of a good test
 
Constructing subjective test items
Constructing  subjective test itemsConstructing  subjective test items
Constructing subjective test items
 
Construction of Test
Construction of TestConstruction of Test
Construction of Test
 
Constructing Objective and Subjective Test
Constructing Objective and Subjective TestConstructing Objective and Subjective Test
Constructing Objective and Subjective Test
 
School Evaluation Program
School Evaluation ProgramSchool Evaluation Program
School Evaluation Program
 
Norm Referenced and Criterion Referenced
Norm Referenced and Criterion ReferencedNorm Referenced and Criterion Referenced
Norm Referenced and Criterion Referenced
 
Test Assembling (writing and constructing)
Test Assembling (writing and constructing)Test Assembling (writing and constructing)
Test Assembling (writing and constructing)
 
Characteristics of a Good Test
Characteristics of a Good TestCharacteristics of a Good Test
Characteristics of a Good Test
 
Qualities of a Good Test
Qualities of a Good TestQualities of a Good Test
Qualities of a Good Test
 
Distinction among measurement, assessment and evaluation
Distinction among measurement, assessment and evaluationDistinction among measurement, assessment and evaluation
Distinction among measurement, assessment and evaluation
 
General principles of assessment
General principles of assessmentGeneral principles of assessment
General principles of assessment
 
Assembling The Test
Assembling The TestAssembling The Test
Assembling The Test
 
Subjective and Objective Test
Subjective and Objective TestSubjective and Objective Test
Subjective and Objective Test
 
Principles of Test Construction 1
Principles of Test Construction 1Principles of Test Construction 1
Principles of Test Construction 1
 
Educational research
Educational researchEducational research
Educational research
 
Attitude of secondary school teachers towards
Attitude of secondary school teachers towardsAttitude of secondary school teachers towards
Attitude of secondary school teachers towards
 
stages of test construction
stages of test constructionstages of test construction
stages of test construction
 

Viewers also liked

Characteristics of a good test
Characteristics  of a good testCharacteristics  of a good test
Characteristics of a good test
Ali Heydari
 
Characteristics of a good test
Characteristics of a good testCharacteristics of a good test
Characteristics of a good test
Boyet Aluan
 
Reliability in selection measures
Reliability in selection measuresReliability in selection measures
Reliability in selection measures
Ore Omotayo
 
Characteristics of a good test
Characteristics  of a good testCharacteristics  of a good test
Characteristics of a good test
Lalima Tripathi
 
Reliability for testing and assessment
Reliability for testing and assessmentReliability for testing and assessment
Reliability for testing and assessment
Erlwinmer Mangmang
 
Qualities of a good test (1)
Qualities of a good test (1)Qualities of a good test (1)
Qualities of a good test (1)
kimoya
 
Characteristics of a good test
Characteristics of a good test Characteristics of a good test
Characteristics of a good test
Arash Yazdani
 
Tests and characteristics spr 2012
Tests and characteristics spr 2012Tests and characteristics spr 2012
Tests and characteristics spr 2012
Laiqa Ahmed
 
Measurement and Reliability Test (updated in March 2011)
Measurement and Reliability Test (updated in March 2011)Measurement and Reliability Test (updated in March 2011)
Measurement and Reliability Test (updated in March 2011)
Hora Tjitra
 
Testing
TestingTesting
Standard error of measurement
Standard error of measurementStandard error of measurement
Standard error of measurement
tlcoffman
 
Standard error
Standard error Standard error
Standard error
Satyaki Mishra
 
Types of testing
Types of testingTypes of testing
Types of testing
Sonam Agarwal
 
Errors and Error Measurements
Errors and Error MeasurementsErrors and Error Measurements
Errors and Error Measurements
Milind Pelagade
 
Errors in measurement
Errors in measurementErrors in measurement
Errors in measurement
Ravinder Jarewal
 
validity its types and importance
validity its types and importancevalidity its types and importance
validity its types and importance
Ierine Joy Caserial
 
Principles of Language Assessment
Principles of Language AssessmentPrinciples of Language Assessment
Principles of Language Assessment
A Faiz
 
Validity, its types, measurement & factors.
Validity, its types, measurement & factors.Validity, its types, measurement & factors.
Validity, its types, measurement & factors.
Maheen Iftikhar
 
Types of test
Types of testTypes of test
Types of test
Nhisa Tumanda
 
Validity, reliability & practicality
Validity, reliability & practicalityValidity, reliability & practicality
Validity, reliability & practicality
Samcruz5
 

Viewers also liked (20)

Characteristics of a good test
Characteristics  of a good testCharacteristics  of a good test
Characteristics of a good test
 
Characteristics of a good test
Characteristics of a good testCharacteristics of a good test
Characteristics of a good test
 
Reliability in selection measures
Reliability in selection measuresReliability in selection measures
Reliability in selection measures
 
Characteristics of a good test
Characteristics  of a good testCharacteristics  of a good test
Characteristics of a good test
 
Reliability for testing and assessment
Reliability for testing and assessmentReliability for testing and assessment
Reliability for testing and assessment
 
Qualities of a good test (1)
Qualities of a good test (1)Qualities of a good test (1)
Qualities of a good test (1)
 
Characteristics of a good test
Characteristics of a good test Characteristics of a good test
Characteristics of a good test
 
Tests and characteristics spr 2012
Tests and characteristics spr 2012Tests and characteristics spr 2012
Tests and characteristics spr 2012
 
Measurement and Reliability Test (updated in March 2011)
Measurement and Reliability Test (updated in March 2011)Measurement and Reliability Test (updated in March 2011)
Measurement and Reliability Test (updated in March 2011)
 
Testing
TestingTesting
Testing
 
Standard error of measurement
Standard error of measurementStandard error of measurement
Standard error of measurement
 
Standard error
Standard error Standard error
Standard error
 
Types of testing
Types of testingTypes of testing
Types of testing
 
Errors and Error Measurements
Errors and Error MeasurementsErrors and Error Measurements
Errors and Error Measurements
 
Errors in measurement
Errors in measurementErrors in measurement
Errors in measurement
 
validity its types and importance
validity its types and importancevalidity its types and importance
validity its types and importance
 
Principles of Language Assessment
Principles of Language AssessmentPrinciples of Language Assessment
Principles of Language Assessment
 
Validity, its types, measurement & factors.
Validity, its types, measurement & factors.Validity, its types, measurement & factors.
Validity, its types, measurement & factors.
 
Types of test
Types of testTypes of test
Types of test
 
Validity, reliability & practicality
Validity, reliability & practicalityValidity, reliability & practicality
Validity, reliability & practicality
 

Similar to Characteristics of a Good Test

EM&E.pptx
EM&E.pptxEM&E.pptx
EM&E.pptx
Hafiz20006
 
Valiadity and reliability- Language testing
Valiadity and reliability- Language testingValiadity and reliability- Language testing
Valiadity and reliability- Language testing
Phuong Tran
 
RELIABILITY AND VALIDITY
RELIABILITY AND VALIDITYRELIABILITY AND VALIDITY
RELIABILITY AND VALIDITY
Joydeep Singh
 
Validity and reliablity
Validity and reliablityValidity and reliablity
Validity and reliablity
S.Bhakti swarupa
 
Reliability by Vartika Verma .pdf
Reliability by Vartika Verma .pdfReliability by Vartika Verma .pdf
Reliability by Vartika Verma .pdf
Vartika Verma
 
Testing in language programs (chapter 8)
Testing in language programs (chapter 8)Testing in language programs (chapter 8)
Testing in language programs (chapter 8)
Tahere Bakhshi
 
Louzel Report - Reliability & validity
Louzel Report - Reliability & validity Louzel Report - Reliability & validity
Louzel Report - Reliability & validity
Louzel Linejan
 
Reliability
ReliabilityReliability
Reliability
shaziazamir1
 
Validity, Reliability and Feasibility
Validity, Reliability and FeasibilityValidity, Reliability and Feasibility
Validity, Reliability and Feasibility
Jasna3134
 
Validity and reliability of the instrument
Validity and reliability of the instrumentValidity and reliability of the instrument
Validity and reliability of the instrument
Bhumi Patel
 
Monika seminar
Monika seminarMonika seminar
Monika seminar
monika22singh
 
Monika seminar
Monika seminarMonika seminar
Monika seminar
monika22singh
 
Validity, reliability and feasibility
Validity, reliability and feasibilityValidity, reliability and feasibility
Validity, reliability and feasibility
silpa $H!lu
 
Reliability of test
Reliability of testReliability of test
Reliability of test
Sarat Rout
 
Meaning and Methods of Estimating Reliability of Test.pptx
Meaning and Methods of Estimating Reliability of Test.pptxMeaning and Methods of Estimating Reliability of Test.pptx
Meaning and Methods of Estimating Reliability of Test.pptx
sarat68
 
Reliability and dependability by neil jones
Reliability and dependability by neil jonesReliability and dependability by neil jones
Reliability and dependability by neil jones
ahfameri
 
Reliability and dependability by neil jones
Reliability and dependability by neil jonesReliability and dependability by neil jones
Reliability and dependability by neil jones
Amir Hamid Forough Ameri
 
Unit 2.pptx
Unit 2.pptxUnit 2.pptx
Unit 2.pptx
Samruddhi Chepe
 
What makes a good testA test is considered good” if the .docx
What makes a good testA test is considered good” if the .docxWhat makes a good testA test is considered good” if the .docx
What makes a good testA test is considered good” if the .docx
mecklenburgstrelitzh
 
Reliability and Validity.pptx
Reliability and Validity.pptxReliability and Validity.pptx
Reliability and Validity.pptx
VandanaGaur8
 

Similar to Characteristics of a Good Test (20)

EM&E.pptx
EM&E.pptxEM&E.pptx
EM&E.pptx
 
Valiadity and reliability- Language testing
Valiadity and reliability- Language testingValiadity and reliability- Language testing
Valiadity and reliability- Language testing
 
RELIABILITY AND VALIDITY
RELIABILITY AND VALIDITYRELIABILITY AND VALIDITY
RELIABILITY AND VALIDITY
 
Validity and reliablity
Validity and reliablityValidity and reliablity
Validity and reliablity
 
Reliability by Vartika Verma .pdf
Reliability by Vartika Verma .pdfReliability by Vartika Verma .pdf
Reliability by Vartika Verma .pdf
 
Testing in language programs (chapter 8)
Testing in language programs (chapter 8)Testing in language programs (chapter 8)
Testing in language programs (chapter 8)
 
Louzel Report - Reliability & validity
Louzel Report - Reliability & validity Louzel Report - Reliability & validity
Louzel Report - Reliability & validity
 
Reliability
ReliabilityReliability
Reliability
 
Validity, Reliability and Feasibility
Validity, Reliability and FeasibilityValidity, Reliability and Feasibility
Validity, Reliability and Feasibility
 
Validity and reliability of the instrument
Validity and reliability of the instrumentValidity and reliability of the instrument
Validity and reliability of the instrument
 
Monika seminar
Monika seminarMonika seminar
Monika seminar
 
Monika seminar
Monika seminarMonika seminar
Monika seminar
 
Validity, reliability and feasibility
Validity, reliability and feasibilityValidity, reliability and feasibility
Validity, reliability and feasibility
 
Reliability of test
Reliability of testReliability of test
Reliability of test
 
Meaning and Methods of Estimating Reliability of Test.pptx
Meaning and Methods of Estimating Reliability of Test.pptxMeaning and Methods of Estimating Reliability of Test.pptx
Meaning and Methods of Estimating Reliability of Test.pptx
 
Reliability and dependability by neil jones
Reliability and dependability by neil jonesReliability and dependability by neil jones
Reliability and dependability by neil jones
 
Reliability and dependability by neil jones
Reliability and dependability by neil jonesReliability and dependability by neil jones
Reliability and dependability by neil jones
 
Unit 2.pptx
Unit 2.pptxUnit 2.pptx
Unit 2.pptx
 
What makes a good testA test is considered good” if the .docx
What makes a good testA test is considered good” if the .docxWhat makes a good testA test is considered good” if the .docx
What makes a good testA test is considered good” if the .docx
 
Reliability and Validity.pptx
Reliability and Validity.pptxReliability and Validity.pptx
Reliability and Validity.pptx
 

More from Ann Meredith Garcia

Tips para Iwas Kanser.pptx
Tips para Iwas Kanser.pptxTips para Iwas Kanser.pptx
Tips para Iwas Kanser.pptx
Ann Meredith Garcia
 
The truth behind herbal supplements on social media
The truth behind herbal supplements on social mediaThe truth behind herbal supplements on social media
The truth behind herbal supplements on social media
Ann Meredith Garcia
 
Proper Handling of Cytotoxic Agents
Proper Handling of Cytotoxic AgentsProper Handling of Cytotoxic Agents
Proper Handling of Cytotoxic Agents
Ann Meredith Garcia
 
Defining Safety & Quality in Cancer Care
Defining Safety & Quality in Cancer CareDefining Safety & Quality in Cancer Care
Defining Safety & Quality in Cancer Care
Ann Meredith Garcia
 
Item Analysis
Item AnalysisItem Analysis
Item Analysis
Ann Meredith Garcia
 
Upper & Lower Respiratory Tract Cancer Prevention
Upper & Lower Respiratory Tract Cancer PreventionUpper & Lower Respiratory Tract Cancer Prevention
Upper & Lower Respiratory Tract Cancer Prevention
Ann Meredith Garcia
 
Social Media for Health Promotion & Education
Social Media for Health Promotion & EducationSocial Media for Health Promotion & Education
Social Media for Health Promotion & Education
Ann Meredith Garcia
 

More from Ann Meredith Garcia (7)

Tips para Iwas Kanser.pptx
Tips para Iwas Kanser.pptxTips para Iwas Kanser.pptx
Tips para Iwas Kanser.pptx
 
The truth behind herbal supplements on social media
The truth behind herbal supplements on social mediaThe truth behind herbal supplements on social media
The truth behind herbal supplements on social media
 
Proper Handling of Cytotoxic Agents
Proper Handling of Cytotoxic AgentsProper Handling of Cytotoxic Agents
Proper Handling of Cytotoxic Agents
 
Defining Safety & Quality in Cancer Care
Defining Safety & Quality in Cancer CareDefining Safety & Quality in Cancer Care
Defining Safety & Quality in Cancer Care
 
Item Analysis
Item AnalysisItem Analysis
Item Analysis
 
Upper & Lower Respiratory Tract Cancer Prevention
Upper & Lower Respiratory Tract Cancer PreventionUpper & Lower Respiratory Tract Cancer Prevention
Upper & Lower Respiratory Tract Cancer Prevention
 
Social Media for Health Promotion & Education
Social Media for Health Promotion & EducationSocial Media for Health Promotion & Education
Social Media for Health Promotion & Education
 

Recently uploaded

clinical examination of hip joint (1).pdf
clinical examination of hip joint (1).pdfclinical examination of hip joint (1).pdf
clinical examination of hip joint (1).pdf
Priyankaranawat4
 
Liberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdfLiberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdf
WaniBasim
 
The History of Stoke Newington Street Names
The History of Stoke Newington Street NamesThe History of Stoke Newington Street Names
The History of Stoke Newington Street Names
History of Stoke Newington
 
Film vocab for eal 3 students: Australia the movie
Film vocab for eal 3 students: Australia the movieFilm vocab for eal 3 students: Australia the movie
Film vocab for eal 3 students: Australia the movie
Nicholas Montgomery
 
How to Add Chatter in the odoo 17 ERP Module
How to Add Chatter in the odoo 17 ERP ModuleHow to Add Chatter in the odoo 17 ERP Module
How to Add Chatter in the odoo 17 ERP Module
Celine George
 
MARY JANE WILSON, A “BOA MÃE” .
MARY JANE WILSON, A “BOA MÃE”           .MARY JANE WILSON, A “BOA MÃE”           .
MARY JANE WILSON, A “BOA MÃE” .
Colégio Santa Teresinha
 
C1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptx
C1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptxC1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptx
C1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptx
mulvey2
 
The Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collectionThe Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collection
Israel Genealogy Research Association
 
BBR 2024 Summer Sessions Interview Training
BBR  2024 Summer Sessions Interview TrainingBBR  2024 Summer Sessions Interview Training
BBR 2024 Summer Sessions Interview Training
Katrina Pritchard
 
Your Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective UpskillingYour Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective Upskilling
Excellence Foundation for South Sudan
 
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Dr. Vinod Kumar Kanvaria
 
A Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdfA Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdf
Jean Carlos Nunes Paixão
 
Leveraging Generative AI to Drive Nonprofit Innovation
Leveraging Generative AI to Drive Nonprofit InnovationLeveraging Generative AI to Drive Nonprofit Innovation
Leveraging Generative AI to Drive Nonprofit Innovation
TechSoup
 
How to Setup Warehouse & Location in Odoo 17 Inventory
How to Setup Warehouse & Location in Odoo 17 InventoryHow to Setup Warehouse & Location in Odoo 17 Inventory
How to Setup Warehouse & Location in Odoo 17 Inventory
Celine George
 
Cognitive Development Adolescence Psychology
Cognitive Development Adolescence PsychologyCognitive Development Adolescence Psychology
Cognitive Development Adolescence Psychology
paigestewart1632
 
How to Manage Your Lost Opportunities in Odoo 17 CRM
How to Manage Your Lost Opportunities in Odoo 17 CRMHow to Manage Your Lost Opportunities in Odoo 17 CRM
How to Manage Your Lost Opportunities in Odoo 17 CRM
Celine George
 
Walmart Business+ and Spark Good for Nonprofits.pdf
Walmart Business+ and Spark Good for Nonprofits.pdfWalmart Business+ and Spark Good for Nonprofits.pdf
Walmart Business+ and Spark Good for Nonprofits.pdf
TechSoup
 
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdfANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
Priyankaranawat4
 
South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)
Academy of Science of South Africa
 
ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...
ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...
ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...
PECB
 

Recently uploaded (20)

clinical examination of hip joint (1).pdf
clinical examination of hip joint (1).pdfclinical examination of hip joint (1).pdf
clinical examination of hip joint (1).pdf
 
Liberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdfLiberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdf
 
The History of Stoke Newington Street Names
The History of Stoke Newington Street NamesThe History of Stoke Newington Street Names
The History of Stoke Newington Street Names
 
Film vocab for eal 3 students: Australia the movie
Film vocab for eal 3 students: Australia the movieFilm vocab for eal 3 students: Australia the movie
Film vocab for eal 3 students: Australia the movie
 
How to Add Chatter in the odoo 17 ERP Module
How to Add Chatter in the odoo 17 ERP ModuleHow to Add Chatter in the odoo 17 ERP Module
How to Add Chatter in the odoo 17 ERP Module
 
MARY JANE WILSON, A “BOA MÃE” .
MARY JANE WILSON, A “BOA MÃE”           .MARY JANE WILSON, A “BOA MÃE”           .
MARY JANE WILSON, A “BOA MÃE” .
 
C1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptx
C1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptxC1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptx
C1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptx
 
The Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collectionThe Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collection
 
BBR 2024 Summer Sessions Interview Training
BBR  2024 Summer Sessions Interview TrainingBBR  2024 Summer Sessions Interview Training
BBR 2024 Summer Sessions Interview Training
 
Your Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective UpskillingYour Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective Upskilling
 
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
 
A Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdfA Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdf
 
Leveraging Generative AI to Drive Nonprofit Innovation
Leveraging Generative AI to Drive Nonprofit InnovationLeveraging Generative AI to Drive Nonprofit Innovation
Leveraging Generative AI to Drive Nonprofit Innovation
 
How to Setup Warehouse & Location in Odoo 17 Inventory
How to Setup Warehouse & Location in Odoo 17 InventoryHow to Setup Warehouse & Location in Odoo 17 Inventory
How to Setup Warehouse & Location in Odoo 17 Inventory
 
Cognitive Development Adolescence Psychology
Cognitive Development Adolescence PsychologyCognitive Development Adolescence Psychology
Cognitive Development Adolescence Psychology
 
How to Manage Your Lost Opportunities in Odoo 17 CRM
How to Manage Your Lost Opportunities in Odoo 17 CRMHow to Manage Your Lost Opportunities in Odoo 17 CRM
How to Manage Your Lost Opportunities in Odoo 17 CRM
 
Walmart Business+ and Spark Good for Nonprofits.pdf
Walmart Business+ and Spark Good for Nonprofits.pdfWalmart Business+ and Spark Good for Nonprofits.pdf
Walmart Business+ and Spark Good for Nonprofits.pdf
 
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdfANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
 
South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)
 
ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...
ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...
ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...
 

Characteristics of a Good Test

  • 1. CHARACTERISTICS OF A GOOD TEST Ann Meredith U. Garcia, MD
  • 2. Reliability vs. validity ¤  A degree of test reliability is requisite to validity. VALID ≠ RELIABLE TEST RELIABILITY
  • 3. Definition ¤  Consistency with which a test measures what it is measuring ¤  Consistent, constant, and repeatable results? ¤  Over time? Across different versions of a test? Among scale items? TEST RELIABILITY
  • 4. Definition ¤  Consistency with which a test measures what it is measuring ¤  Consistent, constant, and repeatable results? ¤  Goal: As close as possible to measuring the TRUE SCORE TEST RELIABILITY
  • 5. Sources of error TEST RELIABILITY is a HUMAN BEING Examinee
  • 6. Sources of error TEST RELIABILITY is a HUMAN BEING Examinee Examiner
  • 7. Sources of error TEST RELIABILITY is designed by & for HUMAN BEINGS Examinee Examiner Examination
  • 8. Sources of measurement error: 1. OBJECTIVITY OF SCORING ¤  Different scorers produce the same score if they apply the same scoring key ¤  More objective scoring à more accurate score TEST RELIABILITY Score1? Score2? Score3?
  • 9. Sources of measurement error: 2. SAMPLING OF CONTENT ¤  A teacher cannot really construct 2 forms of a test that are independent of each other. ¤  Another teacher’s test usually would differ even more. TEST RELIABILITY
  • 10. Sources of measurement error: 2. SAMPLING OF CONTENT ¤  If the test plan is fairly detailed and followed carefully à content sampling for an objective test with a large number of items should be reasonably adequate TEST RELIABILITY
  • 11. Sources of measurement error: 3. TEMPORAL INFLUENCES ¤  TEMPORAL STABILITY – scores should fluctuate very little over a reasonably brief time interval TEST RELIABILITY TEST A Score? TEST A Score?
  • 12. Methods of estimating reliability: 1. TEST-RETEST METHOD ¤  Estimates TEMPORAL RELIABILITY – correlation between scores on the 2 trials ¤  COEFFICIENT OF STABILITY – measure of the correspondence of scores obtained at 2 different times TEST RELIABILITY TEST A Score? TEST A Score?
  • 13. Methods of estimating reliability: 1. TEST-RETEST METHOD ¤  Assesses the external consistency of a test ¤  NO information about possible effects of inadequate sampling of contents and processes TEST RELIABILITY TEST A Score? TEST A Score?
  • 14. Methods of estimating reliability: 2. ALTERNATE-FORMS METHOD ¤  COEFFICIENT OF STABILITY AND EQUIVALENCE – correlation of scores on the 2 forms would reveal not only temporal influences (delayed testing) but also content differences (immediate & delayed testing) TEST AX Score? TEST AY Score? TEST RELIABILITY
  • 15. Methods of estimating reliability: 3. INTER-RATER RELIABILITY ¤  Different and equally competent raters evaluate the results of a single test à correlate the 2 sets of scores ¤  Assesses the consistency of how a measuring system is implemented TEST RELIABILITY Score1? Score2? AVERAGE
  • 16. ¤  Also called ODD-EVEN RELIABILITY ¤  r = estimate of content reliability for half of the test ¤  R = estimate of content reliability for the whole test Methods of estimating reliability: 4. SPLIT-HALF METHOD TEST RELIABILITY TEST Aodd Score? TEST Aeven Score? r
  • 17. Methods of estimating reliability: 4. SPLIT-HALF METHOD TEST RELIABILITY
  • 18. ¤  Extension of the split-half method performed on all combinations of questions à average of split-half estimates that would be expected from making all possible divisions of a test into halves ¤  Measure of internal consistency reliability for measures with dichotomous choices Methods of estimating reliability: 5. KUDER-RICHARDSON APPROACH TEST RELIABILITY TEST Aodd Score? TEST Aeven Score? r
  • 19. k = number of questions pj = number of people in the sample who answered question j correctly qj = number of people in the sample who didn’t answer question j correctly σ2 = variance of the total scores of all the people taking the test Methods of estimating reliability: 5. KUDER-RICHARDSON APPROACH TEST RELIABILITY TEST Aodd Score? TEST Aeven Score? r
  • 21. Which method should be used? • Stability of test scores over time • Consistency of scores over different test forms • Go-togetherness of test items TEST RELIABILITY
  • 22. Factors affecting reliability: 1. LENGTH OF TEST TEST RELIABILITY ¤  Larger sampling of responses with equally good items or greater length of test à higher reliability ¤  Reliability does NOT increase in a straight line (SPEARMAN- BROWN FORMULA) ¤  Reliability of .50 increases to .67 when the length of a test is doubled ¤  Assumption: Subjects do not become exhausted and lose motivation
  • 23. Factors affecting reliability: 2. RANGE OF TALENT TEST RELIABILITY ¤  Validity and reliability coefficients can be expected to increase as range of talent of the subjects increases ¤  Homogeneous group à lower reliability coefficient ¤  Wider spread of scores à higher reliability ¤  Sample of subjects should be representative of those for whom one wishes to draw conclusions about individual differences
  • 24. Factors affecting reliability: 3. TIME LIMITS TEST RELIABILITY ¤  SPLIT-HALF and KUDER-RICHARDSON approaches ¤  If some students do not have time to try some items à ¤  Proportion of correct responses for those items will decrease and the score spread will increase à ¤  Positive although spurious influence on the size of the reliability coefficient
  • 25. Factors affecting reliability: 4. DIFFICULTY OF TEST ITEMS TEST RELIABILITY ¤  Narrow score distributions à low reliability VERY DIFFICULT TEST VERY EASY TEST
  • 26. Other factors affecting reliability TEST RELIABILITY
  • 28. Definition ¤  Usefulness or applicability of the testing procedure in order to serve the needs of its users PRACTICALITY Economy of: þ Time þ Effort þ Money
  • 29. 1. Ease of CONSTRUCTION ¤  Demands adequate time and informed talent PRACTICALITY 2. Ease of ADMINISTRATION ¤  Clarity and simplicity ¤  Ease of reading instructions 3. Ease of SCORING ¤  Subjective vs. objective?
  • 30. 4. Ease of INTERPRETATION and APPLICATION ¤  Meaningfulness of scores obtained from the test ¤  Misinterpreted or misapplied test results – of little value and may be harmful to certain individuals or groups PRACTICALITY
  • 31. Definition ¤  RELIABILITY and VALIDITY – often discussed separately but sometimes you will see them both referred to as aspects of generalizability ¤  Extent one can generalize the results of a measure or a test used with a particular group to other tests or other groups GENERALIZABILITY