SlideShare a Scribd company logo
1
Topic 7: Factors affecting test scores &
simple test evaluation for class teacher
(Bachman, 1990, Chapters 6 &7)
(Madsen, 1983, Chapter 9)
Hoa Nguyen
2
Factors affecting test scores
(test bias)
Communicative language ability
Test scores
Test method facets
Random factors
Personal attributes
3
• Test method facets such as test format/ response format, input
format, length of test
– Same testing skills and sub-skills but different test formats or
response formats (may be in favour of one group but not the other)
• Test content
– Culture background
• Culture features embedded in test content may biased one group but not
the other
– Background knowledge
• Eg. an IELTS test listening section 3 about process of doing an
assignment in Western education or a mini-lecture about birds in
Tasmania, a TOEFL iBT lecturer or a lecture about Pluto.
• In ESP testing, it is necessary to distinguish between language proficiency
and background knowledge and the test designed should define specific
language ability as part of language ability to be tested. Score from an
ESP reading test should not be interpreted as general reading ability.
Part 1: Factors affecting test scores
4
• Cognitive characteristics
– Field independence
• Field independence is “the extent to which a person perceives part of a
field as discrete from the surrounding field as a whole, rather embedded,
or… the extent to which a person perceives analytically” (p.275).
• Test takers who are field independent are likely perform better than those
field dependence especially discrete point tests.
– Ambiguity tolerance
• Ambiguity tolerance is “a person’s ability to function rationally and calmly
in a situation in which interpretation of all stimuli is not clear” (p. 227).
• Individuals with high ambiguity tolerance might perform better than those
with low ambiguity tolerance.
– Eg. Cloze test (clearer till the end of test, dictation: some words could not be
recognized until the second or last item of reading).
• This evidence is not so clear with multiple-choice response format though
research show a significant (but low) correlation between scores on a
measure of ambiguity and multiple choice measures of E proficiency.
5
• Random factor: testing environment
• Native language background, ethnicity, sex,
and age
• These characteristics of test takers are not
facets of test methods  they cannot be
considered as possible sources of
measurement error.
• All these can only provide information about
how language learning varies with age,
ethnicity, sex, and other individual
characteristics.
6
Part 2: Practical approach of test
evaluation for class teacher
• Preparing an item analysis
– score all the test takers
– rank them from the highest to the lowest and
divide them equally into three groups high –
middle - low
– record students’ responses: from high group
to low group; if it is a multiple choice, circle
the correct option.
7
Item 1 High group Low group
A / ///
(B) //// //
C // /
D / //
X / //
8
1. Difficulty level
High correct + Low
correct
Hc + Lc
Total number in sample
(H +L)
or
N
Eg. In this case, (5+2):20 = 7/20 = 35%
Note. easy ≥90%, difficult ≤ 30%
2. Discrimination level
High correct - Low
correct
Hc - Lc
Total number in sample
(H +L)
or
N
Eg. In this case, (5-2):20 = 3/20 = 15%, ≥15% is acceptable;
from 10% to 15% is questionable.
Note. ≤ 10% discrimination is not acceptable
9
• Distractor evaluation
• Weak distractors: poor discrimination or not
chosen by test takers.
• Only one or two distractors attract attention.
Reasons
– the item has been revised well in the class and
everyone masters it: the answer is obvious
– no other choices seem likely to be the answers
– obviously impossible distractor(s)
• If many items are left blank near the end of the
test, the test should be shortened or more time
should be given to it.
10
1. Practice exercises
Item 1 (D) Item 2 (A) Item 3 (B) Item 4 (C)
High Low High Low High Low High Low
A // // // //// //
B / // //// / //
C //// // / // //// //// //// //
D //// / //// / / /// //
X / / / //
Questions:
1. Calculate the level of difficulty for each of the four items
Item 1 = Item 2 = Item 3 = Item 4 =
Which of these items are too difficult, and which are too easy?
2. Calculate the discrimination of each item.
Which item has the poorest discrimination? ………………………….
Which item has unsatisfactory discrimination? ………………………….
Which item(s) has borderline? ………………………….
3. Look at the distractors on the four items.
In which are they the most effective? ………………………….
In which are they the least effective? ………………………….
4. Is there any item with negative discrimination? If so, which one?
………………………………………………………………………
5. Which item did the fewest students leave blank? ………………………….
Which item did the most leave blank? ………………………….
11
Question 1
• 1 = 50% 2 = 39% 3 = 33% 4 = 89%
• none = too difficult none = too easy
Question 2
• 1 = 5%; 2 = negative (no discrimination); 3 = 22%; 4 = 11%
• 2 = poorest discrimination and it is unsatisfactory
• no other unsatisfactory or borderline items
Question 3
• 3 = most effective 4 = least effective
Question 4
• Yes; 2 = negative
Question 5
• Fewest leave blank = 4 Most leave blank = 2 and 3

More Related Content

What's hot

Models of Teacher Education
Models of Teacher EducationModels of Teacher Education
Models of Teacher Education
René Sánchez
 
Testing for Language Teachers Arthur Hughes
Testing for Language TeachersArthur HughesTesting for Language TeachersArthur Hughes
Testing for Language Teachers Arthur Hughes
Rajputt Ainee
 

What's hot (20)

Dynamic assessment
Dynamic assessmentDynamic assessment
Dynamic assessment
 
Plurilingualism
PlurilingualismPlurilingualism
Plurilingualism
 
OBJECTIVITY OF TESTS ppt.pptx
OBJECTIVITY OF TESTS ppt.pptxOBJECTIVITY OF TESTS ppt.pptx
OBJECTIVITY OF TESTS ppt.pptx
 
Qualities of a Good Test
Qualities of a Good TestQualities of a Good Test
Qualities of a Good Test
 
Meaning of Test, Testing and Evaluation
Meaning of Test, Testing and EvaluationMeaning of Test, Testing and Evaluation
Meaning of Test, Testing and Evaluation
 
Construction of Achievement Test
Construction of Achievement TestConstruction of Achievement Test
Construction of Achievement Test
 
Testing and evaluation
Testing and evaluationTesting and evaluation
Testing and evaluation
 
Oral test
Oral testOral test
Oral test
 
Stages of test development
Stages of test developmentStages of test development
Stages of test development
 
Educational Assessment and Evaluation (Constructing Objective Test Items)
Educational Assessment and Evaluation (Constructing Objective Test Items)Educational Assessment and Evaluation (Constructing Objective Test Items)
Educational Assessment and Evaluation (Constructing Objective Test Items)
 
Learning strategies
Learning strategiesLearning strategies
Learning strategies
 
Models of Teacher Education
Models of Teacher EducationModels of Teacher Education
Models of Teacher Education
 
Testing for Language Teachers Arthur Hughes
Testing for Language TeachersArthur HughesTesting for Language TeachersArthur Hughes
Testing for Language Teachers Arthur Hughes
 
Oral work
Oral workOral work
Oral work
 
Testing and evaluation
Testing and evaluationTesting and evaluation
Testing and evaluation
 
approaches and methods in English Language Teaching E.L.T
approaches and methods in English Language Teaching E.L.Tapproaches and methods in English Language Teaching E.L.T
approaches and methods in English Language Teaching E.L.T
 
Types of test items and principles for constructing test items
Types of test  items and principles for constructing test items Types of test  items and principles for constructing test items
Types of test items and principles for constructing test items
 
Principles of assessment
Principles  of assessmentPrinciples  of assessment
Principles of assessment
 
Preparing The Test Items
Preparing The Test ItemsPreparing The Test Items
Preparing The Test Items
 
Steps to design a test
Steps to design a testSteps to design a test
Steps to design a test
 

Similar to Factors affecting test scores and test evaluation in class

Ed103format3 complete summary.docx[1]
Ed103format3 complete summary.docx[1]Ed103format3 complete summary.docx[1]
Ed103format3 complete summary.docx[1]
mark maneb
 
Types of psychological tests and Assessments.pptx
Types of psychological tests and Assessments.pptxTypes of psychological tests and Assessments.pptx
Types of psychological tests and Assessments.pptx
sharmilA722422
 
Multiplechoiceitems
MultiplechoiceitemsMultiplechoiceitems
Multiplechoiceitems
KAthy Cea
 
Multiplechoiceitems
MultiplechoiceitemsMultiplechoiceitems
Multiplechoiceitems
KAthy Cea
 
Test construction 1
Test construction 1Test construction 1
Test construction 1
Arnel Rivera
 

Similar to Factors affecting test scores and test evaluation in class (20)

Ed103format3 complete summary.docx[1]
Ed103format3 complete summary.docx[1]Ed103format3 complete summary.docx[1]
Ed103format3 complete summary.docx[1]
 
Objective Types of test...
Objective Types of test...Objective Types of test...
Objective Types of test...
 
Designing classroom language test
Designing classroom language testDesigning classroom language test
Designing classroom language test
 
Langguage assessment( final version)
Langguage assessment( final version)Langguage assessment( final version)
Langguage assessment( final version)
 
Writing multiple choice questions 3
Writing multiple choice questions 3Writing multiple choice questions 3
Writing multiple choice questions 3
 
Types of psychological tests and Assessments.pptx
Types of psychological tests and Assessments.pptxTypes of psychological tests and Assessments.pptx
Types of psychological tests and Assessments.pptx
 
Designing classroom language tests
Designing classroom language testsDesigning classroom language tests
Designing classroom language tests
 
Objective type of test
Objective type of testObjective type of test
Objective type of test
 
Multiplechoiceitems
MultiplechoiceitemsMultiplechoiceitems
Multiplechoiceitems
 
Multiplechoiceitems
MultiplechoiceitemsMultiplechoiceitems
Multiplechoiceitems
 
An Investigation Into The Characteristics Of Multiple-Choice On The Exam Resu...
An Investigation Into The Characteristics Of Multiple-Choice On The Exam Resu...An Investigation Into The Characteristics Of Multiple-Choice On The Exam Resu...
An Investigation Into The Characteristics Of Multiple-Choice On The Exam Resu...
 
Test construction 1
Test construction 1Test construction 1
Test construction 1
 
tryout test, item analysis (difficulty, discrimination)
tryout test, item analysis (difficulty, discrimination)tryout test, item analysis (difficulty, discrimination)
tryout test, item analysis (difficulty, discrimination)
 
Item analysis2
Item analysis2Item analysis2
Item analysis2
 
Questions
QuestionsQuestions
Questions
 
short answer Questions
short answer Questionsshort answer Questions
short answer Questions
 
Test production process - Approaches to language testing - Techniques of lang...
Test production process - Approaches to language testing - Techniques of lang...Test production process - Approaches to language testing - Techniques of lang...
Test production process - Approaches to language testing - Techniques of lang...
 
Multiple choice items in testing and evaluation
Multiple choice items in testing and evaluationMultiple choice items in testing and evaluation
Multiple choice items in testing and evaluation
 
Fundamental concepts and principles in Language Testing
Fundamental concepts and principles in Language TestingFundamental concepts and principles in Language Testing
Fundamental concepts and principles in Language Testing
 
Test item formats: definition, types, pros and cons
Test item formats: definition, types, pros and consTest item formats: definition, types, pros and cons
Test item formats: definition, types, pros and cons
 

More from steadyfalcon

More from steadyfalcon (20)

SHRM_Chapter 2
SHRM_Chapter 2SHRM_Chapter 2
SHRM_Chapter 2
 
Performance Appraisal
Performance AppraisalPerformance Appraisal
Performance Appraisal
 
Kỹ năng tuyển dụng
Kỹ năng tuyển dụngKỹ năng tuyển dụng
Kỹ năng tuyển dụng
 
Đánh giá công việc
Đánh giá công việcĐánh giá công việc
Đánh giá công việc
 
SHRM_Chapter 01.ppt
SHRM_Chapter 01.pptSHRM_Chapter 01.ppt
SHRM_Chapter 01.ppt
 
Hiểu con người trong công việc
Hiểu con người trong công việcHiểu con người trong công việc
Hiểu con người trong công việc
 
Đào tạo nguồn nhân lực
Đào tạo nguồn nhân lựcĐào tạo nguồn nhân lực
Đào tạo nguồn nhân lực
 
Đánh gia công việc
Đánh gia công việcĐánh gia công việc
Đánh gia công việc
 
Big Five Personality Traits.ppt
Big Five Personality Traits.pptBig Five Personality Traits.ppt
Big Five Personality Traits.ppt
 
MẪU HỆ THỐNG KPI KẾ HOẠCH NHÂNSỰ.pptx
MẪU HỆ THỐNG KPI KẾ HOẠCH NHÂNSỰ.pptxMẪU HỆ THỐNG KPI KẾ HOẠCH NHÂNSỰ.pptx
MẪU HỆ THỐNG KPI KẾ HOẠCH NHÂNSỰ.pptx
 
LỘ TRÌNH ĐÀO TẠO.pptx
LỘ TRÌNH ĐÀO TẠO.pptxLỘ TRÌNH ĐÀO TẠO.pptx
LỘ TRÌNH ĐÀO TẠO.pptx
 
Mẫu báo cáo giáo dục
Mẫu báo cáo giáo dụcMẫu báo cáo giáo dục
Mẫu báo cáo giáo dục
 
Ky nang quan ly theo muc tieu
Ky nang quan ly theo muc tieuKy nang quan ly theo muc tieu
Ky nang quan ly theo muc tieu
 
Customer_driven_marketing_strategy.pptx
Customer_driven_marketing_strategy.pptxCustomer_driven_marketing_strategy.pptx
Customer_driven_marketing_strategy.pptx
 
Customer-Driven-Marketing-Strategy.ppt
Customer-Driven-Marketing-Strategy.pptCustomer-Driven-Marketing-Strategy.ppt
Customer-Driven-Marketing-Strategy.ppt
 
Reference List edited 2016
Reference List edited 2016Reference List edited 2016
Reference List edited 2016
 
Washback
WashbackWashback
Washback
 
Measurement terms
Measurement termsMeasurement terms
Measurement terms
 
Purpose of a test
Purpose of a testPurpose of a test
Purpose of a test
 
THE ROLES OF ESP TEACHERS
THE ROLES OF ESP TEACHERSTHE ROLES OF ESP TEACHERS
THE ROLES OF ESP TEACHERS
 

Recently uploaded

plant breeding methods in asexually or clonally propagated crops
plant breeding methods in asexually or clonally propagated cropsplant breeding methods in asexually or clonally propagated crops
plant breeding methods in asexually or clonally propagated crops
parmarsneha2
 
Accounting and finance exit exam 2016 E.C.pdf
Accounting and finance exit exam 2016 E.C.pdfAccounting and finance exit exam 2016 E.C.pdf
Accounting and finance exit exam 2016 E.C.pdf
YibeltalNibretu
 
Industrial Training Report- AKTU Industrial Training Report
Industrial Training Report- AKTU Industrial Training ReportIndustrial Training Report- AKTU Industrial Training Report
Industrial Training Report- AKTU Industrial Training Report
Avinash Rai
 

Recently uploaded (20)

Embracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic ImperativeEmbracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic Imperative
 
UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...
UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...
UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...
 
The Art Pastor's Guide to Sabbath | Steve Thomason
The Art Pastor's Guide to Sabbath | Steve ThomasonThe Art Pastor's Guide to Sabbath | Steve Thomason
The Art Pastor's Guide to Sabbath | Steve Thomason
 
INU_CAPSTONEDESIGN_비밀번호486_업로드용 발표자료.pdf
INU_CAPSTONEDESIGN_비밀번호486_업로드용 발표자료.pdfINU_CAPSTONEDESIGN_비밀번호486_업로드용 발표자료.pdf
INU_CAPSTONEDESIGN_비밀번호486_업로드용 발표자료.pdf
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
 
PART A. Introduction to Costumer Service
PART A. Introduction to Costumer ServicePART A. Introduction to Costumer Service
PART A. Introduction to Costumer Service
 
Basic phrases for greeting and assisting costumers
Basic phrases for greeting and assisting costumersBasic phrases for greeting and assisting costumers
Basic phrases for greeting and assisting costumers
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
 
Matatag-Curriculum and the 21st Century Skills Presentation.pptx
Matatag-Curriculum and the 21st Century Skills Presentation.pptxMatatag-Curriculum and the 21st Century Skills Presentation.pptx
Matatag-Curriculum and the 21st Century Skills Presentation.pptx
 
plant breeding methods in asexually or clonally propagated crops
plant breeding methods in asexually or clonally propagated cropsplant breeding methods in asexually or clonally propagated crops
plant breeding methods in asexually or clonally propagated crops
 
Sectors of the Indian Economy - Class 10 Study Notes pdf
Sectors of the Indian Economy - Class 10 Study Notes pdfSectors of the Indian Economy - Class 10 Study Notes pdf
Sectors of the Indian Economy - Class 10 Study Notes pdf
 
Instructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptxInstructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptx
 
NLC-2024-Orientation-for-RO-SDO (1).pptx
NLC-2024-Orientation-for-RO-SDO (1).pptxNLC-2024-Orientation-for-RO-SDO (1).pptx
NLC-2024-Orientation-for-RO-SDO (1).pptx
 
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
 
MARUTI SUZUKI- A Successful Joint Venture in India.pptx
MARUTI SUZUKI- A Successful Joint Venture in India.pptxMARUTI SUZUKI- A Successful Joint Venture in India.pptx
MARUTI SUZUKI- A Successful Joint Venture in India.pptx
 
Introduction to Quality Improvement Essentials
Introduction to Quality Improvement EssentialsIntroduction to Quality Improvement Essentials
Introduction to Quality Improvement Essentials
 
Accounting and finance exit exam 2016 E.C.pdf
Accounting and finance exit exam 2016 E.C.pdfAccounting and finance exit exam 2016 E.C.pdf
Accounting and finance exit exam 2016 E.C.pdf
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
 
Industrial Training Report- AKTU Industrial Training Report
Industrial Training Report- AKTU Industrial Training ReportIndustrial Training Report- AKTU Industrial Training Report
Industrial Training Report- AKTU Industrial Training Report
 

Factors affecting test scores and test evaluation in class

  • 1. 1 Topic 7: Factors affecting test scores & simple test evaluation for class teacher (Bachman, 1990, Chapters 6 &7) (Madsen, 1983, Chapter 9) Hoa Nguyen
  • 2. 2 Factors affecting test scores (test bias) Communicative language ability Test scores Test method facets Random factors Personal attributes
  • 3. 3 • Test method facets such as test format/ response format, input format, length of test – Same testing skills and sub-skills but different test formats or response formats (may be in favour of one group but not the other) • Test content – Culture background • Culture features embedded in test content may biased one group but not the other – Background knowledge • Eg. an IELTS test listening section 3 about process of doing an assignment in Western education or a mini-lecture about birds in Tasmania, a TOEFL iBT lecturer or a lecture about Pluto. • In ESP testing, it is necessary to distinguish between language proficiency and background knowledge and the test designed should define specific language ability as part of language ability to be tested. Score from an ESP reading test should not be interpreted as general reading ability. Part 1: Factors affecting test scores
  • 4. 4 • Cognitive characteristics – Field independence • Field independence is “the extent to which a person perceives part of a field as discrete from the surrounding field as a whole, rather embedded, or… the extent to which a person perceives analytically” (p.275). • Test takers who are field independent are likely perform better than those field dependence especially discrete point tests. – Ambiguity tolerance • Ambiguity tolerance is “a person’s ability to function rationally and calmly in a situation in which interpretation of all stimuli is not clear” (p. 227). • Individuals with high ambiguity tolerance might perform better than those with low ambiguity tolerance. – Eg. Cloze test (clearer till the end of test, dictation: some words could not be recognized until the second or last item of reading). • This evidence is not so clear with multiple-choice response format though research show a significant (but low) correlation between scores on a measure of ambiguity and multiple choice measures of E proficiency.
  • 5. 5 • Random factor: testing environment • Native language background, ethnicity, sex, and age • These characteristics of test takers are not facets of test methods  they cannot be considered as possible sources of measurement error. • All these can only provide information about how language learning varies with age, ethnicity, sex, and other individual characteristics.
  • 6. 6 Part 2: Practical approach of test evaluation for class teacher • Preparing an item analysis – score all the test takers – rank them from the highest to the lowest and divide them equally into three groups high – middle - low – record students’ responses: from high group to low group; if it is a multiple choice, circle the correct option.
  • 7. 7 Item 1 High group Low group A / /// (B) //// // C // / D / // X / //
  • 8. 8 1. Difficulty level High correct + Low correct Hc + Lc Total number in sample (H +L) or N Eg. In this case, (5+2):20 = 7/20 = 35% Note. easy ≥90%, difficult ≤ 30% 2. Discrimination level High correct - Low correct Hc - Lc Total number in sample (H +L) or N Eg. In this case, (5-2):20 = 3/20 = 15%, ≥15% is acceptable; from 10% to 15% is questionable. Note. ≤ 10% discrimination is not acceptable
  • 9. 9 • Distractor evaluation • Weak distractors: poor discrimination or not chosen by test takers. • Only one or two distractors attract attention. Reasons – the item has been revised well in the class and everyone masters it: the answer is obvious – no other choices seem likely to be the answers – obviously impossible distractor(s) • If many items are left blank near the end of the test, the test should be shortened or more time should be given to it.
  • 10. 10 1. Practice exercises Item 1 (D) Item 2 (A) Item 3 (B) Item 4 (C) High Low High Low High Low High Low A // // // //// // B / // //// / // C //// // / // //// //// //// // D //// / //// / / /// // X / / / // Questions: 1. Calculate the level of difficulty for each of the four items Item 1 = Item 2 = Item 3 = Item 4 = Which of these items are too difficult, and which are too easy? 2. Calculate the discrimination of each item. Which item has the poorest discrimination? …………………………. Which item has unsatisfactory discrimination? …………………………. Which item(s) has borderline? …………………………. 3. Look at the distractors on the four items. In which are they the most effective? …………………………. In which are they the least effective? …………………………. 4. Is there any item with negative discrimination? If so, which one? ……………………………………………………………………… 5. Which item did the fewest students leave blank? …………………………. Which item did the most leave blank? ………………………….
  • 11. 11 Question 1 • 1 = 50% 2 = 39% 3 = 33% 4 = 89% • none = too difficult none = too easy Question 2 • 1 = 5%; 2 = negative (no discrimination); 3 = 22%; 4 = 11% • 2 = poorest discrimination and it is unsatisfactory • no other unsatisfactory or borderline items Question 3 • 3 = most effective 4 = least effective Question 4 • Yes; 2 = negative Question 5 • Fewest leave blank = 4 Most leave blank = 2 and 3