SlideShare a Scribd company logo
LOGO
Justifying the Use of an English
Language Placement Test
with an Assessment Use Argument
Presented by:
Parisa Mehran
Alzahra University
Tehran, Iran
Placement Tests
Placement test is considered as a fairly high-stakes test (Bachman & Palmer, 1996, 2010),
and the social consequences of placement decisions are of great significance and need to
be investigated, since such decisions can affect the lives of students (Murray, 2001;
Schmitz & delMas, 1991).
Thus, as Brown (1989) emphasizes, it is important to make valid placement decisions to
avoid mismatches that can occur due to inappropriate placement testing.
Validity in Language Assessment
Validity has been regarded as the most significant and complex concept in language
assessment, and it has always been under investigation by language testing experts
and researchers (e.g., Bachman, 1990, 2004, 2005; Bachman & Palmer, 2010; Chapelle,
1999; Cronbach & Meehl, 1955; Kane, 2001, 2012, 2013; Lado, 1961; Messick, 1989). As a
result, the conception of validity has undergone a series of reinterpretations
throughout the history of language assessment.
Argument-based Approaches to Validity
Argument-based approaches to validity are based on the concept of a validity argument which has
been used in the process of validation for more than twenty years (e.g., Bachman, 2005; Cronbach,
1988, 1989; House, 1980, Kane, 1992; Mislevy, 2003; Mislevy, Steinberg, & Almond, 2002, 2003).
The process is comparable to building a legal case to persuade a judge or a jury. The process of
validation thus becomes ongoing: As long as a test is alive, the collection of relevant evidence is
going to be continued (Bachman & Palmer, 2010). Hence, any kind of relevant evidence is gathered
to show the plausibility of the intended interpretations and uses (Bachman, 2004; Kane, 2012).
Purpose of the Study
Using Bachman and Palmer's (2010) AUA as a framework, this study examined the
validity of an English language placement test, which is composed of the Oxford Quick
Placement Test (OQPT) and a follow-up oral examination. The following research
question was addressed:
To what extent are the OQPT and the oral examination justifiable in placing students
appropriately according to Bachman and Palmer's (2010) AUA?
Methodology: Participants and Setting
This study was conducted at one of the English language institutes in Tehran, Iran.
Three hundred and thirty-two (332) newcomers to the institute who had to take the
placement test participated in this study, and 15 of them were interviewed. The head of
the institute, three examiners of the placement test, ten teachers, and four experts also
attended the current study.
Methodology: Instrumentation
1. OQPT
2. Oral Examination
3. TOEFL
4. Interview
5. Observation
The AUA for Justifying the Placement Test
As Vongpumivitch (2010) and Wang et al. (2012) remark, Bachman and Palmer's (2010) framework
has a top-down approach. That is, the four claims are discussed from the perspective of test
development rather than from that of test use. Therefore, in this study, where the aim is to evaluate
the overall usefulness of an English language placement test, the four claims are presented in the
reverse order from that in Bachman and Palmer (2010).
It should also be mentioned that as Bachman and Palmer (2010) emphasize not all of the warrants
and rebuttals listed by them will necessarily be needed in the AUA for any given test. Moreover,
due to practical research limitations, not all of the warrants and rebuttals have been investigated in
the present study.
Claim 4: The assessment records of the OQPT and the oral examination are consistent across
different assessment tasks, different aspects of the assessment procedure, and across different
groups of test takers.
Claim 4: Assessment Records
Consistency
The first warrant for this claim is that the procedures for administrating the OQPT and the oral
examination are followed consistently across different occasions and for all test takers:
 The observation of how the OQPT and the oral examination were administered as well as the
interviews with the examiners and the head of the institute revealed that there are a set of
administrative procedures which are strictly followed by the test administrators; hence, the
administrative procedures are consistent across different occasions and for all test taker groups.
Consistency (cont.)
Another warrant to support the consistency claim involves the scoring criteria and procedures:
 The criteria and procedures for rating test takers' performance on the OQPT are well specified and
are adhered to. Since the OQPT is in multiple-choice format, its rating criteria and procedures are
quite objective, and scoring is done based on an answer key.
 However, the criteria and procedures for the oral examination are not well specified and are quite
subjective. A set of questions have been devised based on the coursebook. In this sense, the
administration of the oral examination is consistent, yet its scoring process does not follow any
specific procedures. This lack of evidence could be a rebuttal here.
Consistency (cont.)
With respect to the warrant of rater training:
Raters undergo training before administrating the placement test.
However, one of the examiners was not satisfied with the training process, and she
believed that what matters is just the examiner's marketing skill to "grab more
customers" for the institute.
Consistency (cont.)
To check internal consistency of the items, as another warrant:
 Kuder-Richardson formula 20 (KR-20) was used.
 The reliability coefficient (KR-20) obtained for the OQPT Version 1 was .93 and for the OQPT Version 2 was
.88 showing that the OQPT has reasonable internal consistency reliability.
 Two main test item indices (item difficulty and item discrimination) were used in the test item analysis for the
OQPT.
 In terms of difficulty, the items have been ordered from the easiest to the most difficult, and this is in line
with the view of experts, examiners, and test takers. The analysis of item difficulty showed that, by and
large, most of the items were difficult (56% in the Version 1 and 63% in Version 2), and both versions of the
OQPT did not have a fairly acceptable distribution of difficulty.
 In terms of item discrimination, the analysis of items demonstrated that the items in the OQPT Version 1
had good amount of discrimination (75%). However, the OQPT Version 2 contained less items of fair
discrimination (46%) in comparison to the first version.
Consistency (cont.)
 In regard to inter-rater and intra-rater reliability, inconsistencies between and within human
raters are not a source of measurement error because the scoring of the OQPT is done through
an answer key.
 In the case of the oral examination, Cronbach's alpha was computed. The alpha was .93 for
inter-rater reliability and .96 for intra-rater reliability indicating that despite the lack of
consistent criteria and procedures for the oral examination, the oral examination has
reasonable internal consistency reliability.
 The analysis of the two versions of the OQPT brought a serious rebuttal to the consistency claim.
Cronbach's alpha was calculated and the alpha was .000 indicating that the two versions are not
equivalent. Two experts also remarked that the second version is much more difficult, and that
it cannot be considered as equivalent to the first version.
Claim 3: The OQPT scores and the oral examination results can be interpreted as test takers' level
of English proficiency and place them in their appropriate level. Such interpretations are
meaningful, impartial, generalizable, relevant, and sufficient.
Claim 3: Interpretations
Meaningfulness
 Interpretations about the construct to be assessed are meaningful if they are based on a frame of reference like a
course syllabus, a needs analysis, or a general theory of language ability. The head of the institute and the
examiners believed that the OQPT is a suitable placement test because Oxford University Press is the publisher of
both this test and the coursebook taught in the institute (i.e., English Result). The teaching method followed in the
institute is Communicative Language Teaching (CLT), and speaking is thus the primary focus; therefore, the oral
examination has a significant role in placement testing. However, lack of a listening section in the OQPT can be a
rebuttal to the meaningfulness of the interpretations.
Impartiality
To support the impartiality warrant, the assessment items/tasks should be checked for response
formats or content that may either favor or disfavor some test takers, and test takers should be
treated impartially in terms of all aspects of test administration.
 As mentioned earlier, interviews with test takers and examiners revealed that due to the lack of
a specific rubric for the oral examination, it was believed that interpretations about the ability to
be assessed were not made without bias against any groups of test takers.
 No complaint was made in regard to the appropriateness of the content. Bias and item
sensitivity studies need to be done for deeper analysis.
Generalizability
According to the generalizability warrant, the characteristics of the assessment items/tasks (e.g., input,
expected response, type of interaction) as well as the scoring criteria and procedures of the test tasks
should correspond closely to those of the target language use (TLU) domain.
 It might be that the items/tasks in the OQPT and the oral examination do not exactly correspond to all
of the TLU tasks; however, the content of the OQPT and the oral examination corresponds to the
content of the textbook taught in the institute. Moreover, in the oral examination, test takers' real world
language performance is examined. Thus, this can to some extent support the generalizability warrant.
 Here, it is worth mentioning that some of the teachers, examiners, and experts asserted that a TOEFL or
an IELTS test was a better test, indeed an ideal one, due to having writing and specially listening parts,
for placement testing, but because of time limitations, they could not be used as placement tests.
Consequently, a TOEFL test was given to those who had taken the OQPT, and the results manifested
that the correlation between the OQPT and TOEFL scores was not high (r=.66).
Relevance
The forth warrant is relevance according to which the assessment-based interpretations should
provide information that is relevant and helpful for the decision makers to make decisions.
 Based on the interviews conducted with examiners, it was revealed that the OQPT scores were
not sufficiently helpful in placement testing. This could have been a rebuttal to the relevance
warrant; yet, the oral examination is in support of this warrant since it is quite helpful for the
examiners to make their placement decisions. As said before, lack of a rubric is a serious
rebuttal.
Sufficiency
The fifth warrant demands that the assessment-based interpretations should provide
information that is sufficient for the decision makers to make decisions.
Again, because the process of placement testing includes both a written and an oral
test, sufficient information is obtained to make placement decisions.
Claim 2: The placement decisions that are made on the basis of the OQPT scores and the oral
examination results are sensitive to local values and equitable to all stakeholders.
Claim 2: Decisions
Values Sensitivity
According to values sensitivity warrant, the existing community values and relevant legal
requirements should be carefully and critically considered in the admission decisions that are to be
made and in determining the relative seriousness of false positive and false negative classification
errors.
 The interviews and observations revealed that the process of placement testing does not
guarantee test fairness considerations, and in this phase just absorbing more clients is
important. Hence, it is possible to have potential false positives (i.e., individuals are placed in a
level higher than their actual level) and false negatives (i.e., individuals are placed in a level
lower than their actual level). Usually the latter happens, because it is less risky; nevertheless, if
at the time of placement testing the institute does not have the level appropriate for the test
taker (due to some limitations, such as space, time, lack of students, all the levels are not always
covered), the test taker will be placed in a higher level.
Equitability
 Due to the subjectivity of the oral examination, it cannot be claimed that the same cut scores
and decision rules are used to classify all students who have applied for the same program, and
no other considerations are used. The economic and practical considerations always exist.
Consequently, test takers and other stakeholders are not fully informed about how decisions will
be made and whether decisions are actually made in the way described to them.
Claim 1: The consequences of the placement decisions based on the OQPT scores and the oral
examination results are beneficial to all stakeholders that use the test, including the test takers, the
institution, the teachers, and the supervisor.
Claim 1: Consequences
Beneficence
The first warrant is that the consequences of using the assessment that are specific to each
stakeholder will be beneficial.
Some of the test takers were interviewed after their placement test and their attendance in
their classes. On the whole, they were satisfied, although some of them believed that their
level was higher, and that they were placed in a lower level. To them, the reason is basically
cost-effectiveness. Most of the teachers believed that their students were homogenous in the
class; yet, two teachers strongly disagreed and they believed that their classes were not at all
homogenous especially at higher levels.
Conclusion
 Based on the evidence gathered, this study found that the assessment records of the OQPT and the oral
examination were consistent across different assessment tasks, different aspects of the assessment
procedure, and across different groups of test takers. However, the oral examination required a set of
clear criteria.
 Moreover, the analysis of the two versions of the OQPT manifested that their parallelism was under
question which could threaten the consistency of the assessment records.
 The findings also indicated that the OQPT scores and the oral examination results could be interpreted
somewhat as test takers' level of English proficiency and could place them in their appropriate levels.
Such interpretations were meaningful, impartial, relevant, and sufficient, although lack of a listening
section in the OQPT and lack of a rubric for the oral examination could be threatening, and
generalizability of the results was to some extent under question.
 In addition, the placement decisions were not sensitive to local values and equitable to all stakeholders
due to the subjectivity of the oral examination and the economic considerations of the institute.
 Lastly, by and large, the consequences of the placement decisions were beneficial to all stakeholders that
use the test, which is composed of the test takers, the institution, the teachers, and the supervisor.
Local Implications
To support the intended test use, it would be helpful to examine the negative evidence that has
been identified in this study and resolve the identified issues or mitigate the potential negative
impact of unresolved issues. For instance, in the case of the current placement test, the oral
examination can be given based on a rubric, a listening section can be added to the written test,
and economic considerations can become less important for the institute, and therefore the
intended uses of the test would become much more justifiable with stronger evidence.
The Merits/Demerits of Using an AUA
Finally, this study serves as an illustration of the merits/demerits of using an AUA.
 On the whole, the AUA provides a sound framework in which the validity of a test and its use
can be justified and the test developers/users can be accountable for their test.
 With the help of the AUA framework, the process of assessment justification becomes more
comprehensive, systematic, and coherent. In fact, one of the merits of an AUA is its clear
articulation about which types of evidence should be collected for which claims or warrants.
 However, in the process of assessment justification, an AUA demands that the evaluation of the
test be done at many levels and this needs different types of data and analyses. Thus, in practice,
the complexity of the justification study may be a big challenge for a single researcher.
Thank you for your attention!

More Related Content

What's hot

Material evaluation (Chapter 1)
Material evaluation (Chapter 1)Material evaluation (Chapter 1)
Material evaluation (Chapter 1)
H. R. Marasabessy
 
RELIABILITY IN LANGUAGE TESTING-TITIN'S GROUP
RELIABILITY IN LANGUAGE TESTING-TITIN'S GROUPRELIABILITY IN LANGUAGE TESTING-TITIN'S GROUP
RELIABILITY IN LANGUAGE TESTING-TITIN'S GROUPTitin Rohayati
 
Types of Tests,
Types of Tests, Types of Tests,
Types of Tests,
Wardah Azhar
 
Assessment &testing in the classroom
Assessment &testing in the classroomAssessment &testing in the classroom
Assessment &testing in the classroomCidher89
 
Test Techniques
Test TechniquesTest Techniques
Test Techniques
Ariane Mitschek
 
Achieving beneficial blackwash
Achieving beneficial blackwashAchieving beneficial blackwash
Achieving beneficial blackwash
Maury Martinez
 
Materials Evaluation Intro
Materials Evaluation IntroMaterials Evaluation Intro
Materials Evaluation Intro
PEDRO MENDOZA DE LEON
 
Assessment &testing in the classroom
Assessment &testing in the classroomAssessment &testing in the classroom
Assessment &testing in the classroom
Cidher89
 
material evaluation IN ESP
material evaluation IN ESP material evaluation IN ESP
material evaluation IN ESP
ZaryabQureshi3
 
Testing reading
Testing readingTesting reading
Testing reading
Maury Martinez
 
Types of syllabus design
Types of syllabus designTypes of syllabus design
Types of syllabus design
رسول المنهي
 
The framework of materials and method & Current approaches to materials and m...
The framework of materials and method & Current approaches to materials and m...The framework of materials and method & Current approaches to materials and m...
The framework of materials and method & Current approaches to materials and m...
RBLmadev Class 2018
 
Validity, reliablility, washback
Validity, reliablility, washbackValidity, reliablility, washback
Validity, reliablility, washback
Maury Martinez
 
Approaches to language Assessment
Approaches to language Assessment Approaches to language Assessment
Approaches to language Assessment
AliAlZurfi
 
Materials development in language teaching
Materials development in language teachingMaterials development in language teaching
Materials development in language teachingFadi Sukkari
 
Teaching Language Skill: Speaking and Writing
Teaching Language Skill: Speaking and WritingTeaching Language Skill: Speaking and Writing
Teaching Language Skill: Speaking and Writing
UNY Pasca PBI-B
 
Chapter 2(principles of language assessment)
Chapter 2(principles of language assessment)Chapter 2(principles of language assessment)
Chapter 2(principles of language assessment)Kheang Sokheng
 
Principles of language assessment
Principles of language assessmentPrinciples of language assessment
Principles of language assessmentAstrid Caballero
 
Lecture 3 Materials Development and Adaptation
Lecture 3 Materials Development and AdaptationLecture 3 Materials Development and Adaptation
Lecture 3 Materials Development and Adaptation
Salina Saharudin
 

What's hot (20)

Material evaluation (Chapter 1)
Material evaluation (Chapter 1)Material evaluation (Chapter 1)
Material evaluation (Chapter 1)
 
RELIABILITY IN LANGUAGE TESTING-TITIN'S GROUP
RELIABILITY IN LANGUAGE TESTING-TITIN'S GROUPRELIABILITY IN LANGUAGE TESTING-TITIN'S GROUP
RELIABILITY IN LANGUAGE TESTING-TITIN'S GROUP
 
Types of Tests,
Types of Tests, Types of Tests,
Types of Tests,
 
Assessment &testing in the classroom
Assessment &testing in the classroomAssessment &testing in the classroom
Assessment &testing in the classroom
 
Test Techniques
Test TechniquesTest Techniques
Test Techniques
 
Achieving beneficial blackwash
Achieving beneficial blackwashAchieving beneficial blackwash
Achieving beneficial blackwash
 
Materials Evaluation Intro
Materials Evaluation IntroMaterials Evaluation Intro
Materials Evaluation Intro
 
Assessment &testing in the classroom
Assessment &testing in the classroomAssessment &testing in the classroom
Assessment &testing in the classroom
 
material evaluation IN ESP
material evaluation IN ESP material evaluation IN ESP
material evaluation IN ESP
 
Testing reading
Testing readingTesting reading
Testing reading
 
Types of syllabus design
Types of syllabus designTypes of syllabus design
Types of syllabus design
 
The framework of materials and method & Current approaches to materials and m...
The framework of materials and method & Current approaches to materials and m...The framework of materials and method & Current approaches to materials and m...
The framework of materials and method & Current approaches to materials and m...
 
Assessing language skills
Assessing language skillsAssessing language skills
Assessing language skills
 
Validity, reliablility, washback
Validity, reliablility, washbackValidity, reliablility, washback
Validity, reliablility, washback
 
Approaches to language Assessment
Approaches to language Assessment Approaches to language Assessment
Approaches to language Assessment
 
Materials development in language teaching
Materials development in language teachingMaterials development in language teaching
Materials development in language teaching
 
Teaching Language Skill: Speaking and Writing
Teaching Language Skill: Speaking and WritingTeaching Language Skill: Speaking and Writing
Teaching Language Skill: Speaking and Writing
 
Chapter 2(principles of language assessment)
Chapter 2(principles of language assessment)Chapter 2(principles of language assessment)
Chapter 2(principles of language assessment)
 
Principles of language assessment
Principles of language assessmentPrinciples of language assessment
Principles of language assessment
 
Lecture 3 Materials Development and Adaptation
Lecture 3 Materials Development and AdaptationLecture 3 Materials Development and Adaptation
Lecture 3 Materials Development and Adaptation
 

Viewers also liked

Cr comp placement_test
Cr comp placement_testCr comp placement_test
Cr comp placement_test
Indah Widyaningrum
 
Survival Skills for College
Survival Skills for CollegeSurvival Skills for College
Survival Skills for College
TTC Webmaster
 
CUADERNILLO DE ESCRITURA
CUADERNILLO DE ESCRITURACUADERNILLO DE ESCRITURA
CUADERNILLO DE ESCRITURA
Alma Rosa Herrera Ruiz
 
Citywide Safety Solutions
Citywide Safety SolutionsCitywide Safety Solutions
Citywide Safety Solutions
Erin Planting
 
Placement test
Placement testPlacement test
Placement testThao Le
 
TEDx Manchester: AI & The Future of Work
TEDx Manchester: AI & The Future of WorkTEDx Manchester: AI & The Future of Work
TEDx Manchester: AI & The Future of Work
Volker Hirsch
 

Viewers also liked (6)

Cr comp placement_test
Cr comp placement_testCr comp placement_test
Cr comp placement_test
 
Survival Skills for College
Survival Skills for CollegeSurvival Skills for College
Survival Skills for College
 
CUADERNILLO DE ESCRITURA
CUADERNILLO DE ESCRITURACUADERNILLO DE ESCRITURA
CUADERNILLO DE ESCRITURA
 
Citywide Safety Solutions
Citywide Safety SolutionsCitywide Safety Solutions
Citywide Safety Solutions
 
Placement test
Placement testPlacement test
Placement test
 
TEDx Manchester: AI & The Future of Work
TEDx Manchester: AI & The Future of WorkTEDx Manchester: AI & The Future of Work
TEDx Manchester: AI & The Future of Work
 

Similar to Justifying the Use of an English Language Placement Test with an Assessment Use Argument

Assessment on SLA Theoretical Framework
Assessment on SLA Theoretical FrameworkAssessment on SLA Theoretical Framework
Assessment on SLA Theoretical FrameworkGerardo Zavalla
 
CHARACTERISTICS OF A GOOD INSTRUMENT
CHARACTERISTICS OF A GOOD INSTRUMENTCHARACTERISTICS OF A GOOD INSTRUMENT
CHARACTERISTICS OF A GOOD INSTRUMENT
Musfera Nara Vadia
 
Principles of language assessment
Principles of language assessmentPrinciples of language assessment
Principles of language assessment
Ameer Al-Labban
 
UTPL-LANGUAGE TESTING-II-BIMESTRE-(OCTUBRE 2011-FEBRERO 2012)
UTPL-LANGUAGE TESTING-II-BIMESTRE-(OCTUBRE 2011-FEBRERO 2012)UTPL-LANGUAGE TESTING-II-BIMESTRE-(OCTUBRE 2011-FEBRERO 2012)
UTPL-LANGUAGE TESTING-II-BIMESTRE-(OCTUBRE 2011-FEBRERO 2012)
Videoconferencias UTPL
 
CRITERIA OF A GOOD TEST.pptx
CRITERIA OF A GOOD TEST.pptxCRITERIA OF A GOOD TEST.pptx
CRITERIA OF A GOOD TEST.pptx
105351123916DwiHanda
 
The nittygritty of language testing
The nittygritty of language testingThe nittygritty of language testing
The nittygritty of language testing
Marzs
 
Principles of Language Assessment
Principles of Language AssessmentPrinciples of Language Assessment
Principles of Language Assessment
A Faiz
 
ELTLAE Group 2.pptx
ELTLAE Group 2.pptxELTLAE Group 2.pptx
ELTLAE Group 2.pptx
AhzaPutro
 
Presentation seminar on
Presentation  seminar on Presentation  seminar on
Presentation seminar on Tahsin Hussein
 
Principles of Language Assessment
Principles of Language AssessmentPrinciples of Language Assessment
Principles of Language Assessment
SubramanianMuthusamy3
 
ASSESSMENT.pptx
ASSESSMENT.pptxASSESSMENT.pptx
ASSESSMENT.pptx
PatrickJamesPelicano2
 
Characteristics of Assessment
Characteristics of Assessment Characteristics of Assessment
Characteristics of Assessment
AliAlZurfi
 
Validity and Reliability
Validity and ReliabilityValidity and Reliability
Validity and Reliability
Maury Martinez
 
Discussion question for meeting two language assessment
Discussion question for meeting two language assessmentDiscussion question for meeting two language assessment
Discussion question for meeting two language assessment
ManasApintamon
 
CLASSROOM ACTIVITIES
CLASSROOM  ACTIVITIESCLASSROOM  ACTIVITIES
CLASSROOM ACTIVITIES
Alfredo Carrion
 
An Investigation Into The Characteristics Of Multiple-Choice On The Exam Resu...
An Investigation Into The Characteristics Of Multiple-Choice On The Exam Resu...An Investigation Into The Characteristics Of Multiple-Choice On The Exam Resu...
An Investigation Into The Characteristics Of Multiple-Choice On The Exam Resu...
Nhận Viết Đề Tài Trọn Gói ZALO 0932091562
 

Similar to Justifying the Use of an English Language Placement Test with an Assessment Use Argument (20)

Theoretical framefinal (autoguardado)
Theoretical framefinal (autoguardado)Theoretical framefinal (autoguardado)
Theoretical framefinal (autoguardado)
 
Assessment on SLA Theoretical Framework
Assessment on SLA Theoretical FrameworkAssessment on SLA Theoretical Framework
Assessment on SLA Theoretical Framework
 
CHARACTERISTICS OF A GOOD INSTRUMENT
CHARACTERISTICS OF A GOOD INSTRUMENTCHARACTERISTICS OF A GOOD INSTRUMENT
CHARACTERISTICS OF A GOOD INSTRUMENT
 
Principles of language assessment
Principles of language assessmentPrinciples of language assessment
Principles of language assessment
 
Week 8 & 9 - Validity and Reliability
Week 8 & 9 - Validity and ReliabilityWeek 8 & 9 - Validity and Reliability
Week 8 & 9 - Validity and Reliability
 
UTPL-LANGUAGE TESTING-II-BIMESTRE-(OCTUBRE 2011-FEBRERO 2012)
UTPL-LANGUAGE TESTING-II-BIMESTRE-(OCTUBRE 2011-FEBRERO 2012)UTPL-LANGUAGE TESTING-II-BIMESTRE-(OCTUBRE 2011-FEBRERO 2012)
UTPL-LANGUAGE TESTING-II-BIMESTRE-(OCTUBRE 2011-FEBRERO 2012)
 
CRITERIA OF A GOOD TEST.pptx
CRITERIA OF A GOOD TEST.pptxCRITERIA OF A GOOD TEST.pptx
CRITERIA OF A GOOD TEST.pptx
 
The nittygritty of language testing
The nittygritty of language testingThe nittygritty of language testing
The nittygritty of language testing
 
Principles of Language Assessment
Principles of Language AssessmentPrinciples of Language Assessment
Principles of Language Assessment
 
ELTLAE Group 2.pptx
ELTLAE Group 2.pptxELTLAE Group 2.pptx
ELTLAE Group 2.pptx
 
Presentation seminar on
Presentation  seminar on Presentation  seminar on
Presentation seminar on
 
Language admin
Language  adminLanguage  admin
Language admin
 
Principles of Language Assessment
Principles of Language AssessmentPrinciples of Language Assessment
Principles of Language Assessment
 
The Components of Test Specifications
The Components of Test SpecificationsThe Components of Test Specifications
The Components of Test Specifications
 
ASSESSMENT.pptx
ASSESSMENT.pptxASSESSMENT.pptx
ASSESSMENT.pptx
 
Characteristics of Assessment
Characteristics of Assessment Characteristics of Assessment
Characteristics of Assessment
 
Validity and Reliability
Validity and ReliabilityValidity and Reliability
Validity and Reliability
 
Discussion question for meeting two language assessment
Discussion question for meeting two language assessmentDiscussion question for meeting two language assessment
Discussion question for meeting two language assessment
 
CLASSROOM ACTIVITIES
CLASSROOM  ACTIVITIESCLASSROOM  ACTIVITIES
CLASSROOM ACTIVITIES
 
An Investigation Into The Characteristics Of Multiple-Choice On The Exam Resu...
An Investigation Into The Characteristics Of Multiple-Choice On The Exam Resu...An Investigation Into The Characteristics Of Multiple-Choice On The Exam Resu...
An Investigation Into The Characteristics Of Multiple-Choice On The Exam Resu...
 

More from Parisa Mehran

How to Transform Your Classroom with AR and VR
How to Transform Your Classroom with AR and VRHow to Transform Your Classroom with AR and VR
How to Transform Your Classroom with AR and VR
Parisa Mehran
 
Design, Implementation, and Evaluation of an English Blended Course
Design, Implementation, and Evaluation of an English Blended CourseDesign, Implementation, and Evaluation of an English Blended Course
Design, Implementation, and Evaluation of an English Blended Course
Parisa Mehran
 
“Write 4 Change”: Cultivating Autonomous, Global EFL Learners through Blogging
“Write 4 Change”: Cultivating Autonomous, Global EFL Learners through Blogging“Write 4 Change”: Cultivating Autonomous, Global EFL Learners through Blogging
“Write 4 Change”: Cultivating Autonomous, Global EFL Learners through Blogging
Parisa Mehran
 
Physically Banned yet Virtually Connected at EUROCALL2017: How Technology Ove...
Physically Banned yet Virtually Connected at EUROCALL2017: How Technology Ove...Physically Banned yet Virtually Connected at EUROCALL2017: How Technology Ove...
Physically Banned yet Virtually Connected at EUROCALL2017: How Technology Ove...
Parisa Mehran
 
Debunking Stereotypes about Middle Eastern Women in the EFL Classroom
Debunking Stereotypes about Middle Eastern Women in the EFL Classroom Debunking Stereotypes about Middle Eastern Women in the EFL Classroom
Debunking Stereotypes about Middle Eastern Women in the EFL Classroom
Parisa Mehran
 
Building Global Awareness and Responsible World Citizenship through Augmented...
Building Global Awareness and Responsible World Citizenship through Augmented...Building Global Awareness and Responsible World Citizenship through Augmented...
Building Global Awareness and Responsible World Citizenship through Augmented...
Parisa Mehran
 
Connecting to Puerto Rico through Augmented and Virtual Realities
Connecting to Puerto Rico through Augmented and Virtual RealitiesConnecting to Puerto Rico through Augmented and Virtual Realities
Connecting to Puerto Rico through Augmented and Virtual Realities
Parisa Mehran
 
Are You Listening? Responding to the Challenges of Diversity (Tottori JALT)
Are You Listening? Responding to the Challenges of Diversity (Tottori JALT)Are You Listening? Responding to the Challenges of Diversity (Tottori JALT)
Are You Listening? Responding to the Challenges of Diversity (Tottori JALT)
Parisa Mehran
 
Being an Iranian Woman Today イラン人女性として現代に生きるということ
Being an Iranian Woman Today イラン人女性として現代に生きるということBeing an Iranian Woman Today イラン人女性として現代に生きるということ
Being an Iranian Woman Today イラン人女性として現代に生きるということ
Parisa Mehran
 
A Virtual Trip to the Real Iran
A Virtual Trip to the Real IranA Virtual Trip to the Real Iran
A Virtual Trip to the Real Iran
Parisa Mehran
 
Are You Listening? Responding to the Challenges of Diversity (Kyoto JALT)
Are You Listening? Responding to the Challenges of Diversity (Kyoto JALT)Are You Listening? Responding to the Challenges of Diversity (Kyoto JALT)
Are You Listening? Responding to the Challenges of Diversity (Kyoto JALT)
Parisa Mehran
 
How Can I Change the World: Postcards for Puerto Rico
How Can I Change the World: Postcards for Puerto RicoHow Can I Change the World: Postcards for Puerto Rico
How Can I Change the World: Postcards for Puerto Rico
Parisa Mehran
 
Developing a Blended Course: Why Quality Matters
Developing a Blended Course: Why Quality MattersDeveloping a Blended Course: Why Quality Matters
Developing a Blended Course: Why Quality Matters
Parisa Mehran
 
Inspiring Women
Inspiring WomenInspiring Women
Inspiring Women
Parisa Mehran
 
Multimodal e-Feedback in an Online English Course
Multimodal e-Feedback in an Online English CourseMultimodal e-Feedback in an Online English Course
Multimodal e-Feedback in an Online English Course
Parisa Mehran
 
A Virtual Trip to the Unseen Iran, MAVR SIG Forum, JALT2017
A Virtual Trip to the Unseen Iran, MAVR SIG Forum, JALT2017A Virtual Trip to the Unseen Iran, MAVR SIG Forum, JALT2017
A Virtual Trip to the Unseen Iran, MAVR SIG Forum, JALT2017
Parisa Mehran
 
Students' Comments on Their Trip to the HeART of the Unseen Iran via the VR ...
Students' Comments on Their Trip to the HeART of  the Unseen Iran via the VR ...Students' Comments on Their Trip to the HeART of  the Unseen Iran via the VR ...
Students' Comments on Their Trip to the HeART of the Unseen Iran via the VR ...
Parisa Mehran
 
Group Number Signs
Group Number SignsGroup Number Signs
Group Number Signs
Parisa Mehran
 
Iran 360
Iran 360Iran 360
Iran 360
Parisa Mehran
 
I Am More Than A Stereotype: Actions and Stories for Diversity Awareness and ...
I Am More Than A Stereotype: Actions and Stories for Diversity Awareness and ...I Am More Than A Stereotype: Actions and Stories for Diversity Awareness and ...
I Am More Than A Stereotype: Actions and Stories for Diversity Awareness and ...
Parisa Mehran
 

More from Parisa Mehran (20)

How to Transform Your Classroom with AR and VR
How to Transform Your Classroom with AR and VRHow to Transform Your Classroom with AR and VR
How to Transform Your Classroom with AR and VR
 
Design, Implementation, and Evaluation of an English Blended Course
Design, Implementation, and Evaluation of an English Blended CourseDesign, Implementation, and Evaluation of an English Blended Course
Design, Implementation, and Evaluation of an English Blended Course
 
“Write 4 Change”: Cultivating Autonomous, Global EFL Learners through Blogging
“Write 4 Change”: Cultivating Autonomous, Global EFL Learners through Blogging“Write 4 Change”: Cultivating Autonomous, Global EFL Learners through Blogging
“Write 4 Change”: Cultivating Autonomous, Global EFL Learners through Blogging
 
Physically Banned yet Virtually Connected at EUROCALL2017: How Technology Ove...
Physically Banned yet Virtually Connected at EUROCALL2017: How Technology Ove...Physically Banned yet Virtually Connected at EUROCALL2017: How Technology Ove...
Physically Banned yet Virtually Connected at EUROCALL2017: How Technology Ove...
 
Debunking Stereotypes about Middle Eastern Women in the EFL Classroom
Debunking Stereotypes about Middle Eastern Women in the EFL Classroom Debunking Stereotypes about Middle Eastern Women in the EFL Classroom
Debunking Stereotypes about Middle Eastern Women in the EFL Classroom
 
Building Global Awareness and Responsible World Citizenship through Augmented...
Building Global Awareness and Responsible World Citizenship through Augmented...Building Global Awareness and Responsible World Citizenship through Augmented...
Building Global Awareness and Responsible World Citizenship through Augmented...
 
Connecting to Puerto Rico through Augmented and Virtual Realities
Connecting to Puerto Rico through Augmented and Virtual RealitiesConnecting to Puerto Rico through Augmented and Virtual Realities
Connecting to Puerto Rico through Augmented and Virtual Realities
 
Are You Listening? Responding to the Challenges of Diversity (Tottori JALT)
Are You Listening? Responding to the Challenges of Diversity (Tottori JALT)Are You Listening? Responding to the Challenges of Diversity (Tottori JALT)
Are You Listening? Responding to the Challenges of Diversity (Tottori JALT)
 
Being an Iranian Woman Today イラン人女性として現代に生きるということ
Being an Iranian Woman Today イラン人女性として現代に生きるということBeing an Iranian Woman Today イラン人女性として現代に生きるということ
Being an Iranian Woman Today イラン人女性として現代に生きるということ
 
A Virtual Trip to the Real Iran
A Virtual Trip to the Real IranA Virtual Trip to the Real Iran
A Virtual Trip to the Real Iran
 
Are You Listening? Responding to the Challenges of Diversity (Kyoto JALT)
Are You Listening? Responding to the Challenges of Diversity (Kyoto JALT)Are You Listening? Responding to the Challenges of Diversity (Kyoto JALT)
Are You Listening? Responding to the Challenges of Diversity (Kyoto JALT)
 
How Can I Change the World: Postcards for Puerto Rico
How Can I Change the World: Postcards for Puerto RicoHow Can I Change the World: Postcards for Puerto Rico
How Can I Change the World: Postcards for Puerto Rico
 
Developing a Blended Course: Why Quality Matters
Developing a Blended Course: Why Quality MattersDeveloping a Blended Course: Why Quality Matters
Developing a Blended Course: Why Quality Matters
 
Inspiring Women
Inspiring WomenInspiring Women
Inspiring Women
 
Multimodal e-Feedback in an Online English Course
Multimodal e-Feedback in an Online English CourseMultimodal e-Feedback in an Online English Course
Multimodal e-Feedback in an Online English Course
 
A Virtual Trip to the Unseen Iran, MAVR SIG Forum, JALT2017
A Virtual Trip to the Unseen Iran, MAVR SIG Forum, JALT2017A Virtual Trip to the Unseen Iran, MAVR SIG Forum, JALT2017
A Virtual Trip to the Unseen Iran, MAVR SIG Forum, JALT2017
 
Students' Comments on Their Trip to the HeART of the Unseen Iran via the VR ...
Students' Comments on Their Trip to the HeART of  the Unseen Iran via the VR ...Students' Comments on Their Trip to the HeART of  the Unseen Iran via the VR ...
Students' Comments on Their Trip to the HeART of the Unseen Iran via the VR ...
 
Group Number Signs
Group Number SignsGroup Number Signs
Group Number Signs
 
Iran 360
Iran 360Iran 360
Iran 360
 
I Am More Than A Stereotype: Actions and Stories for Diversity Awareness and ...
I Am More Than A Stereotype: Actions and Stories for Diversity Awareness and ...I Am More Than A Stereotype: Actions and Stories for Diversity Awareness and ...
I Am More Than A Stereotype: Actions and Stories for Diversity Awareness and ...
 

Recently uploaded

The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
Delapenabediema
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
Sandy Millin
 
Honest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptxHonest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptx
timhan337
 
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
Nguyen Thanh Tu Collection
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
DeeptiGupta154
 
Guidance_and_Counselling.pdf B.Ed. 4th Semester
Guidance_and_Counselling.pdf B.Ed. 4th SemesterGuidance_and_Counselling.pdf B.Ed. 4th Semester
Guidance_and_Counselling.pdf B.Ed. 4th Semester
Atul Kumar Singh
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
TechSoup
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
Pavel ( NSTU)
 
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideas
GeoBlogs
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
Special education needs
 
Francesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptxFrancesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptx
EduSkills OECD
 
Palestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptxPalestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptx
RaedMohamed3
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
Jisc
 
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdfAdversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
Po-Chuan Chen
 
Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
Balvir Singh
 
Additional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdfAdditional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdf
joachimlavalley1
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
Thiyagu K
 
Acetabularia Information For Class 9 .docx
Acetabularia Information For Class 9  .docxAcetabularia Information For Class 9  .docx
Acetabularia Information For Class 9 .docx
vaibhavrinwa19
 
"Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe..."Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe...
SACHIN R KONDAGURI
 
The French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free downloadThe French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free download
Vivekanand Anglo Vedic Academy
 

Recently uploaded (20)

The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
 
Honest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptxHonest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptx
 
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
 
Guidance_and_Counselling.pdf B.Ed. 4th Semester
Guidance_and_Counselling.pdf B.Ed. 4th SemesterGuidance_and_Counselling.pdf B.Ed. 4th Semester
Guidance_and_Counselling.pdf B.Ed. 4th Semester
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
 
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideas
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
 
Francesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptxFrancesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptx
 
Palestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptxPalestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptx
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
 
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdfAdversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
 
Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
 
Additional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdfAdditional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdf
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
 
Acetabularia Information For Class 9 .docx
Acetabularia Information For Class 9  .docxAcetabularia Information For Class 9  .docx
Acetabularia Information For Class 9 .docx
 
"Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe..."Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe...
 
The French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free downloadThe French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free download
 

Justifying the Use of an English Language Placement Test with an Assessment Use Argument

  • 1. LOGO Justifying the Use of an English Language Placement Test with an Assessment Use Argument Presented by: Parisa Mehran Alzahra University Tehran, Iran
  • 2. Placement Tests Placement test is considered as a fairly high-stakes test (Bachman & Palmer, 1996, 2010), and the social consequences of placement decisions are of great significance and need to be investigated, since such decisions can affect the lives of students (Murray, 2001; Schmitz & delMas, 1991). Thus, as Brown (1989) emphasizes, it is important to make valid placement decisions to avoid mismatches that can occur due to inappropriate placement testing.
  • 3. Validity in Language Assessment Validity has been regarded as the most significant and complex concept in language assessment, and it has always been under investigation by language testing experts and researchers (e.g., Bachman, 1990, 2004, 2005; Bachman & Palmer, 2010; Chapelle, 1999; Cronbach & Meehl, 1955; Kane, 2001, 2012, 2013; Lado, 1961; Messick, 1989). As a result, the conception of validity has undergone a series of reinterpretations throughout the history of language assessment.
  • 4. Argument-based Approaches to Validity Argument-based approaches to validity are based on the concept of a validity argument which has been used in the process of validation for more than twenty years (e.g., Bachman, 2005; Cronbach, 1988, 1989; House, 1980, Kane, 1992; Mislevy, 2003; Mislevy, Steinberg, & Almond, 2002, 2003). The process is comparable to building a legal case to persuade a judge or a jury. The process of validation thus becomes ongoing: As long as a test is alive, the collection of relevant evidence is going to be continued (Bachman & Palmer, 2010). Hence, any kind of relevant evidence is gathered to show the plausibility of the intended interpretations and uses (Bachman, 2004; Kane, 2012).
  • 5. Purpose of the Study Using Bachman and Palmer's (2010) AUA as a framework, this study examined the validity of an English language placement test, which is composed of the Oxford Quick Placement Test (OQPT) and a follow-up oral examination. The following research question was addressed: To what extent are the OQPT and the oral examination justifiable in placing students appropriately according to Bachman and Palmer's (2010) AUA?
  • 6. Methodology: Participants and Setting This study was conducted at one of the English language institutes in Tehran, Iran. Three hundred and thirty-two (332) newcomers to the institute who had to take the placement test participated in this study, and 15 of them were interviewed. The head of the institute, three examiners of the placement test, ten teachers, and four experts also attended the current study.
  • 7. Methodology: Instrumentation 1. OQPT 2. Oral Examination 3. TOEFL 4. Interview 5. Observation
  • 8. The AUA for Justifying the Placement Test As Vongpumivitch (2010) and Wang et al. (2012) remark, Bachman and Palmer's (2010) framework has a top-down approach. That is, the four claims are discussed from the perspective of test development rather than from that of test use. Therefore, in this study, where the aim is to evaluate the overall usefulness of an English language placement test, the four claims are presented in the reverse order from that in Bachman and Palmer (2010). It should also be mentioned that as Bachman and Palmer (2010) emphasize not all of the warrants and rebuttals listed by them will necessarily be needed in the AUA for any given test. Moreover, due to practical research limitations, not all of the warrants and rebuttals have been investigated in the present study.
  • 9. Claim 4: The assessment records of the OQPT and the oral examination are consistent across different assessment tasks, different aspects of the assessment procedure, and across different groups of test takers. Claim 4: Assessment Records
  • 10. Consistency The first warrant for this claim is that the procedures for administrating the OQPT and the oral examination are followed consistently across different occasions and for all test takers:  The observation of how the OQPT and the oral examination were administered as well as the interviews with the examiners and the head of the institute revealed that there are a set of administrative procedures which are strictly followed by the test administrators; hence, the administrative procedures are consistent across different occasions and for all test taker groups.
  • 11. Consistency (cont.) Another warrant to support the consistency claim involves the scoring criteria and procedures:  The criteria and procedures for rating test takers' performance on the OQPT are well specified and are adhered to. Since the OQPT is in multiple-choice format, its rating criteria and procedures are quite objective, and scoring is done based on an answer key.  However, the criteria and procedures for the oral examination are not well specified and are quite subjective. A set of questions have been devised based on the coursebook. In this sense, the administration of the oral examination is consistent, yet its scoring process does not follow any specific procedures. This lack of evidence could be a rebuttal here.
  • 12. Consistency (cont.) With respect to the warrant of rater training: Raters undergo training before administrating the placement test. However, one of the examiners was not satisfied with the training process, and she believed that what matters is just the examiner's marketing skill to "grab more customers" for the institute.
  • 13. Consistency (cont.) To check internal consistency of the items, as another warrant:  Kuder-Richardson formula 20 (KR-20) was used.  The reliability coefficient (KR-20) obtained for the OQPT Version 1 was .93 and for the OQPT Version 2 was .88 showing that the OQPT has reasonable internal consistency reliability.  Two main test item indices (item difficulty and item discrimination) were used in the test item analysis for the OQPT.  In terms of difficulty, the items have been ordered from the easiest to the most difficult, and this is in line with the view of experts, examiners, and test takers. The analysis of item difficulty showed that, by and large, most of the items were difficult (56% in the Version 1 and 63% in Version 2), and both versions of the OQPT did not have a fairly acceptable distribution of difficulty.  In terms of item discrimination, the analysis of items demonstrated that the items in the OQPT Version 1 had good amount of discrimination (75%). However, the OQPT Version 2 contained less items of fair discrimination (46%) in comparison to the first version.
  • 14. Consistency (cont.)  In regard to inter-rater and intra-rater reliability, inconsistencies between and within human raters are not a source of measurement error because the scoring of the OQPT is done through an answer key.  In the case of the oral examination, Cronbach's alpha was computed. The alpha was .93 for inter-rater reliability and .96 for intra-rater reliability indicating that despite the lack of consistent criteria and procedures for the oral examination, the oral examination has reasonable internal consistency reliability.  The analysis of the two versions of the OQPT brought a serious rebuttal to the consistency claim. Cronbach's alpha was calculated and the alpha was .000 indicating that the two versions are not equivalent. Two experts also remarked that the second version is much more difficult, and that it cannot be considered as equivalent to the first version.
  • 15. Claim 3: The OQPT scores and the oral examination results can be interpreted as test takers' level of English proficiency and place them in their appropriate level. Such interpretations are meaningful, impartial, generalizable, relevant, and sufficient. Claim 3: Interpretations
  • 16. Meaningfulness  Interpretations about the construct to be assessed are meaningful if they are based on a frame of reference like a course syllabus, a needs analysis, or a general theory of language ability. The head of the institute and the examiners believed that the OQPT is a suitable placement test because Oxford University Press is the publisher of both this test and the coursebook taught in the institute (i.e., English Result). The teaching method followed in the institute is Communicative Language Teaching (CLT), and speaking is thus the primary focus; therefore, the oral examination has a significant role in placement testing. However, lack of a listening section in the OQPT can be a rebuttal to the meaningfulness of the interpretations.
  • 17. Impartiality To support the impartiality warrant, the assessment items/tasks should be checked for response formats or content that may either favor or disfavor some test takers, and test takers should be treated impartially in terms of all aspects of test administration.  As mentioned earlier, interviews with test takers and examiners revealed that due to the lack of a specific rubric for the oral examination, it was believed that interpretations about the ability to be assessed were not made without bias against any groups of test takers.  No complaint was made in regard to the appropriateness of the content. Bias and item sensitivity studies need to be done for deeper analysis.
  • 18. Generalizability According to the generalizability warrant, the characteristics of the assessment items/tasks (e.g., input, expected response, type of interaction) as well as the scoring criteria and procedures of the test tasks should correspond closely to those of the target language use (TLU) domain.  It might be that the items/tasks in the OQPT and the oral examination do not exactly correspond to all of the TLU tasks; however, the content of the OQPT and the oral examination corresponds to the content of the textbook taught in the institute. Moreover, in the oral examination, test takers' real world language performance is examined. Thus, this can to some extent support the generalizability warrant.  Here, it is worth mentioning that some of the teachers, examiners, and experts asserted that a TOEFL or an IELTS test was a better test, indeed an ideal one, due to having writing and specially listening parts, for placement testing, but because of time limitations, they could not be used as placement tests. Consequently, a TOEFL test was given to those who had taken the OQPT, and the results manifested that the correlation between the OQPT and TOEFL scores was not high (r=.66).
  • 19. Relevance The forth warrant is relevance according to which the assessment-based interpretations should provide information that is relevant and helpful for the decision makers to make decisions.  Based on the interviews conducted with examiners, it was revealed that the OQPT scores were not sufficiently helpful in placement testing. This could have been a rebuttal to the relevance warrant; yet, the oral examination is in support of this warrant since it is quite helpful for the examiners to make their placement decisions. As said before, lack of a rubric is a serious rebuttal.
  • 20. Sufficiency The fifth warrant demands that the assessment-based interpretations should provide information that is sufficient for the decision makers to make decisions. Again, because the process of placement testing includes both a written and an oral test, sufficient information is obtained to make placement decisions.
  • 21. Claim 2: The placement decisions that are made on the basis of the OQPT scores and the oral examination results are sensitive to local values and equitable to all stakeholders. Claim 2: Decisions
  • 22. Values Sensitivity According to values sensitivity warrant, the existing community values and relevant legal requirements should be carefully and critically considered in the admission decisions that are to be made and in determining the relative seriousness of false positive and false negative classification errors.  The interviews and observations revealed that the process of placement testing does not guarantee test fairness considerations, and in this phase just absorbing more clients is important. Hence, it is possible to have potential false positives (i.e., individuals are placed in a level higher than their actual level) and false negatives (i.e., individuals are placed in a level lower than their actual level). Usually the latter happens, because it is less risky; nevertheless, if at the time of placement testing the institute does not have the level appropriate for the test taker (due to some limitations, such as space, time, lack of students, all the levels are not always covered), the test taker will be placed in a higher level.
  • 23. Equitability  Due to the subjectivity of the oral examination, it cannot be claimed that the same cut scores and decision rules are used to classify all students who have applied for the same program, and no other considerations are used. The economic and practical considerations always exist. Consequently, test takers and other stakeholders are not fully informed about how decisions will be made and whether decisions are actually made in the way described to them.
  • 24. Claim 1: The consequences of the placement decisions based on the OQPT scores and the oral examination results are beneficial to all stakeholders that use the test, including the test takers, the institution, the teachers, and the supervisor. Claim 1: Consequences
  • 25. Beneficence The first warrant is that the consequences of using the assessment that are specific to each stakeholder will be beneficial. Some of the test takers were interviewed after their placement test and their attendance in their classes. On the whole, they were satisfied, although some of them believed that their level was higher, and that they were placed in a lower level. To them, the reason is basically cost-effectiveness. Most of the teachers believed that their students were homogenous in the class; yet, two teachers strongly disagreed and they believed that their classes were not at all homogenous especially at higher levels.
  • 26. Conclusion  Based on the evidence gathered, this study found that the assessment records of the OQPT and the oral examination were consistent across different assessment tasks, different aspects of the assessment procedure, and across different groups of test takers. However, the oral examination required a set of clear criteria.  Moreover, the analysis of the two versions of the OQPT manifested that their parallelism was under question which could threaten the consistency of the assessment records.  The findings also indicated that the OQPT scores and the oral examination results could be interpreted somewhat as test takers' level of English proficiency and could place them in their appropriate levels. Such interpretations were meaningful, impartial, relevant, and sufficient, although lack of a listening section in the OQPT and lack of a rubric for the oral examination could be threatening, and generalizability of the results was to some extent under question.  In addition, the placement decisions were not sensitive to local values and equitable to all stakeholders due to the subjectivity of the oral examination and the economic considerations of the institute.  Lastly, by and large, the consequences of the placement decisions were beneficial to all stakeholders that use the test, which is composed of the test takers, the institution, the teachers, and the supervisor.
  • 27. Local Implications To support the intended test use, it would be helpful to examine the negative evidence that has been identified in this study and resolve the identified issues or mitigate the potential negative impact of unresolved issues. For instance, in the case of the current placement test, the oral examination can be given based on a rubric, a listening section can be added to the written test, and economic considerations can become less important for the institute, and therefore the intended uses of the test would become much more justifiable with stronger evidence.
  • 28. The Merits/Demerits of Using an AUA Finally, this study serves as an illustration of the merits/demerits of using an AUA.  On the whole, the AUA provides a sound framework in which the validity of a test and its use can be justified and the test developers/users can be accountable for their test.  With the help of the AUA framework, the process of assessment justification becomes more comprehensive, systematic, and coherent. In fact, one of the merits of an AUA is its clear articulation about which types of evidence should be collected for which claims or warrants.  However, in the process of assessment justification, an AUA demands that the evaluation of the test be done at many levels and this needs different types of data and analyses. Thus, in practice, the complexity of the justification study may be a big challenge for a single researcher.
  • 29. Thank you for your attention!

Editor's Notes

  1. According to Bachman (2004), items within a fairly narrow range of item difficulty, around .50, are desirable. Oller (1979) asserts that, for item discrimination, correlations of less than .35 are considered as not being useful for discriminating between participants.
  2. There are seven warrants for meaningfulness.