Test Construction1


Published on

Some guidelines for language test construction

Published in: Education, Business, Technology
  • Be the first to comment

No Downloads
Total views
On SlideShare
From Embeds
Number of Embeds
Embeds 0
No embeds

No notes for slide

Test Construction1

  1. 3. What is testing? It’s an activity whose purpose is to determine what learners can do or know about something. What is a test? It’s a formal instrument to measure what learners can do or know about something.  
  2. 4. <ul><li>What are tests for? </li></ul><ul><li>To inform learners and teachers of the strengths and weaknesses of the process. </li></ul><ul><li>To motivate learners to review or consolidate specific material. </li></ul><ul><li>To create a sense of accomplishment/success. </li></ul><ul><li>To guide the planning/development of the ongoing teaching process. </li></ul><ul><li>To determine if (and to what extent) the objectives have been achieved. </li></ul><ul><li>To encourage improvement. </li></ul><ul><li>  </li></ul>
  3. 6. <ul><li>Specific guidelines : The way the test is designed and organized. </li></ul><ul><li>Moderation of mark scheme : The way in which teachers set the score of the test. </li></ul><ul><li>Standardization of examiners : The way in which examiners guarantee a common criteria for correction. </li></ul>
  4. 7. <ul><li>Specific Guidelines </li></ul><ul><li>Moderation of tasks : Searching for feed-back. Revision made by other teachers. </li></ul><ul><li>Level of difficulty : The presentation of tasks in a test should be arranged from easy to difficult. Starting with the most difficult task will lead the weakest learners to soon give up. An item is easy if 75% of students answer it correctly, it’s average if 50% of the students answer it correctly, and if 25% of students can’t answer the item, then it is considered difficult (pilot test). </li></ul><ul><li>Discrimination : A test should allow candidates at different levels to perform according to their abilities. A variety of tasks ranging from easy to difficult should point out the difference(s) between learners (good and weak). The number of difficult tasks should be limited and go at the end of the test. </li></ul>
  5. 8. <ul><li>Appropriate sample : The test should present a representative sample of the objectives, activities and tasks taught or used in the classroom. </li></ul><ul><li>Overlap : It occurs when content is assessed more than once. It should be avoided as reassessment of content will present an inappropriate sample, but also to prevent visual and mental overload from students. </li></ul><ul><li>Clarity of tasks : Instructions should be simple and unambiguous, providing a clear indication of what the task demands from the student. Instructions should never be more difficult than the task. </li></ul><ul><li>Questions and texts: The selection of questions and texts will depend on the purpose and the formats chosen by the designer of the test. Again, the difficulty should not lie in the question but in the task. Conversely, questions should not be too simple, obvious or answerable from world knowledge. </li></ul>
  6. 9. <ul><li>Timing : Testers should give students a reasonable time to complete the test, since too little time will evidence unreliable results. Students should be aware of the time set to complete each part of the test. The time of the test should reflect the importance and difficulty of what is being assessed. Teachers can pilot the test with a group of a similar level or he/she can even relate to similar evaluative experiences in the classroom, to determine the appropriate time agreed to complete the test. </li></ul><ul><li>Layout : Presentation, printing, spacing, font size, style, formats (a,b,c… I,II,III,IV… 1,2,3…) The layout should be consistent. Single parts should be arranged on the same page. </li></ul><ul><li>Bias : Bias can result from experiential, cultural or knowledge-based factors. Teachers should avoid items or topics inclined to give an unfair advantage to a particular group of students. Conversely, teachers should also avoid tasks or issues so obscure that candidates might have no frame of reference into which process and comprehend what is being asked. </li></ul>
  7. 10. <ul><li>Moderation of Mark Scheme </li></ul><ul><li>Acceptable response/variations . </li></ul><ul><li>Subjectivity in productive tasks . </li></ul><ul><li>Weighting (balance between items/tasks and scores). </li></ul><ul><li>Computation : The data and results should be easy to compute. The manipulation of numbers must be convenient. Simple for students and teacher (to conceive and process). </li></ul><ul><li>Avoidance of muddied measurement : The use of a skill should not interfere with the measurement of another. </li></ul><ul><li>Accessibility/intelligibility of mark scheme : Easy and convenient to access, use and understand. </li></ul>
  8. 11. <ul><li>Standardization of examiners </li></ul><ul><li>Agreement on criteria : by teachers and students. </li></ul><ul><li>Trial assessment : to assess difficulty and potential problems. </li></ul><ul><li>Review of procedures : related to the test. </li></ul><ul><li>Follow up checks : Notes or reports on the results of the tests (to improve or consolidate it) </li></ul>