Creating an in-house computerized adaptive testing (CAT) program with Concerto

1. Creating an in-house computerized adaptive testing (CAT) program with Concerto Atsushi, MIZUMOTO (Kansai University) 2013/09/20 JLTA at Waseda University

2. Computerized Adaptive Testing

3. CAT needs Item Response Theory

4. CTT vs. IRT Aspect CTT IRT Test score Ordinal scale Interval scale Ability estimate Test-dependent Test-independent Test result Person-dependent Person-independent Measurement target (Precision) All test-takers Individuals Equating/CAT Difﬁcult Easy Ohtomo (2009)

5. CAT Needs IRT CAT IRT IRT IRT

6. History of CAT Research 40 years (Thomson & Weiss, 2011)) 30 in LT (Koyama, 2010))

7. Example of CAT

8. Example of CAT

9. CBT ≠ CAT

10. How CAT Works http://www.j-cat.org/page/interpret

11. Advantages of CAT •Tailored for individual test-takers •Shorter test time •More precision (= SE smaller) •No need for random sampling www.geocities.jp/kosugitti/labo/irtnote.pdf

12. Purposes •Creating a CAT program •Evaluation

13. Creating a CAT Program •Choosing the CAT System •Constructing an Item Bank (Pretest) •Calibrating the Item Bank •Determine Speciﬁcations & Feedback •Administering the CAT

15. Moodle Plugin http://moodle2x.info

19. 1. Free account（150 test takers/month） 2. Amazon Machine Images（Free for a year） 3. Installing it on your own server

20. •Open-source •Running R on a server (catR, RMySQL) •HTML-based

21. Installation on a server https://code.google.com/p/concerto-platform/wiki/installation4

22. Wiki (Resources) https://code.google.com/p/concerto-platform/wiki/Resources?tm=6

25. Constructing an Item Bank (Pretest) •Vocabulary Test (Mizumoto, 2006) http://www.mizumot.com/ﬁles/VocSizeMeasure.pdf •Based on SVL 12,000 (Up to 8,000 level; 30 items for each level) •716 university EFL learners

26. Sample Question (1) 心の, 精神の A. essential B. creative C. loose D. mental

29. Calibrating the Item Bank •240 items analyzed (Rasch model) •150 items left for the item bank •Calibrated with two parameter logistic model (item difﬁculty & discrimination) •Update the csv ﬁle to Concerto

32. Speciﬁcations of CAT •Starting point (parameters, initial ability, randmized/ﬁxed） •Ability estimation method (empirical Bayes and others) •Stopping rule (Number of items/Standard error） •Final ability estimation

33. Magis and Raîche (2012, p. 7)

34. How many items for what SE? •Simulation with catR package Magis, D., & Raîche, G. (2012). http://www.jstatsoft.org/v48/i08

35. True Theta = 1, SE = 0.3 Stopping rule = 30 items

36. Concerto

37. http://langtest.jp/concerto/?tid=20

39. Feedback Page

42. 268 test takers (university ﬁrst year) (1) CAT (2) Paper-pencil version (68 items) common person linking (3) Questionnaire “What did you think of the CAT result?”

43. Evaluation CAT vs. Paper-pencil

44. CAT Theta 0 1 2 3 4 -10123 0.92 -1 0 1 2 3 01234 Paper-pencil Theta n = 268 Random 30Qs Fixed 68Qs

45. -1 0 1 2 3 01234 Pape n = 268 CAT (30Qs) M = 1.71 SD = 1.13 P-P (68Qs) M = 1.72 SD = 0.95

46. -1 0 1 2 3 01234 Pape n = 268 CAT (30Qs) M = 1.71 SD = 1.13 P-P (68Qs) M = 1.72 SD = 0.95 Mean diff. = -0.02 95% CI [-0.07, 0.04] d = 0.01 Power = .06

47. -1 0 1 2 3 01234 Pape n = 268 CAT SE (30Qs) M = 0.39 SD = 0.11 P-P SE (68Qs) M = 1.71 SD = 1.13

48. -1 0 1 2 3 01234 Pape n = 268 CAT SE (30Qs) M = 0.39 SD = 0.11 P-P SE (68Qs) M = 1.71 SD = 1.13 Mean diff. of SE = -1.32 95% CI [-1.44, -1.19] d = 1.65 Power = 0.99

49. Evaluation CAT vs. Paper-pencil Means: CAT = Paper-pencil SEs: CAT < Paper-pencil CAT measures the same ability with much more precision (with fewer items).

50. Evaluation Questionnaire

51. Result of the Questionnaire Frequency Response 150 100 50 0 50 100 150 Very inaccurate Inaccurate Rather Inaccurate Rather accurate Accurate Very accurate

52. Feedback Page

53. Future Research •More items in the item bank •Better formula for predicting other test scores •Improved feedback •Collaboration

54. Summary •Created a CAT program •Evaluation (1) CAT better than Paper-pencil (2) Feedback needs improvement.

Creating an in-house computerized adaptive testing (CAT) program with Concerto

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Creating an in-house computerized adaptive testing (CAT) program with Concerto

Similar to Creating an in-house computerized adaptive testing (CAT) program with Concerto (20)

More from Mizumoto Atsushi

More from Mizumoto Atsushi (12)

Recently uploaded

Recently uploaded (20)

Creating an in-house computerized adaptive testing (CAT) program with Concerto