SlideShare a Scribd company logo
Creating an in-house
computerized adaptive testing
(CAT) program with Concerto
Atsushi, MIZUMOTO
(Kansai University)
2013/09/20
JLTA at Waseda University
Computerized Adaptive Testing
CAT needs
Item Response Theory
CTT vs. IRT
Aspect CTT IRT
Test score Ordinal scale Interval scale
Ability estimate Test-dependent Test-independent
Test result Person-dependent Person-independent
Measurement
target (Precision)
All test-takers Individuals
Equating/CAT Difficult Easy
Ohtomo (2009)
CAT Needs IRT
CAT
IRT
IRT
IRT
History of CAT Research
40 years
(Thomson & Weiss, 2011))
30 in LT
(Koyama, 2010))
Example of CAT
Example of CAT
CBT ≠ CAT
How CAT Works
http://www.j-cat.org/page/interpret
Advantages of CAT
•Tailored for individual test-takers
•Shorter test time
•More precision (= SE smaller)
•No need for random sampling
www.geocities.jp/kosugitti/labo/irtnote.pdf
Purposes
•Creating a CAT program
•Evaluation
Creating a CAT Program
•Choosing the CAT System
•Constructing an Item Bank (Pretest)
•Calibrating the Item Bank
•Determine Specifications & Feedback
•Administering the CAT
Creating a CAT Program
•Choosing the CAT System
•Constructing an Item Bank (Pretest)
•Calibrating the Item Bank
•Determine Specifications & Feedback
•Administering the CAT
Moodle Plugin
http://moodle2x.info
1. Free account(150 test takers/month)
2. Amazon Machine Images(Free for a year)
3. Installing it on your own server
•Open-source
•Running R on a server (catR, RMySQL)
•HTML-based
Installation on a server
https://code.google.com/p/concerto-platform/wiki/installation4
Wiki (Resources)
https://code.google.com/p/concerto-platform/wiki/Resources?tm=6
Creating a CAT Program
•Choosing the CAT System
•Constructing an Item Bank (Pretest)
•Calibrating the Item Bank
•Determine Specifications & Feedback
•Administering the CAT
Creating a CAT Program
•Choosing the CAT System
•Constructing an Item Bank (Pretest)
•Calibrating the Item Bank
•Determine Specifications & Feedback
•Administering the CAT
Constructing an Item Bank
(Pretest)
•Vocabulary Test (Mizumoto, 2006)
http://www.mizumot.com/files/VocSizeMeasure.pdf
•Based on SVL 12,000
(Up to 8,000 level; 30 items for each level)
•716 university EFL learners
Sample Question
(1) 心の, 精神の
	

 A.	

essential
	

 B.	

creative
	

 C.	

loose
	

 D.	

mental
Creating a CAT Program
•Choosing the CAT System
•Constructing an Item Bank (Pretest)
•Calibrating the Item Bank
•Determine Specifications & Feedback
•Administering the CAT
Creating a CAT Program
•Choosing the CAT System
•Constructing an Item Bank (Pretest)
•Calibrating the Item Bank
•Determine Specifications & Feedback
•Administering the CAT
Calibrating the Item Bank
•240 items analyzed (Rasch model)
•150 items left for the item bank
•Calibrated with two parameter
logistic model (item difficulty & discrimination)
•Update the csv file to Concerto
Creating a CAT Program
•Choosing the CAT System
•Constructing an Item Bank (Pretest)
•Calibrating the Item Bank
•Determine Specifications & Feedback
•Administering the CAT
Creating a CAT Program
•Choosing the CAT System
•Constructing an Item Bank (Pretest)
•Calibrating the Item Bank
•Determine Specifications & Feedback
•Administering the CAT
Specifications of CAT
•Starting point
(parameters, initial ability, randmized/fixed)
•Ability estimation method
(empirical Bayes and others)
•Stopping rule
(Number of items/Standard error)
•Final ability estimation
Magis and Raîche (2012, p. 7)
How many items for what SE?
•Simulation with catR package
Magis, D., & Raîche, G. (2012).
http://www.jstatsoft.org/v48/i08
True Theta = 1, SE = 0.3
Stopping rule = 30 items
Concerto
http://langtest.jp/concerto/?tid=20
Feedback Page
Creating a CAT Program
•Choosing the CAT System
•Constructing an Item Bank (Pretest)
•Calibrating the Item Bank
•Determine Specifications & Feedback
•Administering the CAT
Creating a CAT Program
•Choosing the CAT System
•Constructing an Item Bank (Pretest)
•Calibrating the Item Bank
•Determine Specifications & Feedback
•Administering the CAT
268 test takers
(university first year)
(1) CAT
(2) Paper-pencil version
(68 items) common person linking
(3) Questionnaire
“What did you think of
the CAT result?”
Evaluation
CAT vs. Paper-pencil
CAT Theta
0 1 2 3 4
-10123
0.92
-1 0 1 2 3
01234
Paper-pencil Theta
n = 268
Random
30Qs
Fixed
68Qs
-1 0 1 2 3
01234
Pape
n = 268
CAT
(30Qs)
M = 1.71
SD = 1.13
P-P
(68Qs)
M = 1.72
SD = 0.95
-1 0 1 2 3
01234
Pape
n = 268
CAT
(30Qs)
M = 1.71
SD = 1.13
P-P
(68Qs)
M = 1.72
SD = 0.95
Mean diff. = -0.02
95% CI [-0.07, 0.04]
d = 0.01
Power = .06
-1 0 1 2 3
01234
Pape
n = 268
CAT SE
(30Qs)
M = 0.39
SD = 0.11
P-P SE
(68Qs)
M = 1.71
SD = 1.13
-1 0 1 2 3
01234
Pape
n = 268
CAT SE
(30Qs)
M = 0.39
SD = 0.11
P-P SE
(68Qs)
M = 1.71
SD = 1.13
Mean diff. of SE
= -1.32
95% CI [-1.44, -1.19]
d = 1.65
Power = 0.99
Evaluation
CAT vs. Paper-pencil
Means: CAT = Paper-pencil
SEs: CAT < Paper-pencil
CAT measures the same ability
with much more precision
(with fewer items).
Evaluation
Questionnaire
Result of the Questionnaire
Frequency
Response
150 100 50 0 50 100 150
Very inaccurate Inaccurate Rather Inaccurate Rather accurate Accurate Very accurate
Feedback Page
Future Research
•More items in the item bank
•Better formula for predicting
other test scores
•Improved feedback
•Collaboration
Summary
•Created a CAT program
•Evaluation
(1) CAT better than Paper-pencil
(2) Feedback needs improvement.

More Related Content

What's hot

Computer based test designs (cbt)
Computer based test designs (cbt)Computer based test designs (cbt)
Computer based test designs (cbt)
munsif123
 
Test equating using irt. final
Test equating using irt. finalTest equating using irt. final
Test equating using irt. final
munsif123
 
Test construction
Test constructionTest construction
Test construction
munsif123
 
Towards a pattern recognition approach for transferring knowledge in acm v4 f...
Towards a pattern recognition approach for transferring knowledge in acm v4 f...Towards a pattern recognition approach for transferring knowledge in acm v4 f...
Towards a pattern recognition approach for transferring knowledge in acm v4 f...
Thanh Tran
 
A/B testing from basic concepts to advanced techniques
A/B testing  from basic concepts to advanced techniquesA/B testing  from basic concepts to advanced techniques
A/B testing from basic concepts to advanced techniques
Anatoliy Vuets
 
Statistical hypothesis testing in e commerce
Statistical hypothesis testing in e commerceStatistical hypothesis testing in e commerce
Statistical hypothesis testing in e commerce
Anatoliy Vuets
 
A Top-N Recommender System Evaluation Protocol Inspired by Deployed Systems
A Top-N Recommender System Evaluation Protocol Inspired by Deployed SystemsA Top-N Recommender System Evaluation Protocol Inspired by Deployed Systems
A Top-N Recommender System Evaluation Protocol Inspired by Deployed Systems
Alan Said
 
Comparative Recommender System Evaluation: Benchmarking Recommendation Frame...
Comparative Recommender System Evaluation: Benchmarking Recommendation Frame...Comparative Recommender System Evaluation: Benchmarking Recommendation Frame...
Comparative Recommender System Evaluation: Benchmarking Recommendation Frame...
Alan Said
 
Programming with GUTs
Programming with GUTsProgramming with GUTs
Programming with GUTscatherinewall
 
AUTOMATIC GENERATION AND OPTIMIZATION OF TEST DATA USING HARMONY SEARCH ALGOR...
AUTOMATIC GENERATION AND OPTIMIZATION OF TEST DATA USING HARMONY SEARCH ALGOR...AUTOMATIC GENERATION AND OPTIMIZATION OF TEST DATA USING HARMONY SEARCH ALGOR...
AUTOMATIC GENERATION AND OPTIMIZATION OF TEST DATA USING HARMONY SEARCH ALGOR...
csandit
 
Test design made easy (and fun) Rik Marselis EuroSTAR
Test design made easy (and fun) Rik Marselis EuroSTARTest design made easy (and fun) Rik Marselis EuroSTAR
Test design made easy (and fun) Rik Marselis EuroSTAR
Rik Marselis
 
Expanding our Testing Horizons
Expanding our Testing HorizonsExpanding our Testing Horizons
Expanding our Testing Horizons
Mark Micallef
 
No struggle with test design (presentation at TestExpo 2015 Denmark)
No struggle with test design (presentation at TestExpo 2015 Denmark)No struggle with test design (presentation at TestExpo 2015 Denmark)
No struggle with test design (presentation at TestExpo 2015 Denmark)
Rik Marselis
 
Caveon webinar series - smart items- using innovative item design to make you...
Caveon webinar series - smart items- using innovative item design to make you...Caveon webinar series - smart items- using innovative item design to make you...
Caveon webinar series - smart items- using innovative item design to make you...
Caveon Test Security
 
Chaplin school of hospitality and tourism management inter
Chaplin school of hospitality and tourism management interChaplin school of hospitality and tourism management inter
Chaplin school of hospitality and tourism management inter
RAJU852744
 
Bridging the Gap: Machine Learning for Ubiquitous Computing -- Evaluation
Bridging the Gap: Machine Learning for Ubiquitous Computing -- EvaluationBridging the Gap: Machine Learning for Ubiquitous Computing -- Evaluation
Bridging the Gap: Machine Learning for Ubiquitous Computing -- Evaluation
Thomas Ploetz
 
Applied Psych Test Design: Part C - Use of Rasch scaling technology
Applied Psych Test Design: Part C - Use of Rasch scaling technologyApplied Psych Test Design: Part C - Use of Rasch scaling technology
Applied Psych Test Design: Part C - Use of Rasch scaling technology
Kevin McGrew
 
Bad Metric, Bad!
Bad Metric, Bad!Bad Metric, Bad!
Bad Metric, Bad!
Joseph Ours, MBA, PMP
 
Guidelines to Understanding Design of Experiment and Reliability Prediction
Guidelines to Understanding Design of Experiment and Reliability PredictionGuidelines to Understanding Design of Experiment and Reliability Prediction
Guidelines to Understanding Design of Experiment and Reliability Prediction
ijsrd.com
 

What's hot (20)

Computer based test designs (cbt)
Computer based test designs (cbt)Computer based test designs (cbt)
Computer based test designs (cbt)
 
Test equating using irt. final
Test equating using irt. finalTest equating using irt. final
Test equating using irt. final
 
Test construction
Test constructionTest construction
Test construction
 
Towards a pattern recognition approach for transferring knowledge in acm v4 f...
Towards a pattern recognition approach for transferring knowledge in acm v4 f...Towards a pattern recognition approach for transferring knowledge in acm v4 f...
Towards a pattern recognition approach for transferring knowledge in acm v4 f...
 
A/B testing from basic concepts to advanced techniques
A/B testing  from basic concepts to advanced techniquesA/B testing  from basic concepts to advanced techniques
A/B testing from basic concepts to advanced techniques
 
Statistical hypothesis testing in e commerce
Statistical hypothesis testing in e commerceStatistical hypothesis testing in e commerce
Statistical hypothesis testing in e commerce
 
A Top-N Recommender System Evaluation Protocol Inspired by Deployed Systems
A Top-N Recommender System Evaluation Protocol Inspired by Deployed SystemsA Top-N Recommender System Evaluation Protocol Inspired by Deployed Systems
A Top-N Recommender System Evaluation Protocol Inspired by Deployed Systems
 
Comparative Recommender System Evaluation: Benchmarking Recommendation Frame...
Comparative Recommender System Evaluation: Benchmarking Recommendation Frame...Comparative Recommender System Evaluation: Benchmarking Recommendation Frame...
Comparative Recommender System Evaluation: Benchmarking Recommendation Frame...
 
Programming with GUTs
Programming with GUTsProgramming with GUTs
Programming with GUTs
 
AUTOMATIC GENERATION AND OPTIMIZATION OF TEST DATA USING HARMONY SEARCH ALGOR...
AUTOMATIC GENERATION AND OPTIMIZATION OF TEST DATA USING HARMONY SEARCH ALGOR...AUTOMATIC GENERATION AND OPTIMIZATION OF TEST DATA USING HARMONY SEARCH ALGOR...
AUTOMATIC GENERATION AND OPTIMIZATION OF TEST DATA USING HARMONY SEARCH ALGOR...
 
Test design made easy (and fun) Rik Marselis EuroSTAR
Test design made easy (and fun) Rik Marselis EuroSTARTest design made easy (and fun) Rik Marselis EuroSTAR
Test design made easy (and fun) Rik Marselis EuroSTAR
 
Expanding our Testing Horizons
Expanding our Testing HorizonsExpanding our Testing Horizons
Expanding our Testing Horizons
 
No struggle with test design (presentation at TestExpo 2015 Denmark)
No struggle with test design (presentation at TestExpo 2015 Denmark)No struggle with test design (presentation at TestExpo 2015 Denmark)
No struggle with test design (presentation at TestExpo 2015 Denmark)
 
Caveon webinar series - smart items- using innovative item design to make you...
Caveon webinar series - smart items- using innovative item design to make you...Caveon webinar series - smart items- using innovative item design to make you...
Caveon webinar series - smart items- using innovative item design to make you...
 
Chaplin school of hospitality and tourism management inter
Chaplin school of hospitality and tourism management interChaplin school of hospitality and tourism management inter
Chaplin school of hospitality and tourism management inter
 
Bridging the Gap: Machine Learning for Ubiquitous Computing -- Evaluation
Bridging the Gap: Machine Learning for Ubiquitous Computing -- EvaluationBridging the Gap: Machine Learning for Ubiquitous Computing -- Evaluation
Bridging the Gap: Machine Learning for Ubiquitous Computing -- Evaluation
 
Applied Psych Test Design: Part C - Use of Rasch scaling technology
Applied Psych Test Design: Part C - Use of Rasch scaling technologyApplied Psych Test Design: Part C - Use of Rasch scaling technology
Applied Psych Test Design: Part C - Use of Rasch scaling technology
 
Bad Metric, Bad!
Bad Metric, Bad!Bad Metric, Bad!
Bad Metric, Bad!
 
Guidelines to Understanding Design of Experiment and Reliability Prediction
Guidelines to Understanding Design of Experiment and Reliability PredictionGuidelines to Understanding Design of Experiment and Reliability Prediction
Guidelines to Understanding Design of Experiment and Reliability Prediction
 
De carlo rizk 2010 icelw
De carlo rizk 2010 icelwDe carlo rizk 2010 icelw
De carlo rizk 2010 icelw
 

Similar to Creating an in-house computerized adaptive testing (CAT) program with Concerto

A framework and approaches to develop an in-house CAT with freeware and open ...
A framework and approaches to develop an in-house CAT with freeware and open ...A framework and approaches to develop an in-house CAT with freeware and open ...
A framework and approaches to develop an in-house CAT with freeware and open ...
Tetsuo Kimura
 
Automated Machine Learning Applied to Diverse Materials Design Problems
Automated Machine Learning Applied to Diverse Materials Design ProblemsAutomated Machine Learning Applied to Diverse Materials Design Problems
Automated Machine Learning Applied to Diverse Materials Design Problems
Anubhav Jain
 
ictir2016
ictir2016ictir2016
ictir2016
Tetsuya Sakai
 
AIRS2016
AIRS2016AIRS2016
AIRS2016
Tetsuya Sakai
 
MSPresentation_Spring2011
MSPresentation_Spring2011MSPresentation_Spring2011
MSPresentation_Spring2011Shaun Smith
 
⭐⭐⭐⭐⭐ Finding a Dynamical Model of a Social Norm Physical Activity Intervention
⭐⭐⭐⭐⭐ Finding a Dynamical Model of a Social Norm Physical Activity Intervention⭐⭐⭐⭐⭐ Finding a Dynamical Model of a Social Norm Physical Activity Intervention
⭐⭐⭐⭐⭐ Finding a Dynamical Model of a Social Norm Physical Activity Intervention
Victor Asanza
 
Topic Set Size Design with Variance Estimates from Two-Way ANOVA
Topic Set Size Design with Variance Estimates from Two-Way ANOVATopic Set Size Design with Variance Estimates from Two-Way ANOVA
Topic Set Size Design with Variance Estimates from Two-Way ANOVA
Tetsuya Sakai
 
Sbst2018 contest2018
Sbst2018 contest2018Sbst2018 contest2018
Sbst2018 contest2018
Annibale Panichella
 
Pathogen phylogenetics using BEAST
Pathogen phylogenetics using BEASTPathogen phylogenetics using BEAST
Pathogen phylogenetics using BEAST
Bioinformatics and Computational Biosciences Branch
 
RS in the context of Big Data-v4
RS in the context of Big Data-v4RS in the context of Big Data-v4
RS in the context of Big Data-v4Khadija Atiya
 
Attribute MSA
Attribute MSA Attribute MSA
Attribute MSA
dishashah4993
 
Attribute MSA
Attribute MSAAttribute MSA
Attribute MSA
dishashah4993
 
Introduction to Bayesian phylogenetics and BEAST
Introduction to Bayesian phylogenetics and BEASTIntroduction to Bayesian phylogenetics and BEAST
Introduction to Bayesian phylogenetics and BEAST
Bioinformatics and Computational Biosciences Branch
 
Can AI Tell Emerging Technologies
Can AI Tell Emerging TechnologiesCan AI Tell Emerging Technologies
Can AI Tell Emerging Technologies
Seonho Kim
 
Fundamentals of Engineering Design
Fundamentals of Engineering DesignFundamentals of Engineering Design
Fundamentals of Engineering Design
asuarea48
 
ES2022-Minh-Nguyen-ShapingTestsIntoModelsForAutomatedTCGeneration.pdf
ES2022-Minh-Nguyen-ShapingTestsIntoModelsForAutomatedTCGeneration.pdfES2022-Minh-Nguyen-ShapingTestsIntoModelsForAutomatedTCGeneration.pdf
ES2022-Minh-Nguyen-ShapingTestsIntoModelsForAutomatedTCGeneration.pdf
Minh Nguyen
 
Model-based programming and AI-assisted software development
Model-based programming and AI-assisted software developmentModel-based programming and AI-assisted software development
Model-based programming and AI-assisted software development
Eficode
 
When Should I Use Simulation?
When Should I Use Simulation?When Should I Use Simulation?
When Should I Use Simulation?
SIMUL8 Corporation
 

Similar to Creating an in-house computerized adaptive testing (CAT) program with Concerto (20)

A framework and approaches to develop an in-house CAT with freeware and open ...
A framework and approaches to develop an in-house CAT with freeware and open ...A framework and approaches to develop an in-house CAT with freeware and open ...
A framework and approaches to develop an in-house CAT with freeware and open ...
 
Automated Machine Learning Applied to Diverse Materials Design Problems
Automated Machine Learning Applied to Diverse Materials Design ProblemsAutomated Machine Learning Applied to Diverse Materials Design Problems
Automated Machine Learning Applied to Diverse Materials Design Problems
 
ictir2016
ictir2016ictir2016
ictir2016
 
AIRS2016
AIRS2016AIRS2016
AIRS2016
 
MSPresentation_Spring2011
MSPresentation_Spring2011MSPresentation_Spring2011
MSPresentation_Spring2011
 
⭐⭐⭐⭐⭐ Finding a Dynamical Model of a Social Norm Physical Activity Intervention
⭐⭐⭐⭐⭐ Finding a Dynamical Model of a Social Norm Physical Activity Intervention⭐⭐⭐⭐⭐ Finding a Dynamical Model of a Social Norm Physical Activity Intervention
⭐⭐⭐⭐⭐ Finding a Dynamical Model of a Social Norm Physical Activity Intervention
 
Topic Set Size Design with Variance Estimates from Two-Way ANOVA
Topic Set Size Design with Variance Estimates from Two-Way ANOVATopic Set Size Design with Variance Estimates from Two-Way ANOVA
Topic Set Size Design with Variance Estimates from Two-Way ANOVA
 
Sbst2018 contest2018
Sbst2018 contest2018Sbst2018 contest2018
Sbst2018 contest2018
 
Pathogen phylogenetics using BEAST
Pathogen phylogenetics using BEASTPathogen phylogenetics using BEAST
Pathogen phylogenetics using BEAST
 
RS in the context of Big Data-v4
RS in the context of Big Data-v4RS in the context of Big Data-v4
RS in the context of Big Data-v4
 
Attribute MSA
Attribute MSA Attribute MSA
Attribute MSA
 
Attribute MSA
Attribute MSAAttribute MSA
Attribute MSA
 
Introduction to Bayesian phylogenetics and BEAST
Introduction to Bayesian phylogenetics and BEASTIntroduction to Bayesian phylogenetics and BEAST
Introduction to Bayesian phylogenetics and BEAST
 
Qualificacao acd
Qualificacao acdQualificacao acd
Qualificacao acd
 
Can AI Tell Emerging Technologies
Can AI Tell Emerging TechnologiesCan AI Tell Emerging Technologies
Can AI Tell Emerging Technologies
 
Fundamentals of Engineering Design
Fundamentals of Engineering DesignFundamentals of Engineering Design
Fundamentals of Engineering Design
 
ES2022-Minh-Nguyen-ShapingTestsIntoModelsForAutomatedTCGeneration.pdf
ES2022-Minh-Nguyen-ShapingTestsIntoModelsForAutomatedTCGeneration.pdfES2022-Minh-Nguyen-ShapingTestsIntoModelsForAutomatedTCGeneration.pdf
ES2022-Minh-Nguyen-ShapingTestsIntoModelsForAutomatedTCGeneration.pdf
 
Model-based programming and AI-assisted software development
Model-based programming and AI-assisted software developmentModel-based programming and AI-assisted software development
Model-based programming and AI-assisted software development
 
BIRTE-13-Kawashima
BIRTE-13-KawashimaBIRTE-13-Kawashima
BIRTE-13-Kawashima
 
When Should I Use Simulation?
When Should I Use Simulation?When Should I Use Simulation?
When Should I Use Simulation?
 

More from Mizumoto Atsushi

2021-0509_JAECS2021_Spring
2021-0509_JAECS2021_Spring2021-0509_JAECS2021_Spring
2021-0509_JAECS2021_Spring
Mizumoto Atsushi
 
2015-1003 英語コーパス学会ワークショップ使用スライド
2015-1003 英語コーパス学会ワークショップ使用スライド2015-1003 英語コーパス学会ワークショップ使用スライド
2015-1003 英語コーパス学会ワークショップ使用スライド
Mizumoto Atsushi
 
LET2015 National Conference Seminar
LET2015 National Conference SeminarLET2015 National Conference Seminar
LET2015 National Conference Seminar
Mizumoto Atsushi
 
JSSS2014 Symposium (Atsushi Mizumoto)
JSSS2014 Symposium (Atsushi Mizumoto)JSSS2014 Symposium (Atsushi Mizumoto)
JSSS2014 Symposium (Atsushi Mizumoto)
Mizumoto Atsushi
 
SappoRo.R #3 LT: Shiny by RStudio
SappoRo.R #3 LT: Shiny by RStudioSappoRo.R #3 LT: Shiny by RStudio
SappoRo.R #3 LT: Shiny by RStudio
Mizumoto Atsushi
 
量的データの分析・報告で気をつけたいこと
量的データの分析・報告で気をつけたいこと量的データの分析・報告で気をつけたいこと
量的データの分析・報告で気をつけたいこと
Mizumoto Atsushi
 
2013全国英語教育学会WS公開用スライド
2013全国英語教育学会WS公開用スライド2013全国英語教育学会WS公開用スライド
2013全国英語教育学会WS公開用スライド
Mizumoto Atsushi
 
Rを使ったコンピュータ適応型テスト構築の試み
Rを使ったコンピュータ適応型テスト構築の試みRを使ったコンピュータ適応型テスト構築の試み
Rを使ったコンピュータ適応型テスト構築の試み
Mizumoto Atsushi
 
Let中部2012シンポスライド
Let中部2012シンポスライドLet中部2012シンポスライド
Let中部2012シンポスライド
Mizumoto Atsushi
 
Excelを使った統計解析とグラフ化入門
Excelを使った統計解析とグラフ化入門Excelを使った統計解析とグラフ化入門
Excelを使った統計解析とグラフ化入門
Mizumoto Atsushi
 
2012-1110「マルチレベルモデルのはなし」(censored)
2012-1110「マルチレベルモデルのはなし」(censored)2012-1110「マルチレベルモデルのはなし」(censored)
2012-1110「マルチレベルモデルのはなし」(censored)
Mizumoto Atsushi
 

More from Mizumoto Atsushi (12)

2021-0509_JAECS2021_Spring
2021-0509_JAECS2021_Spring2021-0509_JAECS2021_Spring
2021-0509_JAECS2021_Spring
 
2015-1003 英語コーパス学会ワークショップ使用スライド
2015-1003 英語コーパス学会ワークショップ使用スライド2015-1003 英語コーパス学会ワークショップ使用スライド
2015-1003 英語コーパス学会ワークショップ使用スライド
 
LET2015 National Conference Seminar
LET2015 National Conference SeminarLET2015 National Conference Seminar
LET2015 National Conference Seminar
 
JSSS2014 Symposium (Atsushi Mizumoto)
JSSS2014 Symposium (Atsushi Mizumoto)JSSS2014 Symposium (Atsushi Mizumoto)
JSSS2014 Symposium (Atsushi Mizumoto)
 
SappoRo.R #3 LT: Shiny by RStudio
SappoRo.R #3 LT: Shiny by RStudioSappoRo.R #3 LT: Shiny by RStudio
SappoRo.R #3 LT: Shiny by RStudio
 
量的データの分析・報告で気をつけたいこと
量的データの分析・報告で気をつけたいこと量的データの分析・報告で気をつけたいこと
量的データの分析・報告で気をつけたいこと
 
2013 11 jacet-kansai-ws
2013 11 jacet-kansai-ws2013 11 jacet-kansai-ws
2013 11 jacet-kansai-ws
 
2013全国英語教育学会WS公開用スライド
2013全国英語教育学会WS公開用スライド2013全国英語教育学会WS公開用スライド
2013全国英語教育学会WS公開用スライド
 
Rを使ったコンピュータ適応型テスト構築の試み
Rを使ったコンピュータ適応型テスト構築の試みRを使ったコンピュータ適応型テスト構築の試み
Rを使ったコンピュータ適応型テスト構築の試み
 
Let中部2012シンポスライド
Let中部2012シンポスライドLet中部2012シンポスライド
Let中部2012シンポスライド
 
Excelを使った統計解析とグラフ化入門
Excelを使った統計解析とグラフ化入門Excelを使った統計解析とグラフ化入門
Excelを使った統計解析とグラフ化入門
 
2012-1110「マルチレベルモデルのはなし」(censored)
2012-1110「マルチレベルモデルのはなし」(censored)2012-1110「マルチレベルモデルのはなし」(censored)
2012-1110「マルチレベルモデルのはなし」(censored)
 

Recently uploaded

The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
Delapenabediema
 
The basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptxThe basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptx
heathfieldcps1
 
Chapter -12, Antibiotics (One Page Notes).pdf
Chapter -12, Antibiotics (One Page Notes).pdfChapter -12, Antibiotics (One Page Notes).pdf
Chapter -12, Antibiotics (One Page Notes).pdf
Kartik Tiwari
 
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
Celine George
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
Jisc
 
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
EugeneSaldivar
 
MASS MEDIA STUDIES-835-CLASS XI Resource Material.pdf
MASS MEDIA STUDIES-835-CLASS XI Resource Material.pdfMASS MEDIA STUDIES-835-CLASS XI Resource Material.pdf
MASS MEDIA STUDIES-835-CLASS XI Resource Material.pdf
goswamiyash170123
 
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
Nguyen Thanh Tu Collection
 
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama UniversityNatural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
Akanksha trivedi rama nursing college kanpur.
 
Advantages and Disadvantages of CMS from an SEO Perspective
Advantages and Disadvantages of CMS from an SEO PerspectiveAdvantages and Disadvantages of CMS from an SEO Perspective
Advantages and Disadvantages of CMS from an SEO Perspective
Krisztián Száraz
 
Acetabularia Information For Class 9 .docx
Acetabularia Information For Class 9  .docxAcetabularia Information For Class 9  .docx
Acetabularia Information For Class 9 .docx
vaibhavrinwa19
 
Digital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and ResearchDigital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and Research
Vikramjit Singh
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
camakaiclarkmusic
 
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Dr. Vinod Kumar Kanvaria
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
EverAndrsGuerraGuerr
 
Pride Month Slides 2024 David Douglas School District
Pride Month Slides 2024 David Douglas School DistrictPride Month Slides 2024 David Douglas School District
Pride Month Slides 2024 David Douglas School District
David Douglas School District
 
"Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe..."Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe...
SACHIN R KONDAGURI
 
Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
Jean Carlos Nunes Paixão
 
The Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptxThe Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptx
DhatriParmar
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
Atul Kumar Singh
 

Recently uploaded (20)

The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
 
The basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptxThe basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptx
 
Chapter -12, Antibiotics (One Page Notes).pdf
Chapter -12, Antibiotics (One Page Notes).pdfChapter -12, Antibiotics (One Page Notes).pdf
Chapter -12, Antibiotics (One Page Notes).pdf
 
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
 
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
 
MASS MEDIA STUDIES-835-CLASS XI Resource Material.pdf
MASS MEDIA STUDIES-835-CLASS XI Resource Material.pdfMASS MEDIA STUDIES-835-CLASS XI Resource Material.pdf
MASS MEDIA STUDIES-835-CLASS XI Resource Material.pdf
 
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
 
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama UniversityNatural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
 
Advantages and Disadvantages of CMS from an SEO Perspective
Advantages and Disadvantages of CMS from an SEO PerspectiveAdvantages and Disadvantages of CMS from an SEO Perspective
Advantages and Disadvantages of CMS from an SEO Perspective
 
Acetabularia Information For Class 9 .docx
Acetabularia Information For Class 9  .docxAcetabularia Information For Class 9  .docx
Acetabularia Information For Class 9 .docx
 
Digital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and ResearchDigital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and Research
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
 
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
 
Pride Month Slides 2024 David Douglas School District
Pride Month Slides 2024 David Douglas School DistrictPride Month Slides 2024 David Douglas School District
Pride Month Slides 2024 David Douglas School District
 
"Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe..."Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe...
 
Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
 
The Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptxThe Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptx
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
 

Creating an in-house computerized adaptive testing (CAT) program with Concerto

  • 1. Creating an in-house computerized adaptive testing (CAT) program with Concerto Atsushi, MIZUMOTO (Kansai University) 2013/09/20 JLTA at Waseda University
  • 4. CTT vs. IRT Aspect CTT IRT Test score Ordinal scale Interval scale Ability estimate Test-dependent Test-independent Test result Person-dependent Person-independent Measurement target (Precision) All test-takers Individuals Equating/CAT Difficult Easy Ohtomo (2009)
  • 6. History of CAT Research 40 years (Thomson & Weiss, 2011)) 30 in LT (Koyama, 2010))
  • 11. Advantages of CAT •Tailored for individual test-takers •Shorter test time •More precision (= SE smaller) •No need for random sampling www.geocities.jp/kosugitti/labo/irtnote.pdf
  • 12. Purposes •Creating a CAT program •Evaluation
  • 13. Creating a CAT Program •Choosing the CAT System •Constructing an Item Bank (Pretest) •Calibrating the Item Bank •Determine Specifications & Feedback •Administering the CAT
  • 14. Creating a CAT Program •Choosing the CAT System •Constructing an Item Bank (Pretest) •Calibrating the Item Bank •Determine Specifications & Feedback •Administering the CAT
  • 16.
  • 17.
  • 18.
  • 19. 1. Free account(150 test takers/month) 2. Amazon Machine Images(Free for a year) 3. Installing it on your own server
  • 20. •Open-source •Running R on a server (catR, RMySQL) •HTML-based
  • 21. Installation on a server https://code.google.com/p/concerto-platform/wiki/installation4
  • 23. Creating a CAT Program •Choosing the CAT System •Constructing an Item Bank (Pretest) •Calibrating the Item Bank •Determine Specifications & Feedback •Administering the CAT
  • 24. Creating a CAT Program •Choosing the CAT System •Constructing an Item Bank (Pretest) •Calibrating the Item Bank •Determine Specifications & Feedback •Administering the CAT
  • 25. Constructing an Item Bank (Pretest) •Vocabulary Test (Mizumoto, 2006) http://www.mizumot.com/files/VocSizeMeasure.pdf •Based on SVL 12,000 (Up to 8,000 level; 30 items for each level) •716 university EFL learners
  • 26. Sample Question (1) 心の, 精神の A. essential B. creative C. loose D. mental
  • 27. Creating a CAT Program •Choosing the CAT System •Constructing an Item Bank (Pretest) •Calibrating the Item Bank •Determine Specifications & Feedback •Administering the CAT
  • 28. Creating a CAT Program •Choosing the CAT System •Constructing an Item Bank (Pretest) •Calibrating the Item Bank •Determine Specifications & Feedback •Administering the CAT
  • 29. Calibrating the Item Bank •240 items analyzed (Rasch model) •150 items left for the item bank •Calibrated with two parameter logistic model (item difficulty & discrimination) •Update the csv file to Concerto
  • 30. Creating a CAT Program •Choosing the CAT System •Constructing an Item Bank (Pretest) •Calibrating the Item Bank •Determine Specifications & Feedback •Administering the CAT
  • 31. Creating a CAT Program •Choosing the CAT System •Constructing an Item Bank (Pretest) •Calibrating the Item Bank •Determine Specifications & Feedback •Administering the CAT
  • 32. Specifications of CAT •Starting point (parameters, initial ability, randmized/fixed) •Ability estimation method (empirical Bayes and others) •Stopping rule (Number of items/Standard error) •Final ability estimation
  • 33. Magis and Raîche (2012, p. 7)
  • 34. How many items for what SE? •Simulation with catR package Magis, D., & Raîche, G. (2012). http://www.jstatsoft.org/v48/i08
  • 35. True Theta = 1, SE = 0.3 Stopping rule = 30 items
  • 38.
  • 40. Creating a CAT Program •Choosing the CAT System •Constructing an Item Bank (Pretest) •Calibrating the Item Bank •Determine Specifications & Feedback •Administering the CAT
  • 41. Creating a CAT Program •Choosing the CAT System •Constructing an Item Bank (Pretest) •Calibrating the Item Bank •Determine Specifications & Feedback •Administering the CAT
  • 42. 268 test takers (university first year) (1) CAT (2) Paper-pencil version (68 items) common person linking (3) Questionnaire “What did you think of the CAT result?”
  • 44. CAT Theta 0 1 2 3 4 -10123 0.92 -1 0 1 2 3 01234 Paper-pencil Theta n = 268 Random 30Qs Fixed 68Qs
  • 45. -1 0 1 2 3 01234 Pape n = 268 CAT (30Qs) M = 1.71 SD = 1.13 P-P (68Qs) M = 1.72 SD = 0.95
  • 46. -1 0 1 2 3 01234 Pape n = 268 CAT (30Qs) M = 1.71 SD = 1.13 P-P (68Qs) M = 1.72 SD = 0.95 Mean diff. = -0.02 95% CI [-0.07, 0.04] d = 0.01 Power = .06
  • 47. -1 0 1 2 3 01234 Pape n = 268 CAT SE (30Qs) M = 0.39 SD = 0.11 P-P SE (68Qs) M = 1.71 SD = 1.13
  • 48. -1 0 1 2 3 01234 Pape n = 268 CAT SE (30Qs) M = 0.39 SD = 0.11 P-P SE (68Qs) M = 1.71 SD = 1.13 Mean diff. of SE = -1.32 95% CI [-1.44, -1.19] d = 1.65 Power = 0.99
  • 49. Evaluation CAT vs. Paper-pencil Means: CAT = Paper-pencil SEs: CAT < Paper-pencil CAT measures the same ability with much more precision (with fewer items).
  • 51. Result of the Questionnaire Frequency Response 150 100 50 0 50 100 150 Very inaccurate Inaccurate Rather Inaccurate Rather accurate Accurate Very accurate
  • 53. Future Research •More items in the item bank •Better formula for predicting other test scores •Improved feedback •Collaboration
  • 54. Summary •Created a CAT program •Evaluation (1) CAT better than Paper-pencil (2) Feedback needs improvement.