SlideShare a Scribd company logo
www.twosigmas.com twitter.com/twosigmas_ facebook/twosigmaspage
www.twosigmas.com
What makes a good adaptive testing program?
www.twosigmas.com twitter.com/twosigmas_ facebook/twosigmaspage
Advantage of IRT based CAT
2
On a 40 question exam with dichotomous
scoring (wrong or right), the total number of
questions you might need to develop is
240
= 1.01 × 1012
.
On a well designed IRT based CAT, the total
number of questions you might need to
develop is
≈ 400.
www.twosigmas.com twitter.com/twosigmas_ facebook/twosigmaspage
Who is your market?
3
Students – Computer adaptive teaching/
learning? Blended learning with MOOCs?
Schools – Adaptive homework? Computer
based in school exams? Formative
assessment?
Exam Board – Professionalize organizations?
Corporations? Government organizations?
www.twosigmas.com twitter.com/twosigmas_ facebook/twosigmaspage
Background
4
What are you trying to
measure?
How is it manifest?
What questions tests
this?
Is this question valid?
Is this question reliable?
Is this question fair?
www.twosigmas.com twitter.com/twosigmas_ facebook/twosigmaspage
General Characteristics of any IRT CAT
5
Item bank
Development
Pre-Test Items
Prioritizing
Publishing
CAT
Maintaining
CAT
Qualification
Specification
Reliability &
Validity
Differential Item
Analysis (DIF)
Content Balancing Item Selection
Percentile
Ranking
Item
Calibration
Communicate Results
Standard Exam
Conditions
Exam Security
(Exposure)
Validity Testing
Termination Criteria
Better than
existing solution?
www.twosigmas.com twitter.com/twosigmas_ facebook/twosigmaspage
Differential Item Functioning (DIF): Guessing (𝑥)
6
0%
10%
20%
30%
40%
50%
60%
70%
80%
90%
100%
-4 -3 -2 -1 0 1 2 3 4
LikelihoodofGettingaQuestionCorrect
Ability of Test Taker
(z-score)
Item Response Curves
Standard Normal Item 1 Item 2
𝑃𝑖𝑗 = 𝑃 𝑢 = 1 𝑥, 𝛼, 𝜃, 𝛿 = 𝑥 + 1 − 𝑥
𝑒1.702𝛼 𝜃−𝛿
1 + 𝑒1.702𝛼 𝜃−𝛿
,
𝛼 𝑥 𝛿
Normal 1 0 0
Item 1 1 0 0.2
Item 2 1 0 0.07
where i represents the ith item and j represents the jth test taker.
www.twosigmas.com twitter.com/twosigmas_ facebook/twosigmaspage
Differential Item Functioning (DIF): Difficulty (𝛿)
7
𝑃𝑖𝑗 = 𝑃 𝑢 = 1 𝑥, 𝛼, 𝜃, 𝛿 = 𝑥 + (1 − 𝑥)
𝑒1.702𝛼(𝜃−𝛿)
1 + 𝑒1.702𝛼(𝜃−𝛿)
𝛼 𝑥 𝛿
Item 1 1 0 0
Item 2 1 0 -1
Item 3 1 0 1
where i represents the ith item and j represents the jth test taker.
0%
10%
20%
30%
40%
50%
60%
70%
80%
90%
100%
-4 -3 -2 -1 0 1 2 3 4
LikelihoodofGettingaQuestionCorrect(percent)
Ability of Test Taker (z-score)
Item Response Curves
Item 1 Item 2 Item 3
www.twosigmas.com twitter.com/twosigmas_ facebook/twosigmaspage
Differential Item Functioning (DIF): Discrimination (𝛼)
8
𝑃𝑖𝑗 = 𝑃 𝑢 = 1 𝑥, 𝛼, 𝜃, 𝛿 = 𝑥 + (1 − 𝑥)
𝑒1.702𝛼(𝜃−𝛿)
1 + 𝑒1.702𝛼(𝜃−𝛿)
𝛼 𝑥 𝛿
Item 1 0.4 0 0
Item 2 0.8 0 -1
Item 3 1.2 0 1
where i represents the ith item and j represents the jth test taker.
0%
10%
20%
30%
40%
50%
60%
70%
80%
90%
100%
-4 -3 -2 -1 0 1 2 3 4
LikelihoodofGettingaQuestionCorrect(percent)
Ability of Test Taker (z-score)
Item Response Curves
Item 1 Item 2 Item 3
www.twosigmas.com twitter.com/twosigmas_ facebook/twosigmaspage
References:
9
• Baker, F. B. and Kim, S. (2004). Item Response Theory Parameter Estimation
Techniques, 2nd Edition, Revised and Expanded. New York, NY: CRC Press,
Taylor and Francis Group.
• Bao, Han, Dayton, C. Mitchell, & Hendrickson, Amy B. (2009). Differential Item
Functioning Amplification and Cancellation in a Reading Test. Practical
Assessment, Research & Evaluation, 14(19). Available online:
http://pareonline.net/getvn.asp?v=14&n=19
• Bergstorm, B. A., Gershon, R.C., and Brown, W. L. (1993) Differential Item
Functioning vs. Differential Test Functioning. Paper Presented at the Annual
Meeting of the American Educational Research Association (Atlanta, GA. April 12 -
16) Retrieved on February 20, 2011 from
http://www.eric.ed.gov/PDFS/ED377227.pdf
• Birdsall, M (2011) Implementing Computer Adaptive Testing to Improve
Achievement Opportunities. Ofqual, Coventry. Online at:
http://webarchive.nationalarchives.gov.uk/+/http://www.ofqual.gov.uk/files/2011-06-
15-implementing-computer-adaptive-testing-to-improve-achievement-
opportunities.pdf
• Bowles, R. and Pommerich, M. (2001). An Examination of Item Review on a CAT
Using the Specific Information Item Selection Algorithm. Paper presented at the
Annual Meeting of the National Council on Measurement in Education, Seattle, WA.
• Childs, Ruth A. & Andrew P. Jaciw (2003). Matrix sampling of items in large-scale
assessments. Practical Assessment, Research & Evaluation, 8(16). Retrieved
February 22, 2011 from http://PAREonline.net/getvn.asp?v=8&n=16
• de Ayala, R. J. (2009). The Theory and Practice of Item Response Theory. New
York, NY: The Guildfor Press.
• He, Q. (2010) Maintaining Standards in on Demand Testing Using Item Response
Theory. Ofqual, Coventry. Retrieved on February 10, 2011, from http://e-
assessment.org.uk/images/uploads/s-docs/Ofqual-10-4724-Maintaining-
standards.pdf
• Newton, Paul E. (2007) 'Clarifying the purposes of educational assessment',
Assessment in Education: Principles, Policy & Practice, 14:2, 149 -170. Retrieved
February 20, 2011 from http://dx.doi.org/10.1080/09695940701478321
• Pommerich, M., Segall, D.O., & Moreno, K.E. (2009). The nine lives of CAT-ASVAB:
Innovations and revelations. In D. J. Weiss (Ed.), Proceedings of the 2009 GMAC
Conference on Computerized Adaptive Testing. Retrieved on February 15, 2011
from www.psych.umn.edu/psylabs/CATCentral/
• Rudner, L. M. (2007). Implementing the Graduate Management Admission Test®
computerized adaptive test. In D. J. Weiss (Ed.), Proceedings of the 2007 GMAC
Conference on Computerized Adaptive Testing. Retrieved January 10, 2010 from
www.psych.umn.edu/psylabs/CATCentral/
• Segall, D. O. and Moreno, K. E. (1999) Development of the CAT-ASVAB. In F.
Drasgow & J. B. Olson-Buchanan (Eds.). Innovations in Computerized Assessment
(pp. 35—65). Hillsdale, NJ: Lawrence Erlbaum Associates. Retrieved on February
20, 2011 from http://www.danielsegall.com/catasvab.pdf
• van der Linden, W. J. and Glas, A. W. (eds.), (2010) Elements of Computer
Adaptive Testing: Statistics. Chapters 4, 10, 17, and page 349. London, UK:
Springer Science + Business Media LLC.
• Wise, L. L., Curran, L. T., & McBride, J. R. (1997). CAT-ASVAB Cost and Benefit
Analyses. In W. A. Sands, B. K. Waters, & J. R. McBride (Eds.), Computerized
adaptive testing: From inquiry to operation (pp. 227-236). Washington, DC:
American Psychological Association.
• Zumbo, B. D. (2007). Three Generations of DIF Analyses: Considering Where it Has
Been, Where it is Now, and Where it is Going. Language Assessment Quarterly,
4(2), 223-233, Lawrence Erlbaum Associates, Publishers. Retrieved February 20,
2011 from http://educ.ubc.ca/faculty/zumbo/papers/Zumbo_LAQ_reprint.pdf

More Related Content

What's hot

Towards a pattern recognition approach for transferring knowledge in acm v4 f...
Towards a pattern recognition approach for transferring knowledge in acm v4 f...Towards a pattern recognition approach for transferring knowledge in acm v4 f...
Towards a pattern recognition approach for transferring knowledge in acm v4 f...
Thanh Tran
 
Caveon webinar series - smart items- using innovative item design to make you...
Caveon webinar series - smart items- using innovative item design to make you...Caveon webinar series - smart items- using innovative item design to make you...
Caveon webinar series - smart items- using innovative item design to make you...
Caveon Test Security
 
Exploratory testing STEW 2016
Exploratory testing STEW 2016Exploratory testing STEW 2016
Exploratory testing STEW 2016
Per Runeson
 
De carlo rizk 2010 icelw
De carlo rizk 2010 icelwDe carlo rizk 2010 icelw
De carlo rizk 2010 icelw
Ting Yuan, Ed.D.
 
Teacher-Aware Active Robot Learning
Teacher-Aware Active Robot LearningTeacher-Aware Active Robot Learning
Teacher-Aware Active Robot Learning
Mattia Racca
 
A/B testing from basic concepts to advanced techniques
A/B testing  from basic concepts to advanced techniquesA/B testing  from basic concepts to advanced techniques
A/B testing from basic concepts to advanced techniques
Anatoliy Vuets
 
Practical Language Testing Glenn Fulcher
Practical Language Testing Glenn FulcherPractical Language Testing Glenn Fulcher
Practical Language Testing Glenn Fulcher
translatoran
 
De carlo rizk 2010 icelw
De carlo rizk 2010 icelwDe carlo rizk 2010 icelw
De carlo rizk 2010 icelw
Ting Yuan, Ed.D.
 
Measuring the impact of instant high quality feedback.
Measuring the impact of instant high quality feedback.Measuring the impact of instant high quality feedback.
Measuring the impact of instant high quality feedback.
Stephen Nutbrown
 
Programming with GUTs
Programming with GUTsProgramming with GUTs
Programming with GUTs
catherinewall
 
ICELW Conference Slides
ICELW Conference SlidesICELW Conference Slides
ICELW Conference Slides
toolboc
 
Statistical hypothesis testing in e commerce
Statistical hypothesis testing in e commerceStatistical hypothesis testing in e commerce
Statistical hypothesis testing in e commerce
Anatoliy Vuets
 
Best Practices for the Academic User: Maximizing the Impact of Your Instituti...
Best Practices for the Academic User: Maximizing the Impact of Your Instituti...Best Practices for the Academic User: Maximizing the Impact of Your Instituti...
Best Practices for the Academic User: Maximizing the Impact of Your Instituti...
Qualtrics
 
Chaplin school of hospitality and tourism management inter
Chaplin school of hospitality and tourism management interChaplin school of hospitality and tourism management inter
Chaplin school of hospitality and tourism management inter
RAJU852744
 
Evaluating Mobile Learning
Evaluating Mobile LearningEvaluating Mobile Learning
Evaluating Mobile Learning
giasemi
 
AUTOMATIC GENERATION AND OPTIMIZATION OF TEST DATA USING HARMONY SEARCH ALGOR...
AUTOMATIC GENERATION AND OPTIMIZATION OF TEST DATA USING HARMONY SEARCH ALGOR...AUTOMATIC GENERATION AND OPTIMIZATION OF TEST DATA USING HARMONY SEARCH ALGOR...
AUTOMATIC GENERATION AND OPTIMIZATION OF TEST DATA USING HARMONY SEARCH ALGOR...
csandit
 
70 270 q & a
70 270 q & a70 270 q & a
70 270 q & a
Samaja
 
Perfector presentation 2016
Perfector  presentation 2016Perfector  presentation 2016
Perfector presentation 2016
Test Perfector
 
Emp Research
Emp ResearchEmp Research
Emp Research
Wendi Tatum
 
Fulcher and Davidson Unit a5
Fulcher and Davidson Unit a5Fulcher and Davidson Unit a5
Fulcher and Davidson Unit a5
Farzaneh Hamidi
 

What's hot (20)

Towards a pattern recognition approach for transferring knowledge in acm v4 f...
Towards a pattern recognition approach for transferring knowledge in acm v4 f...Towards a pattern recognition approach for transferring knowledge in acm v4 f...
Towards a pattern recognition approach for transferring knowledge in acm v4 f...
 
Caveon webinar series - smart items- using innovative item design to make you...
Caveon webinar series - smart items- using innovative item design to make you...Caveon webinar series - smart items- using innovative item design to make you...
Caveon webinar series - smart items- using innovative item design to make you...
 
Exploratory testing STEW 2016
Exploratory testing STEW 2016Exploratory testing STEW 2016
Exploratory testing STEW 2016
 
De carlo rizk 2010 icelw
De carlo rizk 2010 icelwDe carlo rizk 2010 icelw
De carlo rizk 2010 icelw
 
Teacher-Aware Active Robot Learning
Teacher-Aware Active Robot LearningTeacher-Aware Active Robot Learning
Teacher-Aware Active Robot Learning
 
A/B testing from basic concepts to advanced techniques
A/B testing  from basic concepts to advanced techniquesA/B testing  from basic concepts to advanced techniques
A/B testing from basic concepts to advanced techniques
 
Practical Language Testing Glenn Fulcher
Practical Language Testing Glenn FulcherPractical Language Testing Glenn Fulcher
Practical Language Testing Glenn Fulcher
 
De carlo rizk 2010 icelw
De carlo rizk 2010 icelwDe carlo rizk 2010 icelw
De carlo rizk 2010 icelw
 
Measuring the impact of instant high quality feedback.
Measuring the impact of instant high quality feedback.Measuring the impact of instant high quality feedback.
Measuring the impact of instant high quality feedback.
 
Programming with GUTs
Programming with GUTsProgramming with GUTs
Programming with GUTs
 
ICELW Conference Slides
ICELW Conference SlidesICELW Conference Slides
ICELW Conference Slides
 
Statistical hypothesis testing in e commerce
Statistical hypothesis testing in e commerceStatistical hypothesis testing in e commerce
Statistical hypothesis testing in e commerce
 
Best Practices for the Academic User: Maximizing the Impact of Your Instituti...
Best Practices for the Academic User: Maximizing the Impact of Your Instituti...Best Practices for the Academic User: Maximizing the Impact of Your Instituti...
Best Practices for the Academic User: Maximizing the Impact of Your Instituti...
 
Chaplin school of hospitality and tourism management inter
Chaplin school of hospitality and tourism management interChaplin school of hospitality and tourism management inter
Chaplin school of hospitality and tourism management inter
 
Evaluating Mobile Learning
Evaluating Mobile LearningEvaluating Mobile Learning
Evaluating Mobile Learning
 
AUTOMATIC GENERATION AND OPTIMIZATION OF TEST DATA USING HARMONY SEARCH ALGOR...
AUTOMATIC GENERATION AND OPTIMIZATION OF TEST DATA USING HARMONY SEARCH ALGOR...AUTOMATIC GENERATION AND OPTIMIZATION OF TEST DATA USING HARMONY SEARCH ALGOR...
AUTOMATIC GENERATION AND OPTIMIZATION OF TEST DATA USING HARMONY SEARCH ALGOR...
 
70 270 q & a
70 270 q & a70 270 q & a
70 270 q & a
 
Perfector presentation 2016
Perfector  presentation 2016Perfector  presentation 2016
Perfector presentation 2016
 
Emp Research
Emp ResearchEmp Research
Emp Research
 
Fulcher and Davidson Unit a5
Fulcher and Davidson Unit a5Fulcher and Davidson Unit a5
Fulcher and Davidson Unit a5
 

Similar to What makes a good adaptive testing program

20151112PosterJanHellingsV2_definitief
20151112PosterJanHellingsV2_definitief20151112PosterJanHellingsV2_definitief
20151112PosterJanHellingsV2_definitief
Jan Hellings
 
An Adaptive Evaluation System to Test Student Caliber using Item Response Theory
An Adaptive Evaluation System to Test Student Caliber using Item Response TheoryAn Adaptive Evaluation System to Test Student Caliber using Item Response Theory
An Adaptive Evaluation System to Test Student Caliber using Item Response Theory
Editor IJMTER
 
16497 mgt 252
16497 mgt 25216497 mgt 252
Heidelberg presentation
Heidelberg presentationHeidelberg presentation
Heidelberg presentation
npz
 
Investigating learning strategies in a dispositional learning analytics conte...
Investigating learning strategies in a dispositional learning analytics conte...Investigating learning strategies in a dispositional learning analytics conte...
Investigating learning strategies in a dispositional learning analytics conte...
Bart Rienties
 
313 IDS _Course_Introduction_PPT.pptx
313 IDS _Course_Introduction_PPT.pptx313 IDS _Course_Introduction_PPT.pptx
313 IDS _Course_Introduction_PPT.pptx
sameernsn1
 
Topic_6
Topic_6Topic_6
Topic_6
butest
 
Technology-based assessments-special educationNew technologies r.docx
Technology-based assessments-special educationNew technologies r.docxTechnology-based assessments-special educationNew technologies r.docx
Technology-based assessments-special educationNew technologies r.docx
ssuserf9c51d
 
Explanations in Data Systems
Explanations in Data SystemsExplanations in Data Systems
Explanations in Data Systems
Fotis Savva
 
The Anatomy of a 21st Century Educator
The Anatomy of a 21st Century Educator The Anatomy of a 21st Century Educator
The Anatomy of a 21st Century Educator
Simon Bates
 
Learning analytics are more than measurement
Learning analytics are more than measurementLearning analytics are more than measurement
Learning analytics are more than measurement
Dragan Gasevic
 
LACE Spring Briefing - Learning analytics are more than measurement
LACE Spring Briefing - Learning analytics are more than measurementLACE Spring Briefing - Learning analytics are more than measurement
LACE Spring Briefing - Learning analytics are more than measurement
LACE Project
 
ITEM Analysis BNHS
ITEM Analysis BNHSITEM Analysis BNHS
ITEM Analysis BNHS
AlJohnEspejo2
 
Biometry.docx
Biometry.docxBiometry.docx
Biometry.docx
zehiwot hone
 
The power of learning analytics for UCL: lessons learned from the Open Univer...
The power of learning analytics for UCL: lessons learned from the Open Univer...The power of learning analytics for UCL: lessons learned from the Open Univer...
The power of learning analytics for UCL: lessons learned from the Open Univer...
Bart Rienties
 
Dad (Data Analysis And Design)
Dad (Data Analysis And Design)Dad (Data Analysis And Design)
Dad (Data Analysis And Design)
Jill Lyons
 
IRJET - Automatic Attendance Provision using Image Processing
IRJET - Automatic Attendance Provision using Image ProcessingIRJET - Automatic Attendance Provision using Image Processing
IRJET - Automatic Attendance Provision using Image Processing
IRJET Journal
 
NCME_040916
NCME_040916NCME_040916
NCME_040916
Shauna Sweet
 
BANK INFORMATION SYSTEM DESIGN PROBLEM IN CENTRAL ELECTRONIC EDUCATION AND H...
BANK INFORMATION SYSTEM DESIGN PROBLEM IN CENTRAL ELECTRONIC  EDUCATION AND H...BANK INFORMATION SYSTEM DESIGN PROBLEM IN CENTRAL ELECTRONIC  EDUCATION AND H...
BANK INFORMATION SYSTEM DESIGN PROBLEM IN CENTRAL ELECTRONIC EDUCATION AND H...
Novita Ajeng Primantari
 
Data Clustering in Education for Students
Data Clustering in Education for StudentsData Clustering in Education for Students
Data Clustering in Education for Students
IRJET Journal
 

Similar to What makes a good adaptive testing program (20)

20151112PosterJanHellingsV2_definitief
20151112PosterJanHellingsV2_definitief20151112PosterJanHellingsV2_definitief
20151112PosterJanHellingsV2_definitief
 
An Adaptive Evaluation System to Test Student Caliber using Item Response Theory
An Adaptive Evaluation System to Test Student Caliber using Item Response TheoryAn Adaptive Evaluation System to Test Student Caliber using Item Response Theory
An Adaptive Evaluation System to Test Student Caliber using Item Response Theory
 
16497 mgt 252
16497 mgt 25216497 mgt 252
16497 mgt 252
 
Heidelberg presentation
Heidelberg presentationHeidelberg presentation
Heidelberg presentation
 
Investigating learning strategies in a dispositional learning analytics conte...
Investigating learning strategies in a dispositional learning analytics conte...Investigating learning strategies in a dispositional learning analytics conte...
Investigating learning strategies in a dispositional learning analytics conte...
 
313 IDS _Course_Introduction_PPT.pptx
313 IDS _Course_Introduction_PPT.pptx313 IDS _Course_Introduction_PPT.pptx
313 IDS _Course_Introduction_PPT.pptx
 
Topic_6
Topic_6Topic_6
Topic_6
 
Technology-based assessments-special educationNew technologies r.docx
Technology-based assessments-special educationNew technologies r.docxTechnology-based assessments-special educationNew technologies r.docx
Technology-based assessments-special educationNew technologies r.docx
 
Explanations in Data Systems
Explanations in Data SystemsExplanations in Data Systems
Explanations in Data Systems
 
The Anatomy of a 21st Century Educator
The Anatomy of a 21st Century Educator The Anatomy of a 21st Century Educator
The Anatomy of a 21st Century Educator
 
Learning analytics are more than measurement
Learning analytics are more than measurementLearning analytics are more than measurement
Learning analytics are more than measurement
 
LACE Spring Briefing - Learning analytics are more than measurement
LACE Spring Briefing - Learning analytics are more than measurementLACE Spring Briefing - Learning analytics are more than measurement
LACE Spring Briefing - Learning analytics are more than measurement
 
ITEM Analysis BNHS
ITEM Analysis BNHSITEM Analysis BNHS
ITEM Analysis BNHS
 
Biometry.docx
Biometry.docxBiometry.docx
Biometry.docx
 
The power of learning analytics for UCL: lessons learned from the Open Univer...
The power of learning analytics for UCL: lessons learned from the Open Univer...The power of learning analytics for UCL: lessons learned from the Open Univer...
The power of learning analytics for UCL: lessons learned from the Open Univer...
 
Dad (Data Analysis And Design)
Dad (Data Analysis And Design)Dad (Data Analysis And Design)
Dad (Data Analysis And Design)
 
IRJET - Automatic Attendance Provision using Image Processing
IRJET - Automatic Attendance Provision using Image ProcessingIRJET - Automatic Attendance Provision using Image Processing
IRJET - Automatic Attendance Provision using Image Processing
 
NCME_040916
NCME_040916NCME_040916
NCME_040916
 
BANK INFORMATION SYSTEM DESIGN PROBLEM IN CENTRAL ELECTRONIC EDUCATION AND H...
BANK INFORMATION SYSTEM DESIGN PROBLEM IN CENTRAL ELECTRONIC  EDUCATION AND H...BANK INFORMATION SYSTEM DESIGN PROBLEM IN CENTRAL ELECTRONIC  EDUCATION AND H...
BANK INFORMATION SYSTEM DESIGN PROBLEM IN CENTRAL ELECTRONIC EDUCATION AND H...
 
Data Clustering in Education for Students
Data Clustering in Education for StudentsData Clustering in Education for Students
Data Clustering in Education for Students
 

Recently uploaded

HYPERTENSION - SLIDE SHARE PRESENTATION.
HYPERTENSION - SLIDE SHARE PRESENTATION.HYPERTENSION - SLIDE SHARE PRESENTATION.
HYPERTENSION - SLIDE SHARE PRESENTATION.
deepaannamalai16
 
How to Fix [Errno 98] address already in use
How to Fix [Errno 98] address already in useHow to Fix [Errno 98] address already in use
How to Fix [Errno 98] address already in use
Celine George
 
Accounting for Restricted Grants When and How To Record Properly
Accounting for Restricted Grants  When and How To Record ProperlyAccounting for Restricted Grants  When and How To Record Properly
Accounting for Restricted Grants When and How To Record Properly
TechSoup
 
THE SACRIFICE HOW PRO-PALESTINE PROTESTS STUDENTS ARE SACRIFICING TO CHANGE T...
THE SACRIFICE HOW PRO-PALESTINE PROTESTS STUDENTS ARE SACRIFICING TO CHANGE T...THE SACRIFICE HOW PRO-PALESTINE PROTESTS STUDENTS ARE SACRIFICING TO CHANGE T...
THE SACRIFICE HOW PRO-PALESTINE PROTESTS STUDENTS ARE SACRIFICING TO CHANGE T...
indexPub
 
CIS 4200-02 Group 1 Final Project Report (1).pdf
CIS 4200-02 Group 1 Final Project Report (1).pdfCIS 4200-02 Group 1 Final Project Report (1).pdf
CIS 4200-02 Group 1 Final Project Report (1).pdf
blueshagoo1
 
A Visual Guide to 1 Samuel | A Tale of Two Hearts
A Visual Guide to 1 Samuel | A Tale of Two HeartsA Visual Guide to 1 Samuel | A Tale of Two Hearts
A Visual Guide to 1 Samuel | A Tale of Two Hearts
Steve Thomason
 
Contiguity Of Various Message Forms - Rupam Chandra.pptx
Contiguity Of Various Message Forms - Rupam Chandra.pptxContiguity Of Various Message Forms - Rupam Chandra.pptx
Contiguity Of Various Message Forms - Rupam Chandra.pptx
Kalna College
 
Skimbleshanks-The-Railway-Cat by T S Eliot
Skimbleshanks-The-Railway-Cat by T S EliotSkimbleshanks-The-Railway-Cat by T S Eliot
Skimbleshanks-The-Railway-Cat by T S Eliot
nitinpv4ai
 
BPSC-105 important questions for june term end exam
BPSC-105 important questions for june term end examBPSC-105 important questions for june term end exam
BPSC-105 important questions for june term end exam
sonukumargpnirsadhan
 
The basics of sentences session 7pptx.pptx
The basics of sentences session 7pptx.pptxThe basics of sentences session 7pptx.pptx
The basics of sentences session 7pptx.pptx
heathfieldcps1
 
78 Microsoft-Publisher - Sirin Sultana Bora.pptx
78 Microsoft-Publisher - Sirin Sultana Bora.pptx78 Microsoft-Publisher - Sirin Sultana Bora.pptx
78 Microsoft-Publisher - Sirin Sultana Bora.pptx
Kalna College
 
Educational Technology in the Health Sciences
Educational Technology in the Health SciencesEducational Technology in the Health Sciences
Educational Technology in the Health Sciences
Iris Thiele Isip-Tan
 
INTRODUCTION TO HOSPITALS & AND ITS ORGANIZATION
INTRODUCTION TO HOSPITALS & AND ITS ORGANIZATION INTRODUCTION TO HOSPITALS & AND ITS ORGANIZATION
INTRODUCTION TO HOSPITALS & AND ITS ORGANIZATION
ShwetaGawande8
 
KHUSWANT SINGH.pptx ALL YOU NEED TO KNOW ABOUT KHUSHWANT SINGH
KHUSWANT SINGH.pptx ALL YOU NEED TO KNOW ABOUT KHUSHWANT SINGHKHUSWANT SINGH.pptx ALL YOU NEED TO KNOW ABOUT KHUSHWANT SINGH
KHUSWANT SINGH.pptx ALL YOU NEED TO KNOW ABOUT KHUSHWANT SINGH
shreyassri1208
 
欧洲杯下注-欧洲杯下注押注官网-欧洲杯下注押注网站|【​网址​🎉ac44.net🎉​】
欧洲杯下注-欧洲杯下注押注官网-欧洲杯下注押注网站|【​网址​🎉ac44.net🎉​】欧洲杯下注-欧洲杯下注押注官网-欧洲杯下注押注网站|【​网址​🎉ac44.net🎉​】
欧洲杯下注-欧洲杯下注押注官网-欧洲杯下注押注网站|【​网址​🎉ac44.net🎉​】
andagarcia212
 
Oliver Asks for More by Charles Dickens (9)
Oliver Asks for More by Charles Dickens (9)Oliver Asks for More by Charles Dickens (9)
Oliver Asks for More by Charles Dickens (9)
nitinpv4ai
 
Ch-4 Forest Society and colonialism 2.pdf
Ch-4 Forest Society and colonialism 2.pdfCh-4 Forest Society and colonialism 2.pdf
Ch-4 Forest Society and colonialism 2.pdf
lakshayrojroj
 
220711130082 Srabanti Bag Internet Resources For Natural Science
220711130082 Srabanti Bag Internet Resources For Natural Science220711130082 Srabanti Bag Internet Resources For Natural Science
220711130082 Srabanti Bag Internet Resources For Natural Science
Kalna College
 
Gender and Mental Health - Counselling and Family Therapy Applications and In...
Gender and Mental Health - Counselling and Family Therapy Applications and In...Gender and Mental Health - Counselling and Family Therapy Applications and In...
Gender and Mental Health - Counselling and Family Therapy Applications and In...
PsychoTech Services
 
Level 3 NCEA - NZ: A Nation In the Making 1872 - 1900 SML.ppt
Level 3 NCEA - NZ: A  Nation In the Making 1872 - 1900 SML.pptLevel 3 NCEA - NZ: A  Nation In the Making 1872 - 1900 SML.ppt
Level 3 NCEA - NZ: A Nation In the Making 1872 - 1900 SML.ppt
Henry Hollis
 

Recently uploaded (20)

HYPERTENSION - SLIDE SHARE PRESENTATION.
HYPERTENSION - SLIDE SHARE PRESENTATION.HYPERTENSION - SLIDE SHARE PRESENTATION.
HYPERTENSION - SLIDE SHARE PRESENTATION.
 
How to Fix [Errno 98] address already in use
How to Fix [Errno 98] address already in useHow to Fix [Errno 98] address already in use
How to Fix [Errno 98] address already in use
 
Accounting for Restricted Grants When and How To Record Properly
Accounting for Restricted Grants  When and How To Record ProperlyAccounting for Restricted Grants  When and How To Record Properly
Accounting for Restricted Grants When and How To Record Properly
 
THE SACRIFICE HOW PRO-PALESTINE PROTESTS STUDENTS ARE SACRIFICING TO CHANGE T...
THE SACRIFICE HOW PRO-PALESTINE PROTESTS STUDENTS ARE SACRIFICING TO CHANGE T...THE SACRIFICE HOW PRO-PALESTINE PROTESTS STUDENTS ARE SACRIFICING TO CHANGE T...
THE SACRIFICE HOW PRO-PALESTINE PROTESTS STUDENTS ARE SACRIFICING TO CHANGE T...
 
CIS 4200-02 Group 1 Final Project Report (1).pdf
CIS 4200-02 Group 1 Final Project Report (1).pdfCIS 4200-02 Group 1 Final Project Report (1).pdf
CIS 4200-02 Group 1 Final Project Report (1).pdf
 
A Visual Guide to 1 Samuel | A Tale of Two Hearts
A Visual Guide to 1 Samuel | A Tale of Two HeartsA Visual Guide to 1 Samuel | A Tale of Two Hearts
A Visual Guide to 1 Samuel | A Tale of Two Hearts
 
Contiguity Of Various Message Forms - Rupam Chandra.pptx
Contiguity Of Various Message Forms - Rupam Chandra.pptxContiguity Of Various Message Forms - Rupam Chandra.pptx
Contiguity Of Various Message Forms - Rupam Chandra.pptx
 
Skimbleshanks-The-Railway-Cat by T S Eliot
Skimbleshanks-The-Railway-Cat by T S EliotSkimbleshanks-The-Railway-Cat by T S Eliot
Skimbleshanks-The-Railway-Cat by T S Eliot
 
BPSC-105 important questions for june term end exam
BPSC-105 important questions for june term end examBPSC-105 important questions for june term end exam
BPSC-105 important questions for june term end exam
 
The basics of sentences session 7pptx.pptx
The basics of sentences session 7pptx.pptxThe basics of sentences session 7pptx.pptx
The basics of sentences session 7pptx.pptx
 
78 Microsoft-Publisher - Sirin Sultana Bora.pptx
78 Microsoft-Publisher - Sirin Sultana Bora.pptx78 Microsoft-Publisher - Sirin Sultana Bora.pptx
78 Microsoft-Publisher - Sirin Sultana Bora.pptx
 
Educational Technology in the Health Sciences
Educational Technology in the Health SciencesEducational Technology in the Health Sciences
Educational Technology in the Health Sciences
 
INTRODUCTION TO HOSPITALS & AND ITS ORGANIZATION
INTRODUCTION TO HOSPITALS & AND ITS ORGANIZATION INTRODUCTION TO HOSPITALS & AND ITS ORGANIZATION
INTRODUCTION TO HOSPITALS & AND ITS ORGANIZATION
 
KHUSWANT SINGH.pptx ALL YOU NEED TO KNOW ABOUT KHUSHWANT SINGH
KHUSWANT SINGH.pptx ALL YOU NEED TO KNOW ABOUT KHUSHWANT SINGHKHUSWANT SINGH.pptx ALL YOU NEED TO KNOW ABOUT KHUSHWANT SINGH
KHUSWANT SINGH.pptx ALL YOU NEED TO KNOW ABOUT KHUSHWANT SINGH
 
欧洲杯下注-欧洲杯下注押注官网-欧洲杯下注押注网站|【​网址​🎉ac44.net🎉​】
欧洲杯下注-欧洲杯下注押注官网-欧洲杯下注押注网站|【​网址​🎉ac44.net🎉​】欧洲杯下注-欧洲杯下注押注官网-欧洲杯下注押注网站|【​网址​🎉ac44.net🎉​】
欧洲杯下注-欧洲杯下注押注官网-欧洲杯下注押注网站|【​网址​🎉ac44.net🎉​】
 
Oliver Asks for More by Charles Dickens (9)
Oliver Asks for More by Charles Dickens (9)Oliver Asks for More by Charles Dickens (9)
Oliver Asks for More by Charles Dickens (9)
 
Ch-4 Forest Society and colonialism 2.pdf
Ch-4 Forest Society and colonialism 2.pdfCh-4 Forest Society and colonialism 2.pdf
Ch-4 Forest Society and colonialism 2.pdf
 
220711130082 Srabanti Bag Internet Resources For Natural Science
220711130082 Srabanti Bag Internet Resources For Natural Science220711130082 Srabanti Bag Internet Resources For Natural Science
220711130082 Srabanti Bag Internet Resources For Natural Science
 
Gender and Mental Health - Counselling and Family Therapy Applications and In...
Gender and Mental Health - Counselling and Family Therapy Applications and In...Gender and Mental Health - Counselling and Family Therapy Applications and In...
Gender and Mental Health - Counselling and Family Therapy Applications and In...
 
Level 3 NCEA - NZ: A Nation In the Making 1872 - 1900 SML.ppt
Level 3 NCEA - NZ: A  Nation In the Making 1872 - 1900 SML.pptLevel 3 NCEA - NZ: A  Nation In the Making 1872 - 1900 SML.ppt
Level 3 NCEA - NZ: A Nation In the Making 1872 - 1900 SML.ppt
 

What makes a good adaptive testing program

  • 2. www.twosigmas.com twitter.com/twosigmas_ facebook/twosigmaspage Advantage of IRT based CAT 2 On a 40 question exam with dichotomous scoring (wrong or right), the total number of questions you might need to develop is 240 = 1.01 × 1012 . On a well designed IRT based CAT, the total number of questions you might need to develop is ≈ 400.
  • 3. www.twosigmas.com twitter.com/twosigmas_ facebook/twosigmaspage Who is your market? 3 Students – Computer adaptive teaching/ learning? Blended learning with MOOCs? Schools – Adaptive homework? Computer based in school exams? Formative assessment? Exam Board – Professionalize organizations? Corporations? Government organizations?
  • 4. www.twosigmas.com twitter.com/twosigmas_ facebook/twosigmaspage Background 4 What are you trying to measure? How is it manifest? What questions tests this? Is this question valid? Is this question reliable? Is this question fair?
  • 5. www.twosigmas.com twitter.com/twosigmas_ facebook/twosigmaspage General Characteristics of any IRT CAT 5 Item bank Development Pre-Test Items Prioritizing Publishing CAT Maintaining CAT Qualification Specification Reliability & Validity Differential Item Analysis (DIF) Content Balancing Item Selection Percentile Ranking Item Calibration Communicate Results Standard Exam Conditions Exam Security (Exposure) Validity Testing Termination Criteria Better than existing solution?
  • 6. www.twosigmas.com twitter.com/twosigmas_ facebook/twosigmaspage Differential Item Functioning (DIF): Guessing (𝑥) 6 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% -4 -3 -2 -1 0 1 2 3 4 LikelihoodofGettingaQuestionCorrect Ability of Test Taker (z-score) Item Response Curves Standard Normal Item 1 Item 2 𝑃𝑖𝑗 = 𝑃 𝑢 = 1 𝑥, 𝛼, 𝜃, 𝛿 = 𝑥 + 1 − 𝑥 𝑒1.702𝛼 𝜃−𝛿 1 + 𝑒1.702𝛼 𝜃−𝛿 , 𝛼 𝑥 𝛿 Normal 1 0 0 Item 1 1 0 0.2 Item 2 1 0 0.07 where i represents the ith item and j represents the jth test taker.
  • 7. www.twosigmas.com twitter.com/twosigmas_ facebook/twosigmaspage Differential Item Functioning (DIF): Difficulty (𝛿) 7 𝑃𝑖𝑗 = 𝑃 𝑢 = 1 𝑥, 𝛼, 𝜃, 𝛿 = 𝑥 + (1 − 𝑥) 𝑒1.702𝛼(𝜃−𝛿) 1 + 𝑒1.702𝛼(𝜃−𝛿) 𝛼 𝑥 𝛿 Item 1 1 0 0 Item 2 1 0 -1 Item 3 1 0 1 where i represents the ith item and j represents the jth test taker. 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% -4 -3 -2 -1 0 1 2 3 4 LikelihoodofGettingaQuestionCorrect(percent) Ability of Test Taker (z-score) Item Response Curves Item 1 Item 2 Item 3
  • 8. www.twosigmas.com twitter.com/twosigmas_ facebook/twosigmaspage Differential Item Functioning (DIF): Discrimination (𝛼) 8 𝑃𝑖𝑗 = 𝑃 𝑢 = 1 𝑥, 𝛼, 𝜃, 𝛿 = 𝑥 + (1 − 𝑥) 𝑒1.702𝛼(𝜃−𝛿) 1 + 𝑒1.702𝛼(𝜃−𝛿) 𝛼 𝑥 𝛿 Item 1 0.4 0 0 Item 2 0.8 0 -1 Item 3 1.2 0 1 where i represents the ith item and j represents the jth test taker. 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% -4 -3 -2 -1 0 1 2 3 4 LikelihoodofGettingaQuestionCorrect(percent) Ability of Test Taker (z-score) Item Response Curves Item 1 Item 2 Item 3
  • 9. www.twosigmas.com twitter.com/twosigmas_ facebook/twosigmaspage References: 9 • Baker, F. B. and Kim, S. (2004). Item Response Theory Parameter Estimation Techniques, 2nd Edition, Revised and Expanded. New York, NY: CRC Press, Taylor and Francis Group. • Bao, Han, Dayton, C. Mitchell, & Hendrickson, Amy B. (2009). Differential Item Functioning Amplification and Cancellation in a Reading Test. Practical Assessment, Research & Evaluation, 14(19). Available online: http://pareonline.net/getvn.asp?v=14&n=19 • Bergstorm, B. A., Gershon, R.C., and Brown, W. L. (1993) Differential Item Functioning vs. Differential Test Functioning. Paper Presented at the Annual Meeting of the American Educational Research Association (Atlanta, GA. April 12 - 16) Retrieved on February 20, 2011 from http://www.eric.ed.gov/PDFS/ED377227.pdf • Birdsall, M (2011) Implementing Computer Adaptive Testing to Improve Achievement Opportunities. Ofqual, Coventry. Online at: http://webarchive.nationalarchives.gov.uk/+/http://www.ofqual.gov.uk/files/2011-06- 15-implementing-computer-adaptive-testing-to-improve-achievement- opportunities.pdf • Bowles, R. and Pommerich, M. (2001). An Examination of Item Review on a CAT Using the Specific Information Item Selection Algorithm. Paper presented at the Annual Meeting of the National Council on Measurement in Education, Seattle, WA. • Childs, Ruth A. & Andrew P. Jaciw (2003). Matrix sampling of items in large-scale assessments. Practical Assessment, Research & Evaluation, 8(16). Retrieved February 22, 2011 from http://PAREonline.net/getvn.asp?v=8&n=16 • de Ayala, R. J. (2009). The Theory and Practice of Item Response Theory. New York, NY: The Guildfor Press. • He, Q. (2010) Maintaining Standards in on Demand Testing Using Item Response Theory. Ofqual, Coventry. Retrieved on February 10, 2011, from http://e- assessment.org.uk/images/uploads/s-docs/Ofqual-10-4724-Maintaining- standards.pdf • Newton, Paul E. (2007) 'Clarifying the purposes of educational assessment', Assessment in Education: Principles, Policy & Practice, 14:2, 149 -170. Retrieved February 20, 2011 from http://dx.doi.org/10.1080/09695940701478321 • Pommerich, M., Segall, D.O., & Moreno, K.E. (2009). The nine lives of CAT-ASVAB: Innovations and revelations. In D. J. Weiss (Ed.), Proceedings of the 2009 GMAC Conference on Computerized Adaptive Testing. Retrieved on February 15, 2011 from www.psych.umn.edu/psylabs/CATCentral/ • Rudner, L. M. (2007). Implementing the Graduate Management Admission Test® computerized adaptive test. In D. J. Weiss (Ed.), Proceedings of the 2007 GMAC Conference on Computerized Adaptive Testing. Retrieved January 10, 2010 from www.psych.umn.edu/psylabs/CATCentral/ • Segall, D. O. and Moreno, K. E. (1999) Development of the CAT-ASVAB. In F. Drasgow & J. B. Olson-Buchanan (Eds.). Innovations in Computerized Assessment (pp. 35—65). Hillsdale, NJ: Lawrence Erlbaum Associates. Retrieved on February 20, 2011 from http://www.danielsegall.com/catasvab.pdf • van der Linden, W. J. and Glas, A. W. (eds.), (2010) Elements of Computer Adaptive Testing: Statistics. Chapters 4, 10, 17, and page 349. London, UK: Springer Science + Business Media LLC. • Wise, L. L., Curran, L. T., & McBride, J. R. (1997). CAT-ASVAB Cost and Benefit Analyses. In W. A. Sands, B. K. Waters, & J. R. McBride (Eds.), Computerized adaptive testing: From inquiry to operation (pp. 227-236). Washington, DC: American Psychological Association. • Zumbo, B. D. (2007). Three Generations of DIF Analyses: Considering Where it Has Been, Where it is Now, and Where it is Going. Language Assessment Quarterly, 4(2), 223-233, Lawrence Erlbaum Associates, Publishers. Retrieved February 20, 2011 from http://educ.ubc.ca/faculty/zumbo/papers/Zumbo_LAQ_reprint.pdf