CATBOOK Computerized Adaptive Testing: From Inquiry to Operation

Summary: Item parameters for several hundreds of items were estimated based on empirical data from several thousands of subjects. The logistic one-parameter (1PL) and two-parameter (2PL) model estimates were evaluated. However, model fit showed that only a subset of items complied sufficiently, so that the remaining ones were assembled in well-fitting item banks. In several simulation studies 5000 simulated responses were generated in accordance with a computerized adaptive test procedure along with person parameters. A general reliability of .80 or a standard error of measurement of .44 was used as a stopping rule to end CAT testing. We also recorded how often each item was used by all simulees. Person-parameter estimates based on CAT correlated higher than .90 with true values simulated. For all 1PL fitting item banks most simulees used more than 20 items but less than 30 items to reach the pre-set level of measurement error. However, testing based on item banks that complied to the 2PL revealed that, on average, only 10 items were sufficient to end testing at the same measurement error level. Both clearly demonstrate the precision and economy of computerized adaptive testing. Empirical evaluations from everyday uses will show whether these trends will hold up in practice. If so, CAT will become possible and reasonable with some 150 well-calibrated 2PL items.

Download Full-text

Methods for Restricting Maximum Exposure Rate in Computerized Adaptative Testing

Methodology ◽

10.1027/1614-2241.3.1.14 ◽

2007 ◽

Vol 3 (1) ◽

pp. 14-23 ◽

Cited By ~ 9

Author(s):

Juan Ramon Barrada ◽

Julio Olea ◽

Vicente Ponsoda

Keyword(s):

Measurement Accuracy ◽

Computerized Adaptive Testing ◽

Computation Time ◽

Adaptive Testing ◽

Exposure Rate ◽

Control Parameters ◽

The Impact ◽

Two Alternatives ◽

Selection Of ◽

Maximum Exposure

Abstract. The Sympson-Hetter (1985) method provides a means of controlling maximum exposure rate of items in Computerized Adaptive Testing. Through a series of simulations, control parameters are set that mark the probability of administration of an item on being selected. This method presents two main problems: it requires a long computation time for calculating the parameters and the maximum exposure rate is slightly above the fixed limit. Van der Linden (2003) presented two alternatives which appear to solve both of the problems. The impact of these methods in the measurement accuracy has not been tested yet. We show how these methods over-restrict the exposure of some highly discriminating items and, thus, the accuracy is decreased. It also shown that, when the desired maximum exposure rate is near the minimum possible value, these methods offer an empirical maximum exposure rate clearly above the goal. A new method, based on the initial estimation of the probability of administration and the probability of selection of the items with the restricted method ( Revuelta & Ponsoda, 1998 ), is presented in this paper. It can be used with the Sympson-Hetter method and with the two van der Linden's methods. This option, when used with Sympson-Hetter, speeds the convergence of the control parameters without decreasing the accuracy.

Download Full-text

Computerized adaptive testing: From inquiry to operation.

10.1037/10244-000 ◽

1997 ◽

Cited By ~ 48

Keyword(s):

Computerized Adaptive Testing ◽

Adaptive Testing

Download Full-text

A Comparison of Item Selection Methods for Controlling Exposure Rate in Cognitive Diagnostic Computerized Adaptive Testing

Acta Psychologica Sinica ◽

10.3724/sp.j.1041.2013.00694 ◽

2013 ◽

Vol 45 (6) ◽

pp. 694-703

Author(s):

Xiuzhen MAO ◽

Tao XIN

Keyword(s):

Computerized Adaptive Testing ◽

Adaptive Testing ◽

Item Selection ◽

Exposure Rate ◽

Selection Methods

Download Full-text

Dynamic and Comprehensive Item Selection Strategies for Computerized Adaptive Testing Based on Graded Response Model

Acta Psychologica Sinica ◽

10.3724/sp.j.1041.2012.00400 ◽

2013 ◽

Vol 44 (3) ◽

pp. 400-412 ◽

Cited By ~ 1

Author(s):

Fen LUO ◽

Shu-Liang DING ◽

Xiao-Qing WANG

Keyword(s):

Computerized Adaptive Testing ◽

Adaptive Testing ◽

Item Selection ◽

Response Model ◽

Graded Response Model ◽

Selection Strategies ◽

Graded Response

Download Full-text

Application of Online Calibration Technique in Computerized Adaptive Testing

Advances in Psychological Science ◽

10.3724/sp.j.1042.2013.01883 ◽

2013 ◽

Vol 21 (10) ◽

pp. 1883-1892

Author(s):

Ping CHEN ◽

Jiahui ZHANG ◽

Tao XIN

Keyword(s):

Computerized Adaptive Testing ◽

Adaptive Testing ◽

Calibration Technique ◽

Online Calibration

Download Full-text

a-Stratified Methods Combining Item Exposure Control and General Test Overlap in Computerized Adaptive Testing

Acta Psychologica Sinica ◽

10.3724/sp.j.1041.2014.00702 ◽

2014 ◽

Vol 46 (5) ◽

pp. 702

Author(s):

Lei GUO ◽

Zhuoran WANG ◽

Feng WANG ◽

Yufang BIAN

Keyword(s):

Computerized Adaptive Testing ◽

Adaptive Testing ◽

Exposure Control ◽

Item Exposure ◽

General Test ◽

Item Exposure Control ◽

Test Overlap

Download Full-text

Item Selection Strategies for Computerized Adaptive Testing with the Generalized Partial Credit Model

Acta Psychologica Sinica ◽

10.3724/sp.j.1041.2008.00618 ◽

2008 ◽

Vol 40 (5) ◽

pp. 618-625 ◽

Cited By ~ 2

Author(s):

Zhen LIU

Keyword(s):

Computerized Adaptive Testing ◽

Adaptive Testing ◽

Item Selection ◽

Partial Credit Model ◽

Partial Credit ◽

Generalized Partial Credit Model ◽

Selection Strategies ◽

Generalized Partial Credit

Download Full-text

On some Issues in the Accelerated CAT-ASVAB (Computerized Adaptive Testing-Armed Services Vocational Aptitude Battery) Project

10.21236/ada178558 ◽

1986 ◽

Author(s):

D. R. Divgi

Keyword(s):

Computerized Adaptive Testing ◽

Adaptive Testing ◽

Armed Services

Download Full-text

Measurement Precision and Efficiency of Computerized Adaptive Testing for the Activities-specific Balance Confidence Scale in People With Stroke

Physical Therapy ◽

10.1093/ptj/pzab020 ◽

2021 ◽

Author(s):

Bryant A Seamon ◽

Steven A Kautz ◽

Craig A Velozo

Keyword(s):

Rasch Model ◽

Computerized Adaptive Testing ◽

Adaptive Testing ◽

Measurement Precision ◽

Strongly Correlated ◽

Computerized Adaptive Test ◽

Balance Confidence ◽

Adaptive Test ◽

The Rasch Model ◽

Confidence Scale

Abstract Objective Administrative burden often prevents clinical assessment of balance confidence in people with stroke. A computerized adaptive test (CAT) version of the Activities-specific Balance Confidence Scale (ABC CAT) can dramatically reduce this burden. The objective of this study was to test balance confidence measurement precision and efficiency in people with stroke with an ABC CAT. Methods We conducted a retrospective cross-sectional simulation study with data from 406 adults approximately 2-months post-stroke in the Locomotor-Experience Applied Post-Stroke (LEAPS) trial. Item parameters for CAT calibration were estimated with the Rasch model using a random sample of participants (n = 203). Computer simulation was used with response data from remaining 203 participants to evaluate the ABC CAT algorithm under varying stopping criteria. We compared estimated levels of balance confidence from each simulation to actual levels predicted from the Rasch model (Pearson correlations and mean standard error (SE)). Results Results from simulations with number of items as a stopping criterion strongly correlated with actual ABC scores (full item, r = 1, 12-item, r = 0.994; 8-item, r = 0.98; 4-item, r = 0.929). Mean SE increased with decreasing number of items administered (full item, SE = 0.31; 12-item, SE = 0.33; 8-item, SE = 0.38; 4-item, SE = 0.49). A precision-based stopping rule (mean SE = 0.5) also strongly correlated with actual ABC scores (r = .941) and optimized the relationship between number of items administrated with precision (mean number of items 4.37, range [4–9]). Conclusions An ABC CAT can determine accurate and precise measures of balance confidence in people with stroke with as few as 4 items. Individuals with lower balance confidence may require a greater number of items (up to 9) and attributed to the LEAPS trial excluding more functionally impaired persons. Impact Statement Computerized adaptive testing can drastically reduce the ABC’s test administration time while maintaining accuracy and precision. This should greatly enhance clinical utility, facilitating adoption of clinical practice guidelines in stroke rehabilitation. Lay Summary If you have had a stroke, your physical therapist will likely test your balance confidence. A computerized adaptive test version of the ABC scale can accurately identify balance with as few as 4 questions, which takes much less time.

Download Full-text