Item Selection Rules in Computerized Adaptive Testing

The item selection rule (ISR) most commonly used in computerized adaptive testing (CAT) is to select the item with maximum Fisher information for the current trait estimation (PFI). Several alternative ISRs have been proposed. Among them, Fisher information considered in an interval (FI*I), Fisher information weighted with the likelihood function (FI*L), Kullback-Leibler information considered in an interval (KL*I) and Kullback-Leibler weighted with the likelihood function (KL*L) have shown a greater precision of trait estimation at the early stages of CAT. A new ISR is proposed, Fisher information by interval with geometric mean (FI*IG), which tries to rectify some detected problems in FI*I. We evaluate accuracy and item bank security for these six ISRs. FI*IG is the only ISR which simultaneously outperforms PFI in both variables. For the other ISRs, there seems to be a trade-off between accuracy and security, PFI being the one with worse accuracy and greater security, and the ISRs using the likelihood function the reverse.

Download Full-text

A Dynamic Stratification Method for Improving Trait Estimation in Computerized Adaptive Testing Under Item Exposure Control

Applied Psychological Measurement ◽

10.1177/0146621619843820 ◽

2019 ◽

Vol 44 (3) ◽

pp. 182-196

Author(s):

Jyun-Hong Chen ◽

Hsiu-Yi Chao ◽

Shu-Ying Chen

Keyword(s):

Computerized Adaptive Testing ◽

Item Difficulty ◽

Adaptive Testing ◽

Item Selection ◽

Exposure Control ◽

Item Exposure ◽

Stratification Method ◽

High Discrimination ◽

Item Exposure Control ◽

Trait Estimation

When computerized adaptive testing (CAT) is under stringent item exposure control, the precision of trait estimation will substantially decrease. A new item selection method, the dynamic Stratification method based on Dominance Curves (SDC), which is aimed at improving trait estimation, is proposed to mitigate this problem. The objective function of the SDC in item selection is to maximize the sum of test information for all examinees rather than maximizing item information for individual examinees at a single-item administration, as in conventional CAT. To achieve this objective, the SDC uses dominance curves to stratify an item pool into strata with the number being equal to the test length to precisely and accurately increase the quality of the administered items as the test progresses, reducing the likelihood that a high-discrimination item will be administered to an examinee whose ability is not close to the item difficulty. Furthermore, the SDC incorporates a dynamic process for on-the-fly item–stratum adjustment to optimize the use of quality items. Simulation studies were conducted to investigate the performance of the SDC in CAT under item exposure control at different levels of severity. According to the results, the SDC can efficiently improve trait estimation in CAT through greater precision and more accurate trait estimation than those generated by other methods (e.g., the maximum Fisher information method) in most conditions.

Download Full-text

Varying the Valuating Function and the Presentable Bank in Computerized Adaptive Testing

The Spanish Journal of Psychology ◽

10.5209/rev_sjop.2011.v14.n1.45 ◽

2011 ◽

Vol 14 (1) ◽

pp. 500-508 ◽

Cited By ~ 3

Author(s):

Juan Ramón Barrada ◽

Francisco José Abad ◽

Julio Olea

Keyword(s):

Fisher Information ◽

Computerized Adaptive Testing ◽

Adaptive Testing ◽

High Accuracy ◽

Item Bank ◽

Information Function ◽

Matching Criterion ◽

Opposite Pole ◽

Trait Level ◽

Function Constant

In computerized adaptive testing, the most commonly used valuating function is the Fisher information function. When the goal is to keep item bank security at a maximum, the valuating function that seems most convenient is the matching criterion, valuating the distance between the estimated trait level and the point where the maximum of the information function is located. Recently, it has been proposed not to keep the same valuating function constant for all the items in the test. In this study we expand the idea of combining the matching criterion with the Fisher information function. We also manipulate the number of strata into which the bank is divided. We find that the manipulation of the number of items administered with each function makes it possible to move from the pole of high accuracy and low security to the opposite pole. It is possible to greatly improve item bank security with much fewer losses in accuracy by selecting several items with the matching criterion. In general, it seems more appropriate not to stratify the bank.

Download Full-text

Optimizing the Use of Response Times for Item Selection in Computerized Adaptive Testing

Journal of Educational and Behavioral Statistics ◽

10.3102/1076998617723642 ◽

2017 ◽

Vol 43 (2) ◽

pp. 135-158 ◽

Cited By ~ 8

Author(s):

Edison M. Choe ◽

Justin L. Kern ◽

Hua-Hua Chang

Keyword(s):

Fisher Information ◽

Computerized Adaptive Testing ◽

Adaptive Testing ◽

Alternative Methods ◽

Item Selection ◽

Estimation Accuracy ◽

Testing Time ◽

Exposure Control ◽

Simple Modification ◽

Measurement Efficiency

Despite common operationalization, measurement efficiency of computerized adaptive testing should not only be assessed in terms of the number of items administered but also the time it takes to complete the test. To this end, a recent study introduced a novel item selection criterion that maximizes Fisher information per unit of expected response time (RT), which was shown to effectively reduce the average completion time for a fixed-length test with minimal decrease in the accuracy of ability estimation. As this method also resulted in extremely unbalanced exposure of items, however, a-stratification with b-blocking was recommended as a means for counterbalancing. Although exceptionally effective in this regard, it comes at substantial costs of attenuating the reduction in average testing time, increasing the variance of testing times, and further decreasing estimation accuracy. Therefore, this article investigated several alternative methods for item exposure control, of which the most promising was a simple modification of maximizing Fisher information per unit of centered expected RT. The key advantage of the proposed method is the flexibility in choosing a centering value according to a desired distribution of testing times and level of exposure control. Moreover, the centered expected RT can be exponentially weighted to calibrate the degree of measurement precision. The results of extensive simulations, with item pools and examinees that are both simulated and real, demonstrate that optimally chosen centering and weighting values can markedly reduce the mean and variance of both testing times and test overlap, all without much compromise in estimation accuracy.

Download Full-text

A Comparison of Item Selection Methods for Controlling Exposure Rate in Cognitive Diagnostic Computerized Adaptive Testing

Acta Psychologica Sinica ◽

10.3724/sp.j.1041.2013.00694 ◽

2013 ◽

Vol 45 (6) ◽

pp. 694-703

Author(s):

Xiuzhen MAO ◽

Tao XIN

Keyword(s):

Computerized Adaptive Testing ◽

Adaptive Testing ◽

Item Selection ◽

Exposure Rate ◽

Selection Methods

Download Full-text

Dynamic and Comprehensive Item Selection Strategies for Computerized Adaptive Testing Based on Graded Response Model

Acta Psychologica Sinica ◽

10.3724/sp.j.1041.2012.00400 ◽

2013 ◽

Vol 44 (3) ◽

pp. 400-412 ◽

Cited By ~ 1

Author(s):

Fen LUO ◽

Shu-Liang DING ◽

Xiao-Qing WANG

Keyword(s):

Computerized Adaptive Testing ◽

Adaptive Testing ◽

Item Selection ◽

Response Model ◽

Graded Response Model ◽

Selection Strategies ◽

Graded Response

Download Full-text

Item Selection Strategies for Computerized Adaptive Testing with the Generalized Partial Credit Model

Acta Psychologica Sinica ◽

10.3724/sp.j.1041.2008.00618 ◽

2008 ◽

Vol 40 (5) ◽

pp. 618-625 ◽

Cited By ~ 2

Author(s):

Zhen LIU

Keyword(s):

Computerized Adaptive Testing ◽

Adaptive Testing ◽

Item Selection ◽

Partial Credit Model ◽

Partial Credit ◽

Generalized Partial Credit Model ◽

Selection Strategies ◽

Generalized Partial Credit

Download Full-text

Computerized Adaptive Testing for Sleep Disorders: Development of An Item Bank and Validation in A Simulated Study

10.21203/rs.3.rs-18576/v1 ◽

2020 ◽

Author(s):

Menghua She ◽

Yaling Li ◽

Dongbo Tu ◽

Yan Cai

Keyword(s):

Sleep Disorders ◽

Computerized Adaptive Testing ◽

Assessment Tool ◽

Adaptive Testing ◽

Item Bank ◽

Accurate Assessment ◽

Item Pool ◽

Psychometric Characteristics ◽

Predictive Utility ◽

Item Fit

Abstract Background: As more and more people suffer from sleep disorders, developing an efficient, cheap and accurate assessment tool for screening sleep disorders is becoming more urgent. This study developed a computerized adaptive testing for sleep disorders (CAT-SD). Methods: A large sample of 1,304 participants was recruited to construct the item pool of CAT-SD and to investigate the psychometric characteristics of CAT-SD. More specifically, firstly the analyses of unidimensionality, model fit, item fit, item discrimination parameter and differential item functioning (DIF) were conducted to construct a final item pool which meets the requirements of item response theory (IRT) measurement. In addition, a simulated CAT study with real response data of participants was performed to investigate the psychometric characteristics of CAT-SD, including reliability, validity and predictive utility (sensitivity and specificity). Results: The final unidimensional item bank of the CAT-SD not only had good item fit, high discrimination and no DIF; Moreover, it had acceptable reliability, validity and predictive utility. Conclusions: The CAT-SD could be used as an effective and accurate assessment tool for measuring individuals' severity of the sleep disorders and offers a bran-new perspective for screening of sleep disorders with psychological scales.

Download Full-text

Towards Association Rule-Based Item Selection Strategy in Computerized Adaptive Testing

Studies in Computational Intelligence - New Perspectives on Enterprise Decision-Making Applying Artificial Intelligence Techniques ◽

10.1007/978-3-030-71115-3_2 ◽

2021 ◽

pp. 27-54

Author(s):

Josué Pacheco-Ortiz ◽

Lisbeth Rodríguez-Mazahua ◽

Jezreel Mejía-Miranda ◽

Isaac Machorro-Cano ◽

Ulises Juárez-Martínez

Keyword(s):

Association Rule ◽

Computerized Adaptive Testing ◽

Adaptive Testing ◽

Item Selection ◽

Selection Strategy ◽

Rule Based

Download Full-text

Calibration of the food parenting practice (FPP) item bank: tools for improving the measurement of food parenting practices of parents of 5–12-year-old children

International Journal of Behavioral Nutrition and Physical Activity ◽

10.1186/s12966-020-01049-9 ◽

2020 ◽

Vol 17 (1) ◽

Author(s):

Louise C. Mâsse ◽

Teresia M. O’Connor ◽

Yingyi Lin ◽

Sheryl O. Hughes ◽

Claire N. Tugault-Lafleur ◽

...

Keyword(s):

Conceptual Framework ◽

Parenting Practices ◽

Computerized Adaptive Testing ◽

Short Form ◽

Adaptive Testing ◽

Item Bank ◽

Short Version ◽

Parenting Practice ◽

Practice Item ◽

Improve Measurement

Abstract Purpose There has been a call to improve measurement rigour and standardization of food parenting practices measures, as well as aligning the measurement of food parenting practices with the parenting literature. Drawing from an expert-informed conceptual framework assessing three key domains of food parenting practices (autonomy promotion, control, and structure), this study combined factor analytic methods with Item Response Modeling (IRM) methodology to psychometrically validate responses to the Food Parenting Practice item bank. Methods A sample of 799 Canadian parents of 5–12-year-old children completed the Food Parenting Practice item bank (129 items measuring 17 constructs). The factorial structure of the responses to the item bank was assessed with confirmatory factor analysis (CFA), confirmatory bi-factor item analysis, and IRM. Following these analyses, differential Item Functioning (DIF) and Differential Response Functioning (DRF) analyses were then used to test invariance properties by parents’ sex, income and ethnicity. Finally, the efficiency of the item bank was examined using computerized adaptive testing simulations to identify the items to include in a short form. Results Overall, the expert-informed conceptual framework was predominantly supported by the CFA as it retained the same 17 constructs included in the conceptual framework with the exception of the access/availability and permissive constructs which were respectively renamed covert control and accommodating the child to better reflect the content of the final solution. The bi-factor item analyses and IRM analyses revealed that the solution could be simplified to 11 unidimensional constructs and the full item bank included 86-items (empirical reliability from 0.78 to 0.96, except for 1 construct) and the short form had 48 items. Conclusion Overall the food parenting practice item bank has excellent psychometric properties. The item bank includes an expanded version and short version to meet various study needs. This study provides more efficient tools for assessing how food parenting practices influence child dietary behaviours. Next steps are to use the IRM calibrated item bank and draw on computerized adaptive testing methodology to administer the item bank and provide flexibility in item selection.

Download Full-text

An Efficiency Balanced Information Criterion for Item Selection in Computerized Adaptive Testing

Journal of Educational Measurement ◽

10.1111/j.1745-3984.2012.00173.x ◽

2012 ◽

Vol 49 (3) ◽

pp. 225-246 ◽

Cited By ~ 5

Author(s):

Kyung T. Han

Keyword(s):

Computerized Adaptive Testing ◽

Information Criterion ◽

Adaptive Testing ◽

Item Selection ◽

Balanced Information

Download Full-text