Investigating Separate and Concurrent Approaches for Item Parameter Drift in 3PL Item Response Theory Equating

The uncertainty arising from item parameter estimation is often not negligible and must be accounted for when calculating latent variable (LV) scores in item response theory (IRT). It is particularly so when the calibration sample size is limited and/or the calibration IRT model is complex. In the current work, we treat two-stage IRT scoring as a predictive inference problem: The target of prediction is a random variable that follows the true posterior of the LV conditional on the response pattern being scored. Various Bayesian, fiducial, and frequentist prediction intervals of LV scores, which can be obtained from a simple yet generic Monte Carlo recipe, are evaluated and contrasted via simulations based on several measures of prediction quality. An empirical data example is also presented to illustrate the use of candidate methods.

Download Full-text

Postexamination Analysis of Objective Tests Using the Three-Parameter Item Response Theory

Journal of Nursing Measurement ◽

10.1891/1061-3749.22.1.94 ◽

2014 ◽

Vol 22 (1) ◽

pp. 94-105

Author(s):

Mohsen Tavakol ◽

Mohammad Rahimi-Madiseh ◽

Reg Dennick

Keyword(s):

Item Response Theory ◽

Item Response ◽

Item Parameter ◽

Response Theory ◽

Characteristic Curves ◽

Test Characteristic ◽

Test Items ◽

Test Information ◽

Objective Tests

Background and Purpose: Although the importance of item response theory (IRT) has been emphasized in health and medical education, in practice, few psychometricians in nurse education have used these methods to create tests that discriminate well at any level of student ability. The purpose of this study is to evaluate the psychometric properties of a real objective test using three-parameter IRT. Methods: Three-parameter IRT was used to monitor and improve the quality of the test items. Results: Item parameter indices, item characteristic curves (ICCs), test information functions, and test characteristic curves reveal aberrant items which do not assess the construct being measured. Conclusions: The results of this study provide useful information for educators to improve the quality of assessment, teaching strategies, and curricula.

Download Full-text

IPLINK: Multidimensional and Unidimensional Item Parameter Linking in Item Response Theory

Applied Psychological Measurement ◽

10.1177/014662169602000304 ◽

1996 ◽

Vol 20 (3) ◽

pp. 230-230 ◽

Cited By ~ 9

Author(s):

Kevin Lee ◽

T.C. Oshima

Keyword(s):

Item Response Theory ◽

Item Response ◽

Item Parameter ◽

Response Theory

Download Full-text

Sample Size and Test Length for Item Parameter Estimate and Exam Parameter Estimate

Al-Khwarizmi Jurnal Pendidikan Matematika dan Ilmu Pengetahuan Alam ◽

10.24256/jpmipa.v9i1.2384 ◽

2021 ◽

Vol 9 (1) ◽

pp. 69-78

Author(s):

Riswan Riswan

Keyword(s):

Item Response Theory ◽

Sample Size ◽

Item Response ◽

Parameter Estimate ◽

Test Theory ◽

Item Parameter ◽

Parameter Estimates ◽

Test Length ◽

Response Theory ◽

The Stability

The Item Response Theory (IRT) model contains one or more parameters in the model. These parameters are unknown, so it is necessary to predict them. This paper aims (1) to determine the sample size (N) on the stability of the item parameter (2) to determine the length (n) test on the stability of the estimate parameter examinee (3) to determine the effect of the model on the stability of the item and the parameter to examine (4) to find out Effect of sample size and test length on item stability and examinee parameter estimates (5) Effect of sample size, test length, and model on item stability and examinee parameter estimates. This paper is a simulation study in which the latent trait (q) sample simulation is derived from a standard normal population of ~ N (0.1), with a specific Sample Size (N) and test length (n) with the 1PL, 2PL and 3PL models using Wingen. Item analysis was carried out using the classical theory test approach and modern test theory. Item Response Theory and data were analyzed through software R with the ltm package. The results showed that the larger the sample size (N), the more stable the estimated parameter. For the length test, which is the greater the test length (n), the more stable the estimated parameter (q).

Download Full-text

Applications of the Analytically Derived Asymptotic Standard Errors of Item Response Theory Item Parameter Estimates

Journal of Educational Measurement ◽

10.1111/j.1745-3984.2004.tb01109.x ◽

2004 ◽

Vol 41 (2) ◽

pp. 85-117 ◽

Cited By ~ 11

Author(s):

Yuan H. Li ◽

Robert W. Lissitz

Keyword(s):

Item Response Theory ◽

Item Response ◽

Item Parameter ◽

Standard Errors ◽

Parameter Estimates ◽

Response Theory ◽

Asymptotic Standard Errors ◽

Item Parameter Estimates

Download Full-text

A Comparison of Item Parameter Standard Error Estimation Procedures for Unidimensional and Multidimensional Item Response Theory Modeling

Educational and Psychological Measurement ◽

10.1177/0013164413500277 ◽

2013 ◽

Vol 74 (1) ◽

pp. 58-76 ◽

Cited By ~ 19

Author(s):

Insu Paek ◽

Li Cai

Keyword(s):

Item Response Theory ◽

Item Response ◽

Error Estimation ◽

Standard Error ◽

Multidimensional Item Response Theory ◽

Item Parameter ◽

Multidimensional Item Response ◽

Response Theory ◽

Estimation Procedures ◽

Standard Error Estimation

Download Full-text

Parameter Recovery in Multidimensional Item Response Theory Models Under Complexity and Nonnormality

Applied Psychological Measurement ◽

10.1177/0146621617707507 ◽

2017 ◽

Vol 41 (7) ◽

pp. 530-544 ◽

Cited By ~ 3

Author(s):

Dubravka Svetina ◽

Arturo Valdivia ◽

Stephanie Underhill ◽

Shenghai Dai ◽

Xiaolin Wang

Keyword(s):

Item Response Theory ◽

Item Response ◽

Computerized Adaptive Testing ◽

Latent Trait ◽

Item Parameter ◽

Multidimensional Item Response ◽

Response Theory ◽

Parameter Recovery ◽

Assessment Development ◽

Latent Traits

Information about the psychometric properties of items can be highly useful in assessment development, for example, in item response theory (IRT) applications and computerized adaptive testing. Although literature on parameter recovery in unidimensional IRT abounds, less is known about parameter recovery in multidimensional IRT (MIRT), notably when tests exhibit complex structures or when latent traits are nonnormal. The current simulation study focuses on investigation of the effects of complex item structures and the shape of examinees’ latent trait distributions on item parameter recovery in compensatory MIRT models for dichotomous items. Outcome variables included bias and root mean square error. Results indicated that when latent traits were skewed, item parameter recovery was generally adversely impacted. In addition, the presence of complexity contributed to decreases in the precision of parameter recovery, particularly for discrimination parameters along one dimension when at least one latent trait was generated as skewed.

Download Full-text

Use of Restricted Item Response Theory Models for Examining the Stability of Item Parameter Estimates Over Time

Applied Measurement in Education ◽

10.1207/s15324818ame0402_3 ◽

1991 ◽

Vol 4 (2) ◽

pp. 125-141 ◽

Cited By ~ 9

Author(s):

Clement A. Stone ◽

Suzanne Lane

Keyword(s):

Item Response Theory ◽

Item Response ◽

Item Parameter ◽

Parameter Estimates ◽

Response Theory ◽

Item Parameter Estimates ◽

The Stability ◽

Item Response Theory Models ◽

Over Time

Download Full-text

Parameter Estimation Accuracy of the Effort-Moderated Item Response Theory Model Under Multiple Assumption Violations

Educational and Psychological Measurement ◽

10.1177/0013164420949896 ◽

2020 ◽

pp. 001316442094989

Author(s):

Joseph A. Rios ◽

James Soland

Keyword(s):

Parameter Estimation ◽

Item Response Theory ◽

Item Response ◽

Item Parameter ◽

Estimation Accuracy ◽

Parameter Estimates ◽

Response Theory ◽

Irt Model ◽

Ability Estimates ◽

Ability Parameter

As low-stakes testing contexts increase, low test-taking effort may serve as a serious validity threat. One common solution to this problem is to identify noneffortful responses and treat them as missing during parameter estimation via the effort-moderated item response theory (EM-IRT) model. Although this model has been shown to outperform traditional IRT models (e.g., two-parameter logistic [2PL]) in parameter estimation under simulated conditions, prior research has failed to examine its performance under violations to the model’s assumptions. Therefore, the objective of this simulation study was to examine item and mean ability parameter recovery when violating the assumptions that noneffortful responding occurs randomly (Assumption 1) and is unrelated to the underlying ability of examinees (Assumption 2). Results demonstrated that, across conditions, the EM-IRT model provided robust item parameter estimates to violations of Assumption 1. However, bias values greater than 0.20 SDs were observed for the EM-IRT model when violating Assumption 2; nonetheless, these values were still lower than the 2PL model. In terms of mean ability estimates, model results indicated equal performance between the EM-IRT and 2PL models across conditions. Across both models, mean ability estimates were found to be biased by more than 0.25 SDs when violating Assumption 2. However, our accompanying empirical study suggested that this biasing occurred under extreme conditions that may not be present in some operational settings. Overall, these results suggest that the EM-IRT model provides superior item and equal mean ability parameter estimates in the presence of model violations under realistic conditions when compared with the 2PL model.

Download Full-text