Investigating the Impact of Item Parameter Drift for Item Response Theory Models with Mixture Distributions

Educational assessments tests are often constructed using testlets because of the flexibility to test various aspects of the cognitive activities and broad content sampling. However, the violation of the local item independence assumption is inevitable when tests are built using testlet items. In this study, simulations are conducted to evaluate the performance of item response theory models and testlet response theory models for both the dichotomous and polytomous items in the context of equating tests composed of testlets. We also examine the impact of testlet effect, length of testlet items, and sample size on estimating item and person parameters. The results show that more accurate performance of testlet response theory models over item response theory models was consistently observed across the studies, which supports the benefits of using the testlet response theory models in equating for tests composed of testlets. Further, results of the study indicate that when sample size is large, item response theory models performed similarly to testlet response theory models across all studies.

Download Full-text

Use of Restricted Item Response Theory Models for Examining the Stability of Item Parameter Estimates Over Time

Applied Measurement in Education ◽

10.1207/s15324818ame0402_3 ◽

1991 ◽

Vol 4 (2) ◽

pp. 125-141 ◽

Cited By ~ 9

Author(s):

Clement A. Stone ◽

Suzanne Lane

Keyword(s):

Item Response Theory ◽

Item Response ◽

Item Parameter ◽

Parameter Estimates ◽

Response Theory ◽

Item Parameter Estimates ◽

The Stability ◽

Item Response Theory Models ◽

Over Time

Download Full-text

Target Rotations and Assessing the Impact of Model Violations on the Parameters of Unidimensional Item Response Theory Models

Educational and Psychological Measurement ◽

10.1177/0013164410378690 ◽

2011 ◽

Vol 71 (4) ◽

pp. 684-711 ◽

Cited By ~ 39

Author(s):

Steven Reise ◽

Tyler Moore ◽

Alberto Maydeu-Olivares

Keyword(s):

Item Response Theory ◽

Item Response ◽

Response Theory ◽

The Impact ◽

Item Response Theory Models

Download Full-text

The Impact of Ignoring Multilevel Data Structure on the Estimation of Dichotomous Item Response Theory Models

International Journal of Assessment Tools in Education ◽

10.21449/ijate.523586 ◽

2019 ◽

Vol 6 (1 (pre-print issue)) ◽

pp. 92-108

Author(s):

Hyung Rock Lee ◽

Sunbok Lee ◽

Jaeyun Sung

Keyword(s):

Data Structure ◽

Item Response Theory ◽

Item Response ◽

Multilevel Data ◽

Response Theory ◽

The Impact ◽

Item Response Theory Models ◽

Dichotomous Item

Download Full-text

Different approaches to modeling response styles in divide-by-total item response theory models (part 1): A model integration.

Psychological Methods ◽

10.1037/met0000249 ◽

2020 ◽

Vol 25 (5) ◽

pp. 560-576

Author(s):

Mirka Henninger ◽

Thorsten Meiser

Keyword(s):

Item Response Theory ◽

Item Response ◽

Response Styles ◽

Model Integration ◽

Response Theory ◽

Item Response Theory Models ◽

Total Item

Download Full-text

Using Item Response Theory Models to Evaluate the Practice Environment Scale

Journal of Nursing Measurement ◽

10.1891/1061-3749.22.2.323 ◽

2014 ◽

Vol 22 (2) ◽

pp. 323-341 ◽

Cited By ~ 6

Author(s):

Dheeraj Raju ◽

Xiaogang Su ◽

Patricia A. Patrician

Keyword(s):

Item Response Theory ◽

Item Response ◽

Information Criterion ◽

Partial Credit Model ◽

Practice Environment ◽

Partial Credit ◽

Response Theory ◽

Environment Scale ◽

Graded Response ◽

Item Response Theory Models

Background and Purpose: The purpose of this article is to introduce different types of item response theory models and to demonstrate their usefulness by evaluating the Practice Environment Scale. Methods: Item response theory models such as constrained and unconstrained graded response model, partial credit model, Rasch model, and one-parameter logistic model are demonstrated. The Akaike information criterion (AIC) and Bayesian information criterion (BIC) indices are used as model selection criterion. Results: The unconstrained graded response and partial credit models indicated the best fit for the data. Almost all items in the instrument performed well. Conclusions: Although most of the items strongly measure the construct, there are a few items that could be eliminated without substantially altering the instrument. The analysis revealed that the instrument may function differently when administered to different unit types.

Download Full-text