Point Estimation Methods with Applications to Item Response Theory Models

Examinations that permit students to choose a subset of the items are popular despite the potential that students may take examinations of varying difficulty as a result of their choices. We provide a set of conditions for the validity of inference for Item Response Theory (IRT) models applied to data collected from choice-based examinations. Valid likelihood and Bayesian inference using standard estimation methods require (except in extraordinary circumstances) that there is no dependence, after conditioning on the observed item responses, between the examinees choices and their (potential but unobserved) responses to omitted items, as well as their latent abilities. These independence assumptions are typical of those required in much more general settings. Common low-dimensional IRT models estimated by standard methods, though potentially useful tools for educational data, do not resolve the difficult problems posed by choice-based data.

Download Full-text

A brief review on Item Response Theory models-based parameter estimation methods

2010 5th International Conference on Computer Science & Education ◽

10.1109/iccse.2010.5593443 ◽

2010 ◽

Cited By ~ 2

Author(s):

Hua Wang ◽

Cuiqin Ma ◽

Ningning Chen

Keyword(s):

Parameter Estimation ◽

Item Response Theory ◽

Item Response ◽

Estimation Methods ◽

Response Theory ◽

Parameter Estimation Methods ◽

Item Response Theory Models

Download Full-text

Recent developments of parameter estimation methods in item response theory models

10.31234/osf.io/ysgp8 ◽

2021 ◽

Author(s):

Kazuhiro Yamaguchi

Keyword(s):

Parameter Estimation ◽

Item Response Theory ◽

Item Response ◽

Marginal Likelihood ◽

Theoretical Development ◽

Likelihood Method ◽

Estimation Methods ◽

Response Theory ◽

Parameter Estimation Methods ◽

Item Response Theory Models

This research reviewed the recent development of parameter estimation methods in item response theory models. Various new methods to manage the computational burden problem with respect to the item factor analysis and multidimensional item response models, which have high dimensional factors, were introduced. Monte Carlo integral methods, approximation methods for marginal likelihood, new optimization methods, and techniques used in the machine learning field were employed for the estimation methods. Theoretically, a new type of asymptotical setting, that assumes infinite number of sample sizes and items, was considered. Several methods were classified apart from the maximum likelihood method or Bayesian method. Theoretical development of interval estimation methods for individual latent traits were also proposed and they provided highly accurate intervals

Download Full-text

Different approaches to modeling response styles in divide-by-total item response theory models (part 1): A model integration.

Psychological Methods ◽

10.1037/met0000249 ◽

2020 ◽

Vol 25 (5) ◽

pp. 560-576

Author(s):

Mirka Henninger ◽

Thorsten Meiser

Keyword(s):

Item Response Theory ◽

Item Response ◽

Response Styles ◽

Model Integration ◽

Response Theory ◽

Item Response Theory Models ◽

Total Item

Download Full-text

Using Item Response Theory Models to Evaluate the Practice Environment Scale

Journal of Nursing Measurement ◽

10.1891/1061-3749.22.2.323 ◽

2014 ◽

Vol 22 (2) ◽

pp. 323-341 ◽

Cited By ~ 6

Author(s):

Dheeraj Raju ◽

Xiaogang Su ◽

Patricia A. Patrician

Keyword(s):

Item Response Theory ◽

Item Response ◽

Information Criterion ◽

Partial Credit Model ◽

Practice Environment ◽

Partial Credit ◽

Response Theory ◽

Environment Scale ◽

Graded Response ◽

Item Response Theory Models

Background and Purpose: The purpose of this article is to introduce different types of item response theory models and to demonstrate their usefulness by evaluating the Practice Environment Scale. Methods: Item response theory models such as constrained and unconstrained graded response model, partial credit model, Rasch model, and one-parameter logistic model are demonstrated. The Akaike information criterion (AIC) and Bayesian information criterion (BIC) indices are used as model selection criterion. Results: The unconstrained graded response and partial credit models indicated the best fit for the data. Almost all items in the instrument performed well. Conclusions: Although most of the items strongly measure the construct, there are a few items that could be eliminated without substantially altering the instrument. The analysis revealed that the instrument may function differently when administered to different unit types.

Download Full-text

Robust maximum marginal likelihood (RMML) estimation for item response theory models

Behavior Research Methods ◽

10.3758/s13428-018-1150-4 ◽

2018 ◽

Vol 51 (2) ◽

pp. 573-588 ◽

Cited By ~ 3

Author(s):

Maxwell R. Hong ◽

Ying Cheng

Keyword(s):

Item Response Theory ◽

Item Response ◽

Marginal Likelihood ◽

Response Theory ◽

Maximum Marginal Likelihood ◽

Item Response Theory Models

Download Full-text

How Often Is the Misfit of Item Response Theory Models Practically Significant?

Educational Measurement Issues and Practice ◽

10.1111/emip.12024 ◽

2014 ◽

Vol 33 (1) ◽

pp. 23-35 ◽

Cited By ~ 29

Author(s):

Sandip Sinharay ◽

Shelby J. Haberman

Keyword(s):

Item Response Theory ◽

Item Response ◽

Response Theory ◽

Item Response Theory Models

Download Full-text

A Short Note on Estimating the Testlet Model With Different Estimators in Mplus

Educational and Psychological Measurement ◽

10.1177/0013164417717314 ◽

2017 ◽

Vol 78 (3) ◽

pp. 517-529 ◽

Cited By ~ 3

Author(s):

Yong Luo

Keyword(s):

Item Response Theory ◽

Item Response ◽

Latent Variable ◽

Short Note ◽

Bifactor Model ◽

Estimation Methods ◽

Full Information ◽

Modeling Software ◽

Two Parameter ◽

Item Response Theory Models

Mplus is a powerful latent variable modeling software program that has become an increasingly popular choice for fitting complex item response theory models. In this short note, we demonstrate that the two-parameter logistic testlet model can be estimated as a constrained bifactor model in Mplus with three estimators encompassing limited- and full-information estimation methods.

Download Full-text

Prior Sensitivity of the Posterior Predictive Checks Method for Item Response Theory Models

Measurement Interdisciplinary Research and Perspectives ◽

10.1080/15366367.2018.1502026 ◽

2018 ◽

Vol 16 (4) ◽

pp. 239-255

Author(s):

Allison J. Ames

Keyword(s):

Item Response Theory ◽

Item Response ◽

Response Theory ◽

Posterior Predictive Checks ◽

Item Response Theory Models

Download Full-text

Examination of Different Item Response Theory Models on Tests Composed of Testlets

Journal of Education and Learning ◽

10.5539/jel.v6n4p113 ◽

2017 ◽

Vol 6 (4) ◽

pp. 113

Author(s):

Esin Yilmaz Kogar ◽

Hülya Kelecioglu

Keyword(s):

Item Response Theory ◽

Sample Size ◽

Item Response ◽

Data Sets ◽

Meaningful Difference ◽

Response Theory ◽

Size Change ◽

Data Set ◽

Testlet Response Theory ◽

Item Response Theory Models

The purpose of this research is to first estimate the item and ability parameters and the standard error values related to those parameters obtained from Unidimensional Item Response Theory (UIRT), bifactor (BIF) and Testlet Response Theory models (TRT) in the tests including testlets, when the number of testlets, number of independent items, and sample size change, and then to compare the obtained results. Mathematic test in PISA 2012 was employed as the data collection tool, and 36 items were used to constitute six different data sets containing different numbers of testlets and independent items. Subsequently, from these constituted data sets, three different sample sizes of 250, 500 and 1000 persons were selected randomly. When the findings of the research were examined, it was determined that, generally the lowest mean error values were those obtained from UIRT, and TRT yielded a mean of error estimation lower than that of BIF. It was found that, under all conditions, models which take into consideration the local dependency have provided a better model-data compatibility than UIRT, generally there is no meaningful difference between BIF and TRT, and both models can be used for those data sets. It can be said that when there is a meaningful difference between those two models, generally BIF yields a better result. In addition, it has been determined that, in each sample size and data set, item and ability parameters and correlations of errors of the parameters are generally high.

Download Full-text