Item Response Theory True Score Equatings and Their Standard Errors

The asymptotic standard errors of the estimates of the equated scores by several types of item response theory (IRT) true score equatings are provided. The first group of equatings do not use IRT equating coefficients. The second group of equatings use the IRT equating coefficients given by the moment or characteristic curve methods. The equating designs considered in this article cover those with internal or external common items and the methods with separate or simultaneous estimation of item parameters of associated tests. For the estimates of the asymptotic standard errors of the equated true scores, the method of marginal maximum likelihood estimation is employed for estimation of item parameters.

Download Full-text

Asymptotic Standard Errors for Item Response Theory True Score Equating of Polytomous Items

Journal of Educational Measurement ◽

10.1111/jedm.12065 ◽

2015 ◽

Vol 52 (1) ◽

pp. 106-120 ◽

Cited By ~ 6

Author(s):

Cheow Cher Wong

Keyword(s):

Item Response Theory ◽

Item Response ◽

Standard Errors ◽

True Score ◽

Response Theory ◽

Polytomous Items ◽

Asymptotic Standard Errors ◽

True Score Equating

Download Full-text

Applications of the Analytically Derived Asymptotic Standard Errors of Item Response Theory Item Parameter Estimates

Journal of Educational Measurement ◽

10.1111/j.1745-3984.2004.tb01109.x ◽

2004 ◽

Vol 41 (2) ◽

pp. 85-117 ◽

Cited By ~ 11

Author(s):

Yuan H. Li ◽

Robert W. Lissitz

Keyword(s):

Item Response Theory ◽

Item Response ◽

Item Parameter ◽

Standard Errors ◽

Parameter Estimates ◽

Response Theory ◽

Asymptotic Standard Errors ◽

Item Parameter Estimates

Download Full-text

The Delta-Scoring Method of Tests With Binary Items: A Note on True Score Estimation and Equating

Educational and Psychological Measurement ◽

10.1177/0013164417724187 ◽

2017 ◽

Vol 78 (5) ◽

pp. 805-825 ◽

Cited By ~ 6

Author(s):

Dimiter M. Dimitrov

Keyword(s):

Item Response Theory ◽

Item Response ◽

Large Scale ◽

Standard Errors ◽

True Score ◽

Response Theory ◽

Scoring Method ◽

New Developments ◽

Item Response Function ◽

True Values

This article presents some new developments in the methodology of an approach to scoring and equating of tests with binary items, referred to as delta scoring (D-scoring), which is under piloting with large-scale assessments at the National Center for Assessment in Saudi Arabia. This presentation builds on a previous work on delta scoring and adds procedures for scaling and equating, item response function, and estimation of true values and standard errors of D scores. Also, unlike the previous work on this topic, where D-scoring involves estimates of item and person parameters in the framework of item response theory, the approach presented here does not require item response theory calibration.

Download Full-text

Standard Errors and Confidence Intervals From Bootstrapping for Ramsay-Curve Item Response Theory Model Item Parameters

Applied Psychological Measurement ◽

10.1177/0146621611414405 ◽

2011 ◽

Vol 35 (7) ◽

pp. 562-565 ◽

Cited By ~ 1

Author(s):

Fei Gu ◽

William P. Skorupski ◽

Larry Hoyle ◽

Neal M. Kingston

Keyword(s):

Item Response Theory ◽

Item Response ◽

Confidence Intervals ◽

Theory Model ◽

Item Response Theory Model ◽

Standard Errors ◽

Response Theory ◽

Item Parameters

Download Full-text

Item Response Theory and Music Testing

The Oxford Handbook of Assessment Policy and Practice in Music Education, Volume 1 ◽

10.1093/oxfordhb/9780190248093.013.22 ◽

2019 ◽

pp. 477-503

Author(s):

Brian Wesolowski

Keyword(s):

Item Response Theory ◽

Item Response ◽

Test Scores ◽

General Framework ◽

Logistic Function ◽

Response Theory ◽

Measurement Models ◽

Latent Constructs ◽

Item Parameters ◽

Introductory Overview

This chapter presents an introductory overview of concepts that underscore the general framework of item response theory. “Item response theory” is a broad umbrella term used to describe a family of mathematical measurement models that consider observed test scores to be a function of latent, unobservable constructs. Most musical constructs cannot be directly measured and are therefore unobservable. Musical constructs can therefore only be inferred based on secondary, observable behaviors. Item response theory uses observable behaviors as probabilistic distributions of responses as a logistic function of person and item parameters in order to define latent constructs. This chapter describes philosophical, theoretical, and applied perspectives of item response theory in the context of measuring musical behaviors.

Download Full-text

Asymptotic properties of the Bayes modal estimators of item parameters in item response theory

Computational Statistics ◽

10.1007/s00180-013-0418-5 ◽

2013 ◽

Vol 28 (6) ◽

pp. 2559-2583 ◽

Cited By ~ 1

Author(s):

Haruhiko Ogasawara

Keyword(s):

Item Response Theory ◽

Item Response ◽

Asymptotic Properties ◽

Response Theory ◽

Item Parameters ◽

Bayes Modal

Download Full-text

Application of the item response theory to the distinction tests using the photograph and the relation of item parameters and figure-expression method

Journal of Graphic Science of Japan ◽

10.5989/jsgs.36.3_3 ◽

2002 ◽

Vol 36 (3) ◽

pp. 3-10

Author(s):

Hirokazu ABE ◽

Katsuyuki YOSHIDA ◽

Kokichi CHIBANA

Keyword(s):

Item Response Theory ◽

Item Response ◽

Response Theory ◽

Item Parameters

Download Full-text

Simple-Structure Multidimensional Item Response Theory Equating for Multidimensional Tests

Educational and Psychological Measurement ◽

10.1177/0013164419854208 ◽

2019 ◽

Vol 80 (1) ◽

pp. 91-125

Author(s):

Stella Y. Kim ◽

Won-Chan Lee ◽

Michael J. Kolen

Keyword(s):

Item Response Theory ◽

Item Response ◽

Simple Structure ◽

Multidimensional Item Response Theory ◽

Multidimensional Data ◽

True Score ◽

Multidimensional Item Response ◽

Response Theory ◽

Data Types ◽

True Score Equating

A theoretical and conceptual framework for true-score equating using a simple-structure multidimensional item response theory (SS-MIRT) model is developed. A true-score equating method, referred to as the SS-MIRT true-score equating (SMT) procedure, also is developed. SS-MIRT has several advantages over other complex multidimensional item response theory models including improved efficiency in estimation and straightforward interpretability. The performance of the SMT procedure was examined and evaluated through four studies using different data types. In these studies, results from the SMT procedure were compared with results from four other equating methods to assess the relative benefits of SMT compared with the other procedures. In general, SMT showed more accurate equating results compared with the traditional unidimensional IRT (UIRT) equating when the data were multidimensional. More accurate performance of SMT over UIRT true-score equating was consistently observed across the studies, which supports the benefits of a multidimensional approach in equating for multidimensional data. Also, SMT performed similarly to a SS-MIRT observed score method across all studies.

Download Full-text

Efficient Standard Errors in Item Response Theory Models for Short Tests

Educational and Psychological Measurement ◽

10.1177/0013164419882072 ◽

2019 ◽

Vol 80 (3) ◽

pp. 461-475

Author(s):

Lianne Ippel ◽

David Magis

Keyword(s):

Item Response Theory ◽

Item Response ◽

Standard Errors ◽

Response Theory ◽

Asymptotic Standard Error ◽

Irt Models ◽

Global Comparison ◽

Wide Range ◽

Item Response Theory Models ◽

Theory Framework

In dichotomous item response theory (IRT) framework, the asymptotic standard error (ASE) is the most common statistic to evaluate the precision of various ability estimators. Easy-to-use ASE formulas are readily available; however, the accuracy of some of these formulas was recently questioned and new ASE formulas were derived from a general asymptotic theory framework. Furthermore, exact standard errors were suggested to better evaluate the precision of ability estimators, especially with short tests for which the asymptotic framework is invalid. Unfortunately, the accuracy of exact standard errors was assessed so far only in a very limiting setting. The purpose of this article is to perform a global comparison of exact versus (classical and new formulations of) asymptotic standard errors, for a wide range of usual IRT ability estimators, IRT models, and with short tests. Results indicate that exact standard errors globally outperform the ASE versions in terms of reduced bias and root mean square error, while the new ASE formulas are also globally less biased than their classical counterparts. Further discussion about the usefulness and practical computation of exact standard errors are outlined.

Download Full-text

A Bayesian Random Block Item Response Theory Model for Forced-Choice Formats

Educational and Psychological Measurement ◽

10.1177/0013164419871659 ◽

2019 ◽

Vol 80 (3) ◽

pp. 578-603

Author(s):

HyeSun Lee ◽

Weldon Z. Smith

Keyword(s):

Item Response Theory ◽

Item Response ◽

Model Performance ◽

Theory Model ◽

Forced Choice ◽

Simultaneous Estimation ◽

Response Theory ◽

Irt Model ◽

Measurement Models ◽

Random Block

Based on the framework of testlet models, the current study suggests the Bayesian random block item response theory (BRB IRT) model to fit forced-choice formats where an item block is composed of three or more items. To account for local dependence among items within a block, the BRB IRT model incorporated a random block effect into the response function and used a Markov Chain Monte Carlo procedure for simultaneous estimation of item and trait parameters. The simulation results demonstrated that the BRB IRT model performed well for the estimation of item and trait parameters and for screening those with relatively low scores on target traits. As found in the literature, the composition of item blocks was crucial for model performance; negatively keyed items were required for item blocks. The empirical application showed the performance of the BRB IRT model was equivalent to that of the Thurstonian IRT model. The potential advantage of the BRB IRT model as a base for more complex measurement models was also demonstrated by incorporating gender as a covariate into the BRB IRT model to explain response probabilities. Recommendations for the adoption of forced-choice formats were provided along with the discussion about using negatively keyed items.

Download Full-text