Methods of Identifying Individual Guessers From Item Response Data

As a method to derive a “purified” measure along a dimension of interest from response data that are potentially multidimensional in nature, the projective item response theory (PIRT) approach requires first fitting a multidimensional item response theory (MIRT) model to the data before projecting onto a dimension of interest. This study aims to explore how accurate the PIRT results are when the estimated MIRT model is misspecified. Specifically, we focus on using a (potentially misspecified) two-dimensional (2D)-MIRT for projection because of its advantages, including interpretability, identifiability, and computational stability, over higher dimensional models. Two large simulation studies (I and II) were conducted. Both studies examined whether the fitting of a 2D-MIRT is sufficient to recover the PIRT parameters when multiple nuisance dimensions exist in the test items, which were generated, respectively, under compensatory MIRT and bifactor models. Various factors were manipulated, including sample size, test length, latent factor correlation, and number of nuisance dimensions. The results from simulation studies I and II showed that the PIRT was overall robust to a misspecified 2D-MIRT. Smaller third and fourth simulation studies were done to evaluate recovery of the PIRT model parameters when the correctly specified higher dimensional MIRT or bifactor model was fitted with the response data. In addition, a real data set was used to illustrate the robustness of PIRT.

Download Full-text

Dimension: A Program to Generate Unidimensional and Multidimensional Item Response Data

Applied Psychological Measurement ◽

10.1177/014662169301700306 ◽

1993 ◽

Vol 17 (3) ◽

pp. 252-252 ◽

Cited By ~ 1

Author(s):

John Hattie ◽

Krzysztof Krakowski

Keyword(s):

Item Response ◽

Multidimensional Item Response ◽

Response Data

Download Full-text

Continuous and discrete latent structure models for item response data

Psychometrika ◽

10.1007/bf02294762 ◽

1990 ◽

Vol 55 (3) ◽

pp. 477-494 ◽

Cited By ~ 28

Author(s):

Edward H. Haertel

Keyword(s):

Item Response ◽

Latent Structure ◽

Response Data ◽

Latent Structure Models

Download Full-text

IRTDATA: An Interactive or Batch Pascal Program for Generating Logistic Item Response Data

Applied Psychological Measurement ◽

10.1177/014662169201600105 ◽

1992 ◽

Vol 16 (1) ◽

pp. 52-52 ◽

Cited By ~ 2

Author(s):

George A. Johanson

Keyword(s):

Item Response ◽

Pascal Program ◽

Response Data

Download Full-text

Nested Logit Models for Multiple-Choice Item Response Data

Psychometrika ◽

10.1007/s11336-010-9163-7 ◽

2010 ◽

Vol 75 (3) ◽

pp. 454-473 ◽

Cited By ~ 27

Author(s):

Youngsuk Suh ◽

Daniel M. Bolt

Keyword(s):

Item Response ◽

Multiple Choice ◽

Nested Logit ◽

Logit Models ◽

Response Data ◽

Multiple Choice Item ◽

Nested Logit Models

Download Full-text

Using Item Response Theory to Obtain Individual Information From Randomized Response Data: An Application Using Cheating Data

Applied Psychological Measurement ◽

10.1177/0146621607312277 ◽

2008 ◽

Vol 32 (8) ◽

pp. 595-610 ◽

Cited By ~ 13

Author(s):

Jean-Paul Fox ◽

Rob R. Meijer

Keyword(s):

Item Response Theory ◽

Item Response ◽

Randomized Response ◽

Response Theory ◽

Response Data ◽

Individual Information

Download Full-text

A Comparison of IRT Theta Estimates and Delta Scores From the Perspective of Additive Conjoint Measurement

10.35542/osf.io/amh56 ◽

2021 ◽

Author(s):

Benjamin Domingue ◽

Dimiter Dimitrov

Keyword(s):

Saudi Arabia ◽

Item Response ◽

Large Scale ◽

Bayesian Method ◽

Conjoint Measurement ◽

Essential Property ◽

Interval Scale ◽

Scoring Method ◽

Data Set ◽

Response Data

A recently developed framework of measurement, referred to as Delta-scoring (or D-scoring) method (DSM; e.g., Dimitrov 2016, 2018, 2020) is gaining attention in the field of educational measurement and widely used in large-scale assessments at the National Center for Assessment in Saudi Arabia. The D-scores obtained under the DSM range from 0 to 1 to indicate how much (what proportion) of the ability measured by a test of binary items is demonstrated by the examinee. This study examines whether the D-scale is an interval scale and how D-scores compare to IRT ability scores (thetas) in terms of intervalness via testing the axioms of additive conjoint measurement (ACM). The approach to testing is the ConjointChecks (Domingue, 2014), which implements a Bayesian method to evaluating whether the axioms are violated in a given empirical item response data set. The results indicate that the D-scores, computed under the DSM, produce fewer violations of the ordering axioms of ACM than do the IRT “theta” scores. The conclusion is that the DSM produces a dependable D-scale in terms of the essential property of intervalness.

Download Full-text