scholarly journals A Comparison of the Accuracy of Multidimensional IRT equating methods

2021 ◽  
Vol 42 issue 1 ◽  
2001 ◽  
Vol 26 (1) ◽  
pp. 31-50 ◽  
Author(s):  
Haruhiko Ogasawara

The asymptotic standard errors of the estimates of the equated scores by several types of item response theory (IRT) true score equatings are provided. The first group of equatings do not use IRT equating coefficients. The second group of equatings use the IRT equating coefficients given by the moment or characteristic curve methods. The equating designs considered in this article cover those with internal or external common items and the methods with separate or simultaneous estimation of item parameters of associated tests. For the estimates of the asymptotic standard errors of the equated true scores, the method of marginal maximum likelihood estimation is employed for estimation of item parameters.


1983 ◽  
Vol 8 (2) ◽  
pp. 137-156 ◽  
Author(s):  
Nancy S. Petersen ◽  
Linda L. Cook ◽  
Martha L. Stocking

Scale drift for the verbal and mathematical portions of the Scholastic Aptitude Test (SAT) was investigated using linear, equipercentile and item response theory (IRT) equating methods. The linear methods investigated were the Tucker, Levine Equally Reliable and Levine Unequally Reliable models. Three IRT calibration designs were employed. These designs are referred to as (1) concurrent, (2) fixed b’s method, and (3) characteristic curve transformation method. The results of the various equating methods were compared both graphically and analytically. These results indicated that for reasonably parallel tests, linear equating methods perform adequately. However, when tests differ somewhat in content and length, methods based on the three-parameter logistic IRT model lead to greater stability of equating results. Of the conventional equating methods investigated, the Levine Equally Reliable model appears to be the most robust for the type of equating situation used in this study. The IRT method that provided the most stable equating results overall was the concurrent calibration method.


2008 ◽  
Vol 25 (2) ◽  
pp. 187-210 ◽  
Author(s):  
Chisato Saida ◽  
Tamaki Hattori
Keyword(s):  
Post Hoc ◽  

Psychometrika ◽  
1994 ◽  
Vol 59 (2) ◽  
pp. 149-176 ◽  
Author(s):  
Henk Kelderman ◽  
Carl P. M. Rijkes

2018 ◽  
Vol 53 (6) ◽  
pp. 914-924
Author(s):  
Hendrika G. van Lier ◽  
Liseth Siemons ◽  
Mart A.F.J. van der Laar ◽  
Cees A.W. Glas

Sign in / Sign up

Export Citation Format

Share Document