Multilevel Reliability Measures of Latent Scores Within an Item Response Theory Framework

2019 ◽  
Vol 54 (6) ◽  
pp. 856-881 ◽  
Author(s):  
Sun-Joo Cho ◽  
Jianhong Shen ◽  
Matthew Naveiras
2010 ◽  
Vol 7 (2) ◽  
Author(s):  
Alenka Hauptman

In Slovene General Matura, Mathematics is one of the compulsory subjects and it can be taken either at Basic or Higher Level of Achievement. Basic Level of Achievement is expressed by the classic five-grade scale from 1 to 5. Candidates at Higher Level of Achievement can get grades on scale from 1 to 8. Conversion of points into grades (i.e. getting points on tests and points at internal examination and then calculating those grades from the sum of points) on each Level is set independently, and we tried to find out if the same grade on each Level of Achievement corresponds to the same knowledge. Once grades are assigned they are used comparatively in selection procedures for admission to University. Both Basic and Higher Level in Mathematics include the same Part 1 of the exam. The second part of the exam (Part 2) is applied only to the Higher Level's candidates. Part 1 amounts to 80% of the total points at Basic Level, and 53.3% of total points at Higher Level. Higher Level's candidates get other 26.7% of points in Part 2. Oral part of the exam represents 20% of the grades at both Levels. In this paper we show discrepancy between knowledge within the same grades for candidates at Basic and Higher Level of Achievement on an example of a Mathematics exam from General Matura 2008. Rasch model within Item Response Theory framework was used to place item difficulties on common scale and the comparability of grade conversion on both Basic and Higher Level of Achievement was explored. The results show interesting differences in knowledge of candidates with the same grade at Basic and Higher Level of Achievement.


2019 ◽  
Vol 80 (3) ◽  
pp. 461-475
Author(s):  
Lianne Ippel ◽  
David Magis

In dichotomous item response theory (IRT) framework, the asymptotic standard error (ASE) is the most common statistic to evaluate the precision of various ability estimators. Easy-to-use ASE formulas are readily available; however, the accuracy of some of these formulas was recently questioned and new ASE formulas were derived from a general asymptotic theory framework. Furthermore, exact standard errors were suggested to better evaluate the precision of ability estimators, especially with short tests for which the asymptotic framework is invalid. Unfortunately, the accuracy of exact standard errors was assessed so far only in a very limiting setting. The purpose of this article is to perform a global comparison of exact versus (classical and new formulations of) asymptotic standard errors, for a wide range of usual IRT ability estimators, IRT models, and with short tests. Results indicate that exact standard errors globally outperform the ASE versions in terms of reduced bias and root mean square error, while the new ASE formulas are also globally less biased than their classical counterparts. Further discussion about the usefulness and practical computation of exact standard errors are outlined.


2014 ◽  
Vol 68 (1) ◽  
pp. 43-64 ◽  
Author(s):  
Elasma Milanzi ◽  
Geert Molenberghs ◽  
Ariel Alonso ◽  
Geert Verbeke ◽  
Paul De Boeck

2019 ◽  
Author(s):  
Piotr Bereznowski ◽  
Roman Konarski

This study included investigation of efficiency of the threshold used to classify symptoms as present, investigation of efficiency of the cut-off point used to identify potentially addicted to work individuals, investigation of magnitude of the problem of class overlap, and investigation of effects of dichotomization of polytomous items on the estimates of the latent trait level. The sample comprised 16,426 working Norwegians (M age = 37.31; SD = 11.36) who filled out the Bergen Work Addiction Scale (BWAS). The results showed that the difficulty/third threshold parameters corresponding to the threshold used to classify symptoms as present were lower than 1.5 for the items corresponding to tolerance and conflict and higher than or equal to 1.5 for the items corresponding to salience, mood modification, relapse, withdrawal, and problems. The cut-off point used to identify individuals as potentially addicted to work identified 411 individuals (31.9% of all individuals classified by the polythetic approach as potentially addicted to work) whose estimates of the latent trait level were lower than 1.5 as potentially addicted to work. The problem of class overlap (being classified by the polythetic approach into different class despite almost the same level of the latent trait) affected 4,686 individuals (28.5% of the whole sample). The dichotomization of polytomous items had a substantial effect on the estimates of the latent trait level. The findings show that the polythetic approach is not efficient in identifying potentially addicted to work individuals and that the prevalence rates of work addiction based on the polythetic approach are not trustworthy.


Sign in / Sign up

Export Citation Format

Share Document