Multilevel Reliability Measures of Latent Scores Within an Item Response Theory Framework

In Slovene General Matura, Mathematics is one of the compulsory subjects and it can be taken either at Basic or Higher Level of Achievement. Basic Level of Achievement is expressed by the classic five-grade scale from 1 to 5. Candidates at Higher Level of Achievement can get grades on scale from 1 to 8. Conversion of points into grades (i.e. getting points on tests and points at internal examination and then calculating those grades from the sum of points) on each Level is set independently, and we tried to find out if the same grade on each Level of Achievement corresponds to the same knowledge. Once grades are assigned they are used comparatively in selection procedures for admission to University. Both Basic and Higher Level in Mathematics include the same Part 1 of the exam. The second part of the exam (Part 2) is applied only to the Higher Level's candidates. Part 1 amounts to 80% of the total points at Basic Level, and 53.3% of total points at Higher Level. Higher Level's candidates get other 26.7% of points in Part 2. Oral part of the exam represents 20% of the grades at both Levels. In this paper we show discrepancy between knowledge within the same grades for candidates at Basic and Higher Level of Achievement on an example of a Mathematics exam from General Matura 2008. Rasch model within Item Response Theory framework was used to place item difficulties on common scale and the comparability of grade conversion on both Basic and Higher Level of Achievement was explored. The results show interesting differences in knowledge of candidates with the same grade at Basic and Higher Level of Achievement.

Download Full-text

Efficient Standard Errors in Item Response Theory Models for Short Tests

Educational and Psychological Measurement ◽

10.1177/0013164419882072 ◽

2019 ◽

Vol 80 (3) ◽

pp. 461-475

Author(s):

Lianne Ippel ◽

David Magis

Keyword(s):

Item Response Theory ◽

Item Response ◽

Standard Errors ◽

Response Theory ◽

Asymptotic Standard Error ◽

Irt Models ◽

Global Comparison ◽

Wide Range ◽

Item Response Theory Models ◽

Theory Framework

In dichotomous item response theory (IRT) framework, the asymptotic standard error (ASE) is the most common statistic to evaluate the precision of various ability estimators. Easy-to-use ASE formulas are readily available; however, the accuracy of some of these formulas was recently questioned and new ASE formulas were derived from a general asymptotic theory framework. Furthermore, exact standard errors were suggested to better evaluate the precision of ability estimators, especially with short tests for which the asymptotic framework is invalid. Unfortunately, the accuracy of exact standard errors was assessed so far only in a very limiting setting. The purpose of this article is to perform a global comparison of exact versus (classical and new formulations of) asymptotic standard errors, for a wide range of usual IRT ability estimators, IRT models, and with short tests. Results indicate that exact standard errors globally outperform the ASE versions in terms of reduced bias and root mean square error, while the new ASE formulas are also globally less biased than their classical counterparts. Further discussion about the usefulness and practical computation of exact standard errors are outlined.

Download Full-text

Reliability measures in item response theory: Manifest versus latent correlation functions

British Journal of Mathematical and Statistical Psychology ◽

10.1111/bmsp.12033 ◽

2014 ◽

Vol 68 (1) ◽

pp. 43-64 ◽

Cited By ~ 7

Author(s):

Elasma Milanzi ◽

Geert Molenberghs ◽

Ariel Alonso ◽

Geert Verbeke ◽

Paul De Boeck

Keyword(s):

Item Response Theory ◽

Item Response ◽

Correlation Functions ◽

Response Theory ◽

Reliability Measures

Download Full-text

Psychometric Analysis of the Behavior Problems Inventory Using an Item-Response Theory Framework: A Sample of Individuals with Intellectual Disabilities

Journal of Psychopathology and Behavioral Assessment ◽

10.1007/s10862-013-9356-3 ◽

2013 ◽

Vol 35 (4) ◽

pp. 564-577 ◽

Cited By ~ 2

Author(s):

Lucy Barnard-Brak ◽

Johannes Rojahn ◽

Tianlan Wei

Keyword(s):

Item Response Theory ◽

Behavior Problems ◽

Intellectual Disabilities ◽

Item Response ◽

Psychometric Analysis ◽

Response Theory ◽

Theory Framework ◽

Individuals With Intellectual Disabilities

Download Full-text

Comparison of Reliability Measures Under Factor Analysis and Item Response Theory

Educational and Psychological Measurement ◽

10.1177/0013164411407315 ◽

2011 ◽

Vol 72 (1) ◽

pp. 52-67 ◽

Cited By ~ 23

Author(s):

Ying Cheng ◽

Ke-Hai Yuan ◽

Cheng Liu

Keyword(s):

Factor Analysis ◽

Item Response Theory ◽

Item Response ◽

Response Theory ◽

Reliability Measures

Download Full-text

Is the Polythetic Approach Efficient in Identifying Potentially Addicted to Work Individuals? Comparison of the Polythetic Approach With the Item Response Theory Framework

10.31234/osf.io/xvnuc ◽

2019 ◽

Author(s):

Piotr Bereznowski ◽

Roman Konarski

Keyword(s):

Item Response Theory ◽

Item Response ◽

Latent Trait ◽

Substantial Effect ◽

Prevalence Rates ◽

Response Theory ◽

Polytomous Items ◽

Work Addiction ◽

Trait Level ◽

Theory Framework

This study included investigation of efficiency of the threshold used to classify symptoms as present, investigation of efficiency of the cut-off point used to identify potentially addicted to work individuals, investigation of magnitude of the problem of class overlap, and investigation of effects of dichotomization of polytomous items on the estimates of the latent trait level. The sample comprised 16,426 working Norwegians (M age = 37.31; SD = 11.36) who filled out the Bergen Work Addiction Scale (BWAS). The results showed that the difficulty/third threshold parameters corresponding to the threshold used to classify symptoms as present were lower than 1.5 for the items corresponding to tolerance and conflict and higher than or equal to 1.5 for the items corresponding to salience, mood modification, relapse, withdrawal, and problems. The cut-off point used to identify individuals as potentially addicted to work identified 411 individuals (31.9% of all individuals classified by the polythetic approach as potentially addicted to work) whose estimates of the latent trait level were lower than 1.5 as potentially addicted to work. The problem of class overlap (being classified by the polythetic approach into different class despite almost the same level of the latent trait) affected 4,686 individuals (28.5% of the whole sample). The dichotomization of polytomous items had a substantial effect on the estimates of the latent trait level. The findings show that the polythetic approach is not efficient in identifying potentially addicted to work individuals and that the prevalence rates of work addiction based on the polythetic approach are not trustworthy.

Download Full-text