Item Response Theory in Affective Instrument Development: An Illustration

2001 ◽  
Vol 9 (1) ◽  
pp. 5-22 ◽  
Author(s):  
Cheryl T. Beck ◽  
Robert K. Gable

The benefits of item response theory (IRT) analysis in obtaining empirical support for construct validity make it an essential step in the instrument development process. IRT analysis can result in finer construct interpretations that lead to more thorough descriptions of low- and high-scoring respondents. A critical function of IRT is its ability to determine the adequacy with which the attitude continuum underlying each dimension is assessed by the respective items in an instrument. Many nurse researchers, however, are not reaping the benefits of IRT in the development of affective instruments. The purpose of this article is to familiarize nurse researchers with this valuable approach through a description of the Facets computer program. Facets uses a one parameter (i.e., item difficulty) Rasch measurement model. Data from a survey of 525 new mothers that assessed the psychometric properties of the Postpartum Depresssion Screening Scale are used to illustrate the Facets program. It is hoped that IRT will gain increased prominence in affective instrument development as more nurse researchers become aware of computer programs such as Facets to assist in analysis.

2021 ◽  
Vol ahead-of-print (ahead-of-print) ◽  
Author(s):  
Yunsoo Lee ◽  
Ji Hoon Song ◽  
Soo Jung Kim

Purpose This paper aims to validate the Korean version of the decent work scale and examine the relationship between decent work and work engagement. Design/methodology/approach After completing translation and back translation, the authors surveyed 266 Korean employees from various organizations via network sampling. They assessed Rasch’s model based on item response theory. In addition, they used classical test theory to evaluate the decent work scale’s validity and reliability. Findings The authors found that the current version of the decent work scale has good validity, reliability and item difficulty, and decent work has a positive relationship with work engagement. However, based on item response theory, the assessment showed that three of the items are extremely similar to another item within the same dimension, implying that the items are unable to discriminate among individual traits. Originality/value This study validated the decent work scale in a Korean work environment using Rasch’s (1960) model from the perspective of item response theory.


Politics ◽  
2019 ◽  
Vol 40 (1) ◽  
pp. 3-21 ◽  
Author(s):  
Steven M Van Hauwaert ◽  
Christian H Schimpf ◽  
Flavio Azevedo

Recent research in the populism literature has devoted considerable efforts to the conceptualisation and examination of populism on the individual level, that is, populist attitudes. Despite rapid progress in the field, questions of adequate measurement and empirical evaluation of measures of populist attitudes remain scarce. Seeking to remedy these shortcomings, we apply a cross-national measurement model, using item response theory, to six established and two new populist indicators. Drawing on a cross-national survey (nine European countries, n = 18,368), we engage in a four-folded analysis. First, we examine the commonly used 6-item populism scale. Second, we expand the measurement with two novel items. Third, we use the improved 8-item populism scale to further refine equally comprehensive but more concise and parsimonious populist measurements. Finally, we externally validate these sub-scales and find that some of the proposed sub-scales outperform the initial 6- and 8-item scales. We conclude that existing measures of populism capture moderate populist attitudes, but face difficulties measuring more extreme levels, while the individual information of some of the populist items remains limited. Altogether, this provides several interesting routes for future research, both within and between countries.


2015 ◽  
Vol 58 (3) ◽  
pp. 865-877 ◽  
Author(s):  
Gerasimos Fergadiotis ◽  
Stacey Kellough ◽  
William D. Hula

Purpose In this study, we investigated the fit of the Philadelphia Naming Test (PNT; Roach, Schwartz, Martin, Grewal, & Brecher, 1996) to an item-response-theory measurement model, estimated the precision of the resulting scores and item parameters, and provided a theoretical rationale for the interpretation of PNT overall scores by relating explanatory variables to item difficulty. This article describes the statistical model underlying the computer adaptive PNT presented in a companion article (Hula, Kellough, & Fergadiotis, 2015). Method Using archival data, we evaluated the fit of the PNT to 1- and 2-parameter logistic models and examined the precision of the resulting parameter estimates. We regressed the item difficulty estimates on three predictor variables: word length, age of acquisition, and contextual diversity. Results The 2-parameter logistic model demonstrated marginally better fit, but the fit of the 1-parameter logistic model was adequate. Precision was excellent for both person ability and item difficulty estimates. Word length, age of acquisition, and contextual diversity all independently contributed to variance in item difficulty. Conclusions Item-response-theory methods can be productively used to analyze and quantify anomia severity in aphasia. Regression of item difficulty on lexical variables supported the validity of the PNT and interpretation of anomia severity scores in the context of current word-finding models.


2016 ◽  
Vol 44 (2) ◽  
pp. 226-236
Author(s):  
Joseph R. Miles ◽  
Brent Mallinckrodt ◽  
Daniela A. Recabarren

2021 ◽  
Vol 9 ◽  
Author(s):  
Ron D. Hays ◽  
David Hubble ◽  
Frank Jenkins ◽  
Alexa Fraser ◽  
Beryl Carew

The National Children's Study (NCS) statistics and item response theory group was tasked with promoting the quality of study measures and analysis. This paper provides an overview of six measurement and statistical considerations for the NCS: (1) Conceptual and Measurement Model; (2) Reliability; (3) Validity; (4) Measurement Invariance; (5) Interpretability of Scores; and (6) Burden of administration. The guidance was based primarily on recommendations of the International Society of Quality of Life Research.


2019 ◽  
Vol 9 (2) ◽  
pp. 133-146
Author(s):  
Yance Manoppo ◽  
Djemari Mardapi

This study aimed to reveal: (1) the characteristics of items of Chemistry Test in National Examination by using the classical test theory and item response theory; (2) the amount of cheating which occured by using Angoff's B-index Method, Pair 1 Method, Pair 2 Method, Modified Error Similarity Analysis (MESA) Method, and G2 Method; (3) the methods that detect more cheating in the implementation of the Chemistry Test in National Examination for high schools in the year 2011/2012 in Maluku Province. The results of the analysis with the classical test theory approach show that 77.5% items have item difficulty functioning well, 55% items have discrimination yet qualified and 70% items have distractor that works well with the index reliability test of 0,772. The analysis using the item response theory approach shows that 14 (35%) items fit with the model, the maximum function information is 11,4069 at θ = -1,6, and the magnitude of the error of measurement is 2,296. The number of pairs who are suspected of cheating is as follows: 13 pairs according to Angoff's B-index Method, 212 pairs according to Pair 1 Method, 444 pairs according to Pair 2 Method, 7 pairs according to MESA Method, and 102 pairs according to G2 Method. The most widely detecting cheating in a row is a   Pair 2, Pair 1, G2, Angoff's B-index, and MESA.


Sign in / Sign up

Export Citation Format

Share Document