An IRT Investigation of the Validity of Non-Patient Analogue Research Using the Beck Depression Inventory

1995 ◽  
Vol 11 (1) ◽  
pp. 14-20 ◽  
Author(s):  
Sean M. Hammond

This paper presents an IRT analysis of the Beck Depression Inventory which was carried out to assess the assumption of an underlying latent trait common to non-clinical and patient samples. A one parameter rating scale model was fitted to data drawn from a patient and non-patient sample. Findings suggest that while the BDI fits the model reasonably well for the two samples separately there is sufficient differential item functioning to raise serious duobts of the viability of using it analogously with patient and non-patient groups.

Author(s):  
Mahmoud AlQuraan ◽  
Ahmed AL Kuwaiti

This study explored academic discipline as a source of differential item functioning(DIF) in students’ rating of teaching quality and effectiveness at higher education institutions. Data utilized in this study was collected by Imam Abudalruman Bin Faisal University - KSA. The total number of surveys analyzed for the purpose of this study is 36459 from three colleges: Education, Health, and Engineering. Using Extended Rasch model (Rating Scale Model), the results show that the instrument contains four DIF items. The content of these four items confirm the possibility of considering discipline as a source of DIF items in students evaluation of teaching in higher education. Moreover, the results of the current study show that removing DIF items from the instrument increases construct validity.


2011 ◽  
Vol 27 (3) ◽  
pp. 164-170 ◽  
Author(s):  
Anna Sundström

This study evaluated the psychometric properties of a self-report scale for assessing perceived driver competence, labeled the Self-Efficacy Scale for Driver Competence (SSDC), using item response theory analyses. Two samples of Swedish driving-license examinees (n = 795; n = 714) completed two versions of the SSDC that were parallel in content. Prior work, using classical test theory analyses, has provided support for the validity and reliability of scores from the SSDC. This study investigated the measurement precision, item hierarchy, and differential functioning for males and females of the items in the SSDC as well as how the rating scale functions. The results confirmed the previous findings; that the SSDC demonstrates sound psychometric properties. In addition, the findings showed that measurement precision could be increased by adding items that tap higher self-efficacy levels. Moreover, the rating scale can be improved by reducing the number of categories or by providing each category with a label.


2021 ◽  
Vol 34 (1) ◽  
Author(s):  
Evandro Morais Peixoto ◽  
Daniela Sacramento Zanini ◽  
Josemberg Moura de Andrade

Abstract Background The Kessler Distress Scale (K10) is a self-report scale for the assessment of non-specific psychological distress in the general and clinical population. Because of its ease of application and good psychometric properties, the K10 has been adapted to several cultures. The present study seeks to adapt the K10 to Brazilian Portuguese and estimate its validity evidence and reliability. Methods A total of 1914 individuals from the general population participated in the study (age = 34.88, SD = 13.61, 77.7% female). The adjustment indices were compared among three different measurement models proposed for the K10 through confirmatory factor analysis (CFA). The items’ properties were analyzed by Andrich’s Rating Scale Model (RSM). Furthermore, evidence based on relations to other variables (depression, stress, anxiety, positive and negative affects, and satisfaction with life) was estimated. Results CFA indicated the adequacy of the bifactor model (CFI= 0.985; TLI= 0.973; SMR= 0.019; RMSEA= 0.050), composed of two specific factors (depression and anxiety) and one general factor (psychological distress), corresponding to the theoretical hypothesis. Additionally, it was observed multiple-group invariance by gender and age range. The RSM provided an understanding of the organization of the continuum represented by the psychological distress construct (items difficulty), which varied from −0.89 to 1.00; good adjustment indexes; infit between 0.67 and 1.32; outfit between 0.68 and 1.34; and desirable reliability, α= 0.87. Lastly, theoretically coherent associations with the external variables were observed. Conclusions It is concluded that the Brazilian version of the K10 is a suitable measure of psychological distress for the Brazilian population.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Mario Cantó-Cerdán ◽  
Pilar Cacho-Martínez ◽  
Francisco Lara-Lacárcel ◽  
Ángel García-Muñoz

AbstractTo develop the Symptom Questionnaire for Visual Dysfunctions (SQVD) and to perform a psychometric analysis using Rasch method to obtain an instrument which allows to detect the presence and frequency of visual symptoms related to any visual dysfunction. A pilot version of 33 items was carried out on a sample of 125 patients from an optometric clinic. Rasch model (using Andrich Rating Scale Model) was applied to investigate the category probability curves and Andrich thresholds, infit and outfit mean square, local dependency using Yen’s Q3 statistic, Differential item functioning (DIF) for gender and presbyopia, person and item reliability, unidimensionality, targeting and ordinal to interval conversion table. Category probability curves suggested to collapse a response category. Rasch analysis reduced the questionnaire from 33 to 14 items. The final SQVD showed that 14 items fit to the model without local dependency and no significant DIF for gender and presbyopia. Person reliability was satisfactory (0.81). The first contrast of the residual was 1.908 eigenvalue, showing unidimensionality and targeting was − 1.59 logits. In general, the SQVD is a well-structured tool which shows that data adequately fit the Rasch model, with adequate psychometric properties, making it a reliable and valid instrument to measure visual symptoms.


2020 ◽  
Vol 80 (4) ◽  
pp. 808-820
Author(s):  
Cindy M. Walker ◽  
Sakine Göçer Şahin

The purpose of this study was to investigate a new way of evaluating interrater reliability that can allow one to determine if two raters differ with respect to their rating on a polytomous rating scale or constructed response item. Specifically, differential item functioning (DIF) analyses were used to assess interrater reliability and compared with traditional interrater reliability measures. Three different procedures that can be used as measures of interrater reliability were compared: (1) intraclass correlation coefficient (ICC), (2) Cohen’s kappa statistic, and (3) DIF statistic obtained from Poly-SIBTEST. The results of this investigation indicated that DIF procedures appear to be a promising alternative to assess the interrater reliability of constructed response items, or other polytomous types of items, such as rating scales. Furthermore, using DIF to assess interrater reliability does not require a fully crossed design and allows one to determine if a rater is either more severe, or more lenient, in their scoring of each individual polytomous item on a test or rating scale.


2011 ◽  
Vol 48 (4) ◽  
pp. 441-456 ◽  
Author(s):  
Wen-Chung Wang ◽  
Shiu-Lien Wu

Sign in / Sign up

Export Citation Format

Share Document