An IRT Investigation of the Validity of Non-Patient Analogue Research Using the Beck Depression Inventory

This paper presents an IRT analysis of the Beck Depression Inventory which was carried out to assess the assumption of an underlying latent trait common to non-clinical and patient samples. A one parameter rating scale model was fitted to data drawn from a patient and non-patient sample. Findings suggest that while the BDI fits the model reasonably well for the two samples separately there is sufficient differential item functioning to raise serious duobts of the viability of using it analogously with patient and non-patient groups.

Download Full-text

Differential Item Functioning in Students Rating of Teaching Effectiveness Surveys in Higher Education According to Academic Disciplines: Data from a Saudi University

Journal of Educational and Psychological Studies [JEPS] ◽

10.24200/jeps.vol11iss4pp770-780 ◽

2017 ◽

Vol 11 (4) ◽

pp. 770

Author(s):

Mahmoud AlQuraan ◽

Ahmed AL Kuwaiti

Keyword(s):

Higher Education ◽

Differential Item Functioning ◽

Teaching Effectiveness ◽

Rating Scale ◽

Academic Discipline ◽

Teaching Quality ◽

Academic Disciplines ◽

Scale Model ◽

Item Functioning ◽

Teaching In Higher Education

This study explored academic discipline as a source of differential item functioning(DIF) in students’ rating of teaching quality and effectiveness at higher education institutions. Data utilized in this study was collected by Imam Abudalruman Bin Faisal University - KSA. The total number of surveys analyzed for the purpose of this study is 36459 from three colleges: Education, Health, and Engineering. Using Extended Rasch model (Rating Scale Model), the results show that the instrument contains four DIF items. The content of these four items confirm the possibility of considering discipline as a source of DIF items in students evaluation of teaching in higher education. Moreover, the results of the current study show that removing DIF items from the instrument increases construct validity.

Download Full-text

Using the Rating Scale Model to Examine the Psychometric Properties of the Self-Efficacy Scale for Driver Competence

European Journal of Psychological Assessment ◽

10.1027/1015-5759/a000063 ◽

2011 ◽

Vol 27 (3) ◽

pp. 164-170 ◽

Cited By ~ 1

Author(s):

Anna Sundström

Keyword(s):

Psychometric Properties ◽

Self Efficacy ◽

Rating Scale ◽

Measurement Precision ◽

The Self ◽

Test Theory ◽

Self Report ◽

Scale Model ◽

Validity And Reliability ◽

Two Samples

This study evaluated the psychometric properties of a self-report scale for assessing perceived driver competence, labeled the Self-Efficacy Scale for Driver Competence (SSDC), using item response theory analyses. Two samples of Swedish driving-license examinees (n = 795; n = 714) completed two versions of the SSDC that were parallel in content. Prior work, using classical test theory analyses, has provided support for the validity and reliability of scores from the SSDC. This study investigated the measurement precision, item hierarchy, and differential functioning for males and females of the items in the SSDC as well as how the rating scale functions. The results confirmed the previous findings; that the SSDC demonstrates sound psychometric properties. In addition, the findings showed that measurement precision could be increased by adding items that tap higher self-efficacy levels. Moreover, the rating scale can be improved by reducing the number of categories or by providing each category with a label.

Download Full-text

Construction of the Korean Very Short Form of the ATQ: An Application of Rasch Rating Scale Model

Korean Association For Learner-Centered Curriculum And Instruction ◽

10.22251/jlcci.2018.18.6.937 ◽

2018 ◽

Vol 18 (6) ◽

pp. 937-957

Author(s):

Mina Lee ◽

◽

Hyeweon Byun ◽

Minjeong Kim ◽

◽

...

Keyword(s):

Rating Scale ◽

Short Form ◽

Scale Model ◽

Rasch Rating Scale Model ◽

Rating Scale Model

Download Full-text

Cross-cultural adaptation and psychometric properties of the Kessler Distress Scale (K10): an application of the rating scale model

Psicologia: Reflexão e Crítica ◽

10.1186/s41155-021-00186-9 ◽

2021 ◽

Vol 34 (1) ◽

Author(s):

Evandro Morais Peixoto ◽

Daniela Sacramento Zanini ◽

Josemberg Moura de Andrade

Keyword(s):

Psychological Distress ◽

Psychometric Properties ◽

Rating Scale ◽

Self Report ◽

Clinical Population ◽

Scale Model ◽

General Factor ◽

Measurement Models ◽

Distress Scale ◽

Rating Scale Model

Abstract Background The Kessler Distress Scale (K10) is a self-report scale for the assessment of non-specific psychological distress in the general and clinical population. Because of its ease of application and good psychometric properties, the K10 has been adapted to several cultures. The present study seeks to adapt the K10 to Brazilian Portuguese and estimate its validity evidence and reliability. Methods A total of 1914 individuals from the general population participated in the study (age = 34.88, SD = 13.61, 77.7% female). The adjustment indices were compared among three different measurement models proposed for the K10 through confirmatory factor analysis (CFA). The items’ properties were analyzed by Andrich’s Rating Scale Model (RSM). Furthermore, evidence based on relations to other variables (depression, stress, anxiety, positive and negative affects, and satisfaction with life) was estimated. Results CFA indicated the adequacy of the bifactor model (CFI= 0.985; TLI= 0.973; SMR= 0.019; RMSEA= 0.050), composed of two specific factors (depression and anxiety) and one general factor (psychological distress), corresponding to the theoretical hypothesis. Additionally, it was observed multiple-group invariance by gender and age range. The RSM provided an understanding of the organization of the continuum represented by the psychological distress construct (items difficulty), which varied from −0.89 to 1.00; good adjustment indexes; infit between 0.67 and 1.32; outfit between 0.68 and 1.34; and desirable reliability, α= 0.87. Lastly, theoretically coherent associations with the external variables were observed. Conclusions It is concluded that the Brazilian version of the K10 is a suitable measure of psychological distress for the Brazilian population.

Download Full-text

Rasch analysis for development and reduction of Symptom Questionnaire for Visual Dysfunctions (SQVD)

Scientific Reports ◽

10.1038/s41598-021-94166-9 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Mario Cantó-Cerdán ◽

Pilar Cacho-Martínez ◽

Francisco Lara-Lacárcel ◽

Ángel García-Muñoz

Keyword(s):

Rasch Model ◽

Rasch Analysis ◽

Rating Scale ◽

Scale Model ◽

Local Dependency ◽

Visual Symptoms ◽

Symptom Questionnaire ◽

Item Functioning ◽

Conversion Table ◽

Valid Instrument

AbstractTo develop the Symptom Questionnaire for Visual Dysfunctions (SQVD) and to perform a psychometric analysis using Rasch method to obtain an instrument which allows to detect the presence and frequency of visual symptoms related to any visual dysfunction. A pilot version of 33 items was carried out on a sample of 125 patients from an optometric clinic. Rasch model (using Andrich Rating Scale Model) was applied to investigate the category probability curves and Andrich thresholds, infit and outfit mean square, local dependency using Yen’s Q3 statistic, Differential item functioning (DIF) for gender and presbyopia, person and item reliability, unidimensionality, targeting and ordinal to interval conversion table. Category probability curves suggested to collapse a response category. Rasch analysis reduced the questionnaire from 33 to 14 items. The final SQVD showed that 14 items fit to the model without local dependency and no significant DIF for gender and presbyopia. Person reliability was satisfactory (0.81). The first contrast of the residual was 1.908 eigenvalue, showing unidimensionality and targeting was − 1.59 logits. In general, the SQVD is a well-structured tool which shows that data adequately fit the Rasch model, with adequate psychometric properties, making it a reliable and valid instrument to measure visual symptoms.

Download Full-text

Using Differential Item Functioning to Test for Interrater Reliability in Constructed Response Items

Educational and Psychological Measurement ◽

10.1177/0013164419899731 ◽

2020 ◽

Vol 80 (4) ◽

pp. 808-820

Author(s):

Cindy M. Walker ◽

Sakine Göçer Şahin

Keyword(s):

Differential Item Functioning ◽

Interrater Reliability ◽

Rating Scales ◽

Rating Scale ◽

Intraclass Correlation ◽

Kappa Statistic ◽

Promising Alternative ◽

Constructed Response ◽

Polytomous Item ◽

Item Functioning

The purpose of this study was to investigate a new way of evaluating interrater reliability that can allow one to determine if two raters differ with respect to their rating on a polytomous rating scale or constructed response item. Specifically, differential item functioning (DIF) analyses were used to assess interrater reliability and compared with traditional interrater reliability measures. Three different procedures that can be used as measures of interrater reliability were compared: (1) intraclass correlation coefficient (ICC), (2) Cohen’s kappa statistic, and (3) DIF statistic obtained from Poly-SIBTEST. The results of this investigation indicated that DIF procedures appear to be a promising alternative to assess the interrater reliability of constructed response items, or other polytomous types of items, such as rating scales. Furthermore, using DIF to assess interrater reliability does not require a fully crossed design and allows one to determine if a rater is either more severe, or more lenient, in their scoring of each individual polytomous item on a test or rating scale.

Download Full-text