Using Rasch Analysis to Inform Rating Scale Development

AbstractTo develop the Symptom Questionnaire for Visual Dysfunctions (SQVD) and to perform a psychometric analysis using Rasch method to obtain an instrument which allows to detect the presence and frequency of visual symptoms related to any visual dysfunction. A pilot version of 33 items was carried out on a sample of 125 patients from an optometric clinic. Rasch model (using Andrich Rating Scale Model) was applied to investigate the category probability curves and Andrich thresholds, infit and outfit mean square, local dependency using Yen’s Q3 statistic, Differential item functioning (DIF) for gender and presbyopia, person and item reliability, unidimensionality, targeting and ordinal to interval conversion table. Category probability curves suggested to collapse a response category. Rasch analysis reduced the questionnaire from 33 to 14 items. The final SQVD showed that 14 items fit to the model without local dependency and no significant DIF for gender and presbyopia. Person reliability was satisfactory (0.81). The first contrast of the residual was 1.908 eigenvalue, showing unidimensionality and targeting was − 1.59 logits. In general, the SQVD is a well-structured tool which shows that data adequately fit the Rasch model, with adequate psychometric properties, making it a reliable and valid instrument to measure visual symptoms.

Download Full-text

Revisiting rating scale development for rater-mediated language performance assessments: Modelling construct and contextual choices made by scale developers

Language Testing ◽

10.1177/0265532221994052 ◽

2021 ◽

pp. 026553222199405

Author(s):

Ute Knoch ◽

Bart Deygers ◽

Apichat Khamboonruang

Keyword(s):

Scale Development ◽

Rating Scale ◽

Language Assessment ◽

Performance Assessments ◽

Language Performance ◽

Scale Development And Validation ◽

Post Test ◽

Development And Validation ◽

The Impact ◽

Different Sources

Rating scale development in the field of language assessment is often considered in dichotomous ways: It is assumed to be guided either by expert intuition or by drawing on performance data. Even though quite a few authors have argued that rating scale development is rarely so easily classifiable, this dyadic view has dominated language testing research for over a decade. In this paper we refine the dominant model of rating scale development by drawing on a corpus of 36 studies identified in a systematic review. We present a model showing the different sources of scale construct in the corpus. In the discussion, we argue that rating scale designers, just like test developers more broadly, need to start by determining the purpose of the test, the relevant policies that guide test development and score use, and the intended score use when considering the design choices available to them. These include considering the impact of such sources on the generalizability of the scores, the precision of the post-test predictions that can be made about test takers’ future performances and scoring reliability. The most important contributions of the model are that it gives rating scale developers a framework to consider prior to starting scale development and validation activities.

Download Full-text

Validation of Child Behavior Rating Scale in Singapore (Part 1): Rasch Analysis

Hong Kong Journal of Occupational Therapy ◽

10.1016/s1569-18611170004-3 ◽

2010 ◽

Vol 20 (2) ◽

pp. 52-62 ◽

Cited By ~ 2

Author(s):

Sok Mui Lim ◽

Sylvia Rodger ◽

Ted Brown

Keyword(s):

Child Behavior ◽

Rasch Analysis ◽

Rating Scale ◽

Behavior Rating ◽

Behavior Rating Scale

Download Full-text

Validation of the Spanish adaptation of the School Atitude Assessment Survey-Revised using multidimensional Rasch analysis

Anales de Psicología ◽

10.6018/analesps.33.1.235271 ◽

2016 ◽

Vol 33 (1) ◽

pp. 74 ◽

Cited By ~ 1

Author(s):

Alejandro Veas ◽

Juan Luis Castejón ◽

Raquel Gilar ◽

Pablo Miñano

Keyword(s):

Rasch Model ◽

Rasch Analysis ◽

Rating Scale ◽

Item Difficulty ◽

Classical Test Theory ◽

Latent Trait ◽

Principal Component ◽

Analysis Data ◽

Test Theory ◽

Assessment Survey

<p>The School Attitude Assessment Survey-Revised (SAAS-R) was developed by McCoach & Siegle (2003b) and validated in Spain by Author (2014) using Classical Test Theory. The objective of the current research is to validate SAAS-R using multidimensional Rasch analysis. Data were collected from 1398 students attending different high schools. Principal Component Analysis supported the multidimensional SAAS-R. The item difficulty and person ability were calibrated along the same latent trait scale. 10 items were removed from the scale due to misfit with the Rasch model. Differential Item Functioning revealed no significant differences across gender for the remaining 25 items. The 7-category rating scale structure did not function well, and the subscale goal valuation obtained low reliability values. The multidimensional Rasch model supported 25 item-scale SAAS-R measures from five latent factors. Therefore, the advantages of multidimensional Rasch analysis are demonstrated in this study.</p>

Download Full-text

Rasch calibration of the 25-item Connor-Davidson Resilience Scale

Journal of Health Psychology ◽

10.1177/1359105320904769 ◽

2020 ◽

pp. 135910532090476

Author(s):

Natalie Papini ◽

Minsoo Kang ◽

Seungho Ryu ◽

Emily Griese ◽

Taylor Wingert ◽

...

Keyword(s):

Rasch Analysis ◽

Rating Scale ◽

Item Difficulty ◽

Wide Distribution ◽

Management Program ◽

Relative Distribution ◽

Weight Management Program ◽

Model Data ◽

Resilience Scale ◽

Improve Measurement

Rasch modeling was used to examine the 25-item Connor-Davidson Resilience Scale within adults ( n = 410) in a weight management program. Rasch analysis assessed model-data fit, item difficulty and person’s resilience level, an item-person map to evaluate relative distribution items and persons, and rating scale function. Four misfit items were identified and removed. Item difficulty ranged from 1.25 to 1.19 logits (higher logit values indicate more difficult items). Persons’ resilience level had wide distribution (resilience = 2.27 ± 1.56 logits). Item difficulty levels did not adequately assess higher resilience levels. An improved inventory that measures a wider range of resilient behaviors would improve measurement quality.

Download Full-text

Poster 4: The Functional Rating Scale Taskforce for Pre-Huntington's Disease: An Empirically-Driven Initiative for New Scale Development

Neurotherapeutics ◽

10.1016/j.nurt.2008.10.005 ◽

2009 ◽

Vol 6 (1) ◽

pp. 205-205 ◽

Cited By ~ 1

Author(s):

K EVANS ◽

K ANDERSON ◽

B BOROWSKY ◽

K DUFF ◽

J GIULIANO ◽

...

Keyword(s):

Scale Development ◽

Huntington's Disease ◽

Huntington’S Disease ◽

Rating Scale

Download Full-text

Patient preferences for patient participation: Psychometric evaluation of The 4Ps tool in patients with chronic heart or lung disorders

Nordic Journal of Nursing Research ◽

10.1177/2057158517713156 ◽

2017 ◽

Vol 38 (2) ◽

pp. 68-76 ◽

Cited By ~ 7

Author(s):

Kristina Luhr ◽

Ann Catrine Eldh ◽

Ulrica Nilsson ◽

Marie Holmefur

Keyword(s):

Patient Participation ◽

Patient Preferences ◽

Rasch Analysis ◽

Rating Scale ◽

Psychometric Evaluation ◽

Qualitative Evaluation ◽

Weighted Kappa ◽

General Knowledge ◽

Test Retest Reliability ◽

Internal Scale

The Patient Preferences for Patient Participation tool (The 4Ps) was developed to aid clinical dialogue and to help patients to 1) depict, 2) prioritise, and 3) evaluate patient participation with 12 pre-set items reiterated in the three sections. An earlier qualitative evaluation of The 4Ps showed promising results. The present study is a psychometric evaluation of The 4Ps in patients with chronic heart or lung disease ( n = 108) in primary and outpatient care. Internal scale validity was evaluated using Rasch analysis, and two weeks test–retest reliability of the three sections using kappa/weighted kappa and a prevalence- and bias-adjusted kappa. The 4Ps tool was found to be reasonably valid with a varied reliability. Proposed amendments are rephrasing of two items, and modifications of the rating scale in Section 2. The 4Ps is suggested for use to increase general knowledge of patient participation, but further studies are needed with regards to its implementation.

Download Full-text

Child Mania Rating Scale: Development, Reliability, and Validity

Yearbook of Psychiatry and Applied Mental Health ◽

10.1016/s0084-3970(08)70312-8 ◽

2007 ◽

Vol 2007 ◽

pp. 10-11 ◽

Cited By ~ 1

Author(s):

P.S. Jensen

Keyword(s):

Scale Development ◽

Rating Scale ◽

Reliability And Validity

Download Full-text

A Higher-Order Analysis Supports Use of the 11-Item Version of the Tampa Scale for Kinesiophobia in People With Neck Pain

Physical Therapy ◽

10.2522/ptj.20120255 ◽

2013 ◽

Vol 93 (1) ◽

pp. 60-68 ◽

Cited By ~ 17

Author(s):

David Walton ◽

James M. Elliott

Keyword(s):

Neck Pain ◽

Pain Intensity ◽

Convergent Validity ◽

Rasch Analysis ◽

Rating Scale ◽

Neck Disability Index ◽

Measurement Properties ◽

Symptom Duration ◽

Cross Sectional ◽

Tampa Scale For Kinesiophobia

Background Despite increasing clinical and research use of the 11-item version of the Tampa Scale for Kinesiophobia (TSK-11) in people with neck pain, little is known about its measurement properties in this population. Objective The purpose of this study was to rigorously evaluate the measurement properties of the TSK-11 when used in people with mechanical neck pain. Design This study was a secondary analysis of 2 independent databases (N=235) of people with mechanical neck pain of primarily traumatic origin. Methods The TSK-11 was subjected to Rasch analysis and subsequent evaluation of concurrent associations with the Neck Disability Index and a numeric rating scale for pain intensity. Results The TSK-11 conformed well to the Rasch model for interval-level measurement, but less so for acute or nontraumatic etiologies. A transformation matrix suggested that small changes at the extremes of the scale are more meaningful than in the middle. Cross-sectional convergent validity testing suggested relationships of expected magnitude and direction compared with pain intensity and neck-related disability. The use of the linearly transformed TSK-11 led to potentially important differences in distribution of data compared with use of the raw scores. Limitations The sample size was slightly smaller than desired for Rasch analysis. The 2 databases were similar in terms of symptom duration, but differed in pain intensity and age. Conclusions The TSK-11 can be considered an interval-level measure when used in people with neck pain. It provides potentially important information regarding the nature of neck-related disability. Clinically important difference may not be consistent across the range of the scale.

Download Full-text

Rating catatonia in patients with chronic schizophrenia: Rasch analysis of the Bush-Francis Catatonia Rating Scale

International Journal of Methods in Psychiatric Research ◽

10.1002/mpr.224 ◽

2007 ◽

Vol 16 (3) ◽

pp. 161-170 ◽

Cited By ~ 19

Author(s):

Eric Wong ◽

Gabor S. Ungvari ◽

Siu-Kau Leung ◽

Wai-Kwong Tang

Keyword(s):

Rasch Analysis ◽

Rating Scale ◽

Chronic Schizophrenia

Download Full-text