Revisiting rating scale development for rater-mediated language performance assessments: Modelling construct and contextual choices made by scale developers

Rating scale development in the field of language assessment is often considered in dichotomous ways: It is assumed to be guided either by expert intuition or by drawing on performance data. Even though quite a few authors have argued that rating scale development is rarely so easily classifiable, this dyadic view has dominated language testing research for over a decade. In this paper we refine the dominant model of rating scale development by drawing on a corpus of 36 studies identified in a systematic review. We present a model showing the different sources of scale construct in the corpus. In the discussion, we argue that rating scale designers, just like test developers more broadly, need to start by determining the purpose of the test, the relevant policies that guide test development and score use, and the intended score use when considering the design choices available to them. These include considering the impact of such sources on the generalizability of the scores, the precision of the post-test predictions that can be made about test takers’ future performances and scoring reliability. The most important contributions of the model are that it gives rating scale developers a framework to consider prior to starting scale development and validation activities.

Download Full-text

Sensory ataxia rating scale: Development and validation of a functional scale for patients with sensory neuronopathies

Journal of the Peripheral Nervous System ◽

10.1111/jns.12330 ◽

2019 ◽

Vol 24 (3) ◽

pp. 242-246

Author(s):

Alberto R. M. Martinez ◽

Melina P. Martins ◽

Carlos R. Martins ◽

Ingrid Faber ◽

Thiago J. R. Rezende ◽

...

Keyword(s):

Scale Development ◽

Rating Scale ◽

Functional Scale ◽

Scale Development And Validation ◽

Sensory Ataxia ◽

Development And Validation

Download Full-text

The behavior intervention rating scale: Development and validation of a pretreatment acceptability and effectiveness measure

Journal of School Psychology ◽

10.1016/0022-4405(91)90014-i ◽

1991 ◽

Vol 29 (1) ◽

pp. 43-51 ◽

Cited By ~ 149

Author(s):

Stephen N. Elliott ◽

Mary Von Brock Treuting

Keyword(s):

Scale Development ◽

Rating Scale ◽

Behavior Intervention ◽

Scale Development And Validation ◽

Effectiveness Measure ◽

Development And Validation

Download Full-text

CARE Scale: development and validation of a measure assessing the impact of relationships on self-care in chronic pain

Journal of Pain ◽

10.1016/j.jpain.2012.01.083 ◽

2012 ◽

Vol 13 (4) ◽

pp. S19 ◽

Cited By ~ 1

Author(s):

B. Darnall ◽

A. Wilson ◽

D. Pierce

Keyword(s):

Chronic Pain ◽

Scale Development ◽

Self Care ◽

Scale Development And Validation ◽

Development And Validation ◽

The Impact

Download Full-text

Development and validation of a rating scale for summarization as an integrated task

Asian-Pacific Journal of Second and Foreign Language Education ◽

10.1186/s40862-021-00113-6 ◽

2021 ◽

Vol 6 (1) ◽

Author(s):

Jiuliang Li ◽

Qian Wang

Keyword(s):

Academic Success ◽

Scale Development ◽

Large Scale ◽

Rating Scale ◽

Academic Research ◽

Summary Writing ◽

Scale Development And Validation ◽

Significant Relationships ◽

Text Features ◽

Development And Validation

AbstractSummary writing is essential for academic success, and has attracted renewed interest in academic research and large-scale language test. However, less attention has been paid to the development and evaluation of the scoring scales of summary writing. This study reports on the validation of a summary rubric that represented an approach to scale development with limited resources out of consideration for practicality. Participants were 83 students and three raters. Diagnostic evaluation of the scale components and categories was based on raters’ perception of their use and the scores of students’ summaries which were analyzed using multifaceted Rasch measurement (MFRM). Correlation analysis revealed significant relationships among the scoring components, but the coefficients among some of the components were over high. MFRM analysis provided evidence in support of the usefulness of the scoring rubric, but also suggested the need of a refinement of the components and categories. According to the raters, the rubric was ambiguous in addressing some crucial text features. This study has implications for summarization task design, scoring scale development and validation in particular.

Download Full-text

The SCAR (Scar Cosmesis Assessment and Rating) scale: development and validation of a new outcome measure for postoperative scar assessment

British Journal of Dermatology ◽

10.1111/bjd.14812 ◽

2016 ◽

Vol 175 (6) ◽

pp. 1394-1396 ◽

Cited By ~ 8

Author(s):

J. Kantor

Keyword(s):

Scale Development ◽

Rating Scale ◽

Outcome Measure ◽

Scale Development And Validation ◽

Scar Assessment ◽

Development And Validation ◽

Postoperative Scar

Download Full-text

The Workplace Arrogance Scale: Development and Validation of a Measure

PsycEXTRA Dataset ◽

10.1037/e518572013-459 ◽

2006 ◽

Author(s):

Aarti Shyamsunder ◽

Stanley B. Silverman

Keyword(s):

Scale Development ◽

Scale Development And Validation ◽

Development And Validation

Download Full-text

Diversity Seeking: Scale Development and Validation

PsycEXTRA Dataset ◽

10.1037/e692382011-001 ◽

2011 ◽

Author(s):

Anne M. Brumbaugh ◽

Sonya A. Grier

Keyword(s):

Scale Development ◽

Scale Development And Validation ◽

Development And Validation

Download Full-text

Highly Educated Married Korean Women’s Career Persistence Motivation : Scale Development and Validation

THE KOREAN JOURNAL OF COUNSELING AND PSYCHOTHERAPY ◽

10.23844/kjcp.2016.02.28.1.63 ◽

2016 ◽

Vol 28 (1) ◽

pp. 63

Author(s):

Minsun Kim ◽

Young Seok Seo

Keyword(s):

Scale Development ◽

Scale Development And Validation ◽

Highly Educated ◽

Career Persistence ◽

Motivation Scale ◽

Development And Validation

Download Full-text

Xenophobia: scale development and validation

Journal of Contemporary African Studies ◽

10.1080/02589001.2020.1853686 ◽

2021 ◽

pp. 1-13

Author(s):

Tosin Tunrayo Olonisakin ◽

Sulaiman Olanrewaju Adebayo

Keyword(s):

Scale Development ◽

Scale Development And Validation ◽

Development And Validation

Download Full-text

Death-Related Status Consumption: Scale Development and Validation

OMEGA - Journal of Death and Dying ◽

10.1177/00302228211016223 ◽

2021 ◽

pp. 003022282110162

Author(s):

Hakan Cengiz ◽

Omer Torlak

Keyword(s):

Scale Development ◽

Internal Consistency ◽

Culturally Diverse ◽

Three Dimensions ◽

Status Consumption ◽

Scale Development And Validation ◽

Domain Specific ◽

Development And Validation ◽

The U.S

Although it has been widely discussed in the literature, no scale has yet been developed to measure the consumption aspect of death. This study aims to develop a domain-specific death-related status consumption (DRSC) scale to bridge this gap in the field. Results reveal the following three dimensions of the scale: conspicuousness, planning, and showing respect. In four studies, which collate the views of 1,302 participants, both students and adults, the DRSC demonstrates internal consistency and validity across cultures (Turkey, the U.S., and culturally diverse sample). The importance of such a scale for the field is discussed.

Download Full-text