Using the Many-Faceted Rasch Model to Evaluate Standard Setting Judgments

Any high-stakes assessment that leads to an important decision requires careful consideration in determining whether a student passes or fails. Despite the implementation of many standard-setting methods in clinical examinations, concerns remain about the reliability of pass/fail decisions in high stakes assessment, especially clinical assessment. This observational study proposes a defensible pass/fail decision based on the number of failed competencies. In the study conducted in Erbil, Iraq, in June 2018, results were obtained for 150 medical students on their final objective structured clinical examination. Cutoff scores and pass/fail decisions were calculated using the modified Angoff, borderline, borderline-regression, and holistic methods. The results were compared with each other and with a new competency method using Cohen’s kappa. Rasch analysis was used to compare the consistency of competency data with Rasch model estimates. The competency method resulted in 40 (26.7%) students failing, compared with 76 (50.6%), 37 (24.6%), 35 (23.3%), and 13 (8%) for the modified Angoff, borderline, borderline regression, and holistic methods, respectively. The competency method demonstrated a sufficient degree of fit to the Rasch model (mean outfit and infit statistics of 0.961 and 0.960, respectively). In conclusion, the competency method was more stringent in determining pass/fail, compared with other standard-setting methods, except for the modified Angoff method. The fit of competency data to the Rasch model provides evidence for the validity and reliability of pass/fail decisions.

Download Full-text

Setting a standard for low reading proficiency: A comparison of the bookmark procedure and constrained mixture Rasch model

PLoS ONE ◽

10.1371/journal.pone.0257871 ◽

2021 ◽

Vol 16 (11) ◽

pp. e0257871

Author(s):

Tabea Feseker ◽

Timo Gnambs ◽

Cordula Artelt

Keyword(s):

Rasch Model ◽

Panel Study ◽

External Validation ◽

Reading Proficiency ◽

Standard Setting ◽

Cut Scores ◽

Model Based ◽

Constrained Mixture ◽

Cut Score ◽

Efficient Alternative

In order to draw pertinent conclusions about persons with low reading skills, it is essential to use validated standard-setting procedures by which they can be assigned to their appropriate level of proficiency. Since there is no standard-setting procedure without weaknesses, external validity studies are essential. Traditionally, studies have assessed validity by comparing different judgement-based standard-setting procedures. Only a few studies have used model-based approaches for validating judgement-based procedures. The present study addressed this shortcoming and compared agreement of the cut score placement between a judgement-based approach (i.e., Bookmark procedure) and a model-based one (i.e., constrained mixture Rasch model). This was performed by differentiating between individuals with low reading proficiency and those with a functional level of reading proficiency in three independent samples of the German National Educational Panel Study that included students from the ninth grade (N = 13,897) as well as adults (Ns = 5,335 and 3,145). The analyses showed quite similar mean cut scores for the two standard-setting procedures in two of the samples, whereas the third sample showed more pronounced differences. Importantly, these findings demonstrate that model-based approaches provide a valid and resource-efficient alternative for external validation, although they can be sensitive to the ability distribution within a sample.

Download Full-text

A new approach to measuring Overall Liking with the Many-Facet Rasch Model

Food Quality and Preference ◽

10.1016/j.foodqual.2019.01.015 ◽

2019 ◽

Vol 74 ◽

pp. 100-111 ◽

Cited By ~ 1

Author(s):

Peter Ho

Keyword(s):

Rasch Model ◽

New Approach ◽

The Many

Download Full-text

An application of the Many-Faceted Rasch Model and Cluster Analysis in validating rating scales using benchmark essays selected in EFL writing assessment EFL

Studies in English Education ◽

10.22275/see.24.3.06 ◽

2019 ◽

Vol 24 (3) ◽

pp. 487-519

Author(s):

So Young Jang

Keyword(s):

Cluster Analysis ◽

Rasch Model ◽

Writing Assessment ◽

Rating Scales ◽

Efl Writing ◽

And Cluster Analysis ◽

The Many

Download Full-text

Validating Translation Test Items via the Many-Facet Rasch Model

Psychological Reports ◽

10.1177/0033294118768664 ◽

2018 ◽

Vol 122 (2) ◽

pp. 748-772 ◽

Cited By ~ 1

Author(s):

Wen-Ta Tseng ◽

Tzi-Ying Su ◽

John-Michael L. Nix

Keyword(s):

Rasch Model ◽

Item Difficulty ◽

Educational Institutions ◽

English Sentence ◽

Entrance Exam ◽

Test Items ◽

Rater Severity ◽

Expert Novice ◽

The Many ◽

Language Context

This study applied the many-facet Rasch model to assess learners’ translation ability in an English as a foreign language context. Few attempts have been made in extant research to detect and calibrate rater severity in the domain of translation testing. To fill the research gap, this study documented the process of validating a test of Chinese-to-English sentence translation and modeled raters’ scoring propensity defined by harshness or leniency, expert/novice effects on severity, and concomitant effects on item difficulty. Two hundred twenty-five, third-year senior high school Taiwanese students and six educators from tertiary and secondary educational institutions served as participants. The students’ mean age was 17.80 years ( SD = 1.20, range 17–19). The exam consisted of 10 translation items adapted from two entrance exam tests. The results showed that this subjectively scored performance assessment exhibited robust unidimensionality, thus reliably measuring translation ability free from unmodeled disturbances. Furthermore, discrepancies in ratings between novice and expert raters were also identified and modeled by the many-facet Rasch model. The implications for applying the many-facet Rasch model in translation tests at the tertiary level were discussed.

Download Full-text

Investigating adjudicator bias in concert band evaluations: An application of the Many-Facets Rasch Model

Musicae Scientiae ◽

10.1177/1029864917697782 ◽

2017 ◽

Vol 22 (3) ◽

pp. 377-393 ◽

Cited By ~ 1

Author(s):

D. Gregory Springer ◽

Kelly D. Bradley

Keyword(s):

Rasch Model ◽

Instrumental Music ◽

Rating Scale ◽

Interaction Analysis ◽

Music Performance ◽

Concert Band ◽

Performance Evaluations ◽

Large Ensemble ◽

Measurement Framework ◽

The Many

Prior research indicates mixed findings regarding the consistency of adjudicators’ ratings at large ensemble festivals, yet the results of these festivals have strong impacts on the perceived success of instrumental music programs and the perceived effectiveness of their directors. In this study, Rasch modeling was used to investigate the potential influence of adjudicators on performance ratings at a live large ensemble festival. Evaluation forms from a junior high school concert band festival adjudicated by a panel of three expert judges were analyzed using the Many-Facets Rasch Model. Analyses revealed several trends. First, the use of assigning “half points” between adjacent response options on the 5-point rating scale resulted in redundancy and measurement noise. Second, adjudicators provided relatively similar ratings for conceptually distinct criteria, which could be evidence of a halo effect. Third, although all judges demonstrated relatively lenient ratings overall, one judge provided more severe ratings as compared to peers. Finally, an exploratory interaction analysis among the facets of judges and bands indicated the presence of rater-mediated bias. Implications for music researchers and ensemble adjudicators are discussed in the context of ensemble performance evaluations, and a measurement framework that can be applied to other aspects of music performance evaluations is introduced.

Download Full-text

Evaluation of open items using the many-facet Rasch model

Journal of Applied Statistics ◽

10.1080/02664763.2015.1049938 ◽

2015 ◽

Vol 43 (2) ◽

pp. 299-316 ◽

Cited By ~ 3

Author(s):

Sonia Ferreira Lopes Toffoli ◽

Dalton Francisco de Andrade ◽

Antonio Cezar Bornia

Keyword(s):

Rasch Model ◽

The Many

Download Full-text