Effects of Item Characteristics on Multiple-Choice Item Difficulty

Abstract. The multiple-choice item format is widely used in test construction and Large-Scale Assessment. So far, there has been little research on the impact of the position of the solution among the response options and the few existing results are even inconsistent. Since it would be an easy way to create parallel items for group setting by altering the response options, the influence of the response options’ position on item difficulty should be examined. The Linear Logistic Test Model ( Fischer, 1972 ) was used to analyze the data of 829 students aged 8–20 years, who worked on general knowledge items. It was found that the position of the solution among the response options has an influence on item difficulty. Items are easiest when the solution is in first place and more difficult when the solution is placed in a middle position or at the end of the set of response options.

Download Full-text

The Effect of the Most-Attractive-Distractor Location on Multiple-Choice Item Difficulty

The Journal of Experimental Education ◽

10.1080/00220973.2019.1629577 ◽

2019 ◽

Vol 88 (4) ◽

pp. 643-659

Author(s):

Jinnie Shin ◽

Okan Bulut ◽

Mark J. Gierl

Keyword(s):

Item Difficulty ◽

Multiple Choice ◽

Distractor Location ◽

Multiple Choice Item

Download Full-text

Modeling Guessing Components in the Measurement of Political Knowledge

Political Analysis ◽

10.1017/pan.2017.21 ◽

2017 ◽

Vol 25 (4) ◽

pp. 483-504

Author(s):

Tsung-han Tsai ◽

Chang-chih Lin

Keyword(s):

Gender Gap ◽

Item Difficulty ◽

Multiple Choice ◽

Political Knowledge ◽

Proposed Model ◽

Item Characteristics ◽

Item Responses ◽

Multiple Choice Items ◽

Biased Estimates

Due to the crucial role of political knowledge in democratic participation, the measurement of political knowledge has been a major concern in the discipline of political science. Common formats used for political knowledge questions include multiple-choice items and open-ended identification questions. The conventional wisdom holds that multiple-choice items induce guessing behavior, which leads to underestimated item-difficulty parameters and biased estimates of political knowledge. This article examines guessing behavior in multiple-choice items and argues that a successful guess requires certain levels of knowledge conditional on the difficulties of items. To deal with this issue, we propose a Bayesian IRT guessing model that accommodates the guessing components of item responses. The proposed model is applied to analyzing survey data in Taiwan, and the results show that the proposed model appropriately describes the guessing components based on respondents’ levels of political knowledge and item characteristics. That is, in general, partially informed respondents are more likely to have a successful guess because well-informed respondents do not need to guess and barely informed ones are highly seducible by the attractive distractors. We also examine the gender gap in political knowledge and find that, even when the guessing effect is accounted for, men are more knowledgeable than women about political affairs, which is consistent with the literature.

Download Full-text

Are faculty predictions or item taxonomies useful for estimating the outcome of multiple-choice examinations?

AJP Advances in Physiology Education ◽

10.1152/advan.00062.2011 ◽

2011 ◽

Vol 35 (4) ◽

pp. 396-401 ◽

Cited By ~ 16

Author(s):

Jonathan D. Kibble ◽

Teresa Johnson

Keyword(s):

Inverse Relationship ◽

Item Difficulty ◽

Multiple Choice ◽

Faculty Members ◽

Cognitive Level ◽

Item Discrimination ◽

Multiple Choice Item ◽

Cognitive Levels ◽

Multiple Choice Items ◽

Upper Level

The purpose of this study was to evaluate whether multiple-choice item difficulty could be predicted either by a subjective judgment by the question author or by applying a learning taxonomy to the items. Eight physiology faculty members teaching an upper-level undergraduate human physiology course consented to participate in the study. The faculty members annotated questions before exams with the descriptors “easy,” “moderate,” or “hard” and classified them according to whether they tested knowledge, comprehension, or application. Overall analysis showed a statistically significant, but relatively low, correlation between the intended item difficulty and actual student scores (ρ = −0.19, P < 0.01), indicating that, as intended item difficulty increased, the resulting student scores on items tended to decrease. Although this expected inverse relationship was detected, faculty members were correct only 48% of the time when estimating difficulty. There was also significant individual variation among faculty members in the ability to predict item difficulty (χ2 = 16.84, P = 0.02). With regard to the cognitive level of items, no significant correlation was found between the item cognitive level and either actual student scores (ρ = −0.09, P = 0.14) or item discrimination (ρ = 0.05, P = 0.42). Despite the inability of faculty members to accurately predict item difficulty, the examinations were of high quality, as evidenced by reliability coefficients (Cronbach's α) of 0.70–0.92, the rejection of only 4 of 300 items in the postexamination review, and a mean item discrimination (point biserial) of 0.37. In conclusion, the effort of assigning annotations describing intended difficulty and cognitive levels to multiple-choice items is of doubtful value in terms of controlling examination difficulty. However, we also report that the process of annotating questions may enhance examination validity and can reveal aspects of the hidden curriculum.

Download Full-text

Subjective Judgment of Multiple-Choice Item Characteristics

Educational and Psychological Measurement ◽

10.1177/001316448304300227 ◽

1983 ◽

Vol 43 (2) ◽

pp. 563-570 ◽

Cited By ~ 3

Author(s):

Kathy E. Green

Keyword(s):

Multiple Choice ◽

Subjective Judgment ◽

Multiple Choice Item ◽

Item Characteristics

Download Full-text

Are Undergraduate Examinees' Perceptions of Item Difficulty Related to Item Characteristics?

Perceptual and Motor Skills ◽

10.2466/pms.2002.95.3f.1281 ◽

2002 ◽

Vol 95 (3_suppl) ◽

pp. 1281-1286

Author(s):

David T. Morse ◽

Linda V. Morse

Keyword(s):

Undergraduate Students ◽

Item Difficulty ◽

Multiple Choice ◽

Choice Test ◽

Multiple Choice Test ◽

Cognitive Level ◽

Total Correlation ◽

Item Characteristics ◽

Factual Recall

The present study assessed whether selected item characteristics—difficulty for the group, corrected item-total correlation, cognitive level, and difficulty for the examinee—relate to judgment of item difficulty. Undergraduate students in two classes ( ns = 76, 43) identified what they believed were the five easiest and five most difficult items on a multiple-choice test. Statistically significant correlations were observed for difficulty for the group and the frequency that items were chosen as easy as well as with frequency of items chosen as difficult. Students performed significantly better on items they chose as easy than those chosen as difficult. Items chosen as easy more often called for simple, factual recall than did items chosen as difficult.

Download Full-text

Revision of Guilford Formula to Correct Item Difficulty for Guessing in Multiple Choice Test Items

Journal of Educational and Psychological Studies [JEPS] ◽

10.24200/jeps.vol8iss2pp249-257 ◽

2014 ◽

Vol 8 (2) ◽

pp. 249

Author(s):

Ahmad S. Audeh

Keyword(s):

Risk Taking ◽

Item Difficulty ◽

Measurement Theory ◽

Multiple Choice ◽

Choice Test ◽

Correction Factors ◽

Partial Knowledge ◽

Test Items ◽

Multiple Choice Item ◽

New Formula

The original Guilford formula for estimation of multiple choice item difficulty was based on a penalty for guessing. This penalty was originally based on completely random or blind guessing, which means that it is purely based on mathematical estimation and on significantly violated assumptions. While authentic and fair estimation is expected to be based on mixed scoring formula which adds another correction factor to integrate measurement theory with decision theory based on partial knowledge and risk- taking behavior. A new formula with two correction factors related to guessing, partial knowledge and risk-taking is presented in this paper. Further studies are suggested for reviewing the validation of the main assumptions of item theory models.

Download Full-text