A comparison of the approaches of generalizability theory and item response theory in estimating the reliability of test scores for testlet-composed tests

This chapter presents an introductory overview of concepts that underscore the general framework of item response theory. “Item response theory” is a broad umbrella term used to describe a family of mathematical measurement models that consider observed test scores to be a function of latent, unobservable constructs. Most musical constructs cannot be directly measured and are therefore unobservable. Musical constructs can therefore only be inferred based on secondary, observable behaviors. Item response theory uses observable behaviors as probabilistic distributions of responses as a logistic function of person and item parameters in order to define latent constructs. This chapter describes philosophical, theoretical, and applied perspectives of item response theory in the context of measuring musical behaviors.

Download Full-text

Determining differential item functioning and its effect on the test scores of selected pib indexes, using item response theory techniques

SA Journal of Industrial Psychology ◽

10.4102/sajip.v27i2.783 ◽

2001 ◽

Vol 27 (2) ◽

Author(s):

Pieter Schaap

Keyword(s):

Item Response Theory ◽

Differential Item Functioning ◽

Item Response ◽

Test Scores ◽

Response Theory ◽

South Africans ◽

Test Characteristics ◽

Potential Index ◽

Item Functioning

The objective of this article is to present the results of an investigation into the item and test characteristics of two tests of the Potential Index Batteries (PIB) in terms of differential item functioning (DIP) and the effect thereof on test scores of different race groups. The English Vocabulary (Index 12) and Spelling Tests (Index 22) of the PIB were analysed for white, black and coloured South Africans. Item response theory (IRT) methods were used to identify items which function differentially for white, black and coloured race groups. Opsomming Die doel van hierdie artikel is om die resultate van n ondersoek na die item- en toetseienskappe van twee PIB (Potential Index Batteries) toetse in terme van itemsydigheid en die invloed wat dit op die toetstellings van rassegroepe het, weer te gee. Die Potential Index Batteries (PIB) se Engelse Woordeskat (Index 12) en Spellingtoetse (Index 22) is ten opsigte van blanke, swart en gekleurde Suid-Afrikaners ontleed. Itemresponsteorie (IRT) is gebruik om items te identifiseer wat as sydig (DIP) vir die onderskeie rassegroepe beskou kan word.

Download Full-text

Has Item Response Theory Increased the Validity of Achievement Test Scores?

Applied Measurement in Education ◽

10.1207/s15324818ame0302_1 ◽

1990 ◽

Vol 3 (2) ◽

pp. 115-141 ◽

Cited By ~ 12

Author(s):

Robert L. Linn

Keyword(s):

Item Response Theory ◽

Item Response ◽

Test Scores ◽

Achievement Test ◽

Response Theory ◽

Achievement Test Scores

Download Full-text

Reliability of test scores in nonparametric item response theory

Psychometrika ◽

10.1007/bf02293957 ◽

1987 ◽

Vol 52 (1) ◽

pp. 79-97 ◽

Cited By ~ 84

Author(s):

Klaas Sijtsma ◽

Ivo W. Molenaar

Keyword(s):

Item Response Theory ◽

Item Response ◽

Test Scores ◽

Response Theory ◽

Nonparametric Item Response Theory ◽

Nonparametric Item Response

Download Full-text

An Information-Correction Method for Testlet-Based Test Analysis: From the Perspectives of Item Response Theory and Generalizability Theory

ETS Research Report Series ◽

10.1002/ets2.12151 ◽

2017 ◽

Vol 2017 (1) ◽

pp. 1-25

Author(s):

Feifei Li

Keyword(s):

Item Response Theory ◽

Item Response ◽

Generalizability Theory ◽

Correction Method ◽

Response Theory ◽

Test Analysis

Download Full-text

Beyond Dichotomies

Zeitschrift für Psychologie ◽

10.1027/2151-2604/a000194 ◽

2015 ◽

Vol 223 (1) ◽

pp. 3-13 ◽

Cited By ~ 266

Author(s):

Sigrid Blömeke ◽

Jan-Eric Gustafsson ◽

Richard J. Shavelson

Keyword(s):

Item Response Theory ◽

Item Response ◽

Real World ◽

Structural Model ◽

Sampling Error ◽

Generalizability Theory ◽

Latent Trait ◽

Error Variance ◽

Response Theory ◽

Statistical Approaches

In this paper, the state of research on the assessment of competencies in higher education is reviewed. Fundamental conceptual and methodological issues are clarified by showing that current controversies are built on misleading dichotomies. By systematically sketching conceptual controversies, competing competence definitions are unpacked (analytic/trait vs. holistic/real-world performance) and commonplaces are identified. Disagreements are also highlighted. Similarly, competing statistical approaches to assessing competencies, namely item-response theory (latent trait) versus generalizability theory (sampling error variance), are unpacked. The resulting framework moves beyond dichotomies and shows how the different approaches complement each other. Competence is viewed along a continuum from traits that underlie perception, interpretation, and decision-making skills, which in turn give rise to observed behavior in real-world situations. Statistical approaches are also viewed along a continuum from linear to nonlinear models that serve different purposes. Item response theory (IRT) models may be used for scaling item responses and modeling structural relations, and generalizability theory (GT) models pinpoint sources of measurement error variance, thereby enabling the design of reliable measurements. The proposed framework suggests multiple new research studies and may serve as a “grand” structural model.

Download Full-text