Scale length does matter: Recommendations for measurement invariance testing with categorical factor analysis and item response theory approaches

AbstractIn social sciences, the study of group differences concerning latent constructs is ubiquitous. These constructs are generally measured by means of scales composed of ordinal items. In order to compare these constructs across groups, one crucial requirement is that they are measured equivalently or, in technical jargon, that measurement invariance (MI) holds across the groups. This study compared the performance of scale- and item-level approaches based on multiple group categorical confirmatory factor analysis (MG-CCFA) and multiple group item response theory (MG-IRT) in testing MI with ordinal data. In general, the results of the simulation studies showed that MG-CCFA-based approaches outperformed MG-IRT-based approaches when testing MI at the scale level, whereas, at the item level, the best performing approach depends on the tested parameter (i.e., loadings or thresholds). That is, when testing loadings equivalence, the likelihood ratio test provided the best trade-off between true-positive rate and false-positive rate, whereas, when testing thresholds equivalence, the χ2 test outperformed the other testing strategies. In addition, the performance of MG-CCFA’s fit measures, such as RMSEA and CFI, seemed to depend largely on the length of the scale, especially when MI was tested at the item level. General caution is recommended when using these measures, especially when MI is tested for each item individually.

Download Full-text

Scale length does matter: Recommendations for Measurement Invariance Testing with Categorical Factor Analysis and Item Response Theory Approaches

10.31234/osf.io/udbna ◽

2020 ◽

Author(s):

E. Damiano D'Urso ◽

Kim De Roover ◽

Jeroen K. Vermunt ◽

Jesper Tijmstra

Keyword(s):

Factor Analysis ◽

Item Response Theory ◽

Measurement Invariance ◽

Item Response ◽

Simulation Studies ◽

Response Theory ◽

Multiple Group ◽

Item Level ◽

Positive Rate ◽

Scale Length

In social sciences, the study of group differences concerning latent constructs is ubiquitous. These constructs are generally measured by means of scales composed of ordinal items. In order to compare these constructs across groups, one crucial requirement is that they are measured equivalently or, in technical jargon, that measurement invariance holds across the groups. This study compared the performance of multiple group categorical confirmatory factor analysis (MG-CCFA) and multiple group item response theory (MG-IRT) in testing measurement invariance with ordinal data. A simulation study was conducted to compare the true positive rate (TPR) and false positive rate (FPR) both at the scale and at the item level for these two approaches under an invariance and a non-invariance scenario. The results of the simulation studies showed that the performance, in terms of the TPR, of MG-CCFA- and MG-IRT-based approaches mostly depends on the scale length. In fact, for long scales, the likelihood ratio test (LRT) approach, for MG-IRT, outperformed the other approaches, while, for short scales, MG-CCFA seemed to be generally preferable. In addition, the performance of MG-CCFA's fit measures, such as RMSEA and CFI, seemed to depend largely on the length of the scale, especially when MI was tested at the item level. General caution is recommended when using these measures, especially when MI is tested for each item individually. A decision flowchart, based on the results of the simulation studies, is provided to help summarizing the results and providing indications on which approach performed best and in which setting.

Download Full-text

Is aggression the same for boys and girls? Assessing measurement invariance with confirmatory factor analysis and item response theory.

School Psychology Quarterly ◽

10.1037/a0018768 ◽

2010 ◽

Vol 25 (1) ◽

pp. 45-61 ◽

Cited By ~ 14

Author(s):

Sangwon Kim ◽

Seock-Ho Kim ◽

Randy W. Kamphaus

Keyword(s):

Factor Analysis ◽

Item Response Theory ◽

Confirmatory Factor Analysis ◽

Measurement Invariance ◽

Item Response ◽

Response Theory ◽

Confirmatory Factor

Download Full-text

Confirmatory factor analysis and item response theory: Two approaches for exploring measurement invariance.

Psychological Bulletin ◽

10.1037/0033-2909.114.3.552 ◽

1993 ◽

Vol 114 (3) ◽

pp. 552-566 ◽

Cited By ~ 656

Author(s):

Steven P. Reise ◽

Keith F. Widaman ◽

Robin H. Pugh

Keyword(s):

Factor Analysis ◽

Item Response Theory ◽

Confirmatory Factor Analysis ◽

Measurement Invariance ◽

Item Response ◽

Response Theory ◽

Confirmatory Factor

Download Full-text

The Value Of Item Response Theory In Invariance Testing

Journal of Applied Business Research (JABR) ◽

10.19030/jabr.v32i6.9836 ◽

2016 ◽

Vol 32 (6) ◽

pp. 1843

Author(s):

Nico Martins ◽

Hester Nienaber

Keyword(s):

Item Response Theory ◽

Measurement Invariance ◽

Item Response ◽

Employee Engagement ◽

Future Research ◽

Analytic Structure ◽

Response Theory ◽

Multiple Group ◽

The Individual ◽

Annual Turnover

The goal of the current study was to assess the Employee Engagement Instrument (EEI) from an item response theory (IRT) perspective, with a specific focus on measurement invariance for annual turnover. The sample comprised 4 099 respondents from all business sectors in South Africa. This article describes the logic and procedures used to test for factorial invariance across groups in the context of construct validation. The procedures included testing for configural and metric invariance in the framework of multiple-group confirmatory factor analysis (CFA).The results confirmed the factor analytic structure of the model fit for some of the individual scales of the EEI. The measurement invariance of the EEI as a function of annual turnover was confirmed. However, the results indicated that the EEI needs to be refined for future research.

Download Full-text

Structure of Intimate Partner Violence in Male Same Sex Relationships: An Item Response Theory and Exploratory Factor Analysis

PsycEXTRA Dataset ◽

10.1037/e571212013-360 ◽

2013 ◽

Author(s):

Chitra Raghavan ◽

Connie J. A. Beck ◽

J. Michael Menke

Keyword(s):

Intimate Partner Violence ◽

Factor Analysis ◽

Item Response Theory ◽

Item Response ◽

Exploratory Factor Analysis ◽

Partner Violence ◽

Intimate Partner ◽

Same Sex ◽

Response Theory

Download Full-text

The Revised Mood Rhythm Instrument: A Large Multicultural Psychometric Study

Journal of Clinical Medicine ◽

10.3390/jcm10030388 ◽

2021 ◽

Vol 10 (3) ◽

pp. 388

Author(s):

Melissa Alves Braga de Oliveira ◽

Euclides de Mendonça Filho ◽

Alicia Carissimi ◽

Luciene Lima dos Santos Garay ◽

Marina Scop ◽

...

Keyword(s):

Factor Analysis ◽

Item Response Theory ◽

Network Analysis ◽

Psychometric Properties ◽

Item Response ◽

Response Theory ◽

Item Response Theory Analysis ◽

Increased Risk ◽

Psychometric Analyses ◽

Self Reporting Questionnaire

Background: Recent studies with the mood rhythm instrument (MRhI) have shown that the presence of recurrent daily peaks in specific mood symptoms are significantly associated with increased risk of psychiatric disorders. Using a large sample collected in Brazil, Spain, and Canada, we aimed to analyze which MRhI items maintained good psychometric properties across cultures. As a secondary aim, we used network analysis to visualize the strength of the association between the MRhI items. Methods: Adults (n = 1275) between 18–60 years old from Spain (n = 458), Brazil (n = 415), and Canada (n = 401) completed the MRhI and the self-reporting questionnaire (SRQ-20). Psychometric analyses followed three steps: Factor analysis, item response theory, and network analysis. Results: The factor analysis indicated the retention of three factors that grouped the MRhI items into cognitive, somatic, and affective domains. The item response theory analysis suggested the exclusion of items that displayed a significant divergence in difficulty measures between countries. Finally, the network analysis revealed a structure where sleepiness plays a central role in connecting the three domains. These psychometric analyses enabled a psychometric-based refinement of the MRhI, where the 11 items with good properties across cultures were kept in a shorter, revised MRhI version (MRhI-r). Limitations: Participants were mainly university students and, as we did not conduct a formal clinical assessment, any potential correlations (beyond the validated SRQ) cannot be ascertained. Conclusions: The MRhI-r is a novel tool to investigate self-perceived rhythmicity of mood-related symptoms and behaviors, with good psychometric properties across multiple cultures.

Download Full-text

Applying Item Response Theory Analysis to the Montreal Cognitive Assessment in a Low-Education Older Population

Assessment ◽

10.1177/1073191118821733 ◽

2019 ◽

Vol 27 (7) ◽

pp. 1416-1428 ◽

Cited By ~ 3

Author(s):

Hao Luo ◽

Björn Andersson ◽

Jennifer Y. M. Tang ◽

Gloria H. Y. Wong

Keyword(s):

Item Response Theory ◽

Item Response ◽

Cognitive Assessment ◽

Formal Education ◽

Montreal Cognitive Assessment ◽

Response Theory ◽

Homogeneous Sample ◽

Educational Backgrounds ◽

Multiple Group ◽

Item Properties

The traditional application of the Montreal Cognitive Assessment uses total scores in defining cognitive impairment levels, without considering variations in item properties across populations. Item response theory (IRT) analysis provides a potential solution to minimize the effect of important confounding factors such as education. This research applies IRT to investigate the characteristics of Montreal Cognitive Assessment items in a randomly selected, culturally homogeneous sample of 1,873 older persons with diverse educational backgrounds. Any formal education was used as a grouping variable to estimate multiple-group IRT models. Results showed that item characteristics differed between people with and without formal education. Item functioning of the Cube, Clock Number, and Clock Hand items was superior in people without formal education. This analysis provided evidence that item properties vary with education, calling for more sophisticated modelling based on IRT to incorporate the effect of education.

Download Full-text