Taking an Item-Level Approach to Measuring Change With the Force and Motion Conceptual Evaluation: An Application of Item Response Theory

Abstract There are many challenges associated with assessment and diagnosis of ADHD in adulthood. Utilizing the graded response model (GRM) from item response theory (IRT), a comprehensive item-level analysis of adult ADHD rating scales in a clinical population was conducted with Barkley's Adult ADHD Rating Scale-IV, Self-Report of Current Symptoms (CSS), a self-report diagnostic checklist and a similar self-report measure quantifying retrospective report of childhood symptoms, Barkley's Adult ADHD Rating Scale-IV, Self-Report of Childhood Symptoms (BAARS-C). Differences in item functioning were also considered after identifying and excluding individuals with suspect effort. Items associated with symptoms of inattention (IA) and hyperactivity/impulsivity (H/I) are endorsed differently across the lifespan, and these data suggest that they vary in their relationship to the theoretical constructs of IA and H/I. Screening for sufficient effort did not meaningfully change item level functioning. The application IRT to direct item-to-symptom measures allows for a unique psychometric assessment of how the current DSM-5 symptoms represent latent traits of IA and H/I. Meeting a symptom threshold of five or more symptoms may be misleading. Closer attention given to specific symptoms in the context of the clinical interview and reported difficulties across domains may lead to more informed diagnosis.

Download Full-text

Scale length does matter: Recommendations for Measurement Invariance Testing with Categorical Factor Analysis and Item Response Theory Approaches

10.31234/osf.io/udbna ◽

2020 ◽

Author(s):

E. Damiano D'Urso ◽

Kim De Roover ◽

Jeroen K. Vermunt ◽

Jesper Tijmstra

Keyword(s):

Factor Analysis ◽

Item Response Theory ◽

Measurement Invariance ◽

Item Response ◽

Simulation Studies ◽

Response Theory ◽

Multiple Group ◽

Item Level ◽

Positive Rate ◽

Scale Length

In social sciences, the study of group differences concerning latent constructs is ubiquitous. These constructs are generally measured by means of scales composed of ordinal items. In order to compare these constructs across groups, one crucial requirement is that they are measured equivalently or, in technical jargon, that measurement invariance holds across the groups. This study compared the performance of multiple group categorical confirmatory factor analysis (MG-CCFA) and multiple group item response theory (MG-IRT) in testing measurement invariance with ordinal data. A simulation study was conducted to compare the true positive rate (TPR) and false positive rate (FPR) both at the scale and at the item level for these two approaches under an invariance and a non-invariance scenario. The results of the simulation studies showed that the performance, in terms of the TPR, of MG-CCFA- and MG-IRT-based approaches mostly depends on the scale length. In fact, for long scales, the likelihood ratio test (LRT) approach, for MG-IRT, outperformed the other approaches, while, for short scales, MG-CCFA seemed to be generally preferable. In addition, the performance of MG-CCFA's fit measures, such as RMSEA and CFI, seemed to depend largely on the length of the scale, especially when MI was tested at the item level. General caution is recommended when using these measures, especially when MI is tested for each item individually. A decision flowchart, based on the results of the simulation studies, is provided to help summarizing the results and providing indications on which approach performed best and in which setting.

Download Full-text

An Item-Level Analysis for Detecting Faking on Personality Tests: Appropriateness of Ideal Point Item Response Theory Models

Frontiers in Psychology ◽

10.3389/fpsyg.2019.03090 ◽

2020 ◽

Vol 10 ◽

Author(s):

Jie Liu ◽

Jinfu Zhang

Keyword(s):

Item Response Theory ◽

Item Response ◽

Ideal Point ◽

Response Theory ◽

Personality Tests ◽

Item Level ◽

Item Response Theory Models ◽

Level Analysis

Download Full-text

An item level evaluation of the Marlowe-Crowne Social Desirability Scale using item response theory on Icelandic Internet panel data and cognitive interviews

Personality and Individual Differences ◽

10.1016/j.paid.2016.11.023 ◽

2017 ◽

Vol 107 ◽

pp. 164-173 ◽

Cited By ~ 7

Author(s):

Vaka Vésteinsdóttir ◽

Ulf-Dietrich Reips ◽

Adam Joinson ◽

Fanney Thorsdottir

Keyword(s):

Item Response Theory ◽

Panel Data ◽

Item Response ◽

Social Desirability ◽

Cognitive Interviews ◽

Response Theory ◽

Social Desirability Scale ◽

Item Level

Download Full-text

An Investigation of Chi-Square and Entropy Based Methods of Item-Fit Using Item Level Contamination in Item Response Theory

Journal of Modern Applied Statistical Methods ◽

10.22237/jmasm/1604190480 ◽

2020 ◽

Vol 18 (2) ◽

pp. 2-43

Author(s):

William R. Dardick ◽

Brandi A. Weiss

Keyword(s):

Item Response Theory ◽

Item Response ◽

Type I Error ◽

Type I ◽

Power Performance ◽

Empirical Power ◽

Response Theory ◽

Chi Square ◽

Item Fit ◽

Item Level

New variants of entropy as measures of item-fit in item response theory are investigated. Monte Carlo simulation(s) examine aberrant conditions of item-level misfit to evaluate relative (compare EMRj, X2, G2, S-X2, and PV-Q1) and absolute (Type I error and empirical power) performance. EMRj has utility in discovering misfit.

Download Full-text

Multidimensional item response theory and the force and motion conceptual evaluation

Physical Review Physics Education Research ◽

10.1103/physrevphyseducres.15.020141 ◽

2019 ◽

Vol 15 (2) ◽

Cited By ~ 3

Author(s):

Jie Yang ◽

Cabot Zabriskie ◽

John Stewart

Keyword(s):

Item Response Theory ◽

Item Response ◽

Multidimensional Item Response Theory ◽

Multidimensional Item Response ◽

Response Theory ◽

Force And Motion

Download Full-text

Assessing Item-Level Fit for Higher Order Item Response Theory Models

Applied Psychological Measurement ◽

10.1177/0146621618762740 ◽

2018 ◽

Vol 42 (8) ◽

pp. 644-659

Author(s):

Xue Zhang ◽

Chun Wang ◽

Jian Tao

Keyword(s):

Item Response Theory ◽

Item Response ◽

Higher Order ◽

Simulation Studies ◽

Fit Indices ◽

Response Theory ◽

Chi Square ◽

Detection Rates ◽

Irt Models ◽

Item Level

Testing item-level fit is important in scale development to guide item revision/deletion. Many item-level fit indices have been proposed in literature, yet none of them were directly applicable to an important family of models, namely, the higher order item response theory (HO-IRT) models. In this study, chi-square-based fit indices (i.e., Yen’s Q1, McKinley and Mill’s G2, Orlando and Thissen’s S-X2, and S-G2) were extended to HO-IRT models. Their performances are evaluated via simulation studies in terms of false positive rates and correct detection rates. The manipulated factors include test structure (i.e., test length and number of dimensions), sample size, level of correlations among dimensions, and the proportion of misfitting items. For misfitting items, the sources of misfit, including the misfitting item response functions, and misspecifying factor structures were also manipulated. The results from simulation studies demonstrate that the S-G2 is promising for higher order items.

Download Full-text

Item level diagnostics and model - data fit in item response theory (IRT) using BILOG - MG v3.0 and IRTPRO v3.0 programmes

Global Journal of Educational Research ◽

10.4314/gjedr.v16i2.2 ◽

2017 ◽

Vol 16 (2) ◽

pp. 87

Author(s):

Cyrinus B. Essen ◽

Idaka E. Idaka ◽

Michael A. Metibemu

Keyword(s):

Item Response Theory ◽

Item Response ◽

Model Data ◽

Response Theory ◽

Item Level

Download Full-text

Disease-specific quality of life following a flare in systemic lupus erythematosus: an item response theory analysis of the French EQUAL cohort

Rheumatology ◽

10.1093/rheumatology/kez451 ◽

2019 ◽

Vol 59 (6) ◽

pp. 1398-1406 ◽

Cited By ~ 1

Author(s):

Marie Corneloup ◽

François Maurier ◽

Denis Wahl ◽

Geraldine Muller ◽

Olivier Aumaitre ◽

...

Keyword(s):

Quality Of Life ◽

Item Response Theory ◽

Item Response ◽

Response Theory ◽

Theory Analysis ◽

Item Response Theory Analysis ◽

Item Level ◽

Flare Index ◽

Prospective Longitudinal

Abstract Objective To explore, at an item-level, the effect of disease activity (DA) on specific health-related quality of life (HRQoL) in SLE patients using an item response theory longitudinal model. Methods This prospective longitudinal multicentre French cohort EQUAL followed SLE patients over 2 years. Specific HRQoL according to LupusQoL and SLEQOL was collected every 3 months. DA according to SELENA-SLEDAI flare index (SFI) and revised SELENA-SLEDAI flare index (SFI-R) was evaluated every 6 months. Regarding DA according to SFI and each SFI-R type of flare, specific HRQoL of remitting patients was compared with non-flaring patients fitting a linear logistic model with relaxed assumptions for each domain of the questionnaires. Results Between December 2011 and July 2015, 336 patients were included (89.9% female). LupusQoL and SLEQOL items related to physical HRQoL (physical health, physical functioning, pain) were most affected by musculoskeletal and cutaneous flares. Cutaneous flares had significant influence on self-image. Neurological or psychiatric flares had a more severe impact on specific HRQoL. Patient HRQoL was impacted up to 18 months after a flare. Conclusion Item response theory analysis is able to pinpoint items that are influenced by a given patient group in terms of a latent trait change. Item-level analysis provides a new way of interpreting HRQoL variation in SLE patients, permitting a better understanding of DA impact on HRQoL. This kind of analysis could be easily implemented for the comparison of groups in a clinical trial. Trial registration ClinicalTrials.gov, http://clinicaltrials.gov, NCT01904812.

Download Full-text

Scale length does matter: Recommendations for measurement invariance testing with categorical factor analysis and item response theory approaches

Behavior Research Methods ◽

10.3758/s13428-021-01690-7 ◽

2021 ◽

Author(s):

E. Damiano D’Urso ◽

Kim De Roover ◽

Jeroen K. Vermunt ◽

Jesper Tijmstra

Keyword(s):

Factor Analysis ◽

Item Response Theory ◽

Measurement Invariance ◽

Item Response ◽

Ordinal Data ◽

Ratio Test ◽

Response Theory ◽

Multiple Group ◽

Item Level ◽

Positive Rate

AbstractIn social sciences, the study of group differences concerning latent constructs is ubiquitous. These constructs are generally measured by means of scales composed of ordinal items. In order to compare these constructs across groups, one crucial requirement is that they are measured equivalently or, in technical jargon, that measurement invariance (MI) holds across the groups. This study compared the performance of scale- and item-level approaches based on multiple group categorical confirmatory factor analysis (MG-CCFA) and multiple group item response theory (MG-IRT) in testing MI with ordinal data. In general, the results of the simulation studies showed that MG-CCFA-based approaches outperformed MG-IRT-based approaches when testing MI at the scale level, whereas, at the item level, the best performing approach depends on the tested parameter (i.e., loadings or thresholds). That is, when testing loadings equivalence, the likelihood ratio test provided the best trade-off between true-positive rate and false-positive rate, whereas, when testing thresholds equivalence, the χ2 test outperformed the other testing strategies. In addition, the performance of MG-CCFA’s fit measures, such as RMSEA and CFI, seemed to depend largely on the length of the scale, especially when MI was tested at the item level. General caution is recommended when using these measures, especially when MI is tested for each item individually.

Download Full-text