FRI0668 Item response theory to standardize patient reported physical function outcomes; linking 10 commonly used questionnaires to a common metric

AbstractThe reliable change index has been used to evaluate the significance of individual change in health-related quality of life. We estimate reliable change for two measures (physical function and emotional distress) in the Patient-Reported Outcomes Measurement Information System (PROMIS®) 29-item health-related quality of life measure (PROMIS-29 v2.1). Using two waves of data collected 3 months apart in a longitudinal observational study of chronic low back pain and chronic neck pain patients receiving chiropractic care, and simulations, we compare estimates of reliable change from classical test theory fixed standard errors with item response theory standard errors from the graded response model. We find that unless true change in the PROMIS physical function and emotional distress scales is substantial, classical test theory estimates of significant individual change are much more optimistic than estimates of change based on item response theory.

Download Full-text

Construction of the eight-item patient-reported outcomes measurement information system pediatric physical function scales: built using item response theory

Journal of Clinical Epidemiology ◽

10.1016/j.jclinepi.2010.10.012 ◽

2011 ◽

Vol 64 (7) ◽

pp. 794-804 ◽

Cited By ~ 105

Author(s):

Esi Morgan DeWitt ◽

Brian D. Stucky ◽

David Thissen ◽

Debra E. Irwin ◽

Michelle Langer ◽

...

Keyword(s):

Information System ◽

Item Response Theory ◽

Physical Function ◽

Item Response ◽

Patient Reported Outcomes ◽

Measurement Information ◽

Response Theory ◽

Outcomes Measurement ◽

Patient Reported ◽

Measurement Information System

Download Full-text

Item Response Theory, Computerized Adaptive Testing, and PROMIS: Assessment of Physical Function

The Journal of Rheumatology ◽

10.3899/jrheum.130813 ◽

2013 ◽

Vol 41 (1) ◽

pp. 153-158 ◽

Cited By ~ 86

Author(s):

James F. Fries ◽

James Witter ◽

Matthias Rose ◽

David Cella ◽

Dinesh Khanna ◽

...

Keyword(s):

Item Response Theory ◽

Physical Function ◽

Item Response ◽

Computerized Adaptive Testing ◽

Short Form ◽

Adaptive Testing ◽

Self Report ◽

Measurement Information ◽

Response Theory ◽

Patient Reported

Objective.Patient-reported outcome (PRO) questionnaires record health information directly from research participants because observers may not accurately represent the patient perspective. Patient-reported Outcomes Measurement Information System (PROMIS) is a US National Institutes of Health cooperative group charged with bringing PRO to a new level of precision and standardization across diseases by item development and use of item response theory (IRT).Methods.With IRT methods, improved items are calibrated on an underlying concept to form an item bank for a “domain” such as physical function (PF). The most informative items can be combined to construct efficient “instruments” such as 10-item or 20-item PF static forms. Each item is calibrated on the basis of the probability that a given person will respond at a given level, and the ability of the item to discriminate people from one another. Tailored forms may cover any desired level of the domain being measured. Computerized adaptive testing (CAT) selects the best items to sharpen the estimate of a person’s functional ability, based on prior responses to earlier questions. PROMIS item banks have been improved with experience from several thousand items, and are calibrated on over 21,000 respondents.Results.In areas tested to date, PROMIS PF instruments are superior or equal to Health Assessment Questionnaire and Medical Outcome Study Short Form-36 Survey legacy instruments in clarity, translatability, patient importance, reliability, and sensitivity to change.Conclusion.Precise measures, such as PROMIS, efficiently incorporate patient self-report of health into research, potentially reducing research cost by lowering sample size requirements. The advent of routine IRT applications has the potential to transform PRO measurement.

Download Full-text

Using item response theory improved responsiveness of patient-reported outcomes measures in carpal tunnel syndrome

Journal of Clinical Epidemiology ◽

10.1016/j.jclinepi.2011.08.009 ◽

2012 ◽

Vol 65 (3) ◽

pp. 325-334 ◽

Cited By ~ 9

Author(s):

Per-Erik Lyrén ◽

Isam Atroshi

Keyword(s):

Item Response Theory ◽

Carpal Tunnel Syndrome ◽

Item Response ◽

Carpal Tunnel ◽

Patient Reported Outcomes ◽

Response Theory ◽

Outcomes Measures ◽

Patient Reported ◽

Tunnel Syndrome

Download Full-text

Item Response Theory Methods can Improve the Measurement of Physical Function by Combining the Modified Health Assessment Questionnaire and the SF-36 Physical Function Scale

Quality of Life Research ◽

10.1007/s11136-007-9193-5 ◽

2007 ◽

Vol 16 (4) ◽

Cited By ~ 34

Author(s):

Marie Martin ◽

Mark Kosinski ◽

Jakob B. Bjorner ◽

John E. Ware ◽

Ross MacLean ◽

...

Keyword(s):

Item Response Theory ◽

Physical Function ◽

Item Response ◽

Health Assessment Questionnaire ◽

Health Assessment ◽

Response Theory ◽

Sf 36 ◽

Assessment Questionnaire

Download Full-text

Using item response theory to address vulnerabilities in FFQ

British Journal Of Nutrition ◽

10.1017/s0007114517002215 ◽

2017 ◽

Vol 118 (5) ◽

pp. 383-391 ◽

Cited By ~ 1

Author(s):

Josh B. Kazman ◽

Jonathan M. Scott ◽

Patricia A. Deuster

Keyword(s):

Item Response Theory ◽

Item Response ◽

Dietary Habits ◽

Assessment Tool ◽

Design Stage ◽

Measurement Information ◽

Response Theory ◽

Us Army ◽

Item Functioning ◽

Patient Reported

AbstractThe limitations for self-reporting of dietary patterns are widely recognised as a major vulnerability of FFQ and the dietary screeners/scales derived from FFQ. Such instruments can yield inconsistent results to produce questionable interpretations. The present article discusses the value of psychometric approaches and standards in addressing these drawbacks for instruments used to estimate dietary habits and nutrient intake. We argue that a FFQ or screener that treats diet as a ‘latent construct’ can be optimised for both internal consistency and the value of the research results. Latent constructs, a foundation for item response theory (IRT)-based scales (e.g. Patient Reported Outcomes Measurement Information System) are typically introduced in the design stage of an instrument to elicit critical factors that cannot be observed or measured directly. We propose an iterative approach that uses such modelling to refine FFQ and similar instruments. To that end, we illustrate the benefits of psychometric modelling by using items and data from a sample of 12 370 Soldiers who completed the 2012 US Army Global Assessment Tool (GAT). We used factor analysis to build the scale incorporating five out of eleven survey items. An IRT-driven assessment of response category properties indicates likely problems in the ordering or wording of several response categories. Group comparisons, examined with differential item functioning (DIF), provided evidence of scale validity across each Army sub-population (sex, service component and officer status). Such an approach holds promise for future FFQ.

Download Full-text

Item Response Theory and its Applications to Patient-Reported Outcomes Measurement

Evaluation & the Health Professions ◽

10.1177/0163278705278275 ◽

2005 ◽

Vol 28 (3) ◽

pp. 264-282 ◽

Cited By ~ 72

Author(s):

Chih-Hung Chang ◽

Bryce B. Reeve

Keyword(s):

Item Response Theory ◽

Item Response ◽

Patient Reported Outcomes ◽

Response Theory ◽

Outcomes Measurement ◽

Irt Models ◽

Patient Reported ◽

New Directions

This article provides an overview of item response theory (IRT) models and how they can be appropriately applied to patient-reported outcomes (PROs) measurement. Specifically, the following topics are discussed: (a) basics of IRT, (b) types of IRT models, (c) how IRT models have been applied to date, and (d) new directions in applying IRT to PRO measurements.

Download Full-text

Psychometric evaluation of an Italian custom 4-item short form of the PROMIS anxiety item bank in immune-mediated inflammatory diseases: an item response theory analysis

PeerJ ◽

10.7717/peerj.12100 ◽

2021 ◽

Vol 9 ◽

pp. e12100

Author(s):

Marco Tullio Liuzza ◽

Rocco Spagnuolo ◽

Gabriella Antonucci ◽

Rosa Daniela Grembiale ◽

Cristina Cosco ◽

...

Keyword(s):

Item Response Theory ◽

Psychometric Properties ◽

Item Response ◽

Inflammatory Diseases ◽

Short Form ◽

Latent Trait ◽

Test Reliability ◽

Control Group ◽

Response Theory ◽

Patient Reported

Background There has recently been growing interest in the roles of inflammation in contributing to the development of anxiety in people with immune-mediated inflammatory diseases (IMID). Patient-reported outcome measures can facilitate the assessment of physical and psychological functioning. The National Institutes of Health (NIH)’s Patient-Reported Outcomes Measurement Information System (PROMIS®) is a set of Patient-Reported Outcomes (PROs) that cover physical appearance, mental health, and social health. The PROMIS has been built through an Item Response Theory approach (IRT), a model-based measurement in which trait level estimates depend on both persons’ responses and on the properties of the items that were administered. The aim of this study is to test the psychometric properties of an Italian custom four-item Short Form of the PROMIS Anxiety item bank in a cohort of outpatients with IMIDs. Methods We selected four items from the Italian standard Short Form Anxiety 8a and administered them to consecutive outpatients affected by Inflammatory Bowel disease (n = 246), rheumatological (n = 100) and dermatological (n = 43) diseases, and healthy volunteers (n = 280). Data was analyzed through an Item Response Theory (IRT) analysis in order to evaluate the psychometric properties of the Italian adaptation of the PROMIS anxiety short form. Results Taken together, Confirmatory Factor Analysis and Exploratory Factor analysis suggest that the unidimensionality assumption of the instrument holds. The instrument has excellent reliability from a Classical Theory of Test (CTT) standpoint (Cronbach’s α = 0.93, McDonald’s ω = 0.92). The 2PL Graded Response Model (GRM) model provided showed a better goodness of fit as compared to the 1PL GRM model, and local independence assumption appears to be met overall. We did not find signs of differential item functioning (DIF) for age and gender, but evidence for uniform (but not non-uniform) DIF was found in three out of four items for the patient vs. control group. Analysis of the test reliability curve suggested that the instrument is most reliable for higher levels of the latent trait of anxiety. The groups of patients exhibited higher levels of anxiety as compared to the control group (ps < 0.001, Bonferroni-corrected). The groups of patients were not different between themselves (p = 1, Bonferroni-corrected). T-scores based on estimated latent trait and raw scores were highly correlated (Pearson’s r = 0.98) and led to similar results. Discussion The Italian custom four-item short form from the PROMIS anxiety form 8a shows acceptable psychometric properties both from a CTT and an IRT standpoint. The Test Reliability Curve shows that this instrument is mostly informative for people with higher levels of anxiety, making it particularly suitable for clinical populations such as IMID patients.

Download Full-text

Establishing Thresholds for Meaningful Within-individual Change Using Longitudinal Item Response Theory

10.21203/rs.3.rs-371137/v1 ◽

2021 ◽

Author(s):

Jakob Bue Bjorner ◽

Berend Terluin ◽

Andrew Trigg ◽

Jinxiang Hu ◽

Keri J.S. Brady ◽

...

Keyword(s):

Item Response Theory ◽

Item Response ◽

Individual Change ◽

Response Theory ◽

Traditional Methods ◽

Data Set ◽

Score Improvement ◽

Patient Reported ◽

Longitudinal Item Response

Abstract PURPOSE: Thresholds for meaningful within-individual change (MWIC) are useful for interpreting patient-reported outcome measures (PROM). Transition ratings (TR) have been recommended as anchors to establish MWIC. Traditional statistical methods for analyzing MWIC such as mean change analysis, receiver operating characteristic (ROC) analysis, and predictive modeling ignore problems of floor/ceiling effects and measurement error in the PROM scores and the TR item. We present a novel approach to MWIC estimation for multi-item scales using longitudinal item response theory (LIRT).METHODS: A Graded Response LIRT model for baseline and follow-up PROM data was expanded to include a TR item measuring latent change. The LIRT threshold parameter for the TR established the MWIC threshold on the latent metric, from which the observed PROM score MWIC threshold was estimated. We compared the LIRT approach and traditional methods using an example data set with baseline and three follow-up assessments differing by magnitude of score improvement, variance of score improvement, and baseline-follow-up score correlation.RESULTS: The LIRT model provided good fit to the data. LIRT estimates of observed PROM MWIC varied between 3 and 4 points score improvement. In contrast, results from traditional methods varied from 2 points to 10 points - strongly associated with proportion of self-rated improvement. Best agreement between methods was seen when approximately 50% rated their health as improved.CONCLUSION : Results from traditional analyses of anchor-based MWIC are impacted by study conditions. LIRT constitutes a promising and more robust analytic approach to identifying thresholds for MWIC.

Download Full-text