Impact of Missing Data on Person—Model Fit and Person Trait Estimation

This study compares two missing data procedures in the context of ordinal factor analysis models: pairwise deletion (PD; the default setting in Mplus) and multiple imputation (MI). We examine which procedure demonstrates parameter estimates and model fit indices closer to those of complete data. The performance of PD and MI are compared under a wide range of conditions, including number of response categories, sample size, percent of missingness, and degree of model misfit. Results indicate that both PD and MI yield parameter estimates similar to those from analysis of complete data under conditions where the data are missing completely at random (MCAR). When the data are missing at random (MAR), PD parameter estimates are shown to be severely biased across parameter combinations in the study. When the percentage of missingness is less than 50%, MI yields parameter estimates that are similar to results from complete data. However, the fit indices (i.e., χ2, RMSEA, and WRMR) yield estimates that suggested a worse fit than results observed in complete data. We recommend that applied researchers use MI when fitting ordinal factor models with missing data. We further recommend interpreting model fit based on the TLI and CFI incremental fit indices.

Download Full-text

Performance of Model Fit and Selection Indices for Bayesian Structural Equation Modeling with Missing Data

Structural Equation Modeling A Multidisciplinary Journal ◽

10.1080/10705511.2021.2018656 ◽

2022 ◽

pp. 1-19

Author(s):

Sonja D. Winter ◽

Sarah Depaoli

Keyword(s):

Structural Equation Modeling ◽

Missing Data ◽

Structural Equation ◽

Model Fit ◽

Equation Modeling ◽

Selection Indices ◽

Bayesian Structural Equation Modeling

Download Full-text

On the Treatment of Missing Item Responses in Educational Large-scale Assessment Data: The Case of PISA 2018 Mathematics

10.20944/preprints202110.0107.v1 ◽

2021 ◽

Author(s):

Alexander Robitzsch

Keyword(s):

Missing Data ◽

Large Scale ◽

Model Fit ◽

Large Scale Assessment ◽

Missing Data Treatments ◽

Scale Assessment ◽

Scaling Models ◽

Item Responses ◽

Response Propensity ◽

Missing Item

Missing item responses are prevalent in educational large-scale assessment studies like the programme for international student assessment (PISA). The current operational practice scores missing item responses as wrong, but several psychometricians advocated a model-based treatment based on latent ignorability assumption. In this approach, item responses and response indicators are jointly modeled conditional on a latent ability and a latent response propensity variable. Alternatively, imputation-based approaches can be used. The latent ignorability assumption is weakened in the Mislevy-Wu model that characterizes a nonignorable missingness mechanism and allows the missingness of an item to depend on the item itself. The scoring of missing item responses as wrong and the latent ignorable model are submodels of the Mislevy-Wu model. This article uses the PISA 2018 mathematics dataset to investigate the consequences of different missing data treatments on country means. Obtained country means can substantially differ for the different scaling models. In contrast to previous statements in the literature, the scoring of missing item responses as incorrect provided a better model fit than a latent ignorable model for most countries. Furthermore, the dependence of the missingness of an item from the item itself after conditioning on the latent response propensity was much more pronounced for constructed-response items than for multiple-choice items. As a consequence, scaling models that presuppose latent ignorability should be refused from two perspectives. First, the Mislevy-Wu model is preferred over the latent ignorable model for reasons of model fit. Second, we argue that model fit should only play a minor role in choosing psychometric models in large-scale assessment studies because validity aspects are most relevant. Missing data treatments that countries can simply manipulate (and, hence, their students) result in unfair country comparisons.

Download Full-text

Factor structure and convergent validity of the Derriford Appearance Scale-24 using standard scoring versus treating ‘not applicable’ responses as missing data: a Scleroderma Patient-centered Intervention Network (SPIN) cohort study

BMJ Open ◽

10.1136/bmjopen-2017-018641 ◽

2018 ◽

Vol 8 (3) ◽

pp. e018641 ◽

Cited By ~ 2

Author(s):

Erin L Merz ◽

Linda Kwakkenbos ◽

Marie-Eve Carrier ◽

Shadi Gholizadeh ◽

Sarah D Mills ◽

...

Keyword(s):

Missing Data ◽

Factor Structure ◽

Convergent Validity ◽

Factor Model ◽

Negative Evaluation ◽

Model Fit ◽

Secondary Outcome ◽

Scoring Methods ◽

Two Samples ◽

The Uk

ObjectiveValid measures of appearance concern are needed in systemic sclerosis (SSc), a rare, disfiguring autoimmune disease. The Derriford Appearance Scale-24 (DAS-24) assesses appearance-related distress related to visible differences. There is uncertainty regarding its factor structure, possibly due to its scoring method.DesignCross-sectional survey.SettingParticipants with SSc were recruited from 27 centres in Canada, the USA and the UK. Participants who self-identified as having visible differences were recruited from community and clinical settings in the UK.ParticipantsTwo samples were analysed (n=950 participants with SSc; n=1265 participants with visible differences).Primary and secondary outcome measuresThe DAS-24 factor structure was evaluated using two scoring methods. Convergent validity was evaluated with measures of social interaction anxiety, depression, fear of negative evaluation, social discomfort and dissatisfaction with appearance.ResultsWhen items marked by respondents as ‘not applicable’ were scored as 0, per standard DAS-24 scoring, a one-factor model fit poorly; when treated as missing data, the one-factor model fit well. Convergent validity analyses revealed strong correlations that were similar across scoring methods.ConclusionsTreating ‘not applicable’ responses as missing improved the measurement model, but did not substantively influence practical inferences that can be drawn from DAS-24 scores. Indications of item redundancy and poorly performing items suggest that the DAS-24 could be improved and potentially shortened.

Download Full-text

Evaluation of Model Fit in Structural Equation Models with Ordinal Missing Data: An Examination of the D 2 Method

Structural Equation Modeling A Multidisciplinary Journal ◽

10.1080/10705511.2019.1662307 ◽

2019 ◽

Vol 27 (4) ◽

pp. 561-583

Author(s):

Yu Liu ◽

Suppanut Sriutaisuk

Keyword(s):

Missing Data ◽

Structural Equation ◽

Structural Equation Models ◽

Model Fit

Download Full-text

Covariance Structure Model Fit Testing Under Missing Data: An Application of the Supplemented EM Algorithm

Multivariate Behavioral Research ◽

10.1080/00273170902794255 ◽

2009 ◽

Vol 44 (2) ◽

pp. 281-304 ◽

Cited By ~ 14

Author(s):

Li Cai ◽

Taehun Lee

Keyword(s):

Missing Data ◽

Em Algorithm ◽

Covariance Structure ◽

Model Fit ◽

Structure Model ◽

Fit Testing ◽

Covariance Structure Model

Download Full-text

Evaluation of model fit in structural equation models with ordinal missing data: a comparison of the D2 and MI2S methods

Structural Equation Modeling A Multidisciplinary Journal ◽

10.1080/10705511.2021.1919118 ◽

2021 ◽

pp. 1-23

Author(s):

Yu Liu ◽

Suppanut Sriutaisuk ◽

Seungwon Chung

Keyword(s):

Missing Data ◽

Structural Equation ◽

Structural Equation Models ◽

Model Fit

Download Full-text

On Modeling Missing Data of an Incomplete Design in the CFA Framework

Frontiers in Psychology ◽

10.3389/fpsyg.2020.581709 ◽

2020 ◽

Vol 11 ◽

Author(s):

Karl Schweizer ◽

Andreas Gold ◽

Dorothea Krampen ◽

Tengfei Wang

Keyword(s):

Missing Data ◽

Latent Variable ◽

Binary Data ◽

Factor Model ◽

Model Fit ◽

Negative Effects ◽

Factor Loadings ◽

Confirmatory Factor ◽

The Hierarchical Structure ◽

Incomplete Design

The paper reports an investigation on whether valid results can be achieved in analyzing the structure of datasets although a large percentage of data is missing without replacement. Two types of confirmatory factor analysis (CFA) models were employed for this purpose: the missing data CFA model with an additional latent variable for representing the missing data and the semi-hierarchical CFA model that also includes the additional latent variable and reflects the hierarchical structure assumed to underlie the data. Whereas, the missing data CFA model assumes that the model is equally valid for all participants, the semi-hierarchical CFA model is implicitly specified differently for subgroups of participants with and without omissions. The comparison of these models with the regular one-factor model in investigating simulated binary data revealed that the modeling of missing data prevented negative effects of missing data on model fit. The investigation of the accuracy in estimating the factor loadings yielded the best results for the semi-hierarchical CFA model. The average estimated factor loadings for items with and without omissions showed the expected equal sizes. But even this model tended to underestimate the expected values.

Download Full-text

Issues in Evaluating Model Fit With Missing Data

Structural Equation Modeling A Multidisciplinary Journal ◽

10.1207/s15328007sem1204_4 ◽

2005 ◽

Vol 12 (4) ◽

pp. 578-597 ◽

Cited By ~ 32

Author(s):

Adam Davey

Keyword(s):

Missing Data ◽

Model Fit

Download Full-text