scholarly journals Framework for the treatment and reporting of missing data in observational studies: The Treatment And Reporting of Missing data in Observational Studies framework

2021 ◽  
Vol 134 ◽  
pp. 79-88 ◽  
Author(s):  
Katherine J. Lee ◽  
Kate M. Tilling ◽  
Rosie P. Cornish ◽  
Roderick J.A. Little ◽  
Melanie L. Bell ◽  
...  
2021 ◽  
Vol 50 (Supplement_1) ◽  
Author(s):  
Jiaxin Zhang ◽  
S. Ghazaleh Dashti ◽  
John B. Carlin ◽  
Katherine J. Lee ◽  
Margarita Moreno-Betancur

Abstract Background Outcome regression remains widely applied for estimating causal effects in observational studies, in which causal inference is conceptualised as emulating a randomized controlled trial (RCT). Multiple imputation (MI) is a commonly used method for handling missing data, but while in RCTs it has been shown that MI should be conducted by treatment group to reduce bias, whether imputation should be conducted by exposure group in observational studies has not been studied. Methods We conducted a simulation study to evaluate the performance of seven methods for handling missing data: Complete-case analysis (CCA), MI of main effect, MI with interactions (between exposure and: outcome, a strong confounder, outcome and a strong confounder, all incomplete), and MI conducted by exposure group. We simulated data based on an example from the Victorian Adolescent Health Cohort Study. Three exposure prevalences and seven outcome generation models were considered, the latter ranging from no interaction to strong-positive or negative exposure-confounder interaction. Various missingness scenarios were examined: with incomplete outcome only or also incomplete confounders, and three levels of complexity regarding the missingness mechanism. Results For all scenarios, MI by exposure led to the least bias, followed by MI approaches that included exposure-confounder interactions. Conclusions If MI is adopted in outcome regression, we recommend conducting MI by exposure group and, when not feasible, including exposure-confounder interactions in the imputation model. Key messages Similar to RCTs, MI should be conducted by exposure group when estimating average causal effects using outcome regression in observational studies.


2010 ◽  
Vol 13 (7) ◽  
pp. A339
Author(s):  
EP Elkin ◽  
AK Exuzides ◽  
DJ Pasta ◽  
DP Miller

2021 ◽  
Vol 50 (Supplement_1) ◽  
Author(s):  
Rosie Cornish ◽  
Kate Tilling ◽  
Rosie Cornish ◽  
James Carpenter

Abstract Focus of presentation Missing data are ubiquitous in medical research. Although there is increasing guidance on how to handle missing data, practice is changing slowly and misapprehensions abound, particularly in observational research. We present a practical framework for handling and reporting the analysis of incomplete data in observational studies, which we illustrate using a case study from the Avon Longitudinal Study of Parents and Children. Findings The framework consists of three steps: 1) Develop an analysis plan specifying the analysis model and how missing data are going to be addressed. Important considerations are whether a complete records analysis is likely to be valid, whether multiple imputation or an alternative approach is likely to offer benefits, and whether a sensitivity analysis regarding the missingness mechanism is required. 2) Explore the data, checking the methods outlined in the analysis plan are appropriate, and conduct the pre-planned analysis. 3) Report the results, including a description of the missing data, details on how missing data were addressed, and the results from all analyses, interpreted in light of the missing data and clinical relevance. Conclusions/Implications This framework encourages researchers to think carefully about their missing data and be transparent about the potential effect on the study results. This will increase confidence in the reliability and reproducibility of results from published papers. Key messages Researchers need to develop a plan for missing data prior to conducting their analysis, and be transparent about how they handled the missing data and its potential effect when reporting their results.


2021 ◽  
Vol 50 (Supplement_1) ◽  
Author(s):  
Katherine Lee ◽  
Kate Tilling ◽  
Rosie Cornish ◽  
James Carpenter

Abstract Focus of presentation Missing data are ubiquitous in medical research. Although there is increasing guidance on how to handle missing data, practice is changing slowly and misapprehensions abound, particularly in observational research. We present a practical framework for handling and reporting the analysis of incomplete data in observational studies, which we illustrate using a case study from the Avon Longitudinal Study of Parents and Children. Findings The framework consists of three steps: 1) Develop an analysis plan specifying the analysis model and how missing data are going to be addressed. Important considerations are whether a complete records analysis is likely to be valid, whether multiple imputation or an alternative approach is likely to offer benefits, and whether a sensitivity analysis regarding the missingness mechanism is required. 2) Explore the data, checking the methods outlined in the analysis plan are appropriate, and conduct the pre-planned analysis. 3) Report the results, including a description of the missing data, details on how missing data were addressed, and the results from all analyses, interpreted in light of the missing data and clinical relevance. Conclusions/Implications This framework encourages researchers to think carefully about their missing data and be transparent about the potential effect on the study results. This will increase confidence in the reliability and reproducibility of results from published papers. Key messages Researchers need to develop a plan for missing data prior to conducting their analysis, and be transparent about how they handled the missing data and its potential effect when reporting their results.


2021 ◽  
Vol 19 (1) ◽  
Author(s):  
Albee Ling ◽  
Maria Montez-Rath ◽  
Maya Mathur ◽  
Kris Kapphahn ◽  
Manisha Desai

Propensity score matching (PSM) has been widely used to mitigate confounding in observational studies, although complications arise when the covariates used to estimate the PS are only partially observed. Multiple imputation (MI) is a potential solution for handling missing covariates in the estimation of the PS. However, it is not clear how to best apply MI strategies in the context of PSM. We conducted a simulation study to compare the performances of popular non-MI missing data methods and various MI-based strategies under different missing data mechanisms. We found that commonly applied missing data methods resulted in biased and inefficient estimates, and we observed large variation in performance across MI-based strategies. Based on our findings, we recommend 1) estimating the PS after applying MI to impute missing confounders; 2) conducting PSM within each imputed dataset followed by averaging the treatment effects to arrive at one summarized finding; 3) a bootstrapped-based variance to account for uncertainty of PS estimation, matching, and imputation; and 4) inclusion of key auxiliary variables in the imputation model.


2021 ◽  
Vol 50 (Supplement_1) ◽  
Author(s):  
Katherine Lee ◽  
James Carpenter ◽  
Roderick Little ◽  
Cattram Nguyen ◽  
Rosie Cornish

Abstract Focus and outcomes for participants Missing data are ubiquitous in observational studies, and the simple solution of restricting the analyses to the subset with complete records will often result in bias and loss of power. The seriousness of these issues for resulting inferences depends on both the mechanism causing the missing data and the form of the substantive question and associated model. The methodological literature on methods for the analysis of partially observed data has grown substantially over the last twenty years, and although there is increasing guidance on how to handle missing data, practice is changing slowly and misapprehensions abound, particularly in observational research. Importantly, the lack of transparency around methodological decisions regarding the analysis is threatening the validity and reproducibility of modern research. In this symposium leading researchers in missing data methodology will present practical guidance on how to select an appropriate method to handle missing data, describe how to report the results from such an analysis and describe how to conduct sensitivity analyses in the multiple imputation framework. Rationale for the symposium, including for its inclusion in the Congress One of the sub-themes of WCE 2021 is “Translation from research to policy and practice”. Although there is a growing body of literature surrounding missing data methodology, evidence from systematic reviews suggests that missing data is still often not handled appropriately. If practice is to change, it is important to educate applied researchers regarding the available methodology and provide practical guidance on how to determine the best method for handling missing data. An important part of this is providing guidance on the reporting of results from analyses with missing data. This is particularly pertinent given the current emphasis on reproducibility of research findings. In this symposium we bring some of the latest research from the Missing Data Topic Group of the STRengthening Analytical Thinking for Observational Studies (STRATOS) initiative whose aim is to provide accessible and accurate guidance in the design and analysis of observational studies in order to increasie the reliability and validity of observational research. Presentation program Names of presenters


2019 ◽  
Author(s):  
Qi Long ◽  
Brent Johnson ◽  
Yize Zhao ◽  
Yi Deng ◽  
Changgee Chang

Sign in / Sign up

Export Citation Format

Share Document