Propensity score estimation with missing values using a multiple imputation missingness pattern (MIMP) approach

2009 ◽  
Vol 28 (9) ◽  
pp. 1402-1414 ◽  
Author(s):  
Yongming Qu ◽  
Ilya Lipkovich
2019 ◽  
Author(s):  
Donna Coffman ◽  
Jiangxiu Zhou ◽  
Xizhen Cai

Abstract Background Causal effect estimation with observational data is subject to bias due to confounding, which is often controlled for using propensity scores. One unresolved issue in propensity score estimation is how to handle missing values in covariates.Method Several approaches have been proposed for handling covariate missingness, including multiple imputation (MI), multiple imputation with missingness pattern (MIMP), and treatment mean imputation. However, there are other potentially useful approaches that have not been evaluated, including single imputation (SI) + prediction error (PE), SI+PE + parameter uncertainty (PU), and Generalized Boosted Modeling (GBM), which is a nonparametric approach for estimating propensity scores in which missing values are automatically handled in the estimation using a surrogate split method. To evaluate the performance of these approaches, a simulation study was conducted.Results Results suggested that SI+PE, SI+PE+PU, MI, and MIMP perform almost equally well and better than treatment mean imputation and GBM in terms of bias; however, MI and MIMP account for the additional uncertainty of imputing the missingness.Conclusions Applying GBM to the incomplete data and relying on the surrogate split approach resulted in substantial bias. Imputation prior to implementing GBM is recommended.


2020 ◽  
Author(s):  
Donna Coffman ◽  
Jiangxiu Zhou ◽  
Xizhen Cai

Abstract Background: Causal effect estimation with observational data is subject to bias due to confounding, which is often controlled for using propensity scores. One unresolved issue in propensity score estimation is how to handle missing values in covariates.Method: Several approaches have been proposed for handling covariate missingness, including multiple imputation (MI), multiple imputation with missingness pattern (MIMP), and treatment mean imputation. However, there are other potentially useful approaches that have not been evaluated, including single imputation (SI) + prediction error (PE), SI+PE + parameter uncertainty (PU), and Generalized Boosted Modeling (GBM), which is a nonparametric approach for estimating propensity scores in which missing values are automatically handled in the estimation using a surrogate split method. To evaluate the performance of these approaches, a simulation study was conducted. Results: Results suggested that SI+PE, SI+PE+PU, MI, and MIMP perform almost equally well and better than treatment mean imputation and GBM in terms of bias; however, MI and MIMP account for the additional uncertainty of imputing the missingness. Conclusions: Applying GBM to the incomplete data and relying on the surrogate split approach resulted in substantial bias. Imputation prior to implementing GBM is recommended.


Author(s):  
Byron C. Jaeger ◽  
Ryan Cantor ◽  
Venkata Sthanam ◽  
Rongbing Xie ◽  
James K. Kirklin ◽  
...  

Background: Risk prediction models play an important role in clinical decision making. When developing risk prediction models, practitioners often impute missing values to the mean. We evaluated the impact of applying other strategies to impute missing values on the prognostic accuracy of downstream risk prediction models, that is, models fitted to the imputed data. A secondary objective was to compare the accuracy of imputation methods based on artificially induced missing values. To complete these objectives, we used data from the Interagency Registry for Mechanically Assisted Circulatory Support. Methods: We applied 12 imputation strategies in combination with 2 different modeling strategies for mortality and transplant risk prediction following surgery to receive mechanical circulatory support. Model performance was evaluated using Monte-Carlo cross-validation and measured based on outcomes 6 months following surgery using the scaled Brier score, concordance index, and calibration error. We used Bayesian hierarchical models to compare model performance. Results: Multiple imputation with random forests emerged as a robust strategy to impute missing values, increasing model concordance by 0.0030 (25th–75th percentile: 0.0008–0.0052) compared with imputation to the mean for mortality risk prediction using a downstream proportional hazards model. The posterior probability that single and multiple imputation using random forests would improve concordance versus mean imputation was 0.464 and >0.999, respectively. Conclusions: Selecting an optimal strategy to impute missing values such as random forests and applying multiple imputation can improve the prognostic accuracy of downstream risk prediction models.


Author(s):  
Thelma Dede Baddoo ◽  
Zhijia Li ◽  
Samuel Nii Odai ◽  
Kenneth Rodolphe Chabi Boni ◽  
Isaac Kwesi Nooni ◽  
...  

Reconstructing missing streamflow data can be challenging when additional data are not available, and missing data imputation of real-world datasets to investigate how to ascertain the accuracy of imputation algorithms for these datasets are lacking. This study investigated the necessary complexity of missing data reconstruction schemes to obtain the relevant results for a real-world single station streamflow observation to facilitate its further use. This investigation was implemented by applying different missing data mechanisms spanning from univariate algorithms to multiple imputation methods accustomed to multivariate data taking time as an explicit variable. The performance accuracy of these schemes was assessed using the total error measurement (TEM) and a recommended localized error measurement (LEM) in this study. The results show that univariate missing value algorithms, which are specially developed to handle univariate time series, provide satisfactory results, but the ones which provide the best results are usually time and computationally intensive. Also, multiple imputation algorithms which consider the surrounding observed values and/or which can understand the characteristics of the data provide similar results to the univariate missing data algorithms and, in some cases, perform better without the added time and computational downsides when time is taken as an explicit variable. Furthermore, the LEM would be especially useful when the missing data are in specific portions of the dataset or where very large gaps of ‘missingness’ occur. Finally, proper handling of missing values of real-world hydroclimatic datasets depends on imputing and extensive study of the particular dataset to be imputed.


2015 ◽  
Vol 79 (3) ◽  
pp. 635-661 ◽  
Author(s):  
Ahu Alanya ◽  
Christof Wolf ◽  
Cristina Sotto

2018 ◽  
Vol 11 (16) ◽  
pp. 1-11
Author(s):  
Tlhalitshi Volition Montshiwa ◽  
Ntebo Moroke ◽  
Elias Munapo ◽  
◽  
◽  
...  

2020 ◽  
Vol 10 (1) ◽  
Author(s):  
Yusuke Sakimura ◽  
Noriyuki Inaki ◽  
Toshikatsu Tsuji ◽  
Shinichi Kadoya ◽  
Hiroyuki Bando

Abstract Omentectomy is conducted for advanced gastric cancer (AGC) patients as radical surgery without an adequate discussion of the effect. This study was conducted to reveal the impact of omentum-preserving gastrectomy on postoperative outcomes. AGC patients with cT3 and 4 disease who underwent total or distal gastrectomy with R0 resection were identified retrospectively. They were divided into the omentum-preserved group (OPG) and the omentum-resected group (ORG) and matched with propensity score matching with multiple imputation for missing values. Three-year overall survival (OS) and 3-year relapse-free survival (RFS) were compared, and the first recurrence site and complications were analysed. The numbers of eligible patients were 94 in the OPG and 144 in the ORG, and after matching, the number was 73 in each group. No significant difference was found in the 3-year OS rate (OPG: 78.9 vs. ORG: 78.9, P = 0.54) or the 3-year RFS rate (OPG: 77.8 vs. ORG: 68.2, P = 0.24). The proportions of peritoneal carcinomatosis and peritoneal dissemination as the first recurrence site and the rate and severity of complications were similar in the two groups. Omentectomy is not required for radical gastrectomy for AGC.


PLoS ONE ◽  
2015 ◽  
Vol 10 (12) ◽  
pp. e0144370 ◽  
Author(s):  
Ting Tian ◽  
Geoffrey J. McLachlan ◽  
Mark J. Dieters ◽  
Kaye E. Basford

Sign in / Sign up

Export Citation Format

Share Document