A comparison of doubly robust estimators of the mean with missing data

Missing data is almost inevitable in correlated-data studies. For non-Gaussian outcomes with moderate to large sequences, direct-likelihood methods can involve complex, hard-to-manipulate likelihoods. Popular alternative approaches, like generalized estimating equations, that are frequently used to circumvent the computational complexity of full likelihood, are less suitable when scientific interest, at least in part, is placed on the association structure; pseudo-likelihood (PL) methods are then a viable alternative. When the missing data are missing at random, Molenberghs et al. (2011, Statistica Sinica, 21,187–206) proposed a suite of corrections to the standard form of PL, taking the form of singly and doubly robust estimators. They provided the basis and exemplified it in insightful yet primarily illustrative examples. We here consider the important case of marginal models for hierarchical binary data, provide an effective implementation and illustrate it using data from an analgesic trial. Our doubly robust estimator is more convenient than the classical doubly robust estimators. The ideas are illustrated using a marginal model for a binary response, more specifically a Bahadur model.

Download Full-text

Doubly robust estimators of causal exposure effects with missing data in the outcome, exposure or a confounder

Statistics in Medicine ◽

10.1002/sim.5643 ◽

2012 ◽

Vol 31 (30) ◽

pp. 4382-4400 ◽

Cited By ~ 9

Author(s):

E.J. Williamson ◽

A. Forbes ◽

R. Wolfe

Keyword(s):

Missing Data ◽

Robust Estimators ◽

Doubly Robust

Download Full-text

Adjustment for Missing Data in Complex Surveys Using Doubly Robust Estimation

Epidemiology ◽

10.1097/ede.0b013e3181f57571 ◽

2010 ◽

Vol 21 (6) ◽

pp. 863-871 ◽

Cited By ~ 7

Author(s):

Kathleen E. Wirth ◽

Eric J. Tchetgen Tchetgen ◽

Megan Murray

Keyword(s):

Missing Data ◽

Robust Estimation ◽

Complex Surveys ◽

Doubly Robust Estimation ◽

Doubly Robust

Download Full-text

Causal inference for recurrent event data using pseudo-observations

Biostatistics ◽

10.1093/biostatistics/kxaa020 ◽

2020 ◽

Author(s):

Chien-Lin Su ◽

Robert W Platt ◽

Jean-François Plante

Keyword(s):

Goodness Of Fit ◽

Variance Estimation ◽

Recurrent Event ◽

Event Data ◽

Robust Estimators ◽

Recurrent Event Data ◽

Finite Sample ◽

Data Set ◽

Asymptotically Normal ◽

Doubly Robust

Summary Recurrent event data are commonly encountered in observational studies where each subject may experience a particular event repeatedly over time. In this article, we aim to compare cumulative rate functions (CRFs) of two groups when treatment assignment may depend on the unbalanced distribution of confounders. Several estimators based on pseudo-observations are proposed to adjust for the confounding effects, namely inverse probability of treatment weighting estimator, regression model-based estimators, and doubly robust estimators. The proposed marginal regression estimator and doubly robust estimators based on pseudo-observations are shown to be consistent and asymptotically normal. A bootstrap approach is proposed for the variance estimation of the proposed estimators. Model diagnostic plots of residuals are presented to assess the goodness-of-fit for the proposed regression models. A family of adjusted two-sample pseudo-score tests is proposed to compare two CRFs. Simulation studies are conducted to assess finite sample performance of the proposed method. The proposed technique is demonstrated through an application to a hospital readmission data set.

Download Full-text

“Robust-Squared” Imputation Models Using Bart

Journal of Survey Statistics and Methodology ◽

10.1093/jssam/smz002 ◽

2019 ◽

Vol 7 (4) ◽

pp. 465-497

Author(s):

Yaoyuan V Tan ◽

Carol A C Flannagan ◽

Michael R Elliott

Keyword(s):

Model Misspecification ◽

Blood Alcohol Concentration ◽

Alcohol Concentration ◽

Penalized Splines ◽

Robust Estimator ◽

Robust Estimators ◽

Sampling System ◽

Additive Regression ◽

Doubly Robust ◽

Augmented Inverse Probability Weighting

Abstract Examples of “doubly robust” estimators for missing data include augmented inverse probability weighting (AIPWT) and penalized splines of propensity prediction (PSPP). Doubly robust estimators have the property that, if either the response propensity or the mean is modeled correctly, a consistent estimator of the population mean is obtained. However, doubly robust estimators can perform poorly when modest misspecification is present in both models. Here we consider extensions of the AIPWT and PSPP that use Bayesian additive regression trees (BART) to provide highly robust propensity and mean model estimation. We term these “robust-squared” in the sense that the propensity score, the means, or both can be estimated with minimal model misspecification, and applied to the doubly robust estimator. We consider their behavior via simulations where propensities and/or mean models are misspecified. We apply our proposed method to impute missing instantaneous velocity (delta-v) values from the 2014 National Automotive Sampling System Crashworthiness Data System dataset and missing Blood Alcohol Concentration values from the 2015 Fatality Analysis Reporting System dataset. We found that BART, applied to PSPP and AIPWT, provides a more robust estimate compared with PSPP and AIPWT.

Download Full-text

A comparison of multiple imputation and doubly robust estimation for analyses with missing data

Journal of the Royal Statistical Society Series A (Statistics in Society) ◽

10.1111/j.1467-985x.2006.00407.x ◽

2006 ◽

Vol 169 (3) ◽

pp. 571-584 ◽

Cited By ~ 116

Author(s):

James R. Carpenter ◽

Michael G. Kenward ◽

Stijn Vansteelandt

Keyword(s):

Missing Data ◽

Multiple Imputation ◽

Robust Estimation ◽

Doubly Robust Estimation ◽

Doubly Robust

Download Full-text

Comment: Stabilizing the Doubly-Robust Estimators of the Average Treatment Effect under Positivity Violations

Statistical Science ◽

10.1214/20-sts774 ◽

2020 ◽

Vol 35 (3) ◽

pp. 503-510

Author(s):

Fan Li

Keyword(s):

Treatment Effect ◽

Average Treatment Effect ◽

Robust Estimators ◽

Average Treatment ◽

Doubly Robust

Download Full-text

A Predictive Estimator of the Mean with Missing Data

Quality & Quantity ◽

10.1007/s11135-007-9091-5 ◽

2007 ◽

Vol 41 (2) ◽

pp. 201-217 ◽

Cited By ~ 1

Author(s):

M. Rueda ◽

S. González ◽

A. Arcos

Keyword(s):

Missing Data ◽

The Mean

Download Full-text

Proximity Matrix Completion and Ranking Ant Colony Optimization technique in Semantic web

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.c4021.098319 ◽

2019 ◽

Vol 8 (3) ◽

pp. 797-802

Keyword(s):

Missing Data ◽

Semantic Web ◽

Query Processing ◽

Ant Colony Optimization ◽

Matrix Completion ◽

Optimization Technique ◽

Ant Colony ◽

Proximity Matrix ◽

The Mean ◽

Sports Data

The semantic web consists of a large number of data that is difficult to retrieve the answer for the user queries. An existing method in the query processing in the semantic web has three main limitations namely, query flexibility, query relevancy or lack of ranking method and high query cost. In this study, Proximity Matrix Completion technique (PMC) is applied to impute the missing data in the dataset that helps to increase the query flexibility and Ranking Ant Colony Optimization (RACO) technique is used to select the relevant features from the dataset and arrange them to increase relevancy. The result shows that the PMC-RACO method has a higher performance compared to the exiting method in semantic web. The mean precision value of the PMC-RACO method in sports data is 87%, while the existing method has the precision value of 83%

Download Full-text