Generalized empirical likelihood for nonsmooth estimating equations with missing data

Abstract:Missing covariate data occurs often in regression analysis, which frequently arises in the health and social sciences as well as in survey sampling. We study methods for the analysis of a nonignorable covariate-missing data problem in an assumed conditional mean function when some covariates are completely observed but other covariates are missing for some subjects. We adopt the semiparametric perspective of Bartlett et al. (Improving upon the efficiency of complete case analysis when covariates are MNAR. Biostatistics 2014;15:719–30) on regression analyses with nonignorable missing covariates, in which they have introduced the use of two working models, the working probability model of missingness and the working conditional score model. In this paper, we study an empirical likelihood approach to nonignorable covariate-missing data problems with the objective of effectively utilizing the two working models in the analysis of covariate-missing data. We propose a unified approach to constructing a system of unbiased estimating equations, where there are more equations than unknown parameters of interest. One useful feature of these unbiased estimating equations is that they naturally incorporate the incomplete data into the data analysis, making it possible to seek efficient estimation of the parameter of interest even when the working regression function is not specified to be the optimal regression function. We apply the general methodology of empirical likelihood to optimally combine these unbiased estimating equations. We propose three maximum empirical likelihood estimators of the underlying regression parameters and compare their efficiencies with other existing competitors. We present a simulation study to compare the finite-sample performance of various methods with respect to bias, efficiency, and robustness to model misspecification. The proposed empirical likelihood method is also illustrated by an analysis of a data set from the US National Health and Nutrition Examination Survey (NHANES).

Download Full-text

Empirical likelihood for estimating equations with nonignorably missing data

Statistica Sinica ◽

10.5705/ss.2012.254 ◽

2014 ◽

Cited By ~ 2

Author(s):

Niansheng Tang ◽

Puying Zhao ◽

Hongtu Zhu

Keyword(s):

Missing Data ◽

Empirical Likelihood ◽

Estimating Equations

Download Full-text

Generalized empirical likelihood inference in semiparametric regression model for longitudinal data

Acta Mathematica Sinica English Series ◽

10.1007/s10114-008-6434-7 ◽

2008 ◽

Vol 24 (12) ◽

pp. 2029-2040 ◽

Cited By ~ 20

Author(s):

Gao Rong Li ◽

Ping Tian ◽

Liu Gen Xue

Keyword(s):

Longitudinal Data ◽

Regression Model ◽

Empirical Likelihood ◽

Semiparametric Regression ◽

Likelihood Inference ◽

Semiparametric Regression Model ◽

Generalized Empirical Likelihood

Download Full-text

Estimating equations, empirical likelihood and constraints on parameters

Canadian Journal of Statistics ◽

10.2307/3315441 ◽

1995 ◽

Vol 23 (2) ◽

pp. 145-159 ◽

Cited By ~ 50

Author(s):

Jing Qin ◽

Jerry Lawless

Keyword(s):

Empirical Likelihood ◽

Estimating Equations

Download Full-text

Generalized empirical likelihood M testing for semiparametric models with time series data

Econometrics and Statistics ◽

10.1016/j.ecosta.2016.12.004 ◽

2017 ◽

Vol 4 ◽

pp. 18-30 ◽

Cited By ~ 1

Author(s):

Francesco Bravo ◽

Ba M. Chu ◽

David T. Jacho-Chávez

Keyword(s):

Time Series ◽

Empirical Likelihood ◽

Time Series Data ◽

Semiparametric Models ◽

Series Data ◽

Generalized Empirical Likelihood

Download Full-text

Semiparametric inverse propensity weighting for nonignorable missing data

Biometrika ◽

10.1093/biomet/asv071 ◽

2016 ◽

Vol 103 (1) ◽

pp. 175-187 ◽

Cited By ~ 31

Author(s):

Jun Shao ◽

Lei Wang

Keyword(s):

Missing Data ◽

Missing Values ◽

Generalized Method Of Moments ◽

Estimating Equations ◽

Real Data ◽

Population Parameters ◽

Finite Sample ◽

External Data ◽

Nonignorable Missing ◽

Inverse Propensity Weighting

Abstract To estimate unknown population parameters based on data having nonignorable missing values with a semiparametric exponential tilting propensity, Kim & Yu (2011) assumed that the tilting parameter is known or can be estimated from external data, in order to avoid the identifiability issue. To remove this serious limitation on the methodology, we use an instrument, i.e., a covariate related to the study variable but unrelated to the missing data propensity, to construct some estimating equations. Because these estimating equations are semiparametric, we profile the nonparametric component using a kernel-type estimator and then estimate the tilting parameter based on the profiled estimating equations and the generalized method of moments. Once the tilting parameter is estimated, so is the propensity, and then other population parameters can be estimated using the inverse propensity weighting approach. Consistency and asymptotic normality of the proposed estimators are established. The finite-sample performance of the estimators is studied through simulation, and a real-data example is also presented.

Download Full-text