Variable selection via penalized generalized estimating equations for a marginal survival model

2020 ◽  
Vol 29 (9) ◽  
pp. 2493-2506
Author(s):  
Yi Niu ◽  
Xiaoguang Wang ◽  
Hui Cao ◽  
Yingwei Peng

Clustered and multivariate survival times, such as times to recurrent events, commonly arise in biomedical and health research, and marginal survival models are often used to model such data. When a large number of predictors are available, variable selection is always an important issue when modeling such data with a survival model. We consider a Cox’s proportional hazards model for a marginal survival model. Under the sparsity assumption, we propose a penalized generalized estimating equation approach to select important variables and to estimate regression coefficients simultaneously in the marginal model. The proposed method explicitly models the correlation structure within clusters or correlated variables by using a prespecified working correlation matrix. The asymptotic properties of the estimators from the penalized generalized estimating equations are established and the number of candidate covariates is allowed to increase in the same order as the number of clusters does. We evaluate the performance of the proposed method through a simulation study and analyze two real datasets for the application.

2014 ◽  
Vol 15 (1) ◽  
pp. 29
Author(s):  
Rizka Dwi Hidayati ◽  
I Made Tirta ◽  
Yuliani Setia Dewi

The approach of GEE focuses on a linear model for the mean of the observations in the cluster without full specification  the distribution of full-on observation. GEE is a marginal model where is not based on the full likelihood of the response, but only based on the relationship between the mean (first moment) and variance (second moment) as well as the correlation matrix. The advantage of  GEE is that the mean of  parameter are estimated consistently regardless whether  the correlation structure is specified correctly or not, as long as the mean has the correct specifications. However, the efficiency may be reduced when the working correlation structure is wrong. GEE was designed to focus on the marginal mean and correlation structure as nuisiance treat. Implementation of GEE is usually limited to the number of working correlation structure (eg AR-1, exchangeable, independent, m-dependent and unstructured). To increase the efficiency of the GEE, has introduced a variation called the Generalized Estimating Equations order 2 (GEE2). GEE2 has been introduced to overcome the problem that considers correlation GEE as nuisiance, by applying the second equation to estimate covariance parameters and  solved simultaneously with the first equation. This study used simulation data which are designed based on the the AR-1 and Exchangeable correlation structure, then estimation are done  using theAR1 and exchangeable. For GEE2,  estimation done by adding model for correlation link. The result is a link affects the efficiency of the model correlation is shown with standard error values ​​generated by GEE2 method is smaller than the GEE method.


Sign in / Sign up

Export Citation Format

Share Document