scholarly journals Bayesian Variable Selection in Semiparametric Proportional Hazards Model for High Dimensional Survival Data

2011 ◽  
Vol 7 (1) ◽  
pp. 1-32 ◽  
Author(s):  
Kyu Ha Lee ◽  
Sounak Chakraborty ◽  
Jianguo Sun
2021 ◽  
pp. 096228022110092
Author(s):  
Mingyue Du ◽  
Hui Zhao ◽  
Jianguo Sun

Cox’s proportional hazards model is the most commonly used model for regression analysis of failure time data and some methods have been developed for its variable selection under different situations. In this paper, we consider a general type of failure time data, case K interval-censored data, that include all of other types discussed as special cases, and propose a unified penalized variable selection procedure. In addition to its generality, another significant feature of the proposed approach is that unlike all of the existing variable selection methods for failure time data, the proposed approach allows dependent censoring, which can occur quite often and could lead to biased or misleading conclusions if not taken into account. For the implementation, a coordinate descent algorithm is developed and the oracle property of the proposed method is established. The numerical studies indicate that the proposed approach works well for practical situations and it is applied to a set of real data arising from Alzheimer’s Disease Neuroimaging Initiative study that motivated this study.


1991 ◽  
Vol 28 (03) ◽  
pp. 695-701 ◽  
Author(s):  
Philip Hougaard

Ordinary survival models implicitly assume that all individuals in a group have the same risk of death. It may, however, be relevant to consider the group as heterogeneous, i.e. a mixture of individuals with different risks. For example, after an operation each individual may have constant hazard of death. If risk factors are not included, the group shows decreasing hazard. This offers two fundamentally different interpretations of the same data. For instance, Weibull distributions with shape parameter less than 1 can be generated as mixtures of constant individual hazards. In a proportional hazards model, neglect of a subset of the important covariates leads to biased estimates of the other regression coefficients. Different choices of distributions for the unobserved covariates are discussed, including binary, gamma, inverse Gaussian and positive stable distributions, which show both qualitative and quantitative differences. For instance, the heterogeneity distribution can be either identifiable or unidentifiable. Both mathematical and interpretational consequences of the choice of distribution are considered. Heterogeneity can be evaluated by the variance of the logarithm of the mixture distribution. Examples include occupational mortality, myocardial infarction and diabetes.


Author(s):  
George M. Lloyd ◽  
Timothy Hasselman ◽  
Thomas Paez

We present a proportional hazards model (PHM) that establishes a framework suitable for performing reliability estimates and risk prognostics on complex multi-component systems which are transferred at arbitrary times among a discrete set of non-stationary stochastic environments. Such a scenario is not at all uncommon for portable and mobile systems. It is assumed that survival data, possibly interval censored, is available at several “typical” environments. This collection of empirical survival data forms the foundation upon which the basic effects of selected covariates are incorporated via the proportional hazards model. Proportional hazards models are well known in medical statistics, and can provide a variety of data-driven risk models which effectively capture the effects of the covariates. The paper describes three modifications we have found most suitable for this class of systems: development of suitable survival estimators that function well under realistic censoring scenarios, our modifications to the PHM which accommodate time-varying stochastic covariates, and implementation of said model in a non-linear network context which is itself model-free. Our baseline hazard is a parameterized reliability model developed from the empirical reliability estimates. Development of the risk score for arbitrary covariates arising from movement among different random environments is through interaction of the non-linear network and training data obtained from a Markov chain simulation based on stochastic environmental responses generated from Karhunen-Loe`ve models.


Sign in / Sign up

Export Citation Format

Share Document