Bayesian Variable Selection in Semiparametric Proportional Hazards Model for High Dimensional Survival Data

Cox’s proportional hazards model is the most commonly used model for regression analysis of failure time data and some methods have been developed for its variable selection under different situations. In this paper, we consider a general type of failure time data, case K interval-censored data, that include all of other types discussed as special cases, and propose a unified penalized variable selection procedure. In addition to its generality, another significant feature of the proposed approach is that unlike all of the existing variable selection methods for failure time data, the proposed approach allows dependent censoring, which can occur quite often and could lead to biased or misleading conclusions if not taken into account. For the implementation, a coordinate descent algorithm is developed and the oracle property of the proposed method is established. The numerical studies indicate that the proposed approach works well for practical situations and it is applied to a set of real data arising from Alzheimer’s Disease Neuroimaging Initiative study that motivated this study.

Download Full-text

Variable Selection in Partially Linear Proportional Hazards Model with Grouped Covariates and a Diverging Number of Parameters

Statistical Modeling in Biomedical Research - Emerging Topics in Statistics and Biostatistics ◽

10.1007/978-3-030-33416-1_18 ◽

2020 ◽

pp. 411-448

Author(s):

Arfan Raheen Afzal ◽

Xuewen Lu

Keyword(s):

Variable Selection ◽

Proportional Hazards ◽

Proportional Hazards Model ◽

Partially Linear ◽

Hazards Model ◽

Diverging Number Of Parameters

Download Full-text

The variable selection by the Dantzig selector for Cox’s proportional hazards model

Annals of the Institute of Statistical Mathematics ◽

10.1007/s10463-021-00807-1 ◽

2021 ◽

Author(s):

Kou Fujimori

Keyword(s):

Variable Selection ◽

Proportional Hazards ◽

Proportional Hazards Model ◽

Dantzig Selector ◽

Hazards Model ◽

Cox's Proportional Hazards Model

Download Full-text

A novel Cox proportional hazards model for high - dimensional genomic data in cancer prognosis

IEEE/ACM Transactions on Computational Biology and Bioinformatics ◽

10.1109/tcbb.2019.2961667 ◽

2019 ◽

pp. 1-1

Author(s):

HaiHui Huang ◽

Yong Liang

Keyword(s):

Proportional Hazards ◽

Proportional Hazards Model ◽

Genomic Data ◽

Cox Proportional Hazards ◽

Cox Proportional Hazards Model ◽

Cancer Prognosis ◽

High Dimensional ◽

Hazards Model

Download Full-text

Modelling heterogeneity in survival data

Journal of Applied Probability ◽

10.1017/s0021900200042534 ◽

1991 ◽

Vol 28 (03) ◽

pp. 695-701 ◽

Cited By ~ 11

Author(s):

Philip Hougaard

Keyword(s):

Survival Data ◽

Proportional Hazards ◽

Proportional Hazards Model ◽

Mixture Distribution ◽

Survival Models ◽

Risk Of Death ◽

Hazards Model ◽

Occupational Mortality ◽

Heterogeneity Distribution ◽

Biased Estimates

Ordinary survival models implicitly assume that all individuals in a group have the same risk of death. It may, however, be relevant to consider the group as heterogeneous, i.e. a mixture of individuals with different risks. For example, after an operation each individual may have constant hazard of death. If risk factors are not included, the group shows decreasing hazard. This offers two fundamentally different interpretations of the same data. For instance, Weibull distributions with shape parameter less than 1 can be generated as mixtures of constant individual hazards. In a proportional hazards model, neglect of a subset of the important covariates leads to biased estimates of the other regression coefficients. Different choices of distributions for the unobserved covariates are discussed, including binary, gamma, inverse Gaussian and positive stable distributions, which show both qualitative and quantitative differences. For instance, the heterogeneity distribution can be either identifiable or unidentifiable. Both mathematical and interpretational consequences of the choice of distribution are considered. Heterogeneity can be evaluated by the variance of the logarithm of the mixture distribution. Examples include occupational mortality, myocardial infarction and diabetes.

Download Full-text

A Proportional Hazards Neural Network for Performing Reliability Estimates and Risk Prognostics for Mobile Systems Subject to Stochastic Covariates

Engineering/Technology Management ◽

10.1115/imece2005-82657 ◽

2005 ◽

Cited By ~ 3

Author(s):

George M. Lloyd ◽

Timothy Hasselman ◽

Thomas Paez

Keyword(s):

Survival Data ◽

Proportional Hazards ◽

Proportional Hazards Model ◽

Mobile Systems ◽

Linear Network ◽

Model Free ◽

Hazards Model ◽

Reliability Estimates ◽

Non Linear ◽

Stochastic Covariates

We present a proportional hazards model (PHM) that establishes a framework suitable for performing reliability estimates and risk prognostics on complex multi-component systems which are transferred at arbitrary times among a discrete set of non-stationary stochastic environments. Such a scenario is not at all uncommon for portable and mobile systems. It is assumed that survival data, possibly interval censored, is available at several “typical” environments. This collection of empirical survival data forms the foundation upon which the basic effects of selected covariates are incorporated via the proportional hazards model. Proportional hazards models are well known in medical statistics, and can provide a variety of data-driven risk models which effectively capture the effects of the covariates. The paper describes three modifications we have found most suitable for this class of systems: development of suitable survival estimators that function well under realistic censoring scenarios, our modifications to the PHM which accommodate time-varying stochastic covariates, and implementation of said model in a non-linear network context which is itself model-free. Our baseline hazard is a parameterized reliability model developed from the empirical reliability estimates. Development of the risk score for arbitrary covariates arising from movement among different random environments is through interaction of the non-linear network and training data obtained from a Markov chain simulation based on stochastic environmental responses generated from Karhunen-Loe`ve models.

Download Full-text