Penalized likelihood estimation of the proportional hazards model for survival data with interval censoring

2021 ◽  
Vol 0 (0) ◽  
Author(s):  
Jun Ma ◽  
Dominique-Laurent Couturier ◽  
Stephane Heritier ◽  
Ian C. Marschner

Abstract This paper considers the problem of semi-parametric proportional hazards model fitting where observed survival times contain event times and also interval, left and right censoring times. Although this is not a new topic, many existing methods suffer from poor computational performance. In this paper, we adopt a more versatile penalized likelihood method to estimate the baseline hazard and the regression coefficients simultaneously. The baseline hazard is approximated using basis functions such as M-splines. A penalty is introduced to regularize the baseline hazard estimate and also to ease dependence of the estimates on the knots of the basis functions. We propose a Newton–MI (multiplicative iterative) algorithm to fit this model. We also present novel asymptotic properties of our estimates, allowing for the possibility that some parameters of the approximate baseline hazard may lie on the parameter space boundary. Comparisons of our method against other similar approaches are made through an intensive simulation study. Results demonstrate that our method is very stable and encounters virtually no numerical issues. A real data application involving melanoma recurrence is presented and an R package ‘survivalMPL’ implementing the method is available on R CRAN.

2019 ◽  
Vol 58 (1) ◽  
pp. 39-47
Author(s):  
Oksana Chernova ◽  
Alexander Kukush

We investigate linear and nonlinear hypotheses testing in a Cox proportional hazards model for right-censored survival data when the covariates are subject to measurement errors. In Kukush and Chernova (2018) [Theor. Probability and Math. Statist. 96, 101–110], a consistent simultaneous estimator is introduced for the baseline hazard rate and the vector of regression parameters. Therein the baseline hazard rate belongs to an unbounded set of nonnegative Lipschitz functions, with fixed constant, and the vector of regression parameters belongs to a compact parameter set. Based on the estimator, we develop two procedures to test nonlinear and linear hypotheses about the vector of regression parameters: Wald-type and score-type tests. The latter is based on an unbiased estimating equation. The consistency of the tests is shown.


2020 ◽  
Author(s):  
Daniel Shriner ◽  
Amy R. Bentley ◽  
Jie Zhou ◽  
Kenneth Ekoru ◽  
Ayo P. Doumatey ◽  
...  

Given a lifetime risk of ~90% by the ninth decade of life, it is unknown if there are true controls for hypertension in epidemiological and genetic studies. Here, we use the Bayesian framework to compare logistic and time-to-event approaches to modeling hypertension. Using a proportional hazards model, we explored nonparametric and parametric models of the baseline hazard function, accounting for interval censoring. In the Howard University Family Study (HUFS), a population-based study of African Americans from Washington, D.C., the median age at hypertension was 48 years, baseline hazard rates increased with age until 55 years, and the probability of being free of hypertension at 85 years of age was 12.2%. In the nationwide NHANES study, the median age at hypertension was 42 years in African Americans, in contrast to 57 years in European Americans and 56 years in Mexican Americans. Baseline hazard rates increased with age until 58 years in African Americans, comparable to 60 years in European Americans and 58 years in Mexican Americans. The probability of being free of hypertension at 85 years of age was 8.4% in African Americans, in contrast to 21.4% in European Americans and 20.6% in Mexican Americans. In all four groups, baseline hazard rates decreased but did not reach zero, consistent with the nonexistence of controls. Model fits were comparable for a proportional hazards model based on gamma-distributed hazard rates under a correlated prior process and a logistic model adjusted for age and age2. Last, using an agnostic model screening approach of 38 potential covariates, we identified and replicated in all groups a model that included chloride, low-density lipoprotein cholesterol, uric acid, and weight. With age and age2, the variance explained by these covariates was 40.9% in African Americans, 34.8% in European Americans, and 28.3% in Mexican Americans. Taken together, modeling of the baseline hazard function of hypertension suggests that there are no true controls and that controls in logistic regression are cases with a late age of onset. These findings shed considerable insights into the design of genetic and epidemiological studies of hypertension with implications for ethnic health disparities.


Sign in / Sign up

Export Citation Format

Share Document