Conditional distance correlation sure independence screening for ultra-high dimensional survival data

Author(s):  
Shuiyun Lu ◽  
Xiaolin Chen ◽  
Hong Wang
Biometrics ◽  
2005 ◽  
Vol 62 (1) ◽  
pp. 202-210 ◽  
Author(s):  
Shuangge Ma ◽  
Michael R. Kosorok ◽  
Jason P. Fine

2021 ◽  
Vol 12 ◽  
Author(s):  
Yidan Cui ◽  
Chengwen Luo ◽  
Linghao Luo ◽  
Zhangsheng Yu

Mediation analysis has been extensively used to identify potential pathways between exposure and outcome. However, the analytical methods of high-dimensional mediation analysis for survival data are still yet to be promoted, especially for non-Cox model approaches. We propose a procedure including “two-step” variable selection and indirect effect estimation for the additive hazards model with high-dimensional mediators. We first apply sure independence screening and smoothly clipped absolute deviation regularization to select mediators. Then we use the Sobel test and the BH method for indirect effect hypothesis testing. Simulation results demonstrate its good performance with a higher true-positive rate and accuracy, as well as a lower false-positive rate. We apply the proposed procedure to analyze DNA methylation markers mediating smoking and survival time of lung cancer patients in a TCGA (The Cancer Genome Atlas) cohort study. The real data application identifies four mediate CpGs, three of which are newly found.


2021 ◽  
Author(s):  
Mu Yue

In high-dimensional data, penalized regression is often used for variable selection and parameter estimation. However, these methods typically require time-consuming cross-validation methods to select tuning parameters and retain more false positives under high dimensionality. This chapter discusses sparse boosting based machine learning methods in the following high-dimensional problems. First, a sparse boosting method to select important biomarkers is studied for the right censored survival data with high-dimensional biomarkers. Then, a two-step sparse boosting method to carry out the variable selection and the model-based prediction is studied for the high-dimensional longitudinal observations measured repeatedly over time. Finally, a multi-step sparse boosting method to identify patient subgroups that exhibit different treatment effects is studied for the high-dimensional dense longitudinal observations. This chapter intends to solve the problem of how to improve the accuracy and calculation speed of variable selection and parameter estimation in high-dimensional data. It aims to expand the application scope of sparse boosting and develop new methods of high-dimensional survival analysis, longitudinal data analysis, and subgroup analysis, which has great application prospects.


2020 ◽  
Vol 206 ◽  
pp. 106356 ◽  
Author(s):  
Chongbo Fu ◽  
Peng Wang ◽  
Liang Zhao ◽  
Xinjing Wang

Sign in / Sign up

Export Citation Format

Share Document