Variable selection for high-dimensional regression models with time series and heteroscedastic errors

2020 ◽  
Vol 216 (1) ◽  
pp. 118-136
Author(s):  
Hai-Tang Chiou ◽  
Meihui Guo ◽  
Ching-Kang Ing
2017 ◽  
Vol 2017 ◽  
pp. 1-10 ◽  
Author(s):  
Qing-Yan Yin ◽  
Jun-Li Li ◽  
Chun-Xia Zhang

As a pivotal tool to build interpretive models, variable selection plays an increasingly important role in high-dimensional data analysis. In recent years, variable selection ensembles (VSEs) have gained much interest due to their many advantages. Stability selection (Meinshausen and Bühlmann, 2010), a VSE technique based on subsampling in combination with a base algorithm like lasso, is an effective method to control false discovery rate (FDR) and to improve selection accuracy in linear regression models. By adopting lasso as a base learner, we attempt to extend stability selection to handle variable selection problems in a Cox model. According to our experience, it is crucial to set the regularization region Λ in lasso and the parameter λmin properly so that stability selection can work well. To the best of our knowledge, however, there is no literature addressing this problem in an explicit way. Therefore, we first provide a detailed procedure to specify Λ and λmin. Then, some simulated and real-world data with various censoring rates are used to examine how well stability selection performs. It is also compared with several other variable selection approaches. Experimental results demonstrate that it achieves better or competitive performance in comparison with several other popular techniques.


Sign in / Sign up

Export Citation Format

Share Document