scholarly journals Variable Selection via SCAD-Penalized Quantile Regression for High-Dimensional Count Data

IEEE Access ◽  
2019 ◽  
Vol 7 ◽  
pp. 153205-153216
Author(s):  
Dost Muhammad Khan ◽  
Anum Yaqoob ◽  
Nadeem Iqbal ◽  
Abdul Wahid ◽  
Umair Khalil ◽  
...  
2020 ◽  
pp. 096228022094153
Author(s):  
Yongxin Bai ◽  
Maozai Tian ◽  
Man-Lai Tang ◽  
Wing-Yan Lee

In this paper, we consider variable selection for ultra-high dimensional quantile regression model with missing data and measurement errors in covariates. Specifically, we correct the bias in the loss function caused by measurement error by applying the orthogonal quantile regression approach and remove the bias caused by missing data using the inverse probability weighting. A nonconvex Atan penalized estimation method is proposed for simultaneous variable selection and estimation. With the proper choice of the regularization parameter and under some relaxed conditions, we show that the proposed estimate enjoys the oracle properties. The choice of smoothing parameters is also discussed. The performance of the proposed variable selection procedure is assessed by Monte Carlo simulation studies. We further demonstrate the proposed procedure with a breast cancer data set.


2015 ◽  
Vol 69 (3) ◽  
pp. 212-235 ◽  
Author(s):  
Muhammad Amin ◽  
Lixin Song ◽  
Milton Abdul Thorlie ◽  
Xiaoguang Wang

Mathematics ◽  
2021 ◽  
Vol 9 (3) ◽  
pp. 222
Author(s):  
Juan C. Laria ◽  
M. Carmen Aguilera-Morillo ◽  
Enrique Álvarez ◽  
Rosa E. Lillo ◽  
Sara López-Taruella ◽  
...  

Over the last decade, regularized regression methods have offered alternatives for performing multi-marker analysis and feature selection in a whole genome context. The process of defining a list of genes that will characterize an expression profile remains unclear. It currently relies upon advanced statistics and can use an agnostic point of view or include some a priori knowledge, but overfitting remains a problem. This paper introduces a methodology to deal with the variable selection and model estimation problems in the high-dimensional set-up, which can be particularly useful in the whole genome context. Results are validated using simulated data and a real dataset from a triple-negative breast cancer study.


Sign in / Sign up

Export Citation Format

Share Document