A pseudo knockoff filter for correlated features

Abstract In Barber & Candès (2015, Ann. Statist., 43, 2055–2085), the authors introduced a new variable selection procedure called the knockoff filter to control the false discovery rate (FDR) and proved that this method achieves exact FDR control. Inspired by the work by Barber & Candès (2015, Ann. Statist., 43, 2055–2085), we propose a pseudo knockoff filter that inherits some advantages of the original knockoff filter and has more flexibility in constructing its knockoff matrix. Moreover, we perform a number of numerical experiments that seem to suggest that the pseudo knockoff filter with the half Lasso statistic has FDR control and offers more power than the original knockoff filter with the Lasso Path or the half Lasso statistic for the numerical examples that we consider in this paper. Although we cannot establish rigourous FDR control for the pseudo knockoff filter, we provide some partial analysis of the pseudo knockoff filter with the half Lasso statistic and establish a uniform false discovery proportion bound and an expectation inequality.

Download Full-text

A prototype knockoff filter for group selection with FDR control

Information and Inference A Journal of the IMA ◽

10.1093/imaiai/iaz012 ◽

2019 ◽

Vol 9 (2) ◽

pp. 271-288 ◽

Cited By ~ 1

Author(s):

Jiajie Chen ◽

Anthony Hou ◽

Thomas Y Hou

Keyword(s):

Machine Learning ◽

Group Selection ◽

Numerical Experiments ◽

Statistical Power ◽

Selection Procedure ◽

Principal Component ◽

Response Variable ◽

Explanatory Variables ◽

False Discovery ◽

Variable Selection Procedure

Abstract In many applications, we need to study a linear regression model that consists of a response variable and a large number of potential explanatory variables, and determine which variables are truly associated with the response. In Foygel Barber & Candès (2015, Ann. Statist., 43, 2055–2085), the authors introduced a new variable selection procedure called the knockoff filter to control the false discovery rate (FDR) and proved that this method achieves exact FDR control. In this paper, we propose a prototype knockoff filter for group selection by extending the Reid–Tibshirani (2016, Biostatistics, 17, 364–376) prototype method. Our prototype knockoff filter improves the computational efficiency and statistical power of the Reid–Tibshirani prototype method when it is applied for group selection. In some cases when the group features are spanned by one or a few hidden factors, we demonstrate that the Principal Component Analysis (PCA) prototype knockoff filter outperforms the Dai–Foygel Barber (2016, 33rd International Conference on Machine Learning (ICML 2016)) group knockoff filter. We present several numerical experiments to compare our prototype knockoff filter with the Reid–Tibshirani prototype method and the group knockoff filter. We have also conducted some analysis of the knockoff filter. Our analysis reveals that some knockoff path method statistics, including the Lasso path statistic, may lead to loss of power for certain design matrices and a specially designed response even if their signal strengths are still relatively strong.

Download Full-text

Variable Selection Procedure for Discrimination Between two Multinormal Populations with Common Dispersion Matrix Proportional to a Known Positive Definite Matrix

Calcutta Statistical Association Bulletin ◽

10.1177/0008068320100301 ◽

2010 ◽

Vol 62 (3-4) ◽

pp. 129-142

Author(s):

Sisir Kumar Samanta

Keyword(s):

Variable Selection ◽

Selection Procedure ◽

Positive Definite Matrix ◽

Positive Definite ◽

Dispersion Matrix ◽

Variable Selection Procedure

Download Full-text

Prediction of Placental Barrier Permeability: A Model Based on Partial Least Squares Variable Selection Procedure

Molecules ◽

10.3390/molecules20058270 ◽

2015 ◽

Vol 20 (5) ◽

pp. 8270-8286 ◽

Cited By ~ 9

Author(s):

Yong-Hong Zhang ◽

Zhi-Ning Xia ◽

Li Yan ◽

Shu-Shen Liu

Keyword(s):

Variable Selection ◽

Least Squares ◽

Partial Least Squares ◽

Selection Procedure ◽

Placental Barrier ◽

Barrier Permeability ◽

Model Based ◽

Variable Selection Procedure

Download Full-text

Improved variable selection procedure for multivariate linear regression

Analytica Chimica Acta ◽

10.1016/s0003-2670(97)00450-9 ◽

1997 ◽

Vol 354 (1-3) ◽

pp. 225-232 ◽

Cited By ~ 12

Author(s):

A.D Walmsley

Keyword(s):

Linear Regression ◽

Variable Selection ◽

Selection Procedure ◽

Multivariate Linear Regression ◽

Variable Selection Procedure

Download Full-text

A stepwise discrete variable selection procedure

Communication in Statistics- Theory and Methods ◽

10.1080/03610927708827585 ◽

1977 ◽

Vol 6 (14) ◽

pp. 1423-1436 ◽

Cited By ~ 7

Author(s):

Matthew Goldstein ◽

William R. Dillon

Keyword(s):

Variable Selection ◽

Selection Procedure ◽

Discrete Variable ◽

Variable Selection Procedure

Download Full-text

A Variable Selection Procedure for X-ray Diffraction Phase Analysis

Applied Spectroscopy ◽

10.1366/000370207783292127 ◽

2007 ◽

Vol 61 (12) ◽

pp. 1398-1403 ◽

Cited By ~ 7

Author(s):

Daewon Lee ◽

Hyeseon Lee ◽

Chi-Hyuck Jun ◽

Chang Hwan Chang

Keyword(s):

Variable Selection ◽

Phase Analysis ◽

Selection Procedure ◽

X Ray Diffraction ◽

X Ray ◽

Variable Selection Procedure

Download Full-text

Penalized variable selection procedure for Cox proportional hazards model via seamless-$\boldsymbol{L_0}$ penalty

Scientia Sinica Mathematica ◽

10.1360/scm-2016-0609 ◽

2018 ◽

Vol 48 (5) ◽

pp. 643 ◽

Cited By ~ 1

Author(s):

Cao Yongxiu ◽

Jiao Yuling ◽

Shi Yueyong ◽

Liu Yanyan

Keyword(s):

Variable Selection ◽

Proportional Hazards ◽

Proportional Hazards Model ◽

Selection Procedure ◽

Cox Proportional Hazards ◽

Cox Proportional Hazards Model ◽

Hazards Model ◽

Variable Selection Procedure

Download Full-text

Improving Practices for Selecting a Subset of Important Predictors in Psychology: An Application to Predicting Pain

Advances in Methods and Practices in Psychological Science ◽

10.1177/2515245919885617 ◽

2020 ◽

Vol 3 (1) ◽

pp. 66-80 ◽

Cited By ~ 1

Author(s):

Sierra A. Bainter ◽

Thomas G. McCauley ◽

Tor Wager ◽

Elizabeth A. Reynolds Losin

Keyword(s):

Variable Selection ◽

Multiple Testing ◽

Selection Procedure ◽

Experimental Pain ◽

Bayesian Variable Selection ◽

Large Set ◽

Web Based ◽

Stochastic Search Variable Selection ◽

Variable Selection Procedure ◽

Multivariate Relationships

Frequently, researchers in psychology are faced with the challenge of narrowing down a large set of predictors to a smaller subset. There are a variety of ways to do this, but commonly it is done by choosing predictors with the strongest bivariate correlations with the outcome. However, when predictors are correlated, bivariate relationships may not translate into multivariate relationships. Further, any attempts to control for multiple testing are likely to result in extremely low power. Here we introduce a Bayesian variable-selection procedure frequently used in other disciplines, stochastic search variable selection (SSVS). We apply this technique to choosing the best set of predictors of the perceived unpleasantness of an experimental pain stimulus from among a large group of sociocultural, psychological, and neurobiological (functional MRI) individual-difference measures. Using SSVS provides information about which variables predict the outcome, controlling for uncertainty in the other variables of the model. This approach yields new, useful information to guide the choice of relevant predictors. We have provided Web-based open-source software for performing SSVS and visualizing the results.

Download Full-text