Model selection bias and Freedman’s paradox

2009 ◽  
Vol 62 (1) ◽  
pp. 117-125 ◽  
Author(s):  
Paul M. Lukacs ◽  
Kenneth P. Burnham ◽  
David R. Anderson
2010 ◽  
Vol 20 (4) ◽  
pp. 768-786
Author(s):  
Kristien Wouters ◽  
José Cortiñas Abrahantes ◽  
Geert Molenberghs ◽  
Helena Geys ◽  
Abdellah Ahnaou ◽  
...  

2006 ◽  
Vol 22 (10) ◽  
pp. 1245-1250 ◽  
Author(s):  
D. Berrar ◽  
I. Bradbury ◽  
W. Dubitzky

Author(s):  
A. V. Rubanovich ◽  
V. A. Saenko

Marginal screening (MS) is the computationally simple and commonly used for the dimension reduction procedures. In it, a linear model is constructed for several top predictors, chosen according to the absolute value of marginal correlations with the dependent variable. Importantly, when kpredictors out of mprimary covariates are selected, the standard regression analysis may yield false-positive results if m>> k(Freedman's paradox). In this work, we provide analytical expressions describing null distribution of the test statistics for model selection via MS. Using the theory of order statistics, we show that under MS, the common F-statistic is distributed as a mean of ktop variables out of mindependent random variables having a 21χdistribution. Based on this finding, we estimated critical p-values for multiple regression models after MS, comparisons with which of those obtained in real studies will help researchers to avoid false-positive result. Analytical solutions obtained in the work are implemented in a free Excel spreadsheet program.


2020 ◽  
Vol 76 (5) ◽  
pp. 742-743 ◽  
Author(s):  
Min Zhan ◽  
Rebecca M. Doerfler ◽  
Jeffrey C. Fink

2019 ◽  
Vol 43 (2) ◽  
pp. 212-222
Author(s):  
DULAL K. BHAUMIK ◽  
◽  
SUBHASH ARYAL ◽  
SCOTT HUR ◽  
◽  
...  

Sign in / Sign up

Export Citation Format

Share Document