model selection consistency Latest Research Papers

Summary Model selection is crucial both to high-dimensional learning and to inference for contemporary big data applications in pinpointing the best set of covariates among a sequence of candidate interpretable models. Most existing work assumes implicitly that the models are correctly specified or have fixed dimensionality, yet both are prevalent in practice. In this paper, we exploit the framework of model selection principles under the misspecified generalized linear models presented in Lv and Liu (2014) and investigate the asymptotic expansion of the posterior model probability in the setting of high-dimensional misspecified models.With a natural choice of prior probabilities that encourages interpretability and incorporates the Kullback–Leibler divergence, we suggest the high-dimensional generalized Bayesian information criterion with prior probability for large-scale model selection with misspecification. Our new information criterion characterizes the impacts of both model misspecification and high dimensionality on model selection. We further establish the consistency of covariance contrast matrix estimation and the model selection consistency of the new information criterion in ultra-high dimensions under some mild regularity conditions. The numerical studies demonstrate that our new method enjoys improved model selection consistency compared to its main competitors.

Download Full-text

Bayesian variable selection in logistic regression with application to whole-brain functional connectivity analysis for Parkinson’s disease

Statistical Methods in Medical Research ◽

10.1177/0962280220978990 ◽

2020 ◽

pp. 096228022097899

Author(s):

Xuan Cao ◽

Kyoungjae Lee ◽

Qingling Huang

Keyword(s):

Parkinson’S Disease ◽

Parkinson's Disease ◽

Logistic Regression ◽

Functional Connectivity ◽

Variable Selection ◽

Bayesian Variable Selection ◽

Whole Brain ◽

Model Selection Consistency ◽

Brain Functional Connectivity ◽

T Distribution

Parkinson’s disease is a progressive, chronic, and neurodegenerative disorder that is primarily diagnosed by clinical examinations and magnetic resonance imaging (MRI). In this paper, we propose a Bayesian model to predict Parkinson’s disease employing a functional MRI (fMRI) based radiomics approach. We consider a spike and slab prior for variable selection in high-dimensional logistic regression models, and present an approximate Gibbs sampler by replacing a logistic distribution with a t-distribution. Under mild conditions, we establish model selection consistency of the induced posterior and illustrate the performance of the proposed method outperforms existing state-of-the-art methods through simulation studies. In fMRI analysis, 6216 whole-brain functional connectivity features are extracted for 50 healthy controls along with 70 Parkinson’s disease patients. We apply our method to the resulting dataset and further show its benefits with a higher average prediction accuracy of 0.83 compared to other contenders based on 10 random splits. The model fitting procedure also reveals the most discriminative brain regions for Parkinson’s disease. These findings demonstrate that the proposed Bayesian variable selection method has the potential to support radiological diagnosis for patients with Parkinson’s disease.

Download Full-text

Variable Selection Using Nonlocal Priors in High-Dimensional Generalized Linear Models With Application to fMRI Data Analysis

Entropy ◽

10.3390/e22080807 ◽

2020 ◽

Vol 22 (8) ◽

pp. 807

Author(s):

Xuan Cao ◽

Kyoungjae Lee

Keyword(s):

Model Selection ◽

Linear Regression ◽

Variable Selection ◽

Generalized Linear Models ◽

Linear Models ◽

Real Data ◽

High Dimensional ◽

Model Selection Consistency ◽

Fmri Study ◽

Dimensional Variable

High-dimensional variable selection is an important research topic in modern statistics. While methods using nonlocal priors have been thoroughly studied for variable selection in linear regression, the crucial high-dimensional model selection properties for nonlocal priors in generalized linear models have not been investigated. In this paper, we consider a hierarchical generalized linear regression model with the product moment nonlocal prior over coefficients and examine its properties. Under standard regularity assumptions, we establish strong model selection consistency in a high-dimensional setting, where the number of covariates is allowed to increase at a sub-exponential rate with the sample size. The Laplace approximation is implemented for computing the posterior probabilities and the shotgun stochastic search procedure is suggested for exploring the posterior space. The proposed method is validated through simulation studies and illustrated by a real data example on functional activity analysis in fMRI study for predicting Parkinson’s disease.

Download Full-text

Minimax posterior convergence rates and model selection consistency in high-dimensional DAG models based on sparse Cholesky factors

The Annals of Statistics ◽

10.1214/18-aos1783 ◽

2019 ◽

Vol 47 (6) ◽

pp. 3413-3437 ◽

Cited By ~ 3

Author(s):

Kyoungjae Lee ◽

Jaeyong Lee ◽

Lizhen Lin

Keyword(s):

Model Selection ◽

Convergence Rates ◽

High Dimensional ◽

Model Selection Consistency ◽

Selection Consistency

Download Full-text

LARGE SYSTEM OF SEEMINGLY UNRELATED REGRESSIONS: A PENALIZED QUASI-MAXIMUM LIKELIHOOD ESTIMATION PERSPECTIVE

Econometric Theory ◽

10.1017/s026646661900015x ◽

2019 ◽

Vol 36 (3) ◽

pp. 526-558

Author(s):

Qingliang Fan ◽

Xiao Han ◽

Guangming Pan ◽

Bibo Jiang

Keyword(s):

Maximum Likelihood ◽

Sample Size ◽

Covariance Matrix ◽

Large System ◽

Seemingly Unrelated Regression ◽

Frobenius Norm ◽

Error Covariance Matrix ◽

Error Covariance ◽

Model Selection Consistency ◽

Quasi Maximum Likelihood

In this article, using a shrinkage estimator, we propose a penalized quasi-maximum likelihood estimator (PQMLE) to estimate a large system of equations in seemingly unrelated regression models, where the number of equations is large relative to the sample size. We develop the asymptotic properties of the PQMLE for both the error covariance matrix and model coefficients. In particular, we derive the asymptotic distribution of the coefficient estimator and the convergence rate of the estimated covariance matrix in terms of the Frobenius norm. The model selection consistency of the covariance matrix estimator is also established. Simulation results show that when the number of equations is large relative to the sample size and the error covariance matrix is sparse, the PQMLE outperforms other contemporary estimators.

Download Full-text

Bayesian sparse linear regression with unknown symmetric error

Information and Inference A Journal of the IMA ◽

10.1093/imaiai/iay022 ◽

2019 ◽

Vol 8 (3) ◽

pp. 621-653 ◽

Cited By ~ 1

Author(s):

Minwoo Chae ◽

Lizhen Lin ◽

David B Dunson

Keyword(s):

Linear Regression ◽

Dirichlet Process ◽

Regression Coefficients ◽

Dirichlet Process Mixture ◽

Model Selection Consistency ◽

Bayesian Procedures ◽

Mixing Distributions ◽

Error Density ◽

Von Mises ◽

Mixture Of Gaussian

Abstract We study Bayesian procedures for sparse linear regression when the unknown error distribution is endowed with a non-parametric prior. Specifically, we put a symmetrized Dirichlet process mixture of Gaussian prior on the error density, where the mixing distributions are compactly supported. For the prior on regression coefficients, a mixture of point masses at zero and continuous distributions is considered. Under the assumption that the model is well specified, we study behavior of the posterior with diverging number of predictors. The compatibility and restricted eigenvalue conditions yield the minimax convergence rate of the regression coefficients in $\ell _1$- and $\ell _2$-norms, respectively. In addition, strong model selection consistency and a semi-parametric Bernstein–von Mises theorem are proven under slightly stronger conditions.

Download Full-text

Model Selection Consistency of Lasso for Empirical Data

Chinese Annals of Mathematics Series B ◽

10.1007/s11401-018-0084-6 ◽

2018 ◽

Vol 39 (4) ◽

pp. 607-620

Author(s):

Yuehan Yang ◽

Hu Yang

Keyword(s):

Model Selection ◽

Empirical Data ◽

Model Selection Consistency ◽

Selection Consistency

Download Full-text

Model selection consistency of U-statistics with convex loss and weighted lasso penalty

Journal of Nonparametric Statistics ◽

10.1080/10485252.2017.1369078 ◽

2017 ◽

Vol 29 (4) ◽

pp. 768-791 ◽

Cited By ~ 1

Author(s):

W. Rejchel

Keyword(s):

Model Selection ◽

U Statistics ◽

Model Selection Consistency ◽

Lasso Penalty ◽

Convex Loss ◽

Selection Consistency

Download Full-text

Lasso with convex loss: Model selection consistency and estimation

Communication in Statistics- Theory and Methods ◽

10.1080/03610926.2013.870799 ◽

2015 ◽

Vol 45 (7) ◽

pp. 1989-2004 ◽

Cited By ~ 1

Author(s):

Wojciech Rejchel

Keyword(s):

Model Selection ◽

Loss Model ◽

Model Selection Consistency ◽

Convex Loss ◽

Selection Consistency

Download Full-text

model selection consistency
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Law of iterated logarithm and model selection consistency for generalized linear models with independent and dependent responses

Large-scale model selection in misspecified generalized linear models

Bayesian variable selection in logistic regression with application to whole-brain functional connectivity analysis for Parkinson’s disease

Variable Selection Using Nonlocal Priors in High-Dimensional Generalized Linear Models With Application to fMRI Data Analysis

Minimax posterior convergence rates and model selection consistency in high-dimensional DAG models based on sparse Cholesky factors

LARGE SYSTEM OF SEEMINGLY UNRELATED REGRESSIONS: A PENALIZED QUASI-MAXIMUM LIKELIHOOD ESTIMATION PERSPECTIVE

Bayesian sparse linear regression with unknown symmetric error

Model Selection Consistency of Lasso for Empirical Data

Model selection consistency of U-statistics with convex loss and weighted lasso penalty

Lasso with convex loss: Model selection consistency and estimation

Export Citation Format

model selection consistencyRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Law of iterated logarithm and model selection consistency for generalized linear models with independent and dependent responses

Large-scale model selection in misspecified generalized linear models

Bayesian variable selection in logistic regression with application to whole-brain functional connectivity analysis for Parkinson’s disease

Variable Selection Using Nonlocal Priors in High-Dimensional Generalized Linear Models With Application to fMRI Data Analysis

Minimax posterior convergence rates and model selection consistency in high-dimensional DAG models based on sparse Cholesky factors

LARGE SYSTEM OF SEEMINGLY UNRELATED REGRESSIONS: A PENALIZED QUASI-MAXIMUM LIKELIHOOD ESTIMATION PERSPECTIVE

Bayesian sparse linear regression with unknown symmetric error

Model Selection Consistency of Lasso for Empirical Data

Model selection consistency of U-statistics with convex loss and weighted lasso penalty

Lasso with convex loss: Model selection consistency and estimation

model selection consistency
Recently Published Documents