Improving Practices for Selecting a Subset of Important Predictors in Psychology: An Application to Predicting Pain

In this paper we address the problem of selecting important predictors from some larger set of candidate predictors. Standard techniques are limited by lack of power and high false positive rates. A Bayesian variable selection approach used widely in biostatistics, stochastic search variable selection, can be used instead to combat these issues by accounting for uncertainty in the other predictors of the model. In this paper we present Bayesian variable selection to aid researchers facing this common scenario, along with an online application (https://ssvsforpsych.shinyapps.io/ssvsforpsych/) to perform the analysis and visualize the results. Using an application to predict pain ratings, we demonstrate how this approach quickly identifies reliable predictors, even when the set of possible predictors is larger than the sample size. This technique is widely applicable to research questions that may be relatively data-rich, but with limited information or theory to guide variable selection.

Download Full-text

Genetic analysis of complex traits via Bayesian variable selection: the utility of a mixture of uniform priors

Genetics Research ◽

10.1017/s0016672311000164 ◽

2011 ◽

Vol 93 (4) ◽

pp. 303-318 ◽

Cited By ~ 12

Author(s):

TIMO KNÜRR ◽

ESA LÄÄRÄ ◽

MIKKO J. SILLANPÄÄ

Keyword(s):

Genetic Analysis ◽

Variable Selection ◽

Complex Traits ◽

Bayesian Variable Selection ◽

Stochastic Search Variable Selection ◽

Indicator Variables ◽

Supplementary Material ◽

Selection Of ◽

Made In ◽

Search Variable

SummaryA new estimation-based Bayesian variable selection approach is presented for genetic analysis of complex traits based on linear or logistic regression. By assigning a mixture of uniform priors (MU) to genetic effects, the approach provides an intuitive way of specifying hyperparameters controlling the selection of multiple influential loci. It aims at avoiding the difficulty of interpreting assumptions made in the specifications of priors. The method is compared in two real datasets with two other approaches, stochastic search variable selection (SSVS) and a re-formulation of Bayes B utilizing indicator variables and adaptive Student's t-distributions (IAt). The Markov Chain Monte Carlo (MCMC) sampling performance of the three methods is evaluated using the publicly available software OpenBUGS (model scripts are provided in the Supplementary material). The sensitivity of MU to the specification of hyperparameters is assessed in one of the data examples.

Download Full-text

Identifying Activity-sensitive Spectral Lines: A Bayesian Variable Selection Approach

The Astronomical Journal ◽

10.3847/1538-3881/ab441c ◽

2019 ◽

Vol 158 (5) ◽

pp. 210

Author(s):

Bo Ning ◽

Alexander Wise ◽

Jessi Cisewski-Kehe ◽

Sarah Dodson-Robinson ◽

Debra Fischer

Keyword(s):

Variable Selection ◽

Bayesian Variable Selection ◽

Spectral Lines ◽

Selection Approach

Download Full-text

Stochastic search variable selection in vector error correction models with an application to a model of the UK macroeconomy

Journal of Applied Econometrics ◽

10.1002/jae.1238 ◽

2011 ◽

Vol 28 (1) ◽

pp. 62-81 ◽

Cited By ~ 11

Author(s):

Markus Jochmann ◽

Gary Koop ◽

Roberto Leon-Gonzalez ◽

Rodney W. Strachan

Keyword(s):

Variable Selection ◽

Error Correction ◽

Stochastic Search ◽

Error Correction Models ◽

Vector Error Correction ◽

Stochastic Search Variable Selection ◽

Vector Error ◽

The Uk ◽

Vector Error Correction Models ◽

Search Variable

Download Full-text

Gene selection: a Bayesian variable selection approach

Bioinformatics ◽

10.1093/bioinformatics/19.1.90 ◽

2003 ◽

Vol 19 (1) ◽

pp. 90-97 ◽

Cited By ~ 225

Author(s):

K. E. Lee ◽

N. Sha ◽

E. R. Dougherty ◽

M. Vannucci ◽

B. K. Mallick

Keyword(s):

Variable Selection ◽

Gene Selection ◽

Bayesian Variable Selection ◽

Selection Approach

Download Full-text

Acceleration of the stochastic search variable selection via componentwise Gibbs sampling

Metrika ◽

10.1007/s00184-016-0604-x ◽

2016 ◽

Vol 80 (3) ◽

pp. 289-308 ◽

Cited By ~ 2

Author(s):

Hengzhen Huang ◽

Shuangshuang Zhou ◽

Min-Qian Liu ◽

Zong-Feng Qi

Keyword(s):

Variable Selection ◽

Gibbs Sampling ◽

Stochastic Search ◽

Stochastic Search Variable Selection ◽

Search Variable

Download Full-text

Identification of the minimum effective dose for normally distributed data using a Bayesian variable selection approach

Journal of Biopharmaceutical Statistics ◽

10.1080/10543406.2017.1295247 ◽

2017 ◽

Vol 27 (6) ◽

pp. 1073-1088 ◽

Cited By ~ 1

Author(s):

Martin Otava ◽

Ziv Shkedy ◽

Ludwig A. Hothorn ◽

Willem Talloen ◽

Daniel Gerhard ◽

...

Keyword(s):

Variable Selection ◽

Effective Dose ◽

Bayesian Variable Selection ◽

Distributed Data ◽

Minimum Effective Dose ◽

Selection Approach ◽

Normally Distributed

Download Full-text

A split-and-merge Bayesian variable selection approach for ultrahigh dimensional regression

Journal of the Royal Statistical Society Series B (Statistical Methodology) ◽

10.1111/rssb.12095 ◽

2014 ◽

Vol 77 (5) ◽

pp. 947-972 ◽

Cited By ~ 18

Author(s):

Qifan Song ◽

Faming Liang

Keyword(s):

Variable Selection ◽

Bayesian Variable Selection ◽

Selection Approach ◽

Split And Merge

Download Full-text

Stochastic search variable selection for log-linear models

Journal of Statistical Computation and Simulation ◽

10.1080/00949650008812054 ◽

2000 ◽

Vol 68 (1) ◽

pp. 23-37 ◽

Cited By ~ 15

Author(s):

Ioannis Ntzoufras ◽

Jonathan J. Forster ◽

Petros Dellaportas

Keyword(s):

Variable Selection ◽

Linear Models ◽

Stochastic Search ◽

Stochastic Search Variable Selection ◽

Selection For ◽

Log Linear ◽

Search Variable

Download Full-text

Accuracy of genomic selection using stochastic search variable selection in Australian Holstein Friesian dairy cattle

Genetics Research ◽

10.1017/s0016672309990243 ◽

2009 ◽

Vol 91 (5) ◽

pp. 307-311 ◽

Cited By ~ 93

Author(s):

KLARA L. VERBYLA ◽

BEN J. HAYES ◽

PHILIP J. BOWMAN ◽

MICHAEL E. GODDARD

Keyword(s):

Variable Selection ◽

Genomic Selection ◽

Critical Issue ◽

Stochastic Search ◽

Selection Strategy ◽

Genomic Breeding ◽

Breeding Values ◽

Stochastic Search Variable Selection ◽

Snp Data ◽

Search Variable

SummaryGenomic selection describes a selection strategy based on genomic breeding values predicted from dense single nucleotide polymorphism (SNP) data. Multiple methods have been proposed but the critical issue is how to decide whether an SNP should be included in the predictive set to estimate breeding values. One major disadvantage of the traditional Bayes B approach is its high computational demands caused by the changing dimensionality of the models. The use of stochastic search variable selection (SSVS) retains the same assumptions about the distribution of SNP effects as Bayes B, while maintaining constant dimensionality. When Bayesian SSVS was used to predict genomic breeding values for real dairy data over a range of traits it produced accuracies higher or equivalent to other genomic selection methods with significantly decreased computational and time demands than Bayes B.

Download Full-text

BayICE: A hierarchical Bayesian deconvolution model with stochastic search variable selection

10.1101/732743 ◽

2019 ◽

Author(s):

An-Shun Tai ◽

George C. Tseng ◽

Wen-Ping Hsieh

Keyword(s):

Gene Expression ◽

Variable Selection ◽

Immune Cell ◽

Expression Profiles ◽

Gene Expression Profiles ◽

R Package ◽

Stochastic Search ◽

Hierarchical Bayesian ◽

Stochastic Search Variable Selection ◽

Search Variable

AbstractGene expression deconvolution is a powerful tool for exploring the microenvironment of complex tissues comprised of multiple cell groups using transcriptomic data. Characterizing cell activities for a particular condition has been regarded as a primary mission against diseases. For example, cancer immunology aims to clarify the role of the immune system in the progression and development of cancer through analyzing the immune cell components of tumors. To that end, many deconvolution methods have been proposed for inferring cell subpopulations within tissues. Nevertheless, two problems limit the practicality of current approaches. First, all approaches use external purified data to preselect cell type-specific genes that contribute to deconvolution. However, some types of cells cannot be found in purified profiles and the genes specifically over- or under-expressed in them cannot be identified. This is particularly a problem in cancer studies. Hence, a preselection strategy that is independent from deconvolution is inappropriate. The second problem is that existing approaches do not recover the expression profiles of unknown cells present in bulk tissues, which results in biased estimation of unknown cell proportions. Furthermore, it causes the shift-invariant property of deconvolution to fail, which then affects the estimation performance. To address these two problems, we propose a novel deconvolution approach, BayICE, which employs hierarchical Bayesian modeling with stochastic search variable selection. We develop a comprehensive Markov chain Monte Carlo procedure through Gibbs sampling to estimate cell proportions, gene expression profiles, and signature genes. Simulation and validation studies illustrate that BayICE outperforms existing deconvolution approaches in estimating cell proportions. Subsequently, we demonstrate an application of BayICE in the RNA sequencing of patients with non-small cell lung cancer. The model is implemented in the R package “BayICE” and the algorithm is available for download.

Download Full-text