LiMM‐PCA: Combining ASCA
            +
            and linear mixed models to analyse high‐dimensional designed data

Manon Martin; Bernadette Govaerts

doi:10.1002/cem.3232

CyTOF workflow: differential discovery in high-throughput high-dimensional cytometry datasets

F1000Research ◽

10.12688/f1000research.11622.3 ◽

2019 ◽

Vol 6 ◽

pp. 748 ◽

Cited By ~ 11

Author(s):

Malgorzata Nowicka ◽

Carsten Krieg ◽

Helena L. Crowell ◽

Lukas M. Weber ◽

Felix J. Hartmann ◽

...

Keyword(s):

Cell Population ◽

High Throughput ◽

Mixed Models ◽

Exploratory Data Analysis ◽

Linear Mixed Models ◽

High Dimensional ◽

Cell Populations ◽

Dimensional Scaling ◽

Exploratory Data

High-dimensional mass and flow cytometry (HDCyto) experiments have become a method of choice for high-throughput interrogation and characterization of cell populations. Here, we present an updated R-based pipeline for differential analyses of HDCyto data, largely based on Bioconductor packages. We computationally define cell populations using FlowSOM clustering, and facilitate an optional but reproducible strategy for manual merging of algorithm-generated clusters. Our workflow offers different analysis paths, including association of cell type abundance with a phenotype or changes in signalling markers within specific subpopulations, or differential analyses of aggregated signals. Importantly, the differential analyses we show are based on regression frameworks where the HDCyto data is the response; thus, we are able to model arbitrary experimental designs, such as those with batch effects, paired designs and so on. In particular, we apply generalized linear mixed models or linear mixed models to analyses of cell population abundance or cell-population-specific analyses of signaling markers, allowing overdispersion in cell count or aggregated signals across samples to be appropriately modeled. To support the formal statistical analyses, we encourage exploratory data analysis at every step, including quality control (e.g., multi-dimensional scaling plots), reporting of clustering results (dimensionality reduction, heatmaps with dendrograms) and differential analyses (e.g., plots of aggregated signals).

Download Full-text

Bayesian adaptive lasso with variational Bayes for variable selection in high-dimensional generalized linear mixed models

Communications in Statistics - Simulation and Computation ◽

10.1080/03610918.2017.1387663 ◽

2018 ◽

Vol 48 (2) ◽

pp. 530-543 ◽

Cited By ~ 2

Author(s):

Dao Thanh Tung ◽

Minh-Ngoc Tran ◽

Tran Manh Cuong

Keyword(s):

Variable Selection ◽

Mixed Models ◽

Generalized Linear Mixed Models ◽

Linear Mixed Models ◽

Variational Bayes ◽

Adaptive Lasso ◽

High Dimensional

Download Full-text

Improving heritability estimation by a variable selection approach in sparse high dimensional linear mixed models

Journal of the Royal Statistical Society Series C (Applied Statistics) ◽

10.1111/rssc.12261 ◽

2018 ◽

Vol 67 (4) ◽

pp. 813-839 ◽

Cited By ~ 3

Author(s):

Anna Bonnet ◽

Céline Lévy‐Leduc ◽

Elisabeth Gassiat ◽

Roberto Toro ◽

Thomas Bourgeron

Keyword(s):

Variable Selection ◽

Mixed Models ◽

Linear Mixed Models ◽

High Dimensional ◽

Heritability Estimation ◽

Selection Approach

Download Full-text

Inference and Estimation for Random Effects in High-Dimensional Linear Mixed Models

Journal of the American Statistical Association ◽

10.1080/01621459.2021.2004896 ◽

2021 ◽

pp. 1-31

Author(s):

Michael Law ◽

Ya’acov Ritov

Keyword(s):

Random Effects ◽

Mixed Models ◽

Linear Mixed Models ◽

High Dimensional

Download Full-text

Fixed Effects Testing in High-Dimensional Linear Mixed Models

Journal of the American Statistical Association ◽

10.1080/01621459.2019.1660172 ◽

2020 ◽

Vol 115 (532) ◽

pp. 1835-1850

Author(s):

Jelena Bradic ◽

Gerda Claeskens ◽

Thomas Gueuning

Keyword(s):

Mixed Models ◽

Fixed Effects ◽

Linear Mixed Models ◽

High Dimensional

Download Full-text

GLMMLasso: An Algorithm for High-Dimensional Generalized Linear Mixed Models Using ℓ1-Penalization

Journal of Computational and Graphical Statistics ◽

10.1080/10618600.2013.773239 ◽

2014 ◽

Vol 23 (2) ◽

pp. 460-477 ◽

Cited By ~ 25

Author(s):

Jürg Schelldorfer ◽

Lukas Meier ◽

Peter Bühlmann

Keyword(s):

Mixed Models ◽

Generalized Linear Mixed Models ◽

Linear Mixed Models ◽

High Dimensional

Download Full-text

Consistent Fixed-Effects Selection in Ultra-high dimensional Linear Mixed Models with Error-Covariate Endogeneity

Statistica Sinica ◽

10.5705/ss.202019.0421 ◽

2021 ◽

Author(s):

Abhik Ghosh ◽

Magne Thoresen

Keyword(s):

Mixed Models ◽

Fixed Effects ◽

Linear Mixed Models ◽

High Dimensional

Download Full-text

Heritability estimation in high dimensional sparse linear mixed models

Electronic Journal of Statistics ◽

10.1214/15-ejs1069 ◽

2015 ◽

Vol 9 (2) ◽

pp. 2099-2129 ◽

Cited By ~ 5

Author(s):

Anna Bonnet ◽

Elisabeth Gassiat ◽

Céline Lévy-Leduc

Keyword(s):

Mixed Models ◽

Linear Mixed Models ◽

High Dimensional ◽

Heritability Estimation

Download Full-text

Statistical Significance in High-dimensional Linear Mixed Models

Proceedings of the 2020 ACM-IMS on Foundations of Data Science Conference ◽

10.1145/3412815.3416883 ◽

2020 ◽

Author(s):

Lina Lin ◽

Mathias Drton ◽

Ali Shojaie

Keyword(s):

Mixed Models ◽

Statistical Significance ◽

Linear Mixed Models ◽

High Dimensional

Download Full-text

LiMMBo: a simple, scalable approach for linear mixed models in high-dimensional genetic association studies

10.1101/255497 ◽

2018 ◽

Cited By ~ 4

Author(s):

Hannah Verena Meyer ◽

Francesco Paolo Casale ◽

Oliver Stegle ◽

Ewan Birney

Keyword(s):

Mixed Models ◽

Growth Traits ◽

Association Studies ◽

Linear Mixed Models ◽

Genetic Association Studies ◽

Pleiotropic Effects ◽

High Dimensional ◽

Model Parameters ◽

Moderate Number ◽

Genome Wide

AbstractGenome-wide association studies have helped to shed light on the genetic architecture of complex traits and diseases. Deep phenotyping of population cohorts is increasingly applied, where multi-to high-dimensional phenotypes are recorded in the individuals. Whilst these rich datasets provide important opportunities to analyse complex trait structures and pleiotropic effects at a genome-wide scale, existing statistical methods for joint genetic analyses are hampered by computational limitations posed by high-dimensional phenotypes. Consequently, such multivariate analyses are currently limited to a moderate number of traits. Here, we introduce a method that combines linear mixed models with bootstrapping (LiMMBo) to enable computationally efficient joint genetic analysis of high-dimensional phenotypes. Our method builds on linear mixed models, thereby providing robust control for population structure and other confounding factors, and the model scales to larger datasets with up to hundreds of phenotypes. We first validate LiMMBo using simulations, demonstrating consistent covariance estimates at greatly reduced computational cost compared to existing methods. We also find LiMMBo yields consistent power advantages compared to univariate modelling strategies, where the advantages of multivariate mapping increases substantially with the phenotype dimensionality. Finally, we applied LiMMBo to 41 yeast growth traits to map their genetic determinants, finding previously known and novel pleiotropic relationships in this high-dimensional phenotype space. LiMMBo is accessible as open source software (https://github.com/HannahVMeyer/limmbo).Author summaryIn multi-trait genetic association studies one is interested in detecting genetic variants that are associated with one or multiple traits. Genetic variants that influence two or more traits are referred to as pleiotropic. Multivariate linear mixed models have been successfully applied to detect pleiotropic effects, by jointly modelling association signals across traits. However, these models are currently limited to a moderate number of phenotypes as the number of model parameters grows steeply with the number of phenotypes, raising a computational burden. We developed LiMMBo, a new approach for the joint analysis of high-dimensional phenotypes. Our method reduces the number of effective model parameters by introducing an intermediate subsampling step. We validate this strategy using simulations, where we apply LiMMBo for the genetic analysis of hundreds of phenotypes, detecting pleiotropic effects for a wide range of simulated genetic architectures. Finally, to illustrate LiMMBo in practice, we apply the model to a study of growth traits in yeast, where we identify pleiotropic effects for traits with formerly known genetic effects as well as revealing previously unconnected traits.

Download Full-text

LiMM‐PCA: Combining ASCA + and linear mixed models to analyse high‐dimensional designed data

CyTOF workflow: differential discovery in high-throughput high-dimensional cytometry datasets

Bayesian adaptive lasso with variational Bayes for variable selection in high-dimensional generalized linear mixed models

Improving heritability estimation by a variable selection approach in sparse high dimensional linear mixed models

Inference and Estimation for Random Effects in High-Dimensional Linear Mixed Models

Fixed Effects Testing in High-Dimensional Linear Mixed Models

GLMMLasso: An Algorithm for High-Dimensional Generalized Linear Mixed Models Using ℓ1-Penalization

Consistent Fixed-Effects Selection in Ultra-high dimensional Linear Mixed Models with Error-Covariate Endogeneity

Heritability estimation in high dimensional sparse linear mixed models

Statistical Significance in High-dimensional Linear Mixed Models

LiMMBo: a simple, scalable approach for linear mixed models in high-dimensional genetic association studies