A re-formulation of generalized linear mixed models to fit family data in genetic association studies

AbstractLinear mixed models (LMMs) have become the standard approach for genetic association testing in the presence of sample structure. However, the performance of LMMs has primarily been evaluated in relatively homogeneous populations of European ancestry, despite many of the recent genetic association studies including samples from worldwide populations with diverse ancestries. In this paper, we demonstrate that existing LMM methods can have systematic miscalibration of association test statistics genome-wide in samples with heterogenous ancestry, resulting in both increased type-I error rates and a loss of power. Furthermore, we show that this miscalibration arises due to varying allele frequency differences across the genome among populations. To overcome this problem, we developed LMM-OPS, an LMM approach which orthogonally partitions diverse genetic structure into two components: distant population structure and recent genetic relatedness. In simulation studies with real and simulated genotype data, we demonstrate that LMM-OPS is appropriately calibrated in the presence of ancestry heterogeneity and outperforms existing LMM approaches, including EMMAX, GCTA, and GEMMA. We conduct a GWAS of white blood cell (WBC) count in an admixed sample of 3,551 Hispanic/Latino American women from the Women’s Health Initiative SNP Health Association Resource where LMM-OPS detects genome-wide significant associations with corresponding p-values that are one or more orders of magnitude smaller than those from competing LMM methods. We also identify a genome-wide significant association with regulatory variant rs2814778 in the DARC gene on chromosome 1, which generalizes to Hispanic/Latino Americans a previous association with reduced WBC count identified in African Americans.

Download Full-text

Linear Score Tests for Variance Components in Linear Mixed Models and Applications to Genetic Association Studies

Biometrics ◽

10.1111/biom.12095 ◽

2013 ◽

Vol 69 (4) ◽

pp. 883-892 ◽

Cited By ~ 20

Author(s):

Long Qu ◽

Tobias Guennel ◽

Scott L. Marshall

Keyword(s):

Genetic Association ◽

Mixed Models ◽

Variance Components ◽

Association Studies ◽

Linear Mixed Models ◽

Genetic Association Studies ◽

Score Tests

Download Full-text

An Efficient Test for Gene-Environment Interaction in Generalized Linear Mixed Models with Family Data

International Journal of Environmental Research and Public Health ◽

10.3390/ijerph14101134 ◽

2017 ◽

Vol 14 (10) ◽

pp. 1134 ◽

Cited By ~ 3

Author(s):

Mauricio Mazo Lopera ◽

Brandon Coombes ◽

Mariza de Andrade

Keyword(s):

Mixed Models ◽

Generalized Linear Mixed Models ◽

Linear Mixed Models ◽

Family Data ◽

Environment Interaction ◽

Gene Environment Interaction ◽

Efficient Test ◽

Gene Environment

Download Full-text

Extended Bayesian Model Averaging in Generalized Linear Mixed Models Applied to Schizophrenia Family Data

Annals of Human Genetics ◽

10.1111/j.1469-1809.2010.00592.x ◽

2010 ◽

Vol 75 (1) ◽

pp. 62-77 ◽

Cited By ~ 3

Author(s):

Miao-Yu Tsai ◽

Chuhsing K. Hsiao ◽

Wei J. Chen

Keyword(s):

Mixed Models ◽

Bayesian Model ◽

Bayesian Model Averaging ◽

Generalized Linear Mixed Models ◽

Linear Mixed Models ◽

Model Averaging ◽

Family Data

Download Full-text

Using Family Data as a Verification Standard to Evaluate Copy Number Variation Calling Strategies for Genetic Association Studies

Genetic Epidemiology ◽

10.1002/gepi.21618 ◽

2012 ◽

Vol 36 (3) ◽

pp. 253-262 ◽

Cited By ~ 7

Author(s):

Xiaojing Zheng ◽

John R. Shaffer ◽

Caitlin P. McHugh ◽

Cathy C. Laurie ◽

Bjarke Feenstra ◽

...

Keyword(s):

Copy Number Variation ◽

Genetic Association ◽

Copy Number ◽

Association Studies ◽

Genetic Association Studies ◽

Family Data ◽

Number Variation ◽

Copy Number Variation Calling

Download Full-text

Variable selection in Bayesian generalized linear-mixed models: An illustration using candidate gene case-control association studies

Biometrical Journal ◽

10.1002/bimj.201300259 ◽

2014 ◽

Vol 57 (2) ◽

pp. 234-253 ◽

Cited By ~ 2

Author(s):

Miao-Yu Tsai

Keyword(s):

Variable Selection ◽

Candidate Gene ◽

Mixed Models ◽

Generalized Linear Mixed Models ◽

Association Studies ◽

Linear Mixed Models ◽

Case Control ◽

Control Association

Download Full-text

Control for Population Structure and Relatedness for Binary Traits in Genetic Association Studies via Logistic Mixed Models

The American Journal of Human Genetics ◽

10.1016/j.ajhg.2016.02.012 ◽

2016 ◽

Vol 98 (4) ◽

pp. 653-666 ◽

Cited By ~ 169

Author(s):

Han Chen ◽

Chaolong Wang ◽

Matthew P. Conomos ◽

Adrienne M. Stilp ◽

Zilin Li ◽

...

Keyword(s):

Population Structure ◽

Genetic Association ◽

Mixed Models ◽

Association Studies ◽

Genetic Association Studies ◽

Binary Traits

Download Full-text

Specification of Generalized Linear Mixed Models for Family Data using Markov Chain Monte Carlo Methods

Journal of Biometrics & Biostatistics ◽

10.4172/2155-6180.s1-003 ◽

2013 ◽

Author(s):

Kris M Jamsen Sophie G Zaloumis

Keyword(s):

Monte Carlo ◽

Markov Chain ◽

Markov Chain Monte Carlo ◽

Monte Carlo Methods ◽

Mixed Models ◽

Generalized Linear Mixed Models ◽

Linear Mixed Models ◽

Family Data

Download Full-text

LiMMBo: a simple, scalable approach for linear mixed models in high-dimensional genetic association studies

10.1101/255497 ◽

2018 ◽

Cited By ~ 4

Author(s):

Hannah Verena Meyer ◽

Francesco Paolo Casale ◽

Oliver Stegle ◽

Ewan Birney

Keyword(s):

Mixed Models ◽

Growth Traits ◽

Association Studies ◽

Linear Mixed Models ◽

Genetic Association Studies ◽

Pleiotropic Effects ◽

High Dimensional ◽

Model Parameters ◽

Moderate Number ◽

Genome Wide

AbstractGenome-wide association studies have helped to shed light on the genetic architecture of complex traits and diseases. Deep phenotyping of population cohorts is increasingly applied, where multi-to high-dimensional phenotypes are recorded in the individuals. Whilst these rich datasets provide important opportunities to analyse complex trait structures and pleiotropic effects at a genome-wide scale, existing statistical methods for joint genetic analyses are hampered by computational limitations posed by high-dimensional phenotypes. Consequently, such multivariate analyses are currently limited to a moderate number of traits. Here, we introduce a method that combines linear mixed models with bootstrapping (LiMMBo) to enable computationally efficient joint genetic analysis of high-dimensional phenotypes. Our method builds on linear mixed models, thereby providing robust control for population structure and other confounding factors, and the model scales to larger datasets with up to hundreds of phenotypes. We first validate LiMMBo using simulations, demonstrating consistent covariance estimates at greatly reduced computational cost compared to existing methods. We also find LiMMBo yields consistent power advantages compared to univariate modelling strategies, where the advantages of multivariate mapping increases substantially with the phenotype dimensionality. Finally, we applied LiMMBo to 41 yeast growth traits to map their genetic determinants, finding previously known and novel pleiotropic relationships in this high-dimensional phenotype space. LiMMBo is accessible as open source software (https://github.com/HannahVMeyer/limmbo).Author summaryIn multi-trait genetic association studies one is interested in detecting genetic variants that are associated with one or multiple traits. Genetic variants that influence two or more traits are referred to as pleiotropic. Multivariate linear mixed models have been successfully applied to detect pleiotropic effects, by jointly modelling association signals across traits. However, these models are currently limited to a moderate number of phenotypes as the number of model parameters grows steeply with the number of phenotypes, raising a computational burden. We developed LiMMBo, a new approach for the joint analysis of high-dimensional phenotypes. Our method reduces the number of effective model parameters by introducing an intermediate subsampling step. We validate this strategy using simulations, where we apply LiMMBo for the genetic analysis of hundreds of phenotypes, detecting pleiotropic effects for a wide range of simulated genetic architectures. Finally, to illustrate LiMMBo in practice, we apply the model to a study of growth traits in yeast, where we identify pleiotropic effects for traits with formerly known genetic effects as well as revealing previously unconnected traits.

Download Full-text

Statistical methods for multi-marker testing in genetic association studies

10.37099/mtu.dc.etd-restricted/82 ◽

2012 ◽

Author(s):

Yilin Dai

Keyword(s):

Genetic Association ◽

Statistical Methods ◽

Association Studies ◽

Genetic Association Studies

Download Full-text