scholarly journals Leveraging phenotypic variability to identify genetic interactions in human phenotypes

Author(s):  
Andrew R. Marderstein ◽  
Emily Davenport ◽  
Scott Kulm ◽  
Cristopher V. Van Hout ◽  
Olivier Elemento ◽  
...  

AbstractWhile thousands of loci have been associated with human phenotypes, the role of gene-environment (GxE) interactions in determining individual risk of human diseases remains unclear. This is partly due to the severe erosion of statistical power resulting from the massive number of statistical tests required to detect such interactions. Here, we focus on improving the power of GxE tests by developing a statistical framework for assessing quantitative trait loci (QTLs) associated with the trait means and/or trait variances. When applying this framework to body mass index (BMI), we find that GxE discovery and replication rates are significantly higher when prioritizing genetic variants associated with the variance of the phenotype (vQTLs) compared to assessing all genetic variants. Moreover, we find that vQTLs are enriched for associations with other non-BMI phenotypes having strong environmental influences, such as diabetes or ulcerative colitis. We show that GxE effects first identified in quantitative traits such as BMI can be used for GxE discovery in disease phenotypes such as diabetes. A clear conclusion is that strong GxE interactions mediate the genetic contribution to body weight and diabetes risk.

2018 ◽  
Author(s):  
Alfred Pozarickij ◽  
Cathy Williams ◽  
Pirro Hysi ◽  
Jeremy A. Guggenheim ◽  

AbstractRefractive error is a complex ocular trait controlled by genetic and environmental factors. Genome-wide association studies (GWAS) have identified approximately 150 genetic variants associated with refractive error. Among the known environmental factors, education, near-work and time spent outdoors have been demonstrated to have the strongest associations. Currently, the extent of gene-environment or gene-gene interactions in myopia is unknown. Here we show that the majority of genetic variants associated with refractive error show evidence of effect size heterogeneity, which is a hallmark feature of genetic interactions. Using conditional quantile regression, we observed that 88% of genetic variants associated with refractive error have at least nominally-significant non-uniform, non-linear profiles across the refractive error distribution. SNP effects tend to be strongest at the phenotype extremes and have weaker effects in emmetropes. A parsimonious explanation for these findings is that gene-environment or gene-gene interactions in refractive error are pervasive.Author summaryThe prevalence of myopia (nearsightedness) in the United States and East Asia has almost doubled in the past 30 years. Such a rapid rise in prevalence cannot be explained by genetics, which implies that environmental (lifestyle) risk factors play a major role. Nevertheless, diverse approaches have suggested that genetics is also important, and indeed approximately 150 distinct genetic risk loci for myopia have been discovered to date. One attractive explanation for the evidence implicating both genes and environment in myopia is gene-environment (GxE) interaction (a difference in genetic effect in individuals exposed to a high vs. low level of an environmental risk factor). Past studies aiming to discover GxE interactions in myopia have met with limited success, perhaps because information on lifestyle exposures during childhood has rarely been available. Here we used an agnostic approach that does not require information about specific lifestyle exposures in order to detect ‘signatures’ of GxE interaction. We found compelling evidence for widespread genetic interactions in myopia, with 88% of 150 known myopia genetic susceptibility loci showing an interaction signature. These findings suggest that GxE interactions in myopia are pervasive.


Author(s):  
Chao Cheng ◽  
Donna Spiegelman ◽  
Zuoheng Wang ◽  
Molin Wang

Abstract Interest in investigating gene-environment (GxE) interactions has rapidly increased over the last decade. Although GxE interactions have been extremely investigated in large studies, few such effects have been identified and replicated, highlighting the need to develop statistical GxE tests with greater statistical power. The reverse test has been proposed for testing the interaction effect between a continuous exposure and genetic variants in relation to a binary disease outcome, which leverages the idea of linear discriminant analysis, significantly increasing statistical power comparing to the standard logistic regression approach. However, this reverse approach did not take into consideration adjustment for confounders. Since GxE interaction studies are inherently non-experimental, adjusting for potential confounding effects is critical for valid evaluation of GxE interactions. In this paper, we extend the reverse test to allow for confounders. The proposed reverse test also allows for exposure measurement errors as typically occurs. Extensive simulation experiments demonstrated that the proposed method not only provides greater statistical power under most simulation scenarios but also provides substantive computational efficiency, which achieves a computation time that is more than sevenfold less than that of the standard logistic regression test. In an illustrative example, we applied the proposed approach to the Veterans Aging Cohort Study (VACS) to search for genetic susceptibility loci modifying the smoking–HIV status association.


Author(s):  
Andrey Ziyatdinov ◽  
Jihye Kim ◽  
Dmitry Prokopenko ◽  
Florian Privé ◽  
Fabien Laporte ◽  
...  

Abstract The effective sample size (ESS) is a metric used to summarize in a single term the amount of correlation in a sample. It is of particular interest when predicting the statistical power of genome-wide association studies (GWAS) based on linear mixed models. Here, we introduce an analytical form of the ESS for mixed-model GWAS of quantitative traits and relate it to empirical estimators recently proposed. Using our framework, we derived approximations of the ESS for analyses of related and unrelated samples and for both marginal genetic and gene-environment interaction tests. We conducted simulations to validate our approximations and to provide a quantitative perspective on the statistical power of various scenarios, including power loss due to family relatedness and power gains due to conditioning on the polygenic signal. Our analyses also demonstrate that the power of gene-environment interaction GWAS in related individuals strongly depends on the family structure and exposure distribution. Finally, we performed a series of mixed-model GWAS on data from the UK Biobank and confirmed the simulation results. We notably found that the expected power drop due to family relatedness in the UK Biobank is negligible.


Author(s):  
Andrew A. Crawford ◽  
◽  
Sean Bankier ◽  
Elisabeth Altmaier ◽  
Catriona L. K. Barnes ◽  
...  

AbstractThe stress hormone cortisol modulates fuel metabolism, cardiovascular homoeostasis, mood, inflammation and cognition. The CORtisol NETwork (CORNET) consortium previously identified a single locus associated with morning plasma cortisol. Identifying additional genetic variants that explain more of the variance in cortisol could provide new insights into cortisol biology and provide statistical power to test the causative role of cortisol in common diseases. The CORNET consortium extended its genome-wide association meta-analysis for morning plasma cortisol from 12,597 to 25,314 subjects and from ~2.2 M to ~7 M SNPs, in 17 population-based cohorts of European ancestries. We confirmed the genetic association with SERPINA6/SERPINA1. This locus contains genes encoding corticosteroid binding globulin (CBG) and α1-antitrypsin. Expression quantitative trait loci (eQTL) analyses undertaken in the STARNET cohort of 600 individuals showed that specific genetic variants within the SERPINA6/SERPINA1 locus influence expression of SERPINA6 rather than SERPINA1 in the liver. Moreover, trans-eQTL analysis demonstrated effects on adipose tissue gene expression, suggesting that variations in CBG levels have an effect on delivery of cortisol to peripheral tissues. Two-sample Mendelian randomisation analyses provided evidence that each genetically-determined standard deviation (SD) increase in morning plasma cortisol was associated with increased odds of chronic ischaemic heart disease (0.32, 95% CI 0.06–0.59) and myocardial infarction (0.21, 95% CI 0.00–0.43) in UK Biobank and similarly in CARDIoGRAMplusC4D. These findings reveal a causative pathway for CBG in determining cortisol action in peripheral tissues and thereby contributing to the aetiology of cardiovascular disease.


2020 ◽  
Author(s):  
Brody Holohan ◽  
Raphael Laderman

AbstractGene-environment interactions are at the heart of why many complex traits are not fully heritable, and why prediction of disease incidence and individual response to environmental changes based on genetics has been underwhelming in utility. Understanding these interactions is the primary limiting factor for the application of personalized medicine, but current methods are not well suited for dealing with complex traits that pose both a dimensionality and sparse data problem to unsupervised analysis methods. Genteract has developed a proprietary analytical technique that allows for detection and interpretation of GxEs regarding specific pairs of a single phenotype with a single environmental factor; these methods allow us to develop a platform that can be used to predict how individuals will respond to changes in their environment based on their genetics. To validate the methods we performed two types of testing: cross-validation against a dataset of clinical study results, and application of the methods in a simulated dataset. These tests enable a greater understanding of the methods’ utility, statistical power and predictive capabilities.


2019 ◽  
Author(s):  
John A. Lees ◽  
T. Tien Mai ◽  
Marco Galardini ◽  
Nicole E. Wheeler ◽  
Jukka Corander

ABSTRACTDiscovery of influential genetic variants and prediction of phenotypes such as antibiotic resistance are becoming routine tasks in bacterial genomics. Genome-wide association study (GWAS) methods can be applied to study bacterial populations, with a particular emphasis on alignment-free approaches, which are necessitated by the more plastic nature of bacterial genomes. Here we advance bacterial GWAS by introducing a computationally scalable joint modeling framework, where genetic variants covering the entire pangenome are compactly represented by unitigs, and the model fitting is achieved using elastic net penalization. In contrast to current leading GWAS approaches, which test each genotype-phenotype association separately for each variant, our joint modelling approach is shown to lead to increased statistical power while maintaining control of the false positive rate. Our inference procedure also delivers an estimate of the narrow-sense heritability, which is gaining considerable interest in studies of bacteria. Using an extensive set of state-of-the-art bacterial population genomic datasets we demonstrate that our approach performs accurate phenotype prediction, comparable to popular machine learning methods, while retaining both interpretability and computational efficiency. We expect that these advances will pave the way for the next generation of high-powered association and prediction studies for an increasing number of bacterial species.


2018 ◽  
Vol 2018 ◽  
pp. 1-13 ◽  
Author(s):  
Andrea Maugeri ◽  
Martina Barchitta ◽  
Maria Grazia Mazzone ◽  
Francesco Giuliano ◽  
Antonella Agodi

Age-related macular degeneration (AMD) is the most common cause of visual loss in developed countries, with a significant economic and social burden on public health. Although genome-wide and gene-candidate studies have been enabled to identify genetic variants in the complement system associated with AMD pathogenesis, the effect of gene-environment interaction is still under debate. In this review we provide an overview of the role of complement system and its genetic variants in AMD, summarizing the consequences of the interaction between genetic and environmental risk factors on AMD onset, progression, and therapeutic response. Finally, we discuss the perspectives of current evidence in the field of genomics driven personalized medicine and public health.


Circulation ◽  
2021 ◽  
Vol 143 (Suppl_1) ◽  
Author(s):  
Kenneth E Westerman

Background: Gene-environment interaction (GEI) analysis enables us to understand how genetic variants modify the effects of environmental exposures on cardiometabolic risk factors, providing a foundation for genome-based precision medicine. Ideally, these interactions could be mapped comprehensively across all measured genetic variants, exposures, and outcomes, but this approach is computationally intensive and statistically underpowered. Recent studies have shown that variance-quantitative trait loci (vQTLs), or genetic variants that associate with differential variance of an outcome, are substantially enriched for underlying GEIs. Here, we sought to first identify vQTLs for cardiometabolic traits, then use this smaller genetic search space to uncover novel gene-environment interactions across thousands of environmental exposures. Methods: A two-stage, multi-ancestry analysis was conducted in 355,790 unrelated participants from the UK Biobank. First, we performed a genome-wide vQTL scan for each of 20 serum metabolic biomarkers, including but not limited to lipids, lipoproteins, and glycemic measures. This scan used Levene’s test to identify genetic markers whose genotypes are associated with the variance, rather than the mean, of the biomarker. Next, we collected over 2000 variables corresponding to socioeconomic, dietary, lifestyle, and clinical exposures, and conducted an interaction analysis for each combination of exposure and vQTL-biomarker pair. For each stage, the analysis was initially stratified by ancestry then meta-analyzed to generate the primary set of results. Results: vQTLs were identified at 514 independent regions in the genome, with most of these genetic variants already known to affect the mean biomarker level. In the subsequent gene-environment interaction analysis, we found 2,162 significant interactions passing a stringent significance threshold adjusted for multiple testing ( p < 0.05 / 578 vQTL-biomarker pairs / 2140 exposures = 4х10 -8 ). Some of these expanded on existing findings; for example, genetic marker rs2393775 in the HNF1A gene interacted with education level (as a proxy for socioeconomic status) to influence hsCRP ( p = 1.3х10 -10 ), building on a previous finding that HNF1A variants modify the effect of perceived stress on cardiovascular outcomes. Others highlighted novel biology, such as an interaction between variants near the fatty liver-associated gene TM6SF2 and oily fish intake on total and LDL-cholesterol levels ( p = 6.6х10 -9 ). Conclusions: Our systematic GEI discovery effort identified thousands of interactions that may impact cardiometabolic risk, both expanding on previous research and identifying novel biological mechanisms. This catalog of vQTLs and interactions can inform future mechanistic studies and provides a knowledge base for genome-centered precision approaches to cardiometabolic health.


2019 ◽  
Vol 21 (3) ◽  
pp. 753-761 ◽  
Author(s):  
Regina Brinster ◽  
Dominique Scherer ◽  
Justo Lorenzo Bermejo

Abstract Population stratification is usually corrected relying on principal component analysis (PCA) of genome-wide genotype data, even in populations considered genetically homogeneous, such as Europeans. The need to genotype only a small number of genetic variants that show large differences in allele frequency among subpopulations—so-called ancestry-informative markers (AIMs)—instead of the whole genome for stratification adjustment could represent an advantage for replication studies and candidate gene/pathway studies. Here we compare the correction performance of classical and robust principal components (PCs) with the use of AIMs selected according to four different methods: the informativeness for assignment measure ($IN$-AIMs), the combination of PCA and F-statistics, PCA-correlated measurement and the PCA weighted loadings for each genetic variant. We used real genotype data from the Population Reference Sample and The Cancer Genome Atlas to simulate European genetic association studies and to quantify type I error rate and statistical power in different case–control settings. In studies with the same numbers of cases and controls per country and control-to-case ratios reflecting actual rates of disease prevalence, no adjustment for population stratification was required. The unnecessary inclusion of the country of origin, PCs or AIMs as covariates in the regression models translated into increasing type I error rates. In studies with cases and controls from separate countries, no investigated method was able to adequately correct for population stratification. The first classical and the first two robust PCs achieved the lowest (although inflated) type I error, followed at some distance by the first eight $IN$-AIMs.


2019 ◽  
Vol 37 (1) ◽  
pp. 295-299 ◽  
Author(s):  
Sergei L Kosakovsky Pond ◽  
Art F Y Poon ◽  
Ryan Velazquez ◽  
Steven Weaver ◽  
N Lance Hepler ◽  
...  

Abstract HYpothesis testing using PHYlogenies (HyPhy) is a scriptable, open-source package for fitting a broad range of evolutionary models to multiple sequence alignments, and for conducting subsequent parameter estimation and hypothesis testing, primarily in the maximum likelihood statistical framework. It has become a popular choice for characterizing various aspects of the evolutionary process: natural selection, evolutionary rates, recombination, and coevolution. The 2.5 release (available from www.hyphy.org) includes a completely re-engineered computational core and analysis library that introduces new classes of evolutionary models and statistical tests, delivers substantial performance and stability enhancements, improves usability, streamlines end-to-end analysis workflows, makes it easier to develop custom analyses, and is mostly backward compatible with previous HyPhy releases.


Sign in / Sign up

Export Citation Format

Share Document