scholarly journals Population-specific causal disease effect sizes in functionally important regions impacted by selection

2019 ◽  
Author(s):  
Huwenbo Shi ◽  
Steven Gazal ◽  
Masahiro Kanai ◽  
Evan M. Koch ◽  
Armin P. Schoech ◽  
...  

AbstractMany diseases and complex traits exhibit population-specific causal effect sizes with trans-ethnic genetic correlations significantly less than 1, limiting trans-ethnic polygenic risk prediction. We developed a new method, S-LDXR, for stratifying squared trans-ethnic genetic correlation across genomic annotations, and applied S-LDXR to genome-wide association summary statistics for 31 diseases and complex traits in East Asians (EAS) and Europeans (EUR) (average NEAS=90K, NEUR=267K) with an average trans-ethnic genetic correlation of 0.85 (s.e. 0.01). We determined that squared trans-ethnic genetic correlation was 0.82× (s.e. 0.01) smaller than the genome-wide average at SNPs in the top quintile of background selection statistic, implying more population-specific causal effect sizes. Accordingly, causal effect sizes were more population-specific in functionally important regions, including conserved and regulatory regions. In analyses of regions surrounding specifically expressed genes, causal effect sizes were most population-specific for skin and immune genes and least population-specific for brain genes. Our results could potentially be explained by stronger gene-environment interaction at loci impacted by selection, particularly positive selection.


2021 ◽  
Vol 12 (1) ◽  
Author(s):  
Huwenbo Shi ◽  
Steven Gazal ◽  
Masahiro Kanai ◽  
Evan M. Koch ◽  
Armin P. Schoech ◽  
...  

AbstractMany diseases exhibit population-specific causal effect sizes with trans-ethnic genetic correlations significantly less than 1, limiting trans-ethnic polygenic risk prediction. We develop a new method, S-LDXR, for stratifying squared trans-ethnic genetic correlation across genomic annotations, and apply S-LDXR to genome-wide summary statistics for 31 diseases and complex traits in East Asians (average N = 90K) and Europeans (average N = 267K) with an average trans-ethnic genetic correlation of 0.85. We determine that squared trans-ethnic genetic correlation is 0.82× (s.e. 0.01) depleted in the top quintile of background selection statistic, implying more population-specific causal effect sizes. Accordingly, causal effect sizes are more population-specific in functionally important regions, including conserved and regulatory regions. In regions surrounding specifically expressed genes, causal effect sizes are most population-specific for skin and immune genes, and least population-specific for brain genes. Our results could potentially be explained by stronger gene-environment interaction at loci impacted by selection, particularly positive selection.



2015 ◽  
Author(s):  
Brendan Bulik-Sullivan ◽  
Hilary K Finucane ◽  
Verneri Anttila ◽  
Alexander Gusev ◽  
Felix R Day ◽  
...  

Identifying genetic correlations between complex traits and diseases can provide useful etiological insights and help prioritize likely causal relationships. The major challenges preventing estimation of genetic correlation from genome-wide association study (GWAS) data with current methods are the lack of availability of individual genotype data and widespread sample overlap among meta-analyses. We circumvent these difficulties by introducing a technique for estimating genetic correlation that requires only GWAS summary statistics and is not biased by sample overlap. We use our method to estimate 300 genetic correlations among 25 traits, totaling more than 1.5 million unique phenotype measurements. Our results include genetic correlations between anorexia nervosa and schizophrenia/ body mass index and associations between educational attainment and several diseases. These results highlight the power of a polygenic modeling framework, since there currently are no genome-wide significant SNPs for anorexia nervosa and only three for educational attainment.



2021 ◽  
Author(s):  
Gui-Juan Feng ◽  
Qian Xu ◽  
Jing-Jing Ni ◽  
Shan-Shan Yang ◽  
Bai-Xue Han ◽  
...  

Abstract Age at menarche (AAM) is a sign of puberty of females. It is a heritable trait associated with various adult diseases. However, the genetic mechanism that determines AAM and links it to disease risk is poorly understood. Aiming to uncover the genetic basis for AAM, we conducted a joint association study in up to 438,089 participants from 3 genome-wide association studies of European and East Asian ancestries. Twenty-one novel genomic loci were identified at the genome-wide significance level. Besides, we observed significant genetic correlations between AAM and 67 complex traits, and the highest genetic correlation was observed between AAM and body mass index (rg=-0.19, P=6.11×10−31). Latent causal variable analyses demonstrate that there is a genetically causal effect of AAM on high blood pressure (GCP=0.47, P=0.02), forced vital capacity (GCP=0.63, P=0.02), age at first live birth (GCP=0.51, P=0.03), impedance of right arm (GCP=0.41, P<1×10-7) and right leg fat percentage (GCP=-0.10, P=0.02), etc. Enrichment analysis identified 5 enriched tissues and 51 enriched gene sets. Four of the five enriched tissues were related to the nervous system, including the hypothalamus middle, hypothalamo hypophyseal system, neurosecretory systems and hypothalamus. The fifth tissue was the retina in the sensory organ. The most significant gene set was the ‘decreased circulating luteinizing hormone level’ (P=2.45×10-6). Our findings may provide useful insights that elucidate the mechanisms determining AAM and the genetic interplay between AAM and some traits of women.



2021 ◽  
Author(s):  
Abdel Abdellaoui ◽  
Karin Verweij ◽  
Michel G Nivard

Abstract Gene-environment correlations can bias associations between genetic variants and complex traits in genome-wide association studies (GWASs). Here, we control for geographic sources of gene-environment correlation in GWASs on 56 complex traits (N = 69,772–271,457). Controlling for geographic region significantly decreases heritability signals for SES-related traits, most strongly for educational attainment and income, indicating that socio-economic differences between regions induce gene-environment correlations that become part of the polygenic signal. For most other complex traits investigated, genetic correlations with educational attainment and income are significantly reduced, most significantly for traits related to BMI, sedentary behavior, and substance use. Controlling for current address has greater impact on the polygenic signal than birth place, suggesting both active and passive sources of gene-environment correlations. Our results show that societal sources of social stratification that extend beyond families introduce regional-level gene-environment correlations that affect GWAS results.



2021 ◽  
Author(s):  
Abdel Abdellaoui ◽  
Karin J.H. Verweij ◽  
Michel G. Nivard

Gene-environment correlations can bias associations between genetic variants and complex traits in genome-wide association studies (GWASs). Here, we control for geographic sources of gene-environment correlation in GWASs on 56 complex traits (N=69,772-271,457). Controlling for geographic region significantly decreases heritability signals for SES-related traits, most strongly for educational attainment and income, indicating that socio-economic differences between regions induce gene-environment correlations that become part of the polygenic signal. For most other complex traits investigated, genetic correlations with educational attainment and income are significantly reduced, most significantly for traits related to BMI, sedentary behavior, and substance use. Controlling for current address has greater impact on the polygenic signal than birth place, suggesting both active and passive sources of gene-environment correlations. Our results show that societal sources of social stratification that extend beyond families introduce regional-level gene-environment correlations that affect GWAS results.



2017 ◽  
Author(s):  
Luke J. O’Connor ◽  
Alkes L. Price

AbstractMendelian randomization (MR) is widely used to identify causal relationships among heritable traits, but it can be confounded by genetic correlations reflecting shared etiology. We propose a model in which a latent causal variable mediates the genetic correlation between two traits. Under the latent causal variable (LCV) model, trait 1 is fully genetically causal for trait 2 if it is perfectly genetically correlated with the latent causal variable, implying that the entire genetic component of trait 1 is causal for trait 2; it is partially genetically causal for trait 2 if it has a high genetic correlation with the latent variable, implying that part of the genetic component of trait 1 is causal for trait 2. To quantify the degree of partial genetic causality, we define the genetic causality proportion (gcp). We fit this model using mixed fourth moments E(α1α2) and E(α1α2) of marginal effect sizes for each trait, exploiting the fact that if trait 1 is causal for trait 2 then SNPs affecting trait 1 (large ) will have correlated effects on trait 2 (large α1α2), but not vice versa. We performed simulations under a wide range of genetic architectures and determined that LCV, unlike state-of-the-art MR methods, produced well-calibrated false positive rates and reliable gcp estimates in the presence of genetic correlations and asymmetric genetic architectures; we also determined that LCV is well-powered to detect a causal effect. We applied LCV to GWAS summary statistics for 52 traits (average N=331k), identifying partially or fully genetically causal effects (1% FDR) for 59 pairs of traits, including 30 pairs of traits with high gcp estimates (gĉp > 0.6). Results consistent with the published literature included genetically causal effects on myocardial infarction (MI) for LDL, triglycerides and BMI. Novel findings included a genetically causal effect of LDL on bone mineral density, consistent with clinical trials of statins in osteoporosis. These results demonstrate that it is possible to distinguish between genetic correlation and causation using genetic data.



Author(s):  
Carol Kan ◽  
Ma-Li Wong

An association between type 2 diabetes mellitus (T2DM) and depression has been reported in epidemiological studies. Finding a genetic overlap between T2DM and depression will provide evidence to support a common biological pathway to both disorders. Genetic correlations observed from twin studies indicate that a small magnitude of the variance in liability can be attributed to genetic factors. However, no genetic overlap has been observed between T2DM and depression in genome-wide association studies using both the polygenic score and the linkage disequilibrium score regression approaches. Clarifying the shared heritability between these two complex traits is an important next step towards better therapy and treatment. Another area that needs to be explored is gene–environment interaction, since genotypes can affect an individual’s responses to the environment and environment can differentially affect genotypes expression.



2015 ◽  
Vol 2015 ◽  
pp. 1-7 ◽  
Author(s):  
Haiming Xu ◽  
Beibei Jiang ◽  
Yujie Cao ◽  
Yingxin Zhang ◽  
Xiaodeng Zhan ◽  
...  

With development of sequencing technology, dense single nucleotide polymorphisms (SNPs) have been available, enabling uncovering genetic architecture of complex traits by genome-wide association study (GWAS). However, the current GWAS strategy usually ignores epistatic and gene-environment interactions due to absence of appropriate methodology and heavy computational burden. This study proposed a new GWAS strategy by combining the graphics processing unit- (GPU-) based generalized multifactor dimensionality reduction (GMDR) algorithm with mixed linear model approach. The reliability and efficiency of the analytical methods were verified through Monte Carlo simulations, suggesting that a population size of nearly 150 recombinant inbred lines (RILs) had a reasonable resolution for the scenarios considered. Further, a GWAS was conducted with the above two-step strategy to investigate the additive, epistatic, and gene-environment associations between 701,867 SNPs and three important quality traits, gelatinization temperature, amylose content, and gel consistency, in a RIL population with 138 individuals derived from super-hybrid rice Xieyou9308 in two environments. Four significant SNPs were identified with additive, epistatic, and gene-environment interaction effects. Our study showed that the mixed linear model approach combining with the GPU-based GMDR algorithm is a feasible strategy for implementing GWAS to uncover genetic architecture of crop complex traits.



2016 ◽  
Author(s):  
Huwenbo Shi ◽  
Nicholas Mancuso ◽  
Sarah Spendlove ◽  
Bogdan Pasaniuc

AbstractAlthough genetic correlations between complex traits provide valuable insights into epidemiological and etiological studies, a precise quantification of which genomic regions contribute to the genome-wide genetic correlation is currently lacking. Here, we introduce ρ-HESS, a technique to quantify the correlation between pairs of traits due to genetic variation at a small region in the genome. Our approach only requires GWAS summary data and makes no distributional assumption on the causal variant effects sizes while accounting for linkage disequilibrium (LD) and overlapping GWAS samples. We analyzed large-scale GWAS summary data across 35 complex traits, and identified 27 genomic regions that contribute significantly to the genetic correlation among these traits. Notably, we find 7 genomic regions that contribute to the genetic correlation of 12 pairs of traits that show negligible genome-wide correlation, further showcasing the power of local genetic correlation analyses. Finally, we leverage the distribution of local genetic correlations across the genome to assign putative direction of causality for 15 pairs of traits.



2016 ◽  
Author(s):  
Jie Zheng ◽  
A. Mesut Erzurumluoglu ◽  
Benjamin L. Elsworth ◽  
Laurence Howe ◽  
Philip C. Haycock ◽  
...  

AbstractMotivationLD score regression is a reliable and efficient method of using genome-wide association study (GWAS) summary-level results data to estimate the SNP heritability of complex traits and diseases, partition this heritability into functional categories, and estimate the genetic correlation between different phenotypes. Because the method relies on summary level results data, LD score regression is computationally tractable even for very large sample sizes. However, publicly available GWAS summary-level data are typically stored in different databases and have different formats, making it difficult to apply LD score regression to estimate genetic correlations across many different traits simultaneously.ResultsIn this manuscript, we describe LD Hub – a centralized database of summary-level GWAS results for 177 diseases/traits from different publicly available resources/consortia and a web interface that automates the LD score regression analysis pipeline. To demonstrate functionality and validate our software, we replicated previously reported LD score regression analyses of 49 traits/diseases using LD Hub; and estimated SNP heritability and the genetic correlation across the different phenotypes. We also present new results obtained by uploading a recent atopic dermatitis GWAS meta-analysis to examine the genetic correlation between the condition and other potentially related traits. In response to the growing availability of publicly accessible GWAS summary-level results data, our database and the accompanying web interface will ensure maximal uptake of the LD score regression methodology, provide a useful database for the public dissemination of GWAS results, and provide a method for easily screening hundreds of traits for overlapping genetic aetiologies.Availability and implementationThe web interface and instructions for using LD Hub are available at http://ldsc.broadinstitute.org/



Sign in / Sign up

Export Citation Format

Share Document