scholarly journals Genetic Nature or Genetic Nurture? Quantifying Bias in Analyses Using Polygenic Scores

2019 ◽  
Author(s):  
Sam Trejo ◽  
Benjamin W. Domingue

AbstractSummary statistics from a genome-wide association study (GWAS) can be used to generate a polygenic score (PGS). For complex, behavioral traits, the correlation between an individual’s PGS and their phenotype may contain bias alongside the causal effect of the individual’s genes (due to geographic, ancestral, and/or socioeconomic confounding). We formalize the recent introduction of a different source of bias in regression models using PGSs: the effects of parental genes on offspring outcomes, also known as genetic nurture. GWAS do not discriminate between the various pathways through which genes influence outcomes, meaning existing PGSs capture both direct genetic effects and genetic nurture effects. We construct a theoretical model for genetic effects and show that, unlike other sources of bias in PGSs, the presence of genetic nurture biases PGS coefficients from both naïve OLS (between-family) and family fixed effects (within-family) regressions. This bias is in opposite directions; while naïve OLS estimates are biased upwards, family fixed effects estimates are biased downwards. We quantify this bias for a given trait using two novel parameters that we identify and discuss: (1) the genetic correlation between the direct and nurture effects and (2) the ratio of the SNP heritabilities for the direct and nurture effects.

2018 ◽  
Vol 115 (22) ◽  
pp. E4970-E4979 ◽  
Author(s):  
Thomas A. DiPrete ◽  
Casper A. P. Burik ◽  
Philipp D. Koellinger

Identifying causal effects in nonexperimental data is an enduring challenge. One proposed solution that recently gained popularity is the idea to use genes as instrumental variables [i.e., Mendelian randomization (MR)]. However, this approach is problematic because many variables of interest are genetically correlated, which implies the possibility that many genes could affect both the exposure and the outcome directly or via unobserved confounding factors. Thus, pleiotropic effects of genes are themselves a source of bias in nonexperimental data that would also undermine the ability of MR to correct for endogeneity bias from nongenetic sources. Here, we propose an alternative approach, genetic instrumental variable (GIV) regression, that provides estimates for the effect of an exposure on an outcome in the presence of pleiotropy. As a valuable byproduct, GIV regression also provides accurate estimates of the chip heritability of the outcome variable. GIV regression uses polygenic scores (PGSs) for the outcome of interest which can be constructed from genome-wide association study (GWAS) results. By splitting the GWAS sample for the outcome into nonoverlapping subsamples, we obtain multiple indicators of the outcome PGSs that can be used as instruments for each other and, in combination with other methods such as sibling fixed effects, can address endogeneity bias from both pleiotropy and the environment. In two empirical applications, we demonstrate that our approach produces reasonable estimates of the chip heritability of educational attainment (EA) and show that standard regression and MR provide upwardly biased estimates of the effect of body height on EA.


2016 ◽  
Vol 47 (5) ◽  
pp. 971-980 ◽  
Author(s):  
S. H. Gage ◽  
H. J. Jones ◽  
S. Burgess ◽  
J. Bowden ◽  
G. Davey Smith ◽  
...  

BackgroundObservational associations between cannabis and schizophrenia are well documented, but ascertaining causation is more challenging. We used Mendelian randomization (MR), utilizing publicly available data as a method for ascertaining causation from observational data.MethodWe performed bi-directional two-sample MR using summary-level genome-wide data from the International Cannabis Consortium (ICC) and the Psychiatric Genomics Consortium (PGC2). Single nucleotide polymorphisms (SNPs) associated with cannabis initiation (p < 10−5) and schizophrenia (p < 5 × 10−8) were combined using an inverse-variance-weighted fixed-effects approach. We also used height and education genome-wide association study data, representing negative and positive control analyses.ResultsThere was some evidence consistent with a causal effect of cannabis initiation on risk of schizophrenia [odds ratio (OR) 1.04 per doubling odds of cannabis initiation, 95% confidence interval (CI) 1.01–1.07, p = 0.019]. There was strong evidence consistent with a causal effect of schizophrenia risk on likelihood of cannabis initiation (OR 1.10 per doubling of the odds of schizophrenia, 95% CI 1.05–1.14, p = 2.64 × 10−5). Findings were as predicted for the negative control (height: OR 1.00, 95% CI 0.99–1.01, p = 0.90) but weaker than predicted for the positive control (years in education: OR 0.99, 95% CI 0.97–1.00, p = 0.066) analyses.ConclusionsOur results provide some that cannabis initiation increases the risk of schizophrenia, although the size of the causal estimate is small. We find stronger evidence that schizophrenia risk predicts cannabis initiation, possibly as genetic instruments for schizophrenia are stronger than for cannabis initiation.


2017 ◽  
Author(s):  
Thomas A. DiPrete ◽  
Casper A.P. Burik ◽  
Philipp D. Koellinger

Identifying causal effects in non-experimental data is an enduring challenge. One proposed solution that recently gained popularity is the idea to use genes as instrumental variables (i.e. Mendelian Randomization - MR). However, this approach is problematic because many variables of interest are genetically correlated, which implies the possibility that many genes could affect both the exposure and the outcome directly or via unobserved confounding factors. Thus, pleiotropic effects of genes are themselves a source of bias in non-experimental data that would also undermine the ability of MR to correct for endogeneity bias from non-genetic sources. Here, we propose an alternative approach, GIV regression, that provides estimates for the effect of an exposure on an outcome in the presence of pleiotropy. As a valuable byproduct, GIV regression also provides accurate estimates of the chip heritability of the outcome variable. GIV regression uses polygenic scores (PGS) for the outcome of interest which can be constructed from genome-wide association study (GWAS) results. By splitting the GWAS sample for the outcome into non-overlapping subsamples, we obtain multiple indicators of the outcome PGS that can be used as instruments for each other, and, in combination with other methods such as sibling fixed effects, can address endogeneity bias from both pleiotropy and the environment. In two empirical applications, we demonstrate that our approach produces reasonable estimates of the chip heritability of educational attainment (EA) and show that standard regression and MR provide upwardly biased estimates of the effect of body height on EA.


2021 ◽  
Author(s):  
Atul Kumar ◽  
Maryam Shoai ◽  
Sebastian Palmqvist ◽  
Erik Stomrud ◽  
John Hardy ◽  
...  

Abstract Background Cognitive decline in early-stage Alzheimer’s disease (AD) may depend on genetic variability. Methods In the Swedish BioFINDER study, we used polygenic scores (PGS) (for AD, intelligence and educational attainment), and genetic variants (in a genome-wide association study [GWAS]) to predict longitudinal cognitive change (measured by MMSE) over a mean of 4.2 years. We included 555 β-amyloid (Aβ) negative cognitively unimpaired (CU) individuals, 206 Aβ-positive CU (preclinical AD), 110 Aβ-negative mild cognitive impairment (MCI) patients, and 146 Aβ-positive MCI patients (prodromal AD). Results Polygenic scores for AD (in Aβ-positive individuals) and intelligence (independent of Aβ-status) were associated with cognitive decline. Eight genes were associated with cognitive decline in GWAS (3 independent of Aβ-status). Conclusions AD risk genes may influence cognitive decline in early AD, while genes related to intelligence may modulate cognitive decline irrespective of disease. Therapies targeting the implicated biological pathways may modulate the clinical course of AD.


2019 ◽  
Vol 97 (Supplement_2) ◽  
pp. 40-40
Author(s):  
Garrett See ◽  
Melanie Trenhaile-Gannemann ◽  
Daniel Ciobanu ◽  
Matthew L Spangler ◽  
Benny Mote

Abstract The objective of the current study was to conduct a genome-wide association on gestation length (GL) in different parities in swine. Sows (n = 831) belonging to the University of Nebraska – Lincoln resource population (Landrace X Nebraska Index Line) were utilized. GL was defined as the number of days between the final insemination and farrowing. Four traits, GL at parity 1, 2, 3 and 4 (GL1, GL2, GL3 and GL4, respectively) were investigated. Animals which were induced 24h prior to a farrowing event were removed from the analysis. Sows were genotyped with the Illumina SNP60 BeadArray. A Bayes C model with π=0.995 was implemented with fixed effects of contemporary group, development pen, diet, linear and quadratic terms for age at puberty (GL1; P < 0.01), and linear and quadratic terms for farrowing age (GL2; P < 0.01). Results are posterior means of 55,000 samples.Single marker association analysis (SMA) was performed in R utilizing a linear model on SNP from 1-Mb windows (n = 10) which explained the largest proportion of genetic variation in GL1. Top 10 (0.5% of all windows) 1-Mb windows accounted for a limited proportion of genetic variance, 7.75, 4.66, 3.45 and 2.05% in GL1, GL2, GL3 and GL4, respectively. Posterior mean heritability estimates (posterior SD) for GL1, GL2, GL3 and GL4 were 0.33 (0.06), 0.34 (0.07), 0.32 (0.08) and 0.20 (0.08), respectively. The top SNP (ASGA0017859, SSC4, 7.8 Mb) located in one of the two top common genomic regions associated with GL1, GL2 and GL3 displayed a difference of 1.1d in GL1 between alternate homozygotes (P < 0.01). The top SNP from nine of the ten regions were significant (P < 0.05) in the SMA. Two of these regions were in common with GL2 and GL3 where SNP with potential functional effects were found in ZFAT, MAML2 and CCDC82. Results suggest GL is a largely polygenic trait.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Peitao Wu ◽  
Biqi Wang ◽  
Steven A. Lubitz ◽  
Emelia J. Benjamin ◽  
James B. Meigs ◽  
...  

AbstractBecause single genetic variants may have pleiotropic effects, one trait can be a confounder in a genome-wide association study (GWAS) that aims to identify loci associated with another trait. A typical approach to address this issue is to perform an additional analysis adjusting for the confounder. However, obtaining conditional results can be time-consuming. We propose an approximate conditional phenotype analysis based on GWAS summary statistics, the covariance between outcome and confounder, and the variant minor allele frequency (MAF). GWAS summary statistics and MAF are taken from GWAS meta-analysis results while the traits covariance may be estimated by two strategies: (i) estimates from a subset of the phenotypic data; or (ii) estimates from published studies. We compare our two strategies with estimates using individual level data from the full GWAS sample (gold standard). A simulation study for both binary and continuous traits demonstrates that our approximate approach is accurate. We apply our method to the Framingham Heart Study (FHS) GWAS and to large-scale cardiometabolic GWAS results. We observed a high consistency of genetic effect size estimates between our method and individual level data analysis. Our approach leads to an efficient way to perform approximate conditional analysis using large-scale GWAS summary statistics.


2017 ◽  
Author(s):  
Genevieve L Wojcik ◽  
Chelsea Marie ◽  
Mayuresh M Abhyankar ◽  
Nobuya Yoshida ◽  
Koji Watanabe ◽  
...  

AbstractDiarrhea is the second leading cause of death for children globally, causing 760,000 deaths each year in children under the age of 5. Amoebic dysentery contributes significantly to this burden, especially in developing countries. We hypothesize that genetic variation contributes to susceptibility to diarrhea-associated Entamoeba histolytica infection in Bangladeshi infants; thus, we conducted a genome-wide association study (GWAS) in two independent birth cohorts of diarrhea-associated E. histolytica infection. Cases were defined as children with at least one diarrheal episode positive for E. histolytica through either PCR or ELISA within the first year of life. Controls were children without any episodes positive for E. histolytica in the same time frame. Meta-analyses under a fixed-effects inverse variance weighting model identified variants in two neighboring genes on chromosome 10: CUL2 (cullin 2) and CREM (cAMP responsive element modulator) associated with E. histolytica infection, with SNP rs58000832 achieving genome-wide significance (Pmeta=4.2x10−10). Each additional risk allele (an intergenic insertion between CREM and CCNY) of rs58000832 conferred 2.5 increased odds of a diarrhea-associated E. histolytica infection. The most associated SNP within a gene was in an intron of CREM (rs58468685, Pmeta=2.3x10−9), which with CUL2, has been implicated as a susceptibility locus for Inflammatory Bowel Disease (IBD) and Crohn’s Disease. Gene expression resources suggest these loci are related to the higher expression of CREM, but not CUL2. Increased CREM expression is also observed in early E. histolytica infection. Further, CREM-/- mice were more susceptible to E. histolytica amebic colitis. These genetic associations reinforce the pathological similarities observed in gut inflammation between E. histolytica infection and IBD.


2021 ◽  
Author(s):  
Catherine Doust ◽  
Pierre Fontanillas ◽  
Else Eising ◽  
Scott D Gordon ◽  
Zhengjun Wang ◽  
...  

Reading and writing are crucial for many aspects of modern life but up to 1 in 10 children are affected by dyslexia, which can persist into adulthood. Family studies of dyslexia suggest heritability up to 70%, yet no convincing genetic markers have been found due to limited study power. Here, we present a genome-wide association study representing a 20-fold increase in sample size from prior work, with 51,800 adults self-reporting a dyslexia diagnosis and 1,087,070 controls. We identified 42 independent genome-wide significant loci: 17 are in genes linked to or pleiotropic with cognitive ability/educational attainment; 25 are novel and may be more specifically associated with dyslexia. Twenty-three loci (12 novel) were validated in independent cohorts of Chinese and European ancestry. We confirmed a similar genetic aetiology of dyslexia between sexes, and found genetic covariance with many traits, including ambidexterity, but not neuroanatomical measures of language-related circuitry. Causal analyses revealed a directional effect of dyslexia on attention deficit hyperactivity disorder and bidirectional effects on socio-educational traits but these relationships require further investigation. Dyslexia polygenic scores explained up to 6% of variance in reading traits in independent cohorts, and might in future enable earlier identification and remediation of dyslexia.


2020 ◽  
Author(s):  
Gunn-Helen Moen ◽  
Robin N Beaumont ◽  
Christine Sommer ◽  
Beverley M. Shields ◽  
Deborah A Lawlor ◽  
...  

AbstractLower maternal serum vitamin B12 (B12) and folate levels have been associated with lower offspring birthweight in observational studies. The aim of this study was to investigate whether this relationship is causal.We performed two-sample Mendelian Randomization (MR) using summary data on associations between genotype-B12 (10 genetic variants) or genotype-folate (4 genetic variants) levels from a genome-wide association study of 45,576 individuals (sample 1) and maternal-specific genetic effects on offspring birthweight from the latest EGG consortium meta-analysis with 297,356 individuals reporting their own birthweight and 210,248 women reporting their offspring’s birthweight (sample 2). To investigate the effect of offspring’s own B12 or folate levels on their own birthweight, we performed two-sample MR using the fetal-specific genetic effects from the latest EGG consortium meta-analysis. We used the inverse variance weighted method, and sensitivity analyses to account for pleiotropy, in addition to sensitivity analyses excluding a potentially pleiotropic variant in the FUT2 gene for B12.We did not find evidence for a causal effect of maternal B12 on offspring birthweight, nor evidence for an effect of offspring B12 on their own birthweight using the fetal-specific genetic effect. The results were consistent across the different methods and in sensitivity analyses excluding the FUT2 variant. We found a positive effect of maternal folate on offspring birthweight (0.146 [0.065, 0.227], which corresponds to an increase in birthweight of 71g per 1SD higher folate). We found some evidence for a small inverse effect of fetal folate on their own birthweight (−0.051 [−0.100, −0.003]).In conclusion, our results are consistent with evidence from randomized controlled trials that increased maternal folate levels increase offspring birthweight. We did not find evidence for a causal effect of B12 on offspring birthweight, suggesting previous observational studies may have been due to confounding.


2021 ◽  
Author(s):  
Atul Kumar ◽  
Maryam Shoai ◽  
Sebastian Palmqvist ◽  
Erik Stomrud ◽  
John Hardy ◽  
...  

Abstract Background Cognitive decline in early-stage Alzheimer’s disease (AD) may depend on genetic variability. Methods In the Swedish BioFINDER study, we used polygenic scores (PGS) (for AD, intelligence and educational attainment), and genetic variants (in a genome-wide association study [GWAS]) to predict longitudinal cognitive change (measured by MMSE) over a mean of 4.2 years. We included 555 β-amyloid (Aβ) negative cognitively unimpaired (CU) individuals, 206 Aβ-positive CU (preclinical AD), 110 Aβ-negative mild cognitive impairment (MCI) patients, and 146 Aβ-positive MCI patients (prodromal AD). Results Polygenic scores for AD (in Aβ-positive individuals) and intelligence (independent of Aβ-status) were associated with cognitive decline. Eight genes were associated with cognitive decline in GWAS (3 independent of Aβ-status). Conclusions AD risk genes may influence cognitive decline in early AD, while genes related to intelligence may modulate cognitive decline irrespective of disease. Therapies targeting the implicated biological pathways may modulate the clinical course of AD.


Sign in / Sign up

Export Citation Format

Share Document