scholarly journals Leveraging correlations between polygenic risk score predictors to detect heterogeneity in GWAS cohorts

2019 ◽  
Author(s):  
Jie Yuan ◽  
Henry Xing ◽  
Alexandre Lamy ◽  
Todd Lencz ◽  
Itsik Pe’er ◽  
...  

AbstractEvidence from both GWAS and clinical observation has suggested that certain psychiatric, metabolic, and autoimmune diseases are heterogeneous, comprising multiple subtypes with distinct genomic etiologies and Polygenic Risk Scores (PRS). However, the presence of subtypes within many phenotypes is frequently unknown. We present CLiP (Correlated Liability Predictors), a method to detect heterogeneity in single GWAS cohorts. CLiP calculates a weighted sum of correlations between SNPs contributing to a PRS on the case/control liability scale. We demonstrate mathematically and through simulation that among i.i.d. homogeneous cases, significant anti-correlations are expected between otherwise independent predictors due to ascertainment on the hidden liability score. In the presence of heterogeneity from distinct etiologies, confounding by covariates, or mislabeling, these correlation patterns are altered predictably. We further extend our method to two additional association study designs: CLiP-X for quantitative predictors in applications such as transcriptome-wide association, and CLiP-Y for quantitative phenotypes, where there is no clear distinction between cases and controls. Through simulations, we demonstrate that CLiP and its extensions reliably distinguish between homogeneous and heterogeneous cohorts when the PRS explains as low as 5% of variance on the liability scale and cohorts comprise 50, 000 − 100, 000 samples, an increasingly practical size for modern GWAS. We apply CLiP to heterogeneity detection in schizophrenia cohorts totaling > 50, 000 cases and controls collected by the Psychiatric Genomics Consortium. We observe significant heterogeneity in mega-analysis of the combined PGC data (p-value 8.54e-4), as well as in individual cohorts meta-analyzed using Fisher’s method (p-value 0.03), based on significantly associated variants.

2020 ◽  
pp. 1-11 ◽  
Author(s):  
Giada Tripoli ◽  
Diego Quattrone ◽  
Laura Ferraro ◽  
Charlotte Gayer-Anderson ◽  
Victoria Rodriguez ◽  
...  

Abstract Background The ‘jumping to conclusions’ (JTC) bias is associated with both psychosis and general cognition but their relationship is unclear. In this study, we set out to clarify the relationship between the JTC bias, IQ, psychosis and polygenic liability to schizophrenia and IQ. Methods A total of 817 first episode psychosis patients and 1294 population-based controls completed assessments of general intelligence (IQ), and JTC, and provided blood or saliva samples from which we extracted DNA and computed polygenic risk scores for IQ and schizophrenia. Results The estimated proportion of the total effect of case/control differences on JTC mediated by IQ was 79%. Schizophrenia polygenic risk score was non-significantly associated with a higher number of beads drawn (B = 0.47, 95% CI −0.21 to 1.16, p = 0.17); whereas IQ PRS (B = 0.51, 95% CI 0.25–0.76, p < 0.001) significantly predicted the number of beads drawn, and was thus associated with reduced JTC bias. The JTC was more strongly associated with the higher level of psychotic-like experiences (PLEs) in controls, including after controlling for IQ (B = −1.7, 95% CI −2.8 to −0.5, p = 0.006), but did not relate to delusions in patients. Conclusions Our findings suggest that the JTC reasoning bias in psychosis might not be a specific cognitive deficit but rather a manifestation or consequence, of general cognitive impairment. Whereas, in the general population, the JTC bias is related to PLEs, independent of IQ. The work has the potential to inform interventions targeting cognitive biases in early psychosis.


2021 ◽  
Author(s):  
Nuzulul Kurniansyah ◽  
Matthew O Goodman ◽  
Tanika Kelly ◽  
Tali Elfassi ◽  
Kerri Wiggins ◽  
...  

Background: We used summary statistics from previously-published GWAS of systolic and diastolic BP and of hypertension to construct Polygenic Risk Scores (PRS) to predict hypertension across diverse populations. Methods: We used 10,314 participants of diverse ancestry from BioMe to train trait-specific PRS. We implemented a novel approach to select one of multiple potential PRS based on the same GWAS, by optimizing the coefficient of variation across estimated PRS effect sizes in independent subsets of the training dataset. We combined the 3 selected trait-specific PRS as their unweighted sum, called "PRSsum". We evaluated PRS associations in an independent dataset of 39,035 individuals from eight cohort studies, to select the final, multi-ethnic, HTN-PRS. We estimated its association with prevalent and incident hypertension 4-6 years later. We studied hypertension development within HTN-PRS strata in a longitudinal, six-visit, longitudinal dataset of 3,087 self-identified Black and White participants from the CARDIA study. Finally, we evaluated the HTN-PRS association with clinical outcomes in 40,201 individuals from the MGB Biobank. Results: Compared to other race/ethnic backgrounds, African-Americans had higher average values of the HTN-PRS. The HTN-PRS was associated with prevalent hypertension (OR=2.10, 95% CI [1.99, 2.21], per one standard deviation (SD) of the PRS) across all participants, and in each race/ethnic background, with heterogeneity by background (p-value < 1.0x10-4). The lowest estimated effect size was in African Americans (OR=1.53, 95% CI [1.38, 1.69]). The HTN-PRS was associated with new onset hypertension among individuals with normal (respectively, elevated) BP at baseline: OR=1.71, 95% CI [1.55, 1.91] (OR=1.48, 95% CI [1.27, 1.71]). Association was further observed in age-stratified analysis. In CARDIA, Black participants with high HTN-PRS percentiles developed hypertension earlier than White participants with high HTN-PRS percentiles. The HTN-PRS was significantly associated with increased risk of coronary artery disease (OR=1.12), ischemic stroke (OR=1.15), type 2 diabetes (OR=1.19), and chronic kidney disease (OR=1.12), in the MGB Biobank. Conclusions: The multi-ethnic HTN-PRS is associated with both prevalent and incident hypertension at 4-6 years of follow up across adulthood and is associated with clinical outcomes.


2021 ◽  
Author(s):  
Sam Hodgson ◽  
Qin Qin Huang ◽  
Neneh Sallah ◽  
Chris J Griffiths ◽  
William Newman ◽  
...  

Background: Type 2 diabetes is a heterogeneous condition highly prevalent in British Pakistanis and Bangladeshis (BPB). The Genes & Health (G&H) cohort offers means to explore genetic determinants of disease in BPBs, combining genetic and lifelong health record data. Methods: We assessed whether common genetic loci associated with type 2 diabetes in European-ancestry individuals (EUR) replicate in G&H. We constructed a type 2 diabetes polygenic risk score (PRS) and combined it with a clinical risk instrument (QDiabetes) to build a novel, integrated risk tool (IRT). We compared IRT performance using net reclassification index (NRI) versus QDiabetes alone. We assessed the ability of the PRS to predict type 2 diabetes following gestational diabetes (GDM). We compared PRS distribution between type 2 diabetes subgroups identified by clinical features at diagnosis. Findings: Accounting for power, we replicated fewer loci associated with type 2 diabetes in G&H (n = 76/338, 22%) than would be expected if all EUR-ascertained loci were transferable (n = 95, 28%) (binomial p value = 0.01). In 13,648 patients free from type 2 diabetes followed up for 10 years, NRI was 3.2% for IRT versus QDiabetes (95% confidence interval 2.0 - 4.4%). IRT performance was best in reclassification of young adults deemed low risk by QDiabetes as high risk. PRS was independently associated with progression to type 2 diabetes after GDM (p = 0.028). Mean type 2 diabetes PRS differed between phenotypically-defined type 2 diabetes subgroups (p = 0.002). Interpretation: The type 2 diabetes PRS has broad potential clinical application in BPB, improving identification of type 2 diabetes risk (especially in the young), and characterisation of type 2 diabetes subgroups at diagnosis. Funding: Wellcome Trust, MRC, NIHR, and others. Full funding disclosed within.


2019 ◽  
Author(s):  
Giada Tripoli ◽  
Diego Quattrone ◽  
Laura Ferraro ◽  
Charlotte Gayer-Anderson ◽  
Victoria Rodriguez ◽  
...  

AbstractBackgroundThe “jumping to conclusions” (JTC) bias is associated with both psychosis and general cognition but their relationship is unclear. In this study, we set out to clarify the relationship between the JTC bias, IQ, psychosis and polygenic liability to schizophrenia and IQ.Methods817 FEP patients and 1294 population-based controls completed assessments of general intelligence (IQ), and JTC (assessed by the number of beads drawn on the probabilistic reasoning “beads” task) and provided blood or saliva samples from which we extracted DNA and computed polygenic risk scores for IQ and schizophrenia.ResultsThe estimated proportion of the total effect of case/control differences on JTC mediated by IQ was 79%. Schizophrenia Polygenic Risk Score (SZ PRS) was non-significantly associated with a higher number of beads drawn (B= 0.47, 95% CI −0.21 to 1.16, p=0.17); whereas IQ PRS (B=0.51, 95% CI 0.25 to 0.76, p<0.001) significantly predicted the number of beads drawn, and was thus associated with reduced JTC bias. The JTC was more strongly associated with higher level of psychotic-like experiences (PLE) in controls, including after controlling for IQ (B= −1.7, 95% CI −2.8 to −0.5, p=0.006), but did not relate to delusions in patients.Conclusionsthe JTC reasoning bias in psychosis is not a specific cognitive deficit but is rather a manifestation or consequence, of general cognitive impairment. Whereas, in the general population, the JTC bias is related to psychotic-like experiences, independent of IQ. The work has potential to inform interventions targeting cognitive biases in early psychosis.


2017 ◽  
Author(s):  
Lawrence M. Chen ◽  
Nelson Yao ◽  
Elika Garg ◽  
Yuecai Zhu ◽  
Thao T. T. Nguyen ◽  
...  

AbstractMotivationPolygenic risk scores describe the genomic contribution to complex phenotypes and consistently account for a larger proportion of the variance than single nucleotide polymorphisms alone. However, there is little consensus on the optimal data input for generating polygenic risk scores and existing approaches largely preclude the use of imputed posterior probabilities and strand-ambiguous SNPs.ResultsWe developed PRS-on-Spark (PRSoS) a polygenic risk score software implemented in Apache Spark and Python that accommodates a variety of data input (e.g., observed genotypes, imputed genotypes, or imputed posterior probabilities) and strand-ambiguous SNPs. We show that PRSoS is flexible and efficient and computes polygenic risk scores at a range of p-value thresholds more quickly than existing software (PRSice). We also show that the use of imputed posterior probabilities and the inclusion of strand-ambiguous SNPs increases the proportion of variance explained by polygenic risk scores for major depression.Availability and ImplementationPRSoS is written in Apache Spark and Python and is freely available (see https://github.com/MeaneyLab/PRSoS).


2020 ◽  
Vol 3 (Supplement_1) ◽  
pp. 95-96
Author(s):  
S Lee ◽  
K Shestopaloff ◽  
O Espin-Garcia ◽  
W Turpin ◽  
J Raygoza Garay ◽  
...  

Abstract Background Fecal calprotectin concentration (FC), a measure of gut inflammation is reported to be significantly higher in healthy first-degree relatives (FDR) of Crohn’s disease (CD) patients compared to healthy controls. In contrast, FC in spouses of CD patients was not significantly different from controls, suggesting that a genetic predisposition rather than a shared environmental factor affects FC. Aims We investigated the genetic association with FC in healthy FDRs of CD patients. Notably, these subjects are known to be enriched with CD risk alleles. Methods We investigated 1455 healthy Caucasian FDRs of CD patients from the GEM Project. Subjects were genotyped by HumanCoreEXOME chip and ImmunoChip platforms and then imputed by the Haplotype Reference Consortium v1.1 panel (Michigan Imputation Server). SNPs with a minor allele frequency&lt;5% were removed. FC was measured using BUHLMANN ELISA kit. Heritability was estimated using a pedigree based SOLAR program and a SNP-based GCTA software. Genome wide association of FC was tested using the GEE framework that accounts for family clusters, age, sex, first 3 genetic principal components and multiplex family status (≥2 FDRs diagnosed with CD). In addition, CD-polygenic risk scores were derived based on summary statistics and imputed SNPs from a recent GWAS by pruning and thresholding (P+T) and LDPred algorithm (PMID:31002795). Results Among 1455 subjects, 45.2% were male, median age was 19 years (IQR 13–26), 8.8% were from multiplex families, and median FC was 52 mg/kg (IQR 31–87; 20.8% had FC&gt;100). We estimated the heritability of FC to be 27% (27.1%, standard error=9%, p&lt;0.001 by pedigree approach; 27.9%, SE=12%, p&lt;0.001 by SNP approach). An untargeted GWAS failed to show any significant association with FC (i.e. p&lt;5x10-8). The lowest p value was obtained for rs224631 (p=5x10-7). Strikingly, an increase in CD polygenic risk scores was significantly associated with an increase of FC (p=5.2x10-5 with P+T method). Conclusions We demonstrate that FC concentration is a heritable trait in unaffected FDRs of CD patients. Although the association between genetic variants with FC did not reach GWAS significance, CD-polygenic risk score, which incorporates small effect size CD-associated SNPs, was significantly associated with FC concentrationin this cohort. Our results suggest that FC concentration is influenced genetically with contributions from CD-associated SNPs in unaffected FDRs of CD probands. It remains to be determined if the genetic influence to FC concentration is dependent/independent with the future development of CD. Submitted on behalf of The CCC-GEM Project research team Funding Agencies CCCHelmsley Charitable Trust/ Mount Sinai Hospital Fellowship Award


2021 ◽  
pp. 1-8
Author(s):  
Michael Wainberg ◽  
Peter Zhukovsky ◽  
Sean L. Hill ◽  
Daniel Felsky ◽  
Aristotle Voineskos ◽  
...  

Abstract Background Our understanding of major depression is complicated by substantial heterogeneity in disease presentation, which can be disentangled by data-driven analyses of depressive symptom dimensions. We aimed to determine the clinical portrait of such symptom dimensions among individuals in the community. Methods This cross-sectional study consisted of 25 261 self-reported White UK Biobank participants with major depression. Nine questions from the UK Biobank Mental Health Questionnaire encompassing depressive symptoms were decomposed into underlying factors or ‘symptom dimensions’ via factor analysis, which were then tested for association with psychiatric diagnoses and polygenic risk scores for major depressive disorder (MDD), bipolar disorder and schizophrenia. Replication was performed among 655 self-reported non-White participants, across sexes, and among 7190 individuals with an ICD-10 code for MDD from linked inpatient or primary care records. Results Four broad symptom dimensions were identified, encompassing negative cognition, functional impairment, insomnia and atypical symptoms. These dimensions replicated across ancestries, sexes and individuals with inpatient or primary care MDD diagnoses, and were also consistent among 43 090 self-reported White participants with undiagnosed self-reported depression. Every dimension was associated with increased risk of nearly every psychiatric diagnosis and polygenic risk score. However, while certain psychiatric diagnoses were disproportionately associated with specific symptom dimensions, the three polygenic risk scores did not show the same specificity of associations. Conclusions An analysis of questionnaire data from a large community-based cohort reveals four replicable symptom dimensions of depression with distinct clinical, but not genetic, correlates.


Author(s):  
Niccolo’ Tesi ◽  
Sven J van der Lee ◽  
Marc Hulsman ◽  
Iris E Jansen ◽  
Najada Stringa ◽  
...  

Abstract Studying the genome of centenarians may give insights into the molecular mechanisms underlying extreme human longevity and the escape of age-related diseases. Here, we set out to construct polygenic risk scores (PRSs) for longevity and to investigate the functions of longevity-associated variants. Using a cohort of centenarians with maintained cognitive health (N = 343), a population-matched cohort of older adults from 5 cohorts (N = 2905), and summary statistics data from genome-wide association studies on parental longevity, we constructed a PRS including 330 variants that significantly discriminated between centenarians and older adults. This PRS was also associated with longer survival in an independent sample of younger individuals (p = .02), leading up to a 4-year difference in survival based on common genetic factors only. We show that this PRS was, in part, able to compensate for the deleterious effect of the APOE-ε4 allele. Using an integrative framework, we annotated the 330 variants included in this PRS by the genes they associate with. We find that they are enriched with genes associated with cellular differentiation, developmental processes, and cellular response to stress. Together, our results indicate that an extended human life span is, in part, the result of a constellation of variants each exerting small advantageous effects on aging-related biological mechanisms that maintain overall health and decrease the risk of age-related diseases.


Hypertension ◽  
2021 ◽  
Vol 78 (Suppl_1) ◽  
Author(s):  
Joseph H Breeyear ◽  
Megan M Shuey ◽  
Todd L Edwards ◽  
Jacklyn Hellwege

Hypertension is estimated to affect more than 49.6% of US adults 20 years and older. Of those individuals with hypertension, more than ten million are classified as apparent treatment resistant hypertensive (aTRH). The attributable risk of uncontrolled hypertension was estimated to be 49% for cardiovascular disease and 62% for stroke. We developed a polygenic risk score (PRS) for systolic (SBP) and diastolic (DBP) blood pressure to examine the association between the genetic determinants of blood pressure and aTRH with the goal of identifying high risk individuals. The meta-analyzed transethnic results of Giri et al., Biobank Japan, and Liang et al. were used to generate a PRS with PRS-CS followed by p -value thresholding, and validation in the UK Biobank (n max =341,930). Associations were modeled with logistic regression adjusted for age, age-squared, BMI, sex, and ten principal components of ancestry in BioVU’s transethnic population (n max =37,978), as well as non-Hispanic Black (n max =5,026) and non-Hispanic White (n max =28,545) subsets. The SBP PRS was significantly associated with an increased aTRH risk in the non-Hispanic White subset (1.08 (1.04 - 1.12), p = 0.00037) and transethnic (1.08 (1.04 - 1.13), p = 0.00020) populations, but not the non-Hispanic Black subset. The DBP PRS was not associated with aTRH in any population. Our findings present evidence that individuals with a higher genetic predisposition towards hypertension are at higher risk of aTRH. By integrating polygenic risk scores and clinical covariates in prediction of aTRH, individuals’ therapeutic regimens may be tailored to help maintain stable blood pressures, therefore reducing their risk of comorbidities.


2017 ◽  
Author(s):  
David Curtis

SummaryPolygenic risk scores obtained as a weighted sum of associated variants can be used to explore association in additional data sets and to assign risk scores to individuals. The methods used to derive polygenic risk scores from common SNPs are not suitable for variants detected in whole exome sequencing studies. Rare variants which may have major effects are seen too infrequently to judge whether they are associated and may not be shared between training and test subjects. A method is proposed whereby variants are weighted according to their frequency, their annotations and to the genes they affect. A weighted sum across all variants provides an individual risk score. Scores constructed in this way are used in a weighted burden test and are shown to be significantly different between schizophrenia cases and controls using a five-way cross validation procedure. This approach represents a first attempt to summarise exome sequence variation into a summary risk score, which could be combined with risk scores from common variants and from environmental factors. It is hoped that the method could be developed further.


Sign in / Sign up

Export Citation Format

Share Document