scholarly journals Polygenic Risk Scores for Cardio-renal-metabolic Diseases in the Penn Medicine Biobank

2019 ◽  
Author(s):  
R.L. Kember ◽  
A. Verma ◽  
S. Verma ◽  
A. Lucas ◽  
R. Judy ◽  
...  

AbstractCardio-renal-metabolic (CaReMe) conditions are common and the leading cause of mortality around the world. Genome-wide association studies have shown that these diseases are polygenic and share many genetic risk factors. Identifying individuals at high genetic risk will allow us to target prevention and treatment strategies. Polygenic risk scores (PRS) are aggregate weighted counts that can demonstrate an individual’s genetic liability for disease. However, current PRS are often based on European ancestry individuals, limiting the implementation of precision medicine efforts in diverse populations. In this study, we develop PRS for six diseases and traits related to cardio-renal-metabolic disease in the Penn Medicine Biobank. We investigate their performance in both European and African ancestry individuals, and identify genetic and phenotypic overlap within these conditions. We find that genetic risk is associated with the primary phenotype in both ancestries, but this does not translate into a model of predictive value in African ancestry individuals. We conclude that future research should prioritize genetic studies in diverse ancestries in order to address this disparity.

Neurology ◽  
2018 ◽  
Vol 90 (18) ◽  
pp. e1605-e1612 ◽  
Author(s):  
Tian Ge ◽  
Mert R. Sabuncu ◽  
Jordan W. Smoller ◽  
Reisa A. Sperling ◽  
Elizabeth C. Mormino ◽  
...  

ObjectiveTo investigate the effects of genetic risk of Alzheimer disease (AD) dementia in the context of β-amyloid (Aβ) accumulation.MethodsWe analyzed data from 702 participants (221 clinically normal, 367 with mild cognitive impairment, and 114 with AD dementia) with genetic data and florbetapir PET available. A subset of 669 participants additionally had longitudinal MRI scans to assess hippocampal volume. Polygenic risk scores (PRSs) were estimated with summary statistics from previous large-scale genome-wide association studies of AD dementia. We examined relationships between APOE ε4 status and PRS with longitudinal Aβ and cognitive and hippocampal volume measurements.ResultsAPOE ε4 was strongly related to baseline Aβ, whereas only weak associations between PRS and baseline Aβ were present. APOE ε4 was additionally related to greater memory decline and hippocampal atrophy in Aβ+ participants. When APOE ε4 was controlled for, PRS was related to cognitive decline in Aβ+ participants. Finally, PRSs were associated with hippocampal atrophy in Aβ− participants and weakly associated with baseline hippocampal volume in Aβ+ participants.ConclusionsGenetic risk factors of AD dementia demonstrate effects related to Aβ, as well as synergistic interactions with Aβ. The specific effect of faster cognitive decline in Aβ+ individuals with higher genetic risk may explain the large degree of heterogeneity in cognitive trajectories among Aβ+ individuals. Consideration of genetic variants in conjunction with baseline Aβ may improve enrichment strategies for clinical trials targeting Aβ+ individuals most at risk for imminent cognitive decline.


2020 ◽  
Author(s):  
Evan A. Winiger ◽  
Jarrod M. Ellingson ◽  
Claire L. Morrison ◽  
Robin P. Corley ◽  
Joëlle A. Pasman ◽  
...  

AbstractStudy ObjectivesEstimate the genetic relationship of cannabis use with sleep deficits and eveningness chronotype.MethodsWe used linkage disequilibrium score regression (LDSC) to analyze genetic correlations between sleep deficits and cannabis use behaviors. Secondly, we generated sleep deficit polygenic risk scores (PRSs) and estimated their ability to predict cannabis use behaviors using logistic regression. Summary statistics came from existing genome wide association studies (GWASs) of European ancestry that were focused on sleep duration, insomnia, chronotype, lifetime cannabis use, and cannabis use disorder (CUD). A target sample for PRS prediction consisted of high-risk participants and participants from twin/family community-based studies (n = 796, male = 66%; mean age = 26.81). Target data consisted of self-reported sleep (sleep duration, feeling tired, and taking naps) and cannabis use behaviors (lifetime use, number of lifetime uses, past 180-day use, age of first use, and lifetime CUD symptoms).ResultsSignificant genetic correlation between lifetime cannabis use and eveningness chronotype (rG = 0.24, p < 0.01), as well as between CUD and both short sleep duration (<7 h) (rG = 0.23, p = 0.02) and insomnia (rG = 0.20, p = 0.02). Insomnia PRS predicted earlier age of first cannabis use (β = −0.09, p = 0.02) and increased lifetime CUD symptom count use (β = 0.07, p = 0.03).ConclusionCannabis use is genetically associated with both sleep deficits and an eveningness chronotype, suggesting that there are genes that predispose individuals to both cannabis use and sleep deficits.


2022 ◽  
Vol 23 (1) ◽  
Author(s):  
Yanyu Liang ◽  
Milton Pividori ◽  
Ani Manichaikul ◽  
Abraham A. Palmer ◽  
Nancy J. Cox ◽  
...  

Abstract Background Polygenic risk scores (PRS) are valuable to translate the results of genome-wide association studies (GWAS) into clinical practice. To date, most GWAS have been based on individuals of European-ancestry leading to poor performance in populations of non-European ancestry. Results We introduce the polygenic transcriptome risk score (PTRS), which is based on predicted transcript levels (rather than SNPs), and explore the portability of PTRS across populations using UK Biobank data. Conclusions We show that PTRS has a significantly higher portability (Wilcoxon p=0.013) in the African-descent samples where the loss of performance is most acute with better performance than PRS when used in combination.


2018 ◽  
Author(s):  
Florian Privé ◽  
Hugues Aschard ◽  
Michael G.B. Blum

AbstractPolygenic Risk Scores (PRS) consist in combining the information across many single-nucleotide polymorphisms (SNPs) in a score reflecting the genetic risk of developing a disease. PRS might have a major impact on public health, possibly allowing for screening campaigns to identify high-genetic risk individuals for a given disease. The “Clumping+Thresholding” (C+T) approach is the most common method to derive PRS. C+T uses only univariate genome-wide association studies (GWAS) summary statistics, which makes it fast and easy to use. However, previous work showed that jointly estimating SNP effects for computing PRS has the potential to significantly improve the predictive performance of PRS as compared to C+T.In this paper, we present an efficient method to jointly estimate SNP effects, allowing for practical application of penalized logistic regression (PLR) on modern datasets including hundreds of thousands of individuals. Moreover, our implementation of PLR directly includes automatic choices for hyper-parameters. The choice of hyper-parameters for a predictive model is very important since it can dramatically impact its predictive performance. As an example, AUC values range from less than 60% to 90% in a model with 30 causal SNPs, depending on the p-value threshold in C+T.We compare the performance of PLR, C+T and a derivation of random forests using both real and simulated data. PLR consistently achieves higher predictive performance than the two other methods while being as fast as C+T. We find that improvement in predictive performance is more pronounced when there are few effects located in nearby genomic regions with correlated SNPs; for instance, AUC values increase from 83% with the best prediction of C+T to 92.5% with PLR. We confirm these results in a data analysis of a case-control study for celiac disease where PLR and the standard C+T method achieve AUC of 89% and of 82.5%.In conclusion, our study demonstrates that penalized logistic regression can achieve more discriminative polygenic risk scores, while being applicable to large-scale individual-level data thanks to the implementation we provide in the R package bigstatsr.


2019 ◽  
Author(s):  
Yan Zhang ◽  
Amber N. Wilcox ◽  
Haoyu Zhang ◽  
Parichoy Pal Choudhury ◽  
Douglas F. Easton ◽  
...  

AbstractWe analyzed summary-level data from genome-wide association studies (GWAS) of European ancestry across fourteen cancer sites to estimate the number of common susceptibility variants (polygenicity) contributing to risk, as well as the distribution of their associated effect sizes. All cancers evaluated showed polygenicity, involving at a minimum thousands of independent susceptibility variants. For some malignancies, particularly chronic lymphoid leukemia (CLL) and testicular cancer, there are a larger proportion of variants with larger effect sizes than those for other cancers. In contrast, most variants for lung and breast cancers have very small associated effect sizes. For different cancer sites, we estimate a wide range of GWAS sample sizes, required to explain 80% of GWAS heritability, varying from 60,000 cases for CLL to over 1,000,000 cases for lung cancer. The maximum relative risk achievable for subjects at the 99th risk percentile of underlying polygenic risk scores, compared to average risk, ranges from 12 for testicular to 2.5 for ovarian cancer. We show that polygenic risk scores have substantial potential for risk stratification for relatively common cancers such as breast, prostate and colon, but limited potential for other cancer sites because of modest heritability and lower disease incidence.


Thorax ◽  
2021 ◽  
pp. thoraxjnl-2020-215624
Author(s):  
Sinjini Sikdar ◽  
Annah B Wyss ◽  
Mi Kyeong Lee ◽  
Thanh T Hoang ◽  
Marie Richards ◽  
...  

RationaleGenome-wide association studies (GWASs) have identified numerous loci associated with lower pulmonary function. Pulmonary function is strongly related to smoking and has also been associated with asthma and dust endotoxin. At the individual SNP level, genome-wide analyses of pulmonary function have not identified appreciable evidence for gene by environment interactions. Genetic Risk Scores (GRSs) may enhance power to identify gene–environment interactions, but studies are few.MethodsWe analysed 2844 individuals of European ancestry with 1000 Genomes imputed GWAS data from a case–control study of adult asthma nested within a US agricultural cohort. Pulmonary function traits were FEV1, FVC and FEV1/FVC. Using data from a recent large meta-analysis of GWAS, we constructed a weighted GRS for each trait by combining the top (p value<5×10−9) genetic variants, after clumping based on distance (±250 kb) and linkage disequilibrium (r2=0.5). We used linear regression, adjusting for relevant covariates, to estimate associations of each trait with its GRS and to assess interactions.ResultsEach trait was highly significantly associated with its GRS (all three p values<8.9×10−8). The inverse association of the GRS with FEV1/FVC was stronger for current smokers (pinteraction=0.017) or former smokers (pinteraction=0.064) when compared with never smokers and among asthmatics compared with non-asthmatics (pinteraction=0.053). No significant interactions were observed between any GRS and house dust endotoxin.ConclusionsEvaluation of interactions using GRSs supports a greater impact of increased genetic susceptibility on reduced pulmonary function in the presence of smoking or asthma.


Author(s):  
Taylor B. Cavazos ◽  
John S. Witte

ABSTRACTThe majority of polygenic risk scores (PRS) have been developed and optimized in individuals of European ancestry and may have limited generalizability across other ancestral populations. Understanding aspects of PRS that contribute to this issue and determining solutions is complicated by disease-specific genetic architecture and limited knowledge of sharing of causal variants and effect sizes across populations. Motivated by these challenges, we undertook a simulation study to assess the relationship between ancestry and the potential bias in PRS developed in European ancestry populations. Our simulations show that the magnitude of this bias increases with increasing divergence from European ancestry, and this is attributed to population differences in linkage disequilibrium and allele frequencies of European discovered variants, likely as a result of genetic drift. Importantly, we find that including into the PRS variants discovered in African ancestry individuals has the potential to achieve unbiased estimates of genetic risk across global populations and admixed individuals. We confirm our simulation findings in an analysis of HbA1c, asthma, and prostate cancer in the UK Biobank. Given the demonstrated improvement in PRS prediction accuracy, recruiting larger diverse cohorts will be crucial—and potentially even necessary—for enabling accurate and equitable genetic risk prediction across populations.


PLoS Genetics ◽  
2021 ◽  
Vol 17 (9) ◽  
pp. e1009670
Author(s):  
Lars G. Fritsche ◽  
Ying Ma ◽  
Daiwei Zhang ◽  
Maxwell Salvatore ◽  
Seunggeun Lee ◽  
...  

Polygenic risk scores (PRS) can provide useful information for personalized risk stratification and disease risk assessment, especially when combined with non-genetic risk factors. However, their construction depends on the availability of summary statistics from genome-wide association studies (GWAS) independent from the target sample. For best compatibility, it was reported that GWAS and the target sample should match in terms of ancestries. Yet, GWAS, especially in the field of cancer, often lack diversity and are predominated by European ancestry. This bias is a limiting factor in PRS research. By using electronic health records and genetic data from the UK Biobank, we contrast the utility of breast and prostate cancer PRS derived from external European-ancestry-based GWAS across African, East Asian, European, and South Asian ancestry groups. We highlight differences in the PRS distributions of these groups that are amplified when PRS methods condense hundreds of thousands of variants into a single score. While European-GWAS-derived PRS were not directly transferrable across ancestries on an absolute scale, we establish their predictive potential when considering them separately within each group. For example, the top 10% of the breast cancer PRS distributions within each ancestry group each revealed significant enrichments of breast cancer cases compared to the bottom 90% (odds ratio of 2.81 [95%CI: 2.69,2.93] in European, 2.88 [1.85, 4.48] in African, 2.60 [1.25, 5.40] in East Asian, and 2.33 [1.55, 3.51] in South Asian individuals). Our findings highlight a compromise solution for PRS research to compensate for the lack of diversity in well-powered European GWAS efforts while recruitment of diverse participants in the field catches up.


2017 ◽  
Vol 35 (6_suppl) ◽  
pp. 1-1
Author(s):  
Rosalind Eeles ◽  
Ali Amin Al Olama ◽  
Sonja Berndt ◽  
Fredrik Wiklund ◽  
David V Conti ◽  
...  

1 Background: Currently genome-wide association studies (GWAS) have identified over 100 prostate cancer (PrCa) susceptibility loci, capturing 33% of the PrCa familial relative risk (FRR) in Europeans. To identify further susceptibility variants, we conducted a PrCa GWAS, larger than previous studies, comprising ~49,000 cases and ~29,000 controls among individuals of European and Asian descent using the OncoArray, a platform consisting of a 260K GWAS backbone and 310K custom content selected from previous GWAS and fine-mapping studies of multiple cancers ( http://epi.grants.cancer.gov/oncoarray/ ). Methods: Genotypes from the OncoArray were used to impute genotypes from ~70M variants using the October 2014 release of the 1000 genomes project as a reference, and then combined with several previous PrCa GWAS of European ancestry: UK stage 1 (1,906 cases/1,934 controls) and stage 2 (3,888 cases/3,956 controls); CaPS 1 (498 cases/502 controls) and CaPS 2 (1,483 cases/519 controls); BPC3 (2,137 cases/3,101 controls); NCI PEGASUS (4,622 cases/2,954 controls); and iCOGS (21,209 cases/ 20,440 controls). Risk analyses for overall PrCa risk, aggressive PrCa (several definitions defined by PrCa clinical characteristics), and Gleason score were performed. Logistic and linear regression summary statistics were meta-analysed using an inverse variance fixed effect approach. Results: We identified novel loci significantly associated ( P < 5.0x10-8) with overall PrCa (N = 65). Our novel findings are comprised of several missense variants, including a SNP in the ATM gene - a key member of the DNA repair pathway. When combined multiplicatively, the 65 novel PrCa loci identified here increases the captured heritability of PrCa, explaining 38.5% of the FRR when combining novel and previously identified PrCa loci. Conclusions: In risk stratification, men in the top 1% of the genetic risk score group have a relative risk of 5.6 fold for developing PrCa compared with the median risk group. These results will improve the utility of genetic risk scores for targeted screening and prevention for prostate cancer.


2019 ◽  
Author(s):  
Thomas M. Piasecki ◽  
Ian R. Gizer ◽  
Wendy S. Slutske

Background and Aims: Twin studies indicate that disordered gambling (DG) is heritable but are silent with respect to specific genes or pathways involved. Genome-wide association studies of other psychiatric disorders permit calculation of polygenic risk scores (PRS) that reflect the aggregated effects of common genetic variants contributing to risk for the target condition. We investigated whether gambling and DG are associated with PRSs for four psychiatric conditions found to be comorbid with DG in epidemiologic surveys. Design and Setting: Data were drawn from the Wave IV assessment of the National Longitudinal Study of Adolescent to Adult Health, a representative sample of adolescents recruited in 1994-5 and followed into young adulthood. Participants: Analyses were limited to unrelated individuals classified as having European ancestry based on analysis of genetic principal components (N = 5,215). Measurements: Participants were surveyed about lifetime gambling and DG. Genotyping data were used to construct PRSs quantifying participants’ common variant genetic risk for major depression (MDD), attention-deficit hyperactivity disorder (ADHD), bipolar disorder (BD), and schizophrenia (SCZ). Findings: Most participants (78.4%) reported ever having gambled, and 1.3% of those reported lifetime DG. Polygenic risk for BD was associated with decreased odds of lifetime gambling, OR = 0.93 [0.87, 0.99], p = .045, pseudo-R2(%) = 0.12. The SCZ PRS was associated with increased odds of DG, OR = 1.54 [1.07, 2.21], p = .020, pseudo-R2 (%) = 0.85. Polygenic risk for MDD and ADHD were not related to either gambling outcome. Conclusions: Common variant risk for SCZ is associated with DG. Investigating features common to both SCZ and DG might generate valuable clues about the genetically-influenced liabilities to DG.


Sign in / Sign up

Export Citation Format

Share Document