Phenome-wide association study (PheWAS) of colorectal cancer risk SNP effects on health outcomes in UK Biobank

Abstract Background Associations between colorectal cancer (CRC) and other health outcomes have been reported, but these may be subject to biases, or due to limitations of observational studies. Methods We set out to determine whether genetic predisposition to CRC is also associated with the risk of other phenotypes. Under the phenome-wide association study (PheWAS) and tree-structured phenotypic model (TreeWAS), we studied 334,385 unrelated White British individuals (excluding CRC patients) from the UK Biobank cohort. We generated a polygenic risk score (PRS) from CRC genome-wide association studies as a measure of CRC risk. We performed sensitivity analyses to test the robustness of the results and searched the Danish Disease Trajectory Browser (DTB) to replicate the observed associations. Results Eight PheWAS phenotypes and 21 TreeWAS nodes were associated with CRC genetic predisposition by PheWAS and TreeWAS, respectively. The PheWAS detected associations were from neoplasms and digestive system disease group (e.g. benign neoplasm of colon, anal and rectal polyp and diverticular disease). The results from the TreeWAS corroborated the results from the PheWAS. These results were replicated in the observational data within the DTB. Conclusions We show that benign colorectal neoplasms share genetic aetiology with CRC using PheWAS and TreeWAS methods. Additionally, CRC genetic predisposition is associated with diverticular disease.

Download Full-text

Identifying the potential role of insomnia on multimorbidity: A Mendelian randomization phenome-wide association study in UK Biobank

10.1101/2022.01.11.22269005 ◽

2022 ◽

Author(s):

Mark J Gibson ◽

Deborah A Lawlor ◽

Louise AC Millard

Keyword(s):

Health Outcomes ◽

Association Study ◽

Genetic Risk ◽

Association Studies ◽

Causal Effects ◽

Mendelian Randomisation ◽

Genome Wide Association Studies ◽

Uk Biobank ◽

Wide Range

Objectives: To identify the breadth of potential causal effects of insomnia on health outcomes and hence its possible role in multimorbidity. Design: Mendelian randomisation (MR) Phenome-wide association study (MR-PheWAS) with two-sample Mendelian randomisation follow-up. Setting: Individual data from UK Biobank and summary data from a number of genome-wide association studies. Participants: 336,975 unrelated white-British UK Biobank participants. Exposures: Standardised genetic risk of insomnia for the MR-PheWAS and genetically predicted insomnia for the two-sample MR follow-up, with insomnia instrumented by a genetic risk score (GRS) created from 129 single-nucleotide polymorphisms (SNPs). Main outcomes measures: 11,409 outcomes from UK Biobank extracted and processed by an automated pipeline (PHESANT). Potential causal effects (i.e., those passing a Bonferroni-corrected significance threshold) were followed up with two-sample MR in MR-Base, where possible. Results: 437 potential causal effects of insomnia were observed for a number of traits, including anxiety, stress, depression, mania, addiction, pain, body composition, immune, respiratory, endocrine, dental, musculoskeletal, cardiovascular and reproductive traits, as well as socioeconomic and behavioural traits. We were able to undertake two-sample MR for 71 of these 437 and found evidence of causal effects (with directionally concordant effect estimates across all analyses) for 25 of these. These included, for example, risk of anxiety disorders (OR=1.55 [95% confidence interval (CI): 1.30, 1.86] per category increase in insomnia), diseases of the oesophagus/stomach/duodenum (OR=1.32 [95% CI: 1.14, 1.53]) and spondylosis (OR=1.57 [95% CI: 1.22, 2.01]). Conclusion: Insomnia potentially causes a wide range of adverse health outcomes and behaviours. This has implications for developing interventions to prevent and treat a number of diseases in order to reduce multimorbidity and associated polypharmacy.

Download Full-text

Evaluation of glycemic traits in susceptibility to COVID-19 risk: a Mendelian randomization study

BMC Medicine ◽

10.1186/s12916-021-01944-3 ◽

2021 ◽

Vol 19 (1) ◽

Author(s):

Shiu Lun Au Yeung ◽

Jie V Zhao ◽

C Mary Schooling

Keyword(s):

Type 2 Diabetes ◽

Genetic Predisposition ◽

Mendelian Randomization ◽

Fasting Glucose ◽

Association Studies ◽

Sensitivity Analyses ◽

Genome Wide Association Studies ◽

Inverse Association ◽

Wide Confidence Interval

Abstract Background Observational studies suggest poorer glycemic traits and type 2 diabetes associated with coronavirus disease 2019 (COVID-19) risk although these findings could be confounded by socioeconomic position. We conducted a two-sample Mendelian randomization to clarify their role in COVID-19 risk and specific COVID-19 phenotypes (hospitalized and severe cases). Method We identified genetic instruments for fasting glucose (n = 133,010), 2 h glucose (n = 42,854), glycated hemoglobin (n = 123,665), and type 2 diabetes (74,124 cases and 824,006 controls) from genome wide association studies and applied them to COVID-19 Host Genetics Initiative summary statistics (17,965 COVID-19 cases and 1,370,547 population controls). We used inverse variance weighting to obtain the causal estimates of glycemic traits and genetic predisposition to type 2 diabetes in COVID-19 risk. Sensitivity analyses included MR-Egger and weighted median method. Results We found genetic predisposition to type 2 diabetes was not associated with any COVID-19 phenotype (OR: 1.00 per unit increase in log odds of having diabetes, 95%CI 0.97 to 1.04 for overall COVID-19; OR: 1.02, 95%CI 0.95 to 1.09 for hospitalized COVID-19; and OR: 1.00, 95%CI 0.93 to 1.08 for severe COVID-19). There were no strong evidence for an association of glycemic traits in COVID-19 phenotypes, apart from a potential inverse association for fasting glucose albeit with wide confidence interval. Conclusion We provide some genetic evidence that poorer glycemic traits and predisposition to type 2 diabetes unlikely increase the risk of COVID-19. Although our study did not indicate glycemic traits increase severity of COVID-19, additional studies are needed to verify our findings.

Download Full-text

Polygenic Risk Scores for Kidney Function and Their Associations with Circulating Proteome, and Incident Kidney Diseases

Journal of the American Society of Nephrology ◽

10.1681/asn.2020111599 ◽

2021 ◽

pp. ASN.2020111599

Author(s):

Zhi Yu ◽

Jin Jin ◽

Adrienne Tin ◽

Anna Köttgen ◽

Bing Yu ◽

...

Keyword(s):

Kidney Function ◽

Kidney Diseases ◽

Association Studies ◽

Polygenic Risk Score ◽

Genome Wide Association Studies ◽

Plasma Proteome ◽

Uk Biobank ◽

Polygenic Risk ◽

Genome Wide ◽

A Genome

Background: Genome-wide association studies (GWAS) have revealed numerous loci for kidney function (estimated glomerular filtration rate, eGFR). The relationship of polygenic predictors of eGFR, risk of incident adverse kidney outcomes, and the plasma proteome is not known. Methods: We developed a genome-wide polygenic risk score (PRS) for eGFR by applying the LDpred algorithm to summary statistics generated from a multiethnic meta-analysis of CKDGen Consortium GWAS (N=765,348) and UK Biobank GWAS (90% of the cohort; N=451,508), followed by best parameter selection using the remaining 10% of UK Biobank (N=45,158). We then tested the association of the PRS in the Atherosclerosis Risk in Communities (ARIC) study (N=8,866) with incident chronic kidney disease, kidney failure, and acute kidney injury. We also examined associations between the PRS and 4,877 plasma proteins measured at at middle age and older adulthood and evaluated mediation of PRS associations by eGFR. Results: The developed PRS showed significant associations with all outcomes with hazard ratios (95% CI) per 1 SD lower PRS ranged from 1.06 (1.01, 1.11) to 1.33 (1.28, 1.37). The PRS was significantly associated with 132 proteins at both time points. The strongest associations were with cystatin-C, collagen alpha-1(XV) chain, and desmocollin-2. Most proteins were higher at lower kidney function, except for 5 proteins including testican-2. Most correlations of the genetic PRS with proteins were mediated by eGFR. Conclusions: A PRS for eGFR is now sufficiently strong to capture risk for a spectrum of incident kidney diseases and broadly influences the plasma proteome, primarily mediated by eGFR.

Download Full-text

Personality, lifestyle and job satisfaction: causal association between neuroticism and job satisfaction using Mendelian randomisation in the UK biobank cohort

Translational Psychiatry ◽

10.1038/s41398-020-0691-3 ◽

2020 ◽

Vol 10 (1) ◽

Cited By ~ 1

Author(s):

Gull Rukh ◽

Junhua Dang ◽

Gaia Olivo ◽

Diana-Maria Ciuculete ◽

Mathias Rask-Andersen ◽

...

Keyword(s):

Physical Activity ◽

Job Satisfaction ◽

Association Studies ◽

Sensitivity Analyses ◽

Mendelian Randomisation ◽

Genome Wide Association Studies ◽

Single Variable ◽

Causal Association ◽

Uk Biobank ◽

The Uk

AbstractJob-related stress has been associated with poor health outcomes but little is known about the causal nature of these findings. We employed Mendelian randomisation (MR) approach to investigate the causal effect of neuroticism, education, and physical activity on job satisfaction. Trait-specific genetic risk score (GRS) based on recent genome wide association studies were used as instrumental variables (IV) using the UK Biobank cohort (N = 315,536). Both single variable and multivariable MR analyses were used to determine the effect of each trait on job satisfaction. We observed a clear evidence of a causal association between neuroticism and job satisfaction. In single variable MR, one standard deviation (1 SD) higher genetically determined neuroticism score (4.07 units) was associated with −0.31 units lower job satisfaction (95% confidence interval (CI): −0.38 to −0.24; P = 9.5 × 10−20). The causal associations remained significant after performing sensitivity analyses by excluding invalid genetic variants from GRSNeuroticism (β(95%CI): −0.28(−0.35 to −0.21); P = 3.4 x 10−15). Education (0.02; −0.08 to 0.12; 0.67) and physical activity (0.08; −0.34 to 0.50; 0.70) did not show any evidence for causal association with job satisfaction. When genetic instruments for neuroticism, education and physical activity were included together, the association of neuroticism score with job satisfaction was reduced by only −0.01 units, suggesting an independent inverse causal association between neuroticism score (P = 2.7 x 10−17) and job satisfaction. Our findings show an independent causal association between neuroticism score and job satisfaction. Physically active lifestyle may help to increase job satisfaction despite presence of high neuroticism scores. Our study highlights the importance of considering the confounding effect of negative personality traits for studies on job satisfaction.

Download Full-text

Childhood obesity and multiple sclerosis: A Mendelian randomization study

Multiple Sclerosis Journal ◽

10.1177/13524585211001781 ◽

2021 ◽

pp. 135245852110017

Author(s):

Adil Harroud ◽

Ruth E Mitchell ◽

Tom G Richardson ◽

John A Morris ◽

Vincenzo Forgetta ◽

...

Keyword(s):

Multiple Sclerosis ◽

Childhood Obesity ◽

Mendelian Randomization ◽

Pubertal Timing ◽

Association Studies ◽

Sensitivity Analyses ◽

Genome Wide Association Studies ◽

Uk Biobank ◽

Increased Risk ◽

Age At Puberty

Background: Higher childhood body mass index (BMI) has been associated with an increased risk of multiple sclerosis (MS). Objective: To evaluate whether childhood BMI has a causal influence on MS, and whether this putative effect is independent from early adult obesity and pubertal timing. Methods: We performed Mendelian randomization (MR) using summary genetic data on 14,802 MS cases and 26,703 controls. Large-scale genome-wide association studies provided estimates for BMI in childhood ( n = 47,541) and adulthood ( n = 322,154). In multivariable MR, we examined the direct effects of each timepoint and further adjusted for age at puberty. Findings were replicated using the UK Biobank ( n = 453,169). Results: Higher genetically predicted childhood BMI was associated with increased odds of MS (odds ratio (OR) = 1.26/SD BMI increase, 95% confidence interval (CI): 1.07–1.50). However, there was little evidence of a direct effect after adjusting for adult BMI (OR = 1.03, 95% CI: 0.70–1.53). Conversely, the effect of adult BMI persisted independent of childhood BMI (OR = 1.43; 95% CI: 1.01–2.03). The addition of age at puberty did not alter the findings. UK Biobank analyses showed consistent results. Sensitivity analyses provided no evidence of pleiotropy. Conclusion: Genetic evidence supports an association between childhood obesity and MS susceptibility, mediated by persistence of obesity into early adulthood but independent of pubertal timing.

Download Full-text

EraSOR: Erase Sample Overlap in polygenic score analyses

10.1101/2021.12.10.472164 ◽

2021 ◽

Author(s):

Shing Wan Choi ◽

Timothy Shin Heng Mak ◽

Clive J. Hoggart ◽

Paul F. O'Reilly

Keyword(s):

Association Studies ◽

Polygenic Risk Score ◽

Genome Wide Association Studies ◽

Summary Statistics ◽

Uk Biobank ◽

Type 1 Error ◽

Wide Range ◽

Close Relatedness ◽

Target Data

Background: Polygenic risk score (PRS) analyses are now routinely applied in biomedical research, with great hope that they will aid in our understanding of disease aetiology and contribute to personalized medicine. The continued growth of multi-cohort genome-wide association studies (GWASs) and large-scale biobank projects has provided researchers with a wealth of GWAS summary statistics and individual-level data suitable for performing PRS analyses. However, as the size of these studies increase, the risk of inter-cohort sample overlap and close relatedness increases. Ideally sample overlap would be identified and removed directly, but this is typically not possible due to privacy laws or consent agreements. This sample overlap, whether known or not, is a major problem in PRS analyses because it can lead to inflation of type 1 error and, thus, erroneous conclusions in published work. Results: Here, for the first time, we report the scale of the sample overlap problem for PRS analyses by generating known sample overlap across sub-samples of the UK Biobank data, which we then use to produce GWAS and target data to mimic the effects of inter-cohort sample overlap. We demonstrate that inter-cohort overlap results in a significant and often substantial inflation in the observed PRS-trait association, coefficient of determination (R2) and false-positive rate. This inflation can be high even when the absolute number of overlapping individuals is small if this makes up a notable fraction of the target sample. We develop and introduce EraSOR (Erase Sample Overlap and Relatedness), a software for adjusting inflation in PRS prediction and association statistics in the presence of sample overlap or close relatedness between the GWAS and target samples. A key component of the EraSOR approach is inference of the degree of sample overlap from the intercept of a bivariate LD score regression applied to the GWAS and target data, making it powered in settings where both have sample sizes over 1,000 individuals. Through extensive benchmarking using UK Biobank and HapGen2 simulated genotype-phenotype data, we demonstrate that PRSs calculated using EraSOR-adjusted GWAS summary statistics are robust to inter-cohort overlap in a wide range of realistic scenarios and are even robust to high levels of residual genetic and environmental stratification. Conclusion: The results of all PRS analyses for which sample overlap cannot be definitively ruled out should be considered with caution given high type 1 error observed in the presence of even low overlap between base and target cohorts. Given the strong performance of EraSOR in eliminating inflation caused by sample overlap in PRS studies with large (>5k) target samples, we recommend that EraSOR be used in all future such PRS studies to mitigate the potential effects of inter-cohort overlap and close relatedness.

Download Full-text

Frequent Daytime Napping is Detrimental to Human Health: A phenotype-wide Mendelian Randomization Study

10.1101/2020.01.20.20017723 ◽

2020 ◽

Author(s):

Lanlan Chen ◽

Aowen Tian ◽

Zhipeng Liu ◽

Miaoran Zhang ◽

Xingchen Pan ◽

...

Keyword(s):

Health Outcomes ◽

Human Health ◽

Mendelian Randomization ◽

Wide Spectrum ◽

Association Studies ◽

Genome Wide Association Studies ◽

Uk Biobank ◽

Major Depressive ◽

Genome Wide ◽

The Uk

ABSTRACTBackgroundIt remains controversial whether daytime napping is beneficial for human health.ObjectiveTo examine the causal relationship between daytime napping and the risk for various human diseases.DesignPhenotype-wide Mendelian randomization study.SettingNon-UK Biobank cohorts reported in published genome-wide association studies (GWAS) provided the outcome phenotypes in the discovery stage. The UK Biobank cohort provided the outcome phenotypes in the validation stage.ParticipantsThe UK Biobank GWAS included 361,194 European-ancestry residents in the UK. Non-UKBB GWAS included various numbers of participants.ExposureSelf-reported daytime napping frequency.Main outcome measureA wide-spectrum of human health outcomes including obesity, major depressive disorder, and high cholesterol.MethodsWe examined the causal relationship between daytime napping frequency in the UK Biobank as exposure and a panel of 1,146 health outcomes reported in genome-wide association studies (GWAS), using a two-sample Mendelian randomization analysis. The significant findings were further validated in the UK Biobank health outcomes of 4,203 human traits and diseases. The causal effects were estimated using a fixed-effect inverse variance weighted model. MR-Egger intercept test was applied to detect horizontal pleiotropy, along with Cochran’s Q test to assess heterogeneity among the causal effects of IVs.FindingsThere were significant causal relationships between daytime napping frequency and a wide spectrum of human health outcomes. In particular, we validated that frequent daytime napping increased the risks of major depressive disorder, obesity and abnormal lipid profile.InterpretationThe current study showed that frequent daytime napping mainly had adverse impacts on physical and mental health. Cautions should be taken for health recommendations on daytime napping. Further studies are necessary to precisely define the best daytime napping strategies.

Download Full-text

Precision Colorectal Cancer Screening with Polygenic Risk Score

10.1101/2020.08.19.20177931 ◽

2020 ◽

Author(s):

Tonis Tasa ◽

Mikk Puustusmaa ◽

Neeme Tonisson ◽

Berit Kolk ◽

Peeter Padrik

Keyword(s):

Colorectal Cancer ◽

Risk Score ◽

Association Studies ◽

Absolute Risk ◽

Specific Model ◽

Background Information ◽

Polygenic Risk Score ◽

Genome Wide Association Studies ◽

Polygenic Risk ◽

Common Cancer

Colorectal cancer (CRC) is the second most common cancer in women and third most common cancer in men. Genome-wide association studies have identified numerous genetic variants (SNPs) independently associated with CRC. The effects of such SNPs can be combined into a single polygenic risk score (PRS). Stratification of individuals according to PRS could be introduced to primary and secondary prevention. Our aim was to combine risk stratification of a sex-specific PRS model with recommendations for individualized CRC screening. Previously published PRS models for predicting the risk of CRC were collected from the literature. These were validated on the UK Biobank (UKBB) consisting of a total of 458 696 quality-controlled genotypes with 1810 and 1348 prevalent male cases, and 2410 and 1810 incident male and female cases. The best performing sex-specific model was selected based on the AUC in prevalent data and independently validated in the incident dataset. Using Estonian CRC background information, we performed absolute risk simulations and examined the ability of PRS in risk stratifying individual screening recommendations. The best-performing model included 91 SNPs. The C-index of the best performing model in the dataset was 0.613 (SE = 0.007) and hazard ratio (HR) per unit of PRS was 1.53 (1.47 - 1.59) for males. Respective metrics for females were 0.617 (SE = 0.006) and 1.50 (1.44 - 1.58). PRS risk simulations showed that a genetically average 50-year-old female doubles her risk by age 58 (55 in males) and triples it by age 63 (59 in males). In addition, the best performing PRS model was able to identify individuals in one of seven groups proposed by Naber et al. for different coloscopy screening recommendation regimens. We have combined PRS-based recommendations for individual screening attendance. Our approach is easily adaptable to other nationalities by using population-specific background data of other genetically similar populations.

Download Full-text

No Clinically Relevant Effect of Heart Rate Increase and Heart Rate Recovery During Exercise on Cardiovascular Disease: A Mendelian Randomization Analysis

Frontiers in Genetics ◽

10.3389/fgene.2021.569323 ◽

2021 ◽

Vol 12 ◽

Author(s):

Josephine Mensah-Kane ◽

Amand F. Schmidt ◽

Aroon D. Hingorani ◽

Chris Finan ◽

Yutang Chen ◽

...

Keyword(s):

Heart Rate ◽

Mendelian Randomization ◽

Heart Rate Recovery ◽

Association Studies ◽

Sensitivity Analyses ◽

Future Research ◽

Genome Wide Association Studies ◽

Uk Biobank ◽

Relevant Effect ◽

Cv Disease

BackgroundReduced heart rate (HR) increase (HRI), recovery (HRR), and higher resting HR are associated with cardiovascular (CV) disease, but causal inferences have not been deduced. We investigated causal effects of HRI, HRR, and resting HR on CV risk, all-cause mortality (ACM), atrial fibrillation (AF), coronary artery disease (CAD), and ischemic stroke (IS) using Mendelian Randomization.Methods11 variants for HRI, 11 for HRR, and two sets of 46 and 414 variants for resting HR were obtained from four genome-wide association studies (GWASs) on UK Biobank. We performed a lookup on GWASs for CV risk and ACM in UK Biobank (N = 375,367, 5.4% cases and N = 393,165, 4.4% cases, respectively). For CAD, AF, and IS, we used publicly available summary statistics. We used a random-effects inverse-variance weighted (IVW) method and sensitivity analyses to estimate causality.ResultsIVW showed a nominally significant effect of HRI on CV events (odds ratio [OR] = 1.0012, P = 4.11 × 10–2) and on CAD and AF. Regarding HRR, IVW was not significant for any outcome. The IVW method indicated statistically significant associations of resting HR with AF (OR = 0.9825, P = 9.8 × 10–6), supported by all sensitivity analyses, and a nominally significant association with IS (OR = 0.9926, P = 9.82 × 10–3).ConclusionOur findings suggest no strong evidence of an association between HRI and HRR and any outcome and confirm prior work reporting a highly significant effect of resting HR on AF. Future research is required to explore HRI and HRR associations further using more powerful predictors, when available.

Download Full-text

Transcriptome-wide association study in UK Biobank Europeans identifies associations with blood cell traits.

10.1101/2021.08.03.453690 ◽

2021 ◽

Author(s):

Bryce Rowland ◽

Sanan Venkatesh ◽

Manuel Tardaguila ◽

Jia Wen ◽

Jonathan D Rosen ◽

...

Keyword(s):

Association Study ◽

Target Genes ◽

Prediction Models ◽

Association Studies ◽

European Ancestry ◽

Genome Wide Association Studies ◽

Uk Biobank ◽

Rna Seq ◽

Genome Wide ◽

Increased Sensitivity

Previous genome-wide association studies (GWAS) of hematological traits have identified over 10,000 distinct trait-specific risk loci, but the underlying causal mechanisms at these loci remain incompletely characterized. We performed a transcriptome-wide association study (TWAS) of 29 hematological traits in 399,835 UK Biobank (UKB) participants of European ancestry using gene expression prediction models trained from whole blood RNA-seq data in 922 individuals. We discovered 557 TWAS signals associated with hematological traits distinct from previously discovered GWAS variants, including 10 completely novel gene-trait pairs corresponding to 9 unique genes. Among the 557 associations, 301 were available for replication in a cohort of 141,286 participants of European ancestry from the Million Veteran Program (MVP). Of these 301 associations, 199 replicated at a nominal threshold (α = 0.05) and 108 replicated at a strict Bonferroni adjusted threshold (α = 0.05/301). Using our TWAS results, we systematically assigned 4,261 out of 16,900 previously identified hematological trait GWAS variants to putative target genes. Compared to coloc, our TWAS results show reduced specificity and increased sensitivity to assign variants to target genes.

Download Full-text