scholarly journals Estimating the causal effect of genetic liability to prevalent disease on hospital costs using Mendelian Randomization

Author(s):  
Padraig Dixon ◽  
Sean Harrison ◽  
William Hollingworth ◽  
Neil M Davies ◽  
George Davey Smith

BACKGROUND Accurate measurement of the effects of disease status on healthcare cost is important in the pragmatic evaluation of interventions but is complicated by endogeneity biases due to omitted variables and reverse causality. Mendelian Randomization, the use of random perturbations in germline genetic variation as instrumental variables, can avoid these limitations. We report a novel Mendelian Randomization analysis of the causal effect of liability to disease on healthcare costs. METHODS We used Mendelian Randomization to model the causal impact on inpatient hospital costs of liability to six highly prevalent diseases: asthma, eczema, migraine, coronary heart disease, type 2 diabetes, and major depressive disorder. We identified genetic variants from replicated genome-wide associations studies and estimated their association with inpatient hospital costs using data from UK Biobank, a large prospective cohort study of individuals linked to records of hospital care. We assessed potential violations of the instrumental variable assumptions, particularly the exclusion restriction (i.e. variants affecting costs through alternative paths). We also conducted new genome wide association studies of hospital costs within the UK Biobank cohort as a further split sample sensitivity analysis. RESULTS We analyzed data on 307,032 individuals. Genetic variants explained only a small portion of the variance in each disease phenotype. Liability to coronary heart disease had substantial impacts (mean per person per year increase in costs from allele score Mendelian Randomization models: 712 pounds sterling (95% confidence interval: 238 pounds to 1,186 pounds)) on inpatient hospital costs in causal analysis, but other results were imprecise. There was concordance of findings across varieties of sensitivity analyses, including stratification by sex, and those obtained from the split sample analysis. CONCLUSION A novel Mendelian Randomization analysis of the causal effect of liability to disease on healthcare cost demonstrates that this type of analysis is feasible and informative in this context. There was concordance across data sources and across methods bearing different assumptions. Selection into the relatively healthy UK Biobank cohort and the modest proportion of variance in disease status accounted for by the allele scores reduced the precision of our estimates. We therefore could not exclude the possibility of substantial costs due to these diseases.

2020 ◽  
Author(s):  
Sean Harrison ◽  
Neil M Davies ◽  
Laura D Howe ◽  
Amanda Hughes

AbstractMen with more advantaged socioeconomic position (SEP) and better health have been observed to have higher levels of testosterone. It is unclear whether these associations arise because testosterone has a causal impact on SEP and health. In 306,248 participants of UK Biobank, we performed sex- stratified genome-wide association analysis to identify genetic variants associated with testosterone. Using the identified variants, we performed Mendelian randomization analysis of the influence of testosterone on socioeconomic position, including income, employment status, area-level deprivation, and educational qualifications; on health, including self-rated health and BMI, and on risk-taking behaviour. We found little evidence that testosterone affected socioeconomic position, health, or risk-taking. Our results therefore suggest it is unlikely that testosterone meaningfully affects these outcomes in men or women. Differences between Mendelian randomization and multivariable-adjusted estimates suggest previously reported associations with socioeconomic position and health may be due to residual confounding or reverse causation.


Author(s):  
Shuai Yuan ◽  
Maria Bruzelius ◽  
Susanna C. Larsson

AbstractWhether renal function is causally associated with venous thromboembolism (VTE) is not yet fully elucidated. We conducted a two-sample Mendelian randomization (MR) study to determine the causal effect of renal function, measured as estimated glomerular filtration rate (eGFR), on VTE. Single-nucleotide polymorphisms associated with eGFR were selected as instrumental variables at the genome-wide significance level (p < 5 × 10−8) from a meta-analysis of 122 genome-wide association studies including up to 1,046,070 individuals. Summary-level data for VTE were obtained from the FinnGen consortium (6913 VTE cases and 169,986 non-cases) and UK Biobank study (4620 VTE cases and 356,574 non-cases). MR estimates were calculated using the random-effects inverse-variance weighted method and combined using fixed-effects meta-analysis. Genetically predicted decreased eGFR was significantly associated with an increased risk of VTE in both FinnGen and UK Biobank. For one-unit decrease in log-transformed eGFR, the odds ratios of VTE were 2.93 (95% confidence interval (CI) 1.25, 6.84) and 4.46 (95% CI 1.59, 12.5) when using data from FinnGen and UK Biobank, respectively. The combined odds ratio was 3.47 (95% CI 1.80, 6.68). Results were consistent in all sensitivity analyses and no horizontal pleiotropy was detected. This MR-study supported a casual role of impaired renal function in VTE.


2019 ◽  
Author(s):  
Emily Jamieson ◽  
Roxanna Korologou-Linden ◽  
Robyn E. Wootton ◽  
Anna L. Guyatt ◽  
Thomas Battram ◽  
...  

AbstractWhether smoking-associated DNA methylation has a causal effect on lung function has not been thoroughly evaluated. We investigated the causal effects of 474 smoking-associated CpGs on forced expiratory volume in one second (FEV1) in two-sample Mendelian randomization (MR) using methylation quantitative trait loci and genome-wide association data for FEV1. We found evidence of a possible causal effect for DNA methylation on FEV1 at 18 CpGs (p<1.2×10−4). Replication analysis supported a causal effect at three CpGs (cg21201401 (ZGPAT), cg19758448 (PGAP3) and cg12616487 (AHNAK) (p<0.0028). DNA methylation did not clearly mediate the effect of smoking on FEV1, although DNA methylation at some sites may influence lung function via effects on smoking. Using multiple-trait colocalization, we found evidence of shared causal variants between lung function, gene expression and DNA methylation. Findings highlight potential therapeutic targets for improving lung function and possibly smoking cessation, although large, tissue-specific datasets are required to confirm these results.


2020 ◽  
Vol 70 ◽  
pp. 102300
Author(s):  
Padraig Dixon ◽  
William Hollingworth ◽  
Sean Harrison ◽  
Neil M. Davies ◽  
George Davey Smith

2019 ◽  
Vol 49 (4) ◽  
pp. 1147-1158 ◽  
Author(s):  
Jessica M B Rees ◽  
Christopher N Foley ◽  
Stephen Burgess

Abstract Background Factorial Mendelian randomization is the use of genetic variants to answer questions about interactions. Although the approach has been used in applied investigations, little methodological advice is available on how to design or perform a factorial Mendelian randomization analysis. Previous analyses have employed a 2 × 2 approach, using dichotomized genetic scores to divide the population into four subgroups as in a factorial randomized trial. Methods We describe two distinct contexts for factorial Mendelian randomization: investigating interactions between risk factors, and investigating interactions between pharmacological interventions on risk factors. We propose two-stage least squares methods using all available genetic variants and their interactions as instrumental variables, and using continuous genetic scores as instrumental variables rather than dichotomized scores. We illustrate our methods using data from UK Biobank to investigate the interaction between body mass index and alcohol consumption on systolic blood pressure. Results Simulated and real data show that efficiency is maximized using the full set of interactions between genetic variants as instruments. In the applied example, between 4- and 10-fold improvement in efficiency is demonstrated over the 2 × 2 approach. Analyses using continuous genetic scores are more efficient than those using dichotomized scores. Efficiency is improved by finding genetic variants that divide the population at a natural break in the distribution of the risk factor, or else divide the population into more equal-sized groups. Conclusions Previous factorial Mendelian randomization analyses may have been underpowered. Efficiency can be improved by using all genetic variants and their interactions as instrumental variables, rather than the 2 × 2 approach.


PLoS Genetics ◽  
2021 ◽  
Vol 17 (4) ◽  
pp. e1009525
Author(s):  
Mark Gormley ◽  
James Yarmolinsky ◽  
Tom Dudding ◽  
Kimberley Burrows ◽  
Richard M. Martin ◽  
...  

Head and neck squamous cell carcinoma (HNSCC), which includes cancers of the oral cavity and oropharynx, is a cause of substantial global morbidity and mortality. Strategies to reduce disease burden include discovery of novel therapies and repurposing of existing drugs. Statins are commonly prescribed for lowering circulating cholesterol by inhibiting HMG-CoA reductase (HMGCR). Results from some observational studies suggest that statin use may reduce HNSCC risk. We appraised the relationship of genetically-proxied cholesterol-lowering drug targets and other circulating lipid traits with oral (OC) and oropharyngeal (OPC) cancer risk using two-sample Mendelian randomization (MR). For the primary analysis, germline genetic variants in HMGCR, NPC1L1, CETP, PCSK9 and LDLR were used to proxy the effect of low-density lipoprotein cholesterol (LDL-C) lowering therapies. In secondary analyses, variants were used to proxy circulating levels of other lipid traits in a genome-wide association study (GWAS) meta-analysis of 188,578 individuals. Both primary and secondary analyses aimed to estimate the downstream causal effect of cholesterol lowering therapies on OC and OPC risk. The second sample for MR was taken from a GWAS of 6,034 OC and OPC cases and 6,585 controls (GAME-ON). Analyses were replicated in UK Biobank, using 839 OC and OPC cases and 372,016 controls and the results of the GAME-ON and UK Biobank analyses combined in a fixed-effects meta-analysis. We found limited evidence of a causal effect of genetically-proxied LDL-C lowering using HMGCR, NPC1L1, CETP or other circulating lipid traits on either OC or OPC risk. Genetically-proxied PCSK9 inhibition equivalent to a 1 mmol/L (38.7 mg/dL) reduction in LDL-C was associated with an increased risk of OC and OPC combined (OR 1.8 95%CI 1.2, 2.8, p = 9.31 x10-05), with good concordance between GAME-ON and UK Biobank (I2 = 22%). Effects for PCSK9 appeared stronger in relation to OPC (OR 2.6 95%CI 1.4, 4.9) than OC (OR 1.4 95%CI 0.8, 2.4). LDLR variants, resulting in genetically-proxied reduction in LDL-C equivalent to a 1 mmol/L (38.7 mg/dL), reduced the risk of OC and OPC combined (OR 0.7, 95%CI 0.5, 1.0, p = 0.006). A series of pleiotropy-robust and outlier detection methods showed that pleiotropy did not bias our findings. We found limited evidence for a role of cholesterol-lowering in OC and OPC risk, suggesting previous observational results may have been confounded. There was some evidence that genetically-proxied inhibition of PCSK9 increased risk, while lipid-lowering variants in LDLR, reduced risk of combined OC and OPC. This result suggests that the mechanisms of action of PCSK9 on OC and OPC risk may be independent of its cholesterol lowering effects; however, this was not supported uniformly across all sensitivity analyses and further replication of this finding is required.


Biostatistics ◽  
2020 ◽  
Author(s):  
Andrew J Grant ◽  
Stephen Burgess

Summary Valid estimation of a causal effect using instrumental variables requires that all of the instruments are independent of the outcome conditional on the risk factor of interest and any confounders. In Mendelian randomization studies with large numbers of genetic variants used as instruments, it is unlikely that this condition will be met. Any given genetic variant could be associated with a large number of traits, all of which represent potential pathways to the outcome which bypass the risk factor of interest. Such pleiotropy can be accounted for using standard multivariable Mendelian randomization with all possible pleiotropic traits included as covariates. However, the estimator obtained in this way will be inefficient if some of the covariates do not truly sit on pleiotropic pathways to the outcome. We present a method that uses regularization to identify which out of a set of potential covariates need to be accounted for in a Mendelian randomization analysis in order to produce an efficient and robust estimator of a causal effect. The method can be used in the case where individual-level data are not available and the analysis must rely on summary-level data only. It can be used where there are any number of potential pleiotropic covariates up to the number of genetic variants less one. We show the results of simulation studies that demonstrate the performance of the proposed regularization method in realistic settings. We also illustrate the method in an applied example which looks at the causal effect of urate plasma concentration on coronary heart disease.


2019 ◽  
Author(s):  
Padraig Dixon ◽  
William Hollingworth ◽  
Sean Harrison ◽  
Neil M Davies ◽  
George Davey Smith

AbstractEstimates of the marginal effect of measures of adiposity such as body mass index (BMI) on healthcare costs are important for the formulation and evaluation of policies targeting adverse weight profiles. Many existing estimates of this association are affected by endogeneity bias caused by simultaneity, measurement error and omitted variables. The contribution of this study is to avoid this bias by using a novel identification strategy – random germline genetic variation in an instrumental variable analysis – to identify the presence and magnitude of the causal effect of BMI on inpatient hospital costs. We also use data on genetic variants to undertake much richer testing of the sensitivity of results to potential violations of the instrumental variable assumptions than is possible with existing approaches. Using data on over 300,000 individuals, we found effect sizes for the marginal unit of BMI more than 50% larger than multivariable effect sizes. These effects attenuated under sensitivity analyses, but remained larger than multivariable estimates for all but one estimator. There was little evidence for non-linear effects of BMI on hospital costs. Within-family estimates, intended to address dynastic biases, were null but suffered from low power. This paper is the first to use genetic variants in a Mendelian Randomization framework to estimate the causal effect of BMI (or any other disease/trait) on healthcare costs. This type of analysis can be used to inform the cost-effectiveness of interventions and policies targeting the prevention and treatment of overweight and obesity, and for setting research priorities.


2018 ◽  
Vol 48 (3) ◽  
pp. 713-727 ◽  
Author(s):  
Eleanor Sanderson ◽  
George Davey Smith ◽  
Frank Windmeijer ◽  
Jack Bowden

Abstract Background Mendelian randomization (MR) is a powerful tool in epidemiology that can be used to estimate the causal effect of an exposure on an outcome in the presence of unobserved confounding, by utilizing genetic variants that are instrumental variables (IVs) for the exposure. This has been extended to multivariable MR (MVMR) to estimate the effect of two or more exposures on an outcome. Methods and results We use simulations and theory to clarify the interpretation of estimated effects in a MVMR analysis under a range of underlying scenarios, where a secondary exposure acts variously as a confounder, a mediator, a pleiotropic pathway and a collider. We then describe how instrument strength and validity can be assessed for an MVMR analysis in the single-sample setting, and develop tests to assess these assumptions in the popular two-sample summary data setting. We illustrate our methods using data from UK Biobank to estimate the effect of education and cognitive ability on body mass index. Conclusion MVMR analysis consistently estimates the direct causal effect of an exposure, or exposures, of interest and provides a powerful tool for determining causal effects in a wide range of scenarios with either individual- or summary-level data.


Sign in / Sign up

Export Citation Format

Share Document