scholarly journals Evaluating the Impact of Physiological Variability in Genome-Wide Association Studies of Resting Heart Rate

Author(s):  
Stefan van Duijvenboden ◽  
Julia Ramírez ◽  
William J. Young ◽  
Andrew Tinker ◽  
Patricia B. Munroe ◽  
...  
2013 ◽  
Vol 37 (4) ◽  
pp. 383-392 ◽  
Author(s):  
Karla J. Lindquist ◽  
Eric Jorgenson ◽  
Thomas J. Hoffmann ◽  
John S. Witte

2021 ◽  
Vol 12 ◽  
Author(s):  
Michal Marczyk ◽  
Agnieszka Macioszek ◽  
Joanna Tobiasz ◽  
Joanna Polanska ◽  
Joanna Zyla

A typical genome-wide association study (GWAS) analyzes millions of single-nucleotide polymorphisms (SNPs), several of which are in a region of the same gene. To conduct gene set analysis (GSA), information from SNPs needs to be unified at the gene level. A widely used practice is to use only the most relevant SNP per gene; however, there are other methods of integration that could be applied here. Also, the problem of nonrandom association of alleles at two or more loci is often neglected. Here, we tested the impact of incorporation of different integrations and linkage disequilibrium (LD) correction on the performance of several GSA methods. Matched normal and breast cancer samples from The Cancer Genome Atlas database were used to evaluate the performance of six GSA algorithms: Coincident Extreme Ranks in Numerical Observations (CERNO), Gene Set Enrichment Analysis (GSEA), GSEA-SNP, improved GSEA for GWAS (i-GSEA4GWAS), Meta-Analysis Gene-set Enrichment of variaNT Associations (MAGENTA), and Over-Representation Analysis (ORA). Association of SNPs to phenotype was calculated using modified McNemar’s test. Results for SNPs mapped to the same gene were integrated using Fisher and Stouffer methods and compared with the minimum p-value method. Four common measures were used to quantify the performance of all combinations of methods. Results of GSA analysis on GWAS were compared to the one performed on gene expression data. Comparing all evaluation metrics across different GSA algorithms, integrations, and LD correction, we highlighted CERNO, and MAGENTA with Stouffer as the most efficient. Applying LD correction increased prioritization and specificity of enrichment outcomes for all tested algorithms. When Fisher or Stouffer were used with LD, sensitivity and reproducibility were also better. Using any integration method was beneficial in comparison with a minimum p-value method in specific combinations. The correlation between GSA results from genomic and transcriptomic level was the highest when Stouffer integration was combined with LD correction. We thoroughly evaluated different approaches to GSA in GWAS in terms of performance to guide others to select the most effective combinations. We showed that LD correction and Stouffer integration could increase the performance of enrichment analysis and encourage the usage of these techniques.


2020 ◽  
Vol 29 (16) ◽  
pp. 2803-2811
Author(s):  
James P Cook ◽  
Anubha Mahajan ◽  
Andrew P Morris

Abstract The UK Biobank is a prospective study of more than 500 000 participants, which has aggregated data from questionnaires, physical measures, biomarkers, imaging and follow-up for a wide range of health-related outcomes, together with genome-wide genotyping supplemented with high-density imputation. Previous studies have highlighted fine-scale population structure in the UK on a North-West to South-East cline, but the impact of unmeasured geographical confounding on genome-wide association studies (GWAS) of complex human traits in the UK Biobank has not been investigated. We considered 368 325 white British individuals from the UK Biobank and performed GWAS of their birth location. We demonstrate that widely used approaches to adjust for population structure, including principal component analysis and mixed modelling with a random effect for a genetic relationship matrix, cannot fully account for the fine-scale geographical confounding in the UK Biobank. We observe significant genetic correlation of birth location with a range of lifestyle-related traits, including body-mass index and fat mass, hypertension and lung function, even after adjustment for population structure. Variants driving associations with birth location are also strongly associated with many of these lifestyle-related traits after correction for population structure, indicating that there could be environmental factors that are confounded with geography that have not been adequately accounted for. Our findings highlight the need for caution in the interpretation of lifestyle-related trait GWAS in UK Biobank, particularly in loci demonstrating strong residual association with birth location.


2014 ◽  
Vol 96 (5) ◽  
pp. e38-1-10 ◽  
Author(s):  
Nandina Paria ◽  
Lawson A Copley ◽  
John A Herring ◽  
Harry KW Kim ◽  
B Stephens Richards ◽  
...  

Addiction ◽  
2021 ◽  
Author(s):  
Cecilia Dao ◽  
Hang Zhou ◽  
Aeron Small ◽  
Kirsha S. Gordon ◽  
Boyang Li ◽  
...  

2014 ◽  
Vol 115 (suppl_1) ◽  
Author(s):  
Manikandan Panchatcharam ◽  
Sumitra Miriyala ◽  
Natalie D Hendrix ◽  
Diana Escalante-Alcalde

A recent meta-analysis of Genome-Wide Association Studies (GWAS) of coronary artery disease (CAD) involving over 86,000 individuals identified PPAP2B (encodes lipid phosphate phosphatase-3; LPP3) as a new loci (SNP rs17114036; P= 3.81 X 10 −19 ) that independently predicts CAD. LPP3 is an integral membrane enzyme that dephosphorylates lysophosphatidic acid, sphingosine-1-phosphate and related bioactive lipids. Strikingly, we found that targeted inactivation of LPP3 in endothelial cells results in early embryonic lethality, in part to due to vascular patterning defects. In addition, we found that constitutive inactivation of LPP3 in the myocardium results in cardiac dysfunction, indicating that dysregulation of LPP3-dependent cardiomyocyte cell function. Heart rate was significantly higher in conditional Ppap2b Δ mice (P<0.001), which may indicate a role for LPP3 in regulating heart rate and/or function. Further, we found that myocardial LPP3 levels are altered following myocardial infarction in mice and these results are in accord with our myocardial infarct data from patient that had down-regulated myocardial LPP3 levels. The major rs17114036A allele was associated with a 1.17 odds ratio for CAD. This SNP is located in the final intron of the six exon PPAP2B gene. At least seven SNPs are in robust linkage disequilbrium (r2>0.9) with rs17114036. The sequence surrounding the SNP rs17114036 matches a U1 spliceosome recognition sequence, and the variant base is located at the final position of a sequence with high homology to a U1 spliceosome 5’ spice site recognition motif. Binding of the U1 spliceosome may alter mRNA stability or processing, perhaps by masking cryptic splice sites, thus enabling efficient splicing or promoting polyadenylation of the mature message. To address the polymorphism, we transfected MDA MB 453 cells, which are homozygous for the “T’ allele of rs17114036 SNP, with a short synthetic 2’-O-methyl phosphorothioate RNA. We found a dramatic decrease in both LPP3 protein expression and LPP3 mRNA levels. These results imply that the region containing the rs17114036 SNP may be important for proper processing and/or stability of the LPP3 transcript thereby functionally validating the GWAS study.


Sign in / Sign up

Export Citation Format

Share Document