Robust Reference Powered Association Test of genome-wide association studies

ABSTRACTGenome-wide association studies (GWAS) have identified abundant genetic susceptibility loci, although they are far less from meeting the previous expectations due to low statistical power and false positive results. Effective statistical methods are required to further improve the analyses of massive GWAS data. Here we presented a new statistic (Robust Reference Powered Association Test,http://drwang.top/gwas.html) to use large public database as reference to reduce concern of potential population stratification. To evaluate the performance of this statistic for various situations, we simulated multiple sets of sample size and frequencies to compute statistical power. Furthermore, we applied our method to several real datasets (psoriasis genome-wide association datasets and schizophrenia genome-wide association dataset) to evaluate the performance. Careful analyses indicated that our newly developed statistic outperformed several previously developed GWAS applications. Importantly, this statistic is more robust than naive merging method in the presence of small control-reference differentiation, therefore likely to detect more association signals.

Download Full-text

Identification of six novel susceptibility loci for dyslipidemia by longitudinal exome-wide association studies in Japanese

European Heart Journal ◽

10.1093/ehjci/ehaa946.3756 ◽

2020 ◽

Vol 41 (Supplement_2) ◽

Author(s):

M Oguri ◽

K Kato ◽

H Horibe ◽

T Fujimaki ◽

J Sakuma ◽

...

Keyword(s):

Serum Lipid ◽

Ldl Cholesterol ◽

Association Studies ◽

Density Lipoprotein ◽

Hdl Cholesterol ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Susceptibility Loci ◽

Cross Sectional ◽

Genome Wide

Abstract Background The circulating concentrations of triglycerides, high density lipoprotein (HDL)-cholesterol, and low density lipoprotein (LDL)-cholesterol have a substantial genetic component. Although previous genome-wide association studies identified various genes and loci related to plasma lipid levels, those studies were conducted in a cross-sectional manner. Purpose The purpose of the study was to identify genetic variants that confer susceptibility to hypertriglyceridemia, hypo-HDL-cholesterolemia, and hyper-LDL-cholesterolemia in Japanese. We have now performed longitudinal exome-wide association studies (EWASs) to identify novel loci for dyslipidemia by examining temporal changes in serum lipid profiles. Methods Longitudinal EWASs (mean follow-up period, 5 years) for hypertriglyceridemia (2056 case, 3966 controls), hypo-HDL-cholesterolemia (698 cases, 5324 controls), and hyper-LDL-cholesterolemia (2769 cases, 3251 controls) were performed with Illumina Human Exome arrays. The relation of genotypes of 24,691 single nucleotide polymorphisms (SNPs) that passed quality control to dyslipidemia-related traits was examined with the generalized estimating equation (GEE). To compensate for multiple comparisons of genotypes with each of the three conditions, we applied Bonferroni's correction for statistical significance of association. Replication studies with cross-sectional data were performed for hypertriglyceridemia (2685 cases, 4703 controls), hypo-HDL-cholesterolemia (1947 cases, 6146 controls), and hyper-LDL-cholesterolemia (1719 cases, 5833 controls). Results Longitudinal EWASs revealed that 30 SNPs were significantly (P<2.03 × 10–6 by GEE) associated with hypertriglyceridemia, 46 SNPs with hypo-HDL-cholesterolemia, and 25 SNPs with hyper-LDL-cholesterolemia. After examination of the relation of identified SNPs to serum lipid profiles, linkage disequilibrium, and results of the previous genome-wide association studies, we newly identified rs74416240 of TCHP, rs925368 of GIT2, rs7969300 of ATXN2, and rs12231744 of NAA25 as a susceptibility loci for hypo-HDL-cholesterolemia; and rs34902660 of SLC17A3 and rs1042127 of CDSN for hyper-LDL-cholesterolemia. These SNPs were not in linkage disequilibrium with those previously reported to be associated with dyslipidemia, indicating independent effects of the SNPs identified in the present study on serum concentrations of HDL-cholesterol or LDL-cholesterol in Japanese. According to allele frequency data from the 1000 Genomes project database, five of the six identified SNPs were monomorphic or rare variants in European populations. In the replication study, all six SNPs were associated with dyslipidemia-related phenotypes. Conclusion We have thus identified six novel loci that confer susceptibility to hypo-HDL-cholesterolemia or hyper-LDL-cholesterolemia. Determination of genotypes for these SNPs at these loci may prove informative for assessment of the genetic risk for dyslipidemia in Japanese. Funding Acknowledgement Type of funding source: None

Download Full-text

Rare Coding Variants and Breast Cancer Risk: Evaluation of Susceptibility Loci Identified in Genome-Wide Association Studies

Cancer Epidemiology Biomarkers & Prevention ◽

10.1158/1055-9965.epi-13-1043 ◽

2014 ◽

Vol 23 (4) ◽

pp. 622-628 ◽

Cited By ~ 19

Author(s):

Yanfeng Zhang ◽

Jirong Long ◽

Wei Lu ◽

Xiao-Ou Shu ◽

Qiuyin Cai ◽

...

Keyword(s):

Breast Cancer ◽

Breast Cancer Risk ◽

Cancer Risk ◽

Risk Evaluation ◽

Association Studies ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Susceptibility Loci ◽

Genome Wide ◽

Coding Variants

Download Full-text

Statistical power and utility of meta-analysis methods for cross-phenotype genome-wide association studies

PLoS ONE ◽

10.1371/journal.pone.0193256 ◽

2018 ◽

Vol 13 (3) ◽

pp. e0193256 ◽

Cited By ~ 13

Author(s):

Zhaozhong Zhu ◽

Verneri Anttila ◽

Jordan W. Smoller ◽

Phil H. Lee

Keyword(s):

Statistical Power ◽

Association Studies ◽

Meta Analysis ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Analysis Methods ◽

Genome Wide

Download Full-text

Genome-wide association studies for corneal and refractive astigmatism in UK Biobank demonstrate a shared role for myopia susceptibility loci

Human Genetics ◽

10.1007/s00439-018-1942-8 ◽

2018 ◽

Vol 137 (11-12) ◽

pp. 881-896 ◽

Cited By ~ 11

Author(s):

Rupal L. Shah ◽

◽

Jeremy A. Guggenheim

Keyword(s):

Association Studies ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Uk Biobank ◽

Susceptibility Loci ◽

Genome Wide

Download Full-text

Statistical Power of Model Selection Strategies for Genome-Wide Association Studies

PLoS Genetics ◽

10.1371/journal.pgen.1000582 ◽

2009 ◽

Vol 5 (7) ◽

pp. e1000582 ◽

Cited By ~ 14

Author(s):

Zheyang Wu ◽

Hongyu Zhao

Keyword(s):

Model Selection ◽

Statistical Power ◽

Association Studies ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Selection Strategies ◽

Genome Wide

Download Full-text

Combining Multiple Hypothesis Testing with Machine Learning Increases the Statistical Power of Genome-wide Association Studies

Scientific Reports ◽

10.1038/srep36671 ◽

2016 ◽

Vol 6 (1) ◽

Cited By ~ 20

Author(s):

Bettina Mieth ◽

Marius Kloft ◽

Juan Antonio Rodríguez ◽

Sören Sonnenburg ◽

Robin Vobruba ◽

...

Keyword(s):

Machine Learning ◽

Hypothesis Testing ◽

Statistical Power ◽

Association Studies ◽

Multiple Hypothesis Testing ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Multiple Hypothesis ◽

Genome Wide

Download Full-text

Retrospective Association Analysis of Longitudinal Binary Traits Identifies Important Loci and Pathways in Cocaine Use

Genetics ◽

10.1534/genetics.119.302598 ◽

2019 ◽

Vol 213 (4) ◽

pp. 1225-1236 ◽

Cited By ~ 1

Author(s):

Weimiao Wu ◽

Zhong Wang ◽

Ke Xu ◽

Xinyu Zhang ◽

Amei Amei ◽

...

Keyword(s):

Association Analysis ◽

Binary Data ◽

Association Studies ◽

Association Test ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Cocaine Use ◽

Genome Wide ◽

A Genome ◽

Time Varying Covariates

Longitudinal phenotypes have been increasingly available in genome-wide association studies (GWAS) and electronic health record-based studies for identification of genetic variants that influence complex traits over time. For longitudinal binary data, there remain significant challenges in gene mapping, including misspecification of the model for phenotype distribution due to ascertainment. Here, we propose L-BRAT (Longitudinal Binary-trait Retrospective Association Test), a retrospective, generalized estimating equation-based method for genetic association analysis of longitudinal binary outcomes. We also develop RGMMAT, a retrospective, generalized linear mixed model-based association test. Both tests are retrospective score approaches in which genotypes are treated as random conditional on phenotype and covariates. They allow both static and time-varying covariates to be included in the analysis. Through simulations, we illustrated that retrospective association tests are robust to ascertainment and other types of phenotype model misspecification, and gain power over previous association methods. We applied L-BRAT and RGMMAT to a genome-wide association analysis of repeated measures of cocaine use in a longitudinal cohort. Pathway analysis implicated association with opioid signaling and axonal guidance signaling pathways. Lastly, we replicated important pathways in an independent cocaine dependence case-control GWAS. Our results illustrate that L-BRAT is able to detect important loci and pathways in a genome scan and to provide insights into genetic architecture of cocaine use.

Download Full-text

12 new susceptibility loci for prostate cancer identified by genome-wide association study in Japanese population

Nature Communications ◽

10.1038/s41467-019-12267-6 ◽

2019 ◽

Vol 10 (1) ◽

Cited By ~ 17

Author(s):

Ryo Takata ◽

Atsushi Takahashi ◽

Masashi Fujita ◽

Yukihide Momozawa ◽

Edward J. Saunders ◽

...

Keyword(s):

Prostate Cancer ◽

Japanese Population ◽

Genome Wide Association Study ◽

Association Studies ◽

European Population ◽

Genome Wide Association ◽

High Risk Population ◽

Genome Wide Association Studies ◽

Susceptibility Loci ◽

Genome Wide

Abstract Genome-wide association studies (GWAS) have identified ~170 genetic loci associated with prostate cancer (PCa) risk, but most of them were identified in European populations. We here performed a GWAS and replication study using a large Japanese cohort (9,906 cases and 83,943 male controls) to identify novel susceptibility loci associated with PCa risk. We found 12 novel loci for PCa including rs1125927 (TMEM17, P = 3.95 × 10−16), rs73862213 (GATA2, P = 5.87 × 10−23), rs77911174 (ZMIZ1, P = 5.28 × 10−20), and rs138708 (SUN2, P = 1.13 × 10−15), seven of which had crucially low minor allele frequency in European population. Furthermore, we stratified the polygenic risk for Japanese PCa patients by using 82 SNPs, which were significantly associated with Japanese PCa risk in our study, and found that early onset cases and cases with family history of PCa were enriched in the genetically high-risk population. Our study provides important insight into genetic mechanisms of PCa and facilitates PCa risk stratification in Japanese population.

Download Full-text