Comprehensive evaluation of mapping complex traits in wheat using genome-wide association studies

SummaryGenome-wide association studies (GWAS) have been used to study the genetic basis of a wide variety of complex diseases and other traits. However, for most traits it remains difficult to interpret what genes and biological processes are impacted by the top hits. Here, as a contrast, we describe UK Biobank GWAS results for three molecular traits—urate, IGF-1, and testosterone—that are biologically simpler than most diseases, and for which we know a great deal in advance about the core genes and pathways. Unlike most GWAS of complex traits, for all three traits we find that most top hits are readily interpretable. We observe huge enrichment of significant signals near genes involved in the relevant biosynthesis, transport, or signaling pathways. We show how GWAS data illuminate the biology of variation in each trait, including insights into differences in testosterone regulation between females and males. Meanwhile, in other respects the results are reminiscent of GWAS for more-complex traits. In particular, even these molecular traits are highly polygenic, with most of the variance coming not from core genes, but from thousands to tens of thousands of variants spread across most of the genome. Given that diseases are often impacted by many distinct biological processes, including these three, our results help to illustrate why so many variants can affect risk for any given disease.

Download Full-text

GWAS-Flow: A GPU accelerated framework for efficient permutation based genome-wide association studies

10.1101/783100 ◽

2019 ◽

Cited By ~ 2

Author(s):

Jan A. Freudenthal ◽

Markus J. Ankenbrand ◽

Dominik G. Grimm ◽

Arthur Korte

Keyword(s):

Complex Traits ◽

Mixed Model ◽

Linear Mixed Model ◽

Association Studies ◽

Large Datasets ◽

Genome Wide Association ◽

Small Data ◽

Genome Wide Association Studies ◽

Genome Wide ◽

Non Gaussian

AbstractMotivationGenome-wide association studies (GWAS) are one of the most commonly used methods to detect associations between complex traits and genomic polymorphisms. As both genotyping and phenotyping of large populations has become easier, typical modern GWAS have to cope with massive amounts of data. Thus, the computational demand for these analyses grew remarkably during the last decades. This is especially true, if one wants to implement permutation-based significance thresholds, instead of using the naïve Bonferroni threshold. Permutation-based methods have the advantage to provide an adjusted multiple hypothesis correction threshold that takes the underlying phenotypic distribution into account and will thus remove the need to find the correct transformation for non Gaussian phenotypes. To enable efficient analyses of large datasets and the possibility to compute permutation-based significance thresholds, we used the machine learning framework TensorFlow to develop a linear mixed model (GWAS-Flow) that can make use of the available CPU or GPU infrastructure to decrease the time of the analyses especially for large datasets.ResultsWe were able to show that our application GWAS-Flow outperforms custom GWAS scripts in terms of speed without loosing accuracy. Apart from p-values, GWAS-Flow also computes summary statistics, such as the effect size and its standard error for each individual marker. The CPU-based version is the default choice for small data, while the GPU-based version of GWAS-Flow is especially suited for the analyses of big data.AvailabilityGWAS-Flow is freely available on GitHub (https://github.com/Joyvalley/GWAS_Flow) and is released under the terms of the MIT-License.

Download Full-text

Advances in genome-wide association studies of complex traits in rice

Theoretical and Applied Genetics ◽

10.1007/s00122-019-03473-3 ◽

2019 ◽

Vol 133 (5) ◽

pp. 1415-1425 ◽

Cited By ~ 1

Author(s):

Qin Wang ◽

Jiali Tang ◽

Bin Han ◽

Xuehui Huang

Keyword(s):

Complex Traits ◽

Association Studies ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Genome Wide

Download Full-text

Impacts of Population Structure and Analytical Models in Genome-Wide Association Studies of Complex Traits in Forest Trees: A Case Study in Eucalyptus globulus

PLoS ONE ◽

10.1371/journal.pone.0081267 ◽

2013 ◽

Vol 8 (11) ◽

pp. e81267 ◽

Cited By ~ 40

Author(s):

Eduardo P. Cappa ◽

Yousry A. El-Kassaby ◽

Martín N. Garcia ◽

Cintia Acuña ◽

Nuno M. G. Borralho ◽

...

Keyword(s):

Population Structure ◽

Complex Traits ◽

Eucalyptus Globulus ◽

Association Studies ◽

Genome Wide Association ◽

Analytical Models ◽

Genome Wide Association Studies ◽

Forest Trees ◽

Genome Wide

Download Full-text

Integrative Tissue-Specific Functional Annotations in the Human Genome Provide Novel Insights on Many Complex Traits and Improve Signal Prioritization in Genome Wide Association Studies

PLoS Genetics ◽

10.1371/journal.pgen.1005947 ◽

2016 ◽

Vol 12 (4) ◽

pp. e1005947 ◽

Cited By ~ 56

Author(s):

Qiongshi Lu ◽

Ryan Lee Powles ◽

Qian Wang ◽

Beixin Julie He ◽

Hongyu Zhao

Keyword(s):

Human Genome ◽

Complex Traits ◽

Association Studies ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Tissue Specific ◽

Functional Annotations ◽

Genome Wide

Download Full-text

Genome-wide association and genomic selection in animal breedingThis article is one of a selection of papers from the conference “Exploiting Genome-wide Association in Oilseed Brassicas: a model for genetic improvement of major OECD crops for sustainable farming”.

Genome ◽

10.1139/g10-076 ◽

2010 ◽

Vol 53 (11) ◽

pp. 876-883 ◽

Cited By ~ 135

Author(s):

Ben Hayes ◽

Mike Goddard

Keyword(s):

Genomic Selection ◽

Complex Traits ◽

Association Studies ◽

Genome Wide Association ◽

Relationship Matrix ◽

Genome Wide Association Studies ◽

Simple Method ◽

Breeding Values ◽

Genome Wide ◽

A Genome

Results from genome-wide association studies in livestock, and humans, has lead to the conclusion that the effect of individual quantitative trait loci (QTL) on complex traits, such as yield, are likely to be small; therefore, a large number of QTL are necessary to explain genetic variation in these traits. Given this genetic architecture, gains from marker-assisted selection (MAS) programs using only a small number of DNA markers to trace a limited number of QTL is likely to be small. This has lead to the development of alternative technology for using the available dense single nucleotide polymorphism (SNP) information, called genomic selection. Genomic selection uses a genome-wide panel of dense markers so that all QTL are likely to be in linkage disequilibrium with at least one SNP. The genomic breeding values are predicted to be the sum of the effect of these SNPs across the entire genome. In dairy cattle breeding, the accuracy of genomic estimated breeding values (GEBV) that can be achieved and the fact that these are available early in life have lead to rapid adoption of the technology. Here, we discuss the design of experiments necessary to achieve accurate prediction of GEBV in future generations in terms of the number of markers necessary and the size of the reference population where marker effects are estimated. We also present a simple method for implementing genomic selection using a genomic relationship matrix. Future challenges discussed include using whole genome sequence data to improve the accuracy of genomic selection and management of inbreeding through genomic relationships.

Download Full-text

Genome-wide association studies for complex traits: consensus, uncertainty and challenges

Nature Reviews Genetics ◽

10.1038/nrg2344 ◽

2008 ◽

Vol 9 (5) ◽

pp. 356-369 ◽

Cited By ~ 1854

Author(s):

Mark I. McCarthy ◽

Gonçalo R. Abecasis ◽

Lon R. Cardon ◽

David B. Goldstein ◽

Julian Little ◽

...

Keyword(s):

Complex Traits ◽

Association Studies ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Genome Wide

Download Full-text

Causal Inference for Heritable Phenotypic Risk Factors Using Heterogeneous Genetic Instruments

10.1101/2020.05.06.077982 ◽

2020 ◽

Author(s):

Jingshu Wang ◽

Qingyuan Zhao ◽

Jack Bowden ◽

Gilbran Hemani ◽

George Davey Smith ◽

...

Keyword(s):

Risk Factors ◽

Complex Traits ◽

Mendelian Randomization ◽

Causal Effect ◽

Association Studies ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Genetic Associations ◽

Genome Wide ◽

Genetic Instruments

Over a decade of genome-wide association studies have led to the finding that significant genetic associations tend to spread across the genome for complex traits. The extreme polygenicity where "all genes affect every complex trait" complicates Mendelian Randomization studies, where natural genetic variations are used as instruments to infer the causal effect of heritable risk factors. We reexamine the assumptions of existing Mendelian Randomization methods and show how they need to be clarified to allow for pervasive horizontal pleiotropy and heterogeneous effect sizes. We propose a comprehensive framework GRAPPLE (Genome-wide mR Analysis under Pervasive PLEiotropy) to analyze the causal effect of target risk factors with heterogeneous genetic instruments and identify possible pleiotropic patterns from data. By using summary statistics from genome-wide association studies, GRAPPLE can efficiently use both strong and weak genetic instruments, detect the existence of multiple pleiotropic pathways, adjust for confounding risk factors, and determine the causal direction. With GRAPPLE, we analyze the effect of blood lipids, body mass index, and systolic blood pressure on 25 disease outcomes, gaining new information on their causal relationships and the potential pleiotropic pathways.

Download Full-text