scholarly journals A Multi-Locus Association Model Framework for Nested Association Mapping With Discriminating QTL Effects in Various Subpopulations

2021 ◽  
Vol 11 ◽  
Author(s):  
Suhong Bu ◽  
Weiren Wu ◽  
Yuan-Ming Zhang

Nested association mapping (NAM) has been an invaluable approach for plant genetics community and can dissect the genetic architecture of complex traits. As the most popular NAM analysis strategy, joint multifamily mapping can combine all information from diverse genetic backgrounds and increase population size. However, it is influenced by the genetic heterogeneity of quantitative trait locus (QTL) across various subpopulations. Multi-locus association mapping has been proven to be powerful in many cases of QTL mapping and genome-wide association studies. Therefore, we developed a multi-locus association model of multiple families in the NAM population, which could discriminate the effects of QTLs in all subpopulations. A series of simulations with a real maize NAM genomic data were implemented. The results demonstrated that the new method improves the statistical power in QTL detection and the accuracy in QTL effect estimation. The new approach, along with single-family linkage mapping, was used to identify QTLs for three flowering time traits in the maize NAM population. As a result, most QTLs detected in single family linkage mapping were identified by the new method. In addition, the new method also mapped some new QTLs with small effects, although their functions need to be identified in the future.

2017 ◽  
Author(s):  
Haohan Wang ◽  
Xiang Liu ◽  
Yunpeng Xiao ◽  
Ming Xu ◽  
Eric P. Xing

AbstractGenome-wide Association Study has presented a promising way to understand the association between human genomes and complex traits. Many simple polymorphic loci have been shown to explain a significant fraction of phenotypic variability. However, challenges remain in the non-triviality of explaining complex traits associated with multifactorial genetic loci, especially considering the confounding factors caused by population structure, family structure, and cryptic relatedness. In this paper, we propose a Squared-LMM (LMM2) model, aiming to jointly correct population and genetic confounding factors. We offer two strategies of utilizing LMM2 for association mapping: 1) It serves as an extension of univariate LMM, which could effectively correct population structure, but consider each SNP in isolation. 2) It is integrated with the multivariate regression model to discover association relationship between complex traits and multifactorial genetic loci. We refer to this second model as sparse Squared-LMM (sLMM2). Further, we extend LMM2/sLMM2 by raising the power of our squared model to the LMMn/sLMMn model. We demonstrate the practical use of our model with synthetic phenotypic variants generated from genetic loci of Arabidopsis Thaliana. The experiment shows that our method achieves a more accurate and significant prediction on the association relationship between traits and loci. We also evaluate our models on collected phenotypes and genotypes with the number of candidate genes that the models could discover. The results suggest the potential and promising usage of our method in genome-wide association studies.


2016 ◽  
Author(s):  
Jimmy Z Liu ◽  
Yaniv Erlich ◽  
Joseph K Pickrell

AbstractThe case-control association study is a powerful method for identifying genetic variants that influence disease risk. However, the collection of cases can be time-consuming and expensive; if a disease occurs late in life or is rapidly lethal, it may be more practical to identify family members of cases. Here, we show that replacing cases with their first-degree relatives enables genome-wide association studies by proxy (GWAX). In randomly-ascertained cohorts, this approach enables previously infeasible studies of diseases that are absent (or nearly absent) in the cohort. As an illustration, we performed GWAX of 12 common diseases in 116,196 individuals from the UK Biobank. By combining these results with published GWAS summary statistics in a meta-analysis, we replicated established risk loci and identified 17 newly associated risk loci: four in Alzheimer’s disease, eight in coronary artery disease, and five in type 2 diabetes. In addition to informing disease biology, our results demonstrate the utility of association mapping using family history of disease as a phenotype to be mapped. We anticipate that this approach will prove useful in future genetic studies of complex traits in large population cohorts.


2018 ◽  
Author(s):  
Ping Zeng ◽  
Xinjie Hao ◽  
Xiang Zhou

AbstractMotivationGenome-wide association studies (GWASs) have identified many genetic loci associated with complex traits. A substantial fraction of these identified loci are associated with multiple traits – a phenomena known as pleiotropy. Identification of pleiotropic associations can help characterize the genetic relationship among complex traits and can facilitate our understanding of disease etiology. Effective pleiotropic association mapping requires the development of statistical methods that can jointly model multiple traits with genome-wide SNPs together.ResultsWe develop a joint modeling method, which we refer to as the integrative MApping of Pleiotropic association (iMAP). iMAP models summary statistics from GWASs, uses a multivariate Gaussian distribution to account for phenotypic correlation, simultaneously infers genome-wide SNP association pattern using mixture modeling, and has the potential to reveal causal relationship between traits. Importantly, iMAP integrates a large number of SNP functional annotations to substantially improve association mapping power, and, with a sparsity-inducing penalty, is capable of selecting informative annotations from a large, potentially noninformative set. To enable scalable inference of iMAP to association studies with hundreds of thousands of individuals and millions of SNPs, we develop an efficient expectation maximization algorithm based on an approximate penalized regression algorithm. With simulations and comparisons to existing methods, we illustrate the benefits of iMAP both in terms of high association mapping power and in terms of accurate estimation of genome-wide SNP association patterns. Finally, we apply iMAP to perform a joint analysis of 48 traits from 31 GWAS consortia together with 40 tissue-specific SNP annotations generated from the Roadmap Project. iMAP is freely available at www.xzlab.org/software.html.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Chao-Yu Guo ◽  
Reng-Hong Wang ◽  
Hsin-Chou Yang

AbstractAfter the genome-wide association studies (GWAS) era, whole-genome sequencing is highly engaged in identifying the association of complex traits with rare variations. A score-based variance-component test has been proposed to identify common and rare genetic variants associated with complex traits while quickly adjusting for covariates. Such kernel score statistic allows for familial dependencies and adjusts for random confounding effects. However, the etiology of complex traits may involve the effects of genetic and environmental factors and the complex interactions between genes and the environment. Therefore, in this research, a novel method is proposed to detect gene and gene-environment interactions in a complex family-based association study with various correlated structures. We also developed an R function for the Fast Gene-Environment Sequence Kernel Association Test (FGE-SKAT), which is freely available as supplementary material for easy GWAS implementation to unveil such family-based joint effects. Simulation studies confirmed the validity of the new strategy and the superior statistical power. The FGE-SKAT was applied to the whole genome sequence data provided by Genetic Analysis Workshop 18 (GAW18) and discovered concordant and discordant regions compared to the methods without considering gene by environment interactions.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Gregory R. Keele ◽  
Jeremy W. Prokop ◽  
Hong He ◽  
Katie Holl ◽  
John Littrell ◽  
...  

AbstractChronic kidney disease (CKD), which can ultimately progress to kidney failure, is influenced by genetics and the environment. Genes identified in human genome wide association studies (GWAS) explain only a small proportion of the heritable variation and lack functional validation, indicating the need for additional model systems. Outbred heterogeneous stock (HS) rats have been used for genetic fine-mapping of complex traits, but have not previously been used for CKD traits. We performed GWAS for urinary protein excretion (UPE) and CKD related serum biochemistries in 245 male HS rats. Quantitative trait loci (QTL) were identified using a linear mixed effect model that tested for association with imputed genotypes. Candidate genes were identified using bioinformatics tools and targeted RNAseq followed by testing in a novel in vitro model of human tubule, hypoxia-induced damage. We identified two QTL for UPE and five for serum biochemistries. Protein modeling identified a missense variant within Septin 8 (Sept8) as a candidate for UPE. Sept8/SEPTIN8 expression increased in HS rats with elevated UPE and tubulointerstitial injury and in the in vitro hypoxia model. SEPTIN8 is detected within proximal tubule cells in human kidney samples and localizes with acetyl-alpha tubulin in the culture system. After hypoxia, SEPTIN8 staining becomes diffuse and appears to relocalize with actin. These data suggest a role of SEPTIN8 in cellular organization and structure in response to environmental stress. This study demonstrates that integration of a rat genetic model with an environmentally induced tubule damage system identifies Sept8/SEPTIN8 and informs novel aspects of the complex gene by environmental interactions contributing to CKD risk.


2016 ◽  
Vol 283 (1835) ◽  
pp. 20160569 ◽  
Author(s):  
M. E. Goddard ◽  
K. E. Kemper ◽  
I. M. MacLeod ◽  
A. J. Chamberlain ◽  
B. J. Hayes

Complex or quantitative traits are important in medicine, agriculture and evolution, yet, until recently, few of the polymorphisms that cause variation in these traits were known. Genome-wide association studies (GWAS), based on the ability to assay thousands of single nucleotide polymorphisms (SNPs), have revolutionized our understanding of the genetics of complex traits. We advocate the analysis of GWAS data by a statistical method that fits all SNP effects simultaneously, assuming that these effects are drawn from a prior distribution. We illustrate how this method can be used to predict future phenotypes, to map and identify the causal mutations, and to study the genetic architecture of complex traits. The genetic architecture of complex traits is even more complex than previously thought: in almost every trait studied there are thousands of polymorphisms that explain genetic variation. Methods of predicting future phenotypes, collectively known as genomic selection or genomic prediction, have been widely adopted in livestock and crop breeding, leading to increased rates of genetic improvement.


2021 ◽  
Vol 42 (1) ◽  
Author(s):  
Dinesh K. Saini ◽  
Yuvraj Chopra ◽  
Jagmohan Singh ◽  
Karansher S. Sandhu ◽  
Anand Kumar ◽  
...  

Circulation ◽  
2008 ◽  
Vol 118 (suppl_18) ◽  
Author(s):  
Margarete Mehrabian ◽  
Charles Farber ◽  
Peter Langfelder ◽  
Anatole Ghazalpour ◽  
Zhiqiang Zhou ◽  
...  

A recent meta-analysis of three large genome-wide association studies for HDL cholesterol levels revealed several highly significant associations, but altogether these explained less than 10% of the population variance of HDL. Since HDL levels are highly heritable, with heritability estimated at 50–70% in many studies, there are clearly many additional genes, and probably complex genetic and environmental interactions, involved in HDL metabolism. Thus, if “personalized medicine” is to become a reality, these complex factors must be addressed. Combined genetic-genomic approaches have rejuvenated the analysis of complex traits using mouse models, and here report an integrative genomic study of HDL in a large mouse cross. We previously reported the identification of loci associated with HDL cholesterol concentrations using a CXB F2 intercross. We have now generated a much larger CXB cross, consisting of 438 mice, and have integrated genome wide gene expression analysis of liver and adipose with quantitative trait locus (QTL) mapping and causality modeling. These studies were carried out on mice fed a low fat, chow diet and then switched to a high fat, ’Western’ diet. QTL analysis on the clinical traits using R/QTL (http://cran.r-project.org/) revealed a complex inheritance pattern with significant LOD scores at 9 loci, on chromosomes 1,2,4,5,8,9,10,16,18. Of these loci, 6 (chr: 1,4,5,10,16,18) were seen to be involved in genetic-dietary regulation of HDL cholesterol. Expression QTLs (eQTL) were determined using Agilent microarrays for 23,624 transcripts. Genes expressed within a 1-LOD support interval or correlated with HDL (p<2.7E-11) in both adipose and liver were identified. Using Network Edge Orienting (NEO) methods, causal relationships between the identified genes, related QTL peak markers and HDL levels were accessed. The genes were then ranked based on the NEO scores. In liver the highest ranked genes were associated with mitochondrial, ER and golgi trafficking. In adipose, on the other hand, pathways associated with cell signaling, transcription regulation and protein ubiquitation were predicted to be causal for HDL levels. In conclusion, our results reveal a large number of novel pathways and candidate genes for plasma lipid metabolism. This research has received full or partial funding support from the American Heart Association, AHA Western States Affiliate (California, Nevada & Utah).


Sign in / Sign up

Export Citation Format

Share Document