scholarly journals Genome-wide association study in two cohorts from a multi-generational mouse advanced intercross line highlights the difficulty of replication

2018 ◽  
Author(s):  
Xinzhu Zhou ◽  
Celine L. St. Pierre ◽  
Natalia M. Gonzales ◽  
Riyan Cheng ◽  
Apurva Chitre ◽  
...  

AbstractReplication is considered to be critical for genome-wide association studies (GWAS) in humans, but is not routinely performed in model organisms. We explored replication using an advanced intercross line (AIL) which is the simplest possible multigenerational intercross. We re-genotyped a previously published cohort of LG/J x SM/J AIL mice (F34; n=428) using a denser marker set and also genotyped a novel cohort of AIL mice (F39-43; n=600) for the first time. We identified 110 significant loci in the F34 cohort, 36 of which were new discoveries attributable to the denser marker set; we also identified 27 novel significant loci in the F39-43 cohort. For traits measured in both cohorts (locomotor activity, body weight, and coat color), the genetic correlations were high, although, the F39-43 cohort showed systematically lower SNP-heritability estimates. We then attempted to replicate loci identified in either F34 or F39-43 in the other cohort. Albino coat color was robustly replicated; we observed only partial replication of associations for locomotor activity and body weight. Finally, we performed a mega-analysis of locomotor activity and body weight by combining F34 and F39-43 cohorts (n=1,028), which identified four novel loci. The incomplete replication was inconsistent with simulations we performed to estimate our power to replicate. This may reflect: 1) false positives errors in the discovery cohort, 2) environmental or genetic heterogeneity between the two samples, or 3) the systematic over estimation of the effect sizes at significant loci (“Winner’s Curse”). Our results demonstrate that it is difficult to replicate GWAS results even when using similarly sized discovery and replication cohorts drawn from the same population.

2020 ◽  
Vol 10 (3) ◽  
pp. 951-965 ◽  
Author(s):  
Xinzhu Zhou ◽  
Celine L. St. Pierre ◽  
Natalia M. Gonzales ◽  
Jennifer Zou ◽  
Riyan Cheng ◽  
...  

There has been extensive discussion of the “Replication Crisis” in many fields, including genome-wide association studies (GWAS). We explored replication in a mouse model using an advanced intercross line (AIL), which is a multigenerational intercross between two inbred strains. We re-genotyped a previously published cohort of LG/J x SM/J AIL mice (F34; n = 428) using a denser marker set and genotyped a new cohort of AIL mice (F39-43; n = 600) for the first time. We identified 36 novel genome-wide significant loci in the F34 and 25 novel loci in the F39-43 cohort. The subset of traits that were measured in both cohorts (locomotor activity, body weight, and coat color) showed high genetic correlations, although the SNP heritabilities were slightly lower in the F39-43 cohort. For this subset of traits, we attempted to replicate loci identified in either F34 or F39-43 in the other cohort. Coat color was robustly replicated; locomotor activity and body weight were only partially replicated, which was inconsistent with our power simulations. We used a random effects model to show that the partial replications could not be explained by Winner’s Curse but could be explained by study-specific heterogeneity. Despite this heterogeneity, we performed a mega-analysis by combining F34 and F39-43 cohorts (n = 1,028), which identified four novel loci associated with locomotor activity and body weight. These results illustrate that even with the high degree of genetic and environmental control possible in our experimental system, replication was hindered by study-specific heterogeneity, which has broad implications for ongoing concerns about reproducibility.


2018 ◽  
Author(s):  
Geneviève Galarneau ◽  
Pierre Fontanillas ◽  
Caterina Clementi ◽  
Tina Hu-Seliger ◽  
David-Emlyn Parfitt ◽  
...  

AbstractEndometriosis affects ∼10% of women of reproductive age. It is characterized by the growth of endometrial-like tissue outside the uterus and is frequently associated with severe pain and infertility. We performed the largest endometriosis genome-wide association study (GWAS) to date, with 37,183 cases and 251,258 controls. All women were of European ancestry. We also performed the first GWAS of endometriosis-related infertility, including 2,969 cases and 3,770 controls. Our endometriosis GWAS study replicated, at genome-wide significance, seven loci identified in previous endometriosis GWASs (CELA3A-CDC42, SYNE1, KDR, FSHB-ARL14EP, GREB1, ID4, and CEP112) and identified seven new candidate loci with genome-wide significance (NGF, ATP1B1-F5, CD109, HEY2, OSR2-VPS13B, WT1, and TEX11-SLC7A3). No loci demonstrated genome-wide significance for endometriosis-related infertility, however, the three most strongly associated loci (MCTP1, EPS8L3-CSF1, and LPIN1) were in or near genes associated with female fertility or embryonic lethality in model organisms. These results reveal new candidate genes with potential involvement in the pathophysiology of endometriosis and endometriosis-related infertility.


2018 ◽  
Author(s):  
Kristin M. Mignogna ◽  
Silviu A. Bacanu ◽  
Brien P. Riley ◽  
Aaron R. Wolen ◽  
Michael F. Miles

AbstractGenome-wide association studies on alcohol dependence, by themselves, have yet to account for the estimated heritability of the disorder and provide incomplete mechanistic understanding of this complex trait. Integrating brain ethanol-responsive gene expression networks from model organisms with human genetic data on alcohol dependence could aid in identifying dependence-associated genes and functional networks in which they are involved. This study used a modification of the Edge-Weighted Dense Module Searching for genome-wide association studies (EW-dmGWAS) approach to co-analyze whole-genome gene expression data from ethanol-exposed mouse brain tissue, human protein-protein interaction databases and alcohol dependence-related genome-wide association studies. Results revealed novel ethanol-regulated and alcohol dependence-associated gene networks in prefrontal cortex, nucleus accumbens, and ventral tegmental area. Three of these networks were overrepresented with genome-wide association signals from an independent dataset. These networks were significantly overrepresented for gene ontology categories involving several mechanisms, including actin filament-based activity, transcript regulation, Wnt and Syndecan-mediated signaling, and ubiquitination. Together, these studies provide novel insight for brain mechanisms contributing to alcohol dependence.


2020 ◽  
Vol 15 (11) ◽  
pp. 1643-1656
Author(s):  
Adrienne Tin ◽  
Anna Köttgen

The past few years have seen major advances in genome-wide association studies (GWAS) of CKD and kidney function–related traits in several areas: increases in sample size from >100,000 to >1 million, enabling the discovery of >250 associated genetic loci that are highly reproducible; the inclusion of participants not only of European but also of non-European ancestries; and the use of advanced computational methods to integrate additional genomic and other unbiased, high-dimensional data to characterize the underlying genetic architecture and prioritize potentially causal genes and variants. Together with other large-scale biobank and genetic association studies of complex traits, these GWAS of kidney function–related traits have also provided novel insight into the relationship of kidney function to other diseases with respect to their genetic associations, genetic correlation, and directional relationships. A number of studies also included functional experiments using model organisms or cell lines to validate prioritized potentially causal genes and/or variants. In this review article, we will summarize these recent GWAS of CKD and kidney function–related traits, explain approaches for downstream characterization of associated genetic loci and the value of such computational follow-up analyses, and discuss related challenges along with potential solutions to ultimately enable improved treatment and prevention of kidney diseases through genetics.


2018 ◽  
Vol 9 ◽  
Author(s):  
Anahit Nazari-Ghadikolaei ◽  
Hassan Mehrabani-Yeganeh ◽  
Seyed R. Miarei-Aashtiani ◽  
Elizabeth A. Staiger ◽  
Amir Rashidi ◽  
...  

2017 ◽  
Author(s):  
Natalia M. Gonzales ◽  
Jungkyun Seo ◽  
Ana Isabel Hernandez-Cordero ◽  
Celine L. St. Pierre ◽  
Jennifer S. Gregory ◽  
...  

AbstractThe LG/J x SM/J advanced intercross line of mice (LG x SM AIL) is a multigenerational outbred population. High minor allele frequencies, a simple genetic background, and the fully sequenced LG and SM genomes make it a powerful population for genome-wide association studies. Here we use 1,063 AIL mice to identify 126 significant associations for 50 traits relevant to human health and disease. We also identify thousands of cis- and trans-eQTLs in the hippocampus, striatum, and prefrontal cortex of ∼200 mice. We replicate an association between locomotor activity and Csmd1, which we identified in an earlier generation of this AIL, and show that Csmd1 mutant mice recapitulate the locomotor phenotype. Our results demonstrate the utility of the LG x SM AIL as a mapping population, identify numerous novel associations, and shed light on the genetic architecture of mammalian behavior.


2018 ◽  
Author(s):  
Apurva S. Chitre ◽  
Oksana Polesskaya ◽  
Katie Holl ◽  
Jianjun Gao ◽  
Riyan Cheng ◽  
...  

AbstractObjectiveObesity is influenced by genetic and environmental factors. Despite success of human genome wide association studies (GWAS), the specific genes that confer obesity remain largely unknown. The objective of this study was to use outbred rats to identify genetic loci underlying obesity and related morphometric and metabolic traits.MethodsWe measured obesity-relevant traits including body weight, body length, body mass index, fasting glucose, and retroperitoneal, epididymal, and parametrial fat pad weight in 3,173 male and female adult N/NIH heterogeneous stock (HS) rats across three institutions, providing data for the largest rat GWAS to date. Genetic loci were identified using a linear mixed model that accounted for the complex family relationships of the HS and covariate to account for differences among the three phenotyping centers.ResultsWe identified 32 independent loci, several of which contained only a single gene (e.g. Epha5, Nrg1 and Klhl14) or obvious candidate genes (Adcy3, Prlhr). There were strong phenotypic and genetic correlations among obesity-related traits, and extensive pleiotropy at individual loci.ConclusionsThese studies demonstrate utility of HS rats for investigating the genetics of obesity related traits across institutions and identify several candidate genes for future functional testing.


2020 ◽  
Author(s):  
L. Garreta ◽  
I. Cerón-Souza ◽  
M.R. Palacio ◽  
P.H. Reyes-Herrera

AbstractSummaryThe Genome-Wide Association Studies (GWAS) are essential to determine the genetic bases of either ecological or economic phenotypic variation across individuals within populations of model and non-model organisms. For this research question, current practice is the replication of the GWAS testing different parameters and models to validate the reproducibility of results. However, straightforward methodologies that manage both replication and tetraploid data are still missing. To solve this problem, we designed the MultiGWAS, a tool that does GWAS for diploid and tetraploid organisms by executing in parallel four software, two for polyploid data (GWASpoly and SHEsis) and two for diploids data (PLINK and TASSEL). MultiGWAS has several advantages. It runs either in the command line or in an interface. It manages different genotype formats, including VCF. It executes both the full and naïve models using several quality filters. Besides, it calculates a score to choose the best gene action model across GWASPoly and TASSEL. Finally, it generates several reports that facilitate the identification of false associations from both the significant and the best-ranked association SNP among the four software. We tested MultiGWAS with tetraploid potato data. The execution demonstrated that the Venn diagram and the other companion reports (i.e., Manhattan and QQ plots, heatmaps for associated SNP profiles, and chord diagrams to trace associated SNP by chromosomes) were useful to identify associated SNP shared among different models and parameters. Therefore, we confirmed that MultiGWAS is a suitable wrapping tool that successfully handles GWAS replication in both diploid and tetraploid [email protected]


Sign in / Sign up

Export Citation Format

Share Document