scholarly journals LabWAS: Novel findings and study design recommendations from a meta-analysis of clinical labs in two independent biobanks

PLoS Genetics ◽  
2020 ◽  
Vol 16 (11) ◽  
pp. e1009077
Author(s):  
Jeffery A. Goldstein ◽  
Joshua S. Weinstock ◽  
Lisa A. Bastarache ◽  
Daniel B. Larach ◽  
Lars G. Fritsche ◽  
...  

Phenotypes extracted from Electronic Health Records (EHRs) are increasingly prevalent in genetic studies. EHRs contain hundreds of distinct clinical laboratory test results, providing a trove of health data beyond diagnoses. Such lab data is complex and lacks a ubiquitous coding scheme, making it more challenging than diagnosis data. Here we describe the first large-scale cross-health system genome-wide association study (GWAS) of EHR-based quantitative laboratory-derived phenotypes. We meta-analyzed 70 lab traits matched between the BioVU cohort from the Vanderbilt University Health System and the Michigan Genomics Initiative (MGI) cohort from Michigan Medicine. We show high replication of known association for these traits, validating EHR-based measurements as high-quality phenotypes for genetic analysis. Notably, our analysis provides the first replication for 699 previous GWAS associations across 46 different traits. We discovered 31 novel associations at genome-wide significance for 22 distinct traits, including the first reported associations for two lab-based traits. We replicated 22 of these novel associations in an independent tranche of BioVU samples. The summary statistics for all association tests are freely available to benefit other researchers. Finally, we performed mirrored analyses in BioVU and MGI to assess competing analytic practices for EHR lab traits. We find that using the mean of all available lab measurements provides a robust summary value, but alternate summarizations can improve power in certain circumstances. This study provides a proof-of-principle for cross health system GWAS and is a framework for future studies of quantitative EHR lab traits.

Author(s):  
Jeffery A. Goldstein ◽  
Joshua S. Weinstock ◽  
Lisa A. Bastarache ◽  
Daniel B. Larach ◽  
Lars G. Fritsche ◽  
...  

ABSTRACTPhenotypes extracted from Electronic Health Records (EHRs) are increasingly prevalent in genetic studies. EHRs contain hundreds of distinct clinical laboratory test results, providing a trove of health data beyond diagnoses. Such lab data is complex and lacks a ubiquitous coding scheme, making it more challenging than diagnosis data. Here we describe the first large-scale cross-health system genome-wide association study (GWAS) of EHR-based quantitative lab measurements. We meta-analyzed 70 labs matched between the BioVU cohort from the Vanderbilt University Health System and the Michigan Genomics Initiative (MGI) cohort from Michigan Medicine. We show high replication of known association for these labs, validating EHR-based measurements as high-quality phenotypes for genetic analysis. Notably, our analysis provides the first replication for 700 previous GWAS associations across 46 different labs. We discovered 31 novel associations at genome-wide significance for 22 distinct labs, including the first reported associations for two labs. We replicated 22 of these novel associations in an independent tranche of BioVU samples. The summary statistics for all association tests are available through an interactive webtool to benefit other researchers. Finally, we performed mirrored analyses in BioVU and MGI to assess competing analytic practices for lab data. We find that using the mean of all available lab measurements provides a robust summary value, but alternate summarizations can improve power in certain labs. This study provides a proof-of-principle for cross health system GWAS and is a framework for future studies of quantitative traits in EHRs.


2018 ◽  
Vol 21 (6) ◽  
pp. 538-545 ◽  
Author(s):  
W. D. Hill

Lam et al. (2018) respond to a commentary of their paper entitled ‘Large-Scale Cognitive GWAS Meta-Analysis Reveals Tissue-Specific Neural Expression and Potential Nootropic Drug Targets’ Lam et al. (2017). While Lam et al. (2018) have now provided the recommended quality control metrics for their paper, problems remain. Specifically, Lam et al. (2018) do not dispute that the results of their multi-trait analysis of genome-wide association study (MTAG) analysis has produced a phenotype with a genetic correlation of one with three measures of education, but do claim the associations found are specific to the trait of cognitive ability. In this brief paper, it is empirically demonstrated that the phenotype derived by Lam et al. (2017) is more genetically similar to education than cognitive ability. In addition, it is shown that of the genome-wide significant loci identified by Lam et al. (2017) are loci that are associated with education rather than with cognitive ability.


2021 ◽  
Vol 12 (1) ◽  
Author(s):  
Daniel McGuire ◽  
◽  
Yu Jiang ◽  
Mengzhen Liu ◽  
J. Dylan Weissenkampen ◽  
...  

AbstractGenome-wide association meta-analysis (GWAMA) is an effective approach to enlarge sample sizes and empower the discovery of novel associations between genotype and phenotype. Independent replication has been used as a gold-standard for validating genetic associations. However, as current GWAMA often seeks to aggregate all available datasets, it becomes impossible to find a large enough independent dataset to replicate new discoveries. Here we introduce a method, MAMBA (Meta-Analysis Model-based Assessment of replicability), for assessing the “posterior-probability-of-replicability” for identified associations by leveraging the strength and consistency of association signals between contributing studies. We demonstrate using simulations that MAMBA is more powerful and robust than existing methods, and produces more accurate genetic effects estimates. We apply MAMBA to a large-scale meta-analysis of addiction phenotypes with 1.2 million individuals. In addition to accurately identifying replicable common variant associations, MAMBA also pinpoints novel replicable rare variant associations from imputation-based GWAMA and hence greatly expands the set of analyzable variants.


2021 ◽  
Author(s):  
Kazuyoshi Ishigaki ◽  
Saori Sakaue ◽  
Chikashi Terao ◽  
Yang Luo ◽  
Kyuto Sonehara ◽  
...  

AbstractTrans-ancestry genetic research promises to improve power to detect genetic signals, fine-mapping resolution, and performances of polygenic risk score (PRS). We here present a large-scale genome-wide association study (GWAS) of rheumatoid arthritis (RA) which includes 276,020 samples of five ancestral groups. We conducted a trans-ancestry meta-analysis and identified 124 loci (P < 5 × 10-8), of which 34 were novel. Candidate genes at the novel loci suggested essential roles of the immune system (e.g., TNIP2 and TNFRSF11A) and joint tissues (e.g., WISP1) in RA etiology. Trans-ancestry fine mapping identified putatively causal variants with biological insights (e.g., LEF1). Moreover, PRS based on trans-ancestry GWAS outperformed PRS based on single-ancestry GWAS and had comparable performance between European and East Asian populations. Our study provides multiple insights into the etiology of RA and improves genetic predictability of RA.


2020 ◽  
Author(s):  
Dirk Smit

The ENIGMA-EEG working group was established to enable large scale international collaborations among cohorts who investigate the genetics of brain function measured with electroencephalography (EEG). The collaboration resulted in the currently largest genome-wide association study of oscillatory brain activity in EEG recordings by meta-analyzing the results across five participating cohorts’ results. Our endeavor has resulted in the first genome-wide significant hits for oscillatory brain function, and significant genes that were previously associated with psychiatric disorders. Our results have provided insight into the influence that psychitaric liability genes have on the functioning brain. In this overview, we also highlight how we have tackled methodological issues surrounding genetic meta-analysis of EEG features, and identify possible sources of heterogeneity across cohorts, which could affect the results of our meta-analysis. We discuss the importance of harmonizing EEG signal processing, cleaning, and feature extraction. Finally, we explain our selection of EEG features to be investigated in our future studies, e.g. temporal dynamics of oscillations and the connectivity network based on synchronization of oscillations. We argue that these represent some of the most important characteristics of the functioning brain. We conclude that disentangling the genetics of EEG will elucidate effects that genes have on brain function, as well as pathways from genes to neurological and psychiatric disorders.


2020 ◽  
Vol 7 (12) ◽  
pp. 1032-1045 ◽  
Author(s):  
Emma C Johnson ◽  
Ditte Demontis ◽  
Thorgeir E Thorgeirsson ◽  
Raymond K Walters ◽  
Renato Polimanti ◽  
...  

2017 ◽  
Author(s):  
William David Hill

Lam et al. (2017) reported a large-scale genome-wide association study (GWAS) of cognitive ability. They used the new analytical method of Multi-Trait Analysis of GWAS (MTAG) (Turley et al., 2017) to combine GWAS data sets on the correlated phenotypes of cognitive ability and education, deriving 70 loci that they described as “trait specific” to cognitive ability. The purpose of this short commentary is to examine whether the use of MTAG, in this case (Lam et al., 2017), has resulted in a phenotype more similar to education than cognitive ability.


Author(s):  
Kristin Tsuo ◽  
Wei Zhou ◽  
Ying Wang ◽  
Masahiro Kanai ◽  
Shinichi Namba ◽  
...  

Asthma is a complex disease that affects millions of people and varies in prevalence by an order of magnitude across geographic regions and populations. However, the extent to which genetic variation contributes to these disparities is unclear, as studies probing the genetics of asthma have been primarily limited to populations of European (EUR) descent. As part of the Global Biobank Meta-analysis Initiative (GBMI), we conducted the largest genome-wide association study of asthma to date (153,763 cases and 1,647,022 controls) via meta-analysis across 18 biobanks spanning multiple countries and ancestries. Altogether, we discovered 180 genome-wide significant loci (p < 5x10-8) associated with asthma, 49 of which are not previously reported. We replicate well-known associations such as IL1RL1 and STAT6, and find that overall the novel associations have smaller effects than previously-discovered loci, highlighting our substantial increase in statistical power. Despite the considerable range in prevalence among biobanks, from 3% to 24%, the genetic effects of associated loci are largely consistent across the biobanks and ancestries. To further investigate the polygenic architecture of asthma, we construct polygenic risk scores (PRS) using a multi-ancestry approach, which yields higher predictive power for asthma in non-EUR populations compared to PRS derived from previous asthma meta-analyses and using other methods. Additionally, we find considerable genetic overlap between asthma and chronic obstructive pulmonary disease (COPD) across ancestries but minimal overlap in enriched biological pathways. Our work underscores the multifactorial nature of asthma development and offers insight into the shared genetic architecture of asthma that may be differentially perturbed by environmental factors and contribute to variation in prevalence.


2021 ◽  
Author(s):  
Alexandre Pereira ◽  
Taniela M Bes ◽  
Mariliza Velho ◽  
Emanuelle Marques ◽  
Cinthia Jannes ◽  
...  

The Covid-19 pandemic has changed the paradigms for disease surveillance and rapid deployment of scientific-based evidence for understanding disease biology, susceptibility, and treatment. We have organized a large-scale genome-wide association study in Sars-Cov-2 infected individuals in Sao Paulo, Brazil, one of the most affected areas of the pandemic in the country, itself one of the most affected in the world. Here we present the results of the initial analysis in the first 5,233 participants of the BRACOVID study. We have conducted a GWAS for Covid-19 hospitalization enrolling 3533 cases (hospitalized Covid-19 participants) and 1700 controls (non-hospitalized Covid-19 participants). Models were adjusted by age, sex and the 4 first principal components. A meta-analysis was also conducted merging BRACOVID hospitalization data with the Human Genetic Initiative (HGI) Consortia results. BRACOVID results validated most loci previously identified in the HGI meta-analysis. In addition, no significant heterogeneity according to ancestral group within the Brazilian population was observed for the two most important Covid-19 severity associated loci: 3p21.31 and Chr21 near IFNAR2. Using only data provided by BRACOVID a new genome-wide significant locus was identified on Chr1 near the genes DSTYK and RBBP5. The associated haplotype has also been previously associated with a number of blood cell related traits and might play a role in modulating the immune response in Covid-19 cases.


Sign in / Sign up

Export Citation Format

Share Document