scholarly journals Genome-wide survey of parent-of-origin specific associations across clinical traits derived from electronic health records

Author(s):  
Hye In Kim ◽  
Bin Ye ◽  
Jeffrey Staples ◽  
Anthony Marcketta ◽  
Chuan Gao ◽  
...  
2020 ◽  
Author(s):  
Hye In Kim ◽  
Bin Ye ◽  
Jeffrey Staples ◽  
Anthony Marcketta ◽  
Chuan Gao ◽  
...  

AbstractParent-of-origin (PoO) effects refer to the differential phenotypic impact of genetic variants dependent on their parental inheritance. Genetic variants in imprinted genes can have PoO specific effects on complex traits, but these effects may be poorly captured by models that do not differentiate the parental origin of the variant. The aim of this study was to screen genome-wide imputed sequence for PoO effects on electronic health records (EHR) derived clinical traits in 134,049 individuals of European ancestry from the DiscovEHR study.Using pairwise kinship estimates from genetic data and demographic data, we identified 22,051 offspring with at least one parent present in the DiscovEHR study. We then assigned the PoO of ∼9 million variants in the heterozygous offspring using two methods. First, when one of the parental genotypes was homozygous, we determined PoO based on apparent Mendelian segregation. Second, we estimated PoO by comparing parental and offspring haplotypes around the variant allele. Using these PoO assignments, we performed genome-wide PoO association analyses across 154 quantitative traits including lab test results and biometric measures and 612 binary traits of ICD10 3-digit codes extracted from EHR in the DiscovEHR study. Out of 732 PoO associations meeting a significance threshold of P <5×10−8, we attempted to replicate 274 PoO associations in the UK Biobank study, consisting of 462,453 individuals and including 5,015 offspring with at least one parent, and replicated 9 PoO associations with nominal significance threshold P <0.05.In summary, the current study characterizes PoO effects of genetic variants genome-wide on a broad range of clinical traits derived from EHR in a large population study enriched for familial relationships. Our results suggest that 1) PoO specific effects are frequently captured by a standard additive model and that 2) statistical power to detect PoO specific effects remains modest even in large studies. Nonetheless, accurately modeling PoO effects of genetic variants has the potential to improve our understanding of the mechanism of the association and finding new associations that are not captured by the additive model.


2016 ◽  
Vol 49 (1) ◽  
pp. 54-64 ◽  
Author(s):  
Thomas J Hoffmann ◽  
Georg B Ehret ◽  
Priyanka Nandakumar ◽  
Dilrini Ranatunga ◽  
Catherine Schaefer ◽  
...  

2018 ◽  
Vol 83 (12) ◽  
pp. 1005-1011 ◽  
Author(s):  
Thomas H. McCoy ◽  
Victor M. Castro ◽  
Kamber L. Hart ◽  
Amelia M. Pellegrini ◽  
Sheng Yu ◽  
...  

2017 ◽  
Vol 11 (2) ◽  
pp. 112-122 ◽  
Author(s):  
Jamie R. Robinson ◽  
Joshua C. Denny ◽  
Dan M. Roden ◽  
Sara L. Van Driest

2021 ◽  
Author(s):  
Cameron J. Fairfield ◽  
Thomas M. Drake ◽  
Riinu Pius ◽  
Andrew D. Bretherick ◽  
Archie Campbell ◽  
...  

PLoS Genetics ◽  
2016 ◽  
Vol 12 (10) ◽  
pp. e1006371 ◽  
Author(s):  
Thomas J. Hoffmann ◽  
Bronya J. Keats ◽  
Noriko Yoshikawa ◽  
Catherine Schaefer ◽  
Neil Risch ◽  
...  

PLoS ONE ◽  
2017 ◽  
Vol 12 (2) ◽  
pp. e0171745 ◽  
Author(s):  
Logan Dumitrescu ◽  
Marylyn D. Ritchie ◽  
Joshua C. Denny ◽  
Nihal M. El Rouby ◽  
Caitrin W. McDonough ◽  
...  

2018 ◽  
Author(s):  
Karen A. Schlauch ◽  
Robert W. Read ◽  
Gai Elhanan ◽  
William J Metcalf ◽  
Anthony D. Slonim ◽  
...  

AbstractIn this study, we perform a full genome-wide association study (GWAS) to identify statistically significantly associated single nucleotide polymorphisms (SNPs) with three red blood cell (RBC) components and follow it with two independent PheWASs to examine associations between phenotypic data (case-control status of diagnoses or disease), significant SNPs, and RBC component levels. We first identified associations between the three RBC components: mean platelet volume (MPV), mean corpuscular volume (MCV), and platelet counts (PC), and the genotypes of approximately 500,000 SNPs on the Illumina Infimum® DNA Human OmniExpress-24 BeadChip using a single cohort of 4,700 Northern Nevadans. Twenty-one SNPs in five major genomic regions were found to be statistically significantly associated with MPV, two regions with MCV, and one region with PC, with p<5x10-8. Twenty-nine SNPs and nine chromosomal regions were identified in 30 previous GWASs, with effect sizes of similar magnitude and direction as found in our cohort. The two strongest associations were SNP rs1354034 with MPV (p=2.4x10-13) and rs855791 with MCV (p=5.2x10-12). We then examined possible associations between these significant SNPs and incidence of 1,488 phenotype groups mapped from International Classification of Disease version 9 and 10 (ICD9 and ICD10) codes collected in the extensive electronic health record (EHR) database associated with Healthy Nevada Project consented participants. Further leveraging data collected in the EHR, we performed an additional PheWAS to identify associations between continuous red blood cell (RBC) component measures and incidence of specific diagnoses. The first PheWAS illuminated whether SNPs associated with RBC components in our cohort were linked with other hematologic phenotypic diagnoses or diagnoses of other nature. Although no SNPs from our GWAS were identified as strongly associated to other phenotypic components, a number of associations were identified with p-values ranging between 1x10-3 and 1x10-4 with traits such as respiratory failure, sleep disorders, hypoglycemia, hyperglyceridemia, GERD and IBS. The second PheWAS examined possible phenotypic predictors of abnormal RBC component measures: a number of hematologic phenotypes such as thrombocytopenia, anemias, hemoglobinopathies and pancytopenia were found to be strongly associated to RBC component measures; additional phenotypes such as (morbid) obesity, malaise and fatigue, alcoholism, and cirrhosis were also identified to be possible predictors of RBC component measures.Author SummaryThe combination of electronic health records and genomic data have the capability to revolutionize personalized medicine. Each separately contains invaluable data; however, combined, the two are able to identify new discoveries that may have long-term health benefits. The Healthy Nevada Project is a non-profit initiative between Renown Medical Center and the Desert Research Institute in Reno, NV. The project has so far collected a cohort of 6,500 Northern Nevadans, with extensive medical electronic health records in the Renown Health database. Combining the genotypes of these participants with the clinical data, this study’s aim is to find associations between genotypes (genes) and phenotypes (diagnoses and lab records). Here, we identify and examine clinical associations with red blood cell components such as platelet counts and mean platelet volume. These are components that have clinical relevance for several diseases, such as anemia, atherothrombosis and cancer. Our results from genome wide association studies mirror previous studies, and identify new associations. The extensive electronic health records enabled us to perform phenome wide associations to discover strong associations with hematologic components, as well as other important traits and diagnoses.


Sign in / Sign up

Export Citation Format

Share Document