scholarly journals Disease association with frequented regions of genotype graphs

Author(s):  
Samuel Hokin ◽  
Alan Cleary ◽  
Joann Mudge

Complex diseases, with many associated genetic and environmental factors, are a challenging target for genomic risk assessment. Genome-wide association studies (GWAS) associate disease status with, and compute risk from, individual common variants, which can be problematic for diseases with many interacting or rare variants. In addition, GWAS typically employ a reference genome which is not built from the subjects of the study, whose genetic background may differ from the reference and whose genetic characterization may be limited. We present a complementary method based on disease association with collections of genotypes, called frequented regions, on a pangenomic graph built from subjects' genomes. We introduce the pangenomic genotype graph, which is better suited than sequence graphs to human disease studies. Our method draws out collections of features, across multiple genomic segments, which are associated with disease status. We show that the frequented regions method consistently improves machine-learning classification of disease status over GWAS classification, allowing incorporation of rare or interacting variants. Notably, genomic segments that have few or no variants of genome-wide significance (p<5x10-8) provide much-improved classification with frequented regions, encouraging their application across the entire genome. Frequented regions may also be utilized for purposes such as choice of treatment in addition to prediction of disease risk.

2021 ◽  
Author(s):  
Aleksejs Sazonovs ◽  
Christine R Stevens ◽  
Guhan R Venkataraman ◽  
Kai Yuan ◽  
Brandon Avila ◽  
...  

Genome-wide association studies (GWAS) have identified hundreds of loci associated with Crohns disease (CD); however, as with all complex diseases, deriving pathogenic mechanisms from these non-coding GWAS discoveries has been challenging. To complement GWAS and better define actionable biological targets, we analysed sequence data from more than 30,000 CD cases and 80,000 population controls. We observe rare coding variants in established CD susceptibility genes as well as ten genes where coding variation directly implicates the gene in disease risk for the first time.


2019 ◽  
Author(s):  
Wilson Lek Wen Tan ◽  
Chukwuemeka George Anene-Nzelu ◽  
Eleanor Wong ◽  
Hui San Tan ◽  
Pan Bangfen ◽  
...  

AbstractIdentifying genetic markers for heterogeneous complex diseases such as heart failure has been challenging, and may require prohibitively large cohort sizes in genome-wide association studies (GWAS) in order to demonstrate statistical significance1. On the other hand, chromatin quantitative trait loci (QTL), elucidated by direct epigenetic profiling of specific human tissues, may contribute towards prioritising variants for disease-association. Here, we captured non-coding genetic variants by performing enhancer H3K27ac ChIP-seq in 70 human control and end-stage failing hearts, mapping out a comprehensive catalogue of 47,321 putative human heart enhancers. 3,897 differential acetylation peaks (FDR < 0.05) pointed to recognizable pathways altered in heart failure (HF). To identify cardiac histone acetylation QTLs (haQTLs), we regressed out confounding factors including HF disease status, and employed the G-SCI test2 to call out 1,680 haQTLs (FDR < 0.1). A subset of these showed significant association to gene expression, either in cis (180), or through long range interactions (81), identified by Hi-C and Hi-ChIP performed on a subset of hearts. Furthermore, a concordant relationship was found between the gain or disruption of specific transcription factor (TF) binding motifs, inferred from alternative alleles at the haQTLs, associated with altered H3K27ac peak heights. Finally, colocalisation of our haQTLs with heart-related GWAS datasets allowed us to identify 62 unique loci. Disease-association for these new loci may indeed be mediated through modification of H3K27-acetylation enrichment and their corresponding gene expression differences.


2020 ◽  
Vol 66 (1) ◽  
pp. 11-23
Author(s):  
Yukihide Momozawa ◽  
Keijiro Mizukami

AbstractGenome-wide association studies have identified >10,000 genetic variants associated with various phenotypes and diseases. Although the majority are common variants, rare variants with >0.1% of minor allele frequency have been investigated by imputation and using disease-specific custom SNP arrays. Rare variants sequencing analysis mainly revealed have played unique roles in the genetics of complex diseases in humans due to their distinctive features, in contrast to common variants. Unique roles are hypothesis-free evidence for gene causality, a precise target of functional analysis for understanding disease mechanisms, a new favorable target for drug development, and a genetic marker with high disease risk for personalized medicine. As whole-genome sequencing continues to identify more rare variants, the roles associated with rare variants will also increase. However, a better estimation of the functional impact of rare variants across whole genome is needed to enhance their contribution to improvements in human health.


Nature ◽  
2021 ◽  
Vol 590 (7845) ◽  
pp. 290-299 ◽  
Author(s):  
Daniel Taliun ◽  
◽  
Daniel N. Harris ◽  
Michael D. Kessler ◽  
Jedidiah Carlson ◽  
...  

AbstractThe Trans-Omics for Precision Medicine (TOPMed) programme seeks to elucidate the genetic architecture and biology of heart, lung, blood and sleep disorders, with the ultimate goal of improving diagnosis, treatment and prevention of these diseases. The initial phases of the programme focused on whole-genome sequencing of individuals with rich phenotypic data and diverse backgrounds. Here we describe the TOPMed goals and design as well as the available resources and early insights obtained from the sequence data. The resources include a variant browser, a genotype imputation server, and genomic and phenotypic data that are available through dbGaP (Database of Genotypes and Phenotypes)1. In the first 53,831 TOPMed samples, we detected more than 400 million single-nucleotide and insertion or deletion variants after alignment with the reference genome. Additional previously undescribed variants were detected through assembly of unmapped reads and customized analysis in highly variable loci. Among the more than 400 million detected variants, 97% have frequencies of less than 1% and 46% are singletons that are present in only one individual (53% among unrelated individuals). These rare variants provide insights into mutational processes and recent human evolutionary history. The extensive catalogue of genetic variation in TOPMed studies provides unique opportunities for exploring the contributions of rare and noncoding sequence variants to phenotypic variation. Furthermore, combining TOPMed haplotypes with modern imputation methods improves the power and reach of genome-wide association studies to include variants down to a frequency of approximately 0.01%.


2020 ◽  
Author(s):  
Celine Charon ◽  
Rodrigue Allodji ◽  
Vincent Meyer ◽  
Jean-François Deleuze

Abstract Quality control methods for genome-wide association studies and fine mapping are commonly used for imputation, however, they result in loss of many single nucleotide polymorphisms (SNPs). To investigate the consequences of filtration on imputation, we studied the direct effects on the number of markers, their allele frequencies, imputation quality scores and post-filtration events. We pre-phrased 1,031 genotyped individuals from diverse ethnicities and compared the imputed variants to 1,089 NCBI recorded individuals for additional validation.Without variant pre-filtration based on quality control (QC), we observed no impairment in the imputation of SNPs that failed QC whereas with pre-filtration there was an overall loss of information. Significant differences between frequencies with and without pre-filtration were found only in the range of very rare (5E-04-1E-03) and rare variants (1E-03-5E-03) (p < 1E-04). Increasing the post-filtration imputation quality score from 0.3 to 0.8 reduced the number of single nucleotide variants (SNVs) <0.001 2.5 fold with or without QC pre-filtration and halved the number of very rare variants (5E-04). As a result, to maintain confidence and enough SNVs, we propose here a 2-step post-filtration approach to increase the number of very rare and rare variants compared to conservative post-filtration methods.


2019 ◽  
Vol 3 (Supplement_1) ◽  
pp. S221-S221
Author(s):  
Luke C Pilling ◽  
Luigi Ferrucci ◽  
David Melzer

Abstract Thousands of loci across the genome have been identified for specific diseases in genome-wide association studies (GWAS), yet very few are associated with lifespan itself. We hypothesized that specific biological pathways transcend individual diseases and affect health and lifespan more broadly. Using the published results for the most recent GWAS for 10 key age-related diseases (including coronary artery disease, type-2 diabetes, and several cancers) we identified 22 loci with a strong genetic association with at least three of the diseases. These multi-trait aging loci include known genes affecting multiple diverse health end points, such as CDKN2A/B (9p21.3) and APOE. There are also novel multi-trait genes including SH2B3 and CASC8, likely involved in hallmark pathways of aging biology, including telomere shortening and inflammation. Several of these loci involve trade-offs between chronic disease risk and cancer.


TH Open ◽  
2020 ◽  
Vol 04 (04) ◽  
pp. e322-e331
Author(s):  
Eric Manderstedt ◽  
Christina Lind-Halldén ◽  
Stefan Lethagen ◽  
Christer Halldén

AbstractGenome-wide association studies (GWASs) have identified genes that affect plasma von Willebrand factor (VWF) levels. ABO showed a strong effect, whereas smaller effects were seen for VWF, STXBP5, STAB2, SCARA5, STX2, TC2N, and CLEC4M. This study screened comprehensively for both common and rare variants in these eight genes by resequencing their coding sequences in 104 Swedish von Willebrand disease (VWD) patients. The common variants previously associated with the VWF level were all accumulated in the VWD patients compared to three control populations. The strongest effect was detected for blood group O coded for by the ABO gene (71 vs. 38% of genotypes). The other seven VWF level associated alleles were enriched in the VWD population compared to control populations, but the differences were small and not significant. The sequencing detected a total of 146 variants in the eight genes. Excluding 70 variants in VWF, 76 variants remained. Of the 76 variants, 54 had allele frequencies > 0.5% and have therefore been investigated for their association with the VWF level in previous GWAS. The remaining 22 variants with frequencies < 0.5% are less likely to have been evaluated previously. PolyPhen2 classified 3 out of the 22 variants as probably or possibly damaging (two in STAB2 and one in STX2); the others were either synonymous or benign. No accumulation of low frequency (0.05–0.5%) or rare variants (<0.05%) in the VWD population compared to the gnomAD (Genome Aggregation Database) population was detected. Thus, rare variants in these genes do not contribute to the low VWF levels observed in VWD patients.


2011 ◽  
Vol 26 (S2) ◽  
pp. 1346-1346
Author(s):  
D. Benmessaoud ◽  
A.-M. Lepagnol-Bestel ◽  
M. Delepine ◽  
J. Hager ◽  
J.-M. Moalic ◽  
...  

Genome wide association studies (GWAS) of Schizophrenia (SZ) patients have identified common variants in ten genes including SMARCA2 (Koga et al., HMG, 2009). We found that the SZ-GWAS genes are part of an interacting network centered on SMARCA2 (Loe-Mie et al., HMG, 2010). Furthermore, SMARCA2 was found disrupted in SZ (Walsh et al., Science, 2008). SMARCA2 encodes the ATPase (BRM) of the SWI/SNF chromatin remodeling complex that is at the interface of genome and environmental adaptation.Taking advantage of an Algerian trio cohort of one hundred SZ patients (Benmessaoud et al., BMC Psychiatry, 2008), we replicated the association of SNP rs2296212 localized in exon 33, already shown associated in Koga study and resulting in D1546E amino acid change in the SMARCA2 protein. We studied SMARCA2 codons and found that exon 33 displays a signature of positive evolution in the primate lineage.Our working hypothesis is that the coding regions displaying positive selection are target of novel rare variants. To address this question, we sequenced two exons displaying positive evolution and one exon without evidence of positive evolution.We found (i) that rare variants are significantly in excess in SZ-patients compared to their parents (p = 0.038, Fisher test) and (ii) a higher proportion of rare variants in the primate-accelerated exons compared with the non-evolutionary exon in SZ-patients (p = 0.032, Fisher test).SMARCA2 exon sequencing and whole exome sequencing from patients harboring SNP rs2296212 common variant are under progress. Altogether, these results are expected to give new insights into the genetic architecture of SZ.


2020 ◽  
Vol 21 (12) ◽  
pp. 4269 ◽  
Author(s):  
Victoria L. Halperin Kuhns ◽  
Owen M. Woodward

Hyperuricemia, or elevated serum urate, causes urate kidney stones and gout and also increases the incidence of many other conditions including renal disease, cardiovascular disease, and metabolic syndrome. As we gain mechanistic insight into how urate contributes to human disease, a clear sex difference has emerged in the physiological regulation of urate homeostasis. This review summarizes our current understanding of urate as a disease risk factor and how being of the female sex appears protective. Further, we review the mechanisms of renal handling of urate and the significant contributions from powerful genome-wide association studies of serum urate. We also explore the role of sex in the regulation of specific renal urate transporters and the power of new animal models of hyperuricemia to inform on the role of sex and hyperuricemia in disease pathogenesis. Finally, we advocate the use of sex differences in urate handling as a potent tool in gaining a further understanding of physiological regulation of urate homeostasis and for presenting new avenues for treating the constellation of urate related pathologies.


Sign in / Sign up

Export Citation Format

Share Document