scholarly journals Genetic regulatory signatures underlying islet gene expression and type 2 diabetes

2017 ◽  
Vol 114 (9) ◽  
pp. 2301-2306 ◽  
Author(s):  
Arushi Varshney ◽  
Laura J. Scott ◽  
Ryan P. Welch ◽  
Michael R. Erdos ◽  
Peter S. Chines ◽  
...  

Genome-wide association studies (GWAS) have identified >100 independent SNPs that modulate the risk of type 2 diabetes (T2D) and related traits. However, the pathogenic mechanisms of most of these SNPs remain elusive. Here, we examined genomic, epigenomic, and transcriptomic profiles in human pancreatic islets to understand the links between genetic variation, chromatin landscape, and gene expression in the context of T2D. We first integrated genome and transcriptome variation across 112 islet samples to produce dense cis-expression quantitative trait loci (cis-eQTL) maps. Additional integration with chromatin-state maps for islets and other diverse tissue types revealed that cis-eQTLs for islet-specific genes are specifically and significantly enriched in islet stretch enhancers. High-resolution chromatin accessibility profiling using assay for transposase-accessible chromatin sequencing (ATAC-seq) in two islet samples enabled us to identify specific transcription factor (TF) footprints embedded in active regulatory elements, which are highly enriched for islet cis-eQTL. Aggregate allelic bias signatures in TF footprints enabled us de novo to reconstruct TF binding affinities genetically, which support the high-quality nature of the TF footprint predictions. Interestingly, we found that T2D GWAS loci were strikingly and specifically enriched in islet Regulatory Factor X (RFX) footprints. Remarkably, within and across independent loci, T2D risk alleles that overlap with RFX footprints uniformly disrupt the RFX motifs at high-information content positions. Together, these results suggest that common regulatory variations have shaped islet TF footprints and the transcriptome and that a confluent RFX regulatory grammar plays a significant role in the genetic component of T2D predisposition.

2021 ◽  
Author(s):  
John T Walker ◽  
Diane C Saunders ◽  
Vivek Rai ◽  
Chunhua Dai ◽  
Peter Orchard ◽  
...  

A hallmark of type 2 diabetes (T2D), a major cause of world-wide morbidity and mortality, is dysfunction of insulin-producing pancreatic islet β cells. T2D genome-wide association studies (GWAS) have identified hundreds of signals, mostly in the non-coding genome and overlapping β cell regulatory elements, but translating these into biological mechanisms has been challenging. To identify early disease-driving events, we performed single cell spatial proteomics, sorted cell transcriptomics, and assessed islet physiology on pancreatic tissue from short-duration T2D and control donors. Here, through integrative analyses of these diverse modalities, we show that multiple gene regulatory modules are associated with early-stage T2D β cell-intrinsic defects. One notable example is the transcription factor RFX6, which we show is a highly connected β cell hub gene that is reduced in T2D and governs a gene regulatory network associated with insulin secretion defects and T2D GWAS variants. We validated the critical role of RFX6 in β cells through direct perturbation in primary human islets followed by physiological and single nucleus multiome profiling, which showed reduced dynamic insulin secretion and large-scale changes in the β cell transcriptome and chromatin accessibility landscape. Understanding the molecular mechanisms of complex, systemic diseases necessitates integration of signals from multiple molecules, cells, organs, and individuals and thus we anticipate this approach will be a useful template to identify and validate key regulatory networks and master hub genes for other diseases or traits with GWAS data.


eLife ◽  
2018 ◽  
Vol 7 ◽  
Author(s):  
Matthias Thurner ◽  
Martijn van de Bunt ◽  
Jason M Torres ◽  
Anubha Mahajan ◽  
Vibe Nylander ◽  
...  

Human genetic studies have emphasised the dominant contribution of pancreatic islet dysfunction to development of Type 2 Diabetes (T2D). However, limited annotation of the islet epigenome has constrained efforts to define the molecular mechanisms mediating the, largely regulatory, signals revealed by Genome-Wide Association Studies (GWAS). We characterised patterns of chromatin accessibility (ATAC-seq, n = 17) and DNA methylation (whole-genome bisulphite sequencing, n = 10) in human islets, generating high-resolution chromatin state maps through integration with established ChIP-seq marks. We found enrichment of GWAS signals for T2D and fasting glucose was concentrated in subsets of islet enhancers characterised by open chromatin and hypomethylation, with the former annotation predominant. At several loci (including CDC123, ADCY5, KLHDC5) the combination of fine-mapping genetic data and chromatin state enrichment maps, supplemented by allelic imbalance in chromatin accessibility pinpointed likely causal variants. The combination of increasingly-precise genetic and islet epigenomic information accelerates definition of causal mechanisms implicated in T2D pathogenesis.


2017 ◽  
Author(s):  
Matthias Thurner ◽  
Martijn van de Bunt ◽  
Jason M Torres ◽  
Anubha Mahajan ◽  
Vibe Nylander ◽  
...  

AbstractHuman genetic studies have emphasised the dominant contribution of pancreatic islet dysfunction to development of Type 2 Diabetes (T2D). However, limited annotation of the islet epigenome has constrained efforts to define the molecular mechanisms mediating the, largely regulatory, signals revealed by Genome-Wide Association Studies (GWAS). We characterised patterns of chromatin accessibility (ATAC-seq, n=17) and DNA methylation (whole-genome bisulphite sequencing, n=10) in human islets, generating high-resolution chromatin state maps through integration with established ChIP-seq marks. We found enrichment of GWAS signals for T2D and fasting glucose was concentrated in subsets of islet enhancers characterised by open chromatin and hypomethylation, with the former annotation predominant. At several loci (including CDC123, ADCY5, KLHDC5) the combination of fine-mapping genetic data and chromatin state enrichment maps, supplemented by allelic imbalance in chromatin accessibility pinpointed likely causal variants. The combination of increasingly-precise genetic and islet epigenomic information accelerates definition of causal mechanisms implicated in T2D pathogenesis.


2018 ◽  
Author(s):  
Angli Xue ◽  
Yang Wu ◽  
Zhihong Zhu ◽  
Futao Zhang ◽  
Kathryn E Kemper ◽  
...  

AbstractWe conducted a meta-analysis of genome-wide association studies (GWAS) with ∼16 million genotyped/imputed genetic variants in 62,892 type 2 diabetes (T2D) cases and 596,424 controls of European ancestry. We identified 139 common and 4 rare (minor allele frequency < 0.01) variants associated with T2D, 42 of which (39 common and 3 rare variants) were independent of the known variants. Integration of the gene expression data from blood (n = 14,115 and 2,765) and other T2D-relevant tissues (n = up to 385) with the GWAS results identified 33 putative functional genes for T2D, three of which were targeted by approved drugs. A further integration of DNA methylation (n = 1,980) and epigenomic annotations data highlighted three putative T2D genes (CAMK1D, TP53INP1 and ATP5G1) with plausible regulatory mechanisms whereby a genetic variant exerts an effect on T2D through epigenetic regulation of gene expression. We further found evidence that the T2D-associated loci have been under purifying selection.


2020 ◽  
Vol 48 (W1) ◽  
pp. W193-W199 ◽  
Author(s):  
Nina Baumgarten ◽  
Dennis Hecker ◽  
Sivarajan Karunanithi ◽  
Florian Schmidt ◽  
Markus List ◽  
...  

Abstract A current challenge in genomics is to interpret non-coding regions and their role in transcriptional regulation of possibly distant target genes. Genome-wide association studies show that a large part of genomic variants are found in those non-coding regions, but their mechanisms of gene regulation are often unknown. An additional challenge is to reliably identify the target genes of the regulatory regions, which is an essential step in understanding their impact on gene expression. Here we present the EpiRegio web server, a resource of regulatory elements (REMs). REMs are genomic regions that exhibit variations in their chromatin accessibility profile associated with changes in expression of their target genes. EpiRegio incorporates both epigenomic and gene expression data for various human primary cell types and tissues, providing an integrated view of REMs in the genome. Our web server allows the analysis of genes and their associated REMs, including the REM’s activity and its estimated cell type-specific contribution to its target gene’s expression. Further, it is possible to explore genomic regions for their regulatory potential, investigate overlapping REMs and by that the dissection of regions of large epigenomic complexity. EpiRegio allows programmatic access through a REST API and is freely available at https://epiregio.de/.


Nutrients ◽  
2021 ◽  
Vol 13 (12) ◽  
pp. 4208
Author(s):  
Nooshin Ghodsian ◽  
Eloi Gagnon ◽  
Jérôme Bourgault ◽  
Émilie Gobeil ◽  
Hasanga D. Manikpurage ◽  
...  

Hepatokines are liver-derived proteins that may influence metabolic pathways such as insulin sensitivity. Recently, Sparc-related modular calcium-binding protein 1 (SMOC1) was identified as glucose-responsive hepatokine that is dysregulated in the setting of non-alcoholic fatty liver disease (NAFLD). While SMOC1 may influence glucose-insulin homeostasis in rodents, it is unknown if SMOC1 is influenced by NAFLD in humans. It is also unknown if SMOC1 is causally associated with metabolic and disease traits in humans. Therefore, we aimed to determine the effect of NAFLD on SMOC1 gene expression in the liver and aimed to explore the potential causal associations of SMOC1 levels with NAFLD, T2D, and glycemic traits in humans. Using an RNA sequencing dataset from a cohort of 216 patients with NAFLD, we assessed SMOC1 expression levels across the NAFLD spectrum. We performed a series of bidirectional inverse-variance weighted Mendelian randomization (MR) analyses on blood SMOC1 levels using two sources of genome-wide association studies (GWAS) (Fenland study, n = 10,708 and INTERVAL study, n = 3301). We utilized GWAS summary statistics for NAFLD in 8434 cases and 770,180 controls, as well as publicly available GWAS for type 2 diabetes (T2D), body mass index (BMI), waist-to-hip ratio (WHR), fasting blood insulin (FBI), fasting blood glucose (FBG), homeostatic Model Assessment of Insulin Resistance (HOMA-B and HOMA-IR), and hemoglobin A1c (HbA1C). We found that SMOC1 expression showed no significant differences across NAFLD stages. We also identified that the top single-nucleotide polymorphism associated with blood SMOC1 levels, was associated with SMOC1 gene expression in the liver, but not in other tissues. Using MR, we did not find any evidence that genetically predicted NAFLD, T2D, and glycemic traits influenced SMOC1 levels. We also did not find evidence that blood SMOC1 levels were causally associated with T2D, NAFLD, and glycemic traits. In conclusion, the hepatokine SMOC1 does not appear to be modulated by the presence of NAFLD and may not regulate glucose-insulin homeostasis in humans. Results of this study suggest that blood factors regulating metabolism in rodents may not always translate to human biology.


2021 ◽  
Vol 12 (1) ◽  
Author(s):  
Gloriia Novikova ◽  
Manav Kapoor ◽  
Julia TCW ◽  
Edsel M. Abud ◽  
Anastasia G. Efthymiou ◽  
...  

AbstractGenome-wide association studies (GWAS) have identified more than 40 loci associated with Alzheimer’s disease (AD), but the causal variants, regulatory elements, genes and pathways remain largely unknown, impeding a mechanistic understanding of AD pathogenesis. Previously, we showed that AD risk alleles are enriched in myeloid-specific epigenomic annotations. Here, we show that they are specifically enriched in active enhancers of monocytes, macrophages and microglia. We integrated AD GWAS with myeloid epigenomic and transcriptomic datasets using analytical approaches to link myeloid enhancer activity to target gene expression regulation and AD risk modification. We identify AD risk enhancers and nominate candidate causal genes among their likely targets (including AP4E1, AP4M1, APBB3, BIN1, MS4A4A, MS4A6A, PILRA, RABEP1, SPI1, TP53INP1, and ZYX) in twenty loci. Fine-mapping of these enhancers nominates candidate functional variants that likely modify AD risk by regulating gene expression in myeloid cells. In the MS4A locus we identified a single candidate functional variant and validated it in human induced pluripotent stem cell (hiPSC)-derived microglia and brain. Taken together, this study integrates AD GWAS with multiple myeloid genomic datasets to investigate the mechanisms of AD risk alleles and nominates candidate functional variants, regulatory elements and genes that likely modulate disease susceptibility.


2021 ◽  
Author(s):  
Minako Imamura ◽  
Atsushi Takahashi ◽  
Masatoshi Matsunami ◽  
Momoko Horikoshi ◽  
Minoru Iwata ◽  
...  

Abstract Several reports have suggested that genetic susceptibility contributes to the development and progression of diabetic retinopathy. We aimed to identify genetic loci that confer susceptibility to diabetic retinopathy in Japanese patients with type 2 diabetes. We analysed 5 790 508 single nucleotide polymorphisms (SNPs) in 8880 Japanese patients with type 2 diabetes, 4839 retinopathy cases and 4041 controls, as well as 2217 independent Japanese patients with type 2 diabetes, 693 retinopathy cases, and 1524 controls. The results of these two genome-wide association studies (GWAS) were combined with an inverse variance meta-analysis (Stage-1), followed by de novo genotyping for the candidate SNP loci (p &lt; 1.0 × 10−4) in an independent case–control study (Stage-2, 2260 cases and 723 controls). After combining the association data (Stage-1 and -2) using meta-analysis, the associations of two loci reached a genome-wide significance level: rs12630354 near STT3B on chromosome 3, p = 1.62 × 10−9, odds ratio (OR) = 1.17, 95% confidence interval (CI) 1.11–1.23, and rs140508424 within PALM2 on chromosome 9, p = 4.19 × 10−8, OR = 1.61, 95% CI 1.36–1.91. However, the association of these two loci were not replicated in Korean, European, or African American populations. Gene-based analysis using Stage-1 GWAS data identified a gene-level association of EHD3 with susceptibility to diabetic retinopathy (p = 2.17 × 10−6). In conclusion, we identified two novel SNP loci, STT3B and PALM2, and a novel gene, EHD3, that confers susceptibility to diabetic retinopathy; however, further replication studies are required to validate these associations.


Author(s):  
Guanghao Qi ◽  
Nilanjan Chatterjee

Abstract Background Previous studies have often evaluated methods for Mendelian randomization (MR) analysis based on simulations that do not adequately reflect the data-generating mechanisms in genome-wide association studies (GWAS) and there are often discrepancies in the performance of MR methods in simulations and real data sets. Methods We use a simulation framework that generates data on full GWAS for two traits under a realistic model for effect-size distribution coherent with the heritability, co-heritability and polygenicity typically observed for complex traits. We further use recent data generated from GWAS of 38 biomarkers in the UK Biobank and performed down sampling to investigate trends in estimates of causal effects of these biomarkers on the risk of type 2 diabetes (T2D). Results Simulation studies show that weighted mode and MRMix are the only two methods that maintain the correct type I error rate in a diverse set of scenarios. Between the two methods, MRMix tends to be more powerful for larger GWAS whereas the opposite is true for smaller sample sizes. Among the other methods, random-effect IVW (inverse-variance weighted method), MR-Robust and MR-RAPS (robust adjust profile score) tend to perform best in maintaining a low mean-squared error when the InSIDE assumption is satisfied, but can produce large bias when InSIDE is violated. In real-data analysis, some biomarkers showed major heterogeneity in estimates of their causal effects on the risk of T2D across the different methods and estimates from many methods trended in one direction with increasing sample size with patterns similar to those observed in simulation studies. Conclusion The relative performance of different MR methods depends heavily on the sample sizes of the underlying GWAS, the proportion of valid instruments and the validity of the InSIDE assumption. Down-sampling analysis can be used in large GWAS for the possible detection of bias in the MR methods.


Sign in / Sign up

Export Citation Format

Share Document