scholarly journals Author response: Deep learning models predict regulatory variants in pancreatic islets and refine type 2 diabetes association signals

2020 ◽  
Author(s):  
Agata Wesolowska-Andersen ◽  
Grace Zhuo Yu ◽  
Vibe Nylander ◽  
Fernando Abaitua ◽  
Matthias Thurner ◽  
...  
eLife ◽  
2020 ◽  
Vol 9 ◽  
Author(s):  
Agata Wesolowska-Andersen ◽  
Grace Zhuo Yu ◽  
Vibe Nylander ◽  
Fernando Abaitua ◽  
Matthias Thurner ◽  
...  

Genome-wide association analyses have uncovered multiple genomic regions associated with T2D, but identification of the causal variants at these remains a challenge. There is growing interest in the potential of deep learning models - which predict epigenome features from DNA sequence - to support inference concerning the regulatory effects of disease-associated variants. Here, we evaluate the advantages of training convolutional neural network (CNN) models on a broad set of epigenomic features collected in a single disease-relevant tissue – pancreatic islets in the case of type 2 diabetes (T2D) - as opposed to models trained on multiple human tissues. We report convergence of CNN-based metrics of regulatory function with conventional approaches to variant prioritization – genetic fine-mapping and regulatory annotation enrichment. We demonstrate that CNN-based analyses can refine association signals at T2D-associated loci and provide experimental validation for one such signal. We anticipate that these approaches will become routine in downstream analyses of GWAS.


2019 ◽  
Author(s):  
Agata Wesolowska-Andersen ◽  
Grace Zhuo Yu ◽  
Vibe Nylander ◽  
Fernando Abaitua ◽  
Matthias Thurner ◽  
...  

AbstractGenome-wide association analyses have uncovered multiple genomic regions associated with T2D, but identification of the causal variants at these remains a challenge. There is growing interest in the potential of deep learning models - which predict epigenome features from DNA sequence - to support inference concerning the regulatory effects of disease-associated variants. Here, we evaluate the advantages of training convolutional neural network (CNN) models on a broad set of epigenomic features collected in a single disease-relevant tissue – pancreatic islets in the case of type 2 diabetes (T2D) - as opposed to models trained on multiple human tissues. We report convergence of CNN-based metrics of regulatory function with conventional approaches to variant prioritization – genetic fine-mapping and regulatory annotation enrichment. We demonstrate that CNN-based analyses can refine association signals at T2D-associated loci and provide experimental validation for one such signal. We anticipate that these approaches will become routine in downstream analyses of GWAS.


2021 ◽  
Vol 33 (2) ◽  
pp. 93-114
Author(s):  
Mallika G.C. ◽  
Abeer Alsadoon ◽  
Duong Thu Hang Pham ◽  
Salma Hameedi Abdullah ◽  
Ha Thi Mai ◽  
...  

Type 2 Diabetes (T2DM) makes up about 90% of diabetes cases, as well as tough restriction on continuous monitoring and detecting become one of key aspects in T2DM. This research aims to develop an ensemble of several machine learning and deep learning models for early detection of T2DM with high accuracy. With high diversity of models, the ensemble will provide more excessive performance than single models. Methodology: The proposed system is modified enhanced ensemble of machine learning models for T2DM prediction. It is composed of Logistic Regression, Random Forest, SVM and Deep Neural Network models to generate a modified ensemble model. Results: The output of each model in the modified ensemble is used to figure out the final output of the system. The datasets being used for these models include Practice Fusion HER, Pima Indians diabetic's data, UCI AIM94 Dataset and CA Diabetes Prevalence 2014. In comparison to the previous solutions, the proposed ensemble model solution exposes the effectiveness of accuracy, sensitivity, and specificity. It provides an accuracy of 87.5% from 83.51% in average, sensitivity of 35.8% from 29.59% as well as specificity of 98.9% from 96.27%. The processing time of the proposed model solution with 96.6ms is faster than the state-of-the-art with 97.5ms. Conclusion: The proposed modified enhanced system in this work improves the overall prediction capability of T2DM using an ensemble of several machine learning and deep learning models. A majority voting scheme utilizes the output from several models to make the final accurate prediction. Regularization function in this work is modified in order to include the regularization of all the models in ensemble, that helps prevent the overfitting and encourages the generalization capacity of the proposed system.


2019 ◽  
Author(s):  
Vivek Rai ◽  
Daniel X. Quang ◽  
Michael R. Erdos ◽  
Darren A. Cusanovich ◽  
Riza M. Daza ◽  
...  

ABSTRACTObjectiveType 2 diabetes (T2D) is a complex disease characterized by pancreatic islet dysfunction, insulin resistance, and disruption of blood glucose levels. Genome wide association studies (GWAS) have identified >400 independent signals that encode genetic predisposition. More than 90% of the associated single nucleotide polymorphisms (SNPs) localize to non-coding regions and are enriched in chromatin-defined islet enhancer elements, indicating a strong transcriptional regulatory component to disease susceptibility. Pancreatic islets are a mixture of cell types that express distinct hormonal programs, and so each cell type may contribute differentially to the underlying regulatory processes that modulate T2D-associated transcriptional circuits. Existing chromatin profiling methods such as ATAC-seq and DNase-seq, applied to islets in bulk, produce aggregate profiles that mask important cellular and regulatory heterogeneity.MethodsWe present genome-wide single cell chromatin accessibility profiles in >1,600 cells derived from a human pancreatic islet sample using single-cell-combinatorial-indexing ATAC-seq (sci-ATAC-seq). We also developed a deep learning model based on the U-Net architecture to accurately predict open chromatin peak calls in rare cell populations.ResultsWe show that sci-ATAC-seq profiles allow us to deconvolve alpha, beta, and delta cell populations and identify cell-type-specific regulatory signatures underlying T2D. Particularly, we find that T2D GWAS SNPs are significantly enriched in beta cell-specific and cross cell-type shared islet open chromatin, but not in alpha or delta cell-specific open chromatin. We also demonstrate, using less abundant delta cells, that deep-learning models can improve signal recovery and feature reconstruction of rarer cell populations. Finally, we use co-accessibility measures to nominate the cell-specific target genes at 104 non-coding T2D GWAS signals.ConclusionsCollectively, we identify the islet cell type of action across genetic signals of T2D predisposition and provide higher-resolution mechanistic insights into genetically encoded risk pathways.


2020 ◽  
Vol 32 ◽  
pp. 109-121 ◽  
Author(s):  
Vivek Rai ◽  
Daniel X. Quang ◽  
Michael R. Erdos ◽  
Darren A. Cusanovich ◽  
Riza M. Daza ◽  
...  

2017 ◽  
Vol 7 (1) ◽  
Author(s):  
Sophie Bauer ◽  
Charlotte Wennberg Huldt ◽  
Kajsa P. Kanebratt ◽  
Isabell Durieux ◽  
Daniela Gunne ◽  
...  

Diabetologia ◽  
2012 ◽  
Vol 55 (7) ◽  
pp. 1985-1994 ◽  
Author(s):  
J. Taneera ◽  
Z. Jin ◽  
Y. Jin ◽  
S. J. Muhammed ◽  
E. Zhang ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document