A Mutation in the Drosophila melanogaster eve Stripe 2 Minimal Enhancer Is Buffered by Flanking Sequences

AbstractEnhancers are DNA sequences composed of transcription factor binding sites that drive complex patterns of gene expression in space and time. Until recently, studying enhancers in their genomic context was technically challenging. Therefore, minimal enhancers, the shortest pieces of DNA that can drive an expression pattern that resembles a gene’s endogenous pattern, are often used to study features of enhancer function. However, evidence suggests that some enhancers require sequences outside the minimal enhancer to maintain function under environmental perturbations. We hypothesized that these additional sequences also prevent misexpression caused by a transcription factor binding site mutation within a minimal enhancer. Using the Drosophila melanogaster even-skipped stripe 2 enhancer as a case study, we tested the effect of a Giant binding site mutation (gt-2) on the expression patterns driven by minimal and extended enhancer reporter constructs. We found that, in contrast to the misexpression caused by the gt-2 binding site mutation in the minimal enhancer, the same gt-2 binding site mutation in the extended enhancer did not have an effect on expression. The buffering of expression levels, but not expression pattern, is partially explained by an additional Giant binding site outside the minimal enhancer. Mutating the gt-2 binding site in the endogenous locus had no significant effect on stripe 2 expression. Our results indicate that rules derived from mutating enhancer reporter constructs may not represent what occurs in the endogenous context.

Download Full-text

DeepGRN: Prediction of transcription factor binding site across cell-types using attention-based deep neural networks

10.21203/rs.3.rs-19323/v3 ◽

2021 ◽

Author(s):

Chen Chen ◽

Jie Hou ◽

Xiaowen Shi ◽

Hua Yang ◽

James A. Birchler ◽

...

Keyword(s):

Transcription Factor ◽

Transcription Factors ◽

Binding Site ◽

Transcription Factor Binding Site ◽

Dna Sequences ◽

Binding Sites ◽

Transcription Factor Binding ◽

Parallel Sequencing ◽

Factor Binding Site ◽

Factor Binding

Abstract BackgroundDue to the complexity of the biological systems, the prediction of the potential DNA binding sites for transcription factors remains a difficult problem in computational biology. Genomic DNA sequences and experimental results from parallel sequencing provide available information about the affinity and accessibility of genome and are commonly used features in binding sites prediction. The attention mechanism in deep learning has shown its capability to learn long-range dependencies from sequential data, such as sentences and voices. Until now, no study has applied this approach in binding site inference from massively parallel sequencing data. The successful applications of attention mechanism in similar input contexts motivate us to build and test new methods that can accurately determine the binding sites of transcription factors.ResultsIn this study, we propose a novel tool (named DeepGRN) for transcription factors binding site prediction based on the combination of two components: single attention module and pairwise attention module. The performance of our methods is evaluated on the ENCODE-DREAM in vivo Transcription Factor Binding Site Prediction Challenge datasets. The results show that DeepGRN achieves higher unified scores in 6 of 13 targets than any of the top four methods in the DREAM challenge. We also demonstrate that the attention weights learned by the model are correlated with potential informative inputs, such as DNase-Seq coverage and motifs, which provide possible explanations for the predictive improvements in DeepGRN.ConclusionsDeepGRN can automatically and effectively predict transcription factor binding sites from DNA sequences and DNase-Seq coverage. Furthermore, the visualization techniques we developed for the attention modules help to interpret how critical patterns from different types of input features are recognized by our model.

Download Full-text

DeepGRN: prediction of transcription factor binding site across cell-types using attention-based deep neural networks

BMC Bioinformatics ◽

10.1186/s12859-020-03952-1 ◽

2021 ◽

Vol 22 (1) ◽

Author(s):

Chen Chen ◽

Jie Hou ◽

Xiaowen Shi ◽

Hua Yang ◽

James A. Birchler ◽

...

Keyword(s):

Transcription Factor ◽

Transcription Factors ◽

Binding Site ◽

Transcription Factor Binding Site ◽

Dna Sequences ◽

Binding Sites ◽

Transcription Factor Binding ◽

Parallel Sequencing ◽

Factor Binding Site ◽

Factor Binding

Abstract Background Due to the complexity of the biological systems, the prediction of the potential DNA binding sites for transcription factors remains a difficult problem in computational biology. Genomic DNA sequences and experimental results from parallel sequencing provide available information about the affinity and accessibility of genome and are commonly used features in binding sites prediction. The attention mechanism in deep learning has shown its capability to learn long-range dependencies from sequential data, such as sentences and voices. Until now, no study has applied this approach in binding site inference from massively parallel sequencing data. The successful applications of attention mechanism in similar input contexts motivate us to build and test new methods that can accurately determine the binding sites of transcription factors. Results In this study, we propose a novel tool (named DeepGRN) for transcription factors binding site prediction based on the combination of two components: single attention module and pairwise attention module. The performance of our methods is evaluated on the ENCODE-DREAM in vivo Transcription Factor Binding Site Prediction Challenge datasets. The results show that DeepGRN achieves higher unified scores in 6 of 13 targets than any of the top four methods in the DREAM challenge. We also demonstrate that the attention weights learned by the model are correlated with potential informative inputs, such as DNase-Seq coverage and motifs, which provide possible explanations for the predictive improvements in DeepGRN. Conclusions DeepGRN can automatically and effectively predict transcription factor binding sites from DNA sequences and DNase-Seq coverage. Furthermore, the visualization techniques we developed for the attention modules help to interpret how critical patterns from different types of input features are recognized by our model.

Download Full-text

Prediction of transcription factor binding site across cell-types using attention-based deep neural networks

10.21203/rs.3.rs-19323/v1 ◽

2020 ◽

Author(s):

Chen Chen ◽

Jie Hou ◽

Xiaowen Shi ◽

Hua Yang ◽

James A. Birchler ◽

...

Keyword(s):

Transcription Factor ◽

Transcription Factors ◽

Binding Site ◽

Transcription Factor Binding Site ◽

Dna Sequences ◽

Binding Sites ◽

Transcription Factor Binding ◽

Parallel Sequencing ◽

Factor Binding Site ◽

Factor Binding

Abstract Background Due to the complexity of the biological systems, the prediction of the potential DNA binding sites for transcription factors remains a difficult problem in computational biology. Genomic DNA sequences and experimental results from parallel sequencing provide available information about the affinity and accessibility of genome and are commonly used features in binding sites prediction. The attention mechanism in deep learning has shown its capability to learn long-range dependencies from sequential data, such as sentences and voices. Until now, no study has applied this approach in binding site inference from massively parallel sequencing data. The successful applications of attention mechanism in similar input contexts motivate us to build and test new methods that can accurately determine the binding sites of transcription factors. Results In this study, we propose a novel tool (named DeepGRN) for transcription factors binding site prediction based on the combination of two components: single attention module and pairwise attention module. The performance of our methods is evaluated on the ENCODE-DREAM in vivo Transcription Factor Binding Site Prediction Challenge datasets. The results show that DeepGRN achieves higher unified scores in 6 of 13 targets than any of the top four methods in the DREAM challenge. We also demonstrate that the attention weights learned by the model are correlated with potential informative inputs, such as DNase-Seq coverage and motifs, which provide possible explanations for the predictive improvements in DeepGRN. Conclusions DeepGRN can automatically and effectively predict transcription factor binding sites from DNA sequences and DNase-Seq coverage. Furthermore, the visualization techniques we developed for the attention modules help to interpret how critical patterns from different types of input features are recognized by our model.

Download Full-text

An intuitionistic approach to scoring DNA sequences against transcription factor binding site motifs

BMC Bioinformatics ◽

10.1186/1471-2105-11-551 ◽

2010 ◽

Vol 11 (1) ◽

Cited By ~ 5

Author(s):

Fernando Garcia-Alcalde ◽

Armando Blanco ◽

Adrian J Shepherd

Keyword(s):

Transcription Factor ◽

Binding Site ◽

Transcription Factor Binding Site ◽

Dna Sequences ◽

Transcription Factor Binding ◽

Factor Binding Site ◽

Factor Binding

Download Full-text

Transcription Factor Binding Site Mutation

10.32388/qrst98 ◽

2020 ◽

Author(s):

Keyword(s):

Transcription Factor ◽

Binding Site ◽

Transcription Factor Binding Site ◽

Transcription Factor Binding ◽

Site Mutation ◽

Factor Binding Site ◽

Factor Binding

Download Full-text

DeepGRN: Prediction of transcription factor binding site across cell-types using attention-based deep neural networks

10.21203/rs.3.rs-19323/v2 ◽

2020 ◽

Author(s):

Chen Chen ◽

Jie Hou ◽

Xiaowen Shi ◽

Hua Yang ◽

James A. Birchler ◽

...

Keyword(s):

Transcription Factor ◽

Transcription Factors ◽

Binding Site ◽

Transcription Factor Binding Site ◽

Dna Sequences ◽

Binding Sites ◽

Transcription Factor Binding ◽

Parallel Sequencing ◽

Factor Binding Site ◽

Factor Binding

Abstract BackgroundDue to the complexity of the biological systems, the prediction of the potential DNA binding sites for transcription factors remains a difficult problem in computational biology. Genomic DNA sequences and experimental results from parallel sequencing provide available information about the affinity and accessibility of genome and are commonly used features in binding sites prediction. The attention mechanism in deep learning has shown its capability to learn long-range dependencies from sequential data, such as sentences and voices. Until now, no study has applied this approach in binding site inference from massively parallel sequencing data. The successful applications of attention mechanism in similar input contexts motivate us to build and test new methods that can accurately determine the binding sites of transcription factors.ResultsIn this study, we propose a novel tool (named DeepGRN) for transcription factors binding site prediction based on the combination of two components: single attention module and pairwise attention module. The performance of our methods is evaluated on the ENCODE-DREAM in vivo Transcription Factor Binding Site Prediction Challenge datasets. The results show that DeepGRN achieves higher unified scores in 6 of 13 targets than any of the top four methods in the DREAM challenge. We also demonstrate that the attention weights learned by the model are correlated with potential informative inputs, such as DNase-Seq coverage and motifs, which provide possible explanations for the predictive improvements in DeepGRN.ConclusionsDeepGRN can automatically and effectively predict transcription factor binding sites from DNA sequences and DNase-Seq coverage. Furthermore, the visualization techniques we developed for the attention modules help to interpret how critical patterns from different types of input features are recognized by our model.

Download Full-text