scholarly journals MoRAine - A web server for fast computational transcription factor binding motif re-annotation

2008 ◽  
Vol 5 (2) ◽  
Author(s):  
Jan Baumbach ◽  
Tobias Wittkop ◽  
Jochen Weile ◽  
Thomas Kohl ◽  
Sven Rahmann

SummaryBackground: A precise experimental identification of transcription factor binding motifs (TFBMs), accurate to a single base pair, is time-consuming and difficult. For several databases, TFBM annotations are extracted from the literature and stored 5ʹ → 3ʹ relative to the target gene. Mixing the two possible orientations of a motif results in poor information content of subsequently computed position frequency matrices (PFMs) and sequence logos. Since these PFMs are used to predict further TFBMs, we address the question if the TFBMs underlying a PFM can be re-annotated automatically to improve both the information content of the PFM and subsequent classification performance.Results: We present MoRAine, an algorithm that re-annotates transcription factor binding motifs. Each motif with experimental evidence underlying a PFM is compared against each other such motif. The goal is to re-annotate TFBMs by possibly switching their strands and shifting them a few positions in order to maximize the information content of the resulting adjusted PFM. We present two heuristic strategies to perform this optimization and subsequently show that MoRAine significantly improves the corresponding sequence logos. Furthermore, we justify the method by evaluating specificity, sensitivity, true positive, and false positive rates of PFM-based TFBM predictions for E. coli using the original database motifs and the MoRAine-adjusted motifs. The classification performance is considerably increased if MoRAine is used as a preprocessing step.Conclusions: MoRAine is integrated into a publicly available web server and can be used online or downloaded as a stand-alone version from http://moraine.cebitec.uni-bielefeld.de.

2018 ◽  
Author(s):  
Werner Pieter Veldsman

AbstractExperimental validation of computationally predicted transcription factor binding motifs is desirable. Increased RNA levels in the vicinity of predicted protein-chromosomal binding motifs intuitively suggest regulatory activity. With this intuition in mind, the approach presented here juxtaposes publicly available experimentally derived GRID-seq data with binding motif predictions computationally determined by deep learning models. The aim is to demonstrate the feasibility of using RNA-sequencing data to improve binding motif prediction accuracy. Publicly available GRID-seq scores and computed DeepBind scores could be aggregated by chromosomal region and anomalies within the aggregated data could be detected using mahalanobis distance analysis. A mantel’s test of matrices containing pairwise hamming distances showed significant differences between 1) randomly ranked sequences, 2) sequences ranked by non-GRID-seq assisted scores, and 3) sequences ranked by GRID-seq assisted scores. Plots of mahalanobis ranked binding motifs revealed an inversely proportional relationship between GRID-seq scores and DeepBind scores. Data points with high DeepBind scores but low GRID-seq scores had no DNAse hypersensitivity clusters annotated to their respective sequences. However, DNase hypersensitivity was observed for high scoring DeepBind motifs with moderate GRID-seq scores. Binding motifs of interest were recognized by their deviance from the inversely proportional tendency, and the underlying context sequences of these predicted motifs were on occasion associated with DNAse hypersensitivity unlike the most highly ranked motif scores when DeepBind was used in isolation. This article presents a novel combinatory approach to predict functional protein-chromosomal binding motifs. The two underlying methods are based on recent developments in the fields of RNA sequencing and deep learning, respectively. They are shown to be suited for synergistic use, which broadens the scope of their respective applications.


2020 ◽  
Vol 30 (5) ◽  
pp. 736-748
Author(s):  
Luca Mariani ◽  
Kathryn Weinand ◽  
Stephen S. Gisselbrecht ◽  
Martha L. Bulyk

BMC Genomics ◽  
2016 ◽  
Vol 17 (S2) ◽  
Author(s):  
Ilya E. Vorontsov ◽  
Grigory Khimulya ◽  
Elena N. Lukianova ◽  
Daria D. Nikolaeva ◽  
Irina A. Eliseeva ◽  
...  

2010 ◽  
Vol 39 (3) ◽  
pp. 808-824 ◽  
Author(s):  
Alejandra Medina-Rivera ◽  
Cei Abreu-Goodger ◽  
Morgane Thomas-Chollier ◽  
Heladia Salgado ◽  
Julio Collado-Vides ◽  
...  

2001 ◽  
Vol 276 (30) ◽  
pp. 27825-27830 ◽  
Author(s):  
Christopher C. Cioffi ◽  
Darlene L. Middleton ◽  
Melanie R. Wilson ◽  
Norman W. Miller ◽  
L. William Clem ◽  
...  

2007 ◽  
Vol 5 (3-4) ◽  
pp. 158-165 ◽  
Author(s):  
Andy B. Chen ◽  
Kazunori Hamamura ◽  
Guohua Wang ◽  
Weirong Xing ◽  
Subburaman Mohan ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document