Transcription factor and microRNA motif discovery: The Amadeus platform and a compendium of metazoan target sets

C. Linhart; Y. Halperin; R. Shamir

doi:10.1101/gr.076117.108

MD-SVM: a novel SVM-based algorithm for the motif discovery of transcription factor binding sites

BMC Bioinformatics ◽

10.1186/s12859-019-2735-3 ◽

2019 ◽

Vol 20 (S7) ◽

Cited By ~ 3

Author(s):

Jialu Hu ◽

Jingru Wang ◽

Jianan Lin ◽

Tianwei Liu ◽

Yuanke Zhong ◽

...

Keyword(s):

Transcription Factor ◽

Binding Sites ◽

Motif Discovery ◽

Transcription Factor Binding Sites ◽

Transcription Factor Binding ◽

Factor Binding

Download Full-text

Comparative Analysis of Regulatory Motif Discovery Tools for Transcription Factor Binding Sites

Genomics Proteomics & Bioinformatics ◽

10.1016/s1672-0229(07)60023-0 ◽

2007 ◽

Vol 5 (2) ◽

pp. 131-142 ◽

Cited By ~ 19

Author(s):

Wei Wei ◽

Xiao-Dan Yu

Keyword(s):

Transcription Factor ◽

Comparative Analysis ◽

Binding Sites ◽

Motif Discovery ◽

Transcription Factor Binding Sites ◽

Transcription Factor Binding ◽

Regulatory Motif ◽

Factor Binding

Download Full-text

DREME: motif discovery in transcription factor ChIP-seq data

Bioinformatics ◽

10.1093/bioinformatics/btr261 ◽

2011 ◽

Vol 27 (12) ◽

pp. 1653-1659 ◽

Cited By ~ 607

Author(s):

Timothy L. Bailey

Keyword(s):

Transcription Factor ◽

Motif Discovery

Download Full-text

MARS: Motif Assessment and Ranking Suite for transcription factor binding motifs

10.1101/065615 ◽

2016 ◽

Cited By ~ 1

Author(s):

Caleb Kipkurui Kibet ◽

Philip Machanick

Keyword(s):

Transcription Factor ◽

Binding Sites ◽

Motif Discovery ◽

High Throughput Sequencing ◽

Rank Correlation ◽

Transcription Factor Binding ◽

Factor Binding ◽

Sequencing Technologies ◽

Benchmark Database ◽

On Chip

AbstractWe describe MARS (Motif Assessment and Ranking Suite), a web-based suite of tools used to evaluate and rank PWM-based motifs. The increased number of learned motif models that are spread across databases and in different PWM formats, leading to a choice dilemma among the users, is our motivation. This increase has been driven by the difficulty of modelling transcription factor binding sites and the advance in high-throughput sequencing technologies at a continually reducing cost. Therefore, several experimental techniques have been developed resulting in diverse motif-finding algorithms and databases. We collate a wide variety of available motifs into a benchmark database, including the corresponding experimental ChIP-seq and PBM data obtained from ENCODE and UniPROBE databases, respectively. The implemented tools include: a data-independent consistency-based motif assessment and ranking (CB-MAR), which is based on the idea that ‘correct motifs’ are more similar to each other while incorrect motifs will differ from each other; and a scoring and classification-based algorithms, which rank binding models by their ability to discriminate sequences known to contain binding sites from those without. The CB-MAR and scoring techniques have a 0.86 and 0.73 median rank correlation using ChIP-seq and PBM respectively. Best motifs selected by CB-MAR achieve a mean AUC of 0.75, comparable to those ranked by held out data at 0.76 – this is based on ChIP-seq motif discovery using five algorithms on 110 transcription factors. We have demonstrated the benefit of this web server in motif choice and ranking, as well as in motif discovery. It can be accessed at http://www.bioinf.ict.ru.ac.za/.

Download Full-text

RSAT matrix-clustering: dynamic exploration and redundancy reduction of transcription factor binding motif collections

10.1101/065565 ◽

2016 ◽

Cited By ~ 1

Author(s):

Jaime Abraham Castro-Mondragon ◽

Sébastien Jaeger ◽

Denis Thieffry ◽

Morgane Thomas-Chollier ◽

Jacques van Helden

Keyword(s):

Transcription Factor ◽

Motif Discovery ◽

Binding Motif ◽

Data Sets ◽

Transcription Factor Binding Motif ◽

Biologically Relevant ◽

Manual Curation ◽

Versatile Tool ◽

Multiple Motif ◽

Multiple Trees

ABSTRACTTranscription Factor (TF) databases contain multitudes of motifs from various sources, from which non-redundant collections are derived by manual curation. The advent of high-throughput methods stimulated the production of novel collections with increasing numbers of motifs. Meta-databases, built by merging these collections, contain redundant versions, because available tools are not suited to automatically identify and explore biologically relevant clusters among thousands of motifs. Motif discovery from genome-scale data sets (e.g. ChIP-seq peaks) also produces redundant motifs, hampering the interpretation of results. We present matrix-clustering, a versatile tool that clusters similar TFBMs into multiple trees, and automatically creates non-redundant collections of motifs. A feature unique to matrix-clustering is its dynamic visualisation of aligned TFBMs, and its capability to simultaneously treat multiple collections from various sources. We demonstrate that matrix-clustering considerably simplifies the interpretation of combined results from multiple motif discovery tools and highlights biologically relevant variations of similar motifs. By clustering 24 entire databases (>7,500 motifs), we show that matrix-clustering correctly groups motifs belonging to the same TF families, and can drastically reduce motif redundancy. matrix-clustering is integrated within the RSAT suite (http://rsat.eu/), accessible through a user-friendly web interface or command-line for its integration in pipelines.

Download Full-text

Bayesian multiple-instance motif discovery with BAMBI: inference of recombinase and transcription factor binding sites

Nucleic Acids Research ◽

10.1093/nar/gkr745 ◽

2011 ◽

Vol 39 (21) ◽

pp. e146-e146 ◽

Cited By ~ 7

Author(s):

Guido H. Jajamovich ◽

Xiaodong Wang ◽

Adam P. Arkin ◽

Michael S. Samoilov

Keyword(s):

Transcription Factor ◽

Binding Sites ◽

Motif Discovery ◽

Transcription Factor Binding Sites ◽

Transcription Factor Binding ◽

Factor Binding

Download Full-text

A novel motif-discovery algorithm to identify co-regulatory motifs in large transcription factor and microRNA co-regulatory networks in human

Bioinformatics ◽

10.1093/bioinformatics/btv159 ◽

2015 ◽

Vol 31 (14) ◽

pp. 2348-2355 ◽

Cited By ~ 23

Author(s):

Cheng Liang ◽

Yue Li ◽

Jiawei Luo ◽

Zhaolei Zhang

Keyword(s):

Transcription Factor ◽

Regulatory Networks ◽

Motif Discovery ◽

Regulatory Motifs ◽

Motif Discovery Algorithm

Download Full-text

Combining comparative genomics with de novo motif discovery to identify human transcription factor DNA-binding motifs

BMC Bioinformatics ◽

10.1186/1471-2105-7-s4-s21 ◽

2006 ◽

Vol 7 (S4) ◽

Cited By ~ 6

Author(s):

Linyong Mao ◽

W Jim Zheng

Keyword(s):

Transcription Factor ◽

Comparative Genomics ◽

Dna Binding ◽

Motif Discovery ◽

De Novo ◽

Binding Motifs ◽

Dna Binding Motifs ◽

De Novo Motif Discovery ◽

Human Transcription Factor

Download Full-text

High Resolution Genome Wide Binding Event Finding and Motif Discovery Reveals Transcription Factor Spatial Binding Constraints

PLoS Computational Biology ◽

10.1371/journal.pcbi.1002638 ◽

2012 ◽

Vol 8 (8) ◽

pp. e1002638 ◽

Cited By ~ 167

Author(s):

Yuchun Guo ◽

Shaun Mahony ◽

David K. Gifford

Keyword(s):

Transcription Factor ◽

High Resolution ◽

Motif Discovery ◽

Genome Wide ◽

Binding Event ◽

Binding Constraints

Download Full-text

Motif discovery for transcription factor binding sites using a priori information on potential similarity regions at different resolution scales

New Biotechnology ◽

10.1016/j.nbt.2010.01.059 ◽

2010 ◽

Vol 27 ◽

pp. S41

Author(s):

I.V. Kulakovskiy ◽

V.A. Boeva ◽

A.V. Favorov ◽

V.J. Makeev

Keyword(s):

Transcription Factor ◽

Binding Sites ◽

Motif Discovery ◽

A Priori ◽

Transcription Factor Binding Sites ◽

Transcription Factor Binding ◽

A Priori Information ◽

Factor Binding ◽

Priori Information

Download Full-text