A Graph Theoretic Approach for the Feature Extraction of Transcription Factor Binding Sites

Theoretic Model ◽

Theoretic Approach ◽

Promoter Regions ◽

Factor Binding ◽

Graph Theoretic

Identifying transcription factor binding sites with experimental methods is often expensive and time consuming. Although many computational approaches and tools have been developed for this problem, the prediction accuracy is not satisfactory. In this paper, we develop a new computational approach that can model the relationships among all short sequence segments in the promoter regions with a graph theoretic model. Based on this model, finding the locations of transcription factor binding site is reduced to computing maximum weighted cliques in a graph with weighted edges. We have implemented this approach and used it to predict the binding sites in two organisms,Caenorhabditis elegansandmus musculus. We compared the prediction accuracy with that of the Gibbs Motif Sampler. We found that the accuracy of our approach is higher than or comparable with that of the Gibbs Motif Sampler for most of tested data and can accurately identify binding sites in cases where the Gibbs Motif Sampler has difficulty to predict their locations.

Faculty Opinions recommendation of Unbiased mapping of transcription factor binding sites along human chromosomes 21 and 22 points to widespread regulation of noncoding RNAs.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.1017662.206496 ◽

2004 ◽

Author(s):

Carlos F Barbas

Keyword(s):

Transcription Factor ◽

Binding Sites ◽

Noncoding Rnas ◽

Human Chromosomes ◽

Faculty Opinions recommendation of Position specific variation in the rate of evolution in transcription factor binding sites.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.1022543.258615 ◽

2004 ◽

Author(s):

Emmanouil Dermitzakis

Keyword(s):

Transcription Factor ◽

Binding Sites ◽

Factor Binding ◽

Rate Of Evolution ◽

Specific Variation

Faculty Opinions recommendation of Divergence of transcription factor binding sites across related yeast species.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.1089279.548953 ◽

2007 ◽

Author(s):

Emmanouil Dermitzakis

Keyword(s):

Transcription Factor ◽

Binding Sites ◽

Yeast Species ◽

Faculty Opinions recommendation of Genome-wide inference of natural selection on human transcription factor binding sites.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.718018456.793479473 ◽

2013 ◽

Author(s):

Peter Keightley

Keyword(s):

Transcription Factor ◽

Natural Selection ◽

Binding Sites ◽

Factor Binding ◽

Genome Wide ◽

Human Transcription Factor

Gene Variants and Haplotypes Modifying Transcription Factor Binding Sites in the Human Cyclooxygenase 1 and 2 (PTGS1 and PTGS2) Genes

Current Drug Metabolism ◽

10.2174/138920021502140327180336 ◽

2014 ◽

Vol 15 (2) ◽

pp. 182-195 ◽

Cited By ~ 13

Author(s):

Jose Agundez ◽

David Gonzalez-Alvarez ◽

Miguel Vega-Rodriguez ◽

Emilia Botello ◽

Elena Garcia-Martin

Keyword(s):

Transcription Factor ◽

Binding Sites ◽

Gene Variants ◽

Factor Binding ◽

Cyclooxygenase 1

Structure-Based Ab Initio Prediction of Transcription Factor–Binding Sites

Methods in Molecular Biology - Computational Systems Biology ◽

10.1007/978-1-59745-243-4_2 ◽

2009 ◽

pp. 23-41 ◽

Cited By ~ 8

Author(s):

L. Angela Liu ◽

Joel S. Bader

Keyword(s):

Transcription Factor ◽

Ab Initio ◽

Binding Sites ◽

Factor Binding ◽

Ab Initio Prediction

Prediction of Transcription Factor Binding Sites of SP1 on Human Chromosome1

Applied Sciences ◽

10.3390/app11115123 ◽

2021 ◽

Vol 11 (11) ◽

pp. 5123

Author(s):

Maiada M. Mahmoud ◽

Nahla A. Belal ◽

Aliaa Youssif

Keyword(s):

Transcription Factor ◽

Binding Sites ◽

Messenger Rna ◽

Area Under The Curve ◽

Noisy Data ◽

Classification Problem ◽

K Nearest Neighbors ◽

Transcription factors (TFs) are proteins that control the transcription of a gene from DNA to messenger RNA (mRNA). TFs bind to a specific DNA sequence called a binding site. Transcription factor binding sites have not yet been completely identified, and this is considered to be a challenge that could be approached computationally. This challenge is considered to be a classification problem in machine learning. In this paper, the prediction of transcription factor binding sites of SP1 on human chromosome1 is presented using different classification techniques, and a model using voting is proposed. The highest Area Under the Curve (AUC) achieved is 0.97 using K-Nearest Neighbors (KNN), and 0.95 using the proposed voting technique. However, the proposed voting technique is more efficient with noisy data. This study highlights the applicability of the voting technique for the prediction of binding sites, and highlights the outperformance of KNN on this type of data. The study also highlights the significance of using voting.

In silico simulations of occurrence of transcription factor binding sites in bacterial genomes

BMC Evolutionary Biology ◽

10.1186/s12862-019-1381-8 ◽

2019 ◽

Vol 19 (1) ◽

Author(s):

Jan Mrázek ◽

Anna C. Karls

Keyword(s):

Transcription Factor ◽

Binding Sites ◽

In Silico ◽

Bacterial Genomes ◽

ConTra: a promoter alignment analysis tool for identification of transcription factor binding sites across species

Nucleic Acids Research ◽

10.1093/nar/gkn195 ◽

2008 ◽

Vol 36 (suppl_2) ◽

pp. W128-W132 ◽

Cited By ~ 32

Author(s):

Bart Hooghe ◽

Paco Hulpiau ◽

Frans van Roy ◽

Pieter De Bleser

Keyword(s):

Transcription Factor ◽

Binding Sites ◽

Analysis Tool ◽

Factor Binding ◽

Alignment Analysis