scholarly journals BICORN: An R package for integrative inference of de novo cis-regulatory modules

2020 ◽  
Vol 10 (1) ◽  
Author(s):  
Xi Chen ◽  
Jinghua Gu ◽  
Andrew F. Neuwald ◽  
Leena Hilakivi-Clarke ◽  
Robert Clarke ◽  
...  

Abstract Genome-wide transcription factor (TF) binding signal analyses reveal co-localization of TF binding sites, based on which cis-regulatory modules (CRMs) can be inferred. CRMs play a key role in understanding the cooperation of multiple TFs under specific conditions. However, the functions of CRMs and their effects on nearby gene transcription are highly dynamic and context-specific and therefore are challenging to characterize. BICORN (Bayesian Inference of COoperative Regulatory Network) builds a hierarchical Bayesian model and infers context-specific CRMs based on TF-gene binding events and gene expression data for a particular cell type. BICORN automatically searches for a list of candidate CRMs based on the input TF bindings at regulatory regions associated with genes of interest. Applying Gibbs sampling, BICORN iteratively estimates model parameters of CRMs, TF activities, and corresponding regulation on gene transcription, which it models as a sparse network of functional CRMs regulating target genes. The BICORN package is implemented in R (version 3.4 or later) and is publicly available on the CRAN server at https://cran.r-project.org/web/packages/BICORN/index.html.

2019 ◽  
Author(s):  
Xi Chen

AbstractBICORN is an R package developed to integrate prior transcription factor binding information and gene expression data for cis-regulatory module (CRM) inference. BICORN searches for a list of candidate CRMs from binary bindings on potential target genes. Applying Gibbs sampling, BICORN samples CRMs for each gene using the fitting performance of transcription factor activities and regulation strengths of TFs in each CRM on gene expression. Consequently, sparse regulatory networks are inferred as functional CRMs regulating target genes. The BICORN package is implemented in R and is available at https://cran.r-project.org/web/packages/BICORN/index.html.


2017 ◽  
Author(s):  
Tomasz Dzida ◽  
Mudassar Iqbal ◽  
Iryna Charapitsa ◽  
George Reid ◽  
Henk Stunnenberg ◽  
...  

We have developed a machine learning approach to predict context specific enhancer-promoter interactions using evidence from changes in genomic protein occupancy over time. The occupancy of estrogen receptor alpha (ERα), RNA polymerase (Pol II) and histone marks H2AZ and H3K4me3 were measured over time using ChIP-Seq experiments in MCF7 cells stimulated with estrogen. A Bayesian classifier was developed which uses the correlation of temporal binding patterns at enhancers and promoters and genomic proximity as features to predict interactions. This method was trained using experimentally determined interactions from the same system and was shown to achieve much higher precision than predictions based on the genomic proximity of nearest ERα binding. We use the method to identify a genome-wide confident set of ERα target genes and their regulatory enhancers genome-wide. Validation with publicly available GRO-Seq data demonstrates that our predicted targets are much more likely to show early nascent transcription than predictions based on genomic ERα binding proximity alone.


2020 ◽  
Vol 10 (1) ◽  
Author(s):  
Xi Chen ◽  
Jinghua Gu ◽  
Andrew F. Neuwald ◽  
Leena Hilakivi-Clarke ◽  
Robert Clarke ◽  
...  
Keyword(s):  
De Novo ◽  

An amendment to this paper has been published and can be accessed via a link at the top of the paper.


2011 ◽  
Vol 23 (1) ◽  
pp. 75 ◽  
Author(s):  
Thomas Werner

Reproduction and fertility are controlled by specific events naturally linked to oocytes, testes and early embryonal tissues. A significant part of these events involves gene expression, especially transcriptional control and alternative transcription (alternative promoters and alternative splicing). While methods to analyse such events for carefully predetermined target genes are well established, until recently no methodology existed to extend such analyses into a genome-wide de novo discovery process. With the arrival of next generation sequencing (NGS) it becomes possible to attempt genome-wide discovery in genomic sequences as well as whole transcriptomes at a single nucleotide level. This does not only allow identification of the primary changes (e.g. alternative transcripts) but also helps to elucidate the regulatory context that leads to the induction of transcriptional changes. This review discusses the basics of the new technological and scientific concepts arising from NGS, prominent differences from microarray-based approaches and several aspects of its application to reproduction and fertility research. These concepts will then be illustrated in an application example of NGS sequencing data analysis involving postimplantation endometrium tissue from cows.


2018 ◽  
Author(s):  
Viren Amin ◽  
Murat Can Cobanoglu

AbstractWe present EPEE (Effector and Perturbation Estimation Engine), a method for differential analysis of transcription factor (TF) activity from gene expression data. EPEE addresses two principal challenges in the field, namely incorporating context-specific TF-gene regulatory networks, and accounting for the fact that TF activity inference is intrinsically coupled for all TFs that share targets. Our validations in well-studied immune and cancer contexts show that addressing the overlap challenge and using state-of-the-art regulatory networks enable EPEE to consistently produce accurate results. (Accessible at: https://github.com/Cobanoglu-Lab/EPEE)


2020 ◽  
pp. jbc.RA120.015896
Author(s):  
Fabiana Passaro ◽  
Ilaria De Martino ◽  
Federico Zambelli ◽  
Giorgia Di Benedetto ◽  
Matteo Barbato ◽  
...  

The Yes-associated protein YAP, one of the major effectors of the Hippo pathway together with its related protein TAZ, mediates a range of cellular processes from proliferation and death to morphogenesis. YAP and TAZ regulate a large number of target genes, acting as co-activators of DNA-binding transcription factors or as negative regulators of transcription by interacting with the nucleosome remodeling and histone deacetylase complexes. YAP is expressed in self-renewing embryonic stem cells (ESCs), although it is still debated whether it plays any crucial roles in the control of either stemness or differentiation. Here we show that the transient downregulation of YAP in mouse ESCs perturbs cellular homeostasis, leading to the inability to differentiate properly. Bisulfite genomic sequencing revealed that this transient knockdown caused a genome-wide alteration of the DNA methylation remodeling that takes place during the early steps of differentiation, suggesting that the phenotype we observed might be due to the dysregulation of some of the mechanisms involved in regulation of ESC exit from pluripotency. By gene expression analysis we identified two molecules which could have a role in the altered genome-wide methylation profile: the long non-coding RNA Ephemeron, whose rapid upregulation is crucial for ESCs transition into epiblast, and the methyltransferase-like protein Dnmt3l, which, during the embryo development, cooperates with Dnmt3a and Dnmt3b to contribute to the de novo DNA methylation that governs early steps of ESC differentiation. These data suggest a new role for YAP in the governance of the epigenetic dynamics of exit from pluripotency.


2018 ◽  
Vol 62 (11-12) ◽  
pp. 723-732 ◽  
Author(s):  
Julie Carnesecchi ◽  
Pedro B. Pinto ◽  
Ingrid Lohmann

Hox transcription factors (TFs) function as key determinants in the specification of cell fates during development. They do so by triggering entire morphogenetic cascades through the activation of specific target genes. In contrast to their fundamental role in development, the molecular mechanisms employed by Hox TFs are still poorly understood. In recent years, a new picture has emerged regarding the function of Hox proteins in gene regulation. Initial studies have primarily focused on understanding how Hox TFs recognize and bind specific enhancers to activate defined Hox targets. However, genome-wide studies on the interactions and dynamics of Hox proteins have revealed a more elaborate function of the Hox factors. It is now known that Hox proteins are involved in several steps of gene expression with potential regulatory functions in the modification of the chromatin landscape and its accessibility, recognition and activation of specific cis-regulatory modules, assembly and activation of promoter transcription complexes and mRNA processing. In the coming years, the characterization of the molecular activity of Hox TFs in these mechanisms will greatly contribute to our general understanding of Hox activity.


2017 ◽  
Author(s):  
Yijie Wang ◽  
Dong-Yeon Cho ◽  
Hangnoh Lee ◽  
Justin Fear ◽  
Brian Oliver ◽  
...  

AbstractUnderstanding gene regulation is a fundamental step towards understanding of how cells function and respond to environmental cues and perturbations. An important step in this direction is the ability to infer the transcription factor (TF)-gene regulatory network (GRN). However gene regulatory networks are typically constructed disregarding the fact that regulatory programs are conditioned on tissue type, developmental stage, sex, and other factors. Due to lack of the biological context specificity, these context-agnostic networks may not provide insight for revealing the precise actions of genes for a specific biological system under concern. Collecting multitude of features required for a reliable construction of GRNs such as physical features (TF binding, chromatin accessibility) and functional features (correlation of expression or chromatin patterns) for every context of interest is costly. Therefore we need methods that is able to utilize the knowledge about a context-agnostic network (or a network constructed in a related context) for construction of a context specific regulatory network.To address this challenge we developed a computational approach that utilizes expression data obtained in a specific biological context such as a particular development stage, sex, tissue type and a GRN constructed in a different but related context (alternatively an incomplete or a noisy network for the same context) to construct a context specific GRN. Our method, NetREX, is inspired by network component analysis (NCA) that estimates TF activities and their influences on target genes given predetermined topology of a TF-gene network. To predict a network under a different condition, NetREX removes the restriction that the topology of the TF-gene network is fixed and allows for adding and removing edges to that network. To solve the corresponding optimization problem, which is non-convex and non-smooth, we provide a general mathematical framework allowing use of the recently proposed Proximal Alternative Linearized Maximization technique and prove that our formulation has the properties required for convergence.We tested our NetREX on simulated data and subsequently applied it to gene expression data in adult females from 99 hemizygotic lines of the Drosophila deletion (DrosDel) panel. The networks predicted by NetREX showed higher biological consistency than alternative approaches. In addition, we used the list of recently identified targets of the Doublesex (DSX) transcription factor to demonstrate the predictive power of our method.


Blood ◽  
2013 ◽  
Vol 122 (21) ◽  
pp. 1845-1845
Author(s):  
Mariateresa Fulciniti ◽  
Manoj Bashin ◽  
Mehmet Kemal Samur ◽  
Rajya Bandi ◽  
Parantu K Shah ◽  
...  

Abstract Transcription factors (TFs) are important oncogenic regulator and are altered during tumor initiation and progression. Our oncogenomic analysis of gene expression data from large clinically-annotated patient samples identified TF Dp1 as one of the most important gene affecting both overall and event-free survival in multiple myeloma (MM). Elevated Dp1 expression was predictive of adverse clinical outcome, independent of Dp1 protein partners, E2Fs and RB, suggesting direct impact of Dp1 and providing the rationale to further evaluate its specific role in MM. We have observed high level of Dp1 expression and activity in MM cells which was further induced after interaction with bone marrow stromal cells (BMSC). Moreover, Dp1 knock-down using specific sh-RNA decreased MM cell growth in 5 MM cell lines with different genetic background, with a concomitant G1 arrest and late induction of apoptosis. These data suggest a role for Dp1 in MM cell proliferation and survival and established a rationale to identify its molecular impact. We have further characterized Dp1 activity using chromatin immunoprecipitation with Dp1 or E2F1 specific antibody followed by genome wide sequencing (ChIP-Seq) to identify Dp1-binding regions in MM. We have identified 2783 exclusive Dp1 binding regions in two MM cell lines. Examination of Dp1 and E2F1 binding revealed that Dp1 co-occupies 65% of the binding sites with E2F1. The DAVID gene set enrichment analysis showed that identified genes were related to cell cycle, as well as to transcriptional and translational processes. To assess the functional consequences of Dp1 DNA binding, the ChIP-Seq data were supplemented with gene expression profile of MM1S cells following shRNA-mediated Dp1 and E2F knock-downs. Integrated analysis incorporating ChIP-seq and expression data identified Dp1 response program in MM. 805 (46%) of 1752 differentially expressed genes also have binding sites for Dp1 and likely are direct transcriptional targets of Dp1 in MM. Enrichment analysis of direct targets revealed that the most strongly enriched pathways for both Dp1 and E2F1 genes combined were related to the cell cycle, especially DNA replication, repair, and metabolism. Interestingly, pathway analysis identified ‘‘regulation of RNA metabolic processes’’ (40 target genes), ‘‘RNA processing’’ (93 target genes) ‘‘RNA splicing’’ (95 genes), and ‘‘RNA binding’’ (53 genes) as statistically significant RNA-related categories enriched among Dp1 target genes, suggests role of Dp1 in RNA splicing. Based on our previous data showing that dysregulated alternate splicing (AS) has significant impact on overall clinical outcome MM, we evaluated the expression of Dp1-modulated splicing factors in our clinically annotated cohort of MM patients and 5 normal PCs. We identified 23 SFs upregulated in MM compared to normal plasma cells. Importantly, the increased expression of 12 of these SFs was linked with poor prognosis in this cohort of myeloma patients. Our data show for the first time that SFs are upregulated in myeloma and link to clinical outcome. To evaluate the impact of Dp1 on alternate splicing (AS), we performed genome-wide analysis of alternate splicing in total RNA from Dp1 silenced MM1S cells using Human Exon1 ST arrays. Splicing profiles showed that Dp1 knock down causes widespread changes in AS. We have identified 3683 genes whose one exon has splicing index more than 1.5 in in shDP1 compared to control pLKO.1-transduced MM1S cells, suggesting impact of Dp1 silencing on alternate splicing. We are now evaluating impact of a peptide able to disrupt Dp1-E2F1 binding with consequent effect on MM cell growth and alternate splicing. In conclusion, our investigation showed that the Dp1/E2F1 signaling pathway plays significant role in myeloma and can directly activate transcription of specific SFs with effect on alternate splicing and potential functional, clinical and therapeutic implications in myeloma. Disclosures: No relevant conflicts of interest to declare.


Sign in / Sign up

Export Citation Format

Share Document