Causal gene regulatory network inference using enhancer activity as a causal anchor

AbstractMotivationTranscription control plays a crucial role in establishing a unique gene expression signature for each of the hundreds of mammalian cell types. Though gene expression data has been widely used to infer the cellular regulatory networks, the methods mainly infer correlations rather than causality. We propose that a causal inference framework successfully used for eQTL data can be extended to infer causal regulatory networks using enhancers as causal anchors and enhancer RNA expression as a readout of enhancer activity.ResultsWe developed statistical models and likelihood-ratio tests to infer causal gene regulatory networks using enhancer RNA (eRNA) expression information as a causal anchor and applied the framework to eRNA and transcript expression data from the FANTOM consortium. Predicted causal targets of transcription factors (TFs) in mouse embryonic stem cells, macrophages and erythroblastic leukemia overlapped significantly with experimentally validated targets from ChIP-seq and perturbation data. We further improved the model by taking into account that some TFs might act in a quantitative, dosage-dependent manner, whereas others might act predominantly in a binary on/off fashion. We predicted TF targets from concerted variation of eRNA and TF and target promoter expression levels within a single cell type as well as across multiple cell types. Importantly, TFs with high-confidence predictions were largely different between these two analyses, demonstrating that variability within a cell type is highly relevant for target prediction of cell type specific factors. Finally, we generated a compendium of high-confidence TF targets across diverse human cell and tissue types.AvailabilityMethods have been implemented in the Findr software, available at https://github.com/lingfeiwang/[email protected], [email protected]

Download Full-text

Causal Transcription Regulatory Network Inference Using Enhancer Activity as a Causal Anchor

International Journal of Molecular Sciences ◽

10.3390/ijms19113609 ◽

2018 ◽

Vol 19 (11) ◽

pp. 3609 ◽

Cited By ~ 2

Author(s):

Deepti Vipin ◽

Lingfei Wang ◽

Guillaume Devailly ◽

Tom Michoel ◽

Anagha Joshi

Keyword(s):

Gene Expression ◽

Regulatory Networks ◽

Network Inference ◽

Embryonic Stem ◽

Gene Expression Signature ◽

Cell Types ◽

Dependent Manner ◽

Expression Data ◽

Cell Type ◽

High Confidence

Transcription control plays a crucial role in establishing a unique gene expression signature for each of the hundreds of mammalian cell types. Though gene expression data have been widely used to infer cellular regulatory networks, existing methods mainly infer correlations rather than causality. We developed statistical models and likelihood-ratio tests to infer causal gene regulatory networks using enhancer RNA (eRNA) expression information as a causal anchor and applied the framework to eRNA and transcript expression data from the FANTOM Consortium. Predicted causal targets of transcription factors (TFs) in mouse embryonic stem cells, macrophages and erythroblastic leukaemia overlapped significantly with experimentally-validated targets from ChIP-seq and perturbation data. We further improved the model by taking into account that some TFs might act in a quantitative, dosage-dependent manner, whereas others might act predominantly in a binary on/off fashion. We predicted TF targets from concerted variation of eRNA and TF and target promoter expression levels within a single cell type, as well as across multiple cell types. Importantly, TFs with high-confidence predictions were largely different between these two analyses, demonstrating that variability within a cell type is highly relevant for target prediction of cell type-specific factors. Finally, we generated a compendium of high-confidence TF targets across diverse human cell and tissue types.

Download Full-text

Faculty Opinions recommendation of Predicting gene regulatory networks by combining spatial and temporal gene expression data in Arabidopsis root stem cells.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.729074122.793536255 ◽

2017 ◽

Author(s):

Elena Alvarez-Buylla ◽

Monica Garcia

Keyword(s):

Gene Expression ◽

Stem Cells ◽

Gene Expression Data ◽

Gene Regulatory Networks ◽

Regulatory Networks ◽

Expression Data ◽

Arabidopsis Root ◽

Temporal Gene Expression ◽

Gene Regulatory

Download Full-text

Current Development and Review of Dynamic Bayesian Network-Based Methods for Inferring Gene Regulatory Networks from Gene Expression Data

Current Bioinformatics ◽

10.2174/1574893609666140421210333 ◽

2014 ◽

Vol 9 (5) ◽

pp. 531-539 ◽

Cited By ~ 6

Author(s):

Lian Chai ◽

Mohd Mohamad ◽

Safaai Deris ◽

Chuii Chong ◽

Yee Choon ◽

...

Keyword(s):

Gene Expression ◽

Bayesian Network ◽

Gene Expression Data ◽

Gene Regulatory Networks ◽

Regulatory Networks ◽

Dynamic Bayesian Network ◽

Expression Data ◽

Current Development ◽

Gene Regulatory

Download Full-text

Identification of gene regulatory networks from time course gene expression data

2010 Annual International Conference of the IEEE Engineering in Medicine and Biology ◽

10.1109/iembs.2010.5626506 ◽

2010 ◽

Author(s):

Fang-Xiang Wu ◽

Li-Zhi Liu ◽

Zhang-Hang Xia

Keyword(s):

Gene Expression ◽

Gene Expression Data ◽

Gene Regulatory Networks ◽

Regulatory Networks ◽

Time Course ◽

Expression Data ◽

Gene Regulatory

Download Full-text

Inference of Gene Regulatory Networks by Topological Prior Information and Data Integration

Biotechnology ◽

10.4018/978-1-5225-8903-7.ch010 ◽

2019 ◽

pp. 265-304

Author(s):

David Correa Martins Jr. ◽

Fabricio Martins Lopes ◽

Shubhra Sankar Ray

Keyword(s):

Gene Expression ◽

Gene Expression Data ◽

Gene Regulatory Networks ◽

Regulatory Networks ◽

Prior Information ◽

Heterogeneous Data ◽

Data Sources ◽

Expression Data ◽

Heterogeneous Data Sources ◽

Gene Regulatory

The inference of Gene Regulatory Networks (GRNs) is a very challenging problem which has attracted increasing attention since the development of high-throughput sequencing and gene expression measurement technologies. Many models and algorithms have been developed to identify GRNs using mainly gene expression profile as data source. As the gene expression data usually has limited number of samples and inherent noise, the integration of gene expression with several other sources of information can be vital for accurately inferring GRNs. For instance, some prior information about the overall topological structure of the GRN can guide inference techniques toward better results. In addition to gene expression data, recently biological information from heterogeneous data sources have been integrated by GRN inference methods as well. The objective of this chapter is to present an overview of GRN inference models and techniques with focus on incorporation of prior information such as, global and local topological features and integration of several heterogeneous data sources.

Download Full-text

PAGeneRN

Data Analytics in Medicine ◽

10.4018/978-1-7998-1204-3.ch055 ◽

2020 ◽

pp. 1052-1075 ◽

Cited By ~ 1

Author(s):

Dina Elsayad ◽

A. Ali ◽

Howida A. Shedeed ◽

Mohamed F. Tolba

Keyword(s):

Gene Expression ◽

Data Analysis ◽

Network Analysis ◽

Gene Regulatory Network ◽

Gene Expression Data ◽

Regulatory Network ◽

Regulatory Networks ◽

Expression Data ◽

Gene Expression Data Analysis ◽

Gene Regulatory

The gene expression analysis is an important research area of Bioinformatics. The gene expression data analysis aims to understand the genes interacting phenomena, gene functionality and the genes mutations effect. The Gene regulatory network analysis is one of the gene expression data analysis tasks. Gene regulatory network aims to study the genes interactions topological organization. The regulatory network is critical for understanding the pathological phenotypes and the normal cell physiology. There are many researches that focus on gene regulatory network analysis but unfortunately some algorithms are affected by data size. Where, the algorithm runtime is proportional to the data size, therefore, some parallel algorithms are presented to enhance the algorithms runtime and efficiency. This work presents a background, mathematical models and comparisons about gene regulatory networks analysis different techniques. In addition, this work proposes Parallel Architecture for Gene Regulatory Network (PAGeneRN).

Download Full-text

Simultaneous enumeration of cancer and immune cell types from bulk tumor gene expression data

eLife ◽

10.7554/elife.26476 ◽

2017 ◽

Vol 6 ◽

Cited By ~ 107

Author(s):

Julien Racle ◽

Kaat de Jonge ◽

Petra Baumgaertner ◽

Daniel E Speiser ◽

David Gfeller

Keyword(s):

Gene Expression ◽

Gene Expression Data ◽

Immune Cell ◽

Expression Profiles ◽

Cell Types ◽

Response To Therapy ◽

Expression Data ◽

Cell Type ◽

Tumor Gene Expression ◽

Tumor Gene

Immune cells infiltrating tumors can have important impact on tumor progression and response to therapy. We present an efficient algorithm to simultaneously estimate the fraction of cancer and immune cell types from bulk tumor gene expression data. Our method integrates novel gene expression profiles from each major non-malignant cell type found in tumors, renormalization based on cell-type-specific mRNA content, and the ability to consider uncharacterized and possibly highly variable cell types. Feasibility is demonstrated by validation with flow cytometry, immunohistochemistry and single-cell RNA-Seq analyses of human melanoma and colorectal tumor specimens. Altogether, our work not only improves accuracy but also broadens the scope of absolute cell fraction predictions from tumor gene expression data, and provides a unique novel experimental benchmark for immunogenomics analyses in cancer research (http://epic.gfellerlab.org).

Download Full-text