Data-driven Gene Regulatory Networks Inference Based on Classification Algorithms

Sergio Peignier; Pauline Schmitt; Federica Calevro

doi:10.1142/s0218213021500226

Data-driven Gene Regulatory Networks Inference Based on Classification Algorithms

International Journal of Artificial Intelligence Tools ◽

10.1142/s0218213021500226 ◽

2021 ◽

Vol 30 (04) ◽

pp. 2150022

Author(s):

Sergio Peignier ◽

Pauline Schmitt ◽

Federica Calevro

Keyword(s):

Gene Expression ◽

Gene Regulatory Networks ◽

Regulatory Networks ◽

Target Genes ◽

Data Driven ◽

Classification Algorithms ◽

New Family ◽

Gene Regulatory ◽

High Throughput Gene Expression ◽

Inference Methods

Inferring Gene Regulatory Networks from high-throughput gene expression data is a challenging problem, addressed by the systems biology community. Most approaches that aim at unraveling the gene regulation mechanisms in a data-driven way, analyze gene expression datasets to score potential regulatory links between transcription factors and target genes. So far, three major families of approaches have been proposed to score regulatory links. These methods rely respectively on correlation measures, mutual information metrics, and regression algorithms. In this paper we present a new family of data-driven inference methods. This new family, inspired by the regression-based paradigm, relies on the use of classification algorithms. This paper assesses and advocates for the use of this paradigm as a new promising approach to infer gene regulatory networks. Indeed, the development and assessment of five new inference methods based on well-known classification algorithms shows that the classification-based inference family exhibits good results when compared to well-established paradigms.

ComHub: Community predictions of hubs in gene regulatory networks

10.1101/840959 ◽

2019 ◽

Author(s):

Julia Åkesson ◽

Zelmina Lubovac-Pilav ◽

Rasmus Magnusson ◽

Mika Gustafsson

Keyword(s):

Gene Expression ◽

Gene Regulatory Networks ◽

Drug Targets ◽

Regulatory Networks ◽

Network Inference ◽

Target Genes ◽

Data Sets ◽

Data Set ◽

Gene Regulatory ◽

Inference Methods

AbstractSummaryHub transcription factors, regulating many target genes in gene regulatory networks (GRNs), play important roles as disease regulators and potential drug targets. However, while numerous methods have been developed to predict individual regulator-gene interactions from gene expression data, few methods focus on inferring these hubs. We have developed ComHub, a tool to predict hubs in GRNs. ComHub makes a community prediction of hubs by averaging over predictions by a compendium of network inference methods. Benchmarking ComHub to the DREAM5 challenge data and an independent data set of human gene expression, proved a robust performance of ComHub over all data sets. Lastly, we implemented ComHub to work with both predefined networks and to do standard network inference, which we believe will make it generally applicable.AvailabilityCode is available at https://gitlab.com/Gustafsson-lab/[email protected], [email protected]

ComHub: Community predictions of hubs in gene regulatory networks

BMC Bioinformatics ◽

10.1186/s12859-021-03987-y ◽

2021 ◽

Vol 22 (1) ◽

Author(s):

Julia Åkesson ◽

Zelmina Lubovac-Pilav ◽

Rasmus Magnusson ◽

Mika Gustafsson

Keyword(s):

Gene Expression ◽

Gene Regulatory Networks ◽

Drug Targets ◽

Regulatory Networks ◽

Network Inference ◽

Target Genes ◽

Gene Regulatory ◽

Inference Methods ◽

Different Sources ◽

Potential Drug Targets

Abstract Background Hub transcription factors, regulating many target genes in gene regulatory networks (GRNs), play important roles as disease regulators and potential drug targets. However, while numerous methods have been developed to predict individual regulator-gene interactions from gene expression data, few methods focus on inferring these hubs. Results We have developed ComHub, a tool to predict hubs in GRNs. ComHub makes a community prediction of hubs by averaging over predictions by a compendium of network inference methods. Benchmarking ComHub against the DREAM5 challenge data and two independent gene expression datasets showed a robust performance of ComHub over all datasets. Conclusions In contrast to other evaluated methods, ComHub consistently scored among the top performing methods on data from different sources. Lastly, we implemented ComHub to work with both predefined networks and to perform stand-alone network inference, which will make the method generally applicable.

Identifying genetic modulators of the connectivity between transcription factors and their transcriptional targets

Proceedings of the National Academy of Sciences ◽

10.1073/pnas.1517140113 ◽

2016 ◽

Vol 113 (13) ◽

pp. E1835-E1843 ◽

Cited By ~ 8

Author(s):

Mina Fazlollahi ◽

Ivor Muroff ◽

Eunjee Lee ◽

Helen C. Causton ◽

Harmen J. Bussemaker

Keyword(s):

Gene Expression ◽

Transcription Factors ◽

Gene Regulatory Networks ◽

Regulatory Networks ◽

Target Genes ◽

Regulation Of Gene Expression ◽

Nonsynonymous Mutation ◽

Gene Regulatory ◽

Genetic Modulators

Regulation of gene expression by transcription factors (TFs) is highly dependent on genetic background and interactions with cofactors. Identifying specific context factors is a major challenge that requires new approaches. Here we show that exploiting natural variation is a potent strategy for probing functional interactions within gene regulatory networks. We developed an algorithm to identify genetic polymorphisms that modulate the regulatory connectivity between specific transcription factors and their target genes in vivo. As a proof of principle, we mapped connectivity quantitative trait loci (cQTLs) using parallel genotype and gene expression data for segregants from a cross between two strains of the yeast Saccharomyces cerevisiae. We identified a nonsynonymous mutation in the DIG2 gene as a cQTL for the transcription factor Ste12p and confirmed this prediction empirically. We also identified three polymorphisms in TAF13 as putative modulators of regulation by Gcn4p. Our method has potential for revealing how genetic differences among individuals influence gene regulatory networks in any organism for which gene expression and genotype data are available along with information on binding preferences for transcription factors.

Gene regulatory network reconstruction using single-cell RNA sequencing of barcoded genotypes in diverse environments

eLife ◽

10.7554/elife.51254 ◽

2020 ◽

Vol 9 ◽

Cited By ~ 16

Author(s):

Christopher A Jackson ◽

Dayanne M Castro ◽

Giuseppe-Antonio Saldi ◽

Richard Bonneau ◽

David Gresham

Keyword(s):

Gene Expression ◽

Single Cell ◽

Rna Sequencing ◽

Gene Regulatory Network ◽

Gene Regulatory Networks ◽

Regulatory Network ◽

Regulatory Networks ◽

Target Genes ◽

Single Cell Rna Sequencing ◽

Gene Regulatory

Understanding how gene expression programs are controlled requires identifying regulatory relationships between transcription factors and target genes. Gene regulatory networks are typically constructed from gene expression data acquired following genetic perturbation or environmental stimulus. Single-cell RNA sequencing (scRNAseq) captures the gene expression state of thousands of individual cells in a single experiment, offering advantages in combinatorial experimental design, large numbers of independent measurements, and accessing the interaction between the cell cycle and environmental responses that is hidden by population-level analysis of gene expression. To leverage these advantages, we developed a method for scRNAseq in budding yeast (Saccharomyces cerevisiae). We pooled diverse transcriptionally barcoded gene deletion mutants in 11 different environmental conditions and determined their expression state by sequencing 38,285 individual cells. We benchmarked a framework for learning gene regulatory networks from scRNAseq data that incorporates multitask learning and constructed a global gene regulatory network comprising 12,228 interactions.

Evolution of Mendelian dominance in gene regulatory networks associated with phenotypic robustness

10.1101/2021.01.11.426187 ◽

2021 ◽

Author(s):

Kenji Okubo ◽

Kunihiko Kaneko

Keyword(s):

Gene Expression ◽

Gene Regulatory Networks ◽

Regulatory Networks ◽

Target Genes ◽

Gene Expression Pattern ◽

Mendelian Inheritance ◽

Gene Expression Dynamics ◽

Phenotypic Robustness ◽

Robustness To Noise ◽

Gene Regulatory

AbstractMendelian inheritance is a fundamental law of genetics. Considering two alleles in a diploid, a phenotype of a heterotype is dominated by a particular homotype according to the law of dominance. This picture is usually based on simple genotype-phenotype mapping in which one gene regulates one phenotype. However, in reality, some interactions between genes can result in deviation from Mendelian dominance.Here, by using the numerical evolution of diploid gene regulatory networks (GRNs), we discuss whether Mendelian dominance evolves beyond the classical case of one-to-one genotype-phenotype mapping. We examine whether complex genotype-phenotype mapping can achieve Mendelian dominance through the evolution of the GRN with interacting genes. Specifically, we extend the GRN model to a diploid case, in which two GRN matrices are added to give gene expression dynamics, and simulate evolution with meiosis and recombination. Our results reveal that Mendelian dominance evolves even under complex genotype-phenotype mapping. This dominance is achieved via a group of genotypes that differ from each other but have a common phenotype given by the expression of target genes. Calculating the degree of dominance shows that it increases through the evolution, correlating closely with the decrease in phenotypic fluctuations and the increase in robustness to initial noise. This evolution of Mendelian dominance is associated with phenotypic robustness against meiosis-induced genome mixing, whereas sexual recombination arising from the mixing of chromosomes from the parents further enhances dominance and robustness. Owing to this dominance, the robustness to genetic differences increases, while the optimal fitness is sustained up to a large difference between the two genomes. In summary, Mendelian dominance is achieved by groups of genotypes that are associated with the increase in phenotypic robustness to noise.Author summaryMendelian dominance is one of the most fundamental laws in genetics. When two conflicting characters occur in a single diploid, the dominant character is always chosen. Assuming that one gene makes one character, this law is simple to grasp. However, in reality, phenotypes are generated via interactions between several genes, which may alter Mendel’s dominance law. The evolution of robustness to noise and mutations has been investigated extensively using complex expression dynamics with gene regulatory networks. Here, we applied gene-expression dynamics with complex interactions to the case of a diploid and simulated the evolution of the gene regulatory network to generate the optimal phenotype given by a certain gene expression pattern. Interestingly, after evolution, Mendelian dominance is achieved via a group of genes. This group-based Mendelian dominance is shaped by phenotype insensitivity to genome mixing by meiosis and evolves concurrently with the robustness to noise. By focusing on the influence of phenotypic robustness, which has received considerable attention recently, our result provides a novel perspective as to why Mendel’s law of dominance is commonly observed.

ALGORITHMS FOR RECONSTRUCTION OF GENE REGULATORY NETWORKS FROM HIGH -THROUGHPUT GENE EXPRESSION DATA

10.37099/mtu.dc.etdr/722 ◽

2018 ◽

Author(s):

Wenping Deng

Keyword(s):

Gene Expression ◽

Gene Expression Data ◽

High Throughput ◽

Gene Regulatory Networks ◽

Regulatory Networks ◽

Expression Data ◽

Gene Regulatory ◽

High Throughput Gene Expression

Systematic discovery and perturbation of regulatory genes in human T cells reveals the architecture of immune networks

10.1101/2021.04.18.440363 ◽

2021 ◽

Author(s):

Jacob W Freimer ◽

Oren Shaked ◽

Sahin Naqvi ◽

Nasa Sinnott-Armstrong ◽

Arwa Kathiria ◽

...

Keyword(s):

Gene Expression ◽

T Cells ◽

Gene Regulatory Networks ◽

Regulatory Networks ◽

Target Genes ◽

Regulatory Genes ◽

Disease Genes ◽

Human T Cells ◽

Gene Regulatory ◽

Upstream Regulators

Complex gene regulatory networks ensure that important genes are expressed at precise levels. When gene expression is sufficiently perturbed it can lead to disease. To understand how gene expression disruptions percolate through a network, we must first map connections between regulatory genes and their downstream targets. However, we lack comprehensive knowledge of the upstream regulators of most genes. Here we developed an approach for systematic discovery of upstream regulators of critical immune factors - IL2RA, IL-2, and CTLA4 - in primary human T cells. Then, we mapped the network of these regulators' target genes and enhancers using CRISPR perturbations, RNA-Seq, and ATAC-Seq. These regulators form densely interconnected networks with extensive feedback loops. Furthermore, this network is highly enriched for immune-associated disease variants and genes. These results provide insight into how immune-associated disease genes are regulated in T cells and broader principles about the structure of human gene regulatory networks.

Faculty Opinions recommendation of Predicting gene regulatory networks by combining spatial and temporal gene expression data in Arabidopsis root stem cells.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.729074122.793536255 ◽

2017 ◽

Author(s):

Elena Alvarez-Buylla ◽

Monica Garcia

Keyword(s):

Gene Expression ◽

Stem Cells ◽

Gene Expression Data ◽

Gene Regulatory Networks ◽

Regulatory Networks ◽

Expression Data ◽

Arabidopsis Root ◽

Temporal Gene Expression ◽

Gene Regulatory

Current Development and Review of Dynamic Bayesian Network-Based Methods for Inferring Gene Regulatory Networks from Gene Expression Data

Current Bioinformatics ◽

10.2174/1574893609666140421210333 ◽

2014 ◽

Vol 9 (5) ◽

pp. 531-539 ◽

Cited By ~ 6

Author(s):

Lian Chai ◽

Mohd Mohamad ◽

Safaai Deris ◽

Chuii Chong ◽

Yee Choon ◽

...

Keyword(s):

Gene Expression ◽

Bayesian Network ◽

Gene Expression Data ◽

Gene Regulatory Networks ◽

Regulatory Networks ◽

Dynamic Bayesian Network ◽

Expression Data ◽

Current Development ◽

Gene Regulatory

GATA-targeted compounds modulate cardiac subtype cell differentiation in dual reporter stem cell line

Stem Cell Research & Therapy ◽

10.1186/s13287-021-02259-z ◽

2021 ◽

Vol 12 (1) ◽

Author(s):

Mika J. Välimäki ◽

Robert S. Leigh ◽

Sini M. Kinnunen ◽

Alexander R. March ◽

Ana Hernández de Sande ◽

...

Keyword(s):

Gene Expression ◽

Progenitor Cells ◽

Cell Fate ◽

Reporter Gene ◽

Gene Regulatory Networks ◽

Regulatory Networks ◽

Cell Fate Decisions ◽

Dual Reporter ◽

Gene Regulatory ◽

Reporter Gene Assays

AbstractBackgroundPharmacological modulation of cell fate decisions and developmental gene regulatory networks holds promise for the treatment of heart failure. Compounds that target tissue-specific transcription factors could overcome non-specific effects of small molecules and lead to the regeneration of heart muscle following myocardial infarction. Due to cellular heterogeneity in the heart, the activation of gene programs representing specific atrial and ventricular cardiomyocyte subtypes would be highly desirable. Chemical compounds that modulate atrial and ventricular cell fate could be used to improve subtype-specific differentiation of endogenous or exogenously delivered progenitor cells in order to promote cardiac regeneration.MethodsTranscription factor GATA4-targeted compounds that have previously shown in vivo efficacy in cardiac injury models were tested for stage-specific activation of atrial and ventricular reporter genes in differentiating pluripotent stem cells using a dual reporter assay. Chemically induced gene expression changes were characterized by qRT-PCR, global run-on sequencing (GRO-seq) and immunoblotting, and the network of cooperative proteins of GATA4 and NKX2-5 were further explored by the examination of the GATA4 and NKX2-5 interactome by BioID. Reporter gene assays were conducted to examine combinatorial effects of GATA-targeted compounds and bromodomain and extraterminal domain (BET) inhibition on chamber-specific gene expression.ResultsGATA4-targeted compounds 3i-1000 and 3i-1103 were identified as differential modulators of atrial and ventricular gene expression. More detailed structure-function analysis revealed a distinct subclass of GATA4/NKX2-5 inhibitory compounds with an acetyl lysine-like domain that contributed to ventricular cells (%Myl2-eGFP+). Additionally, BioID analysis indicated broad interaction between GATA4 and BET family of proteins, such as BRD4. This indicated the involvement of epigenetic modulators in the regulation of GATA-dependent transcription. In this line, reporter gene assays with combinatorial treatment of 3i-1000 and the BET bromodomain inhibitor (+)-JQ1 demonstrated the cooperative role of GATA4 and BRD4 in the modulation of chamber-specific cardiac gene expression.ConclusionsCollectively, these results indicate the potential for therapeutic alteration of cell fate decisions and pathological gene regulatory networks by GATA4-targeted compounds modulating chamber-specific transcriptional programs in multipotent cardiac progenitor cells and cardiomyocytes. The compound scaffolds described within this study could be used to develop regenerative strategies for myocardial regeneration.