Single-Cell Analysis of Alternative Splicing and Gene Regulatory Network Reveals Remarkable Expression and Regulation Dynamics During Early Embryonic Development

Abstract BackgroundSingle-cell RNA-seq (scRNA-seq) technologies greatly revolutionized our understanding of cell-to-cell variability of gene expression. Although several studies investigated the expression profile of early embryos, they mainly focused on the expression changes at gene level. Here we systematically explored the gene expression dynamics of human early embryonic development from expression level, alternative splicing, isoform switching and expression regulatory network. ResultsWe found that the genes involved in significant changes of these three aspects are all gradually decreased along embryonic development from E3 to E7 stage. Moreover, these three types of variations are complementary for profiling expression dynamics and they vary greatly across embryonic development as well as between different sexes. Strikingly, only a small number of genes exhibited prominent expression level changes between male and female embryos in E3 stage, whereas many more genes showed variations in alternative splicing and major isoform switching. Additionally, we identified functionally important specific gene regulatory modules for each stage and revealed dynamic usage of transcription factor binding motifs (TFBMs). ConclusionsCollectively, our study gain insights into the expression dynamics of early embryonic development from expression level, alternative splicing, isoform switching and gene regulatory networks, which could benefit the understanding of underlying mechanism of embryonic development.

Download Full-text

Predicting genotype-specific gene regulatory networks

10.1101/2021.01.18.427134 ◽

2021 ◽

Author(s):

Deborah Weighill ◽

Marouen Ben Guebila ◽

Kimberly Glass ◽

John Quackenbush ◽

John Platig

Keyword(s):

Gene Expression ◽

Gene Regulatory Network ◽

Gene Regulatory Networks ◽

Genetic Variants ◽

Regulatory Network ◽

Regulatory Networks ◽

Specific Gene ◽

Disease Etiology ◽

Gene Regulatory ◽

The Impact

AbstractThe majority of disease-associated genetic variants are thought to have regulatory effects, including the disruption of transcription factor (TF) binding and the alteration of downstream gene expression. Identifying how a person’s genotype affects their individual gene regulatory network has the potential to provide important insights into disease etiology and to enable improved genotype-specific disease risk assessments and treatments. However, the impact of genetic variants is generally not considered when constructing gene regulatory networks. To address this unmet need, we developed EGRET (Estimating the Genetic Regulatory Effect on TFs), which infers a genotype-specific gene regulatory network (GRN) for each individual in a study population by using message passing to integrate genotype-informed TF motif predictions - derived from individual genotype data, the predicted effects of variants on TF binding and gene expression, and TF motif predictions - with TF protein-protein interactions and gene expression. Comparing EGRET networks for two blood-derived cell lines identified genotype-associated cell-line specific regulatory differences which were subsequently validated using allele-specific expression, chromatin accessibility QTLs, and differential TF binding from ChIP-seq. In addition, EGRET GRNs for three cell types across 119 individuals captured regulatory differences associated with disease in a cell-type-specific manner. Our analyses demonstrate that EGRET networks can capture the impact of genetic variants on complex phenotypes, supporting a novel fine-scale stratification of individuals based on their genetic background. EGRET is available through the Network Zoo R package (netZooR v0.9; netzoo.github.io).

Download Full-text

PAGeneRN

Data Analytics in Medicine ◽

10.4018/978-1-7998-1204-3.ch055 ◽

2020 ◽

pp. 1052-1075 ◽

Cited By ~ 1

Author(s):

Dina Elsayad ◽

A. Ali ◽

Howida A. Shedeed ◽

Mohamed F. Tolba

Keyword(s):

Gene Expression ◽

Data Analysis ◽

Network Analysis ◽

Gene Regulatory Network ◽

Gene Expression Data ◽

Regulatory Network ◽

Regulatory Networks ◽

Expression Data ◽

Gene Expression Data Analysis ◽

Gene Regulatory

The gene expression analysis is an important research area of Bioinformatics. The gene expression data analysis aims to understand the genes interacting phenomena, gene functionality and the genes mutations effect. The Gene regulatory network analysis is one of the gene expression data analysis tasks. Gene regulatory network aims to study the genes interactions topological organization. The regulatory network is critical for understanding the pathological phenotypes and the normal cell physiology. There are many researches that focus on gene regulatory network analysis but unfortunately some algorithms are affected by data size. Where, the algorithm runtime is proportional to the data size, therefore, some parallel algorithms are presented to enhance the algorithms runtime and efficiency. This work presents a background, mathematical models and comparisons about gene regulatory networks analysis different techniques. In addition, this work proposes Parallel Architecture for Gene Regulatory Network (PAGeneRN).

Download Full-text

Inference of differential gene regulatory networks based on gene expression and genetic perturbation data

Bioinformatics ◽

10.1093/bioinformatics/btz529 ◽

2019 ◽

Vol 36 (1) ◽

pp. 197-204 ◽

Cited By ~ 2

Author(s):

Xin Zhou ◽

Xiaodong Cai

Keyword(s):

Gene Expression ◽

Gene Regulatory Networks ◽

Structural Equation ◽

Regulatory Networks ◽

Supplementary Information ◽

Specific Gene ◽

Joint Inference ◽

Perturbation Data ◽

Gene Regulatory ◽

The Difference

Abstract Motivation Gene regulatory networks (GRNs) of the same organism can be different under different conditions, although the overall network structure may be similar. Understanding the difference in GRNs under different conditions is important to understand condition-specific gene regulation. When gene expression and other relevant data under two different conditions are available, they can be used by an existing network inference algorithm to estimate two GRNs separately, and then to identify the difference between the two GRNs. However, such an approach does not exploit the similarity in two GRNs, and may sacrifice inference accuracy. Results In this paper, we model GRNs with the structural equation model (SEM) that can integrate gene expression and genetic perturbation data, and develop an algorithm named fused sparse SEM (FSSEM), to jointly infer GRNs under two conditions, and then to identify difference of the two GRNs. Computer simulations demonstrate that the FSSEM algorithm outperforms the approaches that estimate two GRNs separately. Analysis of a dataset of lung cancer and another dataset of gastric cancer with FSSEM inferred differential GRNs in cancer versus normal tissues, whose genes with largest network degrees have been reported to be implicated in tumorigenesis. The FSSEM algorithm provides a valuable tool for joint inference of two GRNs and identification of the differential GRN under two conditions. Availability and implementation The R package fssemR implementing the FSSEM algorithm is available at https://github.com/Ivis4ml/fssemR.git. It is also available on CRAN. Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

Identification of context-specific gene regulatory networks with GEMULA—gene expression modeling using LAsso

Bioinformatics ◽

10.1093/bioinformatics/btr641 ◽

2011 ◽

Vol 28 (2) ◽

pp. 214-221 ◽

Cited By ~ 26

Author(s):

Geert Geeven ◽

Ronald E. van Kesteren ◽

August B. Smit ◽

Mathisca C. M. de Gunst

Keyword(s):

Gene Expression ◽

Gene Regulatory Networks ◽

Regulatory Networks ◽

Specific Gene ◽

Gene Regulatory ◽

Context Specific

Download Full-text

An estimation method for a cellular-state-specific gene regulatory network along tree-structured gene expression profiles

Gene ◽

10.1016/j.gene.2012.11.090 ◽

2013 ◽

Vol 518 (1) ◽

pp. 17-25 ◽

Cited By ~ 3

Author(s):

Ryo Araki ◽

Shigeto Seno ◽

Yoichi Takenaka ◽

Hideo Matsuda

Keyword(s):

Gene Expression ◽

Gene Regulatory Network ◽

Regulatory Network ◽

Expression Profiles ◽

Estimation Method ◽

Gene Expression Profiles ◽

Specific Gene ◽

Gene Regulatory

Download Full-text

Identification of key tissue-specific, biological processes by integrating enhancer information in maize gene regulatory networks

10.1101/2020.06.16.155481 ◽

2020 ◽

Author(s):

Maud Fagny ◽

Marieke Lydia Kuijjer ◽

Maike Stam ◽

Johann Joets ◽

Olivier Turc ◽

...

Keyword(s):

Gene Expression ◽

Transcription Factor ◽

Binding Sites ◽

Regulatory Network ◽

Tissue Specificity ◽

Target Genes ◽

Specific Gene ◽

Tissue Specific ◽

Genome Wide ◽

Gene Regulatory

AbstractEnhancers are important regulators of gene expression during numerous crucial processes including tissue differentiation across development. In plants, their recent molecular characterization revealed their capacity to activate the expression of several target genes through the binding of transcription factors. Nevertheless, identifying these target genes at a genome-wide level remains a challenge, in particular in species with large genomes, where enhancers and target genes can be hundreds of kilobases away. Therefore, the contribution of enhancers to regulatory network is still poorly understood in plants. In this study, we investigate the enhancer-driven regulatory network of two maize tissues at different stages: leaves at seedling stage and husks (bracts) at flowering. Using a systems biology approach, we integrate genomic, epigenomic and transcriptomic data to model the regulatory relationship between transcription factors and their potential target genes. We identify regulatory modules specific to husk and V2-IST, and show that they are involved in distinct functions related to the biology of each tissue. We evidence enhancers exhibiting binding sites for two distinct transcription factor families (DOF and AP2/ERF) that drive the tissue-specificity of gene expression in seedling immature leaf and husk. Analysis of the corresponding enhancer sequences reveals that two different transposable element families (TIR transposon Mutator and MITE Pif/Harbinger) have shaped the regulatory network in each tissue, and that MITEs have provided new transcription factor binding sites that are involved in husk tissue-specificity.SignificanceEnhancers play a major role in regulating tissue-specific gene expression in higher eukaryotes, including angiosperms. While molecular characterization of enhancers has improved over the past years, identifying their target genes at the genome-wide scale remains challenging. Here, we integrate genomic, epigenomic and transcriptomic data to decipher the tissue-specific gene regulatory network controlled by enhancers at two different stages of maize leaf development. Using a systems biology approach, we identify transcription factor families regulating gene tissue-specific expression in husk and seedling leaves, and characterize the enhancers likely to be involved. We show that a large part of maize enhancers is derived from transposable elements, which can provide novel transcription factor binding sites crucial to the regulation of tissue-specific biological functions.

Download Full-text

Neural Gene Network Constructor: A Neural Based Model for Reconstructing Gene Regulatory Network

10.1101/842369 ◽

2019 ◽

Author(s):

Zhang Zhang ◽

Lifei Wang ◽

Shuo Wang ◽

Ruyi Tao ◽

Jingshu Xiao ◽

...

Keyword(s):

Gene Expression ◽

Gene Regulatory Network ◽

Gene Expression Data ◽

Gene Regulatory Networks ◽

Network Structure ◽

Regulatory Network ◽

Gene Network ◽

Regulatory Networks ◽

Expression Data ◽

Gene Regulatory

SummaryReconstructing gene regulatory networks (GRNs) and inferring the gene dynamics are important to understand the behavior and the fate of the normal and abnormal cells. Gene regulatory networks could be reconstructed by experimental methods or from gene expression data. Recent advances in Single Cell RNA sequencing technology and the computational method to reconstruct trajectory have generated huge scRNA-seq data tagged with additional time labels. Here, we present a deep learning model “Neural Gene Network Constructor” (NGNC), for inferring gene regulatory network and reconstructing the gene dynamics simultaneously from time series gene expression data. NGNC is a model-free heterogenous model, which can reconstruct any network structure and non-linear dynamics. It consists of two parts: a network generator which incorporating gumbel softmax technique to generate candidate network structure, and a dynamics learner which adopting multiple feedforward neural networks to predict the dynamics. We compare our model with other well-known frameworks on the data set generated by GeneNetWeaver, and achieve the state of the arts results both on network reconstruction and dynamics learning.

Download Full-text

Expression pattern determines regulatory logic

PLoS ONE ◽

10.1371/journal.pone.0244864 ◽

2021 ◽

Vol 16 (1) ◽

pp. e0244864

Author(s):

Carlos Mora-Martinez

Keyword(s):

Gene Expression ◽

Gene Regulatory Networks ◽

Dna Sequences ◽

In Silico ◽

Regulatory Networks ◽

Expression Patterns ◽

Cell Types ◽

Evolutionary Strategy ◽

Specific Gene ◽

Gene Regulatory

Large amounts of effort have been invested in trying to understand how a single genome is able to specify the identity of hundreds of cell types. Inspired by some aspects of Caenorhabditis elegans biology, we implemented an in silico evolutionary strategy to produce gene regulatory networks (GRNs) that drive cell-specific gene expression patterns, mimicking the process of terminal cell differentiation. Dynamics of the gene regulatory networks are governed by a thermodynamic model of gene expression, which uses DNA sequences and transcription factor degenerate position weight matrixes as input. In a version of the model, we included chromatin accessibility. Experimentally, it has been determined that cell-specific and broadly expressed genes are regulated differently. In our in silico evolved GRNs, broadly expressed genes are regulated very redundantly and the architecture of their cis-regulatory modules is different, in accordance to what has been found in C. elegans and also in other systems. Finally, we found differences in topological positions in GRNs between these two classes of genes, which help to explain why broadly expressed genes are so resilient to mutations. Overall, our results offer an explanatory hypothesis on why broadly expressed genes are regulated so redundantly compared to cell-specific genes, which can be extrapolated to phenomena such as ChIP-seq HOT regions.

Download Full-text

Inferring Gene Regulatory Networks from Genetical Genomics Data

Handbook of Research on Computational Methodologies in Gene Regulatory Networks ◽

10.4018/978-1-60566-685-3.ch004 ◽

2010 ◽

pp. 79-107 ◽

Cited By ~ 2

Author(s):

Bing Liu ◽

Ina Hoeschele ◽

Alberto de la Fuente

Keyword(s):

Gene Expression ◽

Gene Regulatory Network ◽

Regulatory Network ◽

Regulatory Networks ◽

Network Inference ◽

Dna Marker ◽

Search Space ◽

Genetical Genomics ◽

Gene Regulatory Network Inference ◽

Gene Regulatory

In this chapter, we review the current state of Gene Regulatory Network inference based on ‘Genetical Genomics’ experiments (Brem & Kruglyak, 2005; Brem, Yvert, Clinton & Kruglyak, 2002; Jansen, 2003; Jansen & Nap, 2001; Schadt et al., 2003) as a special case of causal network inference in ‘Systems Genetics’ (Threadgill, 2006). In a Genetical Genomics experiment, a segregating or genetically randomized population is DNA marker genotyped and gene-expression profiled on a genomewide scale. The genotypes are regarded as natural, multifactorial perturbations resulting in different gene-expression ‘phenotypes’, and causal relationships can therefore be established between the measured genotypes and the gene-expression phenotypes. In this chapter, we review different computational approaches to Gene Regulatory Network inference based on the joint analysis of DNA marker and expression data and additionally of DNA sequence information if available. This includes different methods for expression QTL mapping, selection of regulator-target pairs, construction of an encompassing network, which strongly constrains the network search space, and pairwise and multivariate methods for Gene Regulatory Network inference, such as Bayesian Networks and Structural Equation Modeling.

Download Full-text