Learning Gene Regulatory Networks with High-Dimensional Heterogeneous Data

The inference of Gene Regulatory Networks (GRNs) is a very challenging problem which has attracted increasing attention since the development of high-throughput sequencing and gene expression measurement technologies. Many models and algorithms have been developed to identify GRNs using mainly gene expression profile as data source. As the gene expression data usually has limited number of samples and inherent noise, the integration of gene expression with several other sources of information can be vital for accurately inferring GRNs. For instance, some prior information about the overall topological structure of the GRN can guide inference techniques toward better results. In addition to gene expression data, recently biological information from heterogeneous data sources have been integrated by GRN inference methods as well. The objective of this chapter is to present an overview of GRN inference models and techniques with focus on incorporation of prior information such as, global and local topological features and integration of several heterogeneous data sources.

Download Full-text

A Linear Programming Framework for Inferring Gene Regulatory Networks by Integrating Heterogeneous Data

Handbook of Research on Computational Methodologies in Gene Regulatory Networks ◽

10.4018/978-1-60566-685-3.ch019 ◽

2010 ◽

pp. 450-475 ◽

Cited By ~ 1

Author(s):

Yong Wang ◽

Rui-Sheng Wang ◽

Trupti Joshi ◽

Dong Xu ◽

Xiang-Sun Zhang ◽

...

Keyword(s):

Linear Programming ◽

Gene Regulatory Networks ◽

Regulatory Networks ◽

Heterogeneous Data ◽

Data Sources ◽

Protein Interaction Data ◽

Interaction Data ◽

Protein Protein Interaction ◽

Programming Framework ◽

Gene Regulatory

There exist many heterogeneous data sources that are closely related to gene regulatory networks. These data sources provide rich information for depicting complex biological processes at different levels and from different aspects. Here, we introduce a linear programming framework to infer the gene regulatory networks. Within this framework, we extensively integrate the available information derived from multiple time-course expression datasets, ChIP-chip data, regulatory motif-binding patterns, protein-protein interaction data, protein-small molecule interaction data, and documented regulatory relationships in literature and databases. Results on synthetic and real experimental data both demonstrate that the linear programming framework allows us to recover gene regulations in a more robust and reliable manner.

Download Full-text

Inference of Gene Regulatory Networks by Topological Prior Information and Data Integration

Advances in Medical Technologies and Clinical Practice - Emerging Research in the Analysis and Modeling of Gene Regulatory Networks ◽

10.4018/978-1-5225-0353-8.ch001 ◽

2016 ◽

pp. 1-51

Author(s):

David Correa Martins Jr. ◽

Fabricio Martins Lopes ◽

Shubhra Sankar Ray

Keyword(s):

Gene Expression ◽

Gene Expression Data ◽

Gene Regulatory Networks ◽

Regulatory Networks ◽

Prior Information ◽

Heterogeneous Data ◽

Data Sources ◽

Expression Data ◽

Heterogeneous Data Sources ◽

Gene Regulatory

The inference of Gene Regulatory Networks (GRNs) is a very challenging problem which has attracted increasing attention since the development of high-throughput sequencing and gene expression measurement technologies. Many models and algorithms have been developed to identify GRNs using mainly gene expression profile as data source. As the gene expression data usually has limited number of samples and inherent noise, the integration of gene expression with several other sources of information can be vital for accurately inferring GRNs. For instance, some prior information about the overall topological structure of the GRN can guide inference techniques toward better results. In addition to gene expression data, recently biological information from heterogeneous data sources have been integrated by GRN inference methods as well. The objective of this chapter is to present an overview of GRN inference models and techniques with focus on incorporation of prior information such as, global and local topological features and integration of several heterogeneous data sources.

Download Full-text

Multistability and Multicellularity: Cell Fates as High-Dimensional Attractors of Gene Regulatory Networks

Computational Systems Biology ◽

10.1016/b978-012088786-6/50033-2 ◽

2006 ◽

pp. 293-326 ◽

Cited By ~ 1

Author(s):

Sui Huang

Keyword(s):

Gene Regulatory Networks ◽

Regulatory Networks ◽

High Dimensional ◽

Cell Fates ◽

Gene Regulatory

Download Full-text

Independence screening for high dimensional nonlinear additive ODE models with applications to dynamic gene regulatory networks

Statistics in Medicine ◽

10.1002/sim.7669 ◽

2018 ◽

Vol 37 (17) ◽

pp. 2630-2644

Author(s):

Hongqi Xue ◽

Shuang Wu ◽

Yichao Wu ◽

Juan C. Ramirez Idarraga ◽

Hulin Wu

Keyword(s):

Gene Regulatory Networks ◽

Regulatory Networks ◽

High Dimensional ◽

Ode Models ◽

Gene Regulatory

Download Full-text

A Bayesian framework that integrates heterogeneous data for inferring gene regulatory networks

Frontiers in Bioengineering and Biotechnology ◽

10.3389/fbioe.2014.00013 ◽

2014 ◽

Vol 2 ◽

Cited By ~ 12

Author(s):

Tapesh Santra

Keyword(s):

Gene Regulatory Networks ◽

Regulatory Networks ◽

Bayesian Framework ◽

Heterogeneous Data ◽

Gene Regulatory

Download Full-text

Interplay between Path and Speed in Decision Making by High-Dimensional Stochastic Gene Regulatory Networks

PLoS ONE ◽

10.1371/journal.pone.0040085 ◽

2012 ◽

Vol 7 (7) ◽

pp. e40085 ◽

Cited By ~ 8

Author(s):

Nuno R. Nené ◽

Alexey Zaikin

Keyword(s):

Decision Making ◽

Gene Regulatory Networks ◽

Regulatory Networks ◽

High Dimensional ◽

Gene Regulatory

Download Full-text

A general index for linear and nonlinear correlations for high dimensional genomic data

BMC Genomics ◽

10.1186/s12864-020-07246-x ◽

2020 ◽

Vol 21 (1) ◽

Author(s):

Zhihao Yao ◽

Jing Zhang ◽

Xiufen Zou

Keyword(s):

Gene Regulatory Networks ◽

Regulatory Networks ◽

High Dimensional Data ◽

Kernel Functions ◽

High Dimensional ◽

Vector Correlation ◽

General Index ◽

Gene Regulatory ◽

Linear And Nonlinear ◽

Rv Coefficient

Abstract Background With the advance of high throughput sequencing, high-dimensional data are generated. Detecting dependence/correlation between these datasets is becoming one of most important issues in multi-dimensional data integration and co-expression network construction. RNA-sequencing data is widely used to construct gene regulatory networks. Such networks could be more accurate when methylation data, copy number aberration data and other types of data are introduced. Consequently, a general index for detecting relationships between high-dimensional data is indispensable. Results We proposed a Kernel-Based RV-coefficient, named KBRV, for testing both linear and nonlinear correlation between two matrices by introducing kernel functions into RV2 (the modified RV-coefficient). Permutation test and other validation methods were used on simulated data to test the significance and rationality of KBRV. In order to demonstrate the advantages of KBRV in constructing gene regulatory networks, we applied this index on real datasets (ovarian cancer datasets and exon-level RNA-Seq data in human myeloid differentiation) to illustrate its superiority over vector correlation. Conclusions We concluded that KBRV is an efficient index for detecting both linear and nonlinear relationships in high dimensional data. The correlation method for high dimensional data has possible applications in the construction of gene regulatory network.

Download Full-text

High-dimensional Bayesian network inference from systems genetics data using genetic node ordering

10.1101/501460 ◽

2018 ◽

Author(s):

Lingfei Wang ◽

Pieter Audenaert ◽

Tom Michoel

Keyword(s):

Genetic Variation ◽

Bayesian Network ◽

Gene Regulatory Networks ◽

Gene Networks ◽

Regulatory Networks ◽

Network Inference ◽

High Dimensional ◽

Systems Genetics ◽

Gene Regulatory ◽

Bayesian Network Inference

AbstractStudying the impact of genetic variation on gene regulatory networks is essential to understand the biological mechanisms by which genetic variation causes variation in phenotypes. Bayesian networks provide an elegant statistical approach for multi-trait genetic mapping and modelling causal trait relationships. However, inferring Bayesian gene networks from high-dimensional genetics and genomics data is challenging, because the number of possible networks scales super-exponentially with the number of nodes, and the computational cost of conventional Bayesian network inference methods quickly becomes prohibitive. We propose an alternative method to infer high-quality Bayesian gene networks that easily scales to thousands of genes. Our method first reconstructs a node ordering by conducting pairwise causal inference tests between genes, which then allows to infer a Bayesian network via a series of independent variable selection problems, one for each gene. We demonstrate using simulated and real systems genetics data that this results in a Bayesian network with equal, and sometimes better, likelihood than the conventional methods, while having a significantly higher over-lap with groundtruth networks and being orders of magnitude faster. Moreover our method allows for a unified false discovery rate control across genes and individual edges, and thus a rigorous and easily interpretable way for tuning the sparsity level of the inferred network. Bayesian network inference using pairwise node ordering is a highly efficient approach for reconstructing gene regulatory networks when prior information for the inclusion of edges exists or can be inferred from the available data.

Download Full-text