Constrained Covariance Matrices With a Biologically Realistic Structure: Comparison of Methods for Generating High-Dimensional Gaussian Graphical Models

Abstract Background: In systems biology, it is important to reconstruct regulatory networks from quantitative molecular profiles. Gaussian graphical models (GGMs) are one of the most popular methods to this end. A GGM consists of nodes (representing the transcripts, metabolites or proteins) inter-connected by edges (reflecting their partial correlations). Learning the edges from quantitative molecular profiles is statistically challenging, as there are usually fewer samples than nodes (‘high dimensional problem’). Shrinkage methods address this issue by learning a regularized GGM. However, it is an open question how the shrinkage affects the final result and its interpretation.Results: We show that the shrinkage biases the partial correlation in a non-linear way. This bias does not only change the magnitudes of the partial correlations but also affects their order. Furthermore, it makes networks obtained from different experiments incomparable and hinders their biological interpretation. We propose a method, referred to as the ‘un-shrunk’ partial correlation, which corrects for this non-linear bias. Unlike traditional methods, which use a fixed shrinkage value, the new approach provides partial correlations that are closer to the actual (population) values and that are easier to interpret. We apply the ‘un-shrunk’ method to two gene expression datasets from Escherichia coli and Mus musculus.Conclusions: GGMs are popular undirected graphical models based on partial correlations. The application of GGMs to reconstruct regulatory networks is commonly performed using shrinkage to overcome the “high-dimensional” problem. Besides it advantages, we have identified that the shrinkage introduces a non-linear bias in the partial correlations. Ignoring this type of effects caused by the shrinkage can obscure the interpretation of the network, and impede the validation of earlier reported results.

Download Full-text

The ‘un-shrunk’ partial correlation in Gaussian graphical models

BMC Bioinformatics ◽

10.1186/s12859-021-04313-2 ◽

2021 ◽

Vol 22 (1) ◽

Author(s):

Victor Bernal ◽

Rainer Bischoff ◽

Peter Horvatovich ◽

Victor Guryev ◽

Marco Grzegorczyk

Keyword(s):

Graphical Models ◽

Regulatory Networks ◽

Partial Correlation ◽

High Dimensional ◽

Dimensional Problem ◽

Gaussian Graphical Models ◽

High Dimensional Problem ◽

Non Linear ◽

Partial Correlations ◽

Molecular Profiles

Abstract Background In systems biology, it is important to reconstruct regulatory networks from quantitative molecular profiles. Gaussian graphical models (GGMs) are one of the most popular methods to this end. A GGM consists of nodes (representing the transcripts, metabolites or proteins) inter-connected by edges (reflecting their partial correlations). Learning the edges from quantitative molecular profiles is statistically challenging, as there are usually fewer samples than nodes (‘high dimensional problem’). Shrinkage methods address this issue by learning a regularized GGM. However, it remains open to study how the shrinkage affects the final result and its interpretation. Results We show that the shrinkage biases the partial correlation in a non-linear way. This bias does not only change the magnitudes of the partial correlations but also affects their order. Furthermore, it makes networks obtained from different experiments incomparable and hinders their biological interpretation. We propose a method, referred to as ‘un-shrinking’ the partial correlation, which corrects for this non-linear bias. Unlike traditional methods, which use a fixed shrinkage value, the new approach provides partial correlations that are closer to the actual (population) values and that are easier to interpret. This is demonstrated on two gene expression datasets from Escherichia coli and Mus musculus. Conclusions GGMs are popular undirected graphical models based on partial correlations. The application of GGMs to reconstruct regulatory networks is commonly performed using shrinkage to overcome the ‘high-dimensional problem’. Besides it advantages, we have identified that the shrinkage introduces a non-linear bias in the partial correlations. Ignoring this type of effects caused by the shrinkage can obscure the interpretation of the network, and impede the validation of earlier reported results.

Download Full-text

A two-step method for estimating high-dimensional Gaussian graphical models

Science China Mathematics ◽

10.1007/s11425-017-9438-5 ◽

2020 ◽

Vol 63 (6) ◽

pp. 1203-1218

Author(s):

Yuehan Yang ◽

Ji Zhu

Keyword(s):

Graphical Models ◽

High Dimensional ◽

Gaussian Graphical Models ◽

Step Method

Download Full-text

Inferring Two-Level Hierarchical Gaussian Graphical Models to Discover Shared and Context-Specific Conditional Dependencies from High-Dimensional Heterogeneous Data

SN Computer Science ◽

10.1007/s42979-020-00224-w ◽

2020 ◽

Vol 1 (4) ◽

Author(s):

Mohammad S. Rahman ◽

Ann E. Nicholson ◽

Gholamreza Haffari

Keyword(s):

Graphical Models ◽

Heterogeneous Data ◽

High Dimensional ◽

Gaussian Graphical Models ◽

Context Specific

Download Full-text

Iterative Reconstruction of High-Dimensional Gaussian Graphical Models Based on a New Method to Estimate Partial Correlations under Constraints

PLoS ONE ◽

10.1371/journal.pone.0060536 ◽

2013 ◽

Vol 8 (4) ◽

pp. e60536

Author(s):

Vincent Guillemot ◽

Andreas Bender ◽

Anne-Laure Boulesteix

Keyword(s):

Graphical Models ◽

Iterative Reconstruction ◽

New Method ◽

High Dimensional ◽

Gaussian Graphical Models ◽

Partial Correlations

Download Full-text

High-dimensional joint estimation of multiple directed Gaussian graphical models

Electronic Journal of Statistics ◽

10.1214/20-ejs1724 ◽

2020 ◽

Vol 14 (1) ◽

pp. 2439-2483

Author(s):

Yuhao Wang ◽

Santiago Segarra ◽

Caroline Uhler

Keyword(s):

Graphical Models ◽

Joint Estimation ◽

High Dimensional ◽

Gaussian Graphical Models

Download Full-text

High-Dimensional Sparse Graph Estimation by Integrating DTW-D Into Bayesian Gaussian Graphical Models

IEEE Access ◽

10.1109/access.2018.2849213 ◽

2018 ◽

Vol 6 ◽

pp. 34279-34287

Author(s):

Ying Li ◽

Xiaojun Xu ◽

Jianbo Li

Keyword(s):

Graphical Models ◽

High Dimensional ◽

Sparse Graph ◽

Gaussian Graphical Models ◽

Graph Estimation

Download Full-text

Block-Diagonal Covariance Selection for High-Dimensional Gaussian Graphical Models

Journal of the American Statistical Association ◽

10.1080/01621459.2016.1247002 ◽

2017 ◽

Vol 113 (521) ◽

pp. 306-314 ◽

Cited By ~ 5

Author(s):

Emilie Devijver ◽

Mélina Gallopin

Keyword(s):

Graphical Models ◽

High Dimensional ◽

Gaussian Graphical Models ◽

Covariance Selection ◽

Selection For ◽

Block Diagonal

Download Full-text

Uniform inference in high-dimensional Gaussian graphical models

10.1920/wp.cem.2019.2919 ◽

2019 ◽

Author(s):

Victor Chernozhukov ◽

Martin Spindler ◽

Jannis Kück ◽

Sven Klaassen

Keyword(s):

Graphical Models ◽

High Dimensional ◽

Gaussian Graphical Models

Download Full-text

A Stepwise Approach for High-Dimensional Gaussian Graphical Models

10.52933/jdssv.v1i2.11 ◽

2021 ◽

Vol 1 (2) ◽

Author(s):

Ruben Zamar ◽

Marcelo Ruiz ◽

Ginette Lafit ◽

Javier Nogales

Keyword(s):

Graphical Models ◽

Partial Correlation ◽

Pearson Correlation ◽

Real Life ◽

Correlation Coefficients ◽

High Dimensional ◽

Prediction Errors ◽

Gaussian Graphical Models ◽

Stepwise Approach ◽

Linear Predictors

We present a stepwise approach to estimate high dimensional Gaussian graphical models. We exploit the relation between the partial correlation coefficients and the distribution of the prediction errors, and parametrize the model in terms of the Pearson correlation coefficients between the prediction errors of the nodes’ best linear predictors. We propose a novel stepwise algorithm for detecting pairs of conditionally dependent variables. We compare the proposed algorithm with existing methods including graphical lasso (Glasso), constrained `l1-minimization(CLIME) and equivalent partial correlation (EPC), via simulation studies and real life applications. In our simulation study we consider several model settings and report the results using different performance measures that look at desirable features of the recovered graph.

Download Full-text