Mixture of Conditional Gaussian Graphical Models for Unlabelled Heterogeneous Populations in the Presence of Co-factors

AbstractGiven the complex exposures from both exogenous and endogenous sources that an individual experiences during life, exposome-wide association studies that interrogate levels of small molecules in biospecimens have been proposed for discovering causes of chronic diseases. We conducted a study to explore associations between environmental chemicals and endogenous molecules using Gaussian graphical models (GGMs) of non-targeted metabolomics data measured in a cohort of California women firefighters and office workers. GGMs revealed many exposure-metabolite associations, including that exposures to mono-hydroxyisononyl phthalate, ethyl paraben and 4-ethylbenzoic acid were associated with metabolites involved in steroid hormone biosynthesis, and perfluoroalkyl substances were linked to bile acids—hormones that regulate cholesterol and glucose metabolism—and inflammatory signaling molecules. Some hypotheses generated from these findings were confirmed by analysis of data from the National Health and Nutrition Examination Survey. Taken together, our findings demonstrate a novel approach to discovering associations between chemical exposures and biological processes of potential relevance for disease causation.

Download Full-text

Objective Bayesian model selection in Gaussian graphical models

Biometrika ◽

10.1093/biomet/asp017 ◽

2009 ◽

Vol 96 (3) ◽

pp. 497-512 ◽

Cited By ~ 43

Author(s):

C. M. Carvalho ◽

J. G. Scott

Keyword(s):

Model Selection ◽

Graphical Models ◽

Bayesian Model ◽

Bayesian Model Selection ◽

Gaussian Graphical Models

Download Full-text

A Metropolis-Hastings based method for sampling from the G-Wishart distribution in Gaussian graphical models

Electronic Journal of Statistics ◽

10.1214/11-ejs594 ◽

2011 ◽

Vol 5 (0) ◽

pp. 18-30 ◽

Cited By ~ 13

Author(s):

Nicholas Mitsakakis ◽

Hélène Massam ◽

Michael D. Escobar

Keyword(s):

Graphical Models ◽

Wishart Distribution ◽

Gaussian Graphical Models

Download Full-text

Gaussian Graphical Models

Handbook of Graphical Models ◽

10.1201/9780429463976-9 ◽

2018 ◽

pp. 217-238

Keyword(s):

Graphical Models ◽

Gaussian Graphical Models

Download Full-text

The ‘Un-Shrunk’ Partial Correlation in Gaussian Graphical Models

10.21203/rs.3.rs-76682/v1 ◽

2020 ◽

Author(s):

Victor Bernal ◽

Rainer Bischoff ◽

Peter Horvatovich ◽

Victor Guryev ◽

Marco Grzegorczyk

Keyword(s):

Graphical Models ◽

Regulatory Networks ◽

Partial Correlation ◽

High Dimensional ◽

Dimensional Problem ◽

Gaussian Graphical Models ◽

High Dimensional Problem ◽

Non Linear ◽

Partial Correlations ◽

Molecular Profiles

Abstract Background: In systems biology, it is important to reconstruct regulatory networks from quantitative molecular profiles. Gaussian graphical models (GGMs) are one of the most popular methods to this end. A GGM consists of nodes (representing the transcripts, metabolites or proteins) inter-connected by edges (reflecting their partial correlations). Learning the edges from quantitative molecular profiles is statistically challenging, as there are usually fewer samples than nodes (‘high dimensional problem’). Shrinkage methods address this issue by learning a regularized GGM. However, it is an open question how the shrinkage affects the final result and its interpretation.Results: We show that the shrinkage biases the partial correlation in a non-linear way. This bias does not only change the magnitudes of the partial correlations but also affects their order. Furthermore, it makes networks obtained from different experiments incomparable and hinders their biological interpretation. We propose a method, referred to as the ‘un-shrunk’ partial correlation, which corrects for this non-linear bias. Unlike traditional methods, which use a fixed shrinkage value, the new approach provides partial correlations that are closer to the actual (population) values and that are easier to interpret. We apply the ‘un-shrunk’ method to two gene expression datasets from Escherichia coli and Mus musculus.Conclusions: GGMs are popular undirected graphical models based on partial correlations. The application of GGMs to reconstruct regulatory networks is commonly performed using shrinkage to overcome the “high-dimensional” problem. Besides it advantages, we have identified that the shrinkage introduces a non-linear bias in the partial correlations. Ignoring this type of effects caused by the shrinkage can obscure the interpretation of the network, and impede the validation of earlier reported results.

Download Full-text

The ‘un-shrunk’ partial correlation in Gaussian graphical models

BMC Bioinformatics ◽

10.1186/s12859-021-04313-2 ◽

2021 ◽

Vol 22 (1) ◽

Author(s):

Victor Bernal ◽

Rainer Bischoff ◽

Peter Horvatovich ◽

Victor Guryev ◽

Marco Grzegorczyk

Keyword(s):

Graphical Models ◽

Regulatory Networks ◽

Partial Correlation ◽

High Dimensional ◽

Dimensional Problem ◽

Gaussian Graphical Models ◽

High Dimensional Problem ◽

Non Linear ◽

Partial Correlations ◽

Molecular Profiles

Abstract Background In systems biology, it is important to reconstruct regulatory networks from quantitative molecular profiles. Gaussian graphical models (GGMs) are one of the most popular methods to this end. A GGM consists of nodes (representing the transcripts, metabolites or proteins) inter-connected by edges (reflecting their partial correlations). Learning the edges from quantitative molecular profiles is statistically challenging, as there are usually fewer samples than nodes (‘high dimensional problem’). Shrinkage methods address this issue by learning a regularized GGM. However, it remains open to study how the shrinkage affects the final result and its interpretation. Results We show that the shrinkage biases the partial correlation in a non-linear way. This bias does not only change the magnitudes of the partial correlations but also affects their order. Furthermore, it makes networks obtained from different experiments incomparable and hinders their biological interpretation. We propose a method, referred to as ‘un-shrinking’ the partial correlation, which corrects for this non-linear bias. Unlike traditional methods, which use a fixed shrinkage value, the new approach provides partial correlations that are closer to the actual (population) values and that are easier to interpret. This is demonstrated on two gene expression datasets from Escherichia coli and Mus musculus. Conclusions GGMs are popular undirected graphical models based on partial correlations. The application of GGMs to reconstruct regulatory networks is commonly performed using shrinkage to overcome the ‘high-dimensional problem’. Besides it advantages, we have identified that the shrinkage introduces a non-linear bias in the partial correlations. Ignoring this type of effects caused by the shrinkage can obscure the interpretation of the network, and impede the validation of earlier reported results.

Download Full-text