scholarly journals BGGM: A R Package for Bayesian Gaussian Graphical Models

2019 ◽  
Author(s):  
Donald Ray Williams ◽  
Joris Mulder

Gaussian graphical models (GGM) allow for learning conditional independence structures that are encoded by partial correlations. Whereas there are several \proglang{R} packages for classical (i.e., frequentist) methods, there are only two that implement a Bayesian approach. These are exclusively focused on identifying the graphical structure; that is, detecting non-zero effects. The \proglang{R} package \pkg{BGGM} not only fills this gap, but it also includes novel Bayesian methodology for extending inference beyond identifying non-zero relations. \pkg{BGGM} is built around two Bayesian approaches for inference--estimation and hypothesis testing. The former focuses on the posterior distribution and includes extensions to assess predictability, as well as methodology to compare partial correlations. The latter includes methods for Bayesian hypothesis testing, in both exploratory and confirmatory contexts, with the novel matrix-$F$ prior distribution. This allows for testing order and equality constrained hypotheses, as well as a combination of both with the Bayes factor. Further, there are two approaches for comparing any number of GGMs with either the posterior predictive distribution or Bayesian hypothesis testing. This work describes the software implementation of these methods. We end by discussing future directions for \pkg{BGGM}.

2020 ◽  
Author(s):  
Donald Ray Williams ◽  
Joris Mulder

The R package BGGM provides tools for making Bayesian inference in Gaussian graphicalmodels (GGM). The methods are organized around two general approaches for Bayesian inference: (1) estimation and (2) hypothesis testing. The key distinction is that the formerfocuses on either the posterior or posterior predictive distribution (Gelman, Meng, & Stern,1996; see section 5 in Rubin, 1984), whereas the latter focuses on model comparison withthe Bayes factor (Jeffreys, 1961; Kass & Raftery, 1995).


2019 ◽  
Author(s):  
Donald Ray Williams ◽  
Philippe Rast ◽  
Luis Pericchi ◽  
Joris Mulder

Gaussian graphical models are commonly used to characterize conditional independence structures (i.e., networks) of psychological constructs. Recently attention has shifted from estimating single networks to those from various sub-populations. The focus is primarily to detect differences or demonstrate replicability. We introduce two novel Bayesian methods for comparing networks that explicitly address these aims. The first is based on the posterior predictive distribution, with Kullback-Leibler divergence as the discrepancy measure, that tests differences between two multivariate normal distributions. The second approach makes use of Bayesian model selection, with the Bayes factor, and allows for gaining evidence for invariant network structures. This overcomes limitations of current approaches in the literature that use classical hypothesis testing, where it is only possible to determine whether groups are significantly different from each other. With simulation we show the posterior predictive method is approximately calibrated under the null hypothesis ($\alpha = 0.05$) and has more power to detect differences than alternative approaches. We then examine the necessary sample sizes for detecting invariant network structures with Bayesian hypothesis testing, in addition to how this is influenced by the choice of prior distribution. The methods are applied to post-traumatic stress disorder symptoms that were measured in four groups. We end by summarizing our major contribution, that is proposing two novel methods for comparing GGMs, which extends beyond the social-behavioral sciences. The methods have been implemented in the R package BGGM.


2020 ◽  
Author(s):  
Josue E. Rodriguez ◽  
Donald Ray Williams ◽  
Philippe Rast ◽  
Joris Mulder

Network theory has emerged as a popular framework for conceptualizing psychological constructs and mental disorders. Initially, network analysis was motivated in part by the thought that it can be used for hypothesis generation. Although the customary approach for network modeling is inherently exploratory, we argue that there is untapped potential for confirmatory hypothesis testing. In this work, we bring to fruition the potential of Gaussian graphical models for generating testable hypotheses. This is accomplished by merging exploratory and confirmatory analyses into a cohesive framework built around Bayesian hypothesis testing of partial correlations. We first present a motivating example based on a customary, exploratory analysis, where it is made clear how information encoded by the conditional (in)dependence structure can be used to formulate hypotheses. Building upon this foundation, we then provide several empirical examples that unify exploratory and confirmatory testing in psychopathology symptom networks. In particular, we (1) estimate exploratory graphs; (2) derive hypotheses based on the most central structures; and (3) test those hypotheses in a confirmatory setting. Our confirmatory results uncovered an intricate web of relations, including an order to edge weights within comorbidity networks. This illuminates the rich and informative inferences that can be drawn with the proposed approach. We conclude with recommendations for applied researchers, in addition to discussing how our methodology answers recent calls to begin developing formal models related to the conditional (in)dependence structure of psychological networks.


2019 ◽  
Author(s):  
Donald Ray Williams ◽  
Joris Mulder

Gaussian graphical models (GGM; partial correlation networks) have become increasingly popular in the social and behavioral sciences for studying conditional (in)dependencies between variables. In this work, we introduce exploratory and confirmatory Bayesian tests for partial correlations. For the former, we first extend the customary GGM formulation that focuses on conditional dependence to also consider the null hypothesis of conditional independence for each partial correlation. Here a novel testing strategy is introduced that can provide evidence for a null, negative, or positive effect. We then introduce a test for hypotheses with order constraints on partial correlations. This allows for testing theoretical and clinical expectations in GGMs. The novel matrix$-F$ prior distribution is described that provides increased flexibility in specification compared to the Wishart prior. The methods are applied to PTSD symptoms. In several applications, we demonstrate how the exploratory and confirmatory approaches can work in tandem: hypotheses are formulated from an initial analysis and then tested in an independent dataset. The methodology is implemented in the R package BGGM.


2018 ◽  
Author(s):  
Donald Ray Williams

Gaussian graphical models (GGM; ``networks'') allow for estimating conditional independence structures that are encoded by partial correlations. This is accomplished by identifying non-zero relations in the inverse of the covariance matrix. In psychology the default estimation method uses $\ell_1$-regularization, where the accompanying inferences are restricted to frequentist objectives. Bayesian methods remain relatively uncommon in practice and methodological literatures.To date, they have not yet been used for estimation and inference in the psychological network literature. In this work, I introduce Bayesian methodology that is specifically designed for the most common psychological applications. The graphical structure is determined with posterior probabilities, which allow for assessing conditional dependent and independent relations. Additional methods are provided for extending inference to specific aspects within- and between-networks, including partial correlation differences and Bayesian methodology to quantify network predictability. I first demonstrate that the decision rule based on posterior probabilities can be calibrated to the desired level of specificity. The proposed techniques are then demonstrated in several illustrative examples. The methods have been implemented in the R package BGGM.


2020 ◽  
Author(s):  
Victor Bernal ◽  
Rainer Bischoff ◽  
Peter Horvatovich ◽  
Victor Guryev ◽  
Marco Grzegorczyk

Abstract Background: In systems biology, it is important to reconstruct regulatory networks from quantitative molecular profiles. Gaussian graphical models (GGMs) are one of the most popular methods to this end. A GGM consists of nodes (representing the transcripts, metabolites or proteins) inter-connected by edges (reflecting their partial correlations). Learning the edges from quantitative molecular profiles is statistically challenging, as there are usually fewer samples than nodes (‘high dimensional problem’). Shrinkage methods address this issue by learning a regularized GGM. However, it is an open question how the shrinkage affects the final result and its interpretation.Results: We show that the shrinkage biases the partial correlation in a non-linear way. This bias does not only change the magnitudes of the partial correlations but also affects their order. Furthermore, it makes networks obtained from different experiments incomparable and hinders their biological interpretation. We propose a method, referred to as the ‘un-shrunk’ partial correlation, which corrects for this non-linear bias. Unlike traditional methods, which use a fixed shrinkage value, the new approach provides partial correlations that are closer to the actual (population) values and that are easier to interpret. We apply the ‘un-shrunk’ method to two gene expression datasets from Escherichia coli and Mus musculus.Conclusions: GGMs are popular undirected graphical models based on partial correlations. The application of GGMs to reconstruct regulatory networks is commonly performed using shrinkage to overcome the “high-dimensional” problem. Besides it advantages, we have identified that the shrinkage introduces a non-linear bias in the partial correlations. Ignoring this type of effects caused by the shrinkage can obscure the interpretation of the network, and impede the validation of earlier reported results.


2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Victor Bernal ◽  
Rainer Bischoff ◽  
Peter Horvatovich ◽  
Victor Guryev ◽  
Marco Grzegorczyk

Abstract Background In systems biology, it is important to reconstruct regulatory networks from quantitative molecular profiles. Gaussian graphical models (GGMs) are one of the most popular methods to this end. A GGM consists of nodes (representing the transcripts, metabolites or proteins) inter-connected by edges (reflecting their partial correlations). Learning the edges from quantitative molecular profiles is statistically challenging, as there are usually fewer samples than nodes (‘high dimensional problem’). Shrinkage methods address this issue by learning a regularized GGM. However, it remains open to study how the shrinkage affects the final result and its interpretation. Results We show that the shrinkage biases the partial correlation in a non-linear way. This bias does not only change the magnitudes of the partial correlations but also affects their order. Furthermore, it makes networks obtained from different experiments incomparable and hinders their biological interpretation. We propose a method, referred to as ‘un-shrinking’ the partial correlation, which corrects for this non-linear bias. Unlike traditional methods, which use a fixed shrinkage value, the new approach provides partial correlations that are closer to the actual (population) values and that are easier to interpret. This is demonstrated on two gene expression datasets from Escherichia coli and Mus musculus. Conclusions GGMs are popular undirected graphical models based on partial correlations. The application of GGMs to reconstruct regulatory networks is commonly performed using shrinkage to overcome the ‘high-dimensional problem’. Besides it advantages, we have identified that the shrinkage introduces a non-linear bias in the partial correlations. Ignoring this type of effects caused by the shrinkage can obscure the interpretation of the network, and impede the validation of earlier reported results.


2021 ◽  
Author(s):  
Joran Jongerling ◽  
Sacha Epskamp ◽  
Donald Ray Williams

Gaussian Graphical Models (GGMs) are often estimated using regularized estimation and the graphical LASSO (GLASSO). However, the GLASSO has difficulty estimating(uncertainty in) centrality indices of nodes. Regularized Bayesian estimation might provide a solution, as it is better suited to deal with bias in the sampling distribution ofcentrality indices. This study therefore compares estimation of GGMs with a Bayesian GLASSO- and a Horseshoe prior to estimation using the frequentist GLASSO in an extensive simulation study. Results showed that out of the two Bayesian estimation methods, the Bayesian GLASSO performed best. In addition, the Bayesian GLASSOperformed better than the frequentist GLASSO with respect to bias in edge weights, centrality measures, correlation between estimated and true partial correlations, andspecificity. With respect to sensitivity the frequentist GLASSO performs better.However, sensitivity of the Bayesian GLASSO is close to that of the frequentist GLASSO (except for the smallest N used in the simulations) and tends to be favored over the frequentist GLASSO in terms of F1. With respect to uncertainty in the centrality measures, the Bayesian GLASSO shows good coverage for strength andcloseness centrality. Uncertainty in betweenness centrality is estimated less well, and typically overestimated by the Bayesian GLASSO.


Sign in / Sign up

Export Citation Format

Share Document