From heterogeneous healthcare data to disease-specific biomarker networks: A hierarchical Bayesian network approach

In this work, we introduce an entirely data-driven and automated approach to reveal disease-associated biomarker and risk factor networks from heterogeneous and high-dimensional healthcare data. Our workflow is based on Bayesian networks, which are a popular tool for analyzing the interplay of biomarkers. Usually, data require extensive manual preprocessing and dimension reduction to allow for effective learning of Bayesian networks. For heterogeneous data, this preprocessing is hard to automatize and typically requires domain-specific prior knowledge. We here combine Bayesian network learning with hierarchical variable clustering in order to detect groups of similar features and learn interactions between them entirely automated. We present an optimization algorithm for the adaptive refinement of such group Bayesian networks to account for a specific target variable, like a disease. The combination of Bayesian networks, clustering, and refinement yields low-dimensional but disease-specific interaction networks. These networks provide easily interpretable, yet accurate models of biomarker interdependencies. We test our method extensively on simulated data, as well as on data from the Study of Health in Pomerania (SHIP-TREND), and demonstrate its effectiveness using non-alcoholic fatty liver disease and hypertension as examples. We show that the group network models outperform available biomarker scores, while at the same time, they provide an easily interpretable interaction network.

Download Full-text

AIS-BN: An Adaptive Importance Sampling Algorithm for Evidential Reasoning in Large Bayesian Networks

Journal of Artificial Intelligence Research ◽

10.1613/jair.764 ◽

2000 ◽

Vol 13 ◽

pp. 155-188 ◽

Cited By ~ 93

Author(s):

J. Cheng ◽

M. J. Druzdzel

Keyword(s):

Bayesian Networks ◽

Bayesian Network ◽

Importance Sampling ◽

Network Models ◽

Evidential Reasoning ◽

The Other ◽

Sampling Algorithm ◽

Importance Function ◽

Sampling Algorithms ◽

Bayesian Network Models

Stochastic sampling algorithms, while an attractive alternative to exact algorithms in very large Bayesian network models, have been observed to perform poorly in evidential reasoning with extremely unlikely evidence. To address this problem, we propose an adaptive importance sampling algorithm, AIS-BN, that shows promising convergence rates even under extreme conditions and seems to outperform the existing sampling algorithms consistently. Three sources of this performance improvement are (1) two heuristics for initialization of the importance function that are based on the theoretical properties of importance sampling in finite-dimensional integrals and the structural advantages of Bayesian networks, (2) a smooth learning method for the importance function, and (3) a dynamic weighting function for combining samples from different stages of the algorithm. We tested the performance of the AIS-BN algorithm along with two state of the art general purpose sampling algorithms, likelihood weighting (Fung & Chang, 1989; Shachter & Peot, 1989) and self-importance sampling (Shachter & Peot, 1989). We used in our tests three large real Bayesian network models available to the scientific community: the CPCS network (Pradhan et al., 1994), the PathFinder network (Heckerman, Horvitz, & Nathwani, 1990), and the ANDES network (Conati, Gertner, VanLehn, & Druzdzel, 1997), with evidence as unlikely as 10^-41. While the AIS-BN algorithm always performed better than the other two algorithms, in the majority of the test cases it achieved orders of magnitude improvement in precision of the results. Improvement in speed given a desired precision is even more dramatic, although we are unable to report numerical results here, as the other algorithms almost never achieved the precision reached even by the first few iterations of the AIS-BN algorithm.

Download Full-text

Bayesian Network: Probabilistic Reasoning, Statistical Learning, and Applications

Journal of Advanced Computational Intelligence and Intelligent Informatics ◽

10.20965/jaciii.2004.p0093 ◽

2004 ◽

Vol 8 (2) ◽

pp. 93-99 ◽

Cited By ~ 1

Author(s):

Yoichi Motomura ◽

Keyword(s):

Information Processing ◽

Bayesian Networks ◽

Bayesian Network ◽

Statistical Learning ◽

Probabilistic Models ◽

Probabilistic Reasoning ◽

Network Models ◽

Intelligent Information Processing ◽

Intelligent Information ◽

Bayesian Network Models

Bayesian networks are probabilistic models that can be used for prediction and decision-making in the presence of uncertainty. For intelligent information processing, probabilistic reasoning based on Bayesian networks can be used to cope with uncertainty in real-world domains. In order to apply this, we need appropriate models and statistical learning methods to obtain models. We start by reviewing Bayesian network models, probabilistic reasoning, statistical learning, and related researches. Then, we introduce applications for intelligent information processing using Bayesian networks.

Download Full-text

Fraud Detection of Bulk Cargo Theft in Port Using Bayesian Network Models

Applied Sciences ◽

10.3390/app10031056 ◽

2020 ◽

Vol 10 (3) ◽

pp. 1056

Author(s):

Rongjia Song ◽

Lei Huang ◽

Weiping Cui ◽

María Óskarsdóttir ◽

Jan Vanthienen

Keyword(s):

Bayesian Networks ◽

Bayesian Network ◽

Predictive Models ◽

Domain Knowledge ◽

Fraud Detection ◽

Network Models ◽

Risk Elements ◽

Data Driven Approach ◽

Binary Classifiers ◽

Cargo Theft

The fraud detection of cargo theft has been a serious issue in ports for a long time. Traditional research in detecting theft risk is expert- and survey-based, which is not optimal for proactive prediction. As we move into a pervasive and ubiquitous paradigm, the implications of external environment and system behavior are continuously captured as multi-source data. Therefore, we propose a novel data-driven approach for formulating predictive models for detecting bulk cargo theft in ports. More specifically, we apply various feature-ranking methods and classification algorithms for selecting an effective feature set of relevant risk elements. Then, implicit Bayesian networks are derived with the features to graphically present the relationship with the risk elements of fraud. Thus, various binary classifiers are compared to derive a suitable predictive model, and Bayesian network performs best overall. The resulting Bayesian networks are then comparatively analyzed based on the outcomes of model validation and testing, as well as essential domain knowledge. The experimental results show that predictive models are effective, with both accuracy and recall values greater than 0.8. These predictive models are not only useful for understanding the dependency between relevant risk elements, but also for supporting the strategy optimization of risk management.

Download Full-text

Inference of disease-specific gene interaction network using a Bayesian network learned by genetic algorithm

Proceedings of the 30th Annual ACM Symposium on Applied Computing - SAC '15 ◽

10.1145/2695664.2695944 ◽

2015 ◽

Author(s):

Daye Jeong ◽

Yunku Yeu ◽

Jaegyoon Ahn ◽

Youngmi Yoon ◽

Sanghyun Park

Keyword(s):

Genetic Algorithm ◽

Bayesian Network ◽

Interaction Network ◽

Gene Interaction ◽

Specific Gene ◽

Gene Interaction Network ◽

Disease Specific

Download Full-text

Resilience Assessment of Wind Farms in the Arctic with the Application of Bayesian Networks

Energies ◽

10.3390/en14154439 ◽

2021 ◽

Vol 14 (15) ◽

pp. 4439

Author(s):

Albara M. Mustafa ◽

Abbas Barabadi

Keyword(s):

Bayesian Networks ◽

Bayesian Network ◽

Wind Farm ◽

Network Models ◽

Wind Farms ◽

Weather Conditions ◽

Operating Conditions ◽

The Arctic ◽

Black Swan

Infrastructure systems, such as wind farms, are prone to various human-induced and natural disruptions such as extreme weather conditions. There is growing concern among decision makers about the ability of wind farms to withstand and regain their performance when facing disruptions, in terms of resilience-enhanced strategies. This paper proposes a probabilistic model to calculate the resilience of wind farms facing disruptive weather conditions. In this study, the resilience of wind farms is considered to be a function of their reliability, maintainability, supportability, and organizational resilience. The relationships between these resilience variables can be structured using Bayesian network models. The use of Bayesian networks allows for analyzing different resilience scenarios. Moreover, Bayesian networks can be used to quantify resilience, which is demonstrated in this paper with a case study of a wind farm in Arctic Norway. The results of the case study show that the wind farm is highly resilient under normal operating conditions, and slightly degraded under Arctic operating conditions. Moreover, the case study introduced the calculation of wind farm resilience under Arctic black swan conditions. A black swan scenario is an unknowable unknown scenario that can affect a system with low probability and very high extreme consequences. The results of the analysis show that the resilience of the wind farm is significantly degraded when operating under Arctic black swan conditions. In addition, a backward propagation of the Bayesian network illustrates the percentage of improvement required in each resilience factor in order to attain a certain level of resilience of the wind farm under Arctic black swan conditions.

Download Full-text

DNA methylation signatures in non-alcoholic fatty liver disease reveal disease-specific signatures and molecular remodelling drivers after weight loss in non-alcoholic fatty liver disease

Zeitschrift für Gastroenterologie ◽

10.1055/s-0032-1331961 ◽

2013 ◽

Vol 51 (01) ◽

Author(s):

M Ahrens ◽

O Ammerpohl ◽

W von Schönfels ◽

T Itzel ◽

A Teufel ◽

...

Keyword(s):

Dna Methylation ◽

Weight Loss ◽

Liver Disease ◽

Fatty Liver ◽

Fatty Liver Disease ◽

Alcoholic Fatty Liver ◽

Alcoholic Fatty Liver Disease ◽

Disease Specific

Download Full-text

Construction and analysis of protein-protein interaction network of non-alcoholic fatty liver disease

Computers in Biology and Medicine ◽

10.1016/j.compbiomed.2021.104243 ◽

2021 ◽

Vol 131 ◽

pp. 104243

Author(s):

Athina I. Amanatidou ◽

George V. Dedoussis

Keyword(s):

Liver Disease ◽

Fatty Liver ◽

Protein Interaction ◽

Protein Interaction Network ◽

Fatty Liver Disease ◽

Interaction Network ◽

Alcoholic Fatty Liver ◽

Protein Protein Interaction ◽

Alcoholic Fatty Liver Disease ◽

Protein Protein Interaction Network

Download Full-text

Black-Box Test-Cost Reduction Based on Bayesian Network Models

IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems ◽

10.1109/tcad.2020.2994257 ◽

2020 ◽

pp. 1-1

Author(s):

Renjian Pan ◽

Zhaobo Zhang ◽

Xin Li ◽

Krishnendu Chakrabarty ◽

Xinli Gu

Keyword(s):

Bayesian Network ◽

Cost Reduction ◽

Network Models ◽

Black Box ◽

Test Cost ◽

Test Cost Reduction ◽

Bayesian Network Models

Download Full-text

Application of Bayesian Network in Safety Evaluation of Metro Construction

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.838-841.1463 ◽

2013 ◽

Vol 838-841 ◽

pp. 1463-1468

Author(s):

Xiang Ke Liu ◽

Zhi Shen Wang ◽

Hai Liang Wang ◽

Jun Tao Wang

Keyword(s):

Bayesian Networks ◽

Bayesian Network ◽

Posterior Probability ◽

Fault Tree ◽

Fault Tree Analysis ◽

Safety Evaluation ◽

Tree Analysis ◽

Tunnel Blasting ◽

Better Than

The paper introduced the Bayesian networks briefly and discussed the algorithm of transforming fault tree into Bayesian networks at first, then regarded the structures impaired caused by tunnel blasting construction as a example, introduced the built and calculated method of the Bayesian networks by matlab. Then assumed the probabilities of essential events, calculated the probability of top event and the posterior probability of each essential events by the Bayesian networks. After that the paper contrast the characteristics of fault tree analysis and the Bayesian networks, Identified that the Bayesian networks is better than fault tree analysis in safety evaluation in some case, and provided a valid way to assess risk in metro construction.

Download Full-text

Modeling the City Distribution System Reliability with Bayesian Networks to Identify Influence Factors

Scientific Programming ◽

10.1155/2016/7109235 ◽

2016 ◽

Vol 2016 ◽

pp. 1-9 ◽

Cited By ~ 2

Author(s):

Hao Zhang ◽

Liyu Zhu ◽

Shensi Xu

Keyword(s):

Bayesian Networks ◽

Bayesian Network ◽

System Reliability ◽

Distribution System ◽

Complex Problem ◽

Practical Significance ◽

Correlation Degree ◽

Urban Distribution ◽

Distribution System Reliability ◽

The City

Under the increasingly uncertain economic environment, the research on the reliability of urban distribution system has great practical significance for the integration of logistics and supply chain resources. This paper summarizes the factors that affect the city logistics distribution system. Starting from the research of factors that influence the reliability of city distribution system, further construction of city distribution system reliability influence model is built based on Bayesian networks. The complex problem is simplified by using the sub-Bayesian network, and an example is analyzed. In the calculation process, we combined the traditional Bayesian algorithm and the Expectation Maximization (EM) algorithm, which made the Bayesian model able to lay a more accurate foundation. The results show that the Bayesian network can accurately reflect the dynamic relationship among the factors affecting the reliability of urban distribution system. Moreover, by changing the prior probability of the node of the cause, the correlation degree between the variables that affect the successful distribution can be calculated. The results have significant practical significance on improving the quality of distribution, the level of distribution, and the efficiency of enterprises.

Download Full-text