scholarly journals From heterogeneous healthcare data to disease-specific biomarker networks: A hierarchical Bayesian network approach

2021 ◽  
Vol 17 (2) ◽  
pp. e1008735
Author(s):  
Ann-Kristin Becker ◽  
Marcus Dörr ◽  
Stephan B. Felix ◽  
Fabian Frost ◽  
Hans J. Grabe ◽  
...  

In this work, we introduce an entirely data-driven and automated approach to reveal disease-associated biomarker and risk factor networks from heterogeneous and high-dimensional healthcare data. Our workflow is based on Bayesian networks, which are a popular tool for analyzing the interplay of biomarkers. Usually, data require extensive manual preprocessing and dimension reduction to allow for effective learning of Bayesian networks. For heterogeneous data, this preprocessing is hard to automatize and typically requires domain-specific prior knowledge. We here combine Bayesian network learning with hierarchical variable clustering in order to detect groups of similar features and learn interactions between them entirely automated. We present an optimization algorithm for the adaptive refinement of such group Bayesian networks to account for a specific target variable, like a disease. The combination of Bayesian networks, clustering, and refinement yields low-dimensional but disease-specific interaction networks. These networks provide easily interpretable, yet accurate models of biomarker interdependencies. We test our method extensively on simulated data, as well as on data from the Study of Health in Pomerania (SHIP-TREND), and demonstrate its effectiveness using non-alcoholic fatty liver disease and hypertension as examples. We show that the group network models outperform available biomarker scores, while at the same time, they provide an easily interpretable interaction network.

2000 ◽  
Vol 13 ◽  
pp. 155-188 ◽  
Author(s):  
J. Cheng ◽  
M. J. Druzdzel

Stochastic sampling algorithms, while an attractive alternative to exact algorithms in very large Bayesian network models, have been observed to perform poorly in evidential reasoning with extremely unlikely evidence. To address this problem, we propose an adaptive importance sampling algorithm, AIS-BN, that shows promising convergence rates even under extreme conditions and seems to outperform the existing sampling algorithms consistently. Three sources of this performance improvement are (1) two heuristics for initialization of the importance function that are based on the theoretical properties of importance sampling in finite-dimensional integrals and the structural advantages of Bayesian networks, (2) a smooth learning method for the importance function, and (3) a dynamic weighting function for combining samples from different stages of the algorithm. We tested the performance of the AIS-BN algorithm along with two state of the art general purpose sampling algorithms, likelihood weighting (Fung & Chang, 1989; Shachter & Peot, 1989) and self-importance sampling (Shachter & Peot, 1989). We used in our tests three large real Bayesian network models available to the scientific community: the CPCS network (Pradhan et al., 1994), the PathFinder network (Heckerman, Horvitz, & Nathwani, 1990), and the ANDES network (Conati, Gertner, VanLehn, & Druzdzel, 1997), with evidence as unlikely as 10^-41. While the AIS-BN algorithm always performed better than the other two algorithms, in the majority of the test cases it achieved orders of magnitude improvement in precision of the results. Improvement in speed given a desired precision is even more dramatic, although we are unable to report numerical results here, as the other algorithms almost never achieved the precision reached even by the first few iterations of the AIS-BN algorithm.


Author(s):  
Yoichi Motomura ◽  

Bayesian networks are probabilistic models that can be used for prediction and decision-making in the presence of uncertainty. For intelligent information processing, probabilistic reasoning based on Bayesian networks can be used to cope with uncertainty in real-world domains. In order to apply this, we need appropriate models and statistical learning methods to obtain models. We start by reviewing Bayesian network models, probabilistic reasoning, statistical learning, and related researches. Then, we introduce applications for intelligent information processing using Bayesian networks.


2020 ◽  
Vol 10 (3) ◽  
pp. 1056
Author(s):  
Rongjia Song ◽  
Lei Huang ◽  
Weiping Cui ◽  
María Óskarsdóttir ◽  
Jan Vanthienen

The fraud detection of cargo theft has been a serious issue in ports for a long time. Traditional research in detecting theft risk is expert- and survey-based, which is not optimal for proactive prediction. As we move into a pervasive and ubiquitous paradigm, the implications of external environment and system behavior are continuously captured as multi-source data. Therefore, we propose a novel data-driven approach for formulating predictive models for detecting bulk cargo theft in ports. More specifically, we apply various feature-ranking methods and classification algorithms for selecting an effective feature set of relevant risk elements. Then, implicit Bayesian networks are derived with the features to graphically present the relationship with the risk elements of fraud. Thus, various binary classifiers are compared to derive a suitable predictive model, and Bayesian network performs best overall. The resulting Bayesian networks are then comparatively analyzed based on the outcomes of model validation and testing, as well as essential domain knowledge. The experimental results show that predictive models are effective, with both accuracy and recall values greater than 0.8. These predictive models are not only useful for understanding the dependency between relevant risk elements, but also for supporting the strategy optimization of risk management.


Energies ◽  
2021 ◽  
Vol 14 (15) ◽  
pp. 4439
Author(s):  
Albara M. Mustafa ◽  
Abbas Barabadi

Infrastructure systems, such as wind farms, are prone to various human-induced and natural disruptions such as extreme weather conditions. There is growing concern among decision makers about the ability of wind farms to withstand and regain their performance when facing disruptions, in terms of resilience-enhanced strategies. This paper proposes a probabilistic model to calculate the resilience of wind farms facing disruptive weather conditions. In this study, the resilience of wind farms is considered to be a function of their reliability, maintainability, supportability, and organizational resilience. The relationships between these resilience variables can be structured using Bayesian network models. The use of Bayesian networks allows for analyzing different resilience scenarios. Moreover, Bayesian networks can be used to quantify resilience, which is demonstrated in this paper with a case study of a wind farm in Arctic Norway. The results of the case study show that the wind farm is highly resilient under normal operating conditions, and slightly degraded under Arctic operating conditions. Moreover, the case study introduced the calculation of wind farm resilience under Arctic black swan conditions. A black swan scenario is an unknowable unknown scenario that can affect a system with low probability and very high extreme consequences. The results of the analysis show that the resilience of the wind farm is significantly degraded when operating under Arctic black swan conditions. In addition, a backward propagation of the Bayesian network illustrates the percentage of improvement required in each resilience factor in order to attain a certain level of resilience of the wind farm under Arctic black swan conditions.


2013 ◽  
Vol 838-841 ◽  
pp. 1463-1468
Author(s):  
Xiang Ke Liu ◽  
Zhi Shen Wang ◽  
Hai Liang Wang ◽  
Jun Tao Wang

The paper introduced the Bayesian networks briefly and discussed the algorithm of transforming fault tree into Bayesian networks at first, then regarded the structures impaired caused by tunnel blasting construction as a example, introduced the built and calculated method of the Bayesian networks by matlab. Then assumed the probabilities of essential events, calculated the probability of top event and the posterior probability of each essential events by the Bayesian networks. After that the paper contrast the characteristics of fault tree analysis and the Bayesian networks, Identified that the Bayesian networks is better than fault tree analysis in safety evaluation in some case, and provided a valid way to assess risk in metro construction.


2016 ◽  
Vol 2016 ◽  
pp. 1-9 ◽  
Author(s):  
Hao Zhang ◽  
Liyu Zhu ◽  
Shensi Xu

Under the increasingly uncertain economic environment, the research on the reliability of urban distribution system has great practical significance for the integration of logistics and supply chain resources. This paper summarizes the factors that affect the city logistics distribution system. Starting from the research of factors that influence the reliability of city distribution system, further construction of city distribution system reliability influence model is built based on Bayesian networks. The complex problem is simplified by using the sub-Bayesian network, and an example is analyzed. In the calculation process, we combined the traditional Bayesian algorithm and the Expectation Maximization (EM) algorithm, which made the Bayesian model able to lay a more accurate foundation. The results show that the Bayesian network can accurately reflect the dynamic relationship among the factors affecting the reliability of urban distribution system. Moreover, by changing the prior probability of the node of the cause, the correlation degree between the variables that affect the successful distribution can be calculated. The results have significant practical significance on improving the quality of distribution, the level of distribution, and the efficiency of enterprises.


Sign in / Sign up

Export Citation Format

Share Document