Computation of Kullback–Leibler Divergence in Bayesian Networks

This chapter provides an introduction to some elementary aspects of information theory, including entropy in its various forms. Entropy refers to the level of uncertainty associated with a random variable (or more precisely, the probability distribution of the random variable). When there are two or more random variables, it is worthwhile to study the conditional entropy of one random variable with respect to another. The last concept is relative entropy, also known as the Kullback–Leibler divergence, which measures the “disparity” between two probability distributions. The chapter first considers convex and concave functions before discussing the properties of the entropy function, conditional entropy, uniqueness of the entropy function, and the Kullback–Leibler divergence.

Get full-text (via PubEx)

Uniqueness of the Level Two Bayesian Network Representing a Probability Distribution

International Journal of Mathematics and Mathematical Sciences ◽

10.1155/2011/845398 ◽

2011 ◽

Vol 2011 ◽

pp. 1-13 ◽

Cited By ~ 4

Author(s):

Linda Smail

Keyword(s):

Artificial Intelligence ◽

Probability Distribution ◽

Bayesian Networks ◽

Bayesian Network ◽

Probabilistic Models ◽

Probability Distributions ◽

Random Variables ◽

Inference Problem ◽

Computation Algorithm

Bayesian Networks are graphic probabilistic models through which we can acquire, capitalize on, and exploit knowledge. they are becoming an important tool for research and applications in artificial intelligence and many other fields in the last decade. This paper presents Bayesian networks and discusses the inference problem in such models. It proposes a statement of the problem and the proposed method to compute probability distributions. It also uses D-separation for simplifying the computation of probabilities in Bayesian networks. Given a Bayesian network over a family of random variables, this paper presents a result on the computation of the probability distribution of a subset of using separately a computation algorithm and D-separation properties. It also shows the uniqueness of the obtained result.

Get full-text (via PubEx)

Learning Bayesian Networks

Intelligent Information Technologies ◽

10.4018/978-1-59904-941-0.ch016 ◽

2011 ◽

pp. 315-321

Author(s):

Marco F. Ramoni ◽

Paola Sebastiani

Keyword(s):

Probability Distribution ◽

Bayesian Networks ◽

Bayesian Network ◽

Graphical Models ◽

Graphical Model ◽

Joint Probability ◽

Joint Probability Distribution ◽

Probabilistic Graphical Model ◽

Conditional Probability Distribution ◽

Statistics And Probability

Born at the intersection of artificial intelligence, statistics, and probability, Bayesian networks (Pearl, 1988) are a representation formalism at the cutting edge of knowledge discovery and data mining (Heckerman, 1997). Bayesian networks belong to a more general class of models called probabilistic graphical models (Whittaker, 1990; Lauritzen, 1996) that arise from the combination of graph theory and probability theory, and their success rests on their ability to handle complex probabilistic models by decomposing them into smaller, amenable components. A probabilistic graphical model is defined by a graph, where nodes represent stochastic variables and arcs represent dependencies among such variables. These arcs are annotated by probability distribution shaping the interaction between the linked variables. A probabilistic graphical model is called a Bayesian network, when the graph connecting its variables is a directed acyclic graph (DAG). This graph represents conditional independence assumptions that are used to factorize the joint probability distribution of the network variables, thus making the process of learning from a large database amenable to computations. A Bayesian network induced from data can be used to investigate distant relationships between variables, as well as making prediction and explanation, by computing the conditional probability distribution of one variable, given the values of some others.

Get full-text (via PubEx)

Distributionally Robust Stochastic Model Predictive Control for Collision Avoidance

Volume 2: Modeling and Control of Engine and Aftertreatment Systems; Modeling and Control of IC Engines and Aftertreatment Systems; Modeling and Validation; Motion Planning and Tracking Control; Multi-Agent and Networked Systems; Renewable and Smart Energy Systems; Thermal Energy Systems; Uncertain Systems and Robustness; Unmanned Ground and Aerial Vehicles; Vehicle Dynamics and Stability; Vibrations: Modeling, Analysis, and Control ◽

10.1115/dscc2019-9160 ◽

2019 ◽

Author(s):

Baisravan HomChaudhuri

Keyword(s):

Probability Distribution ◽

Collision Avoidance ◽

Controller Design ◽

Probability Distributions ◽

Optimization Methods ◽

Linear Matrix ◽

True Probability ◽

Exact Probability ◽

Exact Probability Distribution ◽

Distributionally Robust

Abstract This paper focuses on distributionally robust controller design for avoiding dynamic and stochastic obstacles whose exact probability distribution is unknown. The true probability distribution of the disturbance associated with an obstacle, although unknown, is considered to belong to an ambiguity set that includes all the probability distributions that share the same first two moment. The controller thus focuses on ensuring the satisfaction of the probabilistic collision avoidance constraints for all probability distributions in the ambiguity set, hence making the solution robust to the true probability distribution of the stochastic obstacles. Techniques from robust optimization methods are used to model the distributionally robust probabilistic or chance constraints as a semi-definite programming (SDP) problem with linear matrix inequality (LMI) constraints that can be solved in a computationally tractable fashion. Simulation results for a robot obstacle avoidance problem shows the efficacy of our method.

Get full-text (via PubEx)

Cutset Bayesian Networks: A New Representation for Learning Rao-Blackwellised Graphical Models

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2019/797 ◽

2019 ◽

Author(s):

Tahrima Rahman ◽

Shasha Jin ◽

Vibhav Gogate

Keyword(s):

Bayesian Networks ◽

Graphical Models ◽

Prediction Accuracy ◽

Probabilistic Models ◽

Probability Distributions ◽

Main Idea ◽

Exact Inference ◽

Trade Off ◽

Probability Estimates ◽

Prediction Time

Recently there has been growing interest in learning probabilistic models that admit poly-time inference called tractable probabilistic models from data. Although they generalize poorly as compared to intractable models, they often yield more accurate estimates at prediction time. In this paper, we seek to further explore this trade-off between generalization performance and inference accuracy by proposing a novel, partially tractable representation called cutset Bayesian networks (CBNs). The main idea in CBNs is to partition the variables into two subsets X and Y, learn a (intractable) Bayesian network that represents P(X) and a tractable conditional model that represents P(Y|X). The hope is that the intractable model will help improve generalization while the tractable model, by leveraging Rao-Blackwellised sampling which combines exact inference and sampling, will help improve the prediction accuracy. To compactly model P(Y|X), we introduce a novel tractable representation called conditional cutset networks (CCNs) in which all conditional probability distributions are represented using calibrated classifiers—classifiers which typically yield higher quality probability estimates than conventional classifiers. We show via a rigorous experimental evaluation that CBNs and CCNs yield more accurate posterior estimates than their tractable as well as intractable counterparts.

Get full-text (via PubEx)

Learning Bayesian Networks

Encyclopedia of Data Warehousing and Mining ◽

10.4018/978-1-59140-557-3.ch128 ◽

2011 ◽

pp. 674-677

Author(s):

Marco F. Ramoni ◽

Paola Sebastiani

Keyword(s):

Probability Distribution ◽

Bayesian Networks ◽

Bayesian Network ◽

Graphical Models ◽

Graphical Model ◽

Joint Probability ◽

Joint Probability Distribution ◽

Probabilistic Graphical Model ◽

Conditional Probability Distribution ◽

Statistics And Probability

Born at the intersection of artificial intelligence, statistics, and probability, Bayesian networks (Pearl, 1988) are a representation formalism at the cutting edge of knowledge discovery and data mining (Heckerman, 1997). Bayesian networks belong to a more general class of models called probabilistic graphical models (Whittaker, 1990; Lauritzen, 1996) that arise from the combination of graph theory and probability theory, and their success rests on their ability to handle complex probabilistic models by decomposing them into smaller, amenable components. A probabilistic graphical model is defined by a graph, where nodes represent stochastic variables and arcs represent dependencies among such variables. These arcs are annotated by probability distribution shaping the interaction between the linked variables. A probabilistic graphical model is called a Bayesian network, when the graph connecting its variables is a directed acyclic graph (DAG). This graph represents conditional independence assumptions that are used to factorize the joint probability distribution of the network variables, thus making the process of learning from a large database amenable to computations. A Bayesian network induced from data can be used to investigate distant relationships between variables, as well as making prediction and explanation, by computing the conditional probability distribution of one variable, given the values of some others.

Get full-text (via PubEx)

Learning Bayesian Networks

Encyclopedia of Data Warehousing and Mining, Second Edition ◽

10.4018/978-1-60566-010-3.ch174 ◽

2011 ◽

pp. 1124-1128

Author(s):

Marco F. Ramoni ◽

Paola Sebastiani

Keyword(s):

Probability Distribution ◽

Bayesian Networks ◽

Bayesian Network ◽

Graphical Models ◽

Graphical Model ◽

Joint Probability ◽

Joint Probability Distribution ◽

Probabilistic Graphical Model ◽

Conditional Probability Distribution ◽

Statistics And Probability

Born at the intersection of artificial intelligence, statistics, and probability, Bayesian networks (Pearl, 1988) are a representation formalism at the cutting edge of knowledge discovery and data mining (Heckerman, 1997). Bayesian networks belong to a more general class of models called probabilistic graphical models (Whittaker, 1990; Lauritzen, 1996) that arise from the combination of graph theory and probability theory, and their success rests on their ability to handle complex probabilistic models by decomposing them into smaller, amenable components. A probabilistic graphical model is defined by a graph, where nodes represent stochastic variables and arcs represent dependencies among such variables. These arcs are annotated by probability distribution shaping the interaction between the linked variables. A probabilistic graphical model is called a Bayesian network, when the graph connecting its variables is a directed acyclic graph (DAG). This graph represents conditional independence assumptions that are used to factorize the joint probability distribution of the network variables, thus making the process of learning from a large database amenable to computations. A Bayesian network induced from data can be used to investigate distant relationships between variables, as well as making prediction and explanation, by computing the conditional probability distribution of one variable, given the values of some others.

Get full-text (via PubEx)

WIGNER FUNCTION OF PULSED FIELDS RECONSTRUCTED BY DIRECT DETECTION

International Journal of Quantum Information ◽

10.1142/s0219749911006995 ◽

2011 ◽

Vol 09 (supp01) ◽

pp. 39-47

Author(s):

ALESSIA ALLEVI ◽

MARIA BONDANI ◽

ALESSANDRA ANDREONI

Keyword(s):

Probability Distribution ◽

Wigner Function ◽

Beam Splitter ◽

Direct Detection ◽

Probability Distributions ◽

Linear Regime ◽

Wigner Functions ◽

Intensity Measurements ◽

Pulsed Fields

We present the experimental reconstruction of the Wigner function of some optical states. The method is based on direct intensity measurements by non-ideal photodetectors operated in the linear regime. The signal state is mixed at a beam-splitter with a set of coherent probes of known complex amplitudes and the probability distribution of the detected photons is measured. The Wigner function is given by a suitable sum of these probability distributions measured for different values of the probe. For comparison, the same data are analyzed to obtain the number distributions and the Wigner functions for photons.

Get full-text (via PubEx)

Analysis of Magnitude and Frequency of Floods in the Damanganga Basin: Western India

Hydrospatial Analysis ◽

10.21523/gcj3.2021050101 ◽

2021 ◽

Vol 5 (1) ◽

pp. 1-11

Author(s):

Vitthal Anwat ◽

Pramodkumar Hire ◽

Uttam Pawar ◽

Rajendra Gunjal

Keyword(s):

Probability Distribution ◽

Probability Distributions ◽

Flood Frequency ◽

Flood Frequency Analysis ◽

Western India ◽

Type I ◽

Return Periods ◽

Pearson Type ◽

Kolmogorov Smirnov ◽

Anderson Darling

Flood Frequency Analysis (FFA) method was introduced by Fuller in 1914 to understand the magnitude and frequency of floods. The present study is carried out using the two most widely accepted probability distributions for FFA in the world namely, Gumbel Extreme Value type I (GEVI) and Log Pearson type III (LP-III). The Kolmogorov-Smirnov (KS) and Anderson-Darling (AD) methods were used to select the most suitable probability distribution at sites in the Damanganga Basin. Moreover, discharges were estimated for various return periods using GEVI and LP-III. The recurrence interval of the largest peak flood on record (Qmax) is 107 years (at Nanipalsan) and 146 years (at Ozarkhed) as per LP-III. Flood Frequency Curves (FFC) specifies that LP-III is the best-fitted probability distribution for FFA of the Damanganga Basin. Therefore, estimated discharges and return periods by LP-III probability distribution are more reliable and can be used for designing hydraulic structures.

Get full-text (via PubEx)

Analysis and Synthesis of Mechanical Error in Universal Joints

16th Design Automation Conference: Volume 2 — Optimal Design and Mechanical Systems Analysis ◽

10.1115/detc1990-0090 ◽

1990 ◽

Author(s):

J. L. Cagney ◽

S. S. Rao

Keyword(s):

Probability Distribution ◽

Real World ◽

Probability Distributions ◽

Manufacturing Cost ◽

Universal Joint ◽

Accuracy Requirement ◽

Output Error ◽

Manufacturing Errors ◽

Analysis And Synthesis ◽

Limiting Value

Abstract The modeling of manufacturing errors in mechanisms is a significant task to validate practical designs. The use of probability distributions for errors can simulate manufacturing variations and real world operations. This paper presents the mechanical error analysis of universal joint drivelines. Each error is simulated using a probability distribution, i.e., a design of the mechanism is created by assigning random values to the errors. Each design is then evaluated by comparing the output error with a limiting value and the reliability of the universal joint is estimated. For this, the design is considered a failure whenever the output error exceeds the specified limit. In addition, the problem of synthesis, which involves the allocation of tolerances (errors) for minimum manufacturing cost without violating a specified accuracy requirement of the output, is also considered. Three probability distributions — normal, Weibull and beta distributions — were used to simulate the random values of the errors. The similarity of the results given by the three distributions suggests that the use of normal distribution would be acceptable for modeling the tolerances in most cases.

Get full-text (via PubEx)