Alleviating the independence assumptions of averaged one-dependence estimators by model weighting

Of numerous proposals to refine naive Bayes by weakening its attribute independence assumption, averaged one-dependence estimators (AODE) has been shown to be able to achieve significantly higher classification accuracy at a moderate cost in classification efficiency. However, all one-dependence estimators (ODEs) in AODE have the same weights and are treated equally. To address this issue, model weighting, which assigns discriminate weights to ODEs and then linearly combine their probability estimates, has been proved to be an efficient and effective approach. Most information-theoretic weighting metrics, including mutual information, Kullback-Leibler measure and the information gain, place more emphasis on the correlation between root attribute (value) and class variable. We argue that the topology of each ODE can be divided into a set of local directed acyclic graphs (DAGs) based on the independence assumption, and multivariate mutual information is introduced to measure the extent to which the DAGs fit data. Based on this premise, in this study we propose a novel weighted AODE algorithm, called AWODE, that adaptively selects weights to alleviate the independence assumption and make the learned probability distribution fit the instance. The proposed approach is validated on 40 benchmark datasets from UCI machine learning repository. The experimental results reveal that, AWODE achieves bias-variance trade-off and is a competitive alternative to single-model Bayesian learners (such as TAN and KDB) and other weighted AODEs (such as WAODE).

Download Full-text

Resolution of Probabilistic Weather Forecasts with Application in Disease Management

Phytopathology ◽

10.1094/phyto-07-16-0256-r ◽

2017 ◽

Vol 107 (2) ◽

pp. 158-162 ◽

Cited By ~ 2

Author(s):

G. Hughes ◽

N. McRoberts ◽

F. J. Burnett

Keyword(s):

Disease Management ◽

Mutual Information ◽

Disease Risk ◽

Information Gain ◽

Weather Forecast ◽

Weather Data ◽

Specific Information ◽

Information Theoretic ◽

Weather Forecasts ◽

Measuring Change

Predictive systems in disease management often incorporate weather data among the disease risk factors, and sometimes this comes in the form of forecast weather data rather than observed weather data. In such cases, it is useful to have an evaluation of the operational weather forecast, in addition to the evaluation of the disease forecasts provided by the predictive system. Typically, weather forecasts and disease forecasts are evaluated using different methodologies. However, the information theoretic quantity expected mutual information provides a basis for evaluating both kinds of forecast. Expected mutual information is an appropriate metric for the average performance of a predictive system over a set of forecasts. Both relative entropy (a divergence, measuring information gain) and specific information (an entropy difference, measuring change in uncertainty) provide a basis for the assessment of individual forecasts.

Download Full-text

An Information-Theoretic Framework for Optimal Design: Analysis of Protocols for Estimating Soft Tissue Parameters in Biaxial Experiments

Axioms ◽

10.3390/axioms10020079 ◽

2021 ◽

Vol 10 (2) ◽

pp. 79

Author(s):

Ankush Aggarwal ◽

Damiano Lombardi ◽

Sanjay Pant

Keyword(s):

Optimal Design ◽

Mutual Information ◽

Soft Tissues ◽

Information Gain ◽

Model Parameters ◽

Conditional Mutual Information ◽

Information Theoretic ◽

Information Theoretic Measures ◽

New Framework ◽

Biaxial Experiments

A new framework for optimal design based on the information-theoretic measures of mutual information, conditional mutual information and their combination is proposed. The framework is tested on the analysis of protocols—a combination of angles along which strain measurements can be acquired—in a biaxial experiment of soft tissues for the estimation of hyperelastic constitutive model parameters. The proposed framework considers the information gain about the parameters from the experiment as the key criterion to be maximised, which can be directly used for optimal design. Information gain is computed through k-nearest neighbour algorithms applied to the joint samples of the parameters and measurements produced by the forward and observation models. For biaxial experiments, the results show that low angles have a relatively low information content compared to high angles. The results also show that a smaller number of angles with suitably chosen combinations can result in higher information gains when compared to a larger number of angles which are poorly combined. Finally, it is shown that the proposed framework is consistent with classical approaches, particularly D-optimal design.

Download Full-text

Information Theoretic Causal Effect Quantification

Entropy ◽

10.3390/e21100975 ◽

2019 ◽

Vol 21 (10) ◽

pp. 975

Author(s):

Aleksander Wieczorek ◽

Volker Roth

Keyword(s):

Information Theory ◽

Time Series ◽

Causal Effect ◽

Directed Acyclic Graphs ◽

Second Step ◽

Information Theoretic ◽

Acyclic Graphs ◽

Outcome Framework ◽

Directed Information ◽

Chain Graphs

Modelling causal relationships has become popular across various disciplines. Most common frameworks for causality are the Pearlian causal directed acyclic graphs (DAGs) and the Neyman-Rubin potential outcome framework. In this paper, we propose an information theoretic framework for causal effect quantification. To this end, we formulate a two step causal deduction procedure in the Pearl and Rubin frameworks and introduce its equivalent which uses information theoretic terms only. The first step of the procedure consists of ensuring no confounding or finding an adjustment set with directed information. In the second step, the causal effect is quantified. We subsequently unify previous definitions of directed information present in the literature and clarify the confusion surrounding them. We also motivate using chain graphs for directed information in time series and extend our approach to chain graphs. The proposed approach serves as a translation between causality modelling and information theory.

Download Full-text

Weighted Independent Directed Acyclic Graphs for Resilient and Efficient Multipath Routing

International Journal of Mathematics and Soft Computing ◽

10.26708/ijmsc.2013.3.3.01 ◽

2013 ◽

Vol 3 (3) ◽

pp. 1

Author(s):

M. Ramakrishnan ◽

Sowmya Swaminathan ◽

Jayaganesan Chandresekaran

Keyword(s):

Multipath Routing ◽

Directed Acyclic Graphs ◽

Acyclic Graphs

Download Full-text

A Nettree for Simple Paths with Length Constraint and the Longest Path in Directed Acyclic Graphs

Chinese Journal of Computers ◽

10.3724/sp.j.1016.2012.02194 ◽

2012 ◽

Vol 35 (10) ◽

pp. 2194 ◽

Cited By ~ 2

Author(s):

Yan LI ◽

Le SUN ◽

Huai-Zhong ZHU ◽

You-Xi WU

Keyword(s):

Directed Acyclic Graphs ◽

Length Constraint ◽

Acyclic Graphs ◽

Longest Path ◽

Simple Paths

Download Full-text

Updating Properties of Directed Acyclic Graphs on a Parallel Random Access Machine.

10.21236/ada162954 ◽

1985 ◽

Author(s):

Shaunak Pawagi ◽

I. V. Ramakrishnan

Keyword(s):

Random Access ◽

Directed Acyclic Graphs ◽

Random Access Machine ◽

Acyclic Graphs ◽

Parallel Random Access Machine

Download Full-text

Do galactic bars depend on environment?: an information theoretic analysis of Galaxy Zoo 2

Monthly Notices of the Royal Astronomical Society ◽

10.1093/mnras/staa3665 ◽

2020 ◽

Vol 501 (1) ◽

pp. 994-1001

Author(s):

Suman Sarkar ◽

Biswajit Pandey ◽

Snehasish Bhattacharjee

Keyword(s):

Spatial Distribution ◽

Mutual Information ◽

Local Density ◽

Statistical Significance ◽

Distribution Functions ◽

Cumulative Distribution ◽

Host Galaxy ◽

Data Sets ◽

Data Set ◽

Information Theoretic

ABSTRACT We use an information theoretic framework to analyse data from the Galaxy Zoo 2 project and study if there are any statistically significant correlations between the presence of bars in spiral galaxies and their environment. We measure the mutual information between the barredness of galaxies and their environments in a volume limited sample (Mr ≤ −21) and compare it with the same in data sets where (i) the bar/unbar classifications are randomized and (ii) the spatial distribution of galaxies are shuffled on different length scales. We assess the statistical significance of the differences in the mutual information using a t-test and find that both randomization of morphological classifications and shuffling of spatial distribution do not alter the mutual information in a statistically significant way. The non-zero mutual information between the barredness and environment arises due to the finite and discrete nature of the data set that can be entirely explained by mock Poisson distributions. We also separately compare the cumulative distribution functions of the barred and unbarred galaxies as a function of their local density. Using a Kolmogorov–Smirnov test, we find that the null hypothesis cannot be rejected even at $75{{\ \rm per\ cent}}$ confidence level. Our analysis indicates that environments do not play a significant role in the formation of a bar, which is largely determined by the internal processes of the host galaxy.

Download Full-text

Might temporal logic improve the specification of directed acyclic graphs (DAGs)?

Journal of Statistics and Data Science Education ◽

10.1080/26939169.2021.1936311 ◽

2021 ◽

pp. 1-18

Author(s):

George T.H. Ellison

Keyword(s):

Temporal Logic ◽

Directed Acyclic Graphs ◽

Acyclic Graphs

Download Full-text

Directed acyclic graphs: An under-utilized tool for child maltreatment research

Child Abuse & Neglect ◽

10.1016/j.chiabu.2019.02.011 ◽

2019 ◽

Vol 91 ◽

pp. 78-87 ◽

Cited By ~ 4

Author(s):

Anna E. Austin ◽

Tania A. Desrosiers ◽

Meghan E. Shanahan

Keyword(s):

Child Maltreatment ◽

Directed Acyclic Graphs ◽

Acyclic Graphs

Download Full-text

Scalable information-theoretic path planning for a rover-helicopter team in uncertain environments

International Journal of Advanced Robotic Systems ◽

10.1177/1729881421999587 ◽

2021 ◽

Vol 18 (2) ◽

pp. 172988142199958

Author(s):

Larkin Folsom ◽

Masahiro Ono ◽

Kyohei Otsu ◽

Hyoshin Park

Keyword(s):

Path Planning ◽

Lower Risk ◽

Information Gain ◽

Probability Distributions ◽

Single Agent ◽

Uncertain Environments ◽

Information Theoretic ◽

Highly Correlated ◽

Naive Approach

Mission-critical exploration of uncertain environments requires reliable and robust mechanisms for achieving information gain. Typical measures of information gain such as Shannon entropy and KL divergence are unable to distinguish between different bimodal probability distributions or introduce bias toward one mode of a bimodal probability distribution. The use of a standard deviation (SD) metric reduces bias while retaining the ability to distinguish between higher and lower risk distributions. Areas of high SD can be safely explored through observation with an autonomous Mars Helicopter allowing safer and faster path plans for ground-based rovers. First, this study presents a single-agent information-theoretic utility-based path planning method for a highly correlated uncertain environment. Then, an information-theoretic two-stage multiagent rapidly exploring random tree framework is presented, which guides Mars helicopter through regions of high SD to reduce uncertainty for the rover. In a Monte Carlo simulation, we compare our information-theoretic framework with a rover-only approach and a naive approach, in which the helicopter scouts ahead of the rover along its planned path. Finally, the model is demonstrated in a case study on the Jezero region of Mars. Results show that the information-theoretic helicopter improves the travel time for the rover on average when compared with the rover alone or with the helicopter scouting ahead along the rover’s initially planned route.

Download Full-text