scholarly journals Model Selection Using Information Theory and the MDL Principle

2004 ◽  
Vol 33 (2) ◽  
pp. 230-260 ◽  
Author(s):  
Robert A. Stine
2004 ◽  
Vol 5 (2) ◽  
pp. 229-241 ◽  
Author(s):  
James P. Hoffmann ◽  
Christopher D. Ellingwood ◽  
Osei M. Bonsu ◽  
Daniel E. Bentil

2020 ◽  
Vol 69 (6) ◽  
pp. 1163-1179 ◽  
Author(s):  
Kris V Parag ◽  
Christl A Donnelly

Abstract Estimating temporal changes in a target population from phylogenetic or count data is an important problem in ecology and epidemiology. Reliable estimates can provide key insights into the climatic and biological drivers influencing the diversity or structure of that population and evidence hypotheses concerning its future growth or decline. In infectious disease applications, the individuals infected across an epidemic form the target population. The renewal model estimates the effective reproduction number, R, of the epidemic from counts of observed incident cases. The skyline model infers the effective population size, N, underlying a phylogeny of sequences sampled from that epidemic. Practically, R measures ongoing epidemic growth while N informs on historical caseload. While both models solve distinct problems, the reliability of their estimates depends on p-dimensional piecewise-constant functions. If p is misspecified, the model might underfit significant changes or overfit noise and promote a spurious understanding of the epidemic, which might misguide intervention policies or misinform forecasts. Surprisingly, no transparent yet principled approach for optimizing p exists. Usually, p is heuristically set, or obscurely controlled via complex algorithms. We present a computable and interpretable p-selection method based on the minimum description length (MDL) formalism of information theory. Unlike many standard model selection techniques, MDL accounts for the additional statistical complexity induced by how parameters interact. As a result, our method optimizes p so that R and N estimates properly and meaningfully adapt to available data. It also outperforms comparable Akaike and Bayesian information criteria on several classification problems, given minimal knowledge of the parameter space, and exposes statistical similarities among renewal, skyline, and other models in biology. Rigorous and interpretable model selection is necessary if trustworthy and justifiable conclusions are to be drawn from piecewise models. [Coalescent processes; epidemiology; information theory; model selection; phylodynamics; renewal models; skyline plots]


2011 ◽  
Vol 159 (3) ◽  
pp. 414-427 ◽  
Author(s):  
S.M. Kirchner ◽  
T.F. Döring ◽  
L.H. Hiltunen ◽  
E. Virtanen ◽  
J.P.T. Valkonen

2021 ◽  
pp. 149-164
Author(s):  
Timothy E. Essington

The chapter “Model Selection” provides a brief overview of alternative ways of thinking about what is (are) the best model(s) and shows the core motivation behind the Akaike information criterion as a measure of the predictive ability of a model fitted via maximum likelihood. It then gives stepwise practical guidance for using information theory as a basis of model selection, including nested versus non-nested models, goodness of fit, and overdispersion. Advanced topics cover some of the philosophy of information theory, other types of information theory criteria, and other ways of evaluating the predictive ability of models. As an example, the chapter examines the case of the Western monarch butterfly (Danaus plexippus plexippus), which, over the past two decades, has experienced a 97% decline from its historical average abundance, declining by 86% from 2017 to 2018 alone. Undoubtedly, there is more than one cause—indeed, overwintering habitat loss and pesticide use are both believed to be important contributors to the decline. Adopting a hypothesis-evaluation framework makes it possible to consider multiple alternative hypotheses simultaneously and measure degrees of support for alternative hypotheses.


Entropy ◽  
2019 ◽  
Vol 21 (11) ◽  
pp. 1081 ◽  
Author(s):  
Sergey Oladyshkin ◽  
Wolfgang Nowak

We show a link between Bayesian inference and information theory that is useful for model selection, assessment of information entropy and experimental design. We align Bayesian model evidence (BME) with relative entropy and cross entropy in order to simplify computations using prior-based (Monte Carlo) or posterior-based (Markov chain Monte Carlo) BME estimates. On the one hand, we demonstrate how Bayesian model selection can profit from information theory to estimate BME values via posterior-based techniques. Hence, we use various assumptions including relations to several information criteria. On the other hand, we demonstrate how relative entropy can profit from BME to assess information entropy during Bayesian updating and to assess utility in Bayesian experimental design. Specifically, we emphasize that relative entropy can be computed avoiding unnecessary multidimensional integration from both prior and posterior-based sampling techniques. Prior-based computation does not require any assumptions, however posterior-based estimates require at least one assumption. We illustrate the performance of the discussed estimates of BME, information entropy and experiment utility using a transparent, non-linear example. The multivariate Gaussian posterior estimate includes least assumptions and shows the best performance for BME estimation, information entropy and experiment utility from posterior-based sampling.


2008 ◽  
Vol 23 (06) ◽  
pp. 787-802 ◽  
Author(s):  
PIA MUKHERJEE ◽  
DAVID PARKINSON

We give an overview of the recent progress in the field of cosmological model selection. Model selection statistics, such as those based on information theory and on Bayesian statistics are introduced and discussed. In the Bayesian framework, the marginalised model likelihood, or evidence, is the primary model selection statistic. We describe different methods of computing the evidence, and focus in particular on Nested Sampling. We describe the results of applying model selection methods to new cosmological data such as the CMB measurements by WMAP.


Sign in / Sign up

Export Citation Format

Share Document