Parsimonious model selection using information theory: a modified selection rule

Ecology ◽  
2021 ◽  
Author(s):  
Luke A. Yates ◽  
Shane A. Richards ◽  
Barry W. Brook
2004 ◽  
Vol 5 (2) ◽  
pp. 229-241 ◽  
Author(s):  
James P. Hoffmann ◽  
Christopher D. Ellingwood ◽  
Osei M. Bonsu ◽  
Daniel E. Bentil

2020 ◽  
Vol 69 (6) ◽  
pp. 1163-1179 ◽  
Author(s):  
Kris V Parag ◽  
Christl A Donnelly

Abstract Estimating temporal changes in a target population from phylogenetic or count data is an important problem in ecology and epidemiology. Reliable estimates can provide key insights into the climatic and biological drivers influencing the diversity or structure of that population and evidence hypotheses concerning its future growth or decline. In infectious disease applications, the individuals infected across an epidemic form the target population. The renewal model estimates the effective reproduction number, R, of the epidemic from counts of observed incident cases. The skyline model infers the effective population size, N, underlying a phylogeny of sequences sampled from that epidemic. Practically, R measures ongoing epidemic growth while N informs on historical caseload. While both models solve distinct problems, the reliability of their estimates depends on p-dimensional piecewise-constant functions. If p is misspecified, the model might underfit significant changes or overfit noise and promote a spurious understanding of the epidemic, which might misguide intervention policies or misinform forecasts. Surprisingly, no transparent yet principled approach for optimizing p exists. Usually, p is heuristically set, or obscurely controlled via complex algorithms. We present a computable and interpretable p-selection method based on the minimum description length (MDL) formalism of information theory. Unlike many standard model selection techniques, MDL accounts for the additional statistical complexity induced by how parameters interact. As a result, our method optimizes p so that R and N estimates properly and meaningfully adapt to available data. It also outperforms comparable Akaike and Bayesian information criteria on several classification problems, given minimal knowledge of the parameter space, and exposes statistical similarities among renewal, skyline, and other models in biology. Rigorous and interpretable model selection is necessary if trustworthy and justifiable conclusions are to be drawn from piecewise models. [Coalescent processes; epidemiology; information theory; model selection; phylodynamics; renewal models; skyline plots]


2011 ◽  
Vol 159 (3) ◽  
pp. 414-427 ◽  
Author(s):  
S.M. Kirchner ◽  
T.F. Döring ◽  
L.H. Hiltunen ◽  
E. Virtanen ◽  
J.P.T. Valkonen

2007 ◽  
Vol 26 (11) ◽  
pp. 1576-1584 ◽  
Author(s):  
R.Z. Freidlin ◽  
E. Ozarslan ◽  
M.E. Komlosh ◽  
Lin-Ching Chang ◽  
Cheng Guan Koay ◽  
...  

2007 ◽  
Author(s):  
Raisa Z. Freidlin ◽  
Michal E. Komlosh ◽  
Murray H. Loew ◽  
Peter J. Basser

2017 ◽  
Vol 52 (1) ◽  
pp. 341-363 ◽  
Author(s):  
Roy Kouwenberg ◽  
Agnieszka Markiewicz ◽  
Ralph Verhoeks ◽  
Remco C. J. Zwinkels

Exchange rate models with uncertain and incomplete information predict that investors focus on a small set of fundamentals that changes frequently over time. We design a model selection rule that captures the current set of fundamentals that best predicts the exchange rate. Out-of-sample tests show that the forecasts made by this rule significantly beat a random walk for 5 out of 10 currencies. Furthermore, the currency forecasts generate meaningful investment profits. We demonstrate that the strong performance of the model selection rule is driven by time-varying weights attached to a small set of fundamentals, in line with theory.


2021 ◽  
pp. 149-164
Author(s):  
Timothy E. Essington

The chapter “Model Selection” provides a brief overview of alternative ways of thinking about what is (are) the best model(s) and shows the core motivation behind the Akaike information criterion as a measure of the predictive ability of a model fitted via maximum likelihood. It then gives stepwise practical guidance for using information theory as a basis of model selection, including nested versus non-nested models, goodness of fit, and overdispersion. Advanced topics cover some of the philosophy of information theory, other types of information theory criteria, and other ways of evaluating the predictive ability of models. As an example, the chapter examines the case of the Western monarch butterfly (Danaus plexippus plexippus), which, over the past two decades, has experienced a 97% decline from its historical average abundance, declining by 86% from 2017 to 2018 alone. Undoubtedly, there is more than one cause—indeed, overwintering habitat loss and pesticide use are both believed to be important contributors to the decline. Adopting a hypothesis-evaluation framework makes it possible to consider multiple alternative hypotheses simultaneously and measure degrees of support for alternative hypotheses.


Sign in / Sign up

Export Citation Format

Share Document