Annual Review of Statistics and Its Application
Latest Publications


TOTAL DOCUMENTS

175
(FIVE YEARS 77)

H-INDEX

27
(FIVE YEARS 8)

Published By Annual Reviews

2326-831x, 2326-8298

Author(s):  
Leah F. South ◽  
Marina Riabiz ◽  
Onur Teymur ◽  
Chris J. Oates

Markov chain Monte Carlo is the engine of modern Bayesian statistics, being used to approximate the posterior and derived quantities of interest. Despite this, the issue of how the output from a Markov chain is postprocessed and reported is often overlooked. Convergence diagnostics can be used to control bias via burn-in removal, but these do not account for (common) situations where a limited computational budget engenders a bias-variance trade-off. The aim of this article is to review state-of-the-art techniques for postprocessing Markov chain output. Our review covers methods based on discrepancy minimization, which directly address the bias-variance trade-off, as well as general-purpose control variate methods for approximating expected quantities of interest. Expected final online publication date for the Annual Review of Statistics and Its Application, Volume 9 is March 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.


Author(s):  
Noel Cressie ◽  
Matthew Sainsbury-Dale ◽  
Andrew Zammit-Mangion

Spatial statistics is concerned with the analysis of data that have spatial locations associated with them, and those locations are used to model statistical dependence between the data. The spatial data are treated as a single realization from a probability model that encodes the dependence through both fixed effects and random effects, where randomness is manifest in the underlying spatial process and in the noisy, incomplete measurement process. The focus of this review article is on the use of basis functions to provide an extremely flexible and computationally efficient way to model spatial processes that are possibly highly nonstationary. Several examples of basis-function models are provided to illustrate how they are used in Gaussian, non-Gaussian, multivariate, and spatio-temporal settings, with applications in geophysics. Our aim is to emphasize the versatility of these spatial-statistical models and to demonstrate that they are now center-stage in a number of application domains. The review concludes with a discussion and illustration of software currently available to fit spatial-basis-function models and implement spatial-statistical prediction. Expected final online publication date for the Annual Review of Statistics and Its Application, Volume 9 is March 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.


Author(s):  
Ali Shojaie ◽  
Emily B. Fox

Introduced more than a half-century ago, Granger causality has become a popular tool for analyzing time series data in many application domains, from economics and finance to genomics and neuroscience. Despite this popularity, the validity of this framework for inferring causal relationships among time series has remained the topic of continuous debate. Moreover, while the original definition was general, limitations in computational tools have constrained the applications of Granger causality to primarily simple bivariate vector autoregressive processes. Starting with a review of early developments and debates, this article discusses recent advances that address various shortcomings of the earlier approaches, from models for high-dimensional time series to more recent developments that account for nonlinear and non-Gaussian observations and allow for subsampled and mixed-frequency time series. Expected final online publication date for the Annual Review of Statistics and Its Application, Volume 9 is March 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.


Author(s):  
Bianca L. De Stavola ◽  
Moritz Herle ◽  
Andrew Pickles

We describe the principles of counterfactual thinking in providing more precise definitions of causal effects and some of the implications of this work for the way in which causal questions in life course research are framed and evidence evaluated. Terminology is explained and examples of common life course analyses are discussed that focus on the timing of exposures, the mediation of their effects, observed and unobserved confounders, and measurement error. The examples are illustrated by analyses using singleton and twin cohort data. Expected final online publication date for the Annual Review of Statistics and Its Application, Volume 9 is March 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.


Author(s):  
Arun K. Kuchibhotla ◽  
John E. Kolassa ◽  
Todd A. Kuffner

We discuss inference after data exploration, with a particular focus on inference after model or variable selection. We review three popular approaches to this problem: sample splitting, simultaneous inference, and conditional selective inference. We explain how each approach works and highlight its advantages and disadvantages. We also provide an illustration of these post-selection inference approaches. Expected final online publication date for the Annual Review of Statistics and Its Application, Volume 9 is March 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.


Author(s):  
Andrew C. Harvey

The construction of score-driven filters for nonlinear time series models is described, and they are shown to apply over a wide range of disciplines. Their theoretical and practical advantages over other methods are highlighted. Topics covered include robust time series modeling, conditional heteroscedasticity, count data, dynamic correlation and association, censoring, circular data, and switching regimes. Expected final online publication date for the Annual Review of Statistics and Its Application, Volume 9 is March 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.


Author(s):  
Thomas H. Scheike ◽  
Klaus Kähler Holst

Familial aggregation refers to the fact that a particular disease may be overrepresented in some families due to genetic or environmental factors. When studying such phenomena, it is clear that one important aspect is the age of onset of the disease in question, and in addition, the data will typically be right-censored. Therefore, one must apply lifetime data methods to quantify such dependence and to separate it into different sources using polygenic modeling. Another important point is that the occurrence of a particular disease can be prevented by death—that is, competing risks—and therefore, the familial aggregation should be studied in a model that allows for both death and the occurrence of the disease. We here demonstrate how polygenic modeling can be done for both survival data and competing risks data dealing with right-censoring. The competing risks modeling that we focus on is closely related to the liability threshold model. Expected final online publication date for the Annual Review of Statistics and Its Application, Volume 9 is March 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.


Author(s):  
Léo R. Belzile ◽  
Anthony C. Davison ◽  
Jutta Gampe ◽  
Holger Rootzén ◽  
Dmitrii Zholud

There is sustained and widespread interest in understanding the limit, if there is any, to the human life span. Apart from its intrinsic and biological interest, changes in survival in old age have implications for the sustainability of social security systems. A central question is whether the endpoint of the underlying lifetime distribution is finite. Recent analyses of data on the oldest human lifetimes have led to competing claims about survival and to some controversy, due in part to incorrect statistical analysis. This article discusses the particularities of such data, outlines correct ways of handling them, and presents suitable models and methods for their analysis. We provide a critical assessment of some earlier work and illustrate the ideas through reanalysis of semisupercentenarian lifetime data. Our analysis suggests that remaining life length after age 109 is exponentially distributed and that any upper limit lies well beyond the highest lifetime yet reliably recorded. Lower limits to 95% confidence intervals for the human life span are about 130 years, and point estimates typically indicate no upper limit at all. Expected final online publication date for the Annual Review of Statistics and Its Application, Volume 9 is March 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.


Author(s):  
Francesco Bartolucci ◽  
Silvia Pandolfi ◽  
Fulvia Pennoni

We review the discrete latent variable approach, which is very popular in statistics and related fields. It allows us to formulate interpretable and flexible models that can be used to analyze complex datasets in the presence of articulated dependence structures among variables. Specific models including discrete latent variables are illustrated, such as finite mixture, latent class, hidden Markov, and stochastic block models. Algorithms for maximum likelihood and Bayesian estimation of these models are reviewed, focusing, in particular, on the expectation–maximization algorithm and the Markov chain Monte Carlo method with data augmentation. Model selection, particularly concerning the number of support points of the latent distribution, is discussed. The approach is illustrated by summarizing applications available in the literature; a brief review of the main software packages to handle discrete latent variable models is also provided. Finally, some possible developments in this literature are suggested. Expected final online publication date for the Annual Review of Statistics and Its Application, Volume 9 is March 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.


Author(s):  
A. Philip Dawid ◽  
Monica Musio

We describe and contrast two distinct problem areas for statistical causality: studying the likely effects of an intervention (effects of causes) and studying whether there is a causal link between the observed exposure and outcome in an individual case (causes of effects). For each of these, we introduce and compare various formal frameworks that have been proposed for that purpose, including the decision-theoretic approach, structural equations, structural and stochastic causal models, and potential outcomes. We argue that counterfactual concepts are unnecessary for studying effects of causes but are needed for analyzing causes of effects. They are, however, subject to a degree of arbitrariness, which can be reduced, though not in general eliminated, by taking account of additional structure in the problem. Expected final online publication date for the Annual Review of Statistics and Its Application, Volume 9 is March 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.


Sign in / Sign up

Export Citation Format

Share Document