scholarly journals Solutions to problems of nonexistence of parameter estimates and sparse data bias in Poisson regression

2021 ◽  
pp. 096228022110654
Author(s):  
Ashwini Joshi ◽  
Angelika Geroldinger ◽  
Lena Jiricka ◽  
Pralay Senchaudhuri ◽  
Christopher Corcoran ◽  
...  

Poisson regression can be challenging with sparse data, in particular with certain data constellations where maximum likelihood estimates of regression coefficients do not exist. This paper provides a comprehensive evaluation of methods that give finite regression coefficients when maximum likelihood estimates do not exist, including Firth’s general approach to bias reduction, exact conditional Poisson regression, and a Bayesian estimator using weakly informative priors that can be obtained via data augmentation. Furthermore, we include in our evaluation a new proposal for a modification of Firth’s approach, improving its performance for predictions without compromising its attractive bias-correcting properties for regression coefficients. We illustrate the issue of the nonexistence of maximum likelihood estimates with a dataset arising from the recent outbreak of COVID-19 and an example from implant dentistry. All methods are evaluated in a comprehensive simulation study under a variety of realistic scenarios, evaluating their performance for prediction and estimation. To conclude, while exact conditional Poisson regression may be confined to small data sets only, both the modification of Firth’s approach and the Bayesian estimator are universally applicable solutions with attractive properties for prediction and estimation. While the Bayesian method needs specification of prior variances for the regression coefficients, the modified Firth approach does not require any user input.

2021 ◽  
Vol 50 (3) ◽  
pp. 41-53
Author(s):  
Andre Menezes ◽  
Josmar Mazucheli ◽  
F. Alqallaf ◽  
M. E. Ghitany

It is well known that the maximum likelihood estimates (MLEs) have appealing statistical properties. Under fairly mild conditions their asymptotic distribution is normal, and no other estimator has a smaller asymptotic variance.However, in finite samples the maximum likelihood estimates are often biased estimates and the bias disappears as the sample size grows.Mazucheli, Menezes, and Ghitany (2018b) introduced a two-parameter unit-Weibull distribution which is useful for modeling data on the unit interval, however its MLEs are biased in finite samples.In this paper, we adopt three approaches for bias reduction of the MLEs of the parameters of unit-Weibull distribution.The first approach is the analytical methodology suggested by Cox and Snell (1968), the second is based on parametric bootstrap resampling method, and the third is the preventive approach introduced by Firth (1993).The results from Monte Carlo simulations revealed that the biases of the estimates should not be ignored and the bias reduction approaches are equally efficient. However, the first approach is easier to implement.Finally, applications to two real data sets are presented for illustrative purposes.


1992 ◽  
Vol 31 (03) ◽  
pp. 215-218
Author(s):  
J. Y. Wan ◽  
A. T. Galecki

Abstract:A Poisson regression model is proposed for the analysis of incidence rates presented in a two-way table classified by two categorical variables. It is shown that the likelihood function is the same as that using Glasser’s exponential covariate model. An algorithm is given to solve the maximum likelihood estimates of the regression parameters. The model is evaluated via deviance and the method is illustrated with an example. Some extensions of the model are discussed.


2002 ◽  
Vol 27 (2) ◽  
pp. 147-161 ◽  
Author(s):  
David Rindskopf

Infinite parameter estimates in logistic regression are commonly thought of as a problem. This article shows that in principle an analyst should be happy to have an infinite slope in logistic regression, because it indicates that a predictor is perfect. Using simple approaches, hypothesis tests may be performed and confidence intervals calculated even when a slope is infinite. Some functions of parameters that are infinite are still well defined, and reasonable estimates of these quantities (in particular, LD50) may be obtained even when the maximum likelihood estimates do not, in a strict sense, exist. Surprisingly, these techniques can provide more reasonable and useful results than the most popular alternative method, exact logistic regression.


2016 ◽  
Vol 47 (1) ◽  
pp. 1-41 ◽  
Author(s):  
Greg Taylor

AbstractThe cross-classified chain ladder has a number of versions, depending on the distribution to which observations are subject. The simplest case is that of Poisson distributed observations, and then maximum likelihood estimates of parameters are explicit. Most other cases, however, including Bayesian chain ladder models, lead to implicit MAP (Bayesian) or MLE (non-Bayesian) solutions for these parameter estimates, raising questions as to their existence and uniqueness. The present paper investigates these questions in the case where observations are distributed according to some member of the exponential dispersion family.


Sign in / Sign up

Export Citation Format

Share Document