A Study of Count Regression Models for Mortality Rate

This paper discusses how overdispersed count data to be fit. Poisson regression model, Negative Binomial 1 regression model (NEGBIN 1) and Negative Binomial regression 2 (NEGBIN 2) model were proposed to fit mortality rate data. The method used is comparing the values of Akaike Information Criterion (AIC) and Bayesian Information Criterion (BIC) to find out which method suits the data the most. The results show that the data indeed display higher variability. Among the three models, the model preferred is NEGBIN 1 model.

Download Full-text

Selecting the best model to fit the Rainfall Count data Using Some Zero Type models with application

Journal of Al-Qadisiyah for Computer Science and Mathematics ◽

10.29304/jqcm.2019.11.2.555 ◽

2019 ◽

Vol 11 (2) ◽

pp. 28-41

Author(s):

Luay Habeeb Hashim ◽

Ahmad Naeem Flaih

Keyword(s):

Linear Regression ◽

Regression Model ◽

Count Data ◽

Negative Binomial ◽

Negative Binomial Regression ◽

Skewed Distribution ◽

Binomial Model ◽

Poisson Regression Model ◽

Health Marketing ◽

Binomial Regression

28 Counts data models cope with the response variable counts, where the number of times that a certain event occurs in a fixed point is called count data, its observations consists of non-negative integers values {0,1,2,…}. Because of the nature of count data, the response variables are usually considered doing not follow normal distribution. Therefore, linear regression is not an appropriate method to analysis count data due to the skewed distribution. Hence, using linear regression model to analysis count data is likely to bias the results, under these limitations, Poisson regression model and “Negative binomial regression” are likely the appropriate models to analysis count data. Sometimes researchers may Counts more zeros than the expected. Count data with many Zeros leads to a concept called “Zero-inflation”. Data with abundant zeros are especially popular in health, marketing, finance, econometric, ecology, statistics quality control, geographical, and environmental fields when counting the occurrence of certain behavioral and natural events, such as frequency of alcohol use, take drugs, number of cigarettes smoked, the occurrence of earthquakes, rainfall, and etc. Some models have been used to analyzing count data such as the “zero- altered Poisson” (ZAP) model and the “negative binomial” model. In this paper, the models, Poisson, Negative Binomial, ZAP, and ZANB were been used to analyze rainfall data.

Download Full-text

Comparison between Efficiency of Poisson Regression Model and Negative Binomial Regression in the Analysis of Factors Affecting Mortality from Cardiovascular Diseases in Yazd Province in 2017

The Journal of Tolooebehdasht ◽

10.18502/tbj.v19i5.5173 ◽

2021 ◽

Author(s):

Mohammad Mirjani Arjenan ◽

Mohsen Askarshahi ◽

Mahmud Vakili

Keyword(s):

Cardiovascular Diseases ◽

Regression Model ◽

Poisson Regression ◽

Regression Models ◽

Negative Binomial ◽

Negative Binomial Regression ◽

Poisson Regression Model ◽

Registration System ◽

Factors Affecting ◽

Binomial Regression

Introduction: Despite the advances in cardiovascular diseases, death caused by these diseases is still considered as the leading cause of mortality. In this study, some of the effective factors on the deaths caused by cardiovascular diseases were investigated Methods: This cross-sectional analytical study investigated the efficacy of Poisson regression models and negative binomial regression models on factors affecting mortality from cardiovascular diseases. The death data were extracted from the death registration system for Yazd province in 2017.Gender, age, education, occupation, location, and city of death were also extracted for each deceased. The two regression models were then fitted to the data Results: A total of 5,015 deaths were recorded, of which 1,642 were due to cardiovascular diseases. Cardiovascular disease mortality rates were significant using negative binomial regression in terms of the educational variables, place of residence, type of residence, and age. Death rates caused by cardiovascular diseases were not significant for age and occupational, educational, and residential variables. Conclusion: If the time of death is considered as an offset variable, the regression model of two negative sentences is more effective in showing the factors affecting death due to cardiovascular diseases according to AIC and BIC criteria. In the case that the total number of deaths is considered as the offset variable, the Poisson regression model is more efficient.

Download Full-text

Count Models for Identifying Factors Associated with Child Mortality in Ethiopia

10.21203/rs.3.rs-222761/v1 ◽

2021 ◽

Author(s):

Endale Alemayehu Ali ◽

Tsigereda Tilahun

Keyword(s):

Mortality Rate ◽

Regression Model ◽

Child Mortality ◽

Regression Models ◽

Negative Binomial ◽

Average Rate ◽

Negative Binomial Regression ◽

Negative Binomial Regression Model ◽

Count Models ◽

Binomial Regression

Abstract Introduction: The under-five mortality rate, often known by its acronym U5MR or simply as the child mortality rate, indicates the probability of dying between births exactly five years of age, expressed per 1,000 live births. In comparison, the probability of dying after the first month and before reaching age 1 was 12 per 1,000, the probability of dying after age 1 and before age 5 was 10 per 1,000, and the probability of dying after age 5 and before age 15 was 7 per 1,000. Objectives: The study was aimed to determine the major factors of child mortality in Ethiopia using different counting models. In detail the study has the objective of identifying the risk factors of child mortality in Ethiopia and also to prioritize the best counting models that fit the data well. Methods: The Ethiopian demographic and health survey of 2016 was used for this study. About 10641 women aged between 15-49 were included in the survey. To analyze the data, counting models like the Poisson regression model, negative binomial model, zero-inflated regression models, and zero-inflated negative binomial regression model were applicable. Results: The results of the study indicated that of the total 10641 women respondents, 7576 (71.2%) have not faced the problem of child mortality. Thus, this result has the clue that the count models, especially the models that can handle the dispersion may be applicable. The average rate of child mortality is less than the variance of child mortality and this indicated that there is an over-dispersion of the data. Of all the candidate models, a zero-inflated negative binomial regression model was found to be the best model since it has a minimum AIC(15517). The coefficient table of the best model indicated that of child mortality for the women from rural residence is 1.2532 greater than those from urban with a 95% confidence interval (0.0905, 0.3610). Conclusion: The model comparison technique is indicated that the zero-inflated negative binomial regression models were the best mode that fit the data well. Under this model, the residency of women, birth order, Preceding Birth Interval, Size of a child at birth (smaller than average), and number of household members are significant variables in determining the status of child mortality in Ethiopia

Download Full-text

Statistical models for analyzing count data: predictors of length of stay among HIV patients in Portugal using a multilevel model

BMC Health Services Research ◽

10.1186/s12913-021-06389-1 ◽

2021 ◽

Vol 21 (1) ◽

Author(s):

Ahmed Nabil Shaaban ◽

Bárbara Peleteiro ◽

Maria Rosario O. Martins

Keyword(s):

Length Of Stay ◽

Regression Model ◽

Random Effects ◽

Count Data ◽

Negative Binomial ◽

Negative Binomial Regression ◽

Comprehensive Approach ◽

Negative Binomial Regression Model ◽

Hiv Patients ◽

Binomial Regression

Abstract Background This study offers a comprehensive approach to precisely analyze the complexly distributed length of stay among HIV admissions in Portugal. Objective To provide an illustration of statistical techniques for analysing count data using longitudinal predictors of length of stay among HIV hospitalizations in Portugal. Method Registered discharges in the Portuguese National Health Service (NHS) facilities Between January 2009 and December 2017, a total of 26,505 classified under Major Diagnostic Category (MDC) created for patients with HIV infection, with HIV/AIDS as a main or secondary cause of admission, were used to predict length of stay among HIV hospitalizations in Portugal. Several strategies were applied to select the best count fit model that includes the Poisson regression model, zero-inflated Poisson, the negative binomial regression model, and zero-inflated negative binomial regression model. A random hospital effects term has been incorporated into the negative binomial model to examine the dependence between observations within the same hospital. A multivariable analysis has been performed to assess the effect of covariates on length of stay. Results The median length of stay in our study was 11 days (interquartile range: 6–22). Statistical comparisons among the count models revealed that the random-effects negative binomial models provided the best fit with observed data. Admissions among males or admissions associated with TB infection, pneumocystis, cytomegalovirus, candidiasis, toxoplasmosis, or mycobacterium disease exhibit a highly significant increase in length of stay. Perfect trends were observed in which a higher number of diagnoses or procedures lead to significantly higher length of stay. The random-effects term included in our model and refers to unexplained factors specific to each hospital revealed obvious differences in quality among the hospitals included in our study. Conclusions This study provides a comprehensive approach to address unique problems associated with the prediction of length of stay among HIV patients in Portugal.

Download Full-text

Managing Inflation

Crime & Delinquency ◽

10.1177/0011128716679796 ◽

2016 ◽

Vol 63 (1) ◽

pp. 77-87 ◽

Cited By ~ 5

Author(s):

William H. Fisher ◽

Stephanie W. Hartwell ◽

Xiaogang Deng

Keyword(s):

Count Data ◽

Regression Models ◽

Negative Binomial ◽

Negative Binomial Regression ◽

Excess Zeros ◽

Binomial Regression ◽

Negative Binomial Models ◽

Using Data ◽

Over Dispersion ◽

Binomial Models

Poisson and negative binomial regression procedures have proliferated, and now are available in virtually all statistical packages. Along with the regression procedures themselves are procedures for addressing issues related to the over-dispersion and excessive zeros commonly observed in count data. These approaches, zero-inflated Poisson and zero-inflated negative binomial models, use logit or probit models for the “excess” zeros and count regression models for the counted data. Although these models are often appropriate on statistical grounds, their interpretation may prove substantively difficult. This article explores this dilemma, using data from a study of individuals released from facilities maintained by the Massachusetts Department of Correction.

Download Full-text

A Combined PLS and Negative Binomial Regression Model for Inferring Association Networks from Next-Generation Sequencing Count Data

IEEE/ACM Transactions on Computational Biology and Bioinformatics ◽

10.1109/tcbb.2017.2665495 ◽

2018 ◽

Vol 15 (3) ◽

pp. 760-773 ◽

Cited By ~ 2

Author(s):

Maiju Pesonen ◽

Jaakko Nevalainen ◽

Steven Potter ◽

Somnath Datta ◽

Susmita Datta

Keyword(s):

Next Generation Sequencing ◽

Regression Model ◽

Count Data ◽

Negative Binomial ◽

Negative Binomial Regression ◽

Negative Binomial Regression Model ◽

Next Generation ◽

Binomial Regression ◽

Generation Sequencing

Download Full-text

PEMODELAN DENGAN GEOGRAPHICALLY WEIGHTED NEGATIVE BINOMIAL REGRESSION (Studi kasus: Banyaknya Penderita Kusta di Jawa Barat)

Xplore Journal of Statistics ◽

10.29244/xplore.v10i3.833 ◽

2021 ◽

Vol 10 (3) ◽

pp. 226-236

Author(s):

Khusnul Khotimah ◽

Itasia Dina Sulvianti ◽

Pika Silvianti

Keyword(s):

Regression Model ◽

Count Data ◽

Poisson Regression ◽

Negative Binomial ◽

Negative Binomial Regression ◽

Kernel Weight ◽

Negative Binomial Regression Model ◽

West Java ◽

Binomial Regression ◽

Spatial Heterogenity

The number of leper in West Java is an example of the count data case. The analyzes commonly used in count data is Poisson regression. This research will determine the variables that influence the number of leper in West Java. The data used is the number of leper in West Java in 2019. This data has an overdispersion condition and spatial heterogenity. To handle overdispersion, the negative binomial regression model can be employed. While spatial heterogenity is overcome by adding adaptive bisquare kernel weight. This research resulted Geographically Weighted Negative Binomial Regression (GWNBR) with a weighting adaptive bisquare kernel classifies regency/city in West Java into ten groups based on the variables that sigfinicantly influence the number of leper. In general, the variable in the percentage of households with Clean and Healthy Behavior (PHBS) has a significant effect in all regency/city in West Java. Especially for Bogor Regency, Depok City, Bogor City, and Pangandaran Regency, the variable of the percentage of people poverty does not have a significant effect on the number leper.

Download Full-text

PENERAPAN REGRESI BINOMIAL NEGATIF UNTUK MENGATASI OVERDISPERSI PADA REGRESI POISSON

E-Jurnal Matematika ◽

10.24843/mtk.2013.v02.i02.p031 ◽

2013 ◽

Vol 2 (2) ◽

pp. 6

Author(s):

PUTU SUSAN PRADAWATI ◽

KOMANG GDE SUKARSA ◽

I GUSTI AYU MADE SRINADI

Keyword(s):

Regression Analysis ◽

Regression Model ◽

Poisson Regression ◽

Negative Binomial ◽

Negative Binomial Regression ◽

Poisson Regression Model ◽

Response Variable ◽

Poisson Regression Analysis ◽

Binomial Regression ◽

The Mean

Poisson regression was used to analyze the count data which Poisson distributed. Poisson regression analysis requires state equidispersion, in which the mean value of the response variable is equal to the value of the variance. However, there are deviations in which the value of the response variable variance is greater than the mean. This is called overdispersion. If overdispersion happens and Poisson Regression analysis is being used, then underestimated standard errors will be obtained. Negative Binomial Regression can handle overdispersion because it contains a dispersion parameter. From the simulation data which experienced overdispersion in the Poisson Regression model it was found that the Negative Binomial Regression was better than the Poisson Regression model.

Download Full-text

Modeling COVID-19 Cases in Nigeria Using Some Selected Count Data Regression Models

International Journal of Healthcare and Medical Sciences ◽

10.32861/ijhms.64.64.73 ◽

2020 ◽

pp. 64-73

Author(s):

Samuel Olorunfemi Adams ◽

Muhammad Ardo Bamanga ◽

Samuel Olayemi Olanrewaju ◽

Haruna Umar Yahaya ◽

Rafiu Olayinka Akano

Keyword(s):

Regression Model ◽

Poisson Regression ◽

Regression Models ◽

Negative Binomial ◽

Negative Binomial Regression ◽

Significance Level ◽

Generalized Poisson ◽

Binomial Regression ◽

Selection Test ◽

Generalized Poisson Regression

COVID-19 is currently threatening countries in the world. Presently in Nigeria, there are about 29,286 confirmed cases, 11,828 discharged and 654 deaths as of 6th July 2020. It is against this background that this study was targeted at modeling daily cases of COVID-19’s deaths in Nigeria using count regression models like; Poisson Regression (PR), Negative Binomial Regression (NBR) and Generalized Poisson Regression (GPR) model. The study aim at fitting an appropriate count Regression model to the confirmed, active and critical cases of COVID-19 in Nigeria after 118 days. The data for the study was extracted from the daily COVID-19 cases update released by the Nigeria Centre for Disease Control (NCDC) online database from February 28th, 2020 – 6th, July 2020. The extracted data were used in the simulation of Poisson, Negative Binomial, and Generalized Poisson Regression model with a program written in STATA version 14 and fitted to the data at a 5% significance level. The best model was selected based on the values of -2logL, AIC, and BIC selection test/criteria. The results obtained from the analysis revealed that the Poisson regression could not capture over-dispersion, so other forms of Poisson Regression models such as the Negative Binomial Regression and Generalized Poisson Regression were used in the estimation. Of the three count Regression models, Generalized Poisson Regression was the best model for fitting daily cumulative confirmed, active and critical COVID-19 cases in Nigeria when overdispersion is present in the predictors because it had the least -2log-Likelihood, AIC, and BIC. It was also discovered that active and critical cases have a positive and significant effect on the number of COVID-19 related deaths in Nigeria.

Download Full-text

Non-Central Negative Binomial Regression Model for Count Data

2020 8th International Conference on Reliability, Infocom Technologies and Optimization (Trends and Future Directions) (ICRITO) ◽

10.1109/icrito48877.2020.9197974 ◽

2020 ◽

Author(s):

Anwar Hassan ◽

Ishfaq S. Ahmad ◽

Peer Bilal Ahmad

Keyword(s):

Regression Model ◽

Count Data ◽

Negative Binomial ◽

Negative Binomial Regression ◽

Negative Binomial Regression Model ◽

Binomial Regression

Download Full-text