scholarly journals PENERAPAN REGRESI BINOMIAL NEGATIF UNTUK MENGATASI OVERDISPERSI PADA REGRESI POISSON

2013 ◽  
Vol 2 (2) ◽  
pp. 6
Author(s):  
PUTU SUSAN PRADAWATI ◽  
KOMANG GDE SUKARSA ◽  
I GUSTI AYU MADE SRINADI

Poisson regression was used to analyze the count data which Poisson distributed. Poisson regression analysis requires state equidispersion, in which the mean value of the response variable is equal to the value of the variance. However, there are deviations in which the value of the response variable variance is greater than the mean. This is called overdispersion. If overdispersion happens and Poisson Regression analysis is being used, then underestimated standard errors will be obtained. Negative Binomial Regression can handle overdispersion because it contains a dispersion parameter. From the simulation data which experienced overdispersion in the Poisson Regression model it was found that the Negative Binomial Regression was better than the Poisson Regression model.

2014 ◽  
Vol 3 (3) ◽  
pp. 107 ◽  
Author(s):  
NI MADE RARA KESWARI ◽  
I WAYAN SUMARJAYA ◽  
NI LUH PUTU SUCIPTAWATI

Poisson regression is a nonlinear regression that is often used to model count response variable and categorical, interval, or count regressor. This regression assumes equidispersion, i.e., the variance equals the mean. However, in practice, this assumption is often violated. One of this violation is overdispersion in which the variance is greater than the mean. There are several  methods to overcome overdispersion. Two of these methods are negative binomial regression and generalized Poisson regression. In this research, binomial negative regression and generalized Poisson regression statistically equally good in handling overdispersion.


Author(s):  
Mohammad Mirjani Arjenan ◽  
Mohsen Askarshahi ◽  
Mahmud Vakili

Introduction: Despite the advances in cardiovascular diseases, death caused by these diseases is still considered as the leading cause of mortality. In this study, some of the effective factors on the deaths caused by cardiovascular diseases were investigated Methods: This cross-sectional analytical study investigated the efficacy of Poisson regression models and negative binomial regression models on factors affecting mortality from cardiovascular diseases. The death data were extracted from the death registration system for Yazd province in 2017.Gender, age, education, occupation, location, and city of death were also extracted for each deceased. The two regression models were then fitted to the data Results:  A total of 5,015 deaths were recorded, of which 1,642 were due to cardiovascular diseases. Cardiovascular disease mortality rates were significant using negative binomial regression in terms of the educational variables, place of residence, type of residence, and age. Death rates caused by cardiovascular diseases were not significant for age and occupational, educational, and residential variables. Conclusion: If the time of death is considered as an offset variable, the regression model of two negative sentences is more effective in showing the factors affecting death due to cardiovascular diseases according to AIC and BIC criteria. In the case that the total number of deaths is considered as the offset variable, the Poisson regression model is more efficient.


2021 ◽  
Vol 13 (2) ◽  
pp. 57
Author(s):  
Kristy Kristy ◽  
Jajang Jajang ◽  
Nunung Nurhayati

Tuberculosis is an infectious disease caused by Mycobacterium tuberculosis. Banyumas Regency is one of the districts with quite high Tuberculosis cases in Central Java. This study aims to analyze the factors that affect the number of tuberculosis cases in Banyumas Regency using regression analysis of count data. Poisson regression is the simplest count data regression model that has the assumption of equidispersion, that is, the mean value equal to the variance. However, in its application, these assumption is often not fulfilled, for example, there are cases of overdispersion (variance value is greater than the mean). In this study, to overcome the case of overdispersion, an approach was used using Generalized Poisson Regression (GPR) and negative binomial regression. The results showed that the data on the number of tuberculosis cases in Banyumas Regency in 2019 was overdispersion. The data modeling of the number of tuberculosis cases in Banyumas Regency with the negative binomial regression model is better than the GPR model. Meanwhile, the only predictor variable that affects the number of tuberculosis cases in Banyumas Regency is the sex ratio of productive age (15-49 years).


2012 ◽  
Vol 57 (1) ◽  
Author(s):  
SEYED EHSAN SAFFAR ◽  
ROBIAH ADNAN ◽  
WILLIAM GREENE

A Poisson model typically is assumed for count data. In many cases, there are many zeros in the dependent variable and because of these many zeros, the mean and the variance values of the dependent variable are not the same as before. In fact, the variance value of the dependent variable will be much more than the mean value of the dependent variable and this is called over–dispersion. Therefore, Poisson model is not suitable anymore for this kind of data because of too many zeros. Thus, it is suggested to use a hurdle Poisson regression model to overcome over–dispersion problem. Furthermore, the response variable in such cases is censored for some values. In this paper, a censored hurdle Poisson regression model is introduced on count data with many zeros. In this model, we consider a response variable and one or more than one explanatory variables. The estimation of regression parameters using the maximum likelihood method is discussed and the goodness–of–fit for the regression model is examined. We study the effects of right censoring on estimated parameters and their standard errors via an example.


2021 ◽  
Vol 5 (1) ◽  
pp. 1-13
Author(s):  
Yopi Ariesia Ulfa ◽  
Agus M Soleh ◽  
Bagus Sartono

Based on data from the Directorate General of Disease Prevention and Control of the Ministry of Health of the Republic of Indonesia, in 2017, new leprosy cases that emerged on Java Island were the highest in Indonesia compared to the number of events on other islands. The purpose of this study is to compare Poisson regression to a negative binomial regression model to be applied to the data on the number of new cases of leprosy and to find out what explanatory variables have a significant effect on the number of new cases of leprosy in Java. This study's results indicate that a negative binomial regression model can overcome the Poisson regression model's overdispersion. Variables that significantly affect the number of new cases of leprosy based on the results of negative binomial regression modeling are total population, percentage of children under five years who had immunized with BCG, and percentage of the population with sustainable access to clean water.


2021 ◽  
Vol 10 (3) ◽  
pp. 226-236
Author(s):  
Khusnul Khotimah ◽  
Itasia Dina Sulvianti ◽  
Pika Silvianti

The number of leper in West Java is an example of the count data case. The analyzes commonly used in count data is Poisson regression. This research will determine the variables that influence the number of leper in West Java. The data used is the number of leper in West Java in 2019. This data has an overdispersion condition and spatial heterogenity. To handle overdispersion, the negative binomial regression model can be employed. While spatial heterogenity is overcome by adding adaptive bisquare kernel weight. This research resulted Geographically Weighted Negative Binomial Regression (GWNBR) with a weighting adaptive bisquare kernel classifies regency/city in West Java into ten groups based on the variables that sigfinicantly influence the number of leper. In general, the variable in the percentage of households with Clean and Healthy Behavior (PHBS) has a significant effect in all regency/city in West Java. Especially for Bogor Regency, Depok City, Bogor City, and Pangandaran Regency, the variable of the percentage of people poverty does not have a significant effect on the number leper.


Agriculture ◽  
2019 ◽  
Vol 9 (6) ◽  
pp. 117 ◽  
Author(s):  
Apri Wahyudi ◽  
John K. M. Kuwornu ◽  
Endro Gunawan ◽  
Avishek Datta ◽  
Loc T. Nguyen

This study assessed the factors influencing the frequency of purchases of locally-produced rice using data collected from a sample of 400 consumers in Jakarta Province in Indonesia. The empirical results of a Poisson regression model revealed that socio-economic characteristics of the consumers (i.e., gender, age, occupation, education, and income), characteristics of the product (i.e., label and color), and the product’s price and promotion significantly influenced consumers’ frequency of purchasing locally-produced rice. The implication is that increasing the quality of locally-produced rice, applying an appropriate marketing strategy such as offering a relatively lower-priced product compared to the price of imported rice, and product promotion are necessary for increasing the frequency of consumers’ purchases of locally-produced rice.


Author(s):  
Samuel Olorunfemi Adams ◽  
Muhammad Ardo Bamanga ◽  
Samuel Olayemi Olanrewaju ◽  
Haruna Umar Yahaya ◽  
Rafiu Olayinka Akano

COVID-19 is currently threatening countries in the world. Presently in Nigeria, there are about 29,286 confirmed cases, 11,828 discharged and 654 deaths as of 6th July 2020. It is against this background that this study was targeted at modeling daily cases of COVID-19’s deaths in Nigeria using count regression models like; Poisson Regression (PR), Negative Binomial Regression (NBR) and Generalized Poisson Regression (GPR) model. The study aim at fitting an appropriate count Regression model to the confirmed, active and critical cases of COVID-19 in Nigeria after 118 days. The data for the study was extracted from the daily COVID-19 cases update released by the Nigeria Centre for Disease Control (NCDC) online database from February 28th, 2020 – 6th, July 2020. The extracted data were used in the simulation of Poisson, Negative Binomial, and Generalized Poisson Regression model with a program written in STATA version 14 and fitted to the data at a 5% significance level. The best model was selected based on the values of -2logL, AIC, and BIC selection test/criteria. The results obtained from the analysis revealed that the Poisson regression could not capture over-dispersion, so other forms of Poisson Regression models such as the Negative Binomial Regression and Generalized Poisson Regression were used in the estimation. Of the three count Regression models, Generalized Poisson Regression was the best model for fitting daily cumulative confirmed, active and critical COVID-19 cases in Nigeria when overdispersion is present in the predictors because it had the least -2log-Likelihood, AIC, and BIC. It was also discovered that active and critical cases have a positive and significant effect on the number of COVID-19 related deaths in Nigeria.


Author(s):  
Luay Habeeb Hashim ◽  
Ahmad Naeem Flaih

28   Counts data models cope with the response variable counts, where the number of times that a certain event occurs in a fixed point is called count data, its observations consists of non-negative integers values {0,1,2,…}. Because of the nature of count data, the response variables are usually considered doing not follow normal distribution. Therefore, linear regression is not an appropriate method to analysis count data due to the skewed distribution. Hence, using linear regression model to analysis count data is likely to bias the results, under these limitations, Poisson regression model and “Negative binomial regression” are likely the appropriate models to analysis count data. Sometimes researchers may Counts more zeros than the expected. Count data with many Zeros leads to a concept called “Zero-inflation”. Data with abundant zeros are especially popular in health, marketing, finance, econometric, ecology, statistics quality control, geographical, and environmental fields when counting the occurrence of certain behavioral and natural events, such as frequency of alcohol use, take drugs, number of cigarettes smoked, the occurrence of earthquakes, rainfall, and etc. Some models have been used to analyzing count data such as the “zero- altered Poisson” (ZAP) model and the “negative binomial” model. In this paper, the models, Poisson, Negative Binomial, ZAP, and ZANB were been used to analyze rainfall data.


Sign in / Sign up

Export Citation Format

Share Document