scholarly journals Handling of Overdispersion in the Poisson Regression Model with Negative Binomial for the Number of New Cases of Leprosy in Java

2021 ◽  
Vol 5 (1) ◽  
pp. 1-13
Author(s):  
Yopi Ariesia Ulfa ◽  
Agus M Soleh ◽  
Bagus Sartono

Based on data from the Directorate General of Disease Prevention and Control of the Ministry of Health of the Republic of Indonesia, in 2017, new leprosy cases that emerged on Java Island were the highest in Indonesia compared to the number of events on other islands. The purpose of this study is to compare Poisson regression to a negative binomial regression model to be applied to the data on the number of new cases of leprosy and to find out what explanatory variables have a significant effect on the number of new cases of leprosy in Java. This study's results indicate that a negative binomial regression model can overcome the Poisson regression model's overdispersion. Variables that significantly affect the number of new cases of leprosy based on the results of negative binomial regression modeling are total population, percentage of children under five years who had immunized with BCG, and percentage of the population with sustainable access to clean water.

2021 ◽  
Vol 10 (3) ◽  
pp. 226-236
Author(s):  
Khusnul Khotimah ◽  
Itasia Dina Sulvianti ◽  
Pika Silvianti

The number of leper in West Java is an example of the count data case. The analyzes commonly used in count data is Poisson regression. This research will determine the variables that influence the number of leper in West Java. The data used is the number of leper in West Java in 2019. This data has an overdispersion condition and spatial heterogenity. To handle overdispersion, the negative binomial regression model can be employed. While spatial heterogenity is overcome by adding adaptive bisquare kernel weight. This research resulted Geographically Weighted Negative Binomial Regression (GWNBR) with a weighting adaptive bisquare kernel classifies regency/city in West Java into ten groups based on the variables that sigfinicantly influence the number of leper. In general, the variable in the percentage of households with Clean and Healthy Behavior (PHBS) has a significant effect in all regency/city in West Java. Especially for Bogor Regency, Depok City, Bogor City, and Pangandaran Regency, the variable of the percentage of people poverty does not have a significant effect on the number leper.


2019 ◽  
Vol 8 (1) ◽  
Author(s):  
Lulu Mahdiyah Sandjadirja ◽  
Muhammad Nur Aidi ◽  
Akbar Rizki

Poisson regression can be used to model rare events that consist of count data. Poisson regression application is carried out to find out external factors that affect the number of poor people in Indonesia by the province in 2016. The assumptions that must be met in this analysis are equdispersion. However, in real cases there is often a problem of overdispersion, ie the value of the variance is greater than the average value. High diversity can be caused by outliers. Expenditures on outliers have not been able to deal with the problem of overdispersion in Poisson Regression. One way to overcome this problem is to replace the Poisson distribution assumption with the Negative Binomial distribution. The results of the analysis show that the Negative Binomial Regression model without outliers is better than the Poisson Regression without outliers model indicated by a smaller AIC value. Based on the Negative Binomial Regression model without this outlier the external factors that affect the number of poor people in Indonesia by the province in 2016 are the percentage of households with floor conditions of houses with soil by province, population by province, percentage of unemployment to the total workforce by province and the percentage of the workforce against the working age population.


2021 ◽  
Vol 13 (2) ◽  
pp. 57
Author(s):  
Kristy Kristy ◽  
Jajang Jajang ◽  
Nunung Nurhayati

Tuberculosis is an infectious disease caused by Mycobacterium tuberculosis. Banyumas Regency is one of the districts with quite high Tuberculosis cases in Central Java. This study aims to analyze the factors that affect the number of tuberculosis cases in Banyumas Regency using regression analysis of count data. Poisson regression is the simplest count data regression model that has the assumption of equidispersion, that is, the mean value equal to the variance. However, in its application, these assumption is often not fulfilled, for example, there are cases of overdispersion (variance value is greater than the mean). In this study, to overcome the case of overdispersion, an approach was used using Generalized Poisson Regression (GPR) and negative binomial regression. The results showed that the data on the number of tuberculosis cases in Banyumas Regency in 2019 was overdispersion. The data modeling of the number of tuberculosis cases in Banyumas Regency with the negative binomial regression model is better than the GPR model. Meanwhile, the only predictor variable that affects the number of tuberculosis cases in Banyumas Regency is the sex ratio of productive age (15-49 years).


2021 ◽  
Vol 21 (1) ◽  
Author(s):  
Ahmed Nabil Shaaban ◽  
Bárbara Peleteiro ◽  
Maria Rosario O. Martins

Abstract Background This study offers a comprehensive approach to precisely analyze the complexly distributed length of stay among HIV admissions in Portugal. Objective To provide an illustration of statistical techniques for analysing count data using longitudinal predictors of length of stay among HIV hospitalizations in Portugal. Method Registered discharges in the Portuguese National Health Service (NHS) facilities Between January 2009 and December 2017, a total of 26,505 classified under Major Diagnostic Category (MDC) created for patients with HIV infection, with HIV/AIDS as a main or secondary cause of admission, were used to predict length of stay among HIV hospitalizations in Portugal. Several strategies were applied to select the best count fit model that includes the Poisson regression model, zero-inflated Poisson, the negative binomial regression model, and zero-inflated negative binomial regression model. A random hospital effects term has been incorporated into the negative binomial model to examine the dependence between observations within the same hospital. A multivariable analysis has been performed to assess the effect of covariates on length of stay. Results The median length of stay in our study was 11 days (interquartile range: 6–22). Statistical comparisons among the count models revealed that the random-effects negative binomial models provided the best fit with observed data. Admissions among males or admissions associated with TB infection, pneumocystis, cytomegalovirus, candidiasis, toxoplasmosis, or mycobacterium disease exhibit a highly significant increase in length of stay. Perfect trends were observed in which a higher number of diagnoses or procedures lead to significantly higher length of stay. The random-effects term included in our model and refers to unexplained factors specific to each hospital revealed obvious differences in quality among the hospitals included in our study. Conclusions This study provides a comprehensive approach to address unique problems associated with the prediction of length of stay among HIV patients in Portugal.


PLoS ONE ◽  
2021 ◽  
Vol 16 (8) ◽  
pp. e0254479
Author(s):  
Ta-Chien Chan ◽  
Jia-Hong Tang ◽  
Cheng-Yu Hsieh ◽  
Kevin J. Chen ◽  
Tsan-Hua Yu ◽  
...  

Background Sentinel physician surveillance in communities has played an important role in detecting early signs of epidemics. The traditional approach is to let the primary care physician voluntarily and actively report diseases to the health department on a weekly basis. However, this is labor-intensive work, and the spatio-temporal resolution of the surveillance data is not precise at all. In this study, we built up a clinic-based enhanced sentinel surveillance system named “Sentinel plus” which was designed for sentinel clinics and community hospitals to monitor 23 kinds of syndromic groups in Taipei City, Taiwan. The definitions of those syndromic groups were based on ICD-10 diagnoses from physicians. Methods Daily ICD-10 counts of two syndromic groups including ILI and EV-like syndromes in Taipei City were extracted from Sentinel plus. A negative binomial regression model was used to couple with lag structure functions to examine the short-term association between ICD counts and meteorological variables. After fitting the negative binomial regression model, residuals were further rescaled to Pearson residuals. We then monitored these daily standardized Pearson residuals for any aberrations from July 2018 to October 2019. Results The results showed that daily average temperature was significantly negatively associated with numbers of ILI syndromes. The ozone and PM2.5 concentrations were significantly positively associated with ILI syndromes. In addition, daily minimum temperature, and the ozone and PM2.5 concentrations were significantly negatively associated with the EV-like syndromes. The aberrational signals detected from clinics for ILI and EV-like syndromes were earlier than the epidemic period based on outpatient surveillance defined by the Taiwan CDC. Conclusions This system not only provides warning signals to the local health department for managing the risks but also reminds medical practitioners to be vigilant toward susceptible patients. The near real-time surveillance can help decision makers evaluate their policy on a timely basis.


Sign in / Sign up

Export Citation Format

Share Document