clustered binary data
Recently Published Documents


TOTAL DOCUMENTS

61
(FIVE YEARS 5)

H-INDEX

14
(FIVE YEARS 1)

PLoS ONE ◽  
2021 ◽  
Vol 16 (4) ◽  
pp. e0249604
Author(s):  
Chénangnon Frédéric Tovissodé ◽  
Aliou Diop ◽  
Romain Glèlè Kakaï

Binary Generalized Linear Mixed Model (GLMM) is the most common method used by researchers to analyze clustered binary data in biological and social sciences. The traditional approach to GLMMs causes substantial bias in estimates due to steady shape of logistic and normal distribution assumptions thereby resulting into wrong and misleading decisions. This study brings forward an approach governed by skew generalized t distributions that belong to a class of potentially skewed and heavy tailed distributions. Interestingly, both the traditional logistic and probit mixed models, as well as other available methods can be utilized within the skew generalized t-link model (SGTLM) frame. We have taken advantage of the Expectation-Maximization algorithm accelerated via parameter-expansion for model fitting. We evaluated the performance of this approach to GLMMs through a simulation experiment by varying sample size and data distribution. Our findings indicated that the proposed methodology outperforms competing approaches in estimating population parameters and predicting random effects, when the traditional link and normality assumptions are violated. In addition, empirical standard errors and information criteria proved useful for detecting spurious skewness and avoiding complex models for probit data. An application with respiratory infection data points out to the superiority of the SGTLM which turns to be the most adequate model. In future, studies should focus on integrating the demonstrated flexibility in other generalized linear mixed models to enhance robust modeling.


2021 ◽  
Vol 14 (3) ◽  
pp. 295-308
Author(s):  
Rong Xia ◽  
Christopher R. Friese ◽  
Mousumi Banerjee

2020 ◽  
Vol 29 (10) ◽  
pp. 3006-3018 ◽  
Author(s):  
Guogen Shan

Clustered binary data are commonly encountered in many medical research studies with several binary outcomes from each cluster. Asymptotic methods are traditionally used for confidence interval calculations. However, these intervals often have unsatisfactory performance with regards to coverage for a study with a small sample size or the actual proportion near the boundary. To improve the coverage probability, exact Buehler’s one-sided intervals may be utilized, but they are computationally intensive in this setting. Therefore, we propose using importance sampling to calculate confidence intervals that almost always guarantee the coverage. We conduct extensive simulation studies to compare the performance of the existing asymptotic intervals and the new accurate intervals using importance sampling. The new intervals based on the asymptotic Wilson score for sample space ordering perform better than others, and they are recommended for use in practice.


2019 ◽  
Vol 34 (4) ◽  
pp. 1765-1778
Author(s):  
Bei Wang ◽  
Yi Zheng ◽  
Kyle M. Irimata ◽  
Jeffrey R. Wilson

2018 ◽  
Vol 61 (3) ◽  
pp. 574-599
Author(s):  
Josep L. Carrasco ◽  
Yi Pan ◽  
Rosa Abellana

2017 ◽  
Vol 37 (6) ◽  
pp. 899-913 ◽  
Author(s):  
Stephen T. Wright ◽  
Louise M. Ryan ◽  
Tung Pham

2016 ◽  
Vol 27 (9) ◽  
pp. 2641-2656
Author(s):  
John Kwagyan ◽  
Victor Apprey

We establish a zero-inflated (random-effects) logistic-Gaussian model for clustered binary data in which members of clusters in one latent class have a zero response with probability one, and members of clusters in a second latent class yield correlated outcomes. Response probabilities in terms of random-effects models are formulated, and maximum marginal likelihood estimation procedures based on Gaussian quadrature are developed. Application to esophageal cancer data in Chinese families is presented.


Sign in / Sign up

Export Citation Format

Share Document