A37 Exact logistic regression for small-sample clustered binary data

Clustered binary data are commonly encountered in many medical research studies with several binary outcomes from each cluster. Asymptotic methods are traditionally used for confidence interval calculations. However, these intervals often have unsatisfactory performance with regards to coverage for a study with a small sample size or the actual proportion near the boundary. To improve the coverage probability, exact Buehler’s one-sided intervals may be utilized, but they are computationally intensive in this setting. Therefore, we propose using importance sampling to calculate confidence intervals that almost always guarantee the coverage. We conduct extensive simulation studies to compare the performance of the existing asymptotic intervals and the new accurate intervals using importance sampling. The new intervals based on the asymptotic Wilson score for sample space ordering perform better than others, and they are recommended for use in practice.

Download Full-text

Logistic Regression for Clustered Binary Data in Proband Studies with Application to Familial Aggregation of Sleep Disorders

Biometrics ◽

10.2307/2532384 ◽

1991 ◽

Vol 47 (4) ◽

pp. 1257 ◽

Cited By ~ 14

Author(s):

Tor D. Tosteson ◽

Bernard Rosner ◽

Susan Redline

Keyword(s):

Logistic Regression ◽

Sleep Disorders ◽

Binary Data ◽

Familial Aggregation ◽

Clustered Binary Data

Download Full-text

Robust Bayesian approach to logistic regression modeling in small sample size utilizing a weakly informative student’s t prior distribution

Communication in Statistics- Theory and Methods ◽

10.1080/03610926.2021.1912767 ◽

2021 ◽

pp. 1-11

Author(s):

Kenneth Chukwuemeka Asanya ◽

Mohamed Kharrat ◽

Akaninyene Udo Udom ◽

Emmanuel Torsen

Keyword(s):

Logistic Regression ◽

Sample Size ◽

Bayesian Approach ◽

Prior Distribution ◽

Small Sample Size ◽

Regression Modeling ◽

Small Sample ◽

Logistic Regression Modeling ◽

Student’S T

Download Full-text

Conservative confidence intervals for the intraclass correlation coefficient for clustered binary data

Journal of Applied Statistics ◽

10.1080/02664763.2021.1910939 ◽

2021 ◽

pp. 1-15

Author(s):

Guogen Shan

Keyword(s):

Correlation Coefficient ◽

Confidence Intervals ◽

Intraclass Correlation Coefficient ◽

Binary Data ◽

Intraclass Correlation ◽

Clustered Binary Data

Download Full-text

Analysis of Clustered Binary Data

Encyclopedia of Biopharmaceutical Statistics, Third Edition ◽

10.1201/b14674-11 ◽

2012 ◽

pp. 58-62

Author(s):

Valerie Durkalski

Keyword(s):

Binary Data ◽

Clustered Binary Data

Download Full-text

Estimating marginal proportions and intraclass correlations with clustered binary data

Biometrical Journal ◽

10.1002/bimj.201700230 ◽

2018 ◽

Vol 61 (3) ◽

pp. 574-599

Author(s):

Josep L. Carrasco ◽

Yi Pan ◽

Rosa Abellana

Keyword(s):

Binary Data ◽

Clustered Binary Data ◽

Intraclass Correlations

Download Full-text

Model-Based Classification of Clustered Binary Data with Non-ignorable Missing Values

Advances in Theoretical and Applied Statistics ◽

10.1007/978-3-642-35588-2_15 ◽

2013 ◽

pp. 155-165 ◽

Cited By ~ 1

Author(s):

Francesco Lagona

Keyword(s):

Binary Data ◽

Missing Values ◽

Clustered Binary Data ◽

Model Based

Download Full-text

Voluntary Kids’ Meal Beverage Standards: Are They Sufficient to Ensure Healthier Restaurant Practices and Consumer Choices?

International Journal of Environmental Research and Public Health ◽

10.3390/ijerph17155275 ◽

2020 ◽

Vol 17 (15) ◽

pp. 5275

Author(s):

Phoebe Harpainter ◽

Sridharshi C. Hewawitharana ◽

Danielle L. Lee ◽

Anna C. Martin ◽

Wendi Gosliner ◽

...

Keyword(s):

Logistic Regression ◽

Low Income ◽

Small Sample ◽

Multilevel Logistic Regression ◽

Implementation Support ◽

Voluntary Standards ◽

Data Collector ◽

Consumer Choices ◽

Quick Service ◽

Optimal Implementation

Many quick-service restaurants (QSRs) instituted voluntary kids’ meal default beverage standards (standards) between 2013 to 2017. Little is known about impacts of standards on QSR drive-through practices and on customer choices. This study assessed differences in restaurant practices including kids’ meal beverages shown on menu boards, offered by cashiers, and selected by customers in QSRs with and without voluntary standards. Observations (n = 111) and customer surveys (n = 84) were conducted in 2018 at QSRs with standards (n = 70) and without (n = 41) in low-income California, U.S. neighborhoods. Kids’ meal beverages on menu boards (n = 149) and offered by cashiers (n = 185) at QSRs with and without standards were analyzed using multilevel logistic regression. Significantly more menu boards at QSRs with standards (n = 103) vs. without (n = 46) featured only milk, water or unsweetened juice (65.1% vs. 4.4%; p < 0.001). Most cashiers at QSRs with standards and QSRs without (53.1%, 62.5%) asked what drink the data collector wanted rather than first offering default beverages. A small sample of customer interviews found that customers at QSRs with standards most commonly ordered juice (37.0%); at QSRs without standards, soda (45.5%). Although menu boards showed healthier kids’ meal beverages at QSRs with standards than without, cashier behavior was inconsistent. Results suggest additional measures (legislation, implementation support, enforcement) may be needed to ensure optimal implementation.

Download Full-text

Refining Land-Cover Maps Based on Probabilistic Re-Classification in CCA Ordination Space

Remote Sensing ◽

10.3390/rs12182954 ◽

2020 ◽

Vol 12 (18) ◽

pp. 2954

Author(s):

Yue Wan ◽

Jingxiong Zhang ◽

Wenjing Yang ◽

Yunwei Tang

Keyword(s):

Logistic Regression ◽

Land Cover ◽

Reference Data ◽

Reference Sample ◽

Training Sample ◽

Small Sample ◽

Cca Ordination ◽

Sample Data ◽

Map Data ◽

Land Cover Maps

Due to spatial inhomogeneity of land-cover types and spectral confusions among them, land-cover maps suffer from misclassification errors. While much research has focused on improving image classification by re-processing source images with more advanced algorithms and/or using images of finer resolution, there is rarely any systematic work on re-processing existing maps to increase their accuracy. We propose refining existing maps to achieve accuracy gains by exploring and utilizing relationships between reference data, which are often already available or can be collected, and map data. For this, we make novel use of canonical correspondence analysis (CCA) to analyze reference-map class co-occurrences to facilitate probabilistic re-classification of map classes in CCA ordination space, a synthesized feature space constrained by map class occurrence patterns. Experiments using GlobeLand30 land-cover (2010) over Wuhan, China were carried out using reference sample data collected previously for accuracy assessment in the same area. Reference sample data were stratified by map classes and their spatial heterogeneity. To examine effects of model-training sample size on refinements, three subset samples (360, 720, and 1480 pixels) were selected from a pool of 3000 sample pixels (the full training sample). Logistic regression modeling was employed as a baseline method for comparisons. Performance evaluation was based on a test sample of 1020 pixels using a strict and relaxed definitions of agreement between reference classification and map classification, resulting in measures of types I and II, respectively. It was found that the CCA-based method is more accurate than logistic regression in general. With increasing sample sizes, refinements generally lead to greater accuracy gains. Heterogeneous sub-strata usually see greater accuracy gains than in homogeneous sub-strata. It was also revealed that accuracy gains in specific strata (map classes and sub-strata) are related to strata refinability. Regarding CCA-based refinements, a relatively small sample of 360 pixels achieved a 3% gain in both overall accuracy (OA) and F0.01 score (II). By using a selective strategy in which only refinable strata of cultivated land and forest are included in refinement, accuracy gains are further increased, with 5–11% gains in users’ accuracies (UAs) (II) and 4–10% gains in F0.01 scores (II). In conclusion, on condition of refinability, map refinement is well worth pursuing, as it increases accuracy of existing maps, extends utility of reference data, facilitates uncertainty-informed map representation, and enhances our understanding about relationships between reference data and map data and about their synthesis.

Download Full-text