A37 Exact logistic regression for small-sample clustered binary data

1993 ◽  
Vol 14 (5) ◽  
pp. 414
Author(s):  
Cyrus R. Mehta
2020 ◽  
Vol 29 (10) ◽  
pp. 3006-3018 ◽  
Author(s):  
Guogen Shan

Clustered binary data are commonly encountered in many medical research studies with several binary outcomes from each cluster. Asymptotic methods are traditionally used for confidence interval calculations. However, these intervals often have unsatisfactory performance with regards to coverage for a study with a small sample size or the actual proportion near the boundary. To improve the coverage probability, exact Buehler’s one-sided intervals may be utilized, but they are computationally intensive in this setting. Therefore, we propose using importance sampling to calculate confidence intervals that almost always guarantee the coverage. We conduct extensive simulation studies to compare the performance of the existing asymptotic intervals and the new accurate intervals using importance sampling. The new intervals based on the asymptotic Wilson score for sample space ordering perform better than others, and they are recommended for use in practice.


2018 ◽  
Vol 61 (3) ◽  
pp. 574-599
Author(s):  
Josep L. Carrasco ◽  
Yi Pan ◽  
Rosa Abellana

Author(s):  
Phoebe Harpainter ◽  
Sridharshi C. Hewawitharana ◽  
Danielle L. Lee ◽  
Anna C. Martin ◽  
Wendi Gosliner ◽  
...  

Many quick-service restaurants (QSRs) instituted voluntary kids’ meal default beverage standards (standards) between 2013 to 2017. Little is known about impacts of standards on QSR drive-through practices and on customer choices. This study assessed differences in restaurant practices including kids’ meal beverages shown on menu boards, offered by cashiers, and selected by customers in QSRs with and without voluntary standards. Observations (n = 111) and customer surveys (n = 84) were conducted in 2018 at QSRs with standards (n = 70) and without (n = 41) in low-income California, U.S. neighborhoods. Kids’ meal beverages on menu boards (n = 149) and offered by cashiers (n = 185) at QSRs with and without standards were analyzed using multilevel logistic regression. Significantly more menu boards at QSRs with standards (n = 103) vs. without (n = 46) featured only milk, water or unsweetened juice (65.1% vs. 4.4%; p < 0.001). Most cashiers at QSRs with standards and QSRs without (53.1%, 62.5%) asked what drink the data collector wanted rather than first offering default beverages. A small sample of customer interviews found that customers at QSRs with standards most commonly ordered juice (37.0%); at QSRs without standards, soda (45.5%). Although menu boards showed healthier kids’ meal beverages at QSRs with standards than without, cashier behavior was inconsistent. Results suggest additional measures (legislation, implementation support, enforcement) may be needed to ensure optimal implementation.


2020 ◽  
Vol 12 (18) ◽  
pp. 2954
Author(s):  
Yue Wan ◽  
Jingxiong Zhang ◽  
Wenjing Yang ◽  
Yunwei Tang

Due to spatial inhomogeneity of land-cover types and spectral confusions among them, land-cover maps suffer from misclassification errors. While much research has focused on improving image classification by re-processing source images with more advanced algorithms and/or using images of finer resolution, there is rarely any systematic work on re-processing existing maps to increase their accuracy. We propose refining existing maps to achieve accuracy gains by exploring and utilizing relationships between reference data, which are often already available or can be collected, and map data. For this, we make novel use of canonical correspondence analysis (CCA) to analyze reference-map class co-occurrences to facilitate probabilistic re-classification of map classes in CCA ordination space, a synthesized feature space constrained by map class occurrence patterns. Experiments using GlobeLand30 land-cover (2010) over Wuhan, China were carried out using reference sample data collected previously for accuracy assessment in the same area. Reference sample data were stratified by map classes and their spatial heterogeneity. To examine effects of model-training sample size on refinements, three subset samples (360, 720, and 1480 pixels) were selected from a pool of 3000 sample pixels (the full training sample). Logistic regression modeling was employed as a baseline method for comparisons. Performance evaluation was based on a test sample of 1020 pixels using a strict and relaxed definitions of agreement between reference classification and map classification, resulting in measures of types I and II, respectively. It was found that the CCA-based method is more accurate than logistic regression in general. With increasing sample sizes, refinements generally lead to greater accuracy gains. Heterogeneous sub-strata usually see greater accuracy gains than in homogeneous sub-strata. It was also revealed that accuracy gains in specific strata (map classes and sub-strata) are related to strata refinability. Regarding CCA-based refinements, a relatively small sample of 360 pixels achieved a 3% gain in both overall accuracy (OA) and F0.01 score (II). By using a selective strategy in which only refinable strata of cultivated land and forest are included in refinement, accuracy gains are further increased, with 5–11% gains in users’ accuracies (UAs) (II) and 4–10% gains in F0.01 scores (II). In conclusion, on condition of refinability, map refinement is well worth pursuing, as it increases accuracy of existing maps, extends utility of reference data, facilitates uncertainty-informed map representation, and enhances our understanding about relationships between reference data and map data and about their synthesis.


Sign in / Sign up

Export Citation Format

Share Document