intracluster correlation
Recently Published Documents


TOTAL DOCUMENTS

49
(FIVE YEARS 14)

H-INDEX

16
(FIVE YEARS 1)

Author(s):  
Monica Ewomazino Akokuwebe ◽  
Erhabor Sunday Idemudia

Background: An unhealthy body weight is an adverse effect of malnutrition associated with morbidity among women of childbearing age. While there is increasing attention being paid to the body weights of children and adolescents in Nigeria and South Africa, a major surge of unhealthy body weight in women has received less attention in both countries despite its predominance. The purpose of this study was to explore the prevalence of body weights (underweight, normal, overweight, and obese) and individual-level factors among women of childbearing age by urban–rural variations in Nigeria and South Africa. Methods: This study used the 2018 Nigeria Demographic Health Survey data (n = 41,821) and 2016 South Africa Demographic Health Survey (n = 8514). Bivariate, multilevel, and intracluster correlation coefficient analyses were used to determine individual-level factors associated with body weights across urban–rural variations. Results: The prevalence of being overweight or obese among women was 28.2% and 44.9%, respectively, in South Africa and 20.2% and 11.4% in Nigeria. A majority, 6.8%, of underweight women were rural residents in Nigeria compared to 0.8% in South Africa. The odds of being underweight were higher among women in Nigeria who were unemployed, with regional differences and according to breastfeeding status, while higher odds of being underweight were found among women from poorer households, with differences between provinces and according to cigarette smoking status in South Africa. On the other hand, significant odds of being overweight or obese among women in both Nigeria and South Africa were associated with increasing age, higher education, higher wealth index, weight above average, and traditional/modern contraceptive use. Unhealthy body weights were higher among women in clustering areas in Nigeria who were underweight (intracluster correlation coefficient (ICC = 0.0127), overweight (ICC = 0.0289), and obese (ICC = 0.1040). Similarly, women of childbearing age in clustering areas in South Africa had a lower risk of experiencing underweight (ICC = 0.0102), overweight (ICC = 0.0127), and obesity (ICC = 0.0819). Conclusions: These findings offer a deeper understanding of the close connection between body weights variations and individual factors. Addressing unhealthy body weights among women of childbearing age in Nigeria and South Africa is important in preventing disease burdens associated with body weights in promoting Sustainable Development Goal 3. Strategies for developing preventive sensitization interventions are imperative to extend the perspectives of the clustering effect of body weights on a country level when establishing social and behavioral modifications for body weight concerns in both countries.


2021 ◽  
pp. 174077452110598
Author(s):  
Lee Kennedy-Shaffer ◽  
Michael D Hughes

Background/Aims Generalized estimating equations are commonly used to fit logistic regression models to clustered binary data from cluster randomized trials. A commonly used correlation structure assumes that the intracluster correlation coefficient does not vary by treatment arm or other covariates, but the consequences of this assumption are understudied. We aim to evaluate the effect of allowing variation of the intracluster correlation coefficient by treatment or other covariates on the efficiency of analysis and show how to account for such variation in sample size calculations. Methods We develop formulae for the asymptotic variance of the estimated difference in outcome between treatment arms obtained when the true exchangeable correlation structure depends on the treatment arm and the working correlation structure used in the generalized estimating equations analysis is: (i) correctly specified, (ii) independent, or (iii) exchangeable with no dependence on treatment arm. These formulae require a known distribution of cluster sizes; we also develop simplifications for the case when cluster sizes do not vary and approximations that can be used when the first two moments of the cluster size distribution are known. We then extend the results to settings with adjustment for a second binary cluster-level covariate. We provide formulae to calculate the required sample size for cluster randomized trials using these variances. Results We show that the asymptotic variance of the estimated difference in outcome between treatment arms using these three working correlation structures is the same if all clusters have the same size, and this asymptotic variance is approximately the same when intracluster correlation coefficient values are small. We illustrate these results using data from a recent cluster randomized trial for infectious disease prevention in which the clusters are groups of households and modest in size (mean 9.6 individuals), with intracluster correlation coefficient values of 0.078 in the control arm and 0.057 in an intervention arm. In this application, we found a negligible difference between the variances calculated using structures (i) and (iii) and only a small increase (typically [Formula: see text]) for the independent correlation structure (ii), and hence minimal effect on power or sample size requirements. The impact may be larger in other applications if there is greater variation in the ICC between treatment arms or with an additional covariate. Conclusion The common approach of fitting generalized estimating equations with an exchangeable working correlation structure with a common intracluster correlation coefficient across arms likely does not substantially reduce the power or efficiency of the analysis in the setting of a large number of small or modest-sized clusters, even if the intracluster correlation coefficient varies by treatment arm. Our formulae, however, allow formal evaluation of this and may identify situations in which variation in intracluster correlation coefficient by treatment arm or another binary covariate may have a more substantial impact on power and hence sample size requirements.


2021 ◽  
pp. 174077452110466
Author(s):  
Monica Taljaard ◽  
Fan Li ◽  
Bo Qin ◽  
Caroline Cui ◽  
Leyi Zhang ◽  
...  

Background and Aims We need more pragmatic trials of interventions to improve care and outcomes for people living with Alzheimer’s disease and related dementias. However, these trials present unique methodological challenges in their design, analysis, and reporting—often, due to the presence of one or more sources of clustering. Failure to account for clustering in the design and analysis can lead to increased risks of Type I and Type II errors. We conducted a review to describe key methodological characteristics and obtain a “baseline assessment” of methodological quality of pragmatic trials in dementia research, with a view to developing new methods and practical guidance to support investigators and methodologists conducting pragmatic trials in this field. Methods We used a published search filter in MEDLINE to identify trials more likely to be pragmatic and identified a subset that focused on people living with Alzheimer’s disease or other dementias or included them as a defined subgroup. Pairs of reviewers extracted descriptive information and key methodological quality indicators from each trial. Results We identified N = 62 eligible primary trial reports published across 36 different journals. There were 15 (24%) individually randomized, 38 (61%) cluster randomized, and 9 (15%) individually randomized group treatment designs; 54 (87%) trials used repeated measures on the same individual and/or cluster over time and 17 (27%) had a multivariate primary outcome (e.g. due to measuring an outcome on both the patient and their caregiver). Of the 38 cluster randomized trials, 16 (42%) did not report sample size calculations accounting for the intracluster correlation and 13 (34%) did not account for intracluster correlation in the analysis. Of the 9 individually randomized group treatment trials, 6 (67%) did not report sample size calculations accounting for intracluster correlation and 8 (89%) did not account for it in the analysis. Of the 54 trials with repeated measurements, 45 (83%) did not report sample size calculations accounting for repeated measurements and 19 (35%) did not utilize at least some of the repeated measures in the analysis. No trials accounted for the multivariate nature of their primary outcomes in sample size calculation; only one did so in the analysis. Conclusion There is a need and opportunity to improve the design, analysis, and reporting of pragmatic trials in dementia research. Investigators should pay attention to the potential presence of one or more sources of clustering. While methods for longitudinal and cluster randomized trials are well developed, accessible resources and new methods for dealing with multiple sources of clustering are required. Involvement of a statistician with expertise in longitudinal and clustered designs is recommended.


PLoS ONE ◽  
2021 ◽  
Vol 16 (10) ◽  
pp. e0254330
Author(s):  
Jane Whelan ◽  
Helen Marshall ◽  
Thomas R. Sullivan

Cluster randomized trials (cRCT) to assess vaccine effectiveness incorporate indirect effects of vaccination, helping to inform vaccination policy. To calculate the sample size for a cRCT, an estimate of the intracluster correlation coefficient (ICC) is required. For infectious diseases, shared characteristics and social mixing behaviours may increase susceptibility and exposure, promote transmission and be a source of clustering. We present ICCs from a school-based cRCT assessing the effectiveness of a meningococcal B vaccine (Bexsero, GlaxoSmithKline) on reducing oropharyngeal carriage of Neisseria meningitidis (Nm) in 34,489 adolescents from 237 schools in South Australia in 2017/2018. We also explore the contribution of shared behaviours and characteristics to these ICCs. The ICC for carriage of disease-causing Nm genogroups (primary outcome) pre-vaccination was 0.004 (95% CI: 0.002, 0.007) and for all Nm was 0.007 (95%CI: 0.004, 0.011). Adjustment for social behaviours and personal characteristics reduced the ICC for carriage of disease-causing and all Nm genogroups by 25% (to 0.003) and 43% (to 0.004), respectively. ICCs are also reported for risk factors here, which may be outcomes in future research. Higher ICCs were observed for susceptibility and/or exposure variables related to Nm carriage (having a cold, spending ≥1 night out socializing or kissing ≥1 person in the previous week). In metropolitan areas, nights out socializing was a highly correlated behaviour. By contrast, smoking was a highly correlated behaviour in rural areas. A practical example to inform future cRCT sample size estimates is provided.


2021 ◽  
pp. 096228022110260
Author(s):  
Ariane M Mbekwe Yepnang ◽  
Agnès Caille ◽  
Sandra M Eldridge ◽  
Bruno Giraudeau

In cluster randomised trials, a measure of intracluster correlation such as the intraclass correlation coefficient (ICC) should be reported for each primary outcome. Providing intracluster correlation estimates may help in calculating sample size of future cluster randomised trials and also in interpreting the results of the trial from which they are derived. For a binary outcome, the ICC is known to be associated with its prevalence, which raises at least two issues. First, it questions the use of ICC estimates obtained on a binary outcome in a trial for sample size calculations in a subsequent trial in which the same binary outcome is expected to have a different prevalence. Second, it challenges the interpretation of ICC estimates because they do not solely depend on clustering level. Other intracluster correlation measures proposed for clustered binary data settings include the variance partition coefficient, the median odds ratio and the tetrachoric correlation coefficient. Under certain assumptions, the theoretical maximum possible value for an ICC associated with a binary outcome can be derived, and we proposed the relative deviation of an ICC estimate to this maximum value as another measure of the intracluster correlation. We conducted a simulation study to explore the dependence of these intracluster correlation measures on outcome prevalence and found that all are associated with prevalence. Even if all depend on prevalence, the tetrachoric correlation coefficient computed with Kirk’s approach was less dependent on the outcome prevalence than the other measures when the intracluster correlation was about 0.05. We also observed that for lower values, such as 0.01, the analysis of variance estimator of the ICC is preferred.


2021 ◽  
Vol 18 (2) ◽  
pp. 147-157
Author(s):  
Richard Hooper ◽  
Andrew J Copas

Background: Cluster randomised trials, like individually randomised trials, may benefit from a baseline period of data collection. We consider trials in which clusters prospectively recruit or identify participants as a continuous process over a given calendar period, and ask whether and for how long investigators should collect baseline data as part of the trial, in order to maximise precision. Methods: We show how to calculate and plot the variance of the treatment effect estimator for different lengths of baseline period in a range of scenarios, and offer general advice. Results: In some circumstances it is optimal not to include a baseline, while in others there is an optimal duration for the baseline. All other things being equal, the circumstances where it is preferable not to include a baseline period are those with a smaller recruitment rate, smaller intracluster correlation, greater decay in the intracluster correlation over time, or wider transition period between recruitment under control and intervention conditions. Conclusion: The variance of the treatment effect estimator can be calculated numerically, and plotted against the duration of baseline to inform design. It would be of interest to extend these investigations to cluster randomised trial designs with more than two randomised sequences of control and intervention condition, including stepped wedge designs.


2020 ◽  
Vol 17 (2) ◽  
pp. 176-183
Author(s):  
Siobhan P Brown ◽  
Abigail B Shoben

Background/aims In a stepped wedge study design, study clusters usually start with the baseline treatment and then cross over to the intervention at randomly determined times. Such designs are useful when the intervention must be delivered at the cluster level and are becoming increasingly common in practice. In these trials, if the outcome is death or serious morbidity, one may have an ethical imperative to monitor the trial and stop before maximum enrollment if the new therapy is proven to be beneficial. In addition, because formal monitoring allows for the stoppage of trials when a significant benefit for new therapy has been ruled out, their use can make a research program more efficient. However, use of the stepped wedge cluster randomized study design complicates the implementation of standard group sequential monitoring methods. Both the correlation of observations introduced by the clustered randomization and the timing of crossover from one treatment to the other impact the rate of information growth, an important component of an interim analysis. Methods We simulated cross-sectional stepped wedge study data in order to evaluate the impact of sequential monitoring on the Type I error and power when the true intracluster correlation is unknown. We studied the impact of varying intracluster correlations, treatment effects, methods of estimating the information growth, and boundary shapes. Results While misspecified information growth can impact both the Type I error and power of a study in some settings, we observed little inflation of the Type I error and only moderate reductions in power across a range of misspecified information growth patterns in our simulations. Conclusion Taking the study design into account and using either an estimate of the intracluster correlation from the ongoing study or other data in the same clusters should allow for easy implementation of group sequential methods in future stepped wedge designs.


2019 ◽  
Vol 18 (1) ◽  
Author(s):  
Pimnara Peerawaranun ◽  
Jordi Landier ◽  
Francois H. Nosten ◽  
Thuy-Nhien Nguyen ◽  
Tran Tinh Hien ◽  
...  

Abstract Background Sample size calculations for cluster randomized trials are a recognized methodological challenge for malaria research in pre-elimination settings. Positively correlated responses from the participants in the same cluster are a key feature in the estimated sample size required for a cluster randomized trial. The degree of correlation is measured by the intracluster correlation coefficient (ICC) where a higher coefficient suggests a closer correlation hence less heterogeneity within clusters but more heterogeneity between clusters. Methods Data on uPCR-detected Plasmodium falciparum and Plasmodium vivax infections from a recent cluster randomized trial which aimed at interrupting malaria transmission through mass drug administrations were used to calculate the ICCs for prevalence and incidence of Plasmodium infections. The trial was conducted in four countries in the Greater Mekong Subregion, Laos, Myanmar, Vietnam and Cambodia. Exact and simulation approaches were used to estimate ICC values for both the prevalence and the incidence of parasitaemia. In addition, the latent variable approach to estimate ICCs for the prevalence was utilized. Results The ICCs for prevalence ranged between 0.001 and 0.082 for all countries. The ICC from the combined 16 villages in the Greater Mekong Subregion were 0.26 and 0.21 for P. falciparum and P. vivax respectively. The ICCs for incidence of parasitaemia ranged between 0.002 and 0.075 for Myanmar, Cambodia and Vietnam. There were very high ICCs for incidence in the range of 0.701 to 0.806 in Laos during follow-up. Conclusion ICC estimates can help researchers when designing malaria cluster randomized trials. A high variability in ICCs and hence sample size requirements between study sites was observed. Realistic sample size estimates for cluster randomized malaria trials in the Greater Mekong Subregion have to assume high between cluster heterogeneity and ICCs. This work focused on uPCR-detected infections; there remains a need to develop more ICC references for trials designed around prevalence and incidence of clinical outcomes. Adequately powered trials are critical to estimate the benefit of interventions to malaria in a reliable and reproducible fashion. Trial registration: ClinicalTrials.govNCT01872702. Registered 7 June 2013. Retrospectively registered. https://clinicaltrials.gov/ct2/show/NCT01872702


Sign in / Sign up

Export Citation Format

Share Document