scholarly journals Systematic review finds major deficiencies in sample size methodology and reporting for stepped-wedge cluster randomised trials

BMJ Open ◽  
2016 ◽  
Vol 6 (2) ◽  
pp. e010166 ◽  
Author(s):  
James Martin ◽  
Monica Taljaard ◽  
Alan Girling ◽  
Karla Hemming
2020 ◽  
Author(s):  
Évèhouénou Lionel Adisso ◽  
Monica Taljaard ◽  
Louis-Paul Rivest ◽  
Hervé Tchala Vignon Zomahoun ◽  
Pierre Jacob Durand ◽  
...  

Abstract Background: The stepped wedge cluster randomised trial is an increasingly common trial design. The design can be useful for informing real-world clinical decision-making, including decisions about the effectiveness of interventions in particular subgroups. However, there is little existing guidance about how to perform subgroup analyses in the stepped wedge design. We aim to determine the prevalence of subgroup analyses and describe statistical methods used to perform them in stepped wedge cluster randomised trials.Methods: We will conduct a systematic review following the methodology recommended in the Cochrane Handbook for Systematic Reviews of Interventions. We report this protocol according to the PRISMA-P checklist. The protocol has been registered in the Open Science Framework. We will search for terms related to ‘stepped wedge’. Sources will be PubMed, Embase, PsycINFO, Web of Science, CINAHL, Cochrane Library, and Current Controlled Trials Register up to 16 October 2020. Studies will be eligible if they are written in English, involve human participants and are primary or secondary reports of planned or completed stepped wedge cluster randomised trials. Two reviewers will first screen the titles and abstracts, then full texts, to select studies that should be included in the review. Disagreements will be solved by consensus through discussion with a third reviewer. We will extract data related to study characteristics including presence or absence of subgroup analyses, characteristics of subgroup variables examined, statistical methods used to perform subgroup analyses, and adherence to the most consistently recommendations suggested for subgroup analyses in general including in clinical trials. We will perform a qualitative synthesis of the extracted data.Discussion: This protocol offers a reproducible and transparent procedure for a systematic review of the literature. It will provide a portrait of the frequency and types of subgroup analyses performed in stepped wedge cluster randomised trials. These results will inform the development of recommendations for subgroup analyses in such trials.Systematic review registration: This protocol has been registered on Open Science Framework, Registration ID: https://osf.io/2kwrz.


2017 ◽  
Vol 28 (3) ◽  
pp. 703-716 ◽  
Author(s):  
J Kasza ◽  
K Hemming ◽  
R Hooper ◽  
JNS Matthews ◽  
AB Forbes

Stepped wedge and cluster randomised crossover trials are examples of cluster randomised designs conducted over multiple time periods that are being used with increasing frequency in health research. Recent systematic reviews of both of these designs indicate that the within-cluster correlation is typically taken account of in the analysis of data using a random intercept mixed model, implying a constant correlation between any two individuals in the same cluster no matter how far apart in time they are measured: within-period and between-period intra-cluster correlations are assumed to be identical. Recently proposed extensions allow the within- and between-period intra-cluster correlations to differ, although these methods require that all between-period intra-cluster correlations are identical, which may not be appropriate in all situations. Motivated by a proposed intensive care cluster randomised trial, we propose an alternative correlation structure for repeated cross-sectional multiple-period cluster randomised trials in which the between-period intra-cluster correlation is allowed to decay depending on the distance between measurements. We present results for the variance of treatment effect estimators for varying amounts of decay, investigating the consequences of the variation in decay on sample size planning for stepped wedge, cluster crossover and multiple-period parallel-arm cluster randomised trials. We also investigate the impact of assuming constant between-period intra-cluster correlations instead of decaying between-period intra-cluster correlations. Our results indicate that in certain design configurations, including the one corresponding to the proposed trial, a correlation decay can have an important impact on variances of treatment effect estimators, and hence on sample size and power. An R Shiny app allows readers to interactively explore the impact of correlation decay.


BMJ Open ◽  
2017 ◽  
Vol 7 (11) ◽  
pp. e017151 ◽  
Author(s):  
Caroline Kristunas ◽  
Tom Morris ◽  
Laura Gray

ObjectivesTo investigate the extent to which cluster sizes vary in stepped-wedge cluster randomised trials (SW-CRT) and whether any variability is accounted for during the sample size calculation and analysis of these trials.SettingAny, not limited to healthcare settings.ParticipantsAny taking part in an SW-CRT published up to March 2016.Primary and secondary outcome measuresThe primary outcome is the variability in cluster sizes, measured by the coefficient of variation (CV) in cluster size. Secondary outcomes include the difference between the cluster sizes assumed during the sample size calculation and those observed during the trial, any reported variability in cluster sizes and whether the methods of sample size calculation and methods of analysis accounted for any variability in cluster sizes.ResultsOf the 101 included SW-CRTs, 48% mentioned that the included clusters were known to vary in size, yet only 13% of these accounted for this during the calculation of the sample size. However, 69% of the trials did use a method of analysis appropriate for when clusters vary in size. Full trial reports were available for 53 trials. The CV was calculated for 23 of these: the median CV was 0.41 (IQR: 0.22–0.52). Actual cluster sizes could be compared with those assumed during the sample size calculation for 14 (26%) of the trial reports; the cluster sizes were between 29% and 480% of that which had been assumed.ConclusionsCluster sizes often vary in SW-CRTs. Reporting of SW-CRTs also remains suboptimal. The effect of unequal cluster sizes on the statistical power of SW-CRTs needs further exploration and methods appropriate to studies with unequal cluster sizes need to be employed.


2016 ◽  
Vol 35 (26) ◽  
pp. 4718-4728 ◽  
Author(s):  
Richard Hooper ◽  
Steven Teerenstra ◽  
Esther de Hoop ◽  
Sandra Eldridge

2021 ◽  
pp. 174077452110208
Author(s):  
Elizabeth Korevaar ◽  
Jessica Kasza ◽  
Monica Taljaard ◽  
Karla Hemming ◽  
Terry Haines ◽  
...  

Background: Sample size calculations for longitudinal cluster randomised trials, such as crossover and stepped-wedge trials, require estimates of the assumed correlation structure. This includes both within-period intra-cluster correlations, which importantly differ from conventional intra-cluster correlations by their dependence on period, and also cluster autocorrelation coefficients to model correlation decay. There are limited resources to inform these estimates. In this article, we provide a repository of correlation estimates from a bank of real-world clustered datasets. These are provided under several assumed correlation structures, namely exchangeable, block-exchangeable and discrete-time decay correlation structures. Methods: Longitudinal studies with clustered outcomes were collected to form the CLustered OUtcome Dataset bank. Forty-four available continuous outcomes from 29 datasets were obtained and analysed using each correlation structure. Patterns of within-period intra-cluster correlation coefficient and cluster autocorrelation coefficients were explored by study characteristics. Results: The median within-period intra-cluster correlation coefficient for the discrete-time decay model was 0.05 (interquartile range: 0.02–0.09) with a median cluster autocorrelation of 0.73 (interquartile range: 0.19–0.91). The within-period intra-cluster correlation coefficients were similar for the exchangeable, block-exchangeable and discrete-time decay correlation structures. Within-period intra-cluster correlation coefficients and cluster autocorrelations were found to vary with the number of participants per cluster-period, the period-length, type of cluster (primary care, secondary care, community or school) and country income status (high-income country or low- and middle-income country). The within-period intra-cluster correlation coefficients tended to decrease with increasing period-length and slightly decrease with increasing cluster-period sizes, while the cluster autocorrelations tended to move closer to 1 with increasing cluster-period size. Using the CLustered OUtcome Dataset bank, an RShiny app has been developed for determining plausible values of correlation coefficients for use in sample size calculations. Discussion: This study provides a repository of intra-cluster correlations and cluster autocorrelations for longitudinal cluster trials. This can help inform sample size calculations for future longitudinal cluster randomised trials.


2017 ◽  
Vol 14 (1) ◽  
Author(s):  
Christopher Jarvis ◽  
Gian Luca Di Tanna ◽  
Daniel Lewis ◽  
Neal Alexander ◽  
W. John Edmunds

2017 ◽  
Vol 14 (5) ◽  
pp. 507-517 ◽  
Author(s):  
Michael J Grayling ◽  
James MS Wason ◽  
Adrian P Mander

Background/Aims: The stepped-wedge cluster randomised trial design has received substantial attention in recent years. Although various extensions to the original design have been proposed, no guidance is available on the design of stepped-wedge cluster randomised trials with interim analyses. In an individually randomised trial setting, group sequential methods can provide notable efficiency gains and ethical benefits. We address this by discussing how established group sequential methodology can be adapted for stepped-wedge designs. Methods: Utilising the error spending approach to group sequential trial design, we detail the assumptions required for the determination of stepped-wedge cluster randomised trials with interim analyses. We consider early stopping for efficacy, futility, or efficacy and futility. We describe first how this can be done for any specified linear mixed model for data analysis. We then focus on one particular commonly utilised model and, using a recently completed stepped-wedge cluster randomised trial, compare the performance of several designs with interim analyses to the classical stepped-wedge design. Finally, the performance of a quantile substitution procedure for dealing with the case of unknown variance is explored. Results: We demonstrate that the incorporation of early stopping in stepped-wedge cluster randomised trial designs could reduce the expected sample size under the null and alternative hypotheses by up to 31% and 22%, respectively, with no cost to the trial’s type-I and type-II error rates. The use of restricted error maximum likelihood estimation was found to be more important than quantile substitution for controlling the type-I error rate. Conclusion: The addition of interim analyses into stepped-wedge cluster randomised trials could help guard against time-consuming trials conducted on poor performing treatments and also help expedite the implementation of efficacious treatments. In future, trialists should consider incorporating early stopping of some kind into stepped-wedge cluster randomised trials according to the needs of the particular trial.


Sign in / Sign up

Export Citation Format

Share Document