A Review of Statistical Reporting in Dietetics Research (2010–2019): How is a Canadian Journal Doing?

Author(s):  
Holly Schaafsma ◽  
Holly Laasanen ◽  
Jasna Twynstra ◽  
Jamie A. Seabrook

Despite the widespread use of statistical techniques in quantitative research, methodological flaws and inadequate statistical reporting persist. The objective of this study is to evaluate the quality of statistical reporting and procedures in all original, quantitative articles published in the Canadian Journal of Dietetic Practice and Research (CJDPR) from 2010 to 2019 using a checklist created by our research team. In total, 107 articles were independently evaluated by 2 raters. The hypothesis or objective(s) was clearly stated in 97.2% of the studies. Over half (51.4%) of the articles reported the study design and 57.9% adequately described the statistical techniques used. Only 21.2% of the studies that required a prestudy sample size calculation reported one. Of the 281 statistical tests conducted, 88.3% of them were correct. P values >0.05–0.10 were reported as “statistically significant” and/or a “trend” in 11.4% of studies. While this evaluation reveals both strengths and areas for improvement in the quality of statistical reporting in CJDPR, we encourage dietitians to pursue additional statistical training and/or seek the assistance of a statistician. Future research should consider validating this new checklist and using it to evaluate the statistical quality of studies published in other nutrition journals and disciplines.

2012 ◽  
Vol 23 (5) ◽  
pp. 570-574 ◽  
Author(s):  
Nina Musurlieva ◽  
Maria Stoykova ◽  
Doychin Boyadjiev

The aim of the paper is to present the validation of a scale for assessing the impact of periodontal diseases on individuals' quality of life in Bulgaria. A pilot research was made among 30 diagnosed patients with periodontitis visiting the Department of Periodontology, Faculty of Dental Medicine, Medical University of Plovdiv, Bulgaria. The minimum sample size of 30 people was established based on a power analysis for sample size calculation. The mean age of participants was 48.95 ± 11.85 years, being 56.67 ± 9.05 years for males and 43.33 ± 9.05 years for females. Standard interviews were conducted using a specific instrument: self-designed questionnaire and a 5-degree ranked scale, containing initially 11 questions. The interviews were repeated after 3 months with the same patients for retest analysis. The data was statistically processed using SPSS v.13 software. Results received after the initial interviews: Cronbach's coefficient (α=0.882), Spearman-Brown coefficient (r sb=0.998), average inter-item correlation coefficient (R=0.426), difficulty of the questions from 0.173 to 0.757 and discrimination power from 0.405 to 0.809. Results after the second interviews: α=0.883, r sb=0.998, R=0.507, difficulty from 0.287 to 0.757 and discrimination power from 0.524 to 0.809. In two of the questions, a low level of inter-item correlation with the rest of the items was found and they were excluded. The final version of the questionnaire contained 9 questions. The validation proved that the developed scale is sufficiently reliable and will be used in the final research, the first one to use such an instrument for measuring oral health-related quality of life in Bulgaria.


2020 ◽  
Vol 99 (13) ◽  
pp. 1453-1460
Author(s):  
D. Qin ◽  
F. Hua ◽  
H. He ◽  
S. Liang ◽  
H. Worthington ◽  
...  

The objectives of this study were to assess the reporting quality and methodological quality of split-mouth trials (SMTs) published during the past 2 decades and to determine whether there has been an improvement in their quality over time. We searched the MEDLINE database via PubMed to identify SMTs published in 1998, 2008, and 2018. For each included SMT, we used the CONsolidated Standards Of Reporting Trials (CONSORT) 2010 guideline, CONSORT for within-person trial (WPT) extension, and a new 3-item checklist to assess its trial reporting quality (TRQ), WPT-specific reporting quality (WRQ), and SMT-specific methodological quality (SMQ), respectively. Multivariable generalized linear models were performed to analyze the quality of SMTs over time, adjusting for potential confounding factors. A total of 119 SMTs were included. The mean overall score for the TRQ (score range, 0 to 32), WRQ (0 to 15), and SMQ (0 to 3) was 15.77 (SD 4.51), 6.06 (2.06), and 1.12 (0.70), respectively. The primary outcome was clearly defined in only 28 SMTs (23.5%), and only 27 (22.7%) presented a replicable sample size calculation. Only 45 SMTs (37.8%) provided the rationale for using a split-mouth design. The correlation between body sites was reported in only 5 studies (4.2%) for sample size calculation and 4 studies (3.4%) for statistical results. Only 2 studies (1.7%) performed an appropriate sample size calculation, and 46 (38.7%) chose appropriate statistical methods, both accounting for the correlation among treatment groups and the clustering/multiplicity of measurements within an individual. Results of regression analyses suggested that the TRQ of SMTs improved significantly with time ( P < 0.001), while there was no evidence of improvement in WRQ or SMQ. Both the reporting quality and methodological quality of SMTs still have much room for improvement. Concerted efforts are needed to improve the execution and reporting of SMTs.


2016 ◽  
Vol 124 (2) ◽  
pp. 558-568 ◽  
Author(s):  
Alireza Mansouri ◽  
Benjamin Cooper ◽  
Samuel M. Shin ◽  
Douglas Kondziolka

OBJECT Randomized-controlled trials (RCTs) are advocated to provide high-level medical evidence. However, in neurosurgery, there are barriers to conducting RCTs. The authors of this study sought to analyze the quality of neurosurgical RCTs since 2000 to determine the adequacy of their design and reporting. METHODS A search of the MEDLINE and EMBASE databases (2000–2014) was conducted. The medical subject heading (MeSH) terms used in the search included: “neurosurgery” OR “neurosurgical procedure,” “brain neoplasms,” “infarction” and “decompression,” “carotid stenosis,” “cerebral hemorrhage,” and “spinal fusion.” These studies were limited to RCTs, in humans, and in the English language. The Consolidated Standards for Reporting of Trials (CONSORT) and Jadad scales were used to assess the quality of RCT design and reporting. The standardized median times cited (median citations divided by years since publication) were used to assess impact. A pragmatic-explanatory continuum indicator summary-based scale was used to assess the design of the studies as primarily pragmatic or explanatory. RESULTS Sixty-one articles were identified, and the following subspecialties were the most common: vascular (23, 37%), followed by functional neurosurgery and neurooncology (both 13, 21%). The following nations were the primary leaders in RCTs: US (25 studies, 41%), Germany (8 studies, 13%), and the United Kingdom (7 studies, 11%). Median sample size was 100 (interquartile range [IQR] 41.5–279). The majority of the studies (40, 66%) had pragmatic objectives. The median number of times cited overall was 69 (IQR 20.5–193). The combined median CONSORT score was 36 (IQR 27.5–39). Blinding was most deficiently reported. Other areas with a relatively low quality of reporting were sample size calculation (34.2% of surgical, 38.5% of drug, and 20% of device studies), allocation concealment (28.9% of surgical, 23.1% of drug, and 50% of device studies), and protocol implementation (18.4% of surgical, 23% of drug, and 20% of device studies). The quality of reporting did not correlate with the study impact. All studies had a median Jadad score ≤ 3. Thirty-three pragmatic studies (83%) and 5 explanatory studies (25%) met the design objectives. All pragmatic studies based on drug and device trials met their objectives, while 74% of pragmatic surgical trials met their objectives. CONCLUSIONS The prevalence of neurosurgical RCTs is low. The quality of RCT design and reporting in neurosurgery is also low. Many study designs are not compatible with stated objectives. Pragmatic studies were more likely to meet design objectives. Given the role of RCTs as one of the highest levels of evidence, it is critical to improve on their methodology and reporting.


2016 ◽  
Vol 6 (9) ◽  
pp. 1753
Author(s):  
Khalid Al-Seghayer

English instructors’ perceptions of computer-assisted reading (CAR) influence their tendency to integrate it in the L2 reading classroom. Accordingly, this study assessed English instructors’ perceptions of the importance of electronic text in L2 reading and their general attitudes about the role and effectiveness of computers in teaching ESL/EFL. To this end, 70 ESL/EFL instructors at multiple universities responded to a two-part survey containing 34 items and open-ended questions about the respondents’ general attitudes about computers in language teaching and learning, and the importance of computer-assisted L2 reading instruction specifically. The researcher performed both descriptive and inferential statistical analyses on the data, including means and standard deviations, as well as paired t-tests and bootstrapped p-values. The results indicated that the participants recognized the importance of CAR in improving the quality of L2 reading instruction and developing learners’ reading skills. The implications of the study are discussed, and potential areas of future research are suggested.


2021 ◽  
Vol 9 ◽  
Author(s):  
Nan Yang ◽  
Huihui Zhang ◽  
Taoyi Deng ◽  
Jeff Jianfei Guo ◽  
Ming Hu

Objectives: This study was aimed to find and appraise the available published pharmacoeconomic research on Traditional Chinese Medicine (TCM), to identify related issues and make suggestions for improvement in future research.Methods: After developing a search strategy and establishing inclusion and exclusion criteria, pharmacoeconomic studies on TCM were sourced from seven Chinese and English databases from inception to April 2020. Basic information about the studies and key pharmacoeconomic items of each study were extracted. The quality of each study was evaluated by using the British Medical Journal economic submissions checklist for authors and peer reviewers, focusing on factors such as study design, research time horizon, sample size, perspective, and evaluation methods.Results: A total of 431 published pharmacoeconomic articles with 434 studies on topics including cost-effectiveness, cost-benefit, cost-minimization, cost-utility, or combination analyses were identified and included in this review. Of these, 424 were published in Chinese and 7 in English. These studies conducted economic evaluations of 264 Chinese patent medicines and 70 types of TCM prescriptions for 143 diseases, including those of the central nervous, cardiovascular, respiratory, gynecologyical, and other systems. The studied TCMs included blood-activating agents (such as Xuesaitong tablet, Fufant Danshen tablet, and Danhong Injection), blood circulation promoting agents (such as Shuxuetong injection, Rupixiao tablet, and Fufang Danshen injection), and other therapeutic agents. The overall quality score of the studies was 0.62 (range 0.38 to 0.85). The mean quality score of studies in English was 0.72, which was higher than that of studies in Chinese with 0.62.Conclusions: The quality of pharmacoeconomic studies on TCM was relatively, generally low. Major concerns included study design, inappropriate pharmacoeconomic evaluation, insufficient sample size, or non-scientific assessment. Enhanced methodological training and cooperation, the development of a targeted pharmacoeconomic evaluation guideline, and proposal of a reasonable health outcome index are warranted to improve quality of future studies.


2004 ◽  
Vol 15 (3) ◽  
pp. 231-237 ◽  
Author(s):  
Gláucia Maria Bovi Ambrosano ◽  
André Figueiredo Reis ◽  
Marcelo Giannini ◽  
Antônio Carlos Pereira

A descriptive survey was performed in order to assess the statistical content and quality of Brazilian and international dental journals, and compare their evolution throughout the last decades. The authors identified the reporting and accuracy of statistical techniques in 1000 papers published from 1970 to 2000 in seven dental journals: three Brazilian (Brazilian Dental Journal, Revista de Odontologia da Universidade de São Paulo and Revista de Odontologia da UNESP) and four international journals (Journal of the American Dental Association, Journal of Dental Research, Caries Research and Journal of Periodontology). Papers were divided into two time periods: from 1970 to 1989, and from 1990 to 2000. A slight increase in the number of articles that presented some form of statistical technique was noticed for Brazilian journals (from 61.0 to 66.7%), whereas for international journals, a significant increase was observed (65.8 to 92.6%). In addition, a decrease in the number of statistical errors was verified. The most commonly used statistical tests as well as the most frequent errors found in dental journals were assessed. Hopefully, this investigation will encourage dental educators to better plan the teaching of biostatistics, and to improve the statistical quality of submitted manuscripts.


Author(s):  
David Clark-Carter

This chapter explores why effect size needs to be taken into account when designing and reporting research. It gives an effect size for each of the standard statistical tests which health and clinical psychologists employ, and looks at the need to consider statistical power when choosing a sample size for a study and how statistical power can help to guide the advice which can be given when discussing future research.


2017 ◽  
Vol 33 (1) ◽  
pp. 103-110 ◽  
Author(s):  
Britta Olberg ◽  
Matthias Perleth ◽  
Katja Felgentraeger ◽  
Sandra Schulz ◽  
Reinhard Busse

Background: The aim of this study was to assess the quality of reporting sample size calculation and underlying design assumptions in pivotal trials of high-risk medical devices (MDs) for neurological conditions.Methods: Systematic review of research protocols for publicly registered randomized controlled trials (RCTs). In the absence of a published protocol, principal investigators were contacted for additional data. To be included, trials had to investigate a high-risk MD, registered between 2005 and 2015, with indications stroke, headache disorders, and epilepsy as case samples within central nervous system diseases. Extraction of key methodological parameters for sample size calculation was performed independently and peer-reviewed.Results: In a final sample of seventy-one eligible trials, we collected data from thirty-one trials. Eighteen protocols were obtained from the public domain or principal investigators. Data availability decreased during the extraction process, with almost all data available for stroke-related trials. Of the thirty-one trials with sample size information available, twenty-six reported a predefined calculation and underlying assumptions. Justification was given in twenty and evidence for parameter estimation in sixteen trials. Estimates were most often based on previous research, including RCTs and observational data. Observational data were predominantly represented by retrospective designs. Other references for parameter estimation indicated a lower level of evidence.Conclusions: Our systematic review of trials on high-risk MDs confirms previous research, which has documented deficiencies regarding data availability and a lack of reporting on sample size calculation. More effort is needed to ensure both relevant sources, that is, original research protocols, to be publicly available and reporting requirements to be standardized.


2008 ◽  
Vol 57 (1) ◽  
pp. 113-119 ◽  
Author(s):  
Mohammad Samir Hossain ◽  
Mohammad Zakaria Siddique

Death is the end of life. But Muslims believe death is an event between two lives, not an absolute cessation of life. Thus religiosity may influence Muslims differently about death. To explore the impact of religious perception, thus religiosity, a cross-sectional, descriptive, analytic and correlational study was conducted on 150 Muslims. Self-declared healthy Muslims equally from both sexes ( N = 150, Age range – 20 to 50 years, Minimum education – Bachelor) were selected by stratified sampling and randomly under each stratum. Subjects, divided in five levels of religiosity, were assessed and scored for the presence of maladjustment symptoms and stage of adjustment with death. ANOVA and correlation coefficient was applied on the sets of data collected. All statistical tests were done at the level of 95% confidence ( P < 0.05). Final results were higher than the table values used for ANOVA and correlation coefficient yielded P values of < 0.05, < 0.01, and < 0.001. Religiosity as a criterion of Muslims influenced the quality of adjustment with death positively. So we hypothesized that religiosity may help Muslims adjust to death.


2001 ◽  
Vol 95 (5) ◽  
pp. 1068-1073 ◽  
Author(s):  
Hwee Leng Pua ◽  
Jerrold Lerman ◽  
Mark W. Crawford ◽  
James G. Wright

Background The authors evaluated the quality of clinical trials published in four anesthesia journals during the 20-yr period from 1981-2000. Methods Trials published in four major anesthesia journals during the periods 1981-1985, 1991-1995, and the first 6 months of 2000 were grouped according to journal and year. Using random number tables, four trials were selected from all of the eligible clinical trials in each journal in each year for the periods 1981-1985 and 1991-1995, and five trials were selected from all of the trials in each journal in the first 6 months of 2000. Methods and results sections from the 160 trials from 1981-1985 and 1991-1995 were randomly ordered and distributed to three of the authors for blinded review of the quality of the study design according to 10 predetermined criteria (weighted equally, maximum score of 10): informed consent and ethics approval, eligibility criteria, sample size calculation, random allocation, method of randomization, blind assessment of outcome, adverse outcomes, statistical analysis, type I error, and type II error. After these trials were evaluated, 20 trials from the first 6 months of 2000 were randomly ordered, distributed, and evaluated as described. Results The mean (+/- SD) analysis scores pooled for the four journals increased from 5.5 +/- 1.4 in 1981-1985 to 7.0 +/- 1.1 in 1991-1995 (P &lt; 0.00001) and to 7.8 +/- 1.5 in 2000. For 7 of the 10 criteria, the percentage of trials from the four journals that fulfilled the criteria increased significantly between 1981-1985 and 1991-1995. During the 20-yr period, the reporting of sample size calculation and method of randomization increased threefold to fourfold, whereas the frequency of type I statistical errors remained unchanged. Conclusion Although the quality of clinical trials in four major anesthesia journals has increased steadily during the past two decades, specific areas of trial methodology require further attention.


Sign in / Sign up

Export Citation Format

Share Document