Comparison between 2000 and 2018 on the reporting of statistical significance and clinical relevance in physiotherapy clinical trials in six major physiotherapy journals: a meta-research design

DesignMeta-research.ObjectiveTo compare the prevalence of reporting p values, effect estimates and clinical relevance in physiotherapy randomised controlled trials (RCTs) published in the years 2000 and 2018.MethodsWe performed a meta-research study of physiotherapy RCTs obtained from six major physiotherapy peer-reviewed journals that were published in the years 2000 and 2018. We searched the databases Embase, Medline and PubMed in May 2019, and extracted data on the study characteristics and whether articles reported on statistical significance, effect estimates and confidence intervals for baseline, between-group, and within-group differences, and clinical relevance. Data were presented using descriptive statistics and inferences were made based on proportions. A 20% difference between 2000 and 2018 was regarded as a meaningful difference.ResultsWe found 140 RCTs: 39 were published in 2000 and 101 in 2018. Overall, there was a high prevalence (>90%) of reporting p values for the main (between-group) analysis, with no difference between years. Statistical significance testing was frequently used for evaluating baseline differences, increasing from 28% in 2000 to 61.4% in 2018. The prevalence of reporting effect estimates, CIs and the mention of clinical relevance increased from 2000 to 2018 by 26.6%, 34% and 32.8% respectively. Despite an increase in use in 2018, over 40% of RCTs failed to report effect estimates, CIs and clinical relevance of results.ConclusionThe prevalence of using p values remains high in physiotherapy research. Although the proportion of reporting effect estimates, CIs and clinical relevance is higher in 2018 compared to 2000, many publications still fail to report and interpret study findings in this way.

Download Full-text

Dismissing the use of P-values and statistical significance testing in scientific research: new methodological perspectives in toxicology and risk assessment

Toxicological Risk Assessment and Multi-System Health Impacts from Exposure ◽

10.1016/b978-0-323-85215-9.00002-7 ◽

2021 ◽

pp. 309-321

Author(s):

Nausicaa Berselli ◽

Tommaso Filippini ◽

Giorgia Adani ◽

Marco Vinceti

Keyword(s):

Risk Assessment ◽

Statistical Significance ◽

Scientific Research ◽

Significance Testing ◽

Statistical Significance Testing ◽

P Values

Download Full-text

Statistical significance testing and p-values: Defending the indefensible? A discussion paper and position statement

International Journal of Nursing Studies ◽

10.1016/j.ijnurstu.2019.07.001 ◽

2019 ◽

Vol 99 ◽

pp. 103384 ◽

Cited By ~ 5

Author(s):

Peter Griffiths ◽

Jack Needleman

Keyword(s):

Statistical Significance ◽

Position Statement ◽

Significance Testing ◽

Discussion Paper ◽

Statistical Significance Testing ◽

P Values

Download Full-text

Twenty steps towards an adequate inferential interpretation of p-values

10.7287/peerj.preprints.3482v3 ◽

2018 ◽

Author(s):

Norbert Hirschauer ◽

Sven Grüner ◽

Oliver Mußhoff ◽

Claudia Becker

Keyword(s):

Regression Analysis ◽

Bayesian Statistics ◽

Multiple Regression Analysis ◽

Multiple Regression ◽

Statistical Significance ◽

Significance Testing ◽

Statistical Significance Testing ◽

P Values ◽

Logical Consistency ◽

Categorical Reasoning

We suggest twenty immediately actionable steps to reduce widespread inferential errors related to “statistical significance testing.” Our propositions refer first to the theoretical preconditions for using p-values. They furthermore include wording guidelines as well as structural and operative advice of how to present results, especially in multiple regression analysis. Our propositions aim at fostering the logical consistency of inferential arguments by avoiding false categorical reasoning. They are not aimed at dispensing with p-values or completely replacing frequentist approaches by Bayesian statistics.

Download Full-text

Twenty steps towards an adequate inferential interpretation of p-values

10.7287/peerj.preprints.3482v1 ◽

2017 ◽

Author(s):

Norbert Hirschauer ◽

Oliver Mußhoff ◽

Claudia Becker ◽

Sven Grüner

Keyword(s):

Regression Analysis ◽

Bayesian Statistics ◽

Multiple Regression Analysis ◽

Multiple Regression ◽

Statistical Significance ◽

Significance Testing ◽

Statistical Significance Testing ◽

P Values ◽

Logical Consistency ◽

Categorical Reasoning

Download Full-text

Twenty Steps Towards an Adequate Inferential Interpretation of p-Values in Econometrics

Jahrbücher für Nationalökonomie und Statistik ◽

10.1515/jbnst-2018-0069 ◽

2019 ◽

Vol 239 (4) ◽

pp. 703-721 ◽

Cited By ~ 1

Author(s):

Norbert Hirschauer ◽

Sven Grüner ◽

Oliver Mußhoff ◽

Claudia Becker

Keyword(s):

Regression Analysis ◽

Bayesian Statistics ◽

Multiple Regression Analysis ◽

Multiple Regression ◽

Statistical Significance ◽

Significance Testing ◽

Statistical Significance Testing ◽

P Values ◽

Logical Consistency ◽

Categorical Reasoning

Abstract We suggest twenty immediately actionable steps to reduce widespread inferential errors related to “statistical significance testing.” Our propositions refer to the theoretical preconditions for using p-values. They furthermore include wording guidelines as well as structural and operative advice on how to present results, especially in research based on multiple regression analysis, the working horse of empirical economists. Our propositions aim at fostering the logical consistency of inferential arguments by avoiding false categorical reasoning. They are not aimed at dispensing with p-values or completely replacing frequentist approaches by Bayesian statistics.

Download Full-text

Twenty steps towards an adequate inferential interpretation of p-values

10.7287/peerj.preprints.3482v2 ◽

2018 ◽

Author(s):

Norbert Hirschauer ◽

Sven Grüner ◽

Oliver Mußhoff ◽

Claudia Becker

Keyword(s):

Regression Analysis ◽

Bayesian Statistics ◽

Multiple Regression Analysis ◽

Multiple Regression ◽

Statistical Significance ◽

Significance Testing ◽

Statistical Significance Testing ◽

P Values ◽

Logical Consistency ◽

Categorical Reasoning

Download Full-text

Beyond ‘significance’: principles and practice of the Analysis of Credibility

Royal Society Open Science ◽

10.1098/rsos.171047 ◽

2018 ◽

Vol 5 (1) ◽

pp. 171047 ◽

Cited By ~ 16

Author(s):

Robert A. J. Matthews

Keyword(s):

Research Study ◽

Statistical Significance ◽

Research Question ◽

Worked Examples ◽

Significance Testing ◽

Additional Insight ◽

Statistical Significance Testing ◽

New Findings ◽

Replication Crisis ◽

Study Designs

The inferential inadequacies of statistical significance testing are now widely recognized. There is, however, no consensus on how to move research into a ‘post p < 0.05’ era. We present a potential route forward via the Analysis of Credibility, a novel methodology that allows researchers to go beyond the simplistic dichotomy of significance testing and extract more insight from new findings. Using standard summary statistics, AnCred assesses the credibility of significant and non-significant findings on the basis of their evidential weight, and in the context of existing knowledge. The outcome is expressed in quantitative terms of direct relevance to the substantive research question, providing greater protection against misinterpretation. Worked examples are given to illustrate how AnCred extracts additional insight from the outcome of typical research study designs. Its ability to cast light on the use of p -values, the interpretation of non-significant findings and the so-called ‘replication crisis’ is also discussed.

Download Full-text

Twenty steps towards an adequate inferential interpretation of p-values

10.7287/peerj.preprints.3482 ◽

2019 ◽

Author(s):

Norbert Hirschauer ◽

Sven Grüner ◽

Oliver Mußhoff ◽

Claudia Becker

Keyword(s):

Regression Analysis ◽

Bayesian Statistics ◽

Multiple Regression Analysis ◽

Multiple Regression ◽

Statistical Significance ◽

Significance Testing ◽

Statistical Significance Testing ◽

P Values ◽

Logical Consistency ◽

Categorical Reasoning

We suggest twenty immediately actionable steps to reduce widespread inferential errors related to “statistical significance testing.” Our propositions refer first to the theoretical preconditions for using p-values. They furthermore include wording guidelines as well as structural and operative advice on how to present results, especially in research based on multiple regression analysis, the working horse of empirical economists. Our propositions aim at fostering the logical consistency of inferential arguments by avoiding false categorical reasoning. They are not aimed at dispensing with p-values or completely replacing frequentist approaches by Bayesian statistics.

Download Full-text

Twenty steps towards an adequate inferential interpretation of p-values

10.7287/peerj.preprints.3482v4 ◽

2019 ◽

Author(s):

Norbert Hirschauer ◽

Sven Grüner ◽

Oliver Mußhoff ◽

Claudia Becker

Keyword(s):

Regression Analysis ◽

Bayesian Statistics ◽

Multiple Regression Analysis ◽

Multiple Regression ◽

Statistical Significance ◽

Significance Testing ◽

Statistical Significance Testing ◽

P Values ◽

Logical Consistency ◽

Categorical Reasoning

We suggest twenty immediately actionable steps to reduce widespread inferential errors related to “statistical significance testing.” Our propositions refer first to the theoretical preconditions for using p-values. They furthermore include wording guidelines as well as structural and operative advice on how to present results, especially in research based on multiple regression analysis, the working horse of empirical economists. Our propositions aim at fostering the logical consistency of inferential arguments by avoiding false categorical reasoning. They are not aimed at dispensing with p-values or completely replacing frequentist approaches by Bayesian statistics.

Download Full-text

Statistical Significance Testing at CHI PLAY: Challenges and Opportunities for More Transparency

Proceedings of the Annual Symposium on Computer-Human Interaction in Play ◽

10.1145/3410404.3414229 ◽

2020 ◽

Author(s):

Jan B. Vornhagen ◽

April Tyack ◽

Elisa D. Mekler

Keyword(s):

Statistical Significance ◽

Significance Testing ◽

Statistical Significance Testing ◽

Challenges And Opportunities

Download Full-text