American Vocational Education Research Association Members' Perceptions of Statistical Significance Tests and Other Statistical Controversies

Howard R. D. Gordon

doi:10.5328/jver26.2.244

Significance tests: Necessary but not sufficient

Behavioral and Brain Sciences ◽

10.1017/s0140525x98521164 ◽

1998 ◽

Vol 21 (2) ◽

pp. 221-222

Author(s):

Louis G. Tassinary

Keyword(s):

Effect Size ◽

Scientific Community ◽

Statistical Significance ◽

Significance Tests ◽

Experimental Control

Chow (1996) offers a reconceptualization of statistical significance that is reasoned and comprehensive. Despite a somewhat rough presentation, his arguments are compelling and deserve to be taken seriously by the scientific community. It is argued that his characterization of literal replication, types of research, effect size, and experimental control are in need of revision.

Download Full-text

Book Review: Multiple Perspectives of Teacher Noticing: An Emerging Area of Research

Journal for Research in Mathematics Education ◽

10.5951/jresematheduc.46.3.0371 ◽

2015 ◽

Vol 46 (3) ◽

pp. 371-376

Author(s):

Edna O. Schack ◽

Molly H. Fisher ◽

Jonathan N. Thomas

Keyword(s):

Teacher Education ◽

Education Research ◽

Mathematics Teacher ◽

American Education ◽

Research Association ◽

Multiple Perspectives ◽

Teacher Noticing ◽

Professional Noticing ◽

Education Award

“Noticing matters” (p. 223). Through these words in the concluding chapter, Alan Schoenfeld succinctly captures the theme of this seminal book, Mathematics Teacher Noticing: Seeing Through Teachers' Eyes. The book received the American Education Research Association 2013 Exemplary Research in Teaching and Teacher Education Award. It addresses a variety of meanings and interpretations of teacher noticing from Dewey's earlier work of inner and outer attention to more specific variations such as that of professional noticing, as defined by Jacobs, Lamb, and Philipp. Chapter contributors have provided the foundation and framing of teacher noticing as a construct for studying and improving teaching.

Download Full-text

The purchasing function: walking a tightrope International Purchasing & Supply Education & Research Association 13th Annual IPSERA Conference Catania (Italy) 4–7 April 2004

Journal of Purchasing and Supply Management ◽

10.1016/j.pursup.2003.09.009 ◽

2003 ◽

Cited By ~ 1

Keyword(s):

Education Research ◽

Research Association

Download Full-text

Vocational Education Research Trend Analysis-Focused on the Journal of Vocational Education Research, 2008∼2017-

Korean Society for the Study of Vocational Education ◽

10.37210/jver.2021.40.1.57 ◽

2021 ◽

Vol 40 (1) ◽

pp. 57-74

Author(s):

Okhan Yoon

Keyword(s):

Education Research ◽

Vocational Education ◽

Trend Analysis ◽

Research Trend

Download Full-text

Damaging Real Lives through Obstinacy: Re-Emphasising Why Significance Testing is Wrong

Sociological Research Online ◽

10.5153/sro.3857 ◽

2016 ◽

Vol 21 (1) ◽

pp. 102-115 ◽

Cited By ~ 6

Author(s):

Stephen Gorard

Keyword(s):

Statistical Significance ◽

Poor Quality ◽

Significance Testing ◽

Significance Tests ◽

Statistical Significance Testing ◽

Wrong Answer ◽

Repeated Calls ◽

Numeric Data ◽

Quality Of Research

This paper reminds readers of the absurdity of statistical significance testing, despite its continued widespread use as a supposed method for analysing numeric data. There have been complaints about the poor quality of research employing significance tests for a hundred years, and repeated calls for researchers to stop using and reporting them. There have even been attempted bans. Many thousands of papers have now been written, in all areas of research, explaining why significance tests do not work. There are too many for all to be cited here. This paper summarises the logical problems as described in over 100 of these prior pieces. It then presents a series of demonstrations showing that significance tests do not work in practice. In fact, they are more likely to produce the wrong answer than a right one. The confused use of significance testing has practical and damaging consequences for people's lives. Ending the use of significance tests is a pressing ethical issue for research. Anyone knowing the problems, as described over one hundred years, who continues to teach, use or publish significance tests is acting unethically, and knowingly risking the damage that ensues.

Download Full-text

The Other Half of the Story: Effect Size Analysis in Quantitative Research

CBE—Life Sciences Education ◽

10.1187/cbe.13-04-0082 ◽

2013 ◽

Vol 12 (3) ◽

pp. 345-351 ◽

Cited By ~ 134

Author(s):

Jessica Middlemis Maher ◽

Jonathan C. Markey ◽

Diane Ebert-May

Keyword(s):

Educational Research ◽

Effect Size ◽

Quantitative Research ◽

Statistical Significance ◽

The Other ◽

Practical Significance ◽

Significance Testing ◽

Size Analysis ◽

Significance Tests ◽

Statistical Significance Testing

Statistical significance testing is the cornerstone of quantitative research, but studies that fail to report measures of effect size are potentially missing a robust part of the analysis. We provide a rationale for why effect size measures should be included in quantitative discipline-based education research. Examples from both biological and educational research demonstrate the utility of effect size for evaluating practical significance. We also provide details about some effect size indices that are paired with common statistical significance tests used in educational research and offer general suggestions for interpreting effect size measures. Finally, we discuss some inherent limitations of effect size measures and provide further recommendations about reporting confidence intervals.

Download Full-text

The use of effect size indices to determine practical significance

Suid-Afrikaanse Tydskrif vir Natuurwetenskap en Tegnologie ◽

10.4102/satnt.v25i3.157 ◽

2006 ◽

Vol 25 (3) ◽

Author(s):

H. S. Styn ◽

S. M. Ellis

Keyword(s):

Effect Size ◽

Statistical Significance ◽

Empirical Studies ◽

Research Literature ◽

Effect Sizes ◽

Practical Significance ◽

Significance Tests ◽

Statistical Application ◽

Significant Difference

The determination of significance of differences in means and of relationships between variables is of importance in many empirical studies. Usually only statistical significance is reported, which does not necessarily indicate an important (practically significant) difference or relationship. With studies based on probability samples, effect size indices should be reported in addition to statistical significance tests in order to comment on practical significance. Where complete populations or convenience samples are worked with, the determination of statistical significance is strictly speaking no longer relevant, while the effect size indices can be used as a basis to judge significance. In this article attention is paid to the use of effect size indices in order to establish practical significance. It is also shown how these indices are utilized in a few fields of statistical application and how it receives attention in statistical literature and computer packages. The use of effect sizes is illustrated by a few examples from the research literature.

Download Full-text

The Politicization of Education Research and the AERA

10.51845/34.3.11 ◽

2021 ◽

Vol 34 (3) ◽

pp. 78-84

Author(s):

Richard P. Phelps

Keyword(s):

Education Research ◽

Best Practices ◽

Educational Research ◽

American Educational Research Association ◽

Professional Associations ◽

Research Association

Like many science-related professional associations founded on the principles of unbiased research, nonpartisanship, and best practices, the American Educational Research Association (AERA) has become thoroughly politicized.

Download Full-text

The DO-climate events are noise induced: statistical investigation of the claimed 1470 years cycle

Climate of the Past Discussions ◽

10.5194/cpd-2-1277-2006 ◽

2006 ◽

Vol 2 (6) ◽

pp. 1277-1292

Author(s):

P. D. Ditlevsen ◽

K. K. Andersen ◽

A. Svensson

Keyword(s):

Time Scale ◽

Statistical Significance ◽

Ice Cores ◽

Statistical Investigation ◽

Significance Tests ◽

Recurrence Times ◽

Annual Layer ◽

Random Occurrence ◽

Triggering Mechanisms ◽

Climate Events

Abstract. The significance of the apparent 1470 years cycle in the recurrence of the Dansgaard-Oeschger (DO) events, observed in the Greenland ice cores, is debated. Here we present statistical significance tests of this periodicity. The detection of a periodicity relies strongly on the accuracy of the dating of the DO events. Here we use both the new NGRIP GICC05 time scale based on multi-parameter annual layer counting and the GISP2 time scale where the periodicity is most pronounced. For the NGRIP dating the recurrence times are indistinguishable from a random occurrence. This is also the case for the GISP2 dating, except in the case where the DO9 event is omitted from the record. Whether or not the record shows a truly periodic beating has strong implications for identifying the underlying cause. If the recurrence is periodic it suggests an external cause. If the recurrence of DO events is not periodic it points to triggering mechanisms internal to the climate system being manifested at the millennial timescale.

Download Full-text

Interpreting Statistical Significance and Meaningfulness in Adapted Physical Activity Research

Adapted Physical Activity Quarterly ◽

10.1123/apaq.15.2.103 ◽

1998 ◽

Vol 15 (2) ◽

pp. 103-118 ◽

Cited By ~ 31

Author(s):

Vinson H. Sutlive ◽

Dale A. Ulrich

Keyword(s):

Physical Activity ◽

Sample Size ◽

Recent Literature ◽

Statistical Significance ◽

Effect Sizes ◽

Significance Tests ◽

Adapted Physical Activity ◽

Research Designs ◽

Alpha Level ◽

Research Findings

The unqualified use of statistical significance tests for interpreting the results of empirical research has been called into question by researchers in a number of behavioral disciplines. This paper reviews what statistical significance tells us and what it does not, with particular attention paid to criticisms of using the results of these tests as the sole basis for evaluating the overall significance of research findings. In addition, implications for adapted physical activity research are discussed. Based on the recent literature of other disciplines, several recommendations for evaluating and reporting research findings are made. They include calculating and reporting effect sizes, selecting an alpha level larger than the conventional .05 level, placing greater emphasis on replication of results, evaluating results in a sample size context, and employing simple research designs. Adapted physical activity researchers are encouraged to use specific modifiers when describing findings as significant.

Download Full-text