Answering Two Criticisms of Hypothesis Testing

In response to the concerns White raises in his technical comment on Rickles, Heppen, Allensworth, Sorensen, and Walters (2018), we discuss whether it would have been appropriate to test for nominally equivalent outcomes, given that the study was initially conceived and designed to test for significant differences, and that the conclusion of no difference was not solely based on a null hypothesis test. To further support the article’s conclusion, confidence intervals for the null hypothesis tests and a test of equivalence are provided.

Download Full-text

Statistical Conclusion Validity

10.1093/oso/9780190661557.003.0006 ◽

2017 ◽

Author(s):

Richard McCleary ◽

David McDowall ◽

Bradley J. Bartos

Keyword(s):

Hypothesis Testing ◽

Null Hypothesis ◽

Hypothesis Test ◽

Model Misspecification ◽

Internal Validity ◽

Error Rates ◽

P Value ◽

Type I ◽

Null Hypothesis Testing ◽

Statistical Conclusion

Chapter 6 addresses the sub-category of internal validity defined by Shadish et al., as statistical conclusion validity, or “validity of inferences about the correlation (covariance) between treatment and outcome.” The common threats to statistical conclusion validity can arise, or become plausible through either model misspecification or through hypothesis testing. The risk of a serious model misspecification is inversely proportional to the length of the time series, for example, and so is the risk of mistating the Type I and Type II error rates. Threats to statistical conclusion validity arise from the classical and modern hybrid significance testing structures, the serious threats that weigh heavily in p-value tests are shown to be undefined in Beyesian tests. While the particularly vexing threats raised by modern null hypothesis testing are resolved through the elimination of the modern null hypothesis test, threats to statistical conclusion validity would inevitably persist and new threats would arise.

Download Full-text

Null-hypothesis tests are not completely stupid, but Bayesian statistics are better

Behavioral and Brain Sciences ◽

10.1017/s0140525x98451161 ◽

1998 ◽

Vol 21 (2) ◽

pp. 215-216 ◽

Cited By ~ 2

Author(s):

David Rindskopf

Keyword(s):

Hypothesis Testing ◽

Bayesian Statistics ◽

Statistical Methods ◽

Bayesian Methods ◽

Null Hypothesis ◽

Hypothesis Tests ◽

Null Hypothesis Testing ◽

Preferred Solutions ◽

The Way

Unfortunately, reading Chow's work is likely to leave the reader more confused than enlightened. My preferred solutions to the “controversy” about null- hypothesis testing are: (1) recognize that we really want to test the hypothesis that an effect is “small,” not null, and (2) use Bayesian methods, which are much more in keeping with the way humans naturally think than are classical statistical methods.

Download Full-text

Hypothesis Testing

Data Analysis for Chemistry ◽

10.1093/oso/9780195162103.003.0008 ◽

2005 ◽

Author(s):

D. Brynn Hibbert ◽

J. Justin Gooding

Keyword(s):

Hypothesis Testing ◽

Null Hypothesis ◽

Hypothesis Test ◽

Significance Test ◽

Type I ◽

Test Statistic ◽

The Difference ◽

Set Up ◽

Type Ii Errors ◽

Normally Distributed

• To understand the concept of the null hypothesis and the role of Type I and Type II errors. • To test that data are normally distributed and whether a datum is an outlier. • To determine whether there is systematic error in the mean of measurement results. • To perform tests to compare the means of two sets of data.… One of the uses to which data analysis is put is to answer questions about the data, or about the system that the data describes. In the former category are ‘‘is the data normally distributed?’’ and ‘‘are there any outliers in the data?’’ (see the discussions in chapter 1). Questions about the system might be ‘‘is the level of alcohol in the suspect’s blood greater than 0.05 g/100 mL?’’ or ‘‘does the new sensor give the same results as the traditional method?’’ In answering these questions we determine the probability of finding the data given the truth of a stated hypothesis—hence ‘‘hypothesis testing.’’ A hypothesis is a statement that might, or might not, be true. Usually the hypothesis is set up in such a way that it is possible to calculate the probability (P) of the data (or the test statistic calculated from the data) given the hypothesis, and then to make a decision about whether the hypothesis is to be accepted (high P) or rejected (low P). A particular case of a hypothesis test is one that determines whether or not the difference between two values is significant—a significance test. For this case we actually put forward the hypothesis that there is no real difference and the observed difference arises from random effects: it is called the null hypothesis (H0). If the probability that the data are consistent with the null hypothesis falls below a predetermined low value (say 0.05 or 0.01), then the hypothesis is rejected at that probability. Therefore, p<0.05 means that if the null hypothesis were true we would find the observed data (or more accurately the value of the statistic, or greater, calculated from the data) in less than 5% of repeated experiments.

Download Full-text

Asymptotic methods of testing statistical hypotheses

10.26686/wgtn.17060027 ◽

2021 ◽

Author(s):

◽

Thuong Nguyen

Keyword(s):

Hypothesis Testing ◽

Null Hypothesis ◽

Goodness Of Fit ◽

Empirical Process ◽

Chi Square ◽

Distribution Free ◽

Statistical Hypotheses ◽

Regularly Varying ◽

Asymptotically Distribution Free ◽

Tail Distributions

For a long time, the goodness of fit (GOF) tests have been one of the main objects of the theory of testing of statistical hypotheses. These tests possess two essential properties. Firstly, the asymptotic distribution of GOF test statistics under the null hypothesis is free from the underlying distribution within the hypothetical family. Secondly, they are of omnibus nature, which means that they are sensitive to every alternative to the null hypothesis. GOF tests are typically based on non-linear functionals from the empirical process. The first idea to change the focus from particular functionals to the transformation of the empirical process itself into another process, which will be asymptotically distribution free, was first formulated and accomplished by {\bf Khmaladze} \cite{Estate1}. Recently, the same author in consecutive papers \cite{Estate} and \cite{Estate2} introduced another method, called here the {\bf Khmaladze-2} transformation, which is distinct from the first Khmaladze transformation and can be used for an even wider class of hypothesis testing problems and is simpler in implementation. This thesis shows how the approach could be used to create the asymptotically distribution free empirical process in two well-known testing problems. The first problem is the problem of testing independence of two discrete random variables/vectors in a contingency table context. Although this problem has a long history, the use of GOF tests for it has been restricted to only one possible choice -- the chi-square test and its several modifications. We start our approach by viewing the problem as one of parametric hypothesis testing and suggest looking at the marginal distributions as parameters. The crucial difficulty is that when the dimension of the table is large, the dimension of the vector of parameters is large as well. Nevertheless, we demonstrate the efficiency of our approach and confirm by simulations the distribution free property of the new empirical process and the GOF tests based on it. The number of parameters is as big as $30$. As an additional benefit, we point out some cases when the GOF tests based on the new process are more powerful than the traditional chi-square one. The second problem is testing whether a distribution has a regularly varying tail. This problem is inspired mainly by the fact that regularly varying tail distributions play an essential role in characterization of the domain of attraction of extreme value distributions. While there are numerous studies on estimating the exponent of regular variation of the tail, using GOF tests for testing relevant distributions has appeared in few papers. We contribute to this latter aspect a construction of a class of GOF tests for testing regularly varying tail distributions.

Download Full-text

Explorations in statistics: hypothesis tests and P values

AJP Advances in Physiology Education ◽

10.1152/advan.90218.2008 ◽

2009 ◽

Vol 33 (2) ◽

pp. 81-86 ◽

Cited By ~ 29

Author(s):

Douglas Curran-Everett

Keyword(s):

Null Hypothesis ◽

P Value ◽

Test Statistics ◽

Simple Test ◽

Hypothesis Tests ◽

Test Statistic ◽

P Values ◽

The One

Learning about statistics is a lot like learning about science: the learning is more meaningful if you can actively explore. This second installment of Explorations in Statistics delves into test statistics and P values, two concepts fundamental to the test of a scientific null hypothesis. The essence of a test statistic is that it compares what we observe in the experiment to what we expect to see if the null hypothesis is true. The P value associated with the magnitude of that test statistic answers this question: if the null hypothesis is true, what proportion of possible values of the test statistic are at least as extreme as the one I got? Although statisticians continue to stress the limitations of hypothesis tests, there are two realities we must acknowledge: hypothesis tests are ingrained within science, and the simple test of a null hypothesis can be useful. As a result, it behooves us to explore the notions of hypothesis tests, test statistics, and P values.

Download Full-text

Asymptotic methods of testing statistical hypotheses

10.26686/wgtn.17060027.v1 ◽

2021 ◽

Author(s):

◽

Thuong Nguyen

Keyword(s):

Hypothesis Testing ◽

Null Hypothesis ◽

Goodness Of Fit ◽

Empirical Process ◽

Chi Square ◽

Distribution Free ◽

Statistical Hypotheses ◽

Regularly Varying ◽

Asymptotically Distribution Free ◽

Tail Distributions

For a long time, the goodness of fit (GOF) tests have been one of the main objects of the theory of testing of statistical hypotheses. These tests possess two essential properties. Firstly, the asymptotic distribution of GOF test statistics under the null hypothesis is free from the underlying distribution within the hypothetical family. Secondly, they are of omnibus nature, which means that they are sensitive to every alternative to the null hypothesis. GOF tests are typically based on non-linear functionals from the empirical process. The first idea to change the focus from particular functionals to the transformation of the empirical process itself into another process, which will be asymptotically distribution free, was first formulated and accomplished by {\bf Khmaladze} \cite{Estate1}. Recently, the same author in consecutive papers \cite{Estate} and \cite{Estate2} introduced another method, called here the {\bf Khmaladze-2} transformation, which is distinct from the first Khmaladze transformation and can be used for an even wider class of hypothesis testing problems and is simpler in implementation. This thesis shows how the approach could be used to create the asymptotically distribution free empirical process in two well-known testing problems. The first problem is the problem of testing independence of two discrete random variables/vectors in a contingency table context. Although this problem has a long history, the use of GOF tests for it has been restricted to only one possible choice -- the chi-square test and its several modifications. We start our approach by viewing the problem as one of parametric hypothesis testing and suggest looking at the marginal distributions as parameters. The crucial difficulty is that when the dimension of the table is large, the dimension of the vector of parameters is large as well. Nevertheless, we demonstrate the efficiency of our approach and confirm by simulations the distribution free property of the new empirical process and the GOF tests based on it. The number of parameters is as big as $30$. As an additional benefit, we point out some cases when the GOF tests based on the new process are more powerful than the traditional chi-square one. The second problem is testing whether a distribution has a regularly varying tail. This problem is inspired mainly by the fact that regularly varying tail distributions play an essential role in characterization of the domain of attraction of extreme value distributions. While there are numerous studies on estimating the exponent of regular variation of the tail, using GOF tests for testing relevant distributions has appeared in few papers. We contribute to this latter aspect a construction of a class of GOF tests for testing regularly varying tail distributions.

Download Full-text

Answering Two Criticisms of Hypothesis Testing: A Comment

Psychological Reports ◽

10.2466/pr0.2000.87.2.579 ◽

2000 ◽

Vol 87 (2) ◽

pp. 579-581 ◽

Cited By ~ 1

Author(s):

Ronald C. Serlin

Keyword(s):

Hypothesis Testing ◽

THE EFFECT OF COMIC STRIPS ON SECONDARY EFL STUDENTS’ READING COMPREHENSION AND WRITING ABILITY

ENGLISH JOURNAL OF INDRAGIRI ◽

10.32520/eji.v2i2.236 ◽

2018 ◽

Vol 2 (2) ◽

pp. 43-57

Author(s):

M. Ridhwan ◽

Muhammad Taufik Ihsan ◽

Naskah Naskah

Keyword(s):

Reading Comprehension ◽

Hypothesis Testing ◽

Null Hypothesis ◽

Test Group ◽

T Test ◽

Comic Strips ◽

Writing Ability ◽

Paired Sample ◽

And Control ◽

Test Treatment

The purpose of this study was to investigate the significant effect of using comic strips strategy toward students’ reading comprehension and writing ability at MTsN 1 Pekanbaru. A Quasi-Experimental by Non-equivalent Pre-test and Post-test Group was applied as a designed for study. The sample was two classes (VIII 3 and VIII 4) consisting 20 students of treatment class, and 20 students of control class. The data were computed using SPPS 20.0 to analyze Independent sample t-test and Paired sample t-test. The finding of this study revealed that there was a significant effect on students’ reading comprehension by using comic strips strategy, it shown on paired sample t-test; treatment class was 77 and control class was 64.5, the hypothesis testing showed the result of post T-test -7.149, then score of sig.(2-tailed) is 0.000, if we act to null hypothesis (Ho) that is 0.05, it means that the score of sig.(2-tailed) was smaller than score of Ho. The data also revealed that there was a significant effect on students’ writing ability, it shown on paired sample t-test; treatment class was 79.6 and control class was 54.2, the hypothesis testing showed the result of post T-test -21.9, then score of sig.(2-tailed) is 0.000, if we act to null hypothesis (Ho) that is 0.05, it means that the score of sig.(2-tailed) was smaller than score of Ho. Therefore, the null hypothesis was rejected and the alternative hypothesis was accepted. From those data it can be summarized that there is a significant effect of using comic strips strategy on students’ reading comprehension and writing ability.

Download Full-text

A Relativistic Newtonian Mechanics Predicts with Precision the Results of Recent Neutrino-Velocity Experiments

JOURNAL OF ADVANCES IN PHYSICS ◽

10.24297/jap.v6i1.1824 ◽

2014 ◽

Vol 6 (1) ◽

pp. 1032-1035 ◽

Cited By ~ 1

Author(s):

Ramzi Suleiman

Keyword(s):

Null Hypothesis ◽

Experimental Studies ◽

Theoretical Models ◽

Newtonian Mechanics ◽

Speed Of Light ◽

Symmetry Principle ◽

Proposed Model ◽

Significant Difference ◽

The One ◽

Complete Collapse

The research on quasi-luminal neutrinos has sparked several experimental studies for testing the "speed of light limit" hypothesis. Until today, the overall evidence favors the "null" hypothesis, stating that there is no significant difference between the observed velocities of light and neutrinos. Despite numerous theoretical models proposed to explain the neutrinos behavior, no attempt has been undertaken to predict the experimentally produced results. This paper presents a simple novel extension of Newton's mechanics to the domain of relativistic velocities. For a typical neutrino-velocity experiment, the proposed model is utilized to derive a general expression for . Comparison of the model's prediction with results of six neutrino-velocity experiments, conducted by five collaborations, reveals that the model predicts all the reported results with striking accuracy. Because in the proposed model, the direction of the neutrino flight matters, the model's impressive success in accounting for all the tested data, indicates a complete collapse of the Lorentz symmetry principle in situation involving quasi-luminal particles, moving in two opposite directions. This conclusion is support by previous findings, showing that an identical Sagnac effect to the one documented for radial motion, occurs also in linear motion.

Download Full-text