Permutation tests for hypothesis testing with animal social data: problems and potential solutions

Mapping Intimacies ◽

10.1101/2020.08.02.232710 ◽

2020 ◽

Cited By ~ 1

Author(s):

Damien R. Farine ◽

Gerald G. Carter

Keyword(s):

Type I Error ◽

Permutation Test ◽

Permutation Tests ◽

Error Rates ◽

Type I ◽

Type Ii ◽

Type Ii Error ◽

Type I Error Rates ◽

Social Network Data ◽

Elevated Type

ABSTRACTGenerating insights about a null hypothesis requires not only a good dataset, but also statistical tests that are reliable and actually address the null hypothesis of interest. Recent studies have found that permutation tests, which are widely used to test hypotheses when working with animal social network data, can suffer from high rates of type I error (false positives) and type II error (false negatives).Here, we first outline why pre-network and node permutation tests have elevated type I and II error rates. We then propose a new procedure, the double permutation test, that addresses some of the limitations of existing approaches by combining pre-network and node permutations.We conduct a range of simulations, allowing us to estimate error rates under different scenarios, including errors caused by confounding effects of social or non-social structure in the raw data.We show that double permutation tests avoid elevated type I errors, while remaining sufficiently sensitive to avoid elevated type II errors. By contrast, the existing solutions we tested, including node permutations, pre-network permutations, and regression models with control variables, all exhibit elevated errors under at least one set of simulated conditions. Type I error rates from double permutation remain close to 5% in the same scenarios where type I error rates from pre-network permutation tests exceed 30%.The double permutation test provides a potential solution to issues arising from elevated type I and type II error rates when testing hypotheses with social network data. We also discuss other approaches, including restricted node permutations, testing multiple null hypotheses, and splitting large datasets to generate replicated networks, that can strengthen our ability to make robust inferences. Finally, we highlight ways that uncertainty can be explicitly considered during the analysis using permutation-based or Bayesian methods.

Download Full-text

A New and Simpler Approximation for ANOVA Under Variance Heterogeneity

Journal of Educational Statistics ◽

10.3102/10769986019002091 ◽

1994 ◽

Vol 19 (2) ◽

pp. 91-101 ◽

Cited By ~ 28

Author(s):

Ralph A. Alexander ◽

Diane M. Govern

Keyword(s):

Type I Error ◽

Order Approximation ◽

Error Rates ◽

Type I ◽

Type Ii ◽

Type Ii Error ◽

Tail Probabilities ◽

Type I Error Rates ◽

Heterogeneity Of Variance ◽

The Face

A new approximation is proposed for testing the equality of k independent means in the face of heterogeneity of variance. Monte Carlo simulations show that the new procedure has Type I error rates that are very nearly nominal and Type II error rates that are quite close to those produced by James’s (1951) second-order approximation. In addition, it is computationally the simplest approximation yet to appear, and it is easily applied to Scheffé (1959) -type multiple contrasts and to the calculation of approximate tail probabilities.

Download Full-text

Analysis of type I and II error rates of Bayesian and frequentist parametric and nonparametric two-sample hypothesis tests under preliminary assessment of normality

Computational Statistics ◽

10.1007/s00180-020-01034-7 ◽

2020 ◽

Author(s):

Riko Kelter

Keyword(s):

Null Hypothesis ◽

Error Control ◽

Type I Error ◽

Error Rates ◽

Type I ◽

Type Ii ◽

Type Ii Error ◽

Preliminary Assessment ◽

Type I Error Rates ◽

Practical Research

Abstract Testing for differences between two groups is among the most frequently carried out statistical methods in empirical research. The traditional frequentist approach is to make use of null hypothesis significance tests which use p values to reject a null hypothesis. Recently, a lot of research has emerged which proposes Bayesian versions of the most common parametric and nonparametric frequentist two-sample tests. These proposals include Student’s two-sample t-test and its nonparametric counterpart, the Mann–Whitney U test. In this paper, the underlying assumptions, models and their implications for practical research of recently proposed Bayesian two-sample tests are explored and contrasted with the frequentist solutions. An extensive simulation study is provided, the results of which demonstrate that the proposed Bayesian tests achieve better type I error control at slightly increased type II error rates. These results are important, because balancing the type I and II errors is a crucial goal in a variety of research, and shifting towards the Bayesian two-sample tests while simultaneously increasing the sample size yields smaller type I error rates. What is more, the results highlight that the differences in type II error rates between frequentist and Bayesian two-sample tests depend on the magnitude of the underlying effect.

Download Full-text

Performance of Monte Carlo Permutation and Approximate Tests for Multivariate Means Comparisons With Small Sample Sizes When Parametric Assumptions are Violated

Methodology ◽

10.1027/1614-2241.5.2.60 ◽

2009 ◽

Vol 5 (2) ◽

pp. 60-70 ◽

Cited By ~ 6

Author(s):

W. Holmes Finch ◽

Teresa Davenport

Keyword(s):

Monte Carlo ◽

Type I Error ◽

Permutation Tests ◽

Error Rates ◽

Covariance Matrices ◽

Small Sample ◽

Type I ◽

Permutation Testing ◽

Sample Sizes ◽

Type I Error Rates

Permutation testing has been suggested as an alternative to the standard F approximate tests used in multivariate analysis of variance (MANOVA). These approximate tests, such as Wilks’ Lambda and Pillai’s Trace, have been shown to perform poorly when assumptions of normally distributed dependent variables and homogeneity of group covariance matrices were violated. Because Monte Carlo permutation tests do not rely on distributional assumptions, they may be expected to work better than their approximate cousins when the data do not conform to the assumptions described above. The current simulation study compared the performance of four standard MANOVA test statistics with their Monte Carlo permutation-based counterparts under a variety of conditions with small samples, including conditions when the assumptions were met and when they were not. Results suggest that for sample sizes of 50 subjects, power is very low for all the statistics. In addition, Type I error rates for both the approximate F and Monte Carlo tests were inflated under the condition of nonnormal data and unequal covariance matrices. In general, the performance of the Monte Carlo permutation tests was slightly better in terms of Type I error rates and power when both assumptions of normality and homogeneous covariance matrices were not met. It should be noted that these simulations were based upon the case with three groups only, and as such results presented in this study can only be generalized to similar situations.

Download Full-text

Genotypic diversity: estimation and prediction in samples.

Genetics ◽

10.1093/genetics/118.4.705 ◽

1988 ◽

Vol 118 (4) ◽

pp. 705-711 ◽

Cited By ~ 3

Author(s):

J A Stoddart ◽

J F Taylor

Keyword(s):

Goodness Of Fit ◽

Type I Error ◽

Genotypic Diversity ◽

Error Rates ◽

Type I ◽

Type Ii ◽

Type I Error Rates ◽

Goodness Of Fit Tests ◽

High Level ◽

Better Than

Abstract We show that a commonly used statistic of genotypic diversity can be used to reflect one form of deviation from panmixia, viz. clonal reproduction, by comparing observed and predicted sample statistics. The characteristics of the statistic, in particular its relationship with population genotypic diversity, are formalised and a method of predicting the genotypic diversity of a sample drawn from a panmictic population using allelic frequencies and sample size is developed. The sensitivity of some possible tests of significance of the deviation from panmictic expectations is examined using computer simulations. Goodness-of-fit tests are robust but produce an unacceptably high level of type II error. With means and variances calculated either from Monte Carlo simulations or from distributional and series approximations, t-tests perform better than goodness-of-fit tests. Under simulation, both forms of t-test exhibit acceptable rates of type I error. Rates of type II are usually large when allele frequencies are severely skewed although the latter test performs the better in those conditions.

Download Full-text

Determining the extension of edge effect in small plots using type I and type II error rates

Canadian Journal of Forest Research ◽

10.1139/x86-127 ◽

1986 ◽

Vol 16 (4) ◽

pp. 710-712 ◽

Cited By ~ 1

Author(s):

B. Côté ◽

C. Camiré

Keyword(s):

Edge Effect ◽

Type I Error ◽

Hybrid Poplar ◽

Error Rates ◽

Type I ◽

Type Ii ◽

Type Ii Error ◽

Data Bases ◽

Response Variable ◽

Black Alder

Data from dense plantings (33 × 33 cm) of black alder (Alnusglutinosa (L.) Gaertn.) and hybrid poplar (Populusnigra L. × Populustrichocarpa Torr and Gray) illustrate a simple statistical procedure to assess the extension of edge effect in small plots. A reference mean free of edge effect for the response variable under study must first be determined. Relative estimates corresponding to plot means produced by incremental removal of border rows are screened for acceptability using type I error and an approximation of type II error. The procedure is applicable to any response variable and can be applied to data bases having secondary maxima in inner rows.

Download Full-text

Type I error rates and power of several versions of scaled chi-square difference tests in investigations of measurement invariance.

Psychological Methods ◽

10.1037/met0000097 ◽

2017 ◽

Vol 22 (3) ◽

pp. 467-485 ◽

Cited By ~ 4

Author(s):

Jordan Campbell Brace ◽

Victoria Savalei

Keyword(s):

Measurement Invariance ◽

Type I Error ◽

Error Rates ◽

Type I ◽

Chi Square ◽

Type I Error Rates

Download Full-text

Type-I Error and Type-II Error and Thirukkural

SSRN Electronic Journal ◽

10.2139/ssrn.1334661 ◽

2008 ◽

Cited By ~ 1

Author(s):

Chendrayan Chendroyaperumal

Keyword(s):

Type I Error ◽

Type I ◽

Type Ii ◽

Type Ii Error

Download Full-text

Type I Error Rates, Coverage of Confidence Intervals, and Variance Estimation in Propensity-Score Matched Analyses

The International Journal of Biostatistics ◽

10.2202/1557-4679.1146 ◽

2009 ◽

Vol 5 (1) ◽

Cited By ~ 65

Author(s):

Peter C Austin

Keyword(s):

Propensity Score ◽

Confidence Intervals ◽

Variance Estimation ◽

Type I Error ◽

Error Rates ◽

Type I ◽

Type I Error Rates

Download Full-text

Type I and type II error rates for quantitative trait loci (QTL) mapping studies using recombinant inbred mouse strains

Behavior Genetics ◽

10.1007/bf02359892 ◽

1996 ◽

Vol 26 (2) ◽

pp. 149-160 ◽

Cited By ~ 103

Author(s):

J. K. Belknap ◽

S. R. Mitchell ◽

L. A. O'Toole ◽

M. L. Helms ◽

J. C. Crabbe

Keyword(s):

Quantitative Trait ◽

Recombinant Inbred ◽

Inbred Mouse ◽

Error Rates ◽

Mouse Strains ◽

Type I ◽

Inbred Mouse Strains ◽

Type Ii ◽

Type Ii Error ◽

Recombinant Inbred Mouse

Download Full-text

Type I Error Rates for Parscale’s Fit Index

Educational and Psychological Measurement ◽

10.1177/0013164404264849 ◽

2005 ◽

Vol 65 (1) ◽

pp. 42-50 ◽

Cited By ~ 17

Author(s):

Christine E. Demars

Keyword(s):

Type I Error ◽

Error Rates ◽

Type I ◽

Type I Error Rates ◽

Fit Index

Download Full-text