type i error rates Latest Research Papers

Correcting the Bias Correction for the Bootstrap Confidence Interval in Mediation Analysis

10.31234/osf.io/pe4m2 ◽

2021 ◽

Author(s):

Tristan Tibbe ◽

Amanda Kay Montoya

Keyword(s):

Confidence Interval ◽

Error Rate ◽

Indirect Effect ◽

Mediation Analysis ◽

Type I Error ◽

Error Rates ◽

Type I ◽

Bootstrap Confidence Interval ◽

Type I Error Rates ◽

Type I Error Rate

The bias-corrected bootstrap confidence interval (BCBCI) was once the method of choice for conducting inference on the indirect effect in mediation analysis due to its high power in small samples, but now it is criticized by methodologists for its inflated type I error rates. In its place, the percentile bootstrap confidence interval (PBCI), which does not adjust for bias, is currently the recommended inferential method for indirect effects. This study proposes two alternative bias-corrected bootstrap methods for creating confidence intervals around the indirect effect. Using a Monte Carlo simulation, these methods were compared to the BCBCI, PBCI, and a bias-corrected method introduced by Chen and Fritz (2021). The results showed that the methods perform on a continuum, where the BCBCI has the best balance (i.e., having closest to an equal proportion of CIs falling above and below the true effect), highest power, and highest type I error rate; the PBCI has the worst balance, lowest power, and lowest type I error rate; and the alternative bias-corrected methods fall between these two methods on all three performance criteria. An extension of the original simulation that compared the bias-corrected methods to the PBCI after controlling for type I error rate inflation suggests that the increased power of these methods might only be due to their higher type I error rates. Thus, if control over the type I error rate is desired, the PBCI is still the recommended method for use with the indirect effect. Future research should examine the performance of these methods in the presence of missing data, confounding variables, and other real-world complications to enhance the generalizability of these results.

A robust mean and variance test with application to high-dimensional phenotypes

European Journal of Epidemiology ◽

10.1007/s10654-021-00805-w ◽

2021 ◽

Author(s):

James R. Staley ◽

Frank Windmeijer ◽

Matthew Suderman ◽

Matthew S. Lyon ◽

George Davey Smith ◽

...

Keyword(s):

Dna Methylation ◽

Gestational Age ◽

Type I Error ◽

Association Studies ◽

Error Rates ◽

High Dimensional ◽

Type I ◽

Type I Error Rates ◽

Cpg Sites ◽

Using Data

AbstractMost studies of continuous health-related outcomes examine differences in mean levels (location) of the outcome by exposure. However, identifying effects on the variability (scale) of an outcome, and combining tests of mean and variability (location-and-scale), could provide additional insights into biological mechanisms. A joint test could improve power for studies of high-dimensional phenotypes, such as epigenome-wide association studies of DNA methylation at CpG sites. One possible cause of heterogeneity of variance is a variable interacting with exposure in its effect on outcome, so a joint test of mean and variability could help in the identification of effect modifiers. Here, we review a scale test, based on the Brown-Forsythe test, for analysing variability of a continuous outcome with respect to both categorical and continuous exposures, and develop a novel joint location-and-scale score (JLSsc) test. These tests were compared to alternatives in simulations and used to test associations of mean and variability of DNA methylation with gender and gestational age using data from the Accessible Resource for Integrated Epigenomics Studies (ARIES). In simulations, the Brown-Forsythe and JLSsc tests retained correct type I error rates when the outcome was not normally distributed in contrast to the other approaches tested which all had inflated type I error rates. These tests also identified > 7500 CpG sites for which either mean or variability in cord blood methylation differed according to gender or gestational age. The Brown-Forsythe test and JLSsc are robust tests that can be used to detect associations not solely driven by a mean effect.

Cluster Wild Bootstrapping to Handle Dependent Effect Sizes in Meta-Analysis with a Small Number of Studies

10.31222/osf.io/x6uhk ◽

2021 ◽

Author(s):

Megha Joshi ◽

James E Pustejovsky ◽

S. Natasha Beretvas

Keyword(s):

Effect Size ◽

Type I Error ◽

Meta Analysis ◽

Error Rates ◽

Small Sample ◽

Type I ◽

Hypothesis Tests ◽

Type I Error Rates ◽

Meta Analyses ◽

Small Sample Correction

The most common and well-known meta-regression models work under the assumption that there is only one effect size estimate per study and that the estimates are independent. However, meta-analytic reviews of social science research often include multiple effect size estimates per primary study, leading to dependence in the estimates. Some meta-analyses also include multiple studies conducted by the same lab or investigator, creating another potential source of dependence. An increasingly popular method to handle dependence is robust variance estimation (RVE), but this method can result in inflated Type I error rates when the number of studies is small. Small-sample correction methods for RVE have been shown to control Type I error rates adequately but may be overly conservative, especially for tests of multiple-contrast hypotheses. We evaluated an alternative method for handling dependence, cluster wild bootstrapping, which has been examined in the econometrics literature but not in the context of meta-analysis. Results from two simulation studies indicate that cluster wild bootstrapping maintains adequate Type I error rates and provides more power than extant small sample correction methods, particularly for multiple-contrast hypothesis tests. We recommend using cluster wild bootstrapping to conduct hypothesis tests for meta-analyses with a small number of studies. We have also created an R package that implements such tests.

Robust Chi-Square in Extreme and Boundary Conditions: Comments on Jak et al. (2021)

Psych ◽

10.3390/psych3030035 ◽

2021 ◽

Vol 3 (3) ◽

pp. 542-551

Author(s):

Tihomir Asparouhov ◽

Bengt Muthén

Keyword(s):

Boundary Conditions ◽

Type I Error ◽

Error Rates ◽

Type I ◽

Chi Square ◽

Type I Error Rates ◽

Chi Square Test

In this article we describe a modification of the robust chi-square test of fit that yields more accurate type I error rates when the estimated model is at the boundary of the admissible space.

Bayesian Hodges-Lehmann tests for statistical equivalence in the two-sample setting: Power analysis, type I error rates and equivalence boundary selection in biomedical research

BMC Medical Research Methodology ◽

10.1186/s12874-021-01341-7 ◽

2021 ◽

Vol 21 (1) ◽

Author(s):

Riko Kelter

Keyword(s):

Biomedical Research ◽

Bayes Factor ◽

Type I Error ◽

Error Rates ◽

Type I ◽

Equivalence Testing ◽

Type I Error Rates ◽

Equivalence Tests ◽

Statistical Equivalence ◽

The Relationship

Abstract Background Null hypothesis significance testing (NHST) is among the most frequently employed methods in the biomedical sciences. However, the problems of NHST and p-values have been discussed widely and various Bayesian alternatives have been proposed. Some proposals focus on equivalence testing, which aims at testing an interval hypothesis instead of a precise hypothesis. An interval hypothesis includes a small range of parameter values instead of a single null value and the idea goes back to Hodges and Lehmann. As researchers can always expect to observe some (although often negligibly small) effect size, interval hypotheses are more realistic for biomedical research. However, the selection of an equivalence region (the interval boundaries) often seems arbitrary and several Bayesian approaches to equivalence testing coexist. Methods A new proposal is made how to determine the equivalence region for Bayesian equivalence tests based on objective criteria like type I error rate and power. Existing approaches to Bayesian equivalence testing in the two-sample setting are discussed with a focus on the Bayes factor and the region of practical equivalence (ROPE). A simulation study derives the necessary results to make use of the new method in the two-sample setting, which is among the most frequently carried out procedures in biomedical research. Results Bayesian Hodges-Lehmann tests for statistical equivalence differ in their sensitivity to the prior modeling, power, and the associated type I error rates. The relationship between type I error rates, power and sample sizes for existing Bayesian equivalence tests is identified in the two-sample setting. Results allow to determine the equivalence region based on the new method by incorporating such objective criteria. Importantly, results show that not only can prior selection influence the type I error rate and power, but the relationship is even reverse for the Bayes factor and ROPE based equivalence tests. Conclusion Based on the results, researchers can select between the existing Bayesian Hodges-Lehmann tests for statistical equivalence and determine the equivalence region based on objective criteria, thus improving the reproducibility of biomedical research.

Consequences of ignoring clustering in linear regression

BMC Medical Research Methodology ◽

10.1186/s12874-021-01333-7 ◽

2021 ◽

Vol 21 (1) ◽

Author(s):

Georgia Ntani ◽

Hazel Inskip ◽

Clive Osmond ◽

David Coggon

Keyword(s):

Confidence Intervals ◽

Type I Error ◽

Explanatory Variable ◽

Simulated Data ◽

Error Rates ◽

Outcome Variable ◽

Type I ◽

Type I Error Rates ◽

True Value ◽

Coverage Rates

Abstract Background Clustering of observations is a common phenomenon in epidemiological and clinical research. Previous studies have highlighted the importance of using multilevel analysis to account for such clustering, but in practice, methods ignoring clustering are often employed. We used simulated data to explore the circumstances in which failure to account for clustering in linear regression could lead to importantly erroneous conclusions. Methods We simulated data following the random-intercept model specification under different scenarios of clustering of a continuous outcome and a single continuous or binary explanatory variable. We fitted random-intercept (RI) and ordinary least squares (OLS) models and compared effect estimates with the “true” value that had been used in simulation. We also assessed the relative precision of effect estimates, and explored the extent to which coverage by 95% confidence intervals and Type I error rates were appropriate. Results We found that effect estimates from both types of regression model were on average unbiased. However, deviations from the “true” value were greater when the outcome variable was more clustered. For a continuous explanatory variable, they tended also to be greater for the OLS than the RI model, and when the explanatory variable was less clustered. The precision of effect estimates from the OLS model was overestimated when the explanatory variable varied more between than within clusters, and was somewhat underestimated when the explanatory variable was less clustered. The cluster-unadjusted model gave poor coverage rates by 95% confidence intervals and high Type I error rates when the explanatory variable was continuous. With a binary explanatory variable, coverage rates by 95% confidence intervals and Type I error rates deviated from nominal values when the outcome variable was more clustered, but the direction of the deviation varied according to the overall prevalence of the explanatory variable, and the extent to which it was clustered. Conclusions In this study we identified circumstances in which application of an OLS regression model to clustered data is more likely to mislead statistical inference. The potential for error is greatest when the explanatory variable is continuous, and the outcome variable more clustered (intraclass correlation coefficient is ≥ 0.01).

Directional-sum test for nonparametric Behrens-Fisher problem with applications to the dietary intervention trial

Statistical Methods in Medical Research ◽

10.1177/09622802211002864 ◽

2021 ◽

pp. 096228022110028

Author(s):

Zhen Meng ◽

Qinglong Yang ◽

Qizhai Li ◽

Baoxue Zhang

Keyword(s):

Dietary Intervention ◽

Type I Error ◽

Statistical Significance ◽

Error Rates ◽

Type I ◽

Simulation Studies ◽

Intervention Trial ◽

Combination Strategy ◽

Type I Error Rates ◽

Dietary Intervention Trial

For a nonparametric Behrens-Fisher problem, a directional-sum test is proposed based on division-combination strategy. A one-layer wild bootstrap procedure is given to calculate its statistical significance. We conduct simulation studies with data generated from lognormal, t and Laplace distributions to show that the proposed test can control the type I error rates properly and is more powerful than the existing rank-sum and maximum-type tests under most of the considered scenarios. Applications to the dietary intervention trial further show the performance of the proposed test.

Tissue specificity-aware TWAS (TSA-TWAS) framework identifies novel associations with metabolic, immunologic, and virologic traits in HIV-positive adults

PLoS Genetics ◽

10.1371/journal.pgen.1009464 ◽

2021 ◽

Vol 17 (4) ◽

pp. e1009464

Author(s):

Binglan Li ◽

Yogasudha Veturi ◽

Anurag Verma ◽

Yuki Bradford ◽

Eric S. Daar ◽

...

Keyword(s):

Tissue Specificity ◽

Type I Error ◽

Density Lipoprotein ◽

Error Rates ◽

Gene Prioritization ◽

Type I ◽

Uridine Diphosphate ◽

Analytic Framework ◽

Type I Error Rates ◽

Transcriptional Regulatory Mechanisms

As a type of relatively new methodology, the transcriptome-wide association study (TWAS) has gained interest due to capacity for gene-level association testing. However, the development of TWAS has outpaced statistical evaluation of TWAS gene prioritization performance. Current TWAS methods vary in underlying biological assumptions about tissue specificity of transcriptional regulatory mechanisms. In a previous study from our group, this may have affected whether TWAS methods better identified associations in single tissues versus multiple tissues. We therefore designed simulation analyses to examine how the interplay between particular TWAS methods and tissue specificity of gene expression affects power and type I error rates for gene prioritization. We found that cross-tissue identification of expression quantitative trait loci (eQTLs) improved TWAS power. Single-tissue TWAS (i.e., PrediXcan) had robust power to identify genes expressed in single tissues, but, often found significant associations in the wrong tissues as well (therefore had high false positive rates). Cross-tissue TWAS (i.e., UTMOST) had overall equal or greater power and controlled type I error rates for genes expressed in multiple tissues. Based on these simulation results, we applied a tissue specificity-aware TWAS (TSA-TWAS) analytic framework to look for gene-based associations with pre-treatment laboratory values from AIDS Clinical Trial Group (ACTG) studies. We replicated several proof-of-concept transcriptionally regulated gene-trait associations, including UGT1A1 (encoding bilirubin uridine diphosphate glucuronosyltransferase enzyme) and total bilirubin levels (p = 3.59×10−12), and CETP (cholesteryl ester transfer protein) with high-density lipoprotein cholesterol (p = 4.49×10−12). We also identified several novel genes associated with metabolic and virologic traits, as well as pleiotropic genes that linked plasma viral load, absolute basophil count, and/or triglyceride levels. By highlighting the advantages of different TWAS methods, our simulation study promotes a tissue specificity-aware TWAS analytic framework that revealed novel aspects of HIV-related traits.

Robustness of T-test Based on Skewness and Kurtosis

Journal of Advances in Mathematics and Computer Science ◽

10.9734/jamcs/2021/v36i230342 ◽

2021 ◽

pp. 102-110

Author(s):

Steven T. Garren ◽

Kate McGann Osborne

Keyword(s):

Degrees Of Freedom ◽

Type I Error ◽

Error Rates ◽

T Test ◽

Type I ◽

Skewed Distributions ◽

Symmetric Distributions ◽

Type I Error Rates ◽

Coverage Probabilities ◽

Skewness And Kurtosis

Coverage probabilities of the two-sided one-sample t-test are simulated for some symmetric and right-skewed distributions. The symmetric distributions analyzed are Normal, Uniform, Laplace, and student-t with 5, 7, and 10 degrees of freedom. The right-skewed distributions analyzed are Exponential and Chi-square with 1, 2, and 3 degrees of freedom. Left-skewed distributions were not analyzed without loss of generality. The coverage probabilities for the symmetric distributions tend to achieve or just barely exceed the nominal values. The coverage probabilities for the skewed distributions tend to be too low, indicating high Type I error rates. Percentiles for the skewness and kurtosis statistics are simulated using Normal data. For sample sizes of 5, 10, 15 and 20 the skewness statistic does an excellent job of detecting non-Normal data, except for Uniform data. The kurtosis statistic also does an excellent job of detecting non-Normal data, including Uniform data. Examined herein are Type I error rates, but not power calculations. We nd that sample skewness is unhelpful when determining whether or not the t-test should be used, but low sample kurtosis is reason to avoid using the t-test.

Type I error control for cluster randomized trials under varying small sample structures

BMC Medical Research Methodology ◽

10.1186/s12874-021-01236-7 ◽

2021 ◽

Vol 21 (1) ◽

Author(s):

Joshua R. Nugent ◽

Ken P. Kleinman

Keyword(s):

Cluster Size ◽

Error Control ◽

Type I Error ◽

Error Rates ◽

Small Sample ◽

Type I ◽

Number Of Clusters ◽

Type I Error Rates ◽

Wald Tests ◽

Cluster Randomized

Abstract Background Linear mixed models (LMM) are a common approach to analyzing data from cluster randomized trials (CRTs). Inference on parameters can be performed via Wald tests or likelihood ratio tests (LRT), but both approaches may give incorrect Type I error rates in common finite sample settings. The impact of different combinations of cluster size, number of clusters, intraclass correlation coefficient (ICC), and analysis approach on Type I error rates has not been well studied. Reviews of published CRTs find that small sample sizes are not uncommon, so the performance of different inferential approaches in these settings can guide data analysts to the best choices. Methods Using a random-intercept LMM stucture, we use simulations to study Type I error rates with the LRT and Wald test with different degrees of freedom (DF) choices across different combinations of cluster size, number of clusters, and ICC. Results Our simulations show that the LRT can be anti-conservative when the ICC is large and the number of clusters is small, with the effect most pronouced when the cluster size is relatively large. Wald tests with the between-within DF method or the Satterthwaite DF approximation maintain Type I error control at the stated level, though they are conservative when the number of clusters, the cluster size, and the ICC are small. Conclusions Depending on the structure of the CRT, analysts should choose a hypothesis testing approach that will maintain the appropriate Type I error rate for their data. Wald tests with the Satterthwaite DF approximation work well in many circumstances, but in other cases the LRT may have Type I error rates closer to the nominal level.

type i error rates
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Correcting the Bias Correction for the Bootstrap Confidence Interval in Mediation Analysis

A robust mean and variance test with application to high-dimensional phenotypes

Cluster Wild Bootstrapping to Handle Dependent Effect Sizes in Meta-Analysis with a Small Number of Studies

Robust Chi-Square in Extreme and Boundary Conditions: Comments on Jak et al. (2021)

Bayesian Hodges-Lehmann tests for statistical equivalence in the two-sample setting: Power analysis, type I error rates and equivalence boundary selection in biomedical research

Consequences of ignoring clustering in linear regression

Directional-sum test for nonparametric Behrens-Fisher problem with applications to the dietary intervention trial

Tissue specificity-aware TWAS (TSA-TWAS) framework identifies novel associations with metabolic, immunologic, and virologic traits in HIV-positive adults

Robustness of T-test Based on Skewness and Kurtosis

Type I error control for cluster randomized trials under varying small sample structures

Export Citation Format

type i error ratesRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Correcting the Bias Correction for the Bootstrap Confidence Interval in Mediation Analysis

A robust mean and variance test with application to high-dimensional phenotypes

Cluster Wild Bootstrapping to Handle Dependent Effect Sizes in Meta-Analysis with a Small Number of Studies

Robust Chi-Square in Extreme and Boundary Conditions: Comments on Jak et al. (2021)

Bayesian Hodges-Lehmann tests for statistical equivalence in the two-sample setting: Power analysis, type I error rates and equivalence boundary selection in biomedical research

Consequences of ignoring clustering in linear regression

Directional-sum test for nonparametric Behrens-Fisher problem with applications to the dietary intervention trial

Tissue specificity-aware TWAS (TSA-TWAS) framework identifies novel associations with metabolic, immunologic, and virologic traits in HIV-positive adults

Robustness of T-test Based on Skewness and Kurtosis

Type I error control for cluster randomized trials under varying small sample structures

type i error rates
Recently Published Documents