On “Field Significance” and the False Discovery Rate

D. S. Wilks

doi:10.1175/jam2404.1

On “Field Significance” and the False Discovery Rate

Journal of Applied Meteorology and Climatology ◽

10.1175/jam2404.1 ◽

2006 ◽

Vol 45 (9) ◽

pp. 1181-1189 ◽

Cited By ~ 280

Author(s):

D. S. Wilks

Keyword(s):

False Discovery Rate ◽

Statistical Power ◽

Statistical Significance ◽

Significance Test ◽

P Value ◽

Test Statistic ◽

Global Test ◽

Additional Advantage ◽

Counting Procedure ◽

False Discovery

Abstract The conventional approach to evaluating the joint statistical significance of multiple hypothesis tests (i.e., “field,” or “global,” significance) in meteorology and climatology is to count the number of individual (or “local”) tests yielding nominally significant results and then to judge the unusualness of this integer value in the context of the distribution of such counts that would occur if all local null hypotheses were true. The sensitivity (i.e., statistical power) of this approach is potentially compromised both by the discrete nature of the test statistic and by the fact that the approach ignores the confidence with which locally significant tests reject their null hypotheses. An alternative global test statistic that has neither of these problems is the minimum p value among all of the local tests. Evaluation of field significance using the minimum local p value as the global test statistic, which is also known as the Walker test, has strong connections to the joint evaluation of multiple tests in a way that controls the “false discovery rate” (FDR, or the expected fraction of local null hypothesis rejections that are incorrect). In particular, using the minimum local p value to evaluate field significance at a level αglobal is nearly equivalent to the slightly more powerful global test based on the FDR criterion. An additional advantage shared by Walker’s test and the FDR approach is that both are robust to spatial dependence within the field of tests. The FDR method not only provides a more broadly applicable and generally more powerful field significance test than the conventional counting procedure but also allows better identification of locations with significant differences, because fewer than αglobal × 100% (on average) of apparently significant local tests will have resulted from local null hypotheses that are true.

Download Full-text

Inference on the Limiting False Discovery Rate and the P-value Threshold Parameter Assuming Weak Dependence between Gene Expression Levels within Subject

Statistical Applications in Genetics and Molecular Biology ◽

10.2202/1544-6115.1285 ◽

2007 ◽

Vol 6 (1) ◽

Cited By ~ 3

Author(s):

Glenn Heller ◽

Jing Qin

Keyword(s):

False Discovery Rate ◽

Weak Dependence ◽

Microarray Data Analysis ◽

P Value ◽

Threshold Parameter ◽

Test Statistic ◽

Data Set ◽

Cancer Data ◽

Mixture Density ◽

False Discovery

An objective of microarray data analysis is to identify gene expressions that are associated with a disease related outcome. For each gene, a test statistic is computed to determine if an association exists, and this statistic generates a marginal p-value. In an effort to pool this information across genes, a p-value density function is derived. The p-value density is modeled as a mixture of a uniform (0,1) density and a scaled ratio of normal densities derived from the asymptotic normality of the test statistic. The p-values are assumed to be weakly dependent and a quasi-likelihood is used to estimate the parameters in the mixture density. The quasi-likelihood and the weak dependence assumption enables estimation and asymptotic inference on the false discovery rate for a given rejection region, and its inverse, the p-value threshold parameter for a fixed false discovery rate. A false discovery rate analysis on a localized prostate cancer data set is used to illustrate the methodology. Simulations are performed to assess the performance of this methodology.

Download Full-text

Application of the False Discovery Rate to Quantitative Trait Loci Interval Mapping With Multiple Traits

Genetics ◽

10.1093/genetics/161.2.905 ◽

2002 ◽

Vol 161 (2) ◽

pp. 905-914 ◽

Cited By ~ 1

Author(s):

Hakkyo Lee ◽

Jack C M Dekkers ◽

M Soller ◽

Massoud Malek ◽

Rohan L Fernando ◽

...

Keyword(s):

Quantitative Trait Loci ◽

False Discovery Rate ◽

Error Rate ◽

Quantitative Trait ◽

Interval Mapping ◽

Error Rates ◽

Multiple Traits ◽

Test Statistic ◽

False Discovery ◽

Trait Loci

Abstract Controlling the false discovery rate (FDR) has been proposed as an alternative to controlling the genomewise error rate (GWER) for detecting quantitative trait loci (QTL) in genome scans. The objective here was to implement FDR in the context of regression interval mapping for multiple traits. Data on five traits from an F2 swine breed cross were used. FDR was implemented using tests at every 1 cM (FDR1) and using tests with the highest test statistic for each marker interval (FDRm). For the latter, a method was developed to predict comparison-wise error rates. At low error rates, FDR1 behaved erratically; FDRm was more stable but gave similar significance thresholds and number of QTL detected. At the same error rate, methods to control FDR gave less stringent significance thresholds and more QTL detected than methods to control GWER. Although testing across traits had limited impact on FDR, single-trait testing was recommended because there is no theoretical reason to pool tests across traits for FDR. FDR based on FDRm was recommended for QTL detection in interval mapping because it provides significance tests that are meaningful, yet not overly stringent, such that a more complete picture of QTL is revealed.

Download Full-text

The null-hypothesis significance-test procedure is still warranted

Behavioral and Brain Sciences ◽

10.1017/s0140525x98591169 ◽

1998 ◽

Vol 21 (2) ◽

pp. 228-235 ◽

Cited By ~ 2

Author(s):

Siu L. Chow

Keyword(s):

Null Hypothesis ◽

Statistical Power ◽

Meta Analysis ◽

Statistical Significance ◽

Test Procedure ◽

Significance Test ◽

Null Hypothesis Significance Test ◽

Wide Range ◽

Statistical Hypotheses ◽

Alternative Hypotheses

Entertaining diverse assumptions about empirical research, commentators give a wide range of verdicts on the NHSTP defence in Statistical significance. The null-hypothesis significance-test procedure (NHSTP) is defended in a framework in which deductive and inductive rules are deployed in theory corroboration in the spirit of Popper's Conjectures and refutations (1968b). The defensible hypothetico-deductive structure of the framework is used to make explicit the distinctions between (1) substantive and statistical hypotheses, (2) statistical alternative and conceptual alternative hypotheses, and (3) making statistical decisions and drawing theoretical conclusions. These distinctions make it easier to show that (1) H0 can be true, (2) the effect size is irrelevant to theory corroboration, and (3) “strong” hypotheses make no difference to NHSTP. Reservations about statistical power, meta-analysis, and the Bayesian approach are still warranted.

Download Full-text

When Studies are in Error: Basic Statistical Vocabulary Needed to Understand Clinical Studies

Journal of Cutaneous Medicine and Surgery ◽

10.1177/120347549600100108 ◽

1996 ◽

Vol 1 (1) ◽

pp. 25-28 ◽

Cited By ~ 1

Author(s):

Martin A. Weinstock

Keyword(s):

Null Hypothesis ◽

Statistical Power ◽

Critical Appraisal ◽

Type I Error ◽

Statistical Significance ◽

P Value ◽

Type I ◽

Type Ii ◽

Type Ii Error ◽

Error Type

Background: Accurate understanding of certain basic statistical terms and principles is key to critical appraisal of published literature. Objective: This review describes type I error, type II error, null hypothesis, p value, statistical significance, a, two-tailed and one-tailed tests, effect size, alternate hypothesis, statistical power, β, publication bias, confidence interval, standard error, and standard deviation, while including examples from reports of dermatologic studies. Conclusion: The application of the results of published studies to individual patients should be informed by an understanding of certain basic statistical concepts.

Download Full-text

Estimation of statistical power and false discovery rate of QTL mapping methods through computer simulation

Chinese Science Bulletin ◽

10.1007/s11434-012-5239-3 ◽

2012 ◽

Vol 57 (21) ◽

pp. 2701-2710 ◽

Cited By ~ 11

Author(s):

HuiHui Li ◽

LuYan Zhang ◽

JianKang Wang

Keyword(s):

Computer Simulation ◽

Qtl Mapping ◽

False Discovery Rate ◽

Statistical Power ◽

False Discovery

Download Full-text

Testing Mediation Effects in High-Dimensional Microbiome Data with False Discovery Rate Control

10.21203/rs.3.rs-1105471/v1 ◽

2021 ◽

Author(s):

Ye Yue ◽

Yijuan Hu

Keyword(s):

False Discovery Rate ◽

Relative Abundance ◽

Mediation Analysis ◽

Type I Error ◽

New Method ◽

Type I ◽

Inverse Regression ◽

Global Test ◽

Mediation Effects ◽

False Discovery

Abstract Background: Understanding whether and which microbes played a mediating role between an exposure and a disease outcome are essential for researchers to develop clinical interventions to treat the disease by modulating the microbes. Existing methods for mediation analysis of the microbiome are often limited to a global test of community-level mediation or selection of mediating microbes without control of the false discovery rate (FDR). Further, while the null hypothesis of no mediation at each microbe is a composite null that consists of three types of null (no exposure-microbe association, no microbe-outcome association given the exposure, or neither), most existing methods for the global test such as MedTest and MODIMA treat the microbes as if they are all under the same type of null. Results: We propose a new approach based on inverse regression that regresses the (possibly transformed) relative abundance of each taxon on the exposure and the exposure-adjusted outcome to assess the exposure-taxon and taxon-outcome associations simultaneously. Then the association p-values are used to test mediation at both the community and individual taxon levels. This approach fits nicely into our Linear Decomposition Model (LDM) framework, so our new method is implemented in the LDM and enjoys all the features of the LDM, i.e., allowing an arbitrary number of taxa to be tested, supporting continuous, discrete, or multivariate exposures and outcomes as well as adjustment of confounding covariates, accommodating clustered data, and offering analysis at the relative abundance or presence-absence scale. We refer to this new method as LDM-med. Using extensive simulations, we showed that LDM-med always controlled the type I error of the global test and had compelling power over existing methods; LDM-med always preserved the FDR of testing individual taxa and had much better sensitivity than alternative approaches. In contrast, MedTest and MODIMA had severely inflated type I error when different taxa were under different types of null. The flexibility of LDM-med for a variety of mediation analyses is illustrated by the application to a murine microbiome dataset, which identified a plausible mediator.Conclusions: Inverse regression coupled with the LDM is a strategy that performs well and is capable of handling mediation analysis in a wide variety of microbiome studies.

Download Full-text

An averaging strategy to reduce variability in target-decoy estimates of false discovery rate

10.1101/440594 ◽

2018 ◽

Cited By ~ 1

Author(s):

Uri Keich ◽

Kaipo Tamura ◽

William Stafford Noble

Keyword(s):

False Discovery Rate ◽

Statistical Power ◽

Shotgun Proteomics ◽

Database Search ◽

Proteomics Data ◽

Decoy Database ◽

Software Toolkit ◽

True Proportion ◽

False Discovery ◽

False Discoveries

AbstractDecoy database search with target-decoy competition (TDC) provides an intuitive, easy-to-implement method for estimating the false discovery rate (FDR) associated with spectrum identifications from shotgun proteomics data. However, the procedure can yield different results for a fixed dataset analyzed with different decoy databases, and this decoy-induced variability is particularly problematic for smaller FDR thresholds, datasets or databases. In such cases, the nominal FDR might be 1% but the true proportion of false discoveries might be 10%. The averaged TDC protocol combats this problem by exploiting multiple independently shuffled decoy databases to provide an FDR estimate with reduced variability. We provide a tutorial introduction to aTDC, describe an improved variant of the protocol that offers increased statistical power, and discuss how to deploy aTDC in practice using the Crux software toolkit.

Download Full-text

Testing Differential Gene Networks under Nonparanormal Graphical Models with False Discovery Rate Control

Genes ◽

10.3390/genes11020167 ◽

2020 ◽

Vol 11 (2) ◽

pp. 167 ◽

Cited By ~ 1

Author(s):

Qingyang Zhang

Keyword(s):

False Discovery Rate ◽

Graphical Models ◽

Rate Control ◽

Gene Networks ◽

Large Scale ◽

Graphical Model ◽

Test Statistic ◽

False Discovery Rate Control ◽

False Discovery ◽

Sample Covariance

The nonparanormal graphical model has emerged as an important tool for modeling dependency structure between variables because it is flexible to non-Gaussian data while maintaining the good interpretability and computational convenience of Gaussian graphical models. In this paper, we consider the problem of detecting differential substructure between two nonparanormal graphical models with false discovery rate control. We construct a new statistic based on a truncated estimator of the unknown transformation functions, together with a bias-corrected sample covariance. Furthermore, we show that the new test statistic converges to the same distribution as its oracle counterpart does. Both synthetic data and real cancer genomic data are used to illustrate the promise of the new method. Our proposed testing framework is simple and scalable, facilitating its applications to large-scale data. The computational pipeline has been implemented in the R package DNetFinder, which is freely available through the Comprehensive R Archive Network.

Download Full-text

Quality Research Toolbox: CANT MISS: Conquer Any Number Task by Making Important Statistics Simple. Part 6. Tests of Statistical Significance (z Test Statistic, Rejecting the Null Hypothesis, p value), t Test, z Test for Proportions, Statistical Significance Versus Meaningful Difference

Journal for Healthcare Quality ◽

10.1111/j.1945-1474.2004.tb00507.x ◽

2004 ◽

Vol 26 (4) ◽

pp. 43-53 ◽

Cited By ~ 5

Author(s):

John P. Hansen

Keyword(s):

Null Hypothesis ◽

Statistical Significance ◽

T Test ◽

P Value ◽

Meaningful Difference ◽

Test Statistic ◽

Quality Research ◽

Simple Part

Download Full-text

Discrete False-Discovery Rate Improves Identification of Differentially Abundant Microbes

mSystems ◽

10.1128/msystems.00092-17 ◽

2017 ◽

Vol 2 (6) ◽

Cited By ~ 25

Author(s):

Lingjing Jiang ◽

Amnon Amir ◽

James T. Morton ◽

Ruth Heller ◽

Ery Arias-Castro ◽

...

Keyword(s):

Gene Expression ◽

False Discovery Rate ◽

Real World ◽

Statistical Power ◽

Gene Expression Analysis ◽

New Method ◽

Data Sets ◽

Differential Abundance ◽

False Discovery ◽

Microbiome Data

ABSTRACT DS-FDR can achieve higher statistical power to detect significant findings in sparse and noisy microbiome data compared to the commonly used Benjamini-Hochberg procedure and other FDR-controlling procedures. Differential abundance testing is a critical task in microbiome studies that is complicated by the sparsity of data matrices. Here we adapt for microbiome studies a solution from the field of gene expression analysis to produce a new method, discrete false-discovery rate (DS-FDR), that greatly improves the power to detect differential taxa by exploiting the discreteness of the data. Additionally, DS-FDR is relatively robust to the number of noninformative features, and thus removes the problem of filtering taxonomy tables by an arbitrary abundance threshold. We show by using a combination of simulations and reanalysis of nine real-world microbiome data sets that this new method outperforms existing methods at the differential abundance testing task, producing a false-discovery rate that is up to threefold more accurate, and halves the number of samples required to find a given difference (thus increasing the efficiency of microbiome experiments considerably). We therefore expect DS-FDR to be widely applied in microbiome studies. IMPORTANCE DS-FDR can achieve higher statistical power to detect significant findings in sparse and noisy microbiome data compared to the commonly used Benjamini-Hochberg procedure and other FDR-controlling procedures.

Download Full-text