Random rotation for identifying differentially expressed genes with linear models following batch effect correction

Bioinformatics ◽

10.1093/bioinformatics/btab063 ◽

2021 ◽

Author(s):

Peter Hettegger ◽

Klemens Vierlinger ◽

Andreas Weinhaeusel

Keyword(s):

Data Analysis ◽

Supplementary Information ◽

Dependence Structure ◽

Test Statistics ◽

False Discovery Rates ◽

P Values ◽

Null Distributions ◽

False Discovery ◽

Random Rotation ◽

Discovery Rates

Abstract Motivation Data generated from high-throughput technologies such as sequencing, microarray and bead-chip technologies are unavoidably affected by batch effects (BEs). Large effort has been put into developing methods for correcting these effects. Often, BE correction and hypothesis testing cannot be done with one single model, but are done successively with separate models in data analysis pipelines. This potentially leads to biased P-values or false discovery rates due to the influence of BE correction on the data. Results We present a novel approach for estimating null distributions of test statistics in data analysis pipelines where BE correction is followed by linear model analysis. The approach is based on generating simulated datasets by random rotation and thereby retains the dependence structure of genes adequately. This allows estimating null distributions of dependent test statistics, and thus the calculation of resampling-based P-values and false-discovery rates following BE correction while maintaining the alpha level. Availability The described methods are implemented as randRotation package on Bioconductor: https://bioconductor.org/packages/randRotation/ Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

Simple estimators of false discovery rates given as few as one or two p-values without strong parametric assumptions

Statistical Applications in Genetics and Molecular Biology ◽

10.1515/sagmb-2013-0003 ◽

2013 ◽

Vol 12 (4) ◽

Cited By ~ 7

Author(s):

David R. Bickel

Keyword(s):

False Discovery Rates ◽

P Values ◽

False Discovery ◽

Discovery Rates

Download Full-text

A Comparison of Two Classes of Methods for Estimating False Discovery Rates in Microarray Studies

Scientifica ◽

10.6064/2012/519394 ◽

2012 ◽

Vol 2012 ◽

pp. 1-9 ◽

Cited By ~ 2

Author(s):

Emily Hansen ◽

Kathleen F. Kerr

Keyword(s):

Differentially Expressed Genes ◽

Null Distribution ◽

Differentially Expressed ◽

Test Statistics ◽

False Discovery Rates ◽

Model Method ◽

False Discovery ◽

Microarray Studies ◽

Discovery Rates

The goal of many microarray studies is to identify genes that are differentially expressed between two classes or populations. Many data analysts choose to estimate the false discovery rate (FDR) associated with the list of genes declared differentially expressed. Estimating an FDR largely reduces to estimatingπ1, the proportion of differentially expressed genes among all analyzed genes. Estimatingπ1is usually done throughP-values, but computingP-values can be viewed as a nuisance and potentially problematic step. We evaluated methods for estimatingπ1directly from test statistics, circumventing the need to computeP-values. We adapted existing methodology for estimatingπ1fromt- andz-statistics so thatπ1could be estimated from other statistics. We compared the quality of these estimates to estimates generated by two established methods for estimatingπ1fromP-values. Overall, methods varied widely in bias and variability. The least biased and least variable estimates ofπ1, the proportion of differentially expressed genes, were produced by applying the “convest” mixture model method toP-values computed from a pooled permutation null distribution. Estimates computed directly from test statistics rather thanP-values did not reliably perform well.

Download Full-text

ipDMR: identification of differentially methylated regions with interval P-values

Bioinformatics ◽

10.1093/bioinformatics/btaa732 ◽

2020 ◽

Author(s):

Zongli Xu ◽

Changchun Xie ◽

Jack A Taylor ◽

Liang Niu

Keyword(s):

Software Tool ◽

Real Data ◽

Supplementary Information ◽

Sequencing Data ◽

Differentially Methylated Regions ◽

R Software ◽

False Discovery Rates ◽

P Values ◽

False Discovery ◽

Bisulfite Sequencing Data

Abstract Summary ipDMR is an R software tool for identification of differentially methylated regions (DMRs) using auto-correlated P-values for individual CpGs from epigenome-wide association analysis using array or bisulfite sequencing data. It summarizes P-values for adjacent CpGs, identifies association peaks and then extends peaks to find boundaries of DMRs. ipDMR uses BED format files as input and is easy to use. Simulations guided by real data found that ipDMR outperformed current available methods and provided slightly higher true positive rates and much lower false discovery rates. Availability and implementation ipDMR is available at https://bioconductor.org/packages/release/bioc/html/ENmix.html. Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

Corrigendum to: Simple estimators of false discovery rates given as few as one or two p-values without strong parametric assumptions

Statistical Applications in Genetics and Molecular Biology ◽

10.1515/sagmb-2014-0100 ◽

2015 ◽

Vol 14 (2) ◽

Cited By ~ 1

Author(s):

David R. Bickel

Keyword(s):

False Discovery Rates ◽

P Values ◽

False Discovery ◽

Discovery Rates

Download Full-text

FDRestimation: Flexible False Discovery Rate Computation in R

F1000Research ◽

10.12688/f1000research.52999.2 ◽

2021 ◽

Vol 10 ◽

pp. 441

Author(s):

Megan H. Murray ◽

Jeffrey D. Blume

Keyword(s):

False Discovery Rate ◽

Estimation Procedure ◽

False Discovery Rates ◽

P Values ◽

False Discovery ◽

Software Packages ◽

Broad Array ◽

Potential Impact ◽

User Friendly ◽

Discovery Rates

False discovery rates (FDR) are an essential component of statistical inference, representing the propensity for an observed result to be mistaken. FDR estimates should accompany observed results to help the user contextualize the relevance and potential impact of findings. This paper introduces a new user-friendly R pack-age for estimating FDRs and computing adjusted p-values for FDR control. The roles of these two quantities are often confused in practice and some software packages even report the adjusted p-values as the estimated FDRs. A key contribution of this package is that it distinguishes between these two quantities while also offering a broad array of refined algorithms for estimating them. For example, included are newly augmented methods for estimating the null proportion of findings - an important part of the FDR estimation procedure. The package is broad, encompassing a variety of adjustment methods for FDR estimation and FDR control, and includes plotting functions for easy display of results. Through extensive illustrations, we strongly encourage wider reporting of false discovery rates for observed findings.

Download Full-text

FDRestimation: Flexible False Discovery Rate Computation in R

F1000Research ◽

10.12688/f1000research.52999.1 ◽

2021 ◽

Vol 10 ◽

pp. 441

Author(s):

Megan H. Murray ◽

Jeffrey D. Blume

Keyword(s):

False Discovery Rate ◽

Estimation Procedure ◽

False Discovery Rates ◽

P Values ◽

False Discovery ◽

Software Packages ◽

Broad Array ◽

Potential Impact ◽

User Friendly ◽

Discovery Rates

Download Full-text

Simultaneous inferences based on empirical Bayes methods and false discovery rates ineQTL data analysis

BMC Genomics ◽

10.1186/1471-2164-14-s8-s8 ◽

2013 ◽

Vol 14 (Suppl 8) ◽

pp. S8 ◽

Cited By ~ 1

Author(s):

Arindom Chakraborty ◽

Guanglong Jiang ◽

Malaz Boustani ◽

Yunlong Liu ◽

Todd Skaar ◽

...

Keyword(s):

Data Analysis ◽

Empirical Bayes ◽

False Discovery Rates ◽

Bayes Methods ◽

False Discovery ◽

Empirical Bayes Methods ◽

Discovery Rates ◽

Simultaneous Inferences

Download Full-text

False discovery rates and copy number variation

Biometrika ◽

10.1093/biomet/asr018 ◽

2011 ◽

Vol 98 (2) ◽

pp. 251-271 ◽

Cited By ~ 11

Author(s):

Bradley Efron ◽

Nancy R. Zhang

Keyword(s):

Copy Number Variation ◽

Copy Number ◽

False Discovery Rates ◽

False Discovery ◽

Number Variation ◽

Discovery Rates

Download Full-text

Signal identification for rare and weak features: higher criticism or false discovery rates?

Biostatistics ◽

10.1093/biostatistics/kxs030 ◽

2012 ◽

Vol 14 (1) ◽

pp. 129-143 ◽

Cited By ~ 14

Author(s):

Bernd Klaus ◽

Korbinian Strimmer

Keyword(s):

Signal Identification ◽

False Discovery Rates ◽

Higher Criticism ◽

False Discovery ◽

Discovery Rates

Download Full-text

False discovery rates: a new deal

Biostatistics ◽

10.1093/biostatistics/kxw041 ◽

2016 ◽

pp. kxw041 ◽

Cited By ~ 66

Author(s):

Matthew Stephens

Keyword(s):

New Deal ◽

False Discovery Rates ◽

False Discovery ◽

Discovery Rates

Download Full-text