scholarly journals The accuracy of absolute differential abundance analysis from relative count data

2021 ◽  
Author(s):  
Kimberly E. Roche ◽  
Sayan Mukherjee

AbstractConcerns have been raised about the use of relative abundance data derived from next generation sequencing as a proxy for absolute abundances. In the differential abundance setting compositional effects are hypothesized to contribute to increased rates of spurious differences (false positives). However in practice, partial reconstruction of total abundance can be imputed through renormalization of observed per-sample abundance. Given the renormalized data differential abundance need not be called on relative counts themselves but on estimates of absolute counts. We use simulated data to explore the consistency of differential abundance calls made on these adjusted relative abundances and find that while overall rates of false positive calls are low substantial error is possible. Conditions consistent with microbial community profiling are the most at risk of error induced by compositional effects. Increasing complexity of composition (i.e. increasing feature number) is generally protective against this effect. In real data sets drawn from 16S metabarcoding, expression array, bulk RNA-seq, and single-cell RNA-seq experiments, results are similar: though median accuracy is high, microbial community profiling and single-cell transcriptomic data sets can have poor outcomes. However, we show that problematic data sets can often be identified by summary characteristics of their relative abundances alone, giving researchers a means of anticipating problems and adjusting analysis strategies where appropriate.

2019 ◽  
Author(s):  
Marcus Alvarez ◽  
Elior Rahmani ◽  
Brandon Jew ◽  
Kristina M. Garske ◽  
Zong Miao ◽  
...  

AbstractSingle-nucleus RNA sequencing (snRNA-seq) measures gene expression in individual nuclei instead of cells, allowing for unbiased cell type characterization in solid tissues. Contrary to single-cell RNA seq (scRNA-seq), we observe that snRNA-seq is commonly subject to contamination by high amounts of extranuclear background RNA, which can lead to identification of spurious cell types in downstream clustering analyses if overlooked. We present a novel approach to remove debris-contaminated droplets in snRNA-seq experiments, called Debris Identification using Expectation Maximization (DIEM). Our likelihood-based approach models the gene expression distribution of debris and cell types, which are estimated using EM. We evaluated DIEM using three snRNA-seq data sets: 1) human differentiating preadipocytes in vitro, 2) fresh mouse brain tissue, and 3) human frozen adipose tissue (AT) from six individuals. All three data sets showed various degrees of extranuclear RNA contamination. We observed that existing methods fail to account for contaminated droplets and led to spurious cell types. When compared to filtering using these state of the art methods, DIEM better removed droplets containing high levels of extranuclear RNA and led to higher quality clusters. Although DIEM was designed for snRNA-seq data, we also successfully applied DIEM to single-cell data. To conclude, our novel method DIEM removes debris-contaminated droplets from single-cell-based data fast and effectively, leading to cleaner downstream analysis. Our code is freely available for use at https://github.com/marcalva/diem.


Microbiology ◽  
2018 ◽  
Vol 87 (1) ◽  
pp. 143-146 ◽  
Author(s):  
A. V. Mangrola ◽  
P. R. Dudhagara ◽  
P. G. Koringa ◽  
C. G. Joshi ◽  
R. K. Patel

2009 ◽  
Vol 58 (1) ◽  
pp. 199-211 ◽  
Author(s):  
Scott M. Geib ◽  
Maria del Mar Jimenez-Gasco ◽  
John E. Carlson ◽  
Ming Tien ◽  
Randa Jabbour ◽  
...  

2012 ◽  
Vol 9 (8) ◽  
pp. 811-814 ◽  
Author(s):  
Nicola Segata ◽  
Levi Waldron ◽  
Annalisa Ballarini ◽  
Vagheesh Narasimhan ◽  
Olivier Jousson ◽  
...  

2015 ◽  
Vol 119 ◽  
pp. 239-242 ◽  
Author(s):  
Angela Glassing ◽  
Scot E. Dowd ◽  
Susan Galandiuk ◽  
Brian Davis ◽  
Jeffrey R. Jorden ◽  
...  

2016 ◽  
Vol 6 (2) ◽  
pp. e00423 ◽  
Author(s):  
Dongping Wang ◽  
Hakim Boukhalfa ◽  
Oana Marina ◽  
Doug S. Ware ◽  
Tim J. Goering ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document