scholarly journals Statistical Assessment of Depth Normalization for Small RNA Sequencing

2020 ◽  
pp. 567-582 ◽  
Author(s):  
Li-Xuan Qin ◽  
Jian Zou ◽  
Jiejun Shi ◽  
Ann Lee ◽  
Aleksandra Mihailovic ◽  
...  

PURPOSE Methods for depth normalization have been assessed primarily with simulated data or cell-line–mixture data. There is a pressing need for benchmark data enabling a more realistic and objective assessment, especially in the context of small RNA sequencing. METHODS We collected a unique pair of microRNA sequencing data sets for the same set of tumor samples; one data set was collected with and the other without uniform handling and balanced design. The former provided a benchmark for evaluating evidence of differential expression and the latter served as a test bed for normalization. Next, we developed a data perturbation algorithm to simulate additional data set pairs. Last, we assembled a set of computational tools to visualize and quantify the assessment. RESULTS We validated the quality of the benchmark data and showed the need for normalization of the test data. For illustration, we applied the data and tools to assess the performance of 9 existing normalization methods. Among them, trimmed mean of M-values was a better scaling method, whereas the median and the upper quartiles were consistently the worst performers; one variation of remove unwanted variation had the best chance of capturing true positives but at the cost of increased false positives. In general, these methods were, at best, moderately helpful when the level of differential expression was extensive and asymmetric. CONCLUSION Our study (1) provides the much-needed benchmark data and computational tools for assessing depth normalization, (2) shows the dependence of normalization performance on the underlying pattern of differential expression, and (3) calls for continued research efforts to develop more effective normalization methods.

2019 ◽  
Vol 47 (W1) ◽  
pp. W530-W535 ◽  
Author(s):  
Ernesto Aparicio-Puerta ◽  
Ricardo Lebrón ◽  
Antonio Rueda ◽  
Cristina Gómez-Martín ◽  
Stavros Giannoukakos ◽  
...  

Abstract Since the original publication of sRNAtoolbox in 2015, small RNA research experienced notable advances in different directions. New protocols for small RNA sequencing have become available to address important issues such as adapter ligation bias, PCR amplification artefacts or to include internal controls such as spike-in sequences. New microRNA reference databases were developed with different foci, either prioritizing accuracy (low number of false positives) or completeness (low number of false negatives). Additionally, other small RNA molecules as well as microRNA sequence and length variants (isomiRs) have continued to gain importance. Finally, the number of microRNA sequencing studies deposited in GEO nearly triplicated from 2014 (280) to 2018 (764). These developments imply that fast and easy-to-use tools for expression profiling and subsequent downstream analysis of miRNA-seq data are essential to many researchers. Key features in this sRNAtoolbox release include addition of all major RNA library preparation protocols to sRNAbench and improvements in sRNAde, a tool that summarizes several aspects of small RNA sequencing studies including the detection of consensus differential expression. A special emphasis was put on the user-friendliness of the tools, for instance sRNAbench now supports parallel launching of several jobs to improve reproducibility and user time efficiency.


RNA ◽  
2014 ◽  
Vol 21 (2) ◽  
pp. 164-171 ◽  
Author(s):  
Joshua D. Campbell ◽  
Gang Liu ◽  
Lingqi Luo ◽  
Ji Xiao ◽  
Joseph Gerrein ◽  
...  

2019 ◽  
Author(s):  
Jiang Li ◽  
Alvin T. Kho ◽  
Robert P. Chase ◽  
Lorena Pantano-Rubino ◽  
Leanna Farnam ◽  
...  

Abstract Background Circulating RNAs are potential disease biomarkers and their function is being actively investigated. Next generation sequencing (NGS) is a common means to interrogate the small RNA'ome or the full spectrum of small RNAs (<200 nucleotide length) of a biological system. A pivotal problem in NGS based small RNA analysis is identifying and quantifying the small RNA'ome constituent components. Most existing NGS data analysis tools focus on the microRNA component and a few other small RNA types like piRNA, snRNA and snoRNA. A comprehensive platform is needed to interrogate the full small RNA'ome, a prerequisite for down-stream data analysis. Results We present COMPASS, a comprehensive modular stand-alone platform for identifying and quantifying small RNAs from small RNA sequencing data. COMPASS contains prebuilt customizable standard RNA databases and sequence processing tools to enable turnkey basic small RNA analysis. We evaluated COMPASS against comparable existing tools on small RNA sequencing data set from serum samples of 12 healthy human controls, and COMPASS identified a greater diversity and abundance of small RNA molecules. Conclusion COMPASS is modular, stand-alone and integrates multiple customizable RNA databases and sequence processing tool and is distributed under the GNU General Public License free to non-commercial registered users at https://regepi.bwh.harvard.edu/circurna/ and the source code is available at https://github.com/cougarlj/COMPASS.


2019 ◽  
Author(s):  
Jiang Li ◽  
Alvin T. Kho ◽  
Robert P. Chase ◽  
Lorena Pantano-Rubino ◽  
Leanna Farnam ◽  
...  

AbstractBackgroundCirculating RNAs are potential disease biomarkers and their function is being actively investigated. Next generation sequencing (NGS) is a common means to interrogate the small RNA’ome or the full spectrum of small RNAs (<200 nucleotide length) of a biological system. A pivotal problem in NGS based small RNA analysis is identifying and quantifying the small RNA’ome constituent components. Most existing NGS data analysis tools focus on the microRNA component and a few other small RNA types like piRNA, snRNA and snoRNA. A comprehensive platform is needed to interrogate the full small RNA’ome, a prerequisite for down-stream data analysis.ResultsWe present COMPASS, a comprehensive modular stand-alone platform for identifying and quantifying small RNAs from small RNA sequencing data. COMPASS contains prebuilt customizable standard RNA databases and sequence processing tools to enable turnkey basic small RNA analysis. We evaluated COMPASS against comparable existing tools on small RNA sequencing data set from serum samples of 12 healthy human controls, and COMPASS identified a greater diversity and abundance of small RNA molecules.ConclusionCOMPASS is modular, stand-alone and integrates multiple customizable RNA databases and sequence processing tool and is distributed under the GNU General Public License free to non-commercial registered users at https://regepi.bwh.harvard.edu/circurna/ and the source code is available at https://github.com/cougarlj/COMPASS.


2020 ◽  
pp. 109158182096151
Author(s):  
Jennifer C. Shing ◽  
Kai Schaefer ◽  
Shaun E. Grosskurth ◽  
Andy H. Vo ◽  
Tatiana Sharapova ◽  
...  

Predictive indicators of testicular toxicity could improve drug development by allowing early in-life screening for this adverse effect before it becomes severe. We hypothesized that circulating microRNAs (miRNAs) could serve as testicular toxicity biomarkers in dogs. Herein, we describe the results of an exploratory study conducted to discover biomarkers of drug-induced testicular injury. Following a dose-selection study using the testicular toxicant ethylene glycol monomethyl ether (EGME), we chose a dose of 50 mg/kg/d EGME to avoid systemic toxicity and treated 2 groups of dogs (castrated, non-castrated) for 14 to 28 days. Castrated animals were used as negative controls to identify biomarkers specific for testicular toxicity because EGME can cause toxicity to organ systems in addition to the testis. Blood was collected daily during the dosing period, followed by recovery for 29 to 43 days with less frequent sampling. Dosing was well tolerated, resulting in mild-to-moderate degeneration in testes and epididymides. Global profiling of serum miRNAs at selected dosing and recovery time points was completed by small RNA sequencing. Bioinformatics data analysis using linear modeling demonstrated several circulating miRNAs that were differentially abundant during the dosing period compared with baseline and/or castrated control samples. Confirmatory reverse transcription quantitative polymerase chain reaction data in these animals was unable to detect sustained alterations of miRNAs in serum, except for 1 potential candidate cfa-miR-146b. Taken together, we report the results of a comprehensive exploratory study and suggest future directions for follow-up research to address the challenge of developing diagnostic biomarkers of testicular toxicity.


Diagnostics ◽  
2021 ◽  
Vol 11 (6) ◽  
pp. 964
Author(s):  
Sarka Benesova ◽  
Mikael Kubista ◽  
Lukas Valihrach

MicroRNAs (miRNAs) are a class of small RNA molecules that have an important regulatory role in multiple physiological and pathological processes. Their disease-specific profiles and presence in biofluids are properties that enable miRNAs to be employed as non-invasive biomarkers. In the past decades, several methods have been developed for miRNA analysis, including small RNA sequencing (RNA-seq). Small RNA-seq enables genome-wide profiling and analysis of known, as well as novel, miRNA variants. Moreover, its high sensitivity allows for profiling of low input samples such as liquid biopsies, which have now found applications in diagnostics and prognostics. Still, due to technical bias and the limited ability to capture the true miRNA representation, its potential remains unfulfilled. The introduction of many new small RNA-seq approaches that tried to minimize this bias, has led to the existence of the many small RNA-seq protocols seen today. Here, we review all current approaches to cDNA library construction used during the small RNA-seq workflow, with particular focus on their implementation in commercially available protocols. We provide an overview of each protocol and discuss their applicability. We also review recent benchmarking studies comparing each protocol’s performance and summarize the major conclusions that can be gathered from their usage. The result documents variable performance of the protocols and highlights their different applications in miRNA research. Taken together, our review provides a comprehensive overview of all the current small RNA-seq approaches, summarizes their strengths and weaknesses, and provides guidelines for their applications in miRNA research.


2013 ◽  
Vol 12 (11) ◽  
pp. 2036-2044 ◽  
Author(s):  
Dong-qing SHI ◽  
Yuan ZHANG ◽  
Jin-hu MA ◽  
Yu-long LI ◽  
Jin XU

Sign in / Sign up

Export Citation Format

Share Document