scholarly journals Variance component score test for time-course gene set analysis of longitudinal RNA-seq data

Biostatistics ◽  
2017 ◽  
Vol 18 (4) ◽  
pp. 589-604 ◽  
Author(s):  
Denis Agniel ◽  
Boris P. Hejblum
2019 ◽  
Vol 17 (05) ◽  
pp. 1940010 ◽  
Author(s):  
Farhad Maleki ◽  
Katie L. Ovens ◽  
Daniel J. Hogan ◽  
Elham Rezaei ◽  
Alan M. Rosenberg ◽  
...  

Gene set analysis is a quantitative approach for generating biological insight from gene expression datasets. The abundance of gene set analysis methods speaks to their popularity, but raises the question of the extent to which results are affected by the choice of method. Our systematic analysis of 13 popular methods using 6 different datasets, from both DNA microarray and RNA-Seq origin, shows that this choice matters a great deal. We observed that the overall number of gene sets reported by each method differed by up to 2 orders of magnitude, and there was a bias toward reporting large gene sets with some methods. Furthermore, there was substantial disagreement between the 20 most statistically significant gene sets reported by the methods. This was also observed when expanding to the 100 most statistically significant reported gene sets. For different datasets of the same phenotype/condition, the top 20 and top 100 most significant results also showed little to no agreement even when using the same method. GAGE, PAGE, and ORA were the only methods able to achieve relatively high reproducibility when comparing the 20 and 100 most statistically significant gene sets. Biological validation on a juvenile idiopathic arthritis (JIA) dataset showed wide variation in terms of the relevance of the top 20 and top 100 most significant gene sets to known biology of the disease, where GAGE predicted the most relevant gene sets, followed by GSEA, ORA, and PAGE.


2015 ◽  
Vol 17 (3) ◽  
pp. 393-407 ◽  
Author(s):  
Yasir Rahmatallah ◽  
Frank Emmert-Streib ◽  
Galina Glazko

2014 ◽  
Vol 15 (1) ◽  
Author(s):  
Yasir Rahmatallah ◽  
Frank Emmert-Streib ◽  
Galina Glazko

2017 ◽  
Vol 10 (1) ◽  
Author(s):  
Xing Ren ◽  
Qiang Hu ◽  
Song Liu ◽  
Jianmin Wang ◽  
Jeffrey C. Miecznikowski

2013 ◽  
Vol 14 (1) ◽  
Author(s):  
Yen-Tsung Huang ◽  
Xihong Lin

Processes ◽  
2019 ◽  
Vol 7 (2) ◽  
pp. 79 ◽  
Author(s):  
Xiang Zhan

The Microbiome Regression-based Kernel Association Test (MiRKAT) is widely used in testing for the association between microbiome compositions and an outcome of interest. The MiRKAT statistic is derived as a variance-component score test in a kernel machine regression-based generalized linear mixed model. In this brief report, we show that the MiRKAT statistic is proportional to the R 2 (coefficient of determination) statistic in a similarity matrix regression, which characterizes the fraction of variability in outcome similarity, explained by microbiome similarity (up to a constant).


2013 ◽  
Vol 3 (1) ◽  
Author(s):  
Brooke L. Fridley ◽  
Gregory D. Jenkins ◽  
Diane E. Grill ◽  
Richard B. Kennedy ◽  
Gregory A. Poland ◽  
...  

2019 ◽  
Vol 21 (5) ◽  
pp. 1495-1508 ◽  
Author(s):  
Antonio Mora

Abstract Gene set analysis (GSA) is one of the methods of choice for analyzing the results of current omics studies; however, it has been mainly developed to analyze mRNA (microarray, RNA-Seq) data. The following review includes an update regarding general methods and resources for GSA and then emphasizes GSA methods and tools for non-mRNA omics datasets, specifically genomic range data (ChIP-Seq, SNP and methylation) and ncRNA data (miRNAs, lncRNAs and others). In the end, the state of the GSA field for non-mRNA datasets is discussed, and some current challenges and trends are highlighted, especially the use of network approaches to face complexity issues.


Sign in / Sign up

Export Citation Format

Share Document