scholarly journals multiGSEA: a GSEA-based pathway enrichment analysis for multi-omics data

2020 ◽  
Vol 21 (1) ◽  
Author(s):  
Sebastian Canzler ◽  
Jörg Hackermüller

Abstract Background Gaining biological insights into molecular responses to treatments or diseases from omics data can be accomplished by gene set or pathway enrichment methods. A plethora of different tools and algorithms have been developed so far. Among those, the gene set enrichment analysis (GSEA) proved to control both type I and II errors well. In recent years the call for a combined analysis of multiple omics layers became prominent, giving rise to a few multi-omics enrichment tools. Each of these has its own drawbacks and restrictions regarding its universal application. Results Here, we present the package aiding to calculate a combined GSEA-based pathway enrichment on multiple omics layers. The package queries 8 different pathway databases and relies on the robust GSEA algorithm for a single-omics enrichment analysis. In a final step, those scores will be combined to create a robust composite multi-omics pathway enrichment measure. supports 11 different organisms and includes a comprehensive mapping of transcripts, proteins, and metabolite IDs. Conclusions With we introduce a highly versatile tool for multi-omics pathway integration that minimizes previous restrictions in terms of omics layer selection, pathway database availability, organism selection and the mapping of omics feature identifiers. is publicly available under the GPL-3 license at https://github.com/yigbt/multiGSEA and at bioconductor: https://bioconductor.org/packages/multiGSEA.

Author(s):  
Sebastian Canzler ◽  
Jörg Hackermüller

AbstractGaining biological insights into molecular responses to treatments or diseases from omics data can be accomplished by gene set or pathway enrichment methods. A plethora of different tools and algorithms have been developed so far. Among those, the gene set enrichment analysis (GSEA) proved to control both type I and II errors well.In recent years the call for a combined analysis of multiple omics layer became prominent, giving rise to a few multi-omics enrichment tools. Each of which has its own drawbacks and restrictions regarding its universal application.Here, we present the multiGSEA package aiding to calculate a combined GSEA-based pathway enrichment on multiple omics layer. The package queries 8 different pathway databases and relies on the robust GSEA algorithm for a single-omics enrichment analysis. In a final step, those scores will be combined to create a robust composite multi-omics pathway enrichment measure. multiGSEA supports 11 different organisms and includes a comprehensive mapping of transcripts, proteins, and metabolite IDs. It is publicly available under the GPL-3 license at https://github.com/yigbt/multiGSEA and at Bioconductor: https://bioconductor.org/packages/multiGSEA.


2019 ◽  
Author(s):  
Rani K. Powers ◽  
Anthony Sun ◽  
James C. Costello

AbstractSummaryGSEA-InContext Explorer is a Shiny app that allows users to perform two methods of gene set enrichment analysis (GSEA). The first, GSEAPreranked, applies the GSEA algorithm in which statistical significance is estimated from a null distribution of enrichment scores generated for randomly permuted gene sets. The second, GSEA-InContext, incorporates a user-defined set of background experiments to define the null distribution and calculate statistical significance. GSEA-InContext Explorer allows the user to build custom background sets from a compendium of over 5,700 curated experiments, run both GSEAPreranked and GSEA-InContext on their own uploaded experiment, and explore the results using an interactive interface. This tool will allow researchers to visualize gene sets that are commonly enriched across experiments and identify gene sets that are uniquely significant in their experiment, thus complementing current methods for interpreting gene set enrichment results.Availability and implementationThe code for GSEA-InContext Explorer is available at: https://github.com/CostelloLab/GSEA-InContext_Explorer and the interactive tool is at: http://gsea-incontext_explorer.ngrok.io


2018 ◽  
Author(s):  
Danyue Dong ◽  
Tian Yuan ◽  
Shijie C. Zheng ◽  
Andrew E. Teschendorff

AbstractMotivationThe biological interpretation of differentially methylated sites derived from Epigenome-Wide-Association Studies remains a significant challenge. Gene Set Enrichment Analysis (GSEA) is a general tool to help aid biological interpretation, yet its correct and unbiased implementation in the EWAS context is difficult due to the differential probe representation of Illumina Infinium DNA methylation beadchips.ResultsWe present a novel GSEA method, called ebayGSEA, which ranks genes, not CpGs, according to the overall level of differential methylation, as assessed using all the probes mapping to the given gene. Applied on simulated and real EWAS data, we show how ebayGSEA may exhibit higher sensitivity and specificity than the current state-of-the-art, whilst also avoiding differential probe representation bias. Thus, ebayGSEA will be a useful additional tool to aid the interpretation of EWAS data.Availability and implementationebayGSEA is available from https://github.com/aet21/ebayGSEA, and has been incorporated into the ChAMP Bioconductor package (https://www.bioconductor.org).


2018 ◽  
Author(s):  
Aldo Acevedo ◽  
Claudio Durán ◽  
Sara Ciucci ◽  
Mathias Gerl ◽  
Carlo Vittorio Cannistraci

AbstractMotivationAnalyzing associations among multiple omic variables to infer mechanisms that meaningfully link them is a crucial step in systems biology. Gene Set Enrichment Analysis (GSEA) was conceived to pursue this aim in computational genomics, unveiling significant pathways associated to certain gene signatures under investigation. Lipidomics is a rapidly growing omic field, and absolute quantification of lipid abundance by shotgun mass spectrometry is generating high-throughput datasets that depict lipid metabolism in a plethora of conditions and organisms. In addition, high-throughput lipidomics represents a new important ally to develop personalized medicine approaches, investigate the causes and predict effective biomarkers in metabolic diseases, and not only.ResultsHere, we present Lipid Pathway Enrichment Analysis (LIPEA), a web-tool for over-representation analysis of lipid signatures and detection of the biological pathways in which they are enriched. LIPEA is a new valid resource for biologists and physicians to mine pathways significantly associated to a set of lipids, helping them to discover whether common and collective mechanisms are hidden behind those lipids. LIPEA was extensively tested and we provide two examples where our system gave successfully results related with Major Depression Disease (MDD) and insulin re-sistance.AvailabilityThe tool is available as web platform at https://lipea.biotec.tu-dresden.de.


2019 ◽  
Author(s):  
Tao Fang ◽  
Iakov Davydov ◽  
Daniel Marbach ◽  
Jitao David Zhang

AbstractMotivationCanonical methods for gene-set enrichment analysis assume independence between gene-sets. In practice, heterogeneous gene-sets from diverse sources are frequently combined and used, resulting in gene-sets with overlapping genes. They compromise statistical modelling and complicate interpretation of results.ResultsWe rephrase gene-set enrichment as a regression problem. Given some genes of interest (e.g.a list of hits from an experiment) and gene-sets (e.g.functional annotations or pathways), we aim to identify a sparse list of gene-sets for the genes of interest. In a regression framework, this amounts to identifying a minimum set of gene-sets that optimally predicts whether any gene belongs to the given genes of interest. To accommodate redundancy between gene-sets, we propose regularized regression techniques such as theelastic net.We report that regression-based results are consistent with established gene-set enrichment methods but more parsimonious and interpretable.AvailabilityWe implement the model ingerr(gene-set enrichment with regularized regression), an R package freely available athttps://github.com/TaoDFang/gerrand submitted toBioconductor.Code and data required to reproduce the results of this study are available athttps://github.com/TaoDFang/GeneModuleAnnotationPaper.ContactJitao David Zhang ([email protected]), Roche Pharma Research and Early Development, Roche Innovation Center Basel, F. Hoffmann-La Roche Ltd. Grenzacherstrasse 124, 4070 Basel, Switzerland.


F1000Research ◽  
2015 ◽  
Vol 4 ◽  
pp. 167 ◽  
Author(s):  
Yan Tan ◽  
Felix Wu ◽  
Pablo Tamayo ◽  
W. Nicholas Haining ◽  
Jill P. Mesirov

Summary: Gene set enrichment analysis (GSEA) approaches are widely used to identify coordinately regulated genes associated with phenotypes of interest. Here, we present Constellation Map, a tool to visualize and interpret the results when enrichment analyses yield a long list of significantly enriched gene sets. Constellation Map identifies commonalities that explain the enrichment of multiple top-scoring gene sets and maps the relationships between them. Constellation Map can help investigators take full advantage of GSEA and facilitates the biological interpretation of enrichment results. Availability: Constellation Map is freely available as a GenePattern module at http://www.genepattern.org.


F1000Research ◽  
2017 ◽  
Vol 6 ◽  
pp. 586 ◽  
Author(s):  
Anthony R. Colombo ◽  
Timothy J. Triche Jr ◽  
Giridharan Ramsingh

The recently introduced Kallisto pseudoaligner has radically simplified the quantification of transcripts in RNA-sequencing experiments.  We offer cloud-scale RNAseq pipelines Arkas-Quantification, and Arkas-Analysis available within Illumina’s BaseSpace cloud application platform which expedites Kallisto preparatory routines, reliably calculates differential expression, and performs gene-set enrichment of REACTOME pathways.  Due to inherit inefficiencies of scale, Illumina's BaseSpace computing platform offers a massively parallel distributive environment improving data management services and data importing.  Arkas-Quantification deploys Kallisto for parallel cloud computations and is conveniently integrated downstream from the BaseSpace Sequence Read Archive (SRA) import/conversion application titled SRA Import.  Arkas-Analysis annotates the Kallisto results by extracting structured information directly from source FASTA files with per-contig metadata, calculates the differential expression and gene-set enrichment analysis on both coding genes and transcripts. The Arkas cloud pipeline supports ENSEMBL transcriptomes and can be used downstream from the SRA Import facilitating raw sequencing importing, SRA FASTQ conversion, RNA quantification and analysis steps.


2019 ◽  
Vol 8 (10) ◽  
pp. 1580 ◽  
Author(s):  
Kyoung Min Moon ◽  
Kyueng-Whan Min ◽  
Mi-Hye Kim ◽  
Dong-Hoon Kim ◽  
Byoung Kwan Son ◽  
...  

Ninety percent of patients with scrub typhus (SC) with vasculitis-like syndrome recover after mild symptoms; however, 10% can suffer serious complications, such as acute respiratory failure (ARF) and admission to the intensive care unit (ICU). Predictors for the progression of SC have not yet been established, and conventional scoring systems for ICU patients are insufficient to predict severity. We aimed to identify simple and robust indicators to predict aggressive behaviors of SC. We evaluated 91 patients with SC and 81 non-SC patients who were admitted to the ICU, and 32 cases from the public functional genomics data repository for gene expression analysis. We analyzed the relationships between several predictors and clinicopathological characteristics in patients with SC. We performed gene set enrichment analysis (GSEA) to identify SC-specific gene sets. The acid-base imbalance (ABI), measured 24 h before serious complications, was higher in patients with SC than in non-SC patients. A high ABI was associated with an increased incidence of ARF, leading to mechanical ventilation and worse survival. GSEA revealed that SC correlated to gene sets reflecting inflammation/apoptotic response and airway inflammation. ABI can be used to indicate ARF in patients with SC and assist with early detection.


2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Mike Fang ◽  
Brian Richardson ◽  
Cheryl M. Cameron ◽  
Jean-Eudes Dazard ◽  
Mark J. Cameron

Abstract Background In this study, we demonstrate that our modified Gene Set Enrichment Analysis (GSEA) method, drug perturbation GSEA (dpGSEA), can detect phenotypically relevant drug targets through a unique transcriptomic enrichment that emphasizes biological directionality of drug-derived gene sets. Results We detail our dpGSEA method and show its effectiveness in detecting specific perturbation of drugs in independent public datasets by confirming fluvastatin, paclitaxel, and rosiglitazone perturbation in gastroenteropancreatic neuroendocrine tumor cells. In drug discovery experiments, we found that dpGSEA was able to detect phenotypically relevant drug targets in previously published differentially expressed genes of CD4+T regulatory cells from immune responders and non-responders to antiviral therapy in HIV-infected individuals, such as those involved with virion replication, cell cycle dysfunction, and mitochondrial dysfunction. dpGSEA is publicly available at https://github.com/sxf296/drug_targeting. Conclusions dpGSEA is an approach that uniquely enriches on drug-defined gene sets while considering directionality of gene modulation. We recommend dpGSEA as an exploratory tool to screen for possible drug targeting molecules.


2011 ◽  
Vol 10 (4) ◽  
pp. 3856-3887 ◽  
Author(s):  
Q.Y. Ning ◽  
J.Z. Wu ◽  
N. Zang ◽  
J. Liang ◽  
Y.L. Hu ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document