pathway databases
Recently Published Documents


TOTAL DOCUMENTS

86
(FIVE YEARS 31)

H-INDEX

17
(FIVE YEARS 2)

2022 ◽  
Vol 23 (1) ◽  
Author(s):  
Prasansah Shrestha ◽  
Min-Su Kim ◽  
Ermal Elbasani ◽  
Jeong-Dong Kim ◽  
Tae-Jin Oh

Abstract Background Metabolism including anabolism and catabolism is a prerequisite phenomenon for all living organisms. Anabolism refers to the synthesis of the entire compound needed by a species. Catabolism refers to the breakdown of molecules to obtain energy. Many metabolic pathways are undisclosed and many organism-specific enzymes involved in metabolism are misplaced. When predicting a specific metabolic pathway of a microorganism, the first and foremost steps is to explore available online databases. Among many online databases, KEGG and MetaCyc pathway databases were used to deduce trehalose metabolic network for bacteria Variovorax sp. PAMC28711. Trehalose, a disaccharide, is used by the microorganism as an alternative carbon source. Results While using KEGG and MetaCyc databases, we found that the KEGG pathway database had one missing enzyme (maltooligosyl-trehalose synthase, EC 5.4.99.15). The MetaCyc pathway database also had some enzymes. However, when we used RAST to annotate the entire genome of Variovorax sp. PAMC28711, we found that all enzymes that were missing in KEGG and MetaCyc databases were involved in the trehalose metabolic pathway. Conclusions Findings of this study shed light on bioinformatics tools and raise awareness among researchers about the importance of conducting detailed investigation before proceeding with any further work. While such comparison for databases such as KEGG and MetaCyc has been done before, it has never been done with a specific microbial pathway. Such studies are useful for future improvement of bioinformatics tools to reduce limitations.


2021 ◽  
Author(s):  
Tobias Rubel ◽  
Pramesh Singh ◽  
Anna Ritz

A major goal of molecular systems biology is to understand the coordinated function of genes or proteins in response to cellular signals and to understand these dynamics in the context of disease. Signaling pathway databases such as KEGG, NetPath, NCI-PID, and Panther describe the molecular interactions involved in different cellular responses. While the same pathway may be present in different databases, prior work has shown that the particular proteins and interactions differ across database annotations. However, to our knowledge no one has attempted to quantify their structural differences. It is important to characterize artifacts or other biases within pathway databases, which can provide a more informed interpretation for downstream analyses. In this work, we consider signaling pathways as graphs and we use topological measures to study their structure. We find that topological characterization using graphlets (small, connected subgraphs) distinguishes signaling pathways from appropriate null models of interaction networks. Next, we quantify topological similarity across pathway databases. Our analysis reveals that the pathways harbor database-specific characteristics implying that even though these databases describe the same pathways, they tend to be systematically different from one another. We show that pathway-specific topology can be uncovered after accounting for database-specific structure. This work present the first step towards elucidating common pathway structure beyond their specific database annotations.


Life ◽  
2021 ◽  
Vol 11 (8) ◽  
pp. 785
Author(s):  
Mila Glavaški ◽  
Lazar Velicki

Hypertrophic cardiomyopathy (HCM) is the most common inherited cardiovascular disease with a prevalence of 1 in 500 people and varying clinical presentations. Although there is much research on HCM, underlying molecular mechanisms are poorly understood, and research on the molecular mechanisms of its specific clinical presentations is scarce. Our aim was to explore the molecular mechanisms shared by HCM and its clinical presentations through the automated extraction of molecular mechanisms. Molecular mechanisms were congregated by a query of the INDRA database, which aggregates knowledge from pathway databases and combines it with molecular mechanisms extracted from abstracts and open-access full articles by multiple machine-reading systems. The molecular mechanisms were extracted from 230,072 articles on HCM and 19 HCM clinical presentations, and their intersections were found. Shared molecular mechanisms of HCM and its clinical presentations were represented as networks; the most important elements in the intersections’ networks were found, centrality scores for each element of each network calculated, networks with reduced level of noise generated, and cooperatively working elements detected in each intersection network. The identified shared molecular mechanisms represent possible mechanisms underlying different HCM clinical presentations. Applied methodology produced results consistent with the information in the scientific literature.


2021 ◽  
Vol 3 (3) ◽  
Author(s):  
Sarah Mubeen ◽  
Vinay S Bharadhwaj ◽  
Yojana Gadiya ◽  
Martin Hofmann-Apitius ◽  
Alpha T Kodamullil ◽  
...  

Abstract The past decades have brought a steady growth of pathway databases and enrichment methods. However, the advent of pathway data has not been accompanied by an improvement in interoperability across databases, hampering the use of pathway knowledge from multiple databases for enrichment analysis. While integrative databases have attempted to address this issue, they often do not account for redundant information across resources. Furthermore, the majority of studies that employ pathway enrichment analysis still rely upon a single database or enrichment method, though the use of another could yield differing results. These shortcomings call for approaches that investigate the differences and agreements across databases and methods as their selection in the design of a pathway analysis can be a crucial step in ensuring the results of such an analysis are meaningful. Here we present DecoPath, a web application to assist in the interpretation of the results of pathway enrichment analysis. DecoPath provides an ecosystem to run enrichment analysis or directly upload results and facilitate the interpretation of results with custom visualizations that highlight the consensus and/or discrepancies at the pathway- and gene-levels. DecoPath is available at https://decopath.scai.fraunhofer.de, and its source code and documentation can be found on GitHub at https://github.com/DecoPath/DecoPath.


2021 ◽  
Author(s):  
Sarah Mubeen ◽  
Vinay Srinivas Bharadhwaj ◽  
Yojana Gadiya ◽  
Martin Hofmann-Apitius ◽  
Alpha Tom Kodamullil ◽  
...  

The past two decades have brought a steady growth of pathway databases and pathway enrichment methods. However, the advent of pathway data has not been accompanied by an improvement with regards to interoperability across databases, thus, hampering the use of pathway knowledge from multiple databases for pathway enrichment analyses. While integrative databases have attempted to address this issue by collating pathway knowledge from multiple resources, these approaches do not account for redundant information across them. On the other hand, the majority of studies that employ pathway enrichment analyses still rely upon a single database, though the use of another resource could yield differing results, which is similarly the case when different pathway enrichment methods are employed. These shortcomings call for approaches that investigate the differences and agreements across databases and enrichment methods as their selection in the experimental design of a pathway analysis can be a crucial first step in ensuring the results of such an analysis are meaningful. Here we present DecoPath, a web application to assist in the interpretation of the results of pathway enrichment analysis. DecoPath provides an ecosystem to run pathway enrichment analysis or directly upload results and facilitate the interpretation of these results with custom visualizations that highlight the consensus and/or discrepancies at the pathway- and gene-levels. DecoPath is available at https://decopath.scai.fraunhofer.de and its source code and documentation can be found on GitHub at https://github.com/DecoPath/DecoPath.


2021 ◽  
Author(s):  
Xiaoxi Dong ◽  
Kovidh Vegesna ◽  
Cory Brouwer ◽  
Weijun Luo

AbstractPathway analysis is widely used in genomics and omics research, but the data visualization has been highly limited in function, pathway coverage and data format. Here we develop SBGNview a comprehensive solution to address these needs. By adopting the standard SBGN format, SBGNview greatly extend the coverage of pathway based analysis and data visualization to essentially all major pathway databases beyond KEGG, including 5200 reference pathways and over 3000 species. In addition, SBGNview substantially extends current tools in both design and function, including standard coherent input/output formats, high quality graphics convenient for both computational and manual analysis, and flexible and open-end workflow. In addition to pathway analysis and data visualization, SBGNview provides essential infrastructure for SBGN data manipulation and processing. SBGNview is available online: https://github.com/datapplab/SBGNview.


BMC Genomics ◽  
2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Peter D. Karp ◽  
Peter E. Midford ◽  
Ron Caspi ◽  
Arkady Khodursky

Abstract Background Enrichment or over-representation analysis is a common method used in bioinformatics studies of transcriptomics, metabolomics, and microbiome datasets. The key idea behind enrichment analysis is: given a set of significantly expressed genes (or metabolites), use that set to infer a smaller set of perturbed biological pathways or processes, in which those genes (or metabolites) play a role. Enrichment computations rely on collections of defined biological pathways and/or processes, which are usually drawn from pathway databases. Although practitioners of enrichment analysis take great care to employ statistical corrections (e.g., for multiple testing), they appear unaware that enrichment results are quite sensitive to the pathway definitions that the calculation uses. Results We show that alternative pathway definitions can alter enrichment p-values by up to nine orders of magnitude, whereas statistical corrections typically alter enrichment p-values by only two orders of magnitude. We present multiple examples where the smaller pathway definitions used in the EcoCyc database produces stronger enrichment p-values than the much larger pathway definitions used in the KEGG database; we demonstrate that to attain a given enrichment p-value, KEGG-based enrichment analyses require 1.3–2.0 times as many significantly expressed genes as does EcoCyc-based enrichment analyses. The large pathways in KEGG are problematic for another reason: they blur together multiple (as many as 21) biological processes. When such a KEGG pathway receives a high enrichment p-value, which of its component processes is perturbed is unclear, and thus the biological conclusions drawn from enrichment of large pathways are also in question. Conclusions The choice of pathway database used in enrichment analyses can have a much stronger effect on the enrichment results than the statistical corrections used in these analyses.


2021 ◽  
Vol 15 (8) ◽  
pp. 803-820
Author(s):  
Ali Ghulam ◽  
Xiujuan Lei ◽  
Min Guo ◽  
Chen Bian

This study focused on describing the necessary information related to pathway mechanisms, characteristics, and databases feature annotations. Various difficulties related to data storage and retrieval in biological pathway databases are discussed. These focus on different techniques for retrieving annotations, features, and methods of digital pathway databases for biological pathway analysis. Furthermore, many pathway databases annotations, features, and search databases were also examined (which are reasonable for the integration into microarray examination). The investigation was performed on the databases, which contain human pathways to understand the hidden components of cells applied in this process. Three different domain-specific pathways were selected for this study and the information of pathway databases was extracted from the existing literature. The research compared different pathways and performed molecular level relations. Moreover, the associations between pathway networks were also evaluated. The study involved datasets for gene pathway matrices and pathway scoring techniques. Additionally, different pathways techniques, such as metabolomics and biochemical pathways, translation, control, and signaling pathways and signal transduction, were also considered. We also analyzed the list of gene sets and constructed a gene pathway network. This article will serve as a useful manual for storing a repository of specific biological data and disease pathways.


Sign in / Sign up

Export Citation Format

Share Document