scholarly journals Metabolic Pathway Prediction Using Non-Negative Matrix Factorization with Improved Precision

Author(s):  
Abdur Rahman M.A. Basher ◽  
Ryan J. Mclaughlin ◽  
Steven J. Hallam
2020 ◽  
Author(s):  
Abdur Rahman M. A. Basher ◽  
Ryan J. McLaughlin ◽  
Steven J. Hallam

AbstractMachine learning provides a probabilistic framework for metabolic pathway inference from genomic sequence information at different levels of complexity and completion. However, several challenges including pathway features engineering, multiple mapping of enzymatic reactions and emergent or distributed metabolism within populations or communities of cells can limit prediction performance. In this paper, we present triUMPF, triple non-negative matrix factorization (NMF) with community detection for metabolic pathway inference, that combines three stages of NMF to capture myriad relationships between enzymes and pathways within a graph network. This is followed by community detection to extract higher order structure based on the clustering of vertices which share similar statistical properties. We evaluated triUMPF performance using experimental datasets manifesting diverse multi-label properties, including Tier 1 genomes from the BioCyc collection of organismal Pathway/Genome Databases and low complexity microbial communities. Resulting performance metrics equaled or exceeded other prediction methods on organismal genomes with improved precision on multi-organismal datasets.


Sign in / Sign up

Export Citation Format

Share Document