Modeling Antibacterial Activity with Machine Learning and Fusion of Chemical Structure Information with Microorganism Metabolic Networks

Abstract Background The topology of metabolic networks is both well-studied and remarkably well-conserved across many species. The regulation of these networks, however, is much more poorly characterized, though it is known to be divergent across organisms—two characteristics that make it difficult to model metabolic networks accurately. While many computational methods have been built to unravel transcriptional regulation, there have been few approaches developed for systems-scale analysis and study of metabolic regulation. Here, we present a stepwise machine learning framework that applies established algorithms to identify regulatory interactions in metabolic systems based on metabolic data: stepwise classification of unknown regulation, or SCOUR. Results We evaluated our framework on both noiseless and noisy data, using several models of varying sizes and topologies to show that our approach is generalizable. We found that, when testing on data under the most realistic conditions (low sampling frequency and high noise), SCOUR could identify reaction fluxes controlled only by the concentration of a single metabolite (its primary substrate) with high accuracy. The positive predictive value (PPV) for identifying reactions controlled by the concentration of two metabolites ranged from 32 to 88% for noiseless data, 9.2 to 49% for either low sampling frequency/low noise or high sampling frequency/high noise data, and 6.6–27% for low sampling frequency/high noise data, with results typically sufficiently high for lab validation to be a practical endeavor. While the PPVs for reactions controlled by three metabolites were lower, they were still in most cases significantly better than random classification. Conclusions SCOUR uses a novel approach to synthetically generate the training data needed to identify regulators of reaction fluxes in a given metabolic system, enabling metabolomics and fluxomics data to be leveraged for regulatory structure inference. By identifying and triaging the most likely candidate regulatory interactions, SCOUR can drastically reduce the amount of time needed to identify and experimentally validate metabolic regulatory interactions. As high-throughput experimental methods for testing these interactions are further developed, SCOUR will provide critical impact in the development of predictive metabolic models in new organisms and pathways.

Download Full-text

Functional prediction of environmental variables using metabolic networks

Scientific Reports ◽

10.1038/s41598-021-91486-8 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Adèle Weber Zendrera ◽

Nataliya Sokolovska ◽

Hédi A. Soula

Keyword(s):

Machine Learning ◽

Growth Temperature ◽

Environmental Variables ◽

Metabolic Networks ◽

Machine Learning Techniques ◽

Underlying Structure ◽

Glutathione Biosynthesis ◽

Additional Information ◽

Cold Environments ◽

Novel Approach

AbstractIn this manuscript, we propose a novel approach to assess relationships between environment and metabolic networks. We used a comprehensive dataset of more than 5000 prokaryotic species from which we derived the metabolic networks. We compute the scope from the reconstructed graphs, which is the set of all metabolites and reactions that can potentially be synthesized when provided with external metabolites. We show using machine learning techniques that the scope is an excellent predictor of taxonomic and environmental variables, namely growth temperature, oxygen tolerance, and habitat. In the literature, metabolites and pathways are rarely used to discriminate species. We make use of the scope underlying structure—metabolites and pathways—to construct the predictive models, giving additional information on the important metabolic pathways needed to discriminate the species, which is often absent in other metabolic network properties. For example, in the particular case of growth temperature, glutathione biosynthesis pathways are specific to species growing in cold environments, whereas tungsten metabolism is specific to species in warm environments, as was hinted in current literature. From a machine learning perspective, the scope is able to reduce the dimension of our data, and can thus be considered as an interpretable graph embedding.

Download Full-text

ChemInform Abstract: Capturing Chemical Structure Information in a Relational Database System: The Chemical Software Component Approach.

ChemInform ◽

10.1002/chin.199604285 ◽

2010 ◽

Vol 27 (4) ◽

pp. no-no

Author(s):

T. R. HAGADONE ◽

M. W. SCHULZ

Keyword(s):

Relational Database ◽

Chemical Structure ◽

Database System ◽

Software Component ◽

Structure Information ◽

Relational Database System ◽

Component Approach

Download Full-text

Automated extraction of chemical structure information from digital raster images

Chemistry Central Journal ◽

10.1186/1752-153x-3-4 ◽

2009 ◽

Vol 3 (1) ◽

Cited By ~ 36

Author(s):

Jungkap Park ◽

Gus R Rosania ◽

Kerby A Shedden ◽

Mandee Nguyen ◽

Naesung Lyu ◽

...

Keyword(s):

Chemical Structure ◽

Automated Extraction ◽

Structure Information ◽

Digital Raster

Download Full-text

Machine learning in drug design: Use of artificial intelligence to explore the chemical structure–biological activity relationship

Wiley Interdisciplinary Reviews Computational Molecular Science ◽

10.1002/wcms.1568 ◽

2021 ◽

Author(s):

Maciej Staszak ◽

Katarzyna Staszak ◽

Karolina Wieszczycka ◽

Anna Bajek ◽

Krzysztof Roszkowski ◽

...

Keyword(s):

Artificial Intelligence ◽

Machine Learning ◽

Biological Activity ◽

Drug Design ◽

Chemical Structure ◽

Activity Relationship

Download Full-text

Chemical structure recognition and prediction: A machine learning technique

2018 IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB) ◽

10.1109/cibcb.2018.8404964 ◽

2018 ◽

Cited By ~ 1

Author(s):

Fakheredine Keyrouz ◽

Lara Tauk ◽

Elias Feghali

Keyword(s):

Machine Learning ◽

Chemical Structure ◽

Machine Learning Technique ◽

Structure Recognition ◽

Learning Technique

Download Full-text

Chemical Structure Information at the Bench: A New Integrated Approach

ACS Symposium Series - Chemical Structure Information Systems ◽

10.1021/bk-1989-0400.ch009 ◽

1989 ◽

pp. 82-88

Author(s):

Robert M. Olszewski ◽

Everett A. Bruce ◽

Craig Leilous ◽

Rudy Potenzone

Keyword(s):

Chemical Structure ◽

Integrated Approach ◽

Structure Information

Download Full-text

Artificial Metabolic Networks: enabling neural computation with metabolic networks

10.1101/2022.01.09.475487 ◽

2022 ◽

Author(s):

Leon Faure ◽

Bastien Mollet ◽

Wolfram Liebermeister ◽

Jean-Loup Faulon

Keyword(s):

Machine Learning ◽

Recurrent Neural Networks ◽

Metabolic Networks ◽

Cross Validation ◽

Regression Coefficient ◽

Network Models ◽

Biotechnological Applications ◽

Neural Network Models ◽

Surrogate Constraint ◽

Constraint Based Modeling

Metabolic networks have largely been exploited as mechanistic tools to predict the behavior of microorganisms with a defined genotype in different environments. However, flux predictions by constraint-based modeling approaches are limited in quality unless labor-intensive experiments including the measurement of media intake fluxes, are performed. Using machine learning instead of an optimization of biomass flux - on which most existing constraint-based methods are based - provides ways to improve flux and growth rate predictions. In this paper, we show how Recurrent Neural Networks can surrogate constraint-based modeling and make metabolic networks suitable for backpropagation and consequently be used as an architecture for machine learning. We refer to our hybrid - mechanistic and neural network - models as Artificial Metabolic Networks (AMN). We showcase AMN and illustrate its performance with an experimental dataset of Escherichia coli growth rates in 73 different media compositions. We reach a regression coefficient of R2=0.78 on cross-validation sets. We expect AMNs to provide easier discovery of metabolic insights and prompt new biotechnological applications.

Download Full-text

Prediction of Compound-Protein Interactions with Machine Learning Methods

Chemoinformatics and Advanced Machine Learning Perspectives ◽

10.4018/978-1-61520-911-8.ch016 ◽

2011 ◽

pp. 304-317

Author(s):

Yoshihiro Yamanishi ◽

Hisashi Kashima

Keyword(s):

Machine Learning ◽

Protein Interactions ◽

Chemical Structure ◽

Genomic Sequence ◽

Sequence Data ◽

Binary Classification ◽

Biological Data ◽

Supervised Machine Learning ◽

Learning Methods ◽

Machine Learning Methods

In silico prediction of compound-protein interactions from heterogeneous biological data is critical in the process of drug development. In this chapter the authors review several supervised machine learning methods to predict unknown compound-protein interactions from chemical structure and genomic sequence information simultaneously. The authors review several kernel-based algorithms from two different viewpoints: binary classification and dimension reduction. In the results, they demonstrate the usefulness of the methods on the prediction of drug-target interactions and ligand-protein interactions from chemical structure data and genomic sequence data.

Download Full-text