scholarly journals Integrating thermodynamic and enzymatic constraints into genome-scale metabolic models

2020 ◽  
Author(s):  
Xue Yang ◽  
Zhitao Mao ◽  
Xin Zhao ◽  
Ruoyu Wang ◽  
Peiji Zhang ◽  
...  

AbstractStoichiometric genome-scale metabolic network models (GEMs) have been widely used to predict metabolic phenotypes. In addition to stoichiometric ratios, other constraints such as enzyme availability and thermodynamic feasibility can also limit the phenotype solution space. Extended GEM models considering either enzymatic or thermodynamic constraints have been shown to improve prediction accuracy. In this paper, we propose a novel method that integrates both enzymatic and thermodynamic constraints in a single Pyomo modeling framework (ETGEMs). We applied this method to construct the EcoETM, the E. coli metabolic model iML1515 with enzymatic and thermodynamic constraints. Using this model, we calculated the optimal pathways for cellular growth and the production of 22 metabolites. When comparing the results with those of iML1515 and models with one of the two constraints, we observed that many thermodynamically unfavorable and/or high enzyme cost pathways were excluded from EcoETM. For example, the synthesis pathway of carbamoyl-phosphate (Cbp) from iML1515 is both thermodynamically unfavorable and enzymatically costly. After introducing the new constraints, the production pathways and yields of several Cbp-derived products (e.g. L-arginine, orotate) calculated using EcoETM were more realistic. The results of this study demonstrate the great application potential of metabolic models with multiple constraints for pathway analysis and phenotype predication.

2019 ◽  
Author(s):  
Vikash Pandey ◽  
Daniel Hernandez Gardiol ◽  
Anush Chiappino-Pepe ◽  
Vassily Hatzimanikatis

AbstractA large number of genome-scale models of cellular metabolism are available for various organisms. These models include all known metabolic reactions based on the genome annotation. However, the reactions that are active are dependent on the cellular metabolic function or environmental condition. Constraint-based methods that integrate condition-specific transcriptomics data into models have been used extensively to investigate condition-specific metabolism. Here, we present a method (TEX-FBA) for modeling condition-specific metabolism that combines transcriptomics and reaction thermodynamics data to generate a thermodynamically-feasible condition-specific metabolic model. TEX-FBA is an extension of thermodynamic-based flux balance analysis (TFA), which allows the simultaneous integration of different stages of experimental data (e.g., absolute gene expression, metabolite concentrations, thermodynamic data, and fluxomics) and the identification of alternative metabolic states that maximize consistency between gene expression levels and condition-specific reaction fluxes. We applied TEX-FBA to a genome-scale metabolic model ofEscherichia coliby integrating available condition-specific experimental data and found a marked reduction in the flux solution space. Our analysis revealed a marked correlation between actual gene expression profile and experimental flux measurements compared to the one obtained from a randomly generated gene expression profile. We identified additional essential reactions from the membrane lipid and folate metabolism when we integrated transcriptomics data of the given condition on the top of metabolomics and thermodynamics data. These results show TEX-FBA is a promising new approach to study condition-specific metabolism when different types of experimental data are available.Author summaryCells utilize nutrients via biochemical reactions that are controlled by enzymes and synthesize required compounds for their survival and growth. Genome-scale models of metabolism representing these complex reaction networks have been reconstructed for a wide variety of organisms ranging from bacteria to human cells. These models comprise all possible biochemical reactions in a cell, but cells choose only a subset of reactions for their immediate needs and functions. Usually, these models allow for a large flux solution space and one can integrate experimental data in order to reduce it and potentially predict the physiology for a specific condition. We developed a method for integrating different types of omics data, such as fluxomics, transcriptomics, metabolomics into genome-scale metabolic models that reduces the flux solution space. Using gene expression data, the algorithm maximizes the consistency between the predicted and experimental flux for the reactions and predicts biologically relevant flux ranges for the remaining reactions in the network. This method is useful for determining fluxes of metabolic reactions with reduced uncertainty and suitable for performing context- and condition-specific analysis in metabolic models using different types of experimental data.


2020 ◽  
Author(s):  
Vetle Simensen ◽  
André Voigt ◽  
Eivind Almaas

AbstractThe long-chain, ω-3 polyunsaturated fatty acids (PUFAs) eicosapentaenoic acid (EPA) and docosahexaenoic acid (DHA) are essential for humans and animals, including marine fish species. Presently, the primary source of these PUFAs is fish oils. As the global production of fish oils appears to be reaching its limits, alternative sources of high-quality ω-3 PUFAs is paramount to support the growing aquaculture industry. Thraustochytrids are a group of heterotrophic protists able to synthesize and accrue large amounts of essential ω-3 PUFAs, including EPA and DHA. Thus, the thraustochytrids are prime candidates to solve the increasing demand for ω-3 PUFAs using microbial cell factories. However, a systems-level understanding of their metabolic shift from cellular growth into lipid accumulation is, to a large extent, unclear. Here, we reconstructed a high-quality genome-scale metabolic model of the thraustochytrid Aurantiochytrium sp. T66 termed iVS1191. Through iterative rounds of model refinement and extensive manual curation, we significantly enhanced the metabolic scope and coverage of the reconstruction from that of previously published models, making considerable improvements with stoichiometric consistency, metabolic connectivity, and model annotations. We show that iVS1191 is highly consistent with experimental growth data, reproducing in vivo growth phenotypes as well as specific growth rates on minimal carbon media. The availability of iVS1191 provides a solid framework for further developing our understanding of T66’s metabolic properties, as well as exploring metabolic engineering and process-optimization strategies in silico for increased ω-3 PUFA production.


2021 ◽  
Author(s):  
Francisco Zorrilla ◽  
Kiran R. Patil ◽  
Aleksej Zelezniak

AbstractAdvances in genome-resolved metagenomic analysis of complex microbial communities have revealed a large degree of interspecies and intraspecies genetic diversity through the reconstruction of metagenome assembled genomes (MAGs). Yet, metabolic modeling efforts still tend to rely on reference genomes as the starting point for reconstruction and simulation of genome scale metabolic models (GEMs), neglecting the immense intra- and inter-species diversity present in microbial communities. Here we present metaGEM (https://github.com/franciscozorrilla/metaGEM), an end-to-end highly scalable pipeline enabling metabolic modeling of multi-species communities directly from metagenomic samples. The pipeline automates all steps from the extraction of context-specific prokaryotic GEMs from metagenome assembled genomes to community level flux balance simulations. To demonstrate the capabilities of the metaGEM pipeline, we analyzed 483 samples spanning lab culture, human gut, plant associated, soil, and ocean metagenomes, to reconstruct over 14 000 prokaryotic GEMs. We show that GEMs reconstructed from metagenomes have fully represented metabolism comparable to the GEMs reconstructed from reference genomes. We further demonstrate that metagenomic GEMs capture intraspecies metabolic diversity by identifying the differences between pathogenicity levels of type 2 diabetes at the level of gut bacterial metabolic exchanges. Overall, our pipeline enables simulation-ready metabolic model reconstruction directly from individual metagenomes, provides a resource of all reconstructed metabolic models, and showcases community-level modeling of microbiomes associated with disease conditions allowing generation of mechanistic hypotheses.


2021 ◽  
Author(s):  
Emanuel Cunha ◽  
Miguel Silva ◽  
Ines Chaves ◽  
Huseyin Demirci ◽  
Davide Lagoa ◽  
...  

AbstractIn the last decade, genome-scale metabolic models have been increasingly used to study plant metabolic behaviour at the tissue and multi-tissue level in different environmental conditions. Quercus suber (Q. suber), also known as the cork oak tree, is one of the most important forest communities of the Mediterranean/Iberian region. In this work, we present the genome-scale metabolic model of the Q. suber (iEC7871), the first of a woody plant. The metabolic model comprises 7871 genes, 6230 reactions, and 6481 metabolites across eight compartments. Transcriptomics data was integrated into the model to obtain tissue-specific models for the leaf, inner bark, and phellogen. Each tissue’s biomass composition was determined to improve model accuracy and merged into a diel multi-tissue metabolic model to predict interactions among the three tissues at the light and dark phases. The metabolic models were also used to analyze the pathways associated with the synthesis of suberin monomers. Nevertheless, the models developed in this work can provide insights about other aspects of the metabolism of Q. suber, such as its secondary metabolism and cork formation.


2018 ◽  
Vol 46 (4) ◽  
pp. 931-936 ◽  
Author(s):  
José P. Faria ◽  
Miguel Rocha ◽  
Isabel Rocha ◽  
Christopher S. Henry

In the era of next-generation sequencing and ubiquitous assembly and binning of metagenomes, new putative genome sequences are being produced from isolate and microbiome samples at ever-increasing rates. Genome-scale metabolic models have enormous utility for supporting the analysis and predictive characterization of these genomes based on sequence data. As a result, tools for rapid automated reconstruction of metabolic models are becoming critically important for supporting the analysis of new genome sequences. Many tools and algorithms have now emerged to support rapid model reconstruction and analysis. Here, we are comparing and contrasting the capabilities and output of a variety of these tools, including ModelSEED, Raven Toolbox, PathwayTools, SuBliMinal Toolbox and merlin.


2019 ◽  
Author(s):  
Miguel Ponce-de-León ◽  
Iñigo Apaolaza ◽  
Alfonso Valencia ◽  
Francisco J. Planes

ABSTRACTWith the publication of high-quality genome-scale metabolic models for several organisms, the Systems Biology community has developed a number of algorithms for their analysis making use of ever growing –omics data. In particular, the reconstruction of the first genome-scale human metabolic model, Recon1, promoted the development of Context-Specific Model (CS-Model) reconstruction methods. This family of algorithms aims to identify the set of metabolic reactions that are active in a cell in a given condition using omics data, such as gene expression levels. Different CS-Model reconstruction algorithms have their own strengths and weaknesses depending on the problem under study and omics data available. However, after careful inspection, we found that all of these algorithms share common issues in the way GPR rules and gene expression data are treated. The first issue is related with how gapfilling reactions are managed after the reconstruction is conducted. The second issue concerns the molecular context, which is used to build the CS-model but neglected for posterior analyses. To evaluate the effect of these issues, we reconstructed ∼400 CS-Models of cancer cell lines and conducted gene essentiality analysis, using CRISPR–Cas9 essentiality data for validation purposes. Altogether, our results illustrate the importance of correcting the errors introduced during the GPR translation in many of the published metabolic reconstructions.


2018 ◽  
Author(s):  
Marzia Di Filippo ◽  
Raúl A. Ortiz-Merino ◽  
Chiara Damiani ◽  
Gianni Frascotti ◽  
Danilo Porro ◽  
...  

Genome-scale metabolic models are powerful tools to understand and engineer cellular systems facilitating their use as cell factories. This is especially true for microorganisms with known genome sequences from which nearly complete sets of enzymes and metabolic pathways are determined, or can be inferred. Yeasts are highly diverse eukaryotes whose metabolic traits have long been exploited in industry, and although many of their genome sequences are available, few genome-scale metabolic models have so far been produced. For the first time, we reconstructed the genome-scale metabolic model of the hybrid yeast Zygosaccharomyces parabailii, which is a member of the Z. bailii sensu lato clade notorious for stress-tolerance and therefore relevant to industry. The model comprises 3096 reactions, 2091 metabolites, and 2413 genes. Our own laboratory data were then used to establish a biomass synthesis reaction, and constrain the extracellular environment. Through constraint-based modeling, our model reproduces the co-consumption and catabolism of acetate and glucose posing it as a promising platform for understanding and exploiting the metabolic potential of Z. parabailii.


2019 ◽  
Author(s):  
Sara A. Amin ◽  
Elizabeth Chavez ◽  
Nikhil U. Nair ◽  
Soha Hassoun

AbstractBackgroundMetabolic models are indispensable in guiding cellular engineering and in advancing our understanding of systems biology. As not all enzymatic activities are fully known and/or annotated, metabolic models remain incomplete, resulting in suboptimal computational analysis and leading to unexpected experimental results. We posit that one major source of unaccounted metabolism is promiscuous enzymatic activity. It is now well-accepted that most, if not all, enzymes are promiscuous – i.e., they transform substrates other than their primary substrate. However, there have been no systematic analyses of genome-scale metabolic models to predict putative reactions and/or metabolites that arise from enzyme promiscuity.ResultsOur workflow utilizes PROXIMAL – a tool that uses reactant-product transformation patterns from the KEGG database – to predict putative structural modifications due to promiscuous enzymes. Using iML1515 as a model system, we first utilized a computational workflow, referred to as Extended Metabolite Model Annotation (EMMA), to predict promiscuous reactions catalyzed, and metabolites produced, by natively encoded enzymes in E. coli. We predict hundreds of new metabolites that can be used to augment iML1515. We then validated our method by comparing predicted metabolites with the Escherichia coli Metabolome Database (ECMDB).ConclusionsWe utilized EMMA to augment the iML1515 metabolic model to more fully reflect cellular metabolic activity. This workflow uses enzyme promiscuity as basis to predict hundreds of reactions and metabolites that may exist in E. coli but have not been documented in iML1515 or other databases. Among these, we found that 17 metabolites have previously been documented in E. coli metabolomics studies. Further, 6 of these metabolites are not documented for any other E. coli metabolic model (e.g. KEGG, EcoCyc). The corresponding reactions should be added to iML1515 to create an Extended Metabolic Model (EMM). Other predicted metabolites and reactions can guide future experimental metabolomics studies. Further, our workflow can easily be applied to other organisms for which comprehensive genome-scale metabolic models are desirable.


Processes ◽  
2019 ◽  
Vol 7 (5) ◽  
pp. 286
Author(s):  
Amir Akbari ◽  
Paul I. Barton

Genome-scale models have become indispensable tools for the study of cellular growth. These models have been progressively improving over the past two decades, enabling accurate predictions of metabolic fluxes and key phenotypes under a variety of growth conditions. In this work, an efficient computational method is proposed to incorporate genome-scale models into superstructure optimization settings, introducing them as viable growth models to simulate the cultivation section of biorefinaries. We perform techno-economic and life-cycle analyses of an algal biorefinery with five processing sections to determine optimal processing pathways and technologies. Formulation of this problem results in a mixed-integer nonlinear program, in which the net present value is maximized with respect to mass flowrates and design parameters. We use a genome-scale metabolic model of Chlamydomonas reinhardtii to predict growth rates in the cultivation section. We study algae cultivation in open ponds, in which exchange fluxes of biomass and carbon dioxide are directly determined by the metabolic model. This formulation enables the coupling of flowrates and design parameters, leading to more accurate cultivation productivity estimates with respect to substrate concentration and light intensity.


2013 ◽  
Vol 105 (2) ◽  
pp. 512-522 ◽  
Author(s):  
Joshua J. Hamilton ◽  
Vivek Dwivedi ◽  
Jennifer L. Reed

Sign in / Sign up

Export Citation Format

Share Document