scholarly journals ProbAnnoWeb and ProbAnnoPy: probabilistic annotation and gap-filling of metabolic reconstructions

2017 ◽  
Author(s):  
Brendan King ◽  
Terry Farrah ◽  
Matthew Richards ◽  
Michael Mundy ◽  
Evangelos Simeonidis ◽  
...  

AbstractSummaryGap-filling is a necessary step to produce quality genome-scale metabolic reconstructions capable of flux-balance simulation. Most available gap-filling tools use an organism-agnostic approach, where reactions are selected from a database to fill gaps without consideration of the target organism. Conversely, our likelihood based gap-filling with probabilistic annotations selects candidate reactions based on a likelihood score derived specifically from the target organism’s genome. Here, we present two new implementations of probabilistic annotation and likelihood based gap-filling: a web service called ProbAnnoWeb, and a standalone python package called ProbAnnoPy.Availability and ImplementationOur tools are available as a web service with no installation needed (ProbAnnoWeb), available at http://probannoweb.systemsbiology.net, and as a local python package implementation (ProbAnnoPy), available for download at http://github.com/PriceLab/probannopy.Contacthttp://[email protected]; http://[email protected]

2019 ◽  
Author(s):  
Gregory L. Medlock ◽  
Jason A. Papin

AbstractUncertainty in the structure and parameters of networks is ubiquitous across computational biology. In constraint-based reconstruction and analysis of metabolic networks, this uncertainty is present both during the reconstruction of networks and in simulations performed with them. Here, we present Medusa, a Python package for the generation and analysis of ensembles of genome-scale metabolic network reconstructions. Medusa builds on the COBRApy package for constraint-based reconstruction and analysis by compressing a set of models into a compact ensemble object, providing functions for the generation of ensembles using experimental data, and extending constraint-based analyses to ensemble scale. We demonstrate how Medusa can be used to generate ensembles, perform ensemble simulations, and how machine learning can be used in conjunction with Medusa to guide the curation of genome-scale metabolic network reconstructions. Medusa is available under the permissive MIT license from the Python Packaging Index (https://pypi.org/) and from github (https://github.com/gregmedlock/Medusa/), and comprehensive documentation is available at https://medusa.readthedocs.io/en/latest/.


2017 ◽  
Author(s):  
Paul D. Blischak ◽  
Julia Chifman ◽  
Andrea D. Wolfe ◽  
Laura S. Kubatko

AbstractThe analysis of hybridization and gene flow among closely related taxa is a common goal for researchers studying speciation and phylogeography. Many methods for hybridization detection use simple site pattern frequencies from observed genomic data and compare them to null models that predict an absence of gene flow. The theory underlying the detection of hybridization using these site pattern probabilities exploits the relationship between the coalescent process for gene trees within population trees and the process of mutation along the branches of the gene trees. For certain models, site patterns are predicted to occur in equal frequency (i.e., their difference is 0), producing a set of functions called phylogenetic invariants. In this paper we introduce HyDe, a software package for detecting hybridization using phylogenetic invariants arising under the coalescent model with hybridization. HyDe is written in Python, and can be used interactively or through the command line using pre-packaged scripts. We demonstrate the use of HyDe on simulated data, as well as on two empirical data sets from the literature. We focus in particular on identifying individual hybrids within population samples and on distinguishing between hybrid speciation and gene flow. HyDe is freely available as an open source Python package under the GNU GPL v3 on both GitHub (https://github.com/pblischak/HyDe) and the Python Package Index (PyPI: https://pypi.python.org/pypi/phyde).


2018 ◽  
Author(s):  
Jean-Christophe Lachance ◽  
Jonathan M. Monk ◽  
Colton J. Lloyd ◽  
Yara Seif ◽  
Bernhard O. Palsson ◽  
...  

AbstractGenome-scale models (GEMs) rely on a biomass objective function (BOF) to predict phenotype from genotype. Here we present BOFdat, a Python package that offers functions to generate biomass objective function stoichiometric coefficients (BOFsc) from macromolecular cell composition and relative abundances of macromolecules obtained from omic datasets. Growth-associated and non-growth associated maintenance (GAM and NGAM) costs can also be calculated by BOFdat.BOFdat is freely available on the Python Package Index (pip install BOFdat). The source code and an example usage (Jupyter Notebook and example files) are available on GitHub (https://github.com/jclachance/BOFdat). The documentation and API are available through ReadTheDocs (https://bofdat.readthedocs.io)[email protected], [email protected], [email protected]


2021 ◽  
Author(s):  
Fernando Cruz ◽  
João Capela ◽  
Eugénio C. Ferreira ◽  
Miguel Rocha ◽  
Oscar Dias

AbstractAs the reconstruction of Genome-Scale Metabolic Models becomes standard practice in systems biology, the number of organisms having at least one metabolic model at the genome-scale is peaking at an unprecedented scale. The automation of several laborious tasks, such as gap-finding and gap-filling, allowed to develop GSMMs for poorly described organisms. However, such models’ quality can be compromised by the automation of several steps, which may lead to erroneous phenotype simulations.The Biological networks constraint-based In Silico Optimization (BioISO) is a computational tool aimed at accelerating the reconstruction of Genome-Scale Metabolic Models. This tool facilitates the manual curation steps by reducing the large search spaces often met when debugging in silico biological models. BioISO uses a recursive relation-like algorithm and Flux Balance Analysis to evaluate and guide debugging of in silico phenotype simulations. The potential of BioISO to guide the debugging of model reconstructions was showcased using GSMMs available in literature and compared with the results of two other state-of-the-art gap-filling tools (Meneco and fastGapFill). Furthermore, BioISO was used as Meneco’s gap-finding algorithm to reduce the number of proposed solutions (reaction sets) for filling the gaps.BioISO was implemented as a webserver available at https://bioiso.bio.di.uminho.pt; and integrated into merlin as a plugin. BioISO’s implementation as a Python™ package can also be retrieved from https://github.com/BioSystemsUM/BioISO.


Microbiome ◽  
2021 ◽  
Vol 9 (1) ◽  
Author(s):  
Jack Jansma ◽  
Sahar El Aidy

AbstractThe human gut harbors an enormous number of symbiotic microbes, which is vital for human health. However, interactions within the complex microbiota community and between the microbiota and its host are challenging to elucidate, limiting development in the treatment for a variety of diseases associated with microbiota dysbiosis. Using in silico simulation methods based on flux balance analysis, those interactions can be better investigated. Flux balance analysis uses an annotated genome-scale reconstruction of a metabolic network to determine the distribution of metabolic fluxes that represent the complete metabolism of a bacterium in a certain metabolic environment such as the gut. Simulation of a set of bacterial species in a shared metabolic environment can enable the study of the effect of numerous perturbations, such as dietary changes or addition of a probiotic species in a personalized manner. This review aims to introduce to experimental biologists the possible applications of flux balance analysis in the host-microbiota interaction field and discusses its potential use to improve human health.


2010 ◽  
Vol 38 (5) ◽  
pp. 1225-1229 ◽  
Author(s):  
Evangelos Simeonidis ◽  
Ettore Murabito ◽  
Kieran Smallbone ◽  
Hans V. Westerhoff

Advances in biological techniques have led to the availability of genome-scale metabolic reconstructions for yeast. The size and complexity of such networks impose limits on what types of analyses one can perform. Constraint-based modelling overcomes some of these restrictions by using physicochemical constraints to describe the potential behaviour of an organism. FBA (flux balance analysis) highlights flux patterns through a network that serves to achieve a particular objective and requires a minimal amount of data to make quantitative inferences about network behaviour. Even though FBA is a powerful tool for system predictions, its general formulation sometimes results in unrealistic flux patterns. A typical example is fermentation in yeast: ethanol is produced during aerobic growth in excess glucose, but this pattern is not present in a typical FBA solution. In the present paper, we examine the issue of yeast fermentation against respiration during growth. We have studied a number of hypotheses from the modelling perspective, and novel formulations of the FBA approach have been tested. By making the observation that more respiration requires the synthesis of more mitochondria, an energy cost related to mitochondrial synthesis is added to the FBA formulation. Results, although still approximate, are closer to experimental observations than earlier FBA analyses, at least on the issue of fermentation.


2022 ◽  
Author(s):  
Javad Zamani ◽  
Sayed-Amir Marashi ◽  
Tahmineh Lohrasebi ◽  
Mohammad-Ali Malboobi ◽  
Esmail Foroozan

Genome-scale metabolic models (GSMMs) have enabled researchers to perform systems-level studies of living organisms. As a constraint-based technique, flux balance analysis (FBA) aids computation of reaction fluxes and prediction of...


2019 ◽  
Author(s):  
Kokulapalan Wimalanathan ◽  
Carolyn J. Lawrence-Dill

AbstractAnnotating gene structures and functions to genome assemblies is a must to make assembly resources useful for biological inference. Gene Ontology (GO) term assignment is the most pervasively used functional annotation system, and new methods for GO assignment have improved the quality of GO-based function predictions. GOMAP, the Gene Ontology Meta Annotator for Plants (GOMAP) is an optimized, high-throughput, and reproducible pipeline for genome-scale GO annotation for plant genomes. GOMAP’s methods have been shown to expand and improve the number of genes annotated and annotations assigned per gene as well as the quality (based on F-score) of GO assignments in maize. Here we report on the pipeline’s availability and performance for annotating large, repetitive plant genomes and describe how to deploy GOMAP to annotate additional plant genomes. We containerized GOMAP to increase portability and reproducibility, and optimized its performance for HPC environments. GOMAP has been used to annotate multiple maize lines, and is currently being deployed to annotate other species including wheat, rice, barley, cotton, soy, and others. Instructions along with access to the GOMAP Singularity container are freely available online at https://gomap-singularity.readthedocs.io/en/latest/. A list of annotated genomes and links to data is maintained at https://dill-picl.org/projects/gomap/gomap-datasets/.


Sign in / Sign up

Export Citation Format

Share Document