scholarly journals Leveraging structure for enzyme function prediction: methods, opportunities, and challenges

2014 ◽  
Vol 39 (8) ◽  
pp. 363-371 ◽  
Author(s):  
Matthew P. Jacobson ◽  
Chakrapani Kalyanaraman ◽  
Suwen Zhao ◽  
Boxue Tian
2014 ◽  
Vol 2014 ◽  
pp. 1-34 ◽  
Author(s):  
Giorgio Valentini

Protein function prediction is a complex multiclass multilabel classification problem, characterized by multiple issues such as the incompleteness of the available annotations, the integration of multiple sources of high dimensional biomolecular data, the unbalance of several functional classes, and the difficulty of univocally determining negative examples. Moreover, the hierarchical relationships between functional classes that characterize both the Gene Ontology and FunCat taxonomies motivate the development of hierarchy-aware prediction methods that showed significantly better performances than hierarchical-unaware “flat” prediction methods. In this paper, we provide a comprehensive review of hierarchical methods for protein function prediction based on ensembles of learning machines. According to this general approach, a separate learning machine is trained to learn a specific functional term and then the resulting predictions are assembled in a “consensus” ensemble decision, taking into account the hierarchical relationships between classes. The main hierarchical ensemble methods proposed in the literature are discussed in the context of existing computational methods for protein function prediction, highlighting their characteristics, advantages, and limitations. Open problems of this exciting research area of computational biology are finally considered, outlining novel perspectives for future research.


2016 ◽  
Vol 17 (1) ◽  
Author(s):  
Yuxiang Jiang ◽  
Tal Ronnen Oron ◽  
Wyatt T. Clark ◽  
Asma R. Bankapur ◽  
Daniel D’Andrea ◽  
...  

2020 ◽  
Vol 118 (3) ◽  
pp. 533a
Author(s):  
Safyan Aman Memon ◽  
Kinaan Aamir Khan ◽  
Hammad Naveed

2009 ◽  
Vol 25 (11) ◽  
pp. 1426-1427 ◽  
Author(s):  
R. Matthew Ward ◽  
E. Venner ◽  
B. Daines ◽  
S. Murray ◽  
S. Erdin ◽  
...  

2019 ◽  
Author(s):  
Cen Wan ◽  
David T. Jones

AbstractProtein function prediction is a challenging but important task in bioinformatics. Many prediction methods have been developed, but are still limited by the bottleneck on training sample quantity. Therefore, it is valuable to develop a data augmentation method that can generate high-quality synthetic samples to further improve the accuracy of prediction methods. In this work, we propose a novel generative adversarial networks-based method, namely FFPred-GAN, to accurately learn the high-dimensional distributions of protein sequence-based biophysical features and also generate high-quality synthetic protein feature samples. The experimental results suggest that the synthetic protein feature samples are successful in improving the prediction accuracy for all three domains of the Gene Ontology through augmentation of the original training protein feature samples.


2008 ◽  
Vol 9 (1) ◽  
Author(s):  
Aaron P Gabow ◽  
Sonia M Leach ◽  
William A Baumgartner ◽  
Lawrence E Hunter ◽  
Debra S Goldberg

Author(s):  
Dongliang Liu ◽  
Mengying Han ◽  
Yu Tian ◽  
Linlin Gong ◽  
Cancan Jia ◽  
...  

Abstract Summary Living cell strains have important applications in synthesizing their native compounds and potential for use in studies exploring the universal chemical space. Here, we present a web server named as Cell2Chem which accelerates the search for explored compounds in organisms, facilitating investigations of biosynthesis in unexplored chemical spaces. Cell2Chem uses co-occurrence networks and natural language processing to provide a systematic method for linking living organisms to biosynthesized compounds and the processes that produce these compounds. The Cell2Chem platform comprises 40 370 species and 125 212 compounds. Using reaction pathway and enzyme function in silico prediction methods, Cell2Chem reveals possible biosynthetic pathways of compounds and catalytic functions of proteins to expand unexplored biosynthetic chemical spaces. Cell2Chem can help improve biosynthesis research and enhance the efficiency of synthetic biology. Availability and implementation Cell2Chem is available at: http://www.rxnfinder.org/cell2chem/.


Sign in / Sign up

Export Citation Format

Share Document