NewDrosophilaLong-Term Memory Genes Revealed by Assessing Computational Function Prediction Methods

Protein function prediction is a complex multiclass multilabel classification problem, characterized by multiple issues such as the incompleteness of the available annotations, the integration of multiple sources of high dimensional biomolecular data, the unbalance of several functional classes, and the difficulty of univocally determining negative examples. Moreover, the hierarchical relationships between functional classes that characterize both the Gene Ontology and FunCat taxonomies motivate the development of hierarchy-aware prediction methods that showed significantly better performances than hierarchical-unaware “flat” prediction methods. In this paper, we provide a comprehensive review of hierarchical methods for protein function prediction based on ensembles of learning machines. According to this general approach, a separate learning machine is trained to learn a specific functional term and then the resulting predictions are assembled in a “consensus” ensemble decision, taking into account the hierarchical relationships between classes. The main hierarchical ensemble methods proposed in the literature are discussed in the context of existing computational methods for protein function prediction, highlighting their characteristics, advantages, and limitations. Open problems of this exciting research area of computational biology are finally considered, outlining novel perspectives for future research.

Download Full-text

An expanded evaluation of protein function prediction methods shows an improvement in accuracy

Genome Biology ◽

10.1186/s13059-016-1037-6 ◽

2016 ◽

Vol 17 (1) ◽

Cited By ~ 180

Author(s):

Yuxiang Jiang ◽

Tal Ronnen Oron ◽

Wyatt T. Clark ◽

Asma R. Bankapur ◽

Daniel D’Andrea ◽

...

Keyword(s):

Protein Function ◽

Protein Function Prediction ◽

Function Prediction ◽

Prediction Methods

Download Full-text

Community-Wide Evaluation of Computational Function Prediction

Methods in Molecular Biology - The Gene Ontology Handbook ◽

10.1007/978-1-4939-3743-1_10 ◽

2016 ◽

pp. 133-146 ◽

Cited By ~ 7

Author(s):

Iddo Friedberg ◽

Predrag Radivojac

Keyword(s):

Function Prediction ◽

Computational Function

Download Full-text

Extensive complementarity between gene function prediction methods

Bioinformatics ◽

10.1093/bioinformatics/btw532 ◽

2016 ◽

pp. btw532 ◽

Cited By ~ 2

Author(s):

Vedrana Vidulin ◽

Tomislav Šmuc ◽

Fran Supek

Keyword(s):

Gene Function ◽

Function Prediction ◽

Prediction Methods ◽

Gene Function Prediction

Download Full-text

Improving protein function prediction with synthetic feature samples created by generative adversarial networks

10.1101/730143 ◽

2019 ◽

Author(s):

Cen Wan ◽

David T. Jones

Keyword(s):

Protein Function ◽

Data Augmentation ◽

Protein Function Prediction ◽

Function Prediction ◽

Generative Adversarial Networks ◽

Prediction Methods ◽

High Quality ◽

Adversarial Networks ◽

Synthetic Protein ◽

Protein Feature

AbstractProtein function prediction is a challenging but important task in bioinformatics. Many prediction methods have been developed, but are still limited by the bottleneck on training sample quantity. Therefore, it is valuable to develop a data augmentation method that can generate high-quality synthetic samples to further improve the accuracy of prediction methods. In this work, we propose a novel generative adversarial networks-based method, namely FFPred-GAN, to accurately learn the high-dimensional distributions of protein sequence-based biophysical features and also generate high-quality synthetic protein feature samples. The experimental results suggest that the synthetic protein feature samples are successful in improving the prediction accuracy for all three domains of the Gene Ontology through augmentation of the original training protein feature samples.

Download Full-text

Enhanced Sequence-Based Function Prediction Methods and Application to Functional Similarity Networks

Protein Function Prediction for Omics Era ◽

10.1007/978-94-007-0881-5_2 ◽

2011 ◽

pp. 19-34

Author(s):

Meghana Chitale ◽

Daisuke Kihara

Keyword(s):

Function Prediction ◽

Functional Similarity ◽

Prediction Methods ◽

Similarity Networks

Download Full-text

Improving protein function prediction methods with integrated literature data

BMC Bioinformatics ◽

10.1186/1471-2105-9-198 ◽

2008 ◽

Vol 9 (1) ◽

Cited By ~ 19

Author(s):

Aaron P Gabow ◽

Sonia M Leach ◽

William A Baumgartner ◽

Lawrence E Hunter ◽

Debra S Goldberg

Keyword(s):

Protein Function ◽

Protein Function Prediction ◽

Function Prediction ◽

Prediction Methods

Download Full-text

Leveraging structure for enzyme function prediction: methods, opportunities, and challenges

Trends in Biochemical Sciences ◽

10.1016/j.tibs.2014.05.006 ◽

2014 ◽

Vol 39 (8) ◽

pp. 363-371 ◽

Cited By ~ 23

Author(s):

Matthew P. Jacobson ◽

Chakrapani Kalyanaraman ◽

Suwen Zhao ◽

Boxue Tian

Keyword(s):

Function Prediction ◽

Enzyme Function ◽

Prediction Methods

Download Full-text

Towards region-specific propagation of protein functions

10.1101/275487 ◽

2018 ◽

Author(s):

Da Chen Emily Koo ◽

Richard Bonneau

Keyword(s):

Protein Level ◽

Protein Function ◽

Predictive Performance ◽

Function Prediction ◽

Amino Acid Sequences ◽

Prediction Methods ◽

Functional Evaluation ◽

Domain Family ◽

Starting Point ◽

Go Terms

AbstractMotivationDue to the nature of experimental annotation, most protein function prediction methods operate at the protein-level, where functions are assigned to full-length proteins based on overall similarities. However, most proteins function by interacting with other proteins or molecules, and many functional associations should be limited to specific regions rather than the entire protein length. Most domain-centric function prediction methods depend on accurate domain family assignments to infer relationships between domains and functions, with regions that are unassigned to a known domain-family left out of functional evaluation. Given the abundance of residue-level annotations currently available, we present a function prediction methodology that automatically infers function labels of specific protein regions using protein-level annotations and multiple types of region-specific features.ResultsWe apply this method to local features obtained from InterPro, UniProtKB and amino acid sequences and show that this method improves both the accuracy and region-specificity of protein function transfer and prediction by testing on both human and yeast proteomes. We compare region-level predictive performance of our method against that of a whole-protein baseline method using a held-out dataset of proteins with structurally-verified binding sites and also compare protein-level temporal holdout predictive performances to expand the variety and specificity of GO terms we could evaluate. Our results can also serve as a starting point to categorize GO terms into site-specific and whole-protein terms and select prediction methods for different classes of GO terms.AvailabilityThe code is freely available at: https://github.com/ek1203/region_spec_func_pred

Download Full-text

Assessment of predicting hourly global solar radiation in Jordan based on Rules, Trees, Meta, Lazy and Function prediction methods

Sustainable Energy Technologies and Assessments ◽

10.1016/j.seta.2020.100923 ◽

2021 ◽

pp. 100923

Author(s):

Nadia Al-Rousan ◽

Hazem Al-Najjar ◽

Osama Alomari

Keyword(s):

Solar Radiation ◽

Global Solar Radiation ◽

Function Prediction ◽

Prediction Methods ◽

And Function

Download Full-text