A Survey of Computational Intelligence Techniques in Protein Function Prediction

During the past, there was a massive growth of knowledge of unknown proteins with the advancement of high throughput microarray technologies. Protein function prediction is the most challenging problem in bioinformatics. In the past, the homology based approaches were used to predict the protein function, but they failed when a new protein was different from the previous one. Therefore, to alleviate the problems associated with homology based traditional approaches, numerous computational intelligence techniques have been proposed in the recent past. This paper presents a state-of-the-art comprehensive review of various computational intelligence techniques for protein function predictions using sequence, structure, protein-protein interaction network, and gene expression data used in wide areas of applications such as prediction of DNA and RNA binding sites, subcellular localization, enzyme functions, signal peptides, catalytic residues, nuclear/G-protein coupled receptors, membrane proteins, and pathway analysis from gene expression datasets. This paper also summarizes the result obtained by many researchers to solve these problems by using computational intelligence techniques with appropriate datasets to improve the prediction performance. The summary shows that ensemble classifiers and integration of multiple heterogeneous data are useful for protein function prediction.

Download Full-text

De-novo protein function prediction using DNA binding and RNA binding proteins as a test case

Nature Communications ◽

10.1038/ncomms13424 ◽

2016 ◽

Vol 7 (1) ◽

Cited By ~ 9

Author(s):

Sapir Peled ◽

Olga Leiderman ◽

Rotem Charar ◽

Gilat Efroni ◽

Yaron Shav-Tal ◽

...

Keyword(s):

Dna Binding ◽

Protein Function ◽

Binding Proteins ◽

Rna Binding ◽

De Novo ◽

Rna Binding Proteins ◽

Protein Function Prediction ◽

Function Prediction ◽

Test Case

Download Full-text

Protein Function Prediction by Clustering of Protein-Protein Interaction Network

Advances in Intelligent and Soft Computing - ICT Innovations 2011 ◽

10.1007/978-3-642-28664-3_4 ◽

2012 ◽

pp. 39-49 ◽

Cited By ~ 1

Author(s):

Ivana Cingovska ◽

Aleksandra Bogojeska ◽

Kire Trivodaliev ◽

Slobodan Kalajdziski

Keyword(s):

Protein Interaction ◽

Protein Interaction Network ◽

Protein Function ◽

Protein Function Prediction ◽

Interaction Network ◽

Function Prediction ◽

Protein Protein Interaction ◽

Protein Protein Interaction Network

Download Full-text

Protein function prediction using neighbor relativity in protein–protein interaction network

Computational Biology and Chemistry ◽

10.1016/j.compbiolchem.2012.12.003 ◽

2013 ◽

Vol 43 ◽

pp. 11-16 ◽

Cited By ~ 16

Author(s):

Sobhan Moosavi ◽

Masoud Rahgozar ◽

Amir Rahimi

Keyword(s):

Protein Interaction ◽

Protein Interaction Network ◽

Protein Function ◽

Protein Function Prediction ◽

Interaction Network ◽

Function Prediction ◽

Protein Protein Interaction ◽

Protein Protein Interaction Network

Download Full-text

Peer Review #2 of "FunPred 3.0: improved protein function prediction using protein interaction network (v0.3)"

10.7287/peerj.6830v0.3/reviews/2 ◽

2019 ◽

Keyword(s):

Peer Review ◽

Protein Interaction ◽

Protein Interaction Network ◽

Protein Function ◽

Protein Function Prediction ◽

Interaction Network ◽

Function Prediction

Download Full-text

FunPred 3.0: improved protein function prediction using protein interaction network

PeerJ ◽

10.7717/peerj.6830 ◽

2019 ◽

Vol 7 ◽

pp. e6830 ◽

Cited By ~ 1

Author(s):

Sovan Saha ◽

Piyali Chatterjee ◽

Subhadip Basu ◽

Mita Nasipuri ◽

Dariusz Plewczynski

Keyword(s):

Protein Interaction ◽

Protein Interactions ◽

Protein Interaction Network ◽

Protein Function ◽

Protein Function Prediction ◽

Experimental Studies ◽

Interaction Network ◽

Function Prediction ◽

The Self ◽

Functional Annotations

Proteins are the most versatile macromolecules in living systems and perform crucial biological functions. In the advent of the post-genomic era, the next generation sequencing is done routinely at the population scale for a variety of species. The challenging problem is to massively determine the functions of proteins that are yet not characterized by detailed experimental studies. Identification of protein functions experimentally is a laborious and time-consuming task involving many resources. We therefore propose the automated protein function prediction methodology using in silico algorithms trained on carefully curated experimental datasets. We present the improved protein function prediction tool FunPred 3.0, an extended version of our previous methodology FunPred 2, which exploits neighborhood properties in protein–protein interaction network (PPIN) and physicochemical properties of amino acids. Our method is validated using the available functional annotations in the PPIN network of Saccharomyces cerevisiae in the latest Munich information center for protein (MIPS) dataset. The PPIN data of S. cerevisiae in MIPS dataset includes 4,554 unique proteins in 13,528 protein–protein interactions after the elimination of the self-replicating and the self-interacting protein pairs. Using the developed FunPred 3.0 tool, we are able to achieve the mean precision, the recall and the F-score values of 0.55, 0.82 and 0.66, respectively. FunPred 3.0 is then used to predict the functions of unpredicted protein pairs (incomplete and missing functional annotations) in MIPS dataset of S. cerevisiae. The method is also capable of predicting the subcellular localization of proteins along with its corresponding functions. The code and the complete prediction results are available freely at: https://github.com/SovanSaha/FunPred-3.0.git.

Download Full-text

Peer Review #2 of "FunPred 3.0: improved protein function prediction using protein interaction network (v0.4)"

10.7287/peerj.6830v0.4/reviews/2 ◽

2019 ◽

Keyword(s):

Peer Review ◽

Protein Interaction ◽

Protein Interaction Network ◽

Protein Function ◽

Protein Function Prediction ◽

Interaction Network ◽

Function Prediction

Download Full-text

FunPred-1: Protein function prediction from a protein interaction network using neighborhood analysis

Cellular & Molecular Biology Letters ◽

10.2478/s11658-014-0221-5 ◽

2014 ◽

Vol 19 (4) ◽

Cited By ~ 14

Author(s):

Sovan Saha ◽

Piyali Chatterjee ◽

Subhadip Basu ◽

Mahantapas Kundu ◽

Mita Nasipuri

Keyword(s):

Protein Interaction ◽

Protein Interaction Network ◽

Protein Function ◽

Sequence Data ◽

Biological Activities ◽

Protein Function Prediction ◽

Interaction Network ◽

Search Space ◽

Function Prediction ◽

Clustering Coefficient

AbstractProteins are responsible for all biological activities in living organisms. Thanks to genome sequencing projects, large amounts of DNA and protein sequence data are now available, but the biological functions of many proteins are still not annotated in most cases. The unknown function of such non-annotated proteins may be inferred or deduced from their neighbors in a protein interaction network. In this paper, we propose two new methods to predict protein functions based on network neighborhood properties. FunPred 1.1 uses a combination of three simple-yet-effective scoring techniques: the neighborhood ratio, the protein path connectivity and the relative functional similarity. FunPred 1.2 applies a heuristic approach using the edge clustering coefficient to reduce the search space by identifying densely connected neighborhood regions. The overall accuracy achieved in FunPred 1.2 over 8 functional groups involving hetero-interactions in 650 yeast proteins is around 87%, which is higher than the accuracy with FunPred 1.1. It is also higher than the accuracy of many of the state-of-the-art protein function prediction methods described in the literature. The test datasets and the complete source code of the developed software are now freely available at http://code.google.com/p/cmaterbioinfo/.

Download Full-text