Using support vector machine combined with auto covariance to predict protein–protein interactions from protein sequences

Yanzhi Guo; Lezheng Yu; Zhining Wen; Menglong Li

doi:10.1093/nar/gkn159

Prediction of Protein-Protein Interactions Based on Molecular Interface Features and the Support Vector Machine

Current Bioinformatics ◽

10.2174/1574893611308010003 ◽

2013 ◽

Vol 8 (1) ◽

pp. 3-8 ◽

Cited By ~ 1

Author(s):

Weiqiang Zhou ◽

Hong Yan ◽

Xiaodan Fan ◽

Quan Hao

Keyword(s):

Support Vector Machine ◽

Protein Interactions ◽

Support Vector ◽

Protein Protein Interactions

Get full-text (via PubEx)

Prediction of Protein-Protein Interactions between HIV-1 and Human using Support Vector Machine Combined with Multivariate Mutual Information

2020 3rd International Conference on Biomedical Engineering (IBIOMED) ◽

10.1109/ibiomed50285.2020.9487598 ◽

2020 ◽

Author(s):

Mohamad Irlin Sunggawa ◽

Alhadi Bustamam ◽

Devvi Sarwinda ◽

Patuan Pangihutan Tampubolon ◽

Wibowo Mangunwardoyo

Keyword(s):

Support Vector Machine ◽

Mutual Information ◽

Protein Interactions ◽

Support Vector ◽

Protein Protein Interactions ◽

Multivariate Mutual Information ◽

Hiv 1

Get full-text (via PubEx)

Building and analysis of protein-protein interactions related to diabetes mellitus using support vector machine, biomedical text mining and network analysis

Computational Biology and Chemistry ◽

10.1016/j.compbiolchem.2016.09.011 ◽

2016 ◽

Vol 65 ◽

pp. 37-44 ◽

Cited By ~ 11

Author(s):

Renu Vyas ◽

Sanket Bapat ◽

Esha Jain ◽

Muthukumarasamy Karthikeyan ◽

Sanjeev Tambe ◽

...

Keyword(s):

Diabetes Mellitus ◽

Support Vector Machine ◽

Network Analysis ◽

Text Mining ◽

Protein Interactions ◽

Support Vector ◽

Biomedical Text ◽

Biomedical Text Mining ◽

Protein Protein Interactions

Get full-text (via PubEx)

Selecting Negative Samples for PPI Prediction Using Hierarchical Clustering Methodology

Journal of Applied Mathematics ◽

10.1155/2012/897289 ◽

2012 ◽

Vol 2012 ◽

pp. 1-23

Author(s):

J. M. Urquiza ◽

I. Rojas ◽

H. Pomares ◽

J. Herrera ◽

J. P. Florido ◽

...

Keyword(s):

Support Vector Machine ◽

Hierarchical Clustering ◽

Protein Interactions ◽

Support Vector Machine Model ◽

Support Vector ◽

Protein Protein Interactions ◽

Machine Model ◽

Cellular Processes ◽

Parallel Feature ◽

Ppi Prediction

Protein-protein interactions (PPIs) play a crucial role in cellular processes. In the present work, a new approach is proposed to construct a PPI predictor training a support vector machine model through a mutual information filter-wrapper parallel feature selection algorithm and an iterative and hierarchical clustering to select a relevance negative training set. By means of a selected suboptimum set of features, the constructed support vector machine model is able to classify PPIs with high accuracy in any positive and negative datasets.

Get full-text (via PubEx)

PPI_SVM: Prediction of protein-protein interactions using machine learning, domain-domain affinities and frequency tables

Cellular & Molecular Biology Letters ◽

10.2478/s11658-011-0008-x ◽

2011 ◽

Vol 16 (2) ◽

Cited By ~ 41

Author(s):

Piyali Chatterjee ◽

Subhadip Basu ◽

Mahantapas Kundu ◽

Mita Nasipuri ◽

Dariusz Plewczynski

Keyword(s):

Machine Learning ◽

Protein Interactions ◽

Three Dimensional ◽

Prediction Method ◽

Protein Sequences ◽

Dimensional Structure ◽

Support Vector ◽

Interacting Proteins ◽

Protein Protein Interactions ◽

Protein Functions

AbstractProtein-protein interactions (PPI) control most of the biological processes in a living cell. In order to fully understand protein functions, a knowledge of protein-protein interactions is necessary. Prediction of PPI is challenging, especially when the three-dimensional structure of interacting partners is not known. Recently, a novel prediction method was proposed by exploiting physical interactions of constituent domains. We propose here a novel knowledge-based prediction method, namely PPI_SVM, which predicts interactions between two protein sequences by exploiting their domain information. We trained a two-class support vector machine on the benchmarking set of pairs of interacting proteins extracted from the Database of Interacting Proteins (DIP). The method considers all possible combinations of constituent domains between two protein sequences, unlike most of the existing approaches. Moreover, it deals with both single-domain proteins and multi domain proteins; therefore it can be applied to the whole proteome in high-throughput studies. Our machine learning classifier, following a brainstorming approach, achieves accuracy of 86%, with specificity of 95%, and sensitivity of 75%, which are better results than most previous methods that sacrifice recall values in order to boost the overall precision. Our method has on average better sensitivity combined with good selectivity on the benchmarking dataset. The PPI_SVM source code, train/test datasets and supplementary files are available freely in the public domain at: http://code.google.com/p/cmater-bioinfo/.

Get full-text (via PubEx)

Effect of training datasets on support vector machine prediction of protein-protein interactions

PROTEOMICS ◽

10.1002/pmic.200401118 ◽

2005 ◽

Vol 5 (4) ◽

pp. 876-884 ◽

Cited By ~ 51

Author(s):

Siaw Ling Lo ◽

Cong Zhong Cai ◽

Yu Zong Chen ◽

Maxey C. M. Chung

Keyword(s):

Support Vector Machine ◽

Protein Interactions ◽

Support Vector ◽

Protein Protein Interactions

Get full-text (via PubEx)

Inferring Protein-Protein Interactions Using a Hybrid Genetic Algorithm/Support Vector Machine Method

Protein and Peptide Letters ◽

10.2174/092986610791760379 ◽

2010 ◽

Vol 17 (9) ◽

pp. 1079-1084 ◽

Cited By ~ 5

Author(s):

Bing Wang ◽

Peng Chen ◽

Jun Zhang ◽

Guangxin Zhao ◽

Xiang Zhang

Keyword(s):

Genetic Algorithm ◽

Support Vector Machine ◽

Protein Interactions ◽

Hybrid Genetic Algorithm ◽

Support Vector ◽

Protein Protein Interactions ◽

Machine Method ◽

Support Vector Machine Method

Get full-text (via PubEx)

FWHT-RF: A Novel Computational Approach to Predict Plant Protein-Protein Interactions via an Ensemble Learning Method

Scientific Programming ◽

10.1155/2021/1607946 ◽

2021 ◽

Vol 2021 ◽

pp. 1-11

Author(s):

Jie Pan ◽

Li-Ping Li ◽

Chang-Qing Yu ◽

Zhu-Hong You ◽

Zhong-Hao Ren ◽

...

Keyword(s):

Protein Interactions ◽

Nearest Neighbor ◽

Protein Sequences ◽

Evolutionary Information ◽

Support Vector ◽

Protein Protein Interactions ◽

K Nearest Neighbor ◽

Novel Approach ◽

Knn Classifier ◽

Scoring Matrix

Protein-protein interactions (PPIs) in plants are crucial for understanding biological processes. Although high-throughput techniques produced valuable information to identify PPIs in plants, they are usually expensive, inefficient, and extremely time-consuming. Hence, there is an urgent need to develop novel computational methods to predict PPIs in plants. In this article, we proposed a novel approach to predict PPIs in plants only using the information of protein sequences. Specifically, plants’ protein sequences are first converted as position-specific scoring matrix (PSSM); then, the fast Walsh–Hadamard transform (FWHT) algorithm is used to extract feature vectors from PSSM to obtain evolutionary information of plant proteins. Lastly, the rotation forest (RF) classifier is trained for prediction and produced a series of evaluation results. In this work, we named this approach FWHT-RF because FWHT and RF are used for feature extraction and classification, respectively. When applying FWHT-RF on three plants’ PPI datasets Maize, Rice, and Arabidopsis thaliana (Arabidopsis), the average accuracies of FWHT-RF using 5-fold cross validation were achieved as high as 95.20%, 94.42%, and 83.85%, respectively. To further evaluate the predictive power of FWHT-RF, we compared it with the state-of-art support vector machine (SVM) and K-nearest neighbor (KNN) classifier in different aspects. The experimental results demonstrated that FWHT-RF can be a useful supplementary method to predict potential PPIs in plants.

Get full-text (via PubEx)

2P2I HUNTER : a tool for filtering orthosteric protein–protein interaction modulators via a dedicated support vector machine

Journal of The Royal Society Interface ◽

10.1098/rsif.2013.0860 ◽

2014 ◽

Vol 11 (90) ◽

pp. 20130860 ◽

Cited By ~ 25

Author(s):

Véronique Hamon ◽

Raphael Bourgeas ◽

Pierre Ducrot ◽

Isabelle Theret ◽

Laura Xuereb ◽

...

Keyword(s):

Support Vector Machine ◽

Protein Interactions ◽

High Throughput Screening ◽

Chemical Space ◽

Support Vector ◽

Protein Protein Interactions ◽

Target Class ◽

Chemical Library ◽

Protein Protein Interaction ◽

Svm Model

Over the last 10 years, protein–protein interactions (PPIs) have shown increasing potential as new therapeutic targets. As a consequence, PPIs are today the most screened target class in high-throughput screening (HTS). The development of broad chemical libraries dedicated to these particular targets is essential; however, the chemical space associated with this ‘high-hanging fruit’ is still under debate. Here, we analyse the properties of 40 non-redundant small molecules present in the 2P2I database ( http://2p2idb.cnrs-mrs.fr/ ) to define a general profile of orthosteric inhibitors and propose an original protocol to filter general screening libraries using a support vector machine (SVM) with 11 standard D ragon molecular descriptors. The filtering protocol has been validated using external datasets from PubChem BioAssay and results from in-house screening campaigns . This external blind validation demonstrated the ability of the SVM model to reduce the size of the filtered chemical library by eliminating up to 96% of the compounds as well as enhancing the proportion of active compounds by up to a factor of 8. We believe that the resulting chemical space identified in this paper will provide the scientific community with a concrete support to search for PPI inhibitors during HTS campaigns.

Get full-text (via PubEx)

Combining protein-protein interactions information with support vector machine to identify chronic obstructive pulmonary disease related genes

Molecular Biology ◽

10.1134/s0026893314020101 ◽

2014 ◽

Vol 48 (2) ◽

pp. 287-296 ◽

Cited By ~ 1

Author(s):

Lin Hua ◽

Ping Zhou

Keyword(s):

Chronic Obstructive Pulmonary Disease ◽

Support Vector Machine ◽

Pulmonary Disease ◽

Protein Interactions ◽

Support Vector ◽

Chronic Obstructive ◽

Protein Protein Interactions ◽

Obstructive Pulmonary Disease ◽

Disease Related Genes

Get full-text (via PubEx)