Positome: A method for improving protein-protein interaction quality and prediction accuracy

Determining protein-protein interaction (PPI) in biological systems is of considerable importance, and prediction of PPI has become a popular research area. Although different classifiers have been developed for PPI prediction, no single classifier seems to be able to predict PPI with high confidence. We postulated that by combining individual classifiers the accuracy of PPI prediction could be improved. We developed a method called protein-protein interaction prediction classifiers merger (PPCM), and this method combines output from two PPI prediction tools, GO2PPI and Phyloprof, using Random Forests algorithm. The performance of PPCM was tested by area under the curve (AUC) using an assembled Gold Standard database that contains both positive and negative PPI pairs. Our AUC test showed that PPCM significantly improved the PPI prediction accuracy over the corresponding individual classifiers. We found that additional classifiers incorporated into PPCM could lead to further improvement in the PPI prediction accuracy. Furthermore, cross species PPCM could achieve competitive and even better prediction accuracy compared to the single species PPCM. This study established a robust pipeline for PPI prediction by integrating multiple classifiers using Random Forests algorithm. This pipeline will be useful for predicting PPI in nonmodel species.

Download Full-text

Prediction Accuracy Evaluation of Domain and Domain Combination Based Prediction Methods for Protein-Protein Interaction

Interdisciplinary Bio Central ◽

10.4051/ibce.2009.2.0016 ◽

2006 ◽

Vol 1 (2) ◽

pp. 129-133

Author(s):

Dong-Soo Han ◽

Woo-Hyuk Jang ◽

Dong-Soo Han

Keyword(s):

Protein Interaction ◽

Prediction Accuracy ◽

Prediction Methods ◽

Accuracy Evaluation ◽

Domain Combination ◽

Protein Protein Interaction

Download Full-text

A Cross-Entropy-based Method for Essential Protein Identification in Yeast Protein–Protein Interaction Network

Current Bioinformatics ◽

10.2174/1574893615999201116210840 ◽

2020 ◽

Vol 15 ◽

Author(s):

Weimiao Sun ◽

Lei Wang ◽

Jiaxin Peng ◽

Zhen Zhang ◽

Tingrui Pei ◽

...

Keyword(s):

Gene Expression ◽

Gene Expression Data ◽

Protein Interaction ◽

Prediction Accuracy ◽

Cross Entropy ◽

Expression Data ◽

Essential Proteins ◽

Protein Protein Interaction ◽

Simulation Results ◽

Initial Score

Background:: Research has shown that essential proteins play important roles in the development and survival of organisms. Because of the high costs of traditional biological experiments, several computational prediction methods based on known protein-protein interactions (PPIs) have been recently proposed to detect essential proteins. Objective:: Here, a novel prediction model called IoMCD is proposed to identify essential proteins by combining known PPIs with a variety of biological information about proteins, including gene expression data and homologous information of proteins. Methods:: Compared to the traditional state-of-the-art prediction models, IoMCD involves two kinds of weights that are obtained, respectively, by extracting topological features of proteins from the original known protein–protein interaction (PPI) networks and calculating the Pearson correlation coefficients (PCCs) between the gene expression data of proteins. Based on these two kinds of weights and adopting a cross-entropy method, a unique weight is assigned to each protein. Subsequently, the homologous information of proteins is used to calculate an initial score for each protein. Finally, based on the unique weights and initial score of proteins, an iterative method is designed to measure the essentialities of proteins. Results:: Intensive experiments were performed, and simulation results showed that the prediction accuracy of IoMCD, based on the dataset downloaded from the DIP and Gavin databases, was 92.16% and 89.71%, respectively, in the top 1% of the predicted essential proteins. Conclusion:: Both simulation results demonstrated that IoMCD can achieve excellent prediction accuracy and could be an effective method for essential protein prediction.

Download Full-text

Extraction of Protein-Protein Interaction from Scientific Articles by Predicting Dominant Keywords

BioMed Research International ◽

10.1155/2015/928531 ◽

2015 ◽

Vol 2015 ◽

pp. 1-13 ◽

Cited By ~ 4

Author(s):

Shun Koyabu ◽

Thi Thanh Thuy Phan ◽

Takenao Ohkawa

Keyword(s):

Protein Interaction ◽

Prediction Accuracy ◽

Protein Pair ◽

Training Data ◽

Learning Approach ◽

Automatic Extraction ◽

Evaluation Result ◽

Imbalanced Classification ◽

Protein Protein Interaction ◽

Machine Learning Approach

For the automatic extraction of protein-protein interaction information from scientific articles, a machine learning approach is useful. The classifier is generated from training data represented using several features to decide whether a protein pair in each sentence has an interaction. Such a specific keyword that is directly related to interaction as “bind” or “interact” plays an important role for training classifiers. We call it a dominant keyword that affects the capability of the classifier. Although it is important to identify the dominant keywords, whether a keyword is dominant depends on the context in which it occurs. Therefore, we propose a method for predicting whether a keyword is dominant for each instance. In this method, a keyword that derives imbalanced classification results is tentatively assumed to be a dominant keyword initially. Then the classifiers are separately trained from the instance with and without the assumed dominant keywords. The validity of the assumed dominant keyword is evaluated based on the classification results of the generated classifiers. The assumption is updated by the evaluation result. Repeating this process increases the prediction accuracy of the dominant keyword. Our experimental results using five corpora show the effectiveness of our proposed method with dominant keyword prediction.

Download Full-text

Protein protein interaction of p53 and p73 with YY1 in human hepatocellular carcinoma

Zeitschrift für Gastroenterologie ◽

10.1055/s-0030-1269561 ◽

2011 ◽

Vol 49 (01) ◽

Author(s):

LC Koenig ◽

C Sandig ◽

MH Bender ◽

W Stremmel ◽

M Müller-Schilling

Keyword(s):

Hepatocellular Carcinoma ◽

Protein Interaction ◽

Human Hepatocellular Carcinoma ◽

Protein Protein Interaction

Download Full-text

0513 Effect of γ radiation on physicochemical properties, protein-protein interaction, and microstructure of whey proteins

Journal of Animal Science ◽

10.2527/jam2016-0513 ◽

2016 ◽

Vol 94 (suppl_5) ◽

pp. 246-246

Author(s):

M. Guo ◽

X. Wang ◽

F. Lee ◽

J. Lv ◽

D. Zhang

Keyword(s):

Physicochemical Properties ◽

Protein Interaction ◽

Whey Proteins ◽

Γ Radiation ◽

Protein Protein Interaction

Download Full-text

Structure-based prediction of protein-protein interaction sites

10.31274/etd-180810-1200 ◽

2012 ◽

Author(s):

Rafael Armando Jordan

Keyword(s):

Protein Interaction ◽

Protein Protein Interaction ◽

Interaction Sites ◽

Protein Interaction Sites

Download Full-text

An efficient transient assays system using Agrobacterium-mediated transformation of onion (Allium cepa) epidermal cells

Indian Journal of Genetics and Plant Breeding (The) ◽

10.31742/ijgpb.80.3.17 ◽

2020 ◽

Vol 80 (03) ◽

Author(s):

Yu-Miao Zhang ◽

Jun Wang ◽

Tao Wu

Keyword(s):

Subcellular Localization ◽

Protein Interaction ◽

Protein Interactions ◽

Epidermal Cells ◽

Cyclin Dependent Kinase ◽

Protein Subcellular Localization ◽

Protein Protein Interactions ◽

Efficient System ◽

Protein Protein Interaction ◽

Onion Epidermal Cells

In this study, the Agrobacterium infection medium, infection duration, detergent, and cell density were optimized. The sorghum-based infection medium (SbIM), 10-20 min infection time, addition of 0.01% Silwet L-77, and Agrobacterium optical density at 600 nm (OD600), improved the competence of onion epidermal cells to support Agrobacterium infection at >90% efficiency. Cyclin-dependent kinase D-2 (CDKD-2) and cytochrome c-type biogenesis protein (CYCH), protein-protein interactions were localized. The optimized procedure is a quick and efficient system for examining protein subcellular localization and protein-protein interaction.

Download Full-text

Faculty Opinions recommendation of Analyzing yeast protein-protein interaction data obtained from different sources.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.1010262.240599 ◽

2004 ◽

Author(s):

Golan Yona

Keyword(s):

Protein Interaction ◽

Protein Interaction Data ◽

Interaction Data ◽

Yeast Protein ◽

Protein Protein Interaction ◽

Different Sources

Download Full-text