How Far Are We from the Completion of the Human Protein Interactome Reconstruction?

Georgios N. Dimitrakopoulos; Maria I. Klapa; Nicholas K. Moschonas

doi:10.3390/biom12010140

How Far Are We from the Completion of the Human Protein Interactome Reconstruction?

Biomolecules ◽

10.3390/biom12010140 ◽

2022 ◽

Vol 12 (1) ◽

pp. 140

Author(s):

Georgios N. Dimitrakopoulos ◽

Maria I. Klapa ◽

Nicholas K. Moschonas

Keyword(s):

High Throughput ◽

Clustering Algorithms ◽

Human Protein ◽

Ppi Network ◽

Medicine Research ◽

Protein Protein Interaction ◽

Protein Interactome ◽

Addition Rate ◽

High Throughput Experiments ◽

Genome Scale

After more than fifteen years from the first high-throughput experiments for human protein–protein interaction (PPI) detection, we are still wondering how close the completion of the genome-scale human PPI network reconstruction is, what needs to be further explored and whether the biological insights gained from the holistic investigation of the current network are valid and useful. The unique structure of PICKLE, a meta-database of the human experimentally determined direct PPI network developed by our group, presently covering ~80% of the UniProtKB/Swiss-Prot reviewed human complete proteome, enables the evaluation of the interactome expansion by comparing the successive PICKLE releases since 2013. We observe a gradual overall increase of 39%, 182%, and 67% in protein nodes, PPIs, and supporting references, respectively. Our results indicate that, in recent years, (a) the PPI addition rate has decreased, (b) the new PPIs are largely determined by high-throughput experiments and mainly concern existing protein nodes and (c), as we had predicted earlier, most of the newly added protein nodes have a low degree. These observations, combined with a largely overlapping k-core between PICKLE releases and a network density increase, imply that an almost complete picture of a structurally defined network has been reached. The comparative unsupervised application of two clustering algorithms indicated that exploring the full interactome topology can reveal the protein neighborhoods involved in closely related biological processes as transcriptional regulation, cell signaling and multiprotein complexes such as the connexon complex associated with cancers. A well-reconstructed human protein interactome is a powerful tool in network biology and medicine research forming the basis for multi-omic and dynamic analyses.

Get full-text (via PubEx)

INFERRING PROTEIN-PROTEIN INTERACTIONS FROM MESSENGER RNA EXPRESSION PROFILES WITH SVM

Journal of Biological System ◽

10.1142/s0218339005001525 ◽

2005 ◽

Vol 13 (03) ◽

pp. 287-298 ◽

Cited By ~ 1

Author(s):

JUN CAI ◽

YING HUANG ◽

LIANG JI ◽

YANDA LI

Keyword(s):

High Throughput ◽

Protein Interactions ◽

Messenger Rna ◽

Expression Profiles ◽

Support Vector ◽

Svm Classifier ◽

Good Prediction ◽

Protein Protein Interactions ◽

Protein Protein Interaction ◽

High Throughput Experiments

In post-genomic biology, researchers in the field of proteome focus their attention on the networks of protein interactions that control the lives of cells and organisms. Protein-protein interactions play a useful role in dynamic cellular machinery. In this paper, we developed a method to infer protein-protein interactions based on the theory of support vector machine (SVM). For a given pair of proteins, a new strategy of calculating cross-correlation function of mRNA expression profiles was used to encode SVM vectors. We compared the performance with other methods of inferring protein-protein interaction. Results suggested that, through five-fold cross validation, our SVM model achieved a good prediction. It enables us to show that expression profiles in transcription level can be used to distinguish physical or functional interactions of proteins as well as sequence contents. Lastly, we applied our SVM classifier to evaluate data quality of interaction data sets from four high-throughput experiments. The results show that high-throughput experiments sacrifice some accuracy in determination of interactions because of limitation of experiment technologies.

Get full-text (via PubEx)

Integrative analysis of human omics data using biomolecular networks

Molecular BioSystems ◽

10.1039/c6mb00476h ◽

2016 ◽

Vol 12 (10) ◽

pp. 2953-2964 ◽

Cited By ~ 26

Author(s):

Jonathan L. Robinson ◽

Jens Nielsen

Keyword(s):

High Throughput ◽

Protein Interaction ◽

Protein Interaction Networks ◽

Interaction Networks ◽

Omics Data ◽

Protein Protein Interaction ◽

Biomolecular Networks ◽

New Information ◽

Protein Protein Interaction Networks ◽

Genome Scale

Biomolecular networks, such as genome-scale metabolic models and protein–protein interaction networks, facilitate the extraction of new information from high-throughput omics data.

Get full-text (via PubEx)

From the static interactome to dynamic protein complexes: Three challenges

Journal of Bioinformatics and Computational Biology ◽

10.1142/s0219720015710018 ◽

2015 ◽

Vol 13 (02) ◽

pp. 1571001 ◽

Cited By ~ 14

Author(s):

Chern Han Yong ◽

Limsoon Wong

Keyword(s):

Protein Interactions ◽

Protein Complexes ◽

Clustering Algorithms ◽

Ppi Network ◽

Protein Protein Interaction ◽

Static Interaction ◽

Ppi Networks ◽

Interaction Screening ◽

Discovery Algorithms ◽

Insight Into

Protein interactions and complexes behave in a dynamic fashion, but this dynamism is not captured by interaction screening technologies, and not preserved in protein–protein interaction (PPI) networks. The analysis of static interaction data to derive dynamic protein complexes leads to several challenges, of which we identify three. First, many proteins participate in multiple complexes, leading to overlapping complexes embedded within highly-connected regions of the PPI network. This makes it difficult to accurately delimit the boundaries of such complexes. Second, many condition- and location-specific PPIs are not detected, leading to sparsely-connected complexes that cannot be picked out by clustering algorithms. Third, the majority of complexes are small complexes (made up of two or three proteins), which are extra sensitive to the effects of extraneous edges and missing co-complex edges. We show that many existing complex-discovery algorithms have trouble predicting such complexes, and show that our insight into the disparity between the static interactome and dynamic protein complexes can be used to improve the performance of complex discovery.

Get full-text (via PubEx)

Protein Complex Discovery by Interaction Filtering from Protein Interaction Networks Using Mutual Rank Coexpression and Sequence Similarity

BioMed Research International ◽

10.1155/2015/165186 ◽

2015 ◽

Vol 2015 ◽

pp. 1-7 ◽

Cited By ~ 1

Author(s):

Ali Kazemi-Pour ◽

Bahram Goliaei ◽

Hamid Pezeshk

Keyword(s):

Protein Interaction ◽

Biological Networks ◽

Sequence Similarity ◽

Clustering Algorithms ◽

Edge Weight ◽

Clustering Methods ◽

Ppi Network ◽

Protein Protein Interaction ◽

New Methods ◽

Mutual Rank

The evaluation of the biological networks is considered the essential key to understanding the complex biological systems. Meanwhile, the graph clustering algorithms are mostly used in the protein-protein interaction (PPI) network analysis. The complexes introduced by the clustering algorithms include noise proteins. The error rate of the noise proteins in the PPI network researches is about 40–90%. However, only 30–40% of the existing interactions in the PPI databases depend on the specific biological function. It is essential to eliminate the noise proteins and the interactions from the complexes created via clustering methods. We have introduced new methods of weighting interactions in protein clusters and the splicing of noise interactions and proteins-based interactions on their weights. The coexpression and the sequence similarity of each pair of proteins are considered the edge weight of the proteins in the network. The results showed that the edge filtering based on the amount of coexpression acts similar to the node filtering via graph-based characteristics. Regarding the removal of the noise edges, the edge filtering has a significant advantage over the graph-based method. The edge filtering based on the amount of sequence similarity has the ability to remove the noise proteins and the noise interactions.

Get full-text (via PubEx)

idenPC-MIIP: identify protein complexes from weighted PPI networks using mutual important interacting partner relation

Briefings in Bioinformatics ◽

10.1093/bib/bbaa016 ◽

2020 ◽

Cited By ~ 1

Author(s):

Zhourun Wu ◽

Qing Liao ◽

Bin Liu

Keyword(s):

High Throughput ◽

State Of The Art ◽

Protein Complexes ◽

Cell System ◽

Protein Protein Interaction ◽

The Past ◽

Ppi Networks ◽

A Cell ◽

Relationship Of ◽

Genome Scale

Abstract Protein complexes are key units for studying a cell system. During the past decades, the genome-scale protein–protein interaction (PPI) data have been determined by high-throughput approaches, which enables the identification of protein complexes from PPI networks. However, the high-throughput approaches often produce considerable fraction of false positive and negative samples. In this study, we propose the mutual important interacting partner relation to reflect the co-complex relationship of two proteins based on their interaction neighborhoods. In addition, a new algorithm called idenPC-MIIP is developed to identify protein complexes from weighted PPI networks. The experimental results on two widely used datasets show that idenPC-MIIP outperforms 17 state-of-the-art methods, especially for identification of small protein complexes with only two or three proteins.

Get full-text (via PubEx)

Integrative COVID-19 Biological Network Inference with Probabilistic Core Decomposition

10.1101/2021.06.23.449535 ◽

2021 ◽

Author(s):

Yang Guo ◽

Fatemeh Esfahani ◽

Xiaojian Shao ◽

Venkatesh Srinivasan ◽

Alex Thomo ◽

...

Keyword(s):

Protein Interactions ◽

Network Inference ◽

Drug Repurposing ◽

Enrichment Analysis ◽

Human Protein ◽

Ppi Network ◽

Network Nodes ◽

Protein Protein Interaction ◽

Function Enrichment Analysis ◽

Encoding Genes

The SARS-CoV-2 coronavirus is responsible for millions of deaths around the world. To help contribute to the understanding of crucial knowledge and to further generate new hypotheses relevant to SARS-CoV-2 and human protein interactions, we make use of the information abundant Biomine probabilistic database and extend the experimentally identified SARS-CoV-2-human protein-protein interaction (PPI) network in silico. We generate an extended network by integrating information from the Biomine database and the PPI network. To generate novel hypotheses, we focus on the high-connectivity sub-communities that overlap most with the PPI network in the extended network. Therefore, we propose a new data analysis pipeline that can efficiently compute core decomposition on the extended network and identify dense subgraphs. We then evaluate the identified dense subgraph and the generated hypotheses in three contexts: literature validation for uncovered virus targeting genes and proteins, gene function enrichment analysis on subgraphs, and literature support on drug repurposing for identified tissues and diseases related to COVID-19. The majority types of the generated hypotheses are proteins with their encoding genes and we rank them by sorting their connections to known PPI network nodes. In addition, we compile a comprehensive list of novel genes, and proteins potentially related to COVID-19, as well as novel diseases which might be comorbidities. Together with the generated hypotheses, our results provide novel knowledge relevant to COVID-19 for further validation.

Get full-text (via PubEx)

IHP-PING—generating integrated human protein–protein interaction networks on-the-fly

Briefings in Bioinformatics ◽

10.1093/bib/bbaa277 ◽

2020 ◽

Author(s):

Gaston K Mazandu ◽

Christopher Hooper ◽

Kenneth Opap ◽

Funmilayo Makinde ◽

Victoria Nembaware ◽

...

Keyword(s):

Protein Interaction ◽

Protein Interactions ◽

High Throughput Sequencing ◽

Current Knowledge ◽

Interaction Network ◽

Human Protein ◽

Online Resources ◽

Ppi Network ◽

Genomic Context ◽

Protein Protein Interaction

Abstract Advances in high-throughput sequencing technologies have resulted in an exponential growth of publicly accessible biological datasets. In the ‘big data’ driven ‘post-genomic’ context, much work is being done to explore human protein–protein interactions (PPIs) for a systems level based analysis to uncover useful signals and gain more insights to advance current knowledge and answer specific biological and health questions. These PPIs are experimentally or computationally predicted, stored in different online databases and some of PPI resources are updated regularly. As with many biological datasets, such regular updates continuously render older PPI datasets potentially outdated. Moreover, while many of these interactions are shared between these online resources, each resource includes its own identified PPIs and none of these databases exhaustively contains all existing human PPI maps. In this context, it is essential to enable the integration of or combining interaction datasets from different resources, to generate a PPI map with increased coverage and confidence. To allow researchers to produce an integrated human PPI datasets in real-time, we introduce the integrated human protein–protein interaction network generator (IHP-PING) tool. IHP-PING is a flexible python package which generates a human PPI network from freely available online resources. This tool extracts and integrates heterogeneous PPI datasets to generate a unified PPI network, which is stored locally for further applications.

Get full-text (via PubEx)

Statistical Approaches for the Construction and Interpretation of Human Protein-Protein Interaction Network

BioMed Research International ◽

10.1155/2016/5313050 ◽

2016 ◽

Vol 2016 ◽

pp. 1-7 ◽

Cited By ~ 7

Author(s):

Yang Hu ◽

Ying Zhang ◽

Jun Ren ◽

Yadong Wang ◽

Zhenzhen Wang ◽

...

Keyword(s):

Experimental Data ◽

Protein Interaction ◽

Protein Interaction Network ◽

Latent Variable ◽

Interaction Network ◽

Human Protein ◽

Ppi Network ◽

Confidence Measure ◽

Protein Protein Interaction ◽

Protein Protein Interaction Network

The overall goal is to establish a reliable human protein-protein interaction network and develop computational tools to characterize a protein-protein interaction (PPI) network and the role of individual proteins in the context of the network topology and their expression status. A novel and unique feature of our approach is that we assigned confidence measure to each derived interacting pair and account for the confidence in our network analysis. We integrated experimental data to infer human PPI network. Our model treated the true interacting status (yes versus no) for any given pair of human proteins as a latent variable whose value was not observed. The experimental data were the manifestation of interacting status, which provided evidence as to the likelihood of the interaction. The confidence of interactions would depend on the strength and consistency of the evidence.

Get full-text (via PubEx)

Faculty Opinions recommendation of Interaction between intrinsically disordered proteins frequently occurs in a human protein-protein interaction network.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.1164197.624888 ◽

2009 ◽

Author(s):

Vladimir Uversky

Keyword(s):

Protein Interaction ◽

Protein Interaction Network ◽

Intrinsically Disordered Proteins ◽

Interaction Network ◽

Human Protein ◽

Disordered Proteins ◽

Protein Protein Interaction ◽

Intrinsically Disordered ◽

Protein Protein Interaction Network

Get full-text (via PubEx)

A High Throughput Screen to Identify Inhibitors of the KIF15-TPX2 Protein-Protein Interaction for Ovarian Cancer

SSRN Electronic Journal ◽

10.2139/ssrn.3279412 ◽

2018 ◽

Author(s):

Rebecca Wates ◽

Anuradha Roy ◽

Frank J Schoenen ◽

Jeffrey Hirst ◽

Anne Cooper ◽

...

Keyword(s):

Ovarian Cancer ◽

High Throughput ◽

Protein Interaction ◽

High Throughput Screen ◽

Protein Protein Interaction

Get full-text (via PubEx)