scholarly journals Incomplete and noisy network data as a percolation process

2010 ◽  
Vol 7 (51) ◽  
pp. 1411-1419 ◽  
Author(s):  
Michael P. H. Stumpf ◽  
Carsten Wiuf

We discuss the ramifications of noisy and incomplete observations of network data on the existence of a giant connected component (GCC). The existence of a GCC in a random graph can be described in terms of a percolation process, and building on general results for classes of random graphs with specified degree distributions we derive percolation thresholds above which GCCs exist. We show that sampling and noise can have a profound effect on the perceived existence of a GCC and find that both processes can destroy it. We also show that the absence of a GCC puts a theoretical upper bound on the false-positive rate and relate our percolation analysis to experimental protein–protein interaction data.

Author(s):  
Hugo Willy

Recent breakthroughs in high throughput experiments to determine protein-protein interaction have generated a vast amount of protein interaction data. However, most of the experiments could only answer the question of whether two proteins interact but not the question on the mechanisms by which proteins interact. Such understanding is crucial for understanding the protein interaction of an organism as a whole (the interactome) and even predicting novel protein interactions. Protein interaction usually occurs at some specific sites on the proteins and, given their importance, they are usually well conserved throughout the evolution of the proteins of the same family. Based on this observation, a number of works on finding protein patterns/motifs conserved in interacting proteins have emerged in the last few years. Such motifs are collectively termed as the interaction motifs. This chapter provides a review on the different approaches on finding interaction motifs with a discussion on their implications, potentials and possible areas of improvements in the future.


2011 ◽  
Vol 16 (2) ◽  
Author(s):  
Dariusz Plewczynski ◽  
Tomas Klingström

AbstractStudying the interactome is one of the exciting frontiers of proteomics, as shown lately at the recent bioinformatics conferences (for example ISMB 2010, or ECCB 2010). Distribution of data is facilitated by a large number of databases. Metamining databases have been created in order to allow researchers access to several databases in one search, but there are serious difficulties for end users to evaluate the metamining effort. Therefore we suggest a new standard, “Good Interaction Data Metamining Practice” (GIDMP), which could be easily automated and requires only very minor inclusion of statistical data on each database homepage. Widespread adoption of the GIDMP standard would provide users with: a standardized way to evaluate the statistics provided by each metamining database, thus enhancing the end-user experiencea stable contact point for each database, allowing the smooth transition of statisticsa fully automated system, enhancing time- and cost-effectiveness.The proposed information can be presented as a few hidden lines of text on the source database www page, and a constantly updated table for a metamining database included in the source/credits web page.


Yeast ◽  
2001 ◽  
Vol 18 (6) ◽  
pp. 523-531 ◽  
Author(s):  
Haretsugu Hishigaki ◽  
Kenta Nakai ◽  
Toshihide Ono ◽  
Akira Tanigami ◽  
Toshihisa Takagi

Sign in / Sign up

Export Citation Format

Share Document