An Experimental Evaluation of Similarity-Based and Embedding-Based Link Prediction Methods on Graphs

Md Kamrul Islam; Sabeur Aridhi; Malika Smail-Tabbone

doi:10.5121/ijdkp.2021.11501

An Experimental Evaluation of Similarity-Based and Embedding-Based Link Prediction Methods on Graphs

International Journal of Data Mining & Knowledge Management Process ◽

10.5121/ijdkp2021.11501 ◽

2021 ◽

Vol 11 (05) ◽

pp. 1-18

Author(s):

Md Kamrul Islam ◽

Sabeur Aridhi ◽

Malika Smail-Tabbone

Keyword(s):

Link Prediction ◽

Graph Embedding ◽

Black Box ◽

Prediction Performance ◽

Prediction Methods ◽

Good Prediction ◽

Good Prediction Performance ◽

Node Similarity ◽

Category Comparison ◽

Selection Of

The task of inferring missing links or predicting future ones in a graph based on its current structure is referred to as link prediction. Link prediction methods that are based on pairwise node similarity are well-established approaches in the literature and show good prediction performance in many realworld graphs though they are heuristic. On the other hand, graph embedding approaches learn lowdimensional representation of nodes in graph and are capable of capturing inherent graph features, and thus support the subsequent link prediction task in graph. This paper studies a selection of methods from both categories on several benchmark (homogeneous) graphs with different properties from various domains. Beyond the intra and inter category comparison of the performances of the methods, our aim is also to uncover interesting connections between Graph Neural Network(GNN)- based methods and heuristic ones as a means to alleviate the black-box well-known limitation.

Download Full-text

Appraisal Study of Similarity-Based and Embedding-Based Link Prediction Methods on Graphs

10.5121/csit.2021.111106 ◽

2021 ◽

Author(s):

Md Kamrul Islam ◽

Sabeur Aridhi ◽

Malika Smail-Tabbone

Keyword(s):

Link Prediction ◽

Black Box ◽

Prediction Methods ◽

Good Prediction ◽

Dimensional Representation ◽

Good Prediction Performance ◽

Node Similarity ◽

Category Comparison ◽

Low Dimensional ◽

Selection Of

The task of inferring missing links or predicting future ones in a graph based on its current structure is referred to as link prediction. Link prediction methods that are based on pairwise node similarity are well-established approaches in the literature and show good prediction performance in many real-world graphs though they are heuristic. On the other hand, graph embedding approaches learn low-dimensional representation of nodes in graph and are capable of capturing inherent graph features, and thus support the subsequent link prediction task in graph. This appraisal paper studies a selection of methods from both categories on several benchmark (homogeneous) graphs with different properties from various domains. Beyond the intra and inter category comparison of the performances of the methods our aim is also to uncover interesting connections between Graph Neural Network(GNN)-based methods and heuristic ones as a means to alleviate the black-box well-known limitation.

Download Full-text

Candidate gene prioritization using graph embedding

10.1101/2020.02.03.927913 ◽

2020 ◽

Author(s):

Quan Do ◽

Pierre Larmande

Keyword(s):

Link Prediction ◽

Graph Embedding ◽

Prediction Performance ◽

Knowledge Graph ◽

Learning Techniques ◽

Number Of Genes ◽

Important Amount ◽

Candidate Gene Prioritization ◽

Gene Information ◽

Embedding Methods

AbstractCandidate genes prioritization allows to rank among a large number of genes, those that are strongly associated with a phenotype or a disease. Due to the important amount of data that needs to be integrate and analyse, gene-to-phenotype association is still a challenging task. In this paper, we evaluated a knowledge graph approach combined with embedding methods to overcome these challenges. We first introduced a dataset of rice genes created from several open-access databases. Then, we used the Translating Embedding model and Convolution Knowledge Base model, to vectorize gene information. Finally, we evaluated the results using link prediction performance and vectors representation using some unsupervised learning techniques.

Download Full-text

Link prediction via layer relevance of multiplex networks

International Journal of Modern Physics C ◽

10.1142/s0129183117501017 ◽

2017 ◽

Vol 28 (08) ◽

pp. 1750101 ◽

Cited By ~ 7

Author(s):

Yabing Yao ◽

Ruisheng Zhang ◽

Fan Yang ◽

Yongna Yuan ◽

Qingshuang Sun ◽

...

Keyword(s):

Structural Properties ◽

Link Prediction ◽

Structural Information ◽

Similarity Index ◽

Single Layer ◽

Structural Features ◽

Prediction Performance ◽

Multiplex Networks ◽

Multiplex Network ◽

Node Similarity

In complex networks, the existing link prediction methods primarily focus on the internal structural information derived from single-layer networks. However, the role of interlayer information is hardly recognized in multiplex networks, which provide more diverse structural features than single-layer networks. Actually, the structural properties and functions of one layer can affect that of other layers in multiplex networks. In this paper, the effect of interlayer structural properties on the link prediction performance is investigated in multiplex networks. By utilizing the intralayer and interlayer information, we propose a novel “Node Similarity Index” based on “Layer Relevance” (NSILR) of multiplex network for link prediction. The performance of NSILR index is validated on each layer of seven multiplex networks in real-world systems. Experimental results show that the NSILR index can significantly improve the prediction performance compared with the traditional methods, which only consider the intralayer information. Furthermore, the more relevant the layers are, the higher the performance is enhanced.

Download Full-text

Link prediction based on contribution of neighbors

International Journal of Modern Physics C ◽

10.1142/s0129183120501582 ◽

2020 ◽

Vol 31 (11) ◽

pp. 2050158

Author(s):

Xiang-Chun Liu ◽

Dian-Qing Meng ◽

Xu-Zhen Zhu ◽

Yang Tian

Keyword(s):

Complex Network ◽

Link Prediction ◽

Node Degree ◽

Prediction Methods ◽

Obvious Advantage ◽

Node Similarity ◽

The Difference

Link prediction based on node similarity has become one of the most effective prediction methods for complex network. When calculating the similarity between two unconnected endpoints in link prediction, most scholars evaluate the influence of endpoint based on the node degree. However, this method ignores the difference in contribution of neighbor (NC) nodes for endpoint. Through abundant investigations and analyses, the paper quantifies the NC nodes to endpoint, and conceives NC Index to evaluate the endpoint influence accurately. Extensive experiments on 12 real datasets indicate that our proposed algorithm can increase the accuracy of link prediction significantly and show an obvious advantage over traditional algorithms.

Download Full-text

Link prediction based on local weighted paths for complex networks

International Journal of Modern Physics C ◽

10.1142/s012918311750053x ◽

2017 ◽

Vol 28 (04) ◽

pp. 1750053

Author(s):

Yabing Yao ◽

Ruisheng Zhang ◽

Fan Yang ◽

Yongna Yuan ◽

Rongjing Hu ◽

...

Keyword(s):

Complex Networks ◽

Real World ◽

Link Prediction ◽

Structural Similarity ◽

Prediction Performance ◽

Topological Feature ◽

Topological Features ◽

Node Similarity ◽

Weighted Paths ◽

Path Dependent

As a significant problem in complex networks, link prediction aims to find the missing and future links between two unconnected nodes by estimating the existence likelihood of potential links. It plays an important role in understanding the evolution mechanism of networks and has broad applications in practice. In order to improve prediction performance, a variety of structural similarity-based methods that rely on different topological features have been put forward. As one topological feature, the path information between node pairs is utilized to calculate the node similarity. However, many path-dependent methods neglect the different contributions of paths for a pair of nodes. In this paper, a local weighted path (LWP) index is proposed to differentiate the contributions between paths. The LWP index considers the effect of the link degrees of intermediate links and the connectivity influence of intermediate nodes on paths to quantify the path weight in the prediction procedure. The experimental results on 12 real-world networks show that the LWP index outperforms other seven prediction baselines.

Download Full-text

GLEE: Geometric Laplacian Eigenmap Embedding

Journal of Complex Networks ◽

10.1093/comnet/cnaa007 ◽

2020 ◽

Vol 8 (2) ◽

Author(s):

Leo Torres ◽

Kevin S Chan ◽

Tina Eliassi-Rad

Keyword(s):

Link Prediction ◽

Graph Embedding ◽

Laplacian Matrix ◽

Dimensional Representation ◽

Laplacian Eigenmaps ◽

New Approach ◽

Graph Reconstruction ◽

Node Similarity ◽

Distance Minimization ◽

Low Dimensional

Abstract Graph embedding seeks to build a low-dimensional representation of a graph $G$. This low-dimensional representation is then used for various downstream tasks. One popular approach is Laplacian Eigenmaps (LE), which constructs a graph embedding based on the spectral properties of the Laplacian matrix of $G$. The intuition behind it, and many other embedding techniques, is that the embedding of a graph must respect node similarity: similar nodes must have embeddings that are close to one another. Here, we dispose of this distance-minimization assumption. Instead, we use the Laplacian matrix to find an embedding with geometric properties instead of spectral ones, by leveraging the so-called simplex geometry of $G$. We introduce a new approach, Geometric Laplacian Eigenmap Embedding, and demonstrate that it outperforms various other techniques (including LE) in the tasks of graph reconstruction and link prediction.

Download Full-text

Vis-NIR Hyperspectral Imaging for Online Quality Evaluation during Food Processing: A Case Study of Hot Air Drying of Purple-Speckled Cocoyam (Colocasia esculenta (L.) Schott)

Processes ◽

10.3390/pr9101804 ◽

2021 ◽

Vol 9 (10) ◽

pp. 1804

Author(s):

John Ndisya ◽

Ayub Gitau ◽

Duncan Mbuge ◽

Arman Arefi ◽

Liliana Bădulescu ◽

...

Keyword(s):

Hyperspectral Imaging ◽

Prediction Models ◽

Prediction Performance ◽

Good Prediction ◽

Hot Air Drying ◽

Full Spectrum ◽

Air Drying ◽

Hot Air ◽

Good Prediction Performance ◽

Hue Angle

In this study, hyperspectral imaging (HSI) and chemometrics were implemented to develop prediction models for moisture, colour, chemical and structural attributes of purple-speckled cocoyam slices subjected to hot-air drying. Since HSI systems are costly and computationally demanding, the selection of a narrow band of wavelengths can enable the utilisation of simpler multispectral systems. In this study, 19 optimal wavelengths in the spectral range 400–1700 nm were selected using PLS-BETA and PLS-VIP feature selection methods. Prediction models for the studied quality attributes were developed from the 19 wavelengths. Excellent prediction performance (RMSEP < 2.0, r2P > 0.90, RPDP > 3.5) was obtained for MC, RR, VS and aw. Good prediction performance (RMSEP < 8.0, r2P = 0.70–0.90, RPDP > 2.0) was obtained for PC, BI, CIELAB b*, chroma, TFC, TAA and hue angle. Additionally, PPA and WI were also predicted successfully. An assessment of the agreement between predictions from the non-invasive hyperspectral imaging technique and experimental results from the routine laboratory methods established the potential of the HSI technique to replace or be used interchangeably with laboratory measurements. Additionally, a comparison of full-spectrum model results and the reduced models demonstrated the potential replacement of HSI with simpler imaging systems.

Download Full-text

Adaptive Network Automata Modelling of Complex Networks

10.20944/preprints202012.0808.v2 ◽

2021 ◽

Author(s):

Alessandro Muscoloni ◽

Umberto Michieli ◽

Carlo Vittorio Cannistraci

Keyword(s):

Complex Networks ◽

Link Prediction ◽

Network Science ◽

Fundamental Problem ◽

Graph Embedding ◽

Prediction Performance ◽

Self Organization ◽

Deterministic Models ◽

Adaptive Network ◽

Wide Range

Many complex networks have a connectivity that might be only partially detected or that tends to grow over time, hence the prediction of non-observed links is a fundamental problem in network science. The aim of topological link prediction is to forecast these non-observed links by only exploiting features intrinsic to the network topology. It has a wide range of real applications, like suggesting friendships in social networks or predicting interactions in biological networks.The Cannistraci-Hebb theory is a recent achievement in network science that includes a theoretical framework to understand local-based link prediction on paths of length n. In this study we introduce two innovations: theory of modelling (science) and theory of realization (engineering). For the theory of modelling we first recall a definition of network automata as a general framework for modelling the growth of connectivity in complex networks. We then show that several deterministic models previously developed fall within this framework and we introduce novel network automata following the Cannistraci-Hebb rule. For the theory of realization, we present how to build adaptive network automata for link prediction, which incorporate multiple deterministic models of self-organization and automatically choose the rule that better explains the patterns of connectivity in the network under investigation. We compare Cannistraci-Hebb adaptive (CHA) network automaton against state-of-the-art link prediction methods such as structural perturbation method (SPM), stochastic block models (SBM) and artificial intelligence algorithms for graph embedding. CHA displays an overall higher link prediction performance across different evaluation frameworks on 1386 networks. Finally, we highlight that CHA offers the key advantage to explicitly explain the mechanistic rule of self-organization which leads to the link prediction performance, whereas SPM and graph embedding not. In comparison to CHA, SBM unfortunately shows irrelevant and unsatisfactory performance demonstrating that SBM modelling is not adequate for link prediction in real networks.

Download Full-text

Accumulation of Corresponding Colours Under Extreme Chromatic Illuminations and modification of CAM16

Color and Imaging Conference ◽

10.2352/issn.2169-2629.2021.29.368 ◽

2021 ◽

Vol 2021 (29) ◽

pp. 368-373

Author(s):

Yuechen Zhu ◽

Ming Ronnier Luo

Keyword(s):

Magnitude Estimation ◽

Estimation Method ◽

Prediction Performance ◽

Chromatic Adaptation ◽

Good Prediction ◽

Lighting Conditions ◽

Good Prediction Performance ◽

The Matrix ◽

D Values ◽

Magnitude Estimation Method

The goal of this study was to investigate the chromatic adaptation under extreme chromatic lighting conditions using the magnitude estimation method. The locations of the lightings on CIE1976 u′v′ plane were close to the spectrum locus, so the colour purity was far beyond the previous studies, and the data could test the limitations of the existing models. Two psychophysical experiments were carried out, and 1,470 estimations of corresponding colours were accumulated. The results showed that CAT16 gave a good prediction performance for all the chromatic lightings except for blue lighting, and the degree of adaptation was relatively high, that is, D was close to 1. The prediction for blue lightings was modified, the results showed the performance of CAM16 could be improved by correcting the matrix instead of the D values.

Download Full-text