A supervised similarity measure for link prediction based on KNN

As a research hotspot of complex network analysis, link prediction has received growing attention from various disciplines. Link prediction intends to determine the connecting probability of latent links based on the observed structure information. To this end, a host of similarity-based and learning-based link prediction methods have been proposed. To attain stable prediction performance on diverse networks, this paper proposes a supervised similarity-based method, which absorbs the advantages of both kinds of link prediction methods. In the proposed method, to capture the characteristics of a node pair, a collection of structural features is extracted from the network to represent the node pair as a vector. Then, the positive and negative [Formula: see text]-nearest neighbors are searched from existing and nonexisting links, respectively. The connection likelihood of a node pair is measured according to its distances to the local mean vectors of positive and negative [Formula: see text]-nearest neighbors. The prediction performance of the proposed method is experimentally evaluated on 10 benchmark networks. The results show that the proposed method is superior to the compared methods in terms of accuracy and stableness.

Download Full-text

Link prediction via layer relevance of multiplex networks

International Journal of Modern Physics C ◽

10.1142/s0129183117501017 ◽

2017 ◽

Vol 28 (08) ◽

pp. 1750101 ◽

Cited By ~ 7

Author(s):

Yabing Yao ◽

Ruisheng Zhang ◽

Fan Yang ◽

Yongna Yuan ◽

Qingshuang Sun ◽

...

Keyword(s):

Structural Properties ◽

Link Prediction ◽

Structural Information ◽

Similarity Index ◽

Single Layer ◽

Structural Features ◽

Prediction Performance ◽

Multiplex Networks ◽

Multiplex Network ◽

Node Similarity

In complex networks, the existing link prediction methods primarily focus on the internal structural information derived from single-layer networks. However, the role of interlayer information is hardly recognized in multiplex networks, which provide more diverse structural features than single-layer networks. Actually, the structural properties and functions of one layer can affect that of other layers in multiplex networks. In this paper, the effect of interlayer structural properties on the link prediction performance is investigated in multiplex networks. By utilizing the intralayer and interlayer information, we propose a novel “Node Similarity Index” based on “Layer Relevance” (NSILR) of multiplex network for link prediction. The performance of NSILR index is validated on each layer of seven multiplex networks in real-world systems. Experimental results show that the NSILR index can significantly improve the prediction performance compared with the traditional methods, which only consider the intralayer information. Furthermore, the more relevant the layers are, the higher the performance is enhanced.

Download Full-text

Link prediction in real-world multiplex networks via layer reconstruction method

Royal Society Open Science ◽

10.1098/rsos.191928 ◽

2020 ◽

Vol 7 (7) ◽

pp. 191928

Author(s):

Amir Mahdi Abdolhosseini-Qomi ◽

Seyed Hossein Jafari ◽

Amirheckmat Taghizadeh ◽

Naser Yazdani ◽

Masoud Asadpour ◽

...

Keyword(s):

Real World ◽

Link Prediction ◽

Large Body ◽

Single Layer ◽

Structural Features ◽

Prediction Performance ◽

Reconstruction Method ◽

Multiplex Networks ◽

Technological Complex ◽

Different Types

Networks are invaluable tools to study real biological, social and technological complex systems in which connected elements form a purposeful phenomenon. A higher resolution image of these systems shows that the connection types do not confine to one but to a variety of types. Multiplex networks encode this complexity with a set of nodes which are connected in different layers via different types of links. A large body of research on link prediction problem is devoted to finding missing links in single-layer (simplex) networks. In recent years, the problem of link prediction in multiplex networks has gained the attention of researchers from different scientific communities. Although most of these studies suggest that prediction performance can be enhanced by using the information contained in different layers of the network, the exact source of this enhancement remains obscure. Here, it is shown that similarity w.r.t. structural features (eigenvectors) is a major source of enhancements for link prediction task in multiplex networks using the proposed layer reconstruction method and experiments on real-world multiplex networks from different disciplines. Moreover, we characterize how low values of similarity w.r.t. structural features result in cases where improving prediction performance is substantially hard.

Download Full-text

An Experimental Evaluation of Similarity-Based and Embedding-Based Link Prediction Methods on Graphs

International Journal of Data Mining & Knowledge Management Process ◽

10.5121/ijdkp.2021.11501 ◽

2021 ◽

Vol 11 (5) ◽

pp. 1-18

Author(s):

Md Kamrul Islam ◽

Sabeur Aridhi ◽

Malika Smail-Tabbone

Keyword(s):

Link Prediction ◽

Graph Embedding ◽

Black Box ◽

Prediction Performance ◽

Prediction Methods ◽

Good Prediction ◽

Good Prediction Performance ◽

Node Similarity ◽

Category Comparison ◽

Selection Of

The task of inferring missing links or predicting future ones in a graph based on its current structure is referred to as link prediction. Link prediction methods that are based on pairwise node similarity are well-established approaches in the literature and show good prediction performance in many realworld graphs though they are heuristic. On the other hand, graph embedding approaches learn lowdimensional representation of nodes in graph and are capable of capturing inherent graph features, and thus support the subsequent link prediction task in graph. This paper studies a selection of methods from both categories on several benchmark (homogeneous) graphs with different properties from various domains. Beyond the intra and inter category comparison of the performances of the methods, our aim is also to uncover interesting connections between Graph Neural Network(GNN)- based methods and heuristic ones as a means to alleviate the black-box well-known limitation.

Download Full-text

An Experimental Evaluation of Similarity-Based and Embedding-Based Link Prediction Methods on Graphs

International Journal of Data Mining & Knowledge Management Process ◽

10.5121/ijdkp2021.11501 ◽

2021 ◽

Vol 11 (05) ◽

pp. 1-18

Author(s):

Md Kamrul Islam ◽

Sabeur Aridhi ◽

Malika Smail-Tabbone

Keyword(s):

Link Prediction ◽

Graph Embedding ◽

Black Box ◽

Prediction Performance ◽

Prediction Methods ◽

Good Prediction ◽

Good Prediction Performance ◽

Node Similarity ◽

Category Comparison ◽

Selection Of

Download Full-text

Towards effective link prediction: A hybrid similarity model

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-200344 ◽

2020 ◽

pp. 1-14

Author(s):

Longjie Li ◽

Lu Wang ◽

Hongsheng Luo ◽

Xiaoyun Chen

Keyword(s):

Link Prediction ◽

Structural Similarity ◽

Research Direction ◽

Structural Features ◽

Proposed Model ◽

Similarity Model ◽

Weight Calculation ◽

Stable Performance ◽

Grey Relation ◽

Important Research Direction

Link prediction is an important research direction in complex network analysis and has drawn increasing attention from researchers in various fields. So far, a plethora of structural similarity-based methods have been proposed to solve the link prediction problem. To achieve stable performance on different networks, this paper proposes a hybrid similarity model to conduct link prediction. In the proposed model, the Grey Relation Analysis (GRA) approach is employed to integrate four carefully selected similarity indexes, which are designed according to different structural features. In addition, to adaptively estimate the weight for each index based on the observed network structures, a new weight calculation method is presented by considering the distribution of similarity scores. Due to taking separate similarity indexes into account, the proposed method is applicable to multiple different types of network. Experimental results show that the proposed method outperforms other prediction methods in terms of accuracy and stableness on 10 benchmark networks.

Download Full-text

Non-negative sparse Laplacian regularized latent multi-view subspace clustering

Journal of Algorithms & Computational Technology ◽

10.1177/17483026211024904 ◽

2021 ◽

Vol 15 ◽

pp. 174830262110249

Author(s):

Cong-Zhe You ◽

Zhen-Qiu Shu ◽

Hong-Hui Fan

Keyword(s):

Subspace Clustering ◽

Structural Features ◽

Space Representation ◽

Graph Regularization ◽

Structure Information ◽

Cluster Data ◽

Latent Space ◽

Low Dimensional ◽

Relationship Of ◽

The Relationship

Recently, in the area of artificial intelligence and machine learning, subspace clustering of multi-view data is a research hotspot. The goal is to divide data samples from different sources into different groups. We proposed a new subspace clustering method for multi-view data which termed as Non-negative Sparse Laplacian regularized Latent Multi-view Subspace Clustering (NSL2MSC) in this paper. The method proposed in this paper learns the latent space representation of multi view data samples, and performs the data reconstruction on the latent space. The algorithm can cluster data in the latent representation space and use the relationship of different views. However, the traditional representation-based method does not consider the non-linear geometry inside the data, and may lose the local and similar information between the data in the learning process. By using the graph regularization method, we can not only capture the global low dimensional structural features of data, but also fully capture the nonlinear geometric structure information of data. The experimental results show that the proposed method is effective and its performance is better than most of the existing alternatives.

Download Full-text

Candidate gene prioritization using graph embedding

10.1101/2020.02.03.927913 ◽

2020 ◽

Author(s):

Quan Do ◽

Pierre Larmande

Keyword(s):

Link Prediction ◽

Graph Embedding ◽

Prediction Performance ◽

Knowledge Graph ◽

Learning Techniques ◽

Number Of Genes ◽

Important Amount ◽

Candidate Gene Prioritization ◽

Gene Information ◽

Embedding Methods

AbstractCandidate genes prioritization allows to rank among a large number of genes, those that are strongly associated with a phenotype or a disease. Due to the important amount of data that needs to be integrate and analyse, gene-to-phenotype association is still a challenging task. In this paper, we evaluated a knowledge graph approach combined with embedding methods to overcome these challenges. We first introduced a dataset of rice genes created from several open-access databases. Then, we used the Translating Embedding model and Convolution Knowledge Base model, to vectorize gene information. Finally, we evaluated the results using link prediction performance and vectors representation using some unsupervised learning techniques.

Download Full-text

A novel similarity measure for missing link prediction in social networks

Indonesian Journal of Electrical Engineering and Computer Science ◽

10.11591/ijeecs.v19.i2.pp1071-1077 ◽

2020 ◽

Vol 19 (2) ◽

pp. 1071

Author(s):

Gogulamudi Naga Chandrika ◽

E. Srinivasa Reddy

Keyword(s):

Social Networks ◽

Complex Networks ◽

Similarity Measure ◽

Network Theory ◽

Link Prediction ◽

Prediction Methods ◽

Missing Link ◽

The Social ◽

Hidden Patterns ◽

Over Time

Social Networks progress over time by the addition of new nodes and links, form associations with one community to the other community. Over a few decades, the fast expansion of Social Networks has attracted many researchers to pay more attention towards complex networks, the collection of social data, understand the social behaviors of complex networks and predict future conflicts. Thus, Link prediction is imperative to do research with social networks and network theory. The objective of this research is to find the hidden patterns and uncovered missing links over complex networks. Here, we developed a new similarity measure to predict missing links over social networks. The new method is computed on common neighbors with node-to-node distance to get better accuracy of missing link prediction. We tested the proposed measure on a variety of real-world linked datasets which are formed from various linked social networks. The proposed approach performance is compared with contemporary link prediction methods. Our measure makes very effective and intuitive in predicting disappeared links in linked social networks.

Download Full-text

The New Link Prediction Methods Based on Spectral Analysis

2017 13th International Conference on Semantics, Knowledge and Grids (SKG) ◽

10.1109/skg.2017.00025 ◽

2017 ◽

Cited By ~ 1

Author(s):

Yu Qiu ◽

Rongjing Hu ◽

Runzhen Yan ◽

Yabing Yao ◽

Mengyao Jiao

Keyword(s):

Spectral Analysis ◽

Link Prediction ◽

Prediction Methods

Download Full-text

Synonymy Expansion Using Link Prediction Methods: A Case Study of Assamese WordNet

ACM Transactions on Asian and Low-Resource Language Information Processing ◽

10.1145/3467966 ◽

2022 ◽

Vol 21 (1) ◽

pp. 1-21

Author(s):

Bornali Phukon ◽

Akash Anil ◽

Sanasam Ranbir Singh ◽

Priyankoo Sarmah

Keyword(s):

Network Analysis ◽

Target Word ◽

Complex Network ◽

Link Prediction ◽

Expansion Method ◽

Prediction Methods ◽

Network Embedding ◽

Per Se ◽

Local Proximity

WordNets built for low-resource languages, such as Assamese, often use the expansion methodology. This may result in missing lexical entries and missing synonymy relations. As the Assamese WordNet is also built using the expansion method, using the Hindi WordNet, it also has missing synonymy relations. As WordNets can be visualized as a network of unique words connected by synonymy relations, link prediction in complex network analysis is an effective way of predicting missing relations in a network. Hence, to predict the missing synonyms in the Assamese WordNet, link prediction methods were used in the current work that proved effective. It is also observed that for discovering missing relations in the Assamese WordNet, simple local proximity-based methods might be more effective as compared to global and complex supervised models using network embedding. Further, it is noticed that though a set of retrieved words are not synonyms per se, they are semantically related to the target word and may be categorized as semantic cohorts.

Download Full-text