Towards Co-Evolution of Random-Walk-Based Embedding and Label Propagation for Node Classification

Abstract: The biomedical network is becoming a fundamental tool to represent sophisticated bio-systems, while random walk models on it are becoming a sharp sword to address such challenging issues as gene function annotation, drug target identification, and disease biomarker recognition. Recently, numerous random walk models have been proposed and applied to biomedical networks. Due to good performances, the random walk is increasingly attracting more and more attention from multiple communities. In this survey, we firstly introduced various random walk models, with emphasis on the Pag-eRank and the random walk with restart. We then summarized applications of the RW on the biomedical networks from the graph learning point of view, which mainly included node classification, link prediction, cluster/community detection, and learning representation of the node. We discussed briefly its limitation and existing issues also

Download Full-text

A New Random-Walk Based Label Propagation Community Detection Algorithm

2015 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT) ◽

10.1109/wi-iat.2015.19 ◽

2015 ◽

Cited By ~ 3

Author(s):

Chang Su ◽

Xiaotao Jia ◽

Xianzhong Xie ◽

Yue Yu

Keyword(s):

Random Walk ◽

Community Detection ◽

Detection Algorithm ◽

Label Propagation ◽

Community Detection Algorithm

Download Full-text

Node Classification based on Graph Neural Networks using Random Walk with Restart

KIISE Transactions on Computing Practices ◽

10.5626/ktcp.2020.26.9.419 ◽

2020 ◽

Vol 26 (9) ◽

pp. 419-423

Author(s):

Seongjin Ahn ◽

Myoung Ho Kim

Keyword(s):

Neural Networks ◽

Random Walk ◽

Random Walk With Restart ◽

Node Classification ◽

Graph Neural Networks

Download Full-text

An improved label propagation algorithm based on node importance and random walk for community detection

Modern Physics Letters B ◽

10.1142/s0217984917501627 ◽

2017 ◽

Vol 31 (14) ◽

pp. 1750162 ◽

Cited By ~ 5

Author(s):

Tianren Ma ◽

Zhengyou Xia

Keyword(s):

Random Walk ◽

Community Detection ◽

High Efficiency ◽

Linear Time ◽

Structural Characteristics ◽

Rapid Development ◽

Time Algorithm ◽

Label Propagation ◽

Node Importance ◽

Propagation Algorithm

Currently, with the rapid development of information technology, the electronic media for social communication is becoming more and more popular. Discovery of communities is a very effective way to understand the properties of complex networks. However, traditional community detection algorithms consider the structural characteristics of a social organization only, with more information about nodes and edges wasted. In the meanwhile, these algorithms do not consider each node on its merits.Label propagation algorithm (LPA) is a near linear time algorithm which aims to find the community in the network. It attracts many scholars owing to its high efficiency. In recent years, there are more improved algorithms that were put forward based on LPA. In this paper, an improved LPA based on random walk and node importance (NILPA) is proposed. Firstly, a list of node importance is obtained through calculation. The nodes in the network are sorted in descending order of importance. On the basis of random walk, a matrix is constructed to measure the similarity of nodes and it avoids the random choice in the LPA. Secondly, a new metric IAS (importance and similarity) is calculated by node importance and similarity matrix, which we can use to avoid the random selection in the original LPA and improve the algorithm stability.Finally, a test in real-world and synthetic networks is given. The result shows that this algorithm has better performance than existing methods in finding community structure.

Download Full-text

An Information-Explainable Random Walk Based Unsupervised Network Representation Learning Framework on Node Classification Tasks

Mathematics ◽

10.3390/math9151767 ◽

2021 ◽

Vol 9 (15) ◽

pp. 1767

Author(s):

Xin Xu ◽

Yang Lu ◽

Yupeng Zhou ◽

Zhiguo Fu ◽

Yanjie Fu ◽

...

Keyword(s):

Random Walk ◽

Representation Learning ◽

Local Information ◽

Learning Framework ◽

Network Representation ◽

Label Node ◽

Label Information ◽

Classification Tasks ◽

Node Classification ◽

Low Dimensional

Network representation learning aims to learn low-dimensional, compressible, and distributed representational vectors of nodes in networks. Due to the expensive costs of obtaining label information of nodes in networks, many unsupervised network representation learning methods have been proposed, where random walk strategy is one of the wildly utilized approaches. However, the existing random walk based methods have some challenges, including: 1. The insufficiency of explaining what network knowledge in the walking path-samplings; 2. The adverse effects caused by the mixture of different information in networks; 3. The poor generality of the methods with hyper-parameters on different networks. This paper proposes an information-explainable random walk based unsupervised network representation learning framework named Probabilistic Accepted Walk (PAW) to obtain network representation from the perspective of the stationary distribution of networks. In the framework, we design two stationary distributions based on nodes’ self-information and local-information of networks to guide our proposed random walk strategy to learn representational vectors of networks through sampling paths of nodes. Numerous experimental results demonstrated that the PAW could obtain more expressive representation than the other six widely used unsupervised network representation learning baselines on four real-world networks in single-label and multi-label node classification tasks.

Download Full-text

Masked Graph Convolutional Network

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2019/565 ◽

2019 ◽

Cited By ~ 1

Author(s):

Liang Yang ◽

Fan Wu ◽

Yingkui Wang ◽

Junhua Gu ◽

Yuanfang Guo

Keyword(s):

Supervised Classification ◽

Label Propagation ◽

Unstructured Data ◽

Classification Methods ◽

Convolutional Network ◽

Node Attributes ◽

Classification Tasks ◽

Node Classification ◽

Supervised Classification Methods ◽

The Impact

Semi-supervised classification is a fundamental technology to process the structured and unstructured data in machine learning field. The traditional attribute-graph based semi-supervised classification methods propagate labels over the graph which is usually constructed from the data features, while the graph convolutional neural networks smooth the node attributes, i.e., propagate the attributes, over the real graph topology. In this paper, they are interpreted from the perspective of propagation, and accordingly categorized into symmetric and asymmetric propagation based methods. From the perspective of propagation, both the traditional and network based methods are propagating certain objects over the graph. However, different from the label propagation, the intuition ``the connected data samples tend to be similar in terms of the attributes", in attribute propagation is only partially valid. Therefore, a masked graph convolution network (Masked GCN) is proposed by only propagating a certain portion of the attributes to the neighbours according to a masking indicator, which is learned for each node by jointly considering the attribute distributions in local neighbourhoods and the impact on the classification results. Extensive experiments on transductive and inductive node classification tasks have demonstrated the superiority of the proposed method.

Download Full-text

Learning Random-Walk Label Propagation for Weakly-Supervised Semantic Segmentation

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) ◽

10.1109/cvpr.2017.315 ◽

2017 ◽

Cited By ~ 27

Author(s):

Paul Vernaza ◽

Manmohan Chandraker

Keyword(s):

Random Walk ◽

Semantic Segmentation ◽

Label Propagation ◽

Weakly Supervised

Download Full-text

Joint Classification with Heterogeneous Labels Using Random Walk with Dynamic Label Propagation

Advances in Knowledge Discovery and Data Mining - Lecture Notes in Computer Science ◽

10.1007/978-3-319-31753-3_1 ◽

2016 ◽

pp. 3-13

Author(s):

Yongxin Liao ◽

Shenxi Yuan ◽

Jian Chen ◽

Qingyao Wu ◽

Bin Li

Keyword(s):

Random Walk ◽

Label Propagation ◽

Joint Classification

Download Full-text

RWSF-BLP: a novel lncRNA-disease association prediction model using random walk-based multi-similarity fusion and bidirectional label propagation

Molecular Genetics and Genomics ◽

10.1007/s00438-021-01764-3 ◽

2021 ◽

Author(s):

Guobo Xie ◽

Bin Huang ◽

Yuping Sun ◽

Changhai Wu ◽

Yuqiong Han

Keyword(s):

Random Walk ◽

Prediction Model ◽

Disease Association ◽

Label Propagation

Download Full-text

Multimodal person discovery using label propagation over speaking faces graphs

10.5753/sibgrapi.est.2019.8312 ◽

2019 ◽

Author(s):

Gabriel Barbosa Fonseca ◽

Zenilton K. G. Patrocínio Jr ◽

Guillaume Gravier ◽

Silvio Jamil F. Guimarães

Keyword(s):

Random Walk ◽

Large Datasets ◽

Kappa Coefficient ◽

Label Propagation ◽

Hierarchical Approach ◽

Quality Of Information ◽

Automatic Indexing ◽

Processing Step ◽

Propagation Methods

The indexing of large datasets is a task of great importance, since it directly impacts on the quality of information that can be retrieved from these sets. Unfortunately, some datasets are growing in size so fast that manually indexing becomes unfeasible. Automatic indexing techniques can be applied to overcome this issue, and in this study, a unsupervised technique for multimodal person discovery is proposed, which consists in detecting persons that are appearing and speaking simultaneously on a video and associating names to them. To achieve that, the data is modeled as a graph of speaking-faces, and names are extracted via OCR and propagated through the graph based on audiovisual relations between speaking faces. To propagate labels, two graph based methods are proposed, one based on random walks and the other based on a hierarchical approach. In order to assess the proposed approach, we use two graph clustering baselines, and different modality fusion approaches. On the MediaEval MPD 2017 dataset, the proposed label propagation methods outperform all literature methods except one, which uses a different approach on the pre-processing step. Even though the Kappa coefficient indicates that the random walk and the hierarchical label propagation produce highly equivalent results, the hierarchical propagation is more than 6 times faster than the random walk under same configurations.

Download Full-text